Generative AI: Databricks unveils open source large language model

March 27, 2024

[ad_1]

Data and artificial intelligence (AI) company Databricks has unveiled DBRX, a general-purpose large language model (LLM) that it claims can outperform other open source models.

The company said DBRX outperforms existing open source LLMs such as Llama 2 70B and Mixtral-8x7B on industry benchmarks including language understanding, programming, maths and logic.

“DBRX democratises the training and tuning of custom, high-performing LLMs for every enterprise so they no longer need to rely on a small handful of closed models,” the company said.

Ali Ghodsi, co-founder and CEO of Databricks, said DBRX enables enterprises to build “customised reasoning capabilities based on their own data”. Because DBRX beats GPT-3.5 on most benchmarks, he said it should accelerate the trend Databricks is seeing across its customers – of organisations replacing proprietary models with open source models.

DBRX outperforms GPT-3.5 across language understanding (MMLU), programming (HumanEval) and maths (GSM8K), Databricks said.

DBRX was developed by Mosaic AI and trained on Nvidia DGX Cloud. Databricks optimised DBRX for efficiency with a mixture-of-experts (MoE) architecture, built on the MegaBlocks open source project. The resulting model is up to twice as compute-efficient as other available leading LLMs, the company said.

DBRX is available on GitHub and Hugging Face for research and commercial use. On the Databricks Platform, enterprises can interact with DBRX and build custom DBRX models on their own unique data. DBRX is also available on Amazon Web Services (AWS) and Google Cloud, as well as directly on Microsoft Azure through Azure Databricks. DBRX is also expected to be available through the Nvidia Catalog API and supported on the Nvidia NIM inference microservice.

While the model is open source, Databricks also offers services around it to help enterprises build and deploy production-quality generative AI (GenAI) applications.

“This is going to be by far the best open source model out there – it surpasses GPT-3.5 in quality and it is completely open source”

Naveen Rao, Databricks