DBRX: A New State-of-the-Art Open LLM

Databricks has unveiled DBRX, a new open-source large language model (LLM) that has set a new benchmark for performance among established open LLMs. DBRX boasts superior capabilities in general-purpose tasks and coding, outperforming specialized models like CodeLLaMA-70B. It utilizes a fine-grained mixture-of-experts (MoE) architecture, which results in a model that is both smaller and faster—up to twice as fast as LLaMA2-70B in inference and 40% smaller than Grok-1 in parameter count. Additionally, DBRX is more FLOP-efficient during training compared to dense models.

The model’s weights are accessible on Hugging Face under an open license, and Databricks customers can use DBRX via APIs or train their own models using Databricks’ tools. DBRX has already been integrated into GenAI-powered products, showing promising results in applications like SQL.

DBRX’s training process leveraged Databricks’ suite of tools, including Apache Spark™ and MLflow, and was conducted on NVIDIA H100 GPUs. The model’s efficiency extends to inference, with optimized serving infrastructure allowing up to 150 tokens per second per user. Databricks emphasizes the importance of enterprises having control over their data and AI destiny, positioning DBRX as a key component in their next-generation GenAI products. The development of DBRX was a collaborative effort, drawing on contributions from across the Databricks team and the wider AI community.
Read more at Databricks…

DBRX: A New State-of-the-Art Open LLM

Related

The Energy Infrastructure Gap That Could Decide the AI Race

AI-Powered Security Checks: Filtering Bots Without Slowing Users

Inside the Underground World of LLM Jailbreaks

GPT-5 is Here, and It’s Not What You Expected

The AI Agent That Actually Knows How to Build ML Models

Qwen-Image: Finally, an AI That Can Actually Write

Perplexity’s Stealth Crawling Sparks Debate Over AI Web Ethics

Feeding Your Gut to Fight Fat: How Tryptophan Sparks Hormone Recovery

Putting Math Behind the Madness: A Theoretical Framework for LLM Hallucinations