Diffusion language models

Diffusion models have completely taken over generative modelling of perceptual signals — why is autoregression still the name of the game for language modelling? Can we do anything about that?

GPT-4 says:
Diffusion models have revolutionized generative modeling for perceptual signals like images, audio, and video. However, autoregression remains dominant in language modeling. This article explores the potential of diffusion models in language modeling, discussing the challenges and advantages of iterative refinement techniques. It also examines the use of continuous Gaussian diffusion for discrete data and the possibility of learning higher-level continuous representations for language modeling. While autoregression remains a tough baseline to beat, further exploration of diffusion models in language modeling could yield significant benefits.
Read more at Sander Dieleman…

Diffusion language models

Related

OpenAI Codex CLI: Executable AI Reasoning Hits Your Terminal

GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano

DolphinGemma: Unveiling the Language of the Seas with AI

Grok 3 API Debuts with Scalable Models for Code, Data, and Enterprise Tasks

Smarter GitHub Automation with the MCP Server

China Unveils GPMI: A Single-Cable Standard for 8K Video and High Power

When Weather Apps Steal Your SSH Keys

Llama 4

Tame Your Terminal: Managing AI Coding Agents with Claude Squad