Scaling Up Language Models with Agent Ensembles

A new study reveals that simply increasing the number of agents in an ensemble can boost…

Largest text-to-speech AI model yet shows ’emergent abilities’

Amazon researchers have developed BASE TTS, the largest text-to-speech model to date, with 980 million parameters…

Judge rejects most ChatGPT copyright claims from book authors

In a significant legal development, a US district judge in California has dismissed most of the…

Memory and new controls for ChatGPT

OpenAI’s ChatGPT now offers a memory feature for Enterprise and Team users, enhancing productivity by learning…

Google’s paid Gemini Advanced plan is getting mixed reviews

Google’s Gemini Advanced, the paid version of its AI assistant, is eliciting a polarized response from…

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Harnessing the power of Supervised Fine-Tuning (SFT) is crucial for the evolution of Large Language Models…

Google introduces AI-powered Gemini app and casts aside Bard

Large Language Models Learn to Self-Compose Reasoning Structures

Researchers from Google DeepMind and University of Southern California have developed a new technique called SELF-DISCOVER…

Fastest JSON Decoding for Local LLMs with Compressed Finite State Machine

Hugging Face makes it easier to create its custom chatbots.

Hugging Face has streamlined the process of creating custom chatbots with its new Hugging Chat Assistant,…

DEJAVU: 6x faster transformers’ inference

In the YouTube video “Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper explained,” the…

MoE-LLaVA: Mixture-of-Experts for Large Vision-Language Models

MoE-LLaVA, a new Mixture of Experts (MoE) model for large vision-language tasks, demonstrates high performance with…

Proof of Achievement of the First Artificial General Intelligence

A groundbreaking development in artificial intelligence has been announced with the creation of the first Artificial…

Mistral CEO confirms ‘leak’ of new open source AI model nearing GPT-4 performance

The open source AI community has been abuzz with the emergence of a new large language…

ChatGPT now lets you pull other GPTs into the chat

OpenAI has introduced a new feature for ChatGPT Plus subscribers, allowing them to bring third-party GPTs…

🦅 Eagle 7B : Soaring past Transformers with 1 Trillion Tokens Across 100+ Languages (RWKV-v5)

RWKV-v5 Eagle 7B, a new 7.52 billion parameter model, has been released under the Apache 2.0…

Code LLaMA 2 70B, a ‘free’ source code generating model beating GPT-4

Brave Leo, the AI browser assistant, now features Mixtral for improved performance | Brave

Brave has updated its desktop browser to include Mixtral 8x7B as the default large language model…

OpenAI Unveils New Embedding Models, API Enhancements, and Pricing Updates

OpenAI has announced significant updates, including the launch of new embedding models, enhancements to GPT-4 Turbo,…

AlphaGeometry: An Olympiad-level AI system for geometry

AlphaGeometry, a new AI system, has achieved a significant milestone by solving complex geometry problems at…