The Dawn of 1-Bit Large Language Models

A new paper from Microsoft Research titled “The Era of 1-bit LLMs: All Large Language Models…

Phind-70B closes the code quality gap with GPT-4 while running 4x faster

Phind, the startup behind the AI assistant of the same name, has released their largest language…

Gemini 1.5: A Giant Leap in Long-Context AI

Google DeepMind unveiled its latest AI system, Gemini 1.5 Pro, representing a major advance in models’…

Scaling Up Language Models with Agent Ensembles

A new study reveals that simply increasing the number of agents in an ensemble can boost…

Large Language Models Learn to Self-Compose Reasoning Structures

Researchers from Google DeepMind and University of Southern California have developed a new technique called SELF-DISCOVER…

New AI Breakthrough: Mixtral 8x7B Surpasses Leading Models in Performance and Efficiency

Introduction In the rapidly evolving field of artificial intelligence, a groundbreaking model named Mixtral 8x7B, developed…

Mamba: Revolutionizing Sequence Modeling with Selective State Spaces

Introduction In the recent breakthrough paper titled “Mamba: Linear-Time Sequence Modeling with Selective State Spaces,” authors…

MathCoders: Enhancing Mathematical Reasoning of Open-Source Language Models

A group of researchers from The Chinese University of Hong Kong, Shanghai Artificial Intelligence Laboratory, and…

Linux Copilot: Interacting with Linux Desktop via GPTs

The Linux Copilot project uses Generative Pretrained Transformers (GPTs) to perform tasks on your Linux desktop.…

Orca 2 has splashed!

Microsoft researchers have developed a new technique called “Cautious Reasoning” that allows smaller AI models to…

Researchers Evaluate Abstraction Abilities of Text and Multimodal Versions of GPT-4

Recent advances in large language models (LLMs) like GPT-3 and GPT-4 have led to claims that…

Boosting Code LLMs Through Innovative Multitask Fine-Tuning

A new study proposes an innovative approach to enhancing the capabilities of Code LLMs through multi-task…

Making Whisper Models Faster and Smaller Through Knowledge Distillation

Recent advances in self-supervised pre-training have led to impressive gains in speech recognition performance. Models like…

Phind’s New Model Matches GPT-4 in Coding at 5x the Speed

A company Phind has unveiled a new model that achieves coding abilities on par with OpenAI’s…

No-Code Tools Enable Customizable Open AI Models

A new paper titled “H2O Open Ecosystem for State-of-the-art Large Language Models” introduces two open-source libraries…

New AI system aims to improve factuality of large language model outputs

Recent advances in large language models (LLMs) like ChatGPT have demonstrated impressive capabilities in generating human-like…

Open-Source Lemur Brings Language Agents into Focus: Reasoning, Coding, and Versatility

A new open-source language model named Lemur, introduced in a paper from researchers at the University…

Rethinking Calibration for More Robust Large Language Models

Large language models (LLMs) like GPT-3 have shown impressive capabilities when prompted with instructions or given…

Automated Program Repair Deployed at Facebook

Facebook researchers have achieved a major milestone in automated program repair with the deployment of SapFix,…

New Tool-Integrated Reasoning Agents Achieve Major Gains in Mathematical Problem Solving

A new study from researchers at Tsinghua University and Microsoft presents ToRA, a series of novel…