New AI system aims to improve factuality of large language model outputs

Recent advances in large language models (LLMs) like ChatGPT have demonstrated impressive capabilities in generating human-like…

Open-Source Lemur Brings Language Agents into Focus: Reasoning, Coding, and Versatility

A new open-source language model named Lemur, introduced in a paper from researchers at the University…

Rethinking Calibration for More Robust Large Language Models

Large language models (LLMs) like GPT-3 have shown impressive capabilities when prompted with instructions or given…

Automated Program Repair Deployed at Facebook

Facebook researchers have achieved a major milestone in automated program repair with the deployment of SapFix,…

New Tool-Integrated Reasoning Agents Achieve Major Gains in Mathematical Problem Solving

A new study from researchers at Tsinghua University and Microsoft presents ToRA, a series of novel…

Improved Baselines for Visual Instruction Tuning Models

Researchers from the University of Wisconsin-Madison and Microsoft Research have developed improved baselines for visual instruction…

Borges and AI: A New Perspective on Language Models

A new paper by researchers Léon Bottou and Bernhard Schölkopf offers a novel perspective on large…

New Decoding Method Boosts Reasoning in AI Models

Researchers from UC San Diego and Meta AI have developed a new decoding method called Contrastive…

Simplifying Vision Transformers with ReLU Attention

A new paper from researchers at DeepMind explores replacing the softmax function in transformer attention with…

Simple Auto-Regressive Models Shown to be Powerful Universal Learners

Recent advancements in large language models like GPT-3 and GPT-4 have demonstrated remarkable capabilities in logical…

No More Manual Testing? ChatGPT Shows Promise for Automated Unit Test Generation

A new study from researchers at multiple Chinese universities evaluates ChatGPT’s ability to automatically generate unit…

AGENTS – An Open-source Framework for Building Autonomous Language Agents

Recent advances in large language models (LLMs) like GPT-3 and ChatGPT have enabled the development of…

GPT-4 Takes on P vs NP, Reveals Potential of LLMs in Scientific Discovery

A new study reveals that large language models like GPT-4 can make significant contributions to complex…

When Stars Align with AI: Training LLM for Astronomy Texts

Large language models like GPT-3 and PaLM have demonstrated impressive performance on many natural language tasks.…

Large Language Models Show Promise as General-Purpose Optimizers

A new paper from researchers at Google DeepMind demonstrates the potential for large language models (LLMs)…

Large Language Models Still Struggle with Reliable Code Generation

A new study from researchers at UC San Diego raises concerns about the reliability and robustness…

Can AI Find and Fix Software Vulnerabilities?

A new study evaluates the ability of large language models (LLMs) like ChatGPT to detect and…

A New System to Turn Natural Language Prompts into Deployable AI Models

A team of researchers from Carnegie Mellon University and Tsinghua University have introduced a new system…

Automated Unit Testing Reaches New Heights with ChatGPT-Based Tool ChatUniTest

Unit testing is a crucial yet often tedious task in software development. To make this process…

Knowledge Graph Prompting Enhances Multi-Document Question Answering with Large Language Models

Recent advances in large language models (LLMs) like ChatGPT have shown promising results on open-domain question…