Monitoring ChatGPT Drifts Reveals Substantial Behavior Changes Over Time

A new paper by researchers at Stanford University and UC Berkeley reveals that the behavior of…

Llama 2: An Open Large Language Model Matching Proprietary Chatbots

A new large language model called Llama 2 was recently open-sourced by researchers at Meta AI.…

[Article] Faster Transformers for Longer Context with FlashAttention-2

Researchers from Stanford University have developed a new technique called FlashAttention-2 that can significantly speed up…

[Article] Retentive Networks: The Next Evolution of Transformers for AI?

A new paper from researchers at Microsoft proposes a novel neural network architecture called Retentive Networks…

Faster Optimization with Counterintuitively Long Steps

A new study by Benjamin Grimmer at Johns Hopkins University has demonstrated that the classic gradient…

New Framework Generates Commonsense Knowledge with Smaller AI Models

Researchers at the Allen Institute for AI have developed a novel framework called I2D2 that can…

Massive Language Models Struggle to Learn Rare Facts

A new study from researchers at UNC Chapel Hill and Google Research reveals that large language…

AI system creates realistic images and art from a textual description

“An astronaut riding a horse as pencil drawing”. This and many more images can be easily…