Improving Language Model Behavior by Training on a Curated Dataset

Our latest research finds we can improve language model behavior with respect to specific behavioral values…

GPT-J-6B: 6B JAX-Based Transformer

Summary: We have released GPT-J-6B, 6B JAX-based (Mesh) Transformer LM (Github).GPT-J-6B performs nearly on par with…