GitHub - ZrrSkywalker/LLaMA-Adapter: Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

GPT-4: LLaMA-Adapter is a lightweight adaption method for fine-tuning instruction-following LLaMA models using Stanford Alpaca’s 52K data. With only 1.2M learnable parameters, it can turn a LLaMA into an instruction-following model within an hour. The method introduces a novel Zero-init Attention mechanism for stabilizing training at early stages and can be extended to multi-modal input instructions. LLaMA-Adapter generates high-quality instruction-following sentences, comparable to fully fine-tuned Stanford Alpaca and Alpaca-Lora models.
Read more at GitHub…

GitHub – ZrrSkywalker/LLaMA-Adapter: Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Related

When the Vending Machine Went Sentient

Constant-Time Breakthrough Raises the Hash-Table Speed Limit

Star Wars Reimagined: China’s Laser Satellite Outpaces Starlink

Court Rules AI’s Use of Books as Fair Use but Slams Pirated Collection Storage

Introducing the OWASP AI Testing Guide: A New Standard for AI Security Testing

The Low-Background Steel Problem of AI

Chinese AI Firms Dodge US Chip Bans with Cross-Border Data Smuggling to Malaysia

OpenAI open-sources a demo of a UI testing agent

Financial Dynamics in Agentic AI: Cursor’s Rise Versus GitHub Copilot