Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei
Abstract
We develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform existing open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be a suitable substitute for some closed-source models.
Key Findings
- 1Released open-source LLMs from 7B to 70B parameters competitive with closed models
- 2Fine-tuned chat models using RLHF with over 1 million human annotations
- 3Demonstrated that open-source models can approach closed-source quality
- 4Included extensive safety evaluations and red-teaming
- 5Released with a permissive license enabling commercial use
Impact & Significance
Llama 2 democratized access to high-quality LLMs, enabling thousands of open-source projects, fine-tuned variants, and enterprise deployments. It proved that powerful open-source AI models could be both safe and commercially viable.
Related Tools
Related Papers
The Llama 3 Herd of Models
Meta AI
Qwen2 Technical Report
Alibaba Cloud / Qwen Team
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
DeepSeek AI
The Claude 3 Model Family: Opus, Sonnet, and Haiku
Anthropic