LLMJuly 18, 2023Meta AI

Llama 2: Open Foundation and Fine-Tuned Chat Models

Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei

Abstract

We develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform existing open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be a suitable substitute for some closed-source models.

Key Findings

1Released open-source LLMs from 7B to 70B parameters competitive with closed models
2Fine-tuned chat models using RLHF with over 1 million human annotations
3Demonstrated that open-source models can approach closed-source quality
4Included extensive safety evaluations and red-teaming
5Released with a permissive license enabling commercial use

Impact & Significance

Llama 2 democratized access to high-quality LLMs, enabling thousands of open-source projects, fine-tuned variants, and enterprise deployments. It proved that powerful open-source AI models could be both safe and commercially viable.

Related Tools

Llama Hugging Face Ollama

Read Full Paper

Llama 2: Open Foundation and Fine-Tuned Chat Models

Abstract

Key Findings

Impact & Significance

Related Tools

Related Papers

The Llama 3 Herd of Models

Qwen2 Technical Report

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

The Claude 3 Model Family: Opus, Sonnet, and Haiku