LLMMarch 15, 2023OpenAI
GPT-4 Technical Report
OpenAI
Abstract
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers.
Key Findings
- 1Demonstrated human-level performance on professional exams including the bar exam
- 2Introduced multimodal capabilities accepting both image and text inputs
- 3Showed significant improvements in reasoning and factuality over GPT-3.5
- 4Achieved strong performance across 26 languages
- 5Included extensive safety and alignment work through RLHF
Impact & Significance
GPT-4 represented a major leap in LLM capabilities and set new benchmarks for what AI systems could achieve on professional-grade tasks. It accelerated enterprise AI adoption and sparked a wave of multimodal AI development.
Related Tools
Related Papers
LLMJuly 23, 2024
The Llama 3 Herd of Models
Meta AI
LLMJuly 15, 2024
Qwen2 Technical Report
Alibaba Cloud / Qwen Team
EfficiencyMay 7, 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
DeepSeek AI
LLMMarch 4, 2024
The Claude 3 Model Family: Opus, Sonnet, and Haiku
Anthropic