LLMMarch 15, 2023OpenAI

GPT-4 Technical Report

OpenAI

Abstract

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers.

Key Findings

1Demonstrated human-level performance on professional exams including the bar exam
2Introduced multimodal capabilities accepting both image and text inputs
3Showed significant improvements in reasoning and factuality over GPT-3.5
4Achieved strong performance across 26 languages
5Included extensive safety and alignment work through RLHF

Impact & Significance

GPT-4 represented a major leap in LLM capabilities and set new benchmarks for what AI systems could achieve on professional-grade tasks. It accelerated enterprise AI adoption and sparked a wave of multimodal AI development.

Related Tools

Chatgpt Openai Api

Read Full Paper

GPT-4 Technical Report

Abstract

Key Findings

Impact & Significance

Related Tools

Related Papers

The Llama 3 Herd of Models

Qwen2 Technical Report

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

The Claude 3 Model Family: Opus, Sonnet, and Haiku