LLMMarch 15, 2023OpenAI

GPT-4 Technical Report

OpenAI

Abstract

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers.

Key Findings

  • 1Demonstrated human-level performance on professional exams including the bar exam
  • 2Introduced multimodal capabilities accepting both image and text inputs
  • 3Showed significant improvements in reasoning and factuality over GPT-3.5
  • 4Achieved strong performance across 26 languages
  • 5Included extensive safety and alignment work through RLHF

Impact & Significance

GPT-4 represented a major leap in LLM capabilities and set new benchmarks for what AI systems could achieve on professional-grade tasks. It accelerated enterprise AI adoption and sparked a wave of multimodal AI development.

Related Tools

Read Full Paper