GPT-5 is OpenAI's most advanced model, offering major improvements in reasoning, code quality, and user experience. It handles complex coding tasks with minimal prompting, provides clear explanations, and introduces enhanced agentic capabilities. Designed for logic and multi-step tasks.
Added Aug 7, 2025
Context Window
400.0K
Max Output
128.0K
Input Price
$1.25/1M
Output Price
$10.00/1M
Capabilities
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
21.8
Coding Index
21.2
Agentic Index
54.6
GPQA Diamond
Graduate-level scientific reasoning
68.6%
Better than 56% of models compared
HLE
Humanity's Last Exam
5.8%
Better than 48% of models compared
IFBench
Instruction-following benchmark
45.0%
Better than 56% of models compared
T²-Bench Telecom
Conversational AI agents in dual-control scenarios
0.0%
Better than 3% of models compared
AA-LCR
Long context reasoning evaluation
63.7%
Better than 84% of models compared
GDPval-AA
Economically valuable tasks
39.8%
Better than 85% of models compared
CritPt
Research-level physics reasoning
5.7%
Better than 92% of models compared
SciCode
Python programming for scientific computing
37.8%
Better than 73% of models compared
Terminal-Bench Hard
Agentic coding and terminal use
12.9%
Better than 54% of models compared
AIME 2025
American Invitational Mathematics Examination 2025
48.3%
Better than 47% of models compared
MMLU-Pro
Professional and academic subject knowledge
82.0%
Better than 78% of models compared
AA-Omniscience Accuracy
Proportion of correctly answered questions
40.6%
Better than 94% of models compared
Last updated May 11, 2026
Artificial AnalysisLiveCodeBench
Contamination-free coding benchmark
54.3%
Better than 61% of models compared
AA-Omniscience Hallucination Rate
Rate of incorrect answers among non-correct responses
82.1%
Better than 52% of models compared