Claude Opus 4.6 is Anthropic's newest Opus model with stronger coding and agentic performance, plus support for up to a 1M-token context window.
Added Feb 5, 2026
Context Window
1.0M
Max Output
128.0K
Input Price
$5.00/1M
Output Price
$25.01/1M
Capabilities
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
46.5
Coding Index
47.6
Agentic Index
64.2
GPQA Diamond
Graduate-level scientific reasoning
84.0%
Better than 87% of models compared
HLE
Humanity's Last Exam
18.6%
Better than 85% of models compared
IFBench
Instruction-following benchmark
44.6%
Better than 55% of models compared
T²-Bench Telecom
Conversational AI agents in dual-control scenarios
84.8%
Better than 79% of models compared
AA-LCR
Long context reasoning evaluation
58.3%
Better than 76% of models compared
GDPval-AA
Economically valuable tasks
54.6%
Better than 97% of models compared
CritPt
Research-level physics reasoning
2.8%
Better than 87% of models compared
SciCode
Python programming for scientific computing
45.7%
Better than 93% of models compared
Terminal-Bench Hard
Agentic coding and terminal use
48.5%
Better than 96% of models compared
AA-Omniscience Accuracy
Proportion of correctly answered questions
45.2%
Better than 96% of models compared
AA-Omniscience Hallucination Rate
Rate of incorrect answers among non-correct responses
76.0%
Better than 68% of models compared
Last updated May 11, 2026
Artificial Analysis