Discover AI language models for conversations, coding, and creative writing
DeepSeek V4 Pro Cheaper
DeepSeek V4 Pro Cheaper is the same large-scale Mixture-of-Experts model from DeepSeek with a 1M-token context window, built for advanced reasoning, coding, long-horizon agent workflows, knowledge, math, and software engineering tasks. This variant is temporarily cheaper while DeepSeek has lowered its prices, and can route via Chinese providers, so data privacy is not guaranteed.
Benchmarks (Artificial Analysis)
Intelligence
51.5
Coding
47.5
Speed
30.9
Features
Context
1.0M
Max Output
384.0K
Date Added
Apr 25, 2026
Pricing
Input:
$0.44/1M
Output:
$0.87/1M
Est./msg:
$0.0009
Subscription
Included in subscription
DeepSeek V4 Pro Cheaper (Thinking)
DeepSeek V4 Pro Cheaper Thinking enables DeepSeek's chain-of-thought mode on the same large-scale Mixture-of-Experts model with a 1M-token context window, built for advanced reasoning, coding, long-horizon agent workflows, knowledge, math, and software engineering tasks. This variant is temporarily cheaper while DeepSeek has lowered its prices, and can route via Chinese providers, so data privacy is not guaranteed.
Benchmarks (Artificial Analysis)
Intelligence
51.5
Coding
47.5
Speed
30.9
Features
Context
1.0M
Max Output
384.0K
Date Added
Apr 25, 2026
Pricing
Input:
$0.44/1M
Output:
$0.87/1M
Est./msg:
$0.0009
Subscription
Included in subscription
DeepSeek V4 Pro TEE
DeepSeek V4 Pro in a Trusted Execution Environment (TEE) with attestation support. Built for advanced reasoning, coding, long-horizon agent workflows, knowledge, math, and software engineering tasks.
Benchmarks (Artificial Analysis)
Intelligence
51.5
Coding
47.5
Speed
30.9
Features
Context
800.0K
Max Output
65.5K
Date Added
Apr 25, 2026
Pricing
Input:
$1.50/1M
Output:
$5.25/1M
Est./msg:
$0.0041
Subscription
Not included in subscription
DeepSeek V4 Flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with a 1M-token context window, built for fast inference, high-throughput workloads, reasoning, coding, and agent workflows.
Benchmarks (Artificial Analysis)
Intelligence
46.5
Coding
38.7
Speed
62.7
Features
Context
1.0M
Max Output
384.0K
Date Added
Apr 24, 2026
Pricing
Input:
$0.14/1M
Output:
$0.28/1M
Est./msg:
$0.0003
Subscription
Included in subscription
DeepSeek V4 Flash (Thinking)
DeepSeek V4 Flash Thinking enables DeepSeek's reasoning mode on the efficiency-optimized Mixture-of-Experts model with a 1M-token context window, built for fast inference, high-throughput workloads, reasoning, coding, and agent workflows.
Benchmarks (Artificial Analysis)
Intelligence
46.5
Coding
38.7
Speed
62.7
Features
Context
1.0M
Max Output
384.0K
Date Added
Apr 24, 2026
Pricing
Input:
$0.14/1M
Output:
$0.28/1M
Est./msg:
$0.0003
Subscription
Included in subscription
DeepSeek V4 Pro
DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with a 1M-token context window, built for advanced reasoning, coding, long-horizon agent workflows, knowledge, math, and software engineering tasks.
Benchmarks (Artificial Analysis)
Intelligence
51.5
Coding
47.5
Speed
30.9
Features
Context
1.0M
Max Output
384.0K
Date Added
Apr 24, 2026
Pricing
Input:
$1.74/1M
Output:
$3.48/1M
Est./msg:
$0.0035
Subscription
Included in subscription
· Uses 2x tokens per request
DeepSeek V4 Pro (Thinking)
DeepSeek V4 Pro Thinking enables DeepSeek's chain-of-thought mode on the large-scale Mixture-of-Experts model with a 1M-token context window, built for advanced reasoning, coding, long-horizon agent workflows, knowledge, math, and software engineering tasks.
Benchmarks (Artificial Analysis)
Intelligence
51.5
Coding
47.5
Speed
30.9
Features
Context
1.0M
Max Output
384.0K
Date Added
Apr 24, 2026
Pricing
Input:
$1.74/1M
Output:
$3.48/1M
Est./msg:
$0.0035
Subscription
Included in subscription
· Uses 2x tokens per request
GPT 5.5
GPT-5.5 is OpenAI's smartest and most intuitive model yet, built for agentic coding, computer use, and professional knowledge work with stronger reasoning and token efficiency.
Benchmarks (Artificial Analysis)
Intelligence
40.9
Coding
48.6
Speed
56.8
Features
Context
1.0M
Max Output
128.0K
Date Added
Apr 23, 2026
Pricing
Input:
$5.00/1M
Output:
$30.00/1M
Cache:
Read $0.50/1M
Est./msg:
$0.0200
Subscription
Not included in subscription
Qwen3.6 27B
Qwen3.6 27B is a native vision-language dense model with stronger agentic coding and STEM reasoning than Qwen 3.5 27B. It also improves spatial intelligence (including object localization/detection), plus video understanding, document OCR, and visual-agent workflows.
Benchmarks (Artificial Analysis)
Intelligence
37.1
Coding
26.6
Speed
59.9
Features
Context
260.1K
Max Output
65.5K
Date Added
Apr 23, 2026
Pricing
Input:
$0.60/1M
Output:
$3.60/1M
Est./msg:
$0.0024
Subscription
Included in subscription
Ling 2.6 1T
Ling-2.6-1T is an inclusionAI instruction model optimized for large-scale agentic and coding workloads with long-context support and structured output capabilities.
Benchmarks (Artificial Analysis)
Intelligence
33.6
Coding
33.0
Features
Context
262.1K
Max Output
32.8K
Date Added
Apr 23, 2026
Pricing
Input:
$1.00/1M
Output:
$3.00/1M
Est./msg:
$0.0025
Subscription
Not included in subscription
Tencent: Hy3 preview
Hy3 Preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels to balance speed and depth, with strong code generation performance in multi-step workflows.
Benchmarks (Artificial Analysis)
Intelligence
33.7
Coding
34.3
Speed
130.1
Context
262.1K
Max Output
262.1K
Date Added
Apr 23, 2026
Pricing
Input:
$0.30/1M
Output:
$1.20/1M
Est./msg:
$0.0009
Subscription
Included in subscription
Trinity Large Preview
Open source preview release in Arcee's Trinity Large family with a 262K context window, strong tool use, and production-grade latency for agent workflows.
Benchmarks (Artificial Analysis)
Intelligence
31.9
Coding
27.2
Speed
131.5
Features
Context
262.1K
Max Output
80.0K
Date Added
Apr 22, 2026
Pricing
Input:
$0.25/1M
Output:
$1.00/1M
Est./msg:
$0.0008
Subscription
Included in subscription
MiMo V2.5
MiMo V2.5 is Xiaomi's full-modal understanding model for agent workflows. It supports deep reasoning, tool calling, structured outputs, and web search with up to 1M context.
Benchmarks (Artificial Analysis)
Intelligence
49.0
Coding
42.1
Speed
96.0
Features
Context
1.0M
Max Output
131.1K
Date Added
Apr 22, 2026
Pricing
Input:
$0.40/1M
Output:
$2.00/1M
Est./msg:
$0.0014
Subscription
Not included in subscription
MiMo V2.5 Pro
MiMo V2.5 Pro is Xiaomi's long-context flagship general model for coding and agentic orchestration. It supports reasoning, tool calling, and structured outputs with up to 1M context.
Benchmarks (Artificial Analysis)
Intelligence
53.8
Coding
45.5
Speed
59.9
Features
Context
1.0M
Max Output
131.1K
Date Added
Apr 22, 2026
Pricing
Input:
$1.00/1M
Output:
$3.00/1M
Est./msg:
$0.0025
Subscription
Not included in subscription
Qwen3 Coder Next TEE
Qwen3 Coder Next is optimized for coding agents and local development workflows with a sparse 80B MoE architecture. Running inside a TEE (Trusted Execution Environment), with provider attestation support.
Features
Context
262.1K
Max Output
65.5K
Date Added
Apr 21, 2026
Pricing
Input:
$0.18/1M
Output:
$1.20/1M
Est./msg:
$0.0008
Subscription
Not included in subscription
Kimi K2.6 TEE
Kimi K2.6 is Moonshot AI's next-generation multimodal model designed for long-horizon coding and agentic orchestration. Running inside a TEE (Trusted Execution Environment), with provider attestation support.
Features
Context
262.1K
Max Output
65.5K
Date Added
Apr 21, 2026
Pricing
Input:
$1.50/1M
Output:
$5.25/1M
Est./msg:
$0.0041
Subscription
Not included in subscription
Ling 2.6 Flash
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency. It delivers performance comparable to state-of-the-art models at a similar scale while significantly reducing token usage across coding, document processing, and lightweight agent workflows.
Benchmarks (Artificial Analysis)
Intelligence
26.2
Coding
23.2
Features
Context
262.1K
Max Output
32.8K
Date Added
Apr 21, 2026
Pricing
Input:
$1.00/1M
Output:
$3.00/1M
Est./msg:
$0.0025
Subscription
Not included in subscription
Qwen3.6 Max Preview
Qwen3.6 Max Preview is Alibaba's flagship Qwen 3.6 model for complex tasks. It supports both thinking and non-thinking modes in a single model id.
Benchmarks (Artificial Analysis)
Intelligence
51.8
Coding
44.9
Speed
37.8
Context
245.8K
Max Output
65.5K
Date Added
Apr 20, 2026
Pricing
Input:
$1.30/1M
Output:
$7.80/1M
Est./msg:
$0.0052
Subscription
Not included in subscription
MiniMax M2.5 TEE
MiniMax M2.5 is a productivity-focused flagship model with stronger coding and office workflow performance (Word, Excel, PowerPoint), plus improved tool planning and token efficiency. Running inside a TEE (Trusted Execution Environment), with provider attestation support.
Benchmarks (Artificial Analysis)
Intelligence
41.9
Coding
37.4
Speed
89.1
Features
Context
196.6K
Max Output
131.1K
Date Added
Apr 20, 2026
Pricing
Input:
$0.20/1M
Output:
$1.38/1M
Est./msg:
$0.0009
Subscription
Not included in subscription
GLM 5.1 TEE
GLM-5.1 is Z.AI's next-gen model optimized for long-horizon agent workflows and deep reasoning. Running inside a TEE (Trusted Execution Environment), with provider attestation support.
Benchmarks (Artificial Analysis)
Intelligence
51.4
Coding
43.4
Speed
55.8
Context
202.8K
Max Output
65.5K
Date Added
Apr 20, 2026
Pricing
Input:
$1.50/1M
Output:
$5.25/1M
Est./msg:
$0.0041
Subscription
Not included in subscription
GLM 5.1 Thinking TEE
GLM-5.1 with extended reasoning mode for multi-step planning and tool-heavy workflows. Running inside a TEE (Trusted Execution Environment), with provider attestation support.
Benchmarks (Artificial Analysis)
Intelligence
51.4
Coding
43.4
Speed
55.8
Features
Context
202.8K
Max Output
65.5K
Date Added
Apr 20, 2026
Pricing
Input:
$1.50/1M
Output:
$5.25/1M
Est./msg:
$0.0041
Subscription
Not included in subscription
MiMo V2 Flash TEE
Xiaomi MiMo V2 Flash (309B MoE, 15B active) served with TEE-backed inference and attestation support.
Context
262.1K
Max Output
32.8K
Date Added
Apr 20, 2026
Pricing
Input:
$0.10/1M
Output:
$0.30/1M
Est./msg:
$0.0003
Subscription
Not included in subscription
Qwen3.6 35B A3B Thinking
Qwen3.6 35B A3B is a native vision-language MoE model with hybrid attention. Compared to Qwen3.5 35B A3B, Alibaba reports stronger agentic coding, mathematical and code reasoning, and better spatial understanding (including object localization and detection).
Benchmarks (Artificial Analysis)
Intelligence
43.5
Coding
35.1
Speed
186.5
Features
Context
262.1K
Max Output
16.4K
Date Added
Apr 19, 2026
Pricing
Input:
$0.20/1M
Output:
$1.00/1M
Est./msg:
$0.0007
Subscription
Included in subscription
Qwen3.6 35B A3B
Qwen3.6 35B A3B is a native vision-language MoE model with hybrid attention. Compared to Qwen3.5 35B A3B, Alibaba reports stronger agentic coding, mathematical and code reasoning, and better spatial understanding (including object localization and detection).
Benchmarks (Artificial Analysis)
Intelligence
31.5
Coding
17.6
Speed
180.8
Features
Context
262.1K
Max Output
16.4K
Date Added
Apr 17, 2026
Pricing
Input:
$0.20/1M
Output:
$1.00/1M
Est./msg:
$0.0007
Subscription
Included in subscription
Qwen3.6 Flash
Qwen3.6 Flash is Alibaba's fast native vision-language model in the Qwen 3.6 family. It improves over 3.5 Flash with stronger coding/agent performance and better spatial intelligence, including object localization and detection.
Features
Context
991.8K
Max Output
65.5K
Date Added
Apr 17, 2026
Pricing
Input:
$0.19/1M
Output:
$1.16/1M
Cache:
Read $0.02/1M · Write $0.24/1M (5m) / $0.38/1M (1h)
Est./msg:
$0.0008
Subscription
Not included in subscription
Kimi K2.6
Kimi K2.6 is an open-source, native multimodal agentic model built for long-horizon coding, coding-driven design, and large-scale task orchestration. It can turn simple prompts and visual inputs into production-ready interfaces and full-stack workflows, and is designed to coordinate complex multi-agent plans with thousands of steps across code, documents, and spreadsheets.
Benchmarks (Artificial Analysis)
Intelligence
53.9
Coding
47.1
Speed
41.2
Features
Context
256.0K
Max Output
65.5K
Date Added
Apr 16, 2026
Pricing
Input:
$0.50/1M
Output:
$2.60/1M
Est./msg:
$0.0018
Subscription
Included in subscription
· Uses 2x tokens per request
View Providers
Kimi K2.6 Thinking
Kimi K2.6 Thinking is the reasoning-optimized K2.6 variant for deeper multi-step planning and execution. It is tuned for long-horizon coding and design workflows, including complex orchestration across many specialized sub-agents and autonomous end-to-end output generation.
Benchmarks (Artificial Analysis)
Intelligence
53.9
Coding
47.1
Speed
41.2
Features
Context
256.0K
Max Output
65.5K
Date Added
Apr 16, 2026
Pricing
Input:
$0.50/1M
Output:
$2.60/1M
Est./msg:
$0.0018
Subscription
Included in subscription
· Uses 2x tokens per request
View Providers
Claude 4.7 Opus
Claude Opus 4.7 is a major upgrade for advanced software engineering, long-running complex tasks, and high-resolution vision understanding.
Benchmarks (Artificial Analysis)
Intelligence
51.8
Coding
53.1
Speed
48.9
Features
Context
1.0M
Max Output
128.0K
Date Added
Apr 16, 2026
Pricing
Input:
$5.00/1M
Output:
$25.01/1M
Cache:
Read $0.50/1M · Write $6.25/1M (5m) / $10.00/1M (1h)
Est./msg:
$0.0175
Subscription
Not included in subscription
Claude 4.7 Opus Thinking
Claude Opus 4.7 with thinking enabled (default budget: 16k tokens).
Benchmarks (Artificial Analysis)
Intelligence
57.3
Coding
52.5
Speed
65.0
Features
Context
1.0M
Max Output
128.0K
Date Added
Apr 16, 2026
Pricing
Input:
$5.00/1M
Output:
$25.01/1M
Cache:
Read $0.50/1M · Write $6.25/1M (5m) / $10.00/1M (1h)
Est./msg:
$0.0175
Subscription
Not included in subscription
Step 3.5 Flash 2603
Step 3.5 Flash 2603 is optimized for high-frequency agentic and coding workflows with improved token efficiency and faster reasoning. NOTE: This model runs via StepFun, which may log and train on your prompts.
Features
Context
256.0K
Max Output
256.0K
Date Added
Apr 14, 2026
Pricing
Input:
$0.10/1M
Output:
$0.30/1M
Est./msg:
$0.0003
Subscription
Included in subscription