AllToken/Model Rankings·7 Days
Decisions, quantified.
Top models ranked by request volume and token usage
Model Leaderboard
Ranked by routed token usage in the last 7 Days.01
Qwen 3.6 Plus
qwen3.6-plus
53.7M
aliyun-dashscope
↑ 950040.0%
View →
02
DeepSeek V4 Pro
deepseek-v4-pro
34.9M
deepseek
↓ 72.0%
View →
03
GLM 5.1
glm-5.1
8.3M
tencent-tokenhub
↑ 37370.0%
View →
04
Claude Sonnet 4.6
claude-sonnet-4-6
4.3M
anthropic
↑ 149608.0%
View →
05
DeepSeek V3.2
deepseek-v3.2
4.2M
deepseek
↑ 97442.0%
View →
06
DeepSeek V4 Flash
deepseek-v4-flash
685.2K
deepseek
↓ 94.0%
View →
07
Claude Opus 4.7
claude-opus-4-7
345.4K
anthropic
↑ 376.0%
View →
08
GPT-5.4 Nano
gpt-5.4-nano
224.5K
openai
↑ 88645.0%
View →
09
GLM 4.7 Flash
glm-4.7-flash
124.5K
bigmodel
↓ 98.0%
View →
10
GPT-5.5
gpt-5.5
76.2K
openai
↓ 94.0%
View →
Market Share
Provider share across routed token usage.Benchmarks
Intelligence IndexIntelligence
Composite benchmark index
01Claude Opus 4.6anthropic96.8
02Claude Sonnet 4.6anthropic94.5
03MiniMax M2.7tencent-tokenhub92.3
04MiniMax M2.5tencent-tokenhub89.7
05Claude Haiku 4.5anthropic87.2
06Seed 2.0 LiteUnknown85.1
07MiniMax M2.7 HighspeedUnknown84.6
08MiniMax M2.5 Highspeedminimax-official83.9
Coding
Agentic coding score
01Claude Sonnet 4.6anthropic94.1
02Claude Opus 4.6anthropic93.5
03MiniMax M2.7tencent-tokenhub88.6
04MiniMax M2.5tencent-tokenhub85.4
05Claude Haiku 4.5anthropic82.3
06Seed 2.0 LiteUnknown78.9
Math
Mathematical reasoning
01Claude Opus 4.6anthropic97.3
02MiniMax M2.7tencent-tokenhub94.8
03Claude Sonnet 4.6anthropic93.6
04MiniMax M2.5tencent-tokenhub90.2
05Claude Haiku 4.5anthropic85.5
06Seed 2.0 LiteUnknown82.7
Knowledge
MMLU benchmark score
01Claude Opus 4.6anthropic95.2
02Claude Sonnet 4.6anthropic93.8
03MiniMax M2.7tencent-tokenhub91.5
04MiniMax M2.5tencent-tokenhub88.3
05Claude Haiku 4.5anthropic86.7
06Seed 2.0 LiteUnknown84.2
Fastest Models
Highest Throughput| Model | Highest Throughput | Lowest Latency | Time to First Token |
|---|---|---|---|
MiniMax M2.7 Highspeed Unknown | 312.5 t/s | 1520 ms | 350 ms |
MiniMax M2.5 Highspeed minimax-official | 298.8 t/s | 1280 ms | 320 ms |
Claude Haiku 4.5 anthropic | 245.6 t/s | 850 ms | 180 ms |
MiniMax M2.7 tencent-tokenhub | 178.3 t/s | 2350 ms | 560 ms |
MiniMax M2.5 tencent-tokenhub | 165.4 t/s | 1780 ms | 480 ms |
Claude Sonnet 4.6 anthropic | 142.8 t/s | 1450 ms | 420 ms |
Seed 2.0 Lite Unknown | 118.5 t/s | 3200 ms | 920 ms |
Claude Opus 4.6 anthropic | 86.2 t/s | 2100 ms | 680 ms |
Categories
Ranked leaders by use case.01
Qwen 3.6 Plus
qwen3.6-plus
53.7M
aliyun-dashscope
↑ 950040.0%
View →
02
DeepSeek V4 Pro
deepseek-v4-pro
34.9M
deepseek
↓ 72.0%
View →
03
GLM 5.1
glm-5.1
8.3M
tencent-tokenhub
↑ 37370.0%
View →
04
Claude Sonnet 4.6
claude-sonnet-4-6
4.3M
anthropic
↑ 149608.0%
View →
05
DeepSeek V3.2
deepseek-v3.2
4.2M
deepseek
↑ 97442.0%
View →
06
DeepSeek V4 Flash
deepseek-v4-flash
685.2K
deepseek
↓ 94.0%
View →
07
Claude Opus 4.7
claude-opus-4-7
345.4K
anthropic
↑ 376.0%
View →
08
GPT-5.4 Nano
gpt-5.4-nano
224.5K
openai
↑ 88645.0%
View →
09
GLM 4.7 Flash
glm-4.7-flash
124.5K
bigmodel
↓ 98.0%
View →
10
GPT-5.5
gpt-5.5
76.2K
openai
↓ 94.0%
View →