AllToken/Model Rankings·7 Days

Decisions, quantified.

Top models ranked by request volume and token usage

Model Leaderboard

Ranked by routed token usage in the last 7 Days.

DeepSeek V4 Flash

deepseek-v4-flash

DeepSeek V4 Pro

deepseek-v4-pro

tencent-tokenhub

tencent-tokenhub

aliyun-dashscope

tencent-tokenhub

Provider share across routed token usage.

Benchmarks

Intelligence Index

Intelligence

Composite benchmark index

01Claude Opus 4.6anthropic96.8

02Claude Sonnet 4.6anthropic94.5

03MiniMax M2.7tencent-tokenhub92.3

04MiniMax M2.5tencent-tokenhub89.7

05Claude Haiku 4.5anthropic87.2

06Seed 2.0 LiteUnknown85.1

07MiniMax M2.7 Highspeedminimax-official84.6

08MiniMax M2.5 Highspeedminimax-official83.9

Coding

Agentic coding score

01Claude Sonnet 4.6anthropic94.1

02Claude Opus 4.6anthropic93.5

03MiniMax M2.7tencent-tokenhub88.6

04MiniMax M2.5tencent-tokenhub85.4

05Claude Haiku 4.5anthropic82.3

06Seed 2.0 LiteUnknown78.9

Math

Mathematical reasoning

01Claude Opus 4.6anthropic97.3

02MiniMax M2.7tencent-tokenhub94.8

03Claude Sonnet 4.6anthropic93.6

04MiniMax M2.5tencent-tokenhub90.2

05Claude Haiku 4.5anthropic85.5

06Seed 2.0 LiteUnknown82.7

Knowledge

MMLU benchmark score

01Claude Opus 4.6anthropic95.2

02Claude Sonnet 4.6anthropic93.8

03MiniMax M2.7tencent-tokenhub91.5

04MiniMax M2.5tencent-tokenhub88.3

05Claude Haiku 4.5anthropic86.7

06Seed 2.0 LiteUnknown84.2

Fastest Models

Highest Throughput

Model	Highest Throughput	Lowest Latency	Time to First Token
MiniMax M2.7 Highspeed minimax-official	312.5 t/s	1520 ms	350 ms
MiniMax M2.5 Highspeed minimax-official	298.8 t/s	1280 ms	320 ms
Claude Haiku 4.5 anthropic	245.6 t/s	850 ms	180 ms
MiniMax M2.7 tencent-tokenhub	178.3 t/s	2350 ms	560 ms
MiniMax M2.5 tencent-tokenhub	165.4 t/s	1780 ms	480 ms
Claude Sonnet 4.6 anthropic	142.8 t/s	1450 ms	420 ms
Seed 2.0 Lite Unknown	118.5 t/s	3200 ms	920 ms
Claude Opus 4.6 anthropic	86.2 t/s	2100 ms	680 ms

MiniMax M2.7 Highspeed

minimax-official· 312.5 t/s· 1520 ms

MiniMax M2.5 Highspeed

minimax-official· 298.8 t/s· 1280 ms

Claude Haiku 4.5

anthropic· 245.6 t/s· 850 ms

tencent-tokenhub· 178.3 t/s· 2350 ms

tencent-tokenhub· 165.4 t/s· 1780 ms

Claude Sonnet 4.6

anthropic· 142.8 t/s· 1450 ms

Unknown· 118.5 t/s· 3200 ms

Claude Opus 4.6

anthropic· 86.2 t/s· 2100 ms

Categories

Ranked leaders by use case.

DeepSeek V4 Flash

deepseek-v4-flash

DeepSeek V4 Pro

deepseek-v4-pro

tencent-tokenhub

tencent-tokenhub

aliyun-dashscope

tencent-tokenhub