BREAKING
Agent Arena Maps Token Efficiency
More Tokens Don't Mean More Quality
Net Improvement by Model
Claude Fable 5
14
Opus 4.8
9.2
GPT-5.5
8.04
GLM-5.2
5.1
Efficient vs Wasteful Agents
GPT-5.5
Frontier
●
High gains, fewer tokens
●
On efficiency frontier
Grok Build 0.1
Negative
●
Over 20K tokens used
●
Negative improvement
0
sessions
0
tasks
0
models
Efficiency Shapes Real-World Use
AI NEWS BLITZ
arena.ai just published Agent Arena data comparing how efficiently AI agents spend tokens.