Monitor AI model performance over time. See how GPT, Claude, Gemini, and others compete as the landscape evolves.
Anthropic still leads text, but the top slot flipped back to Claude Opus 4.6 Thinking at 1502 Elo. On code, Claude Opus 4.7 Thinking still leads at 1566 Elo. Meta Muse Spark is now top five on text.
Read our analysis→LMArena Elo ratings by release date
← Swipe chart to explore · Tap any model for details →
Which company held the #1 spot over time
All tracked models ranked by Elo score
| Rank | Model | Score | Details |
|---|---|---|---|
| 1 | Claude Opus 4.6 Thinking Anthropic | 1502 | |
| 2 | Claude Opus 4.7 Thinking Anthropic | 1500 | |
| 3 | Claude Opus 4.6 Anthropic | 1498 | |
| 4 | Claude Opus 4.7 Anthropic | 1492 | |
| 5 | Muse Spark Meta | 1490 | |
| 6 | Gemini 3.1 Pro Preview Google | 1489 | |
| 7 | Gemini 3 Pro Google | 1486 | |
| 8 | GPT-5.5 High OpenAI | 1484 | |
| 9 | GPT-5.4 High OpenAI | 1479 | |
| 10 | Grok 4.20 Beta1 xAI | 1479 | |
| 11 | GPT-5.2 Chat Latest OpenAI | 1477 | |
| 12 | GPT-5.5 OpenAI | 1476 | |
| 13 | Grok 4.20 Beta 0309 Reasoning xAI | 1476 | |
| 14 | Grok 4.20 Multi-Agent Beta 0309 xAI | 1475 | |
| 15 | Gemini 3 Flash Google | 1473 | |
| 16 | Claude Opus 4.5 Thinking Anthropic | 1473 | |
| 17 | ERNIE 5.1 Baidu | 1472 | |
| 18 | GLM-5.1 Z.ai | 1472 | |
| 19 | GPT-5.5 Instant OpenAI | 1472 | |
| 20 | Claude Sonnet 4.6 Anthropic | 1468 |
Trusted Data Sources
As of May 13, 2026, Claude Opus 4.6 Thinking holds the #1 position on text with 1502 Elo, followed by Claude Opus 4.7 Thinking (1501) at #2 and Claude Opus 4.6 (1498) at #3. On code, Claude Opus 4.7 Thinking leads at 1566 Elo, ahead of Claude Opus 4.7 at 1559 and Claude Opus 4.6 Thinking at 1547.
An Elo score is a rating system where models compete head-to-head. Users compare responses without knowing which model produced them, and scores adjust based on wins and losses.
Leadership has changed multiple times. OpenAI led with GPT-4 through most of 2024. Google took the lead with Gemini 3 Pro in January 2026. Anthropic has led since February 2026, first with Claude Opus 4.6, then Claude Opus 4.7, and now Claude Opus 4.6 Thinking again after the May 2026 text reshuffle. Anthropic currently holds the top 4 text slots and the top 4 code slots.
We capture daily snapshots to track how rankings evolve over time. This reveals score drift as new models enter the arena.