/
Navigation
C
Chronicles
Browse all articles
C
E
Explore
Semantic exploration
E
R
Research
Entity momentum
R
N
Nexus
Correlations & relationships
N
~
Story Arc
Topic evolution
S
Drift Map
Semantic trajectory animation
D
P
Posts
Analysis & commentary
P
Browse
@
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
?
Concept Search
Semantic similarity search
!
High Impact Stories
Top coverage by position
+
Sentiment Analysis
Positive/negative coverage
*
Anomaly Detection
Unusual coverage patterns
Analysis
vs
Rivalry Report
Compare two entities head-to-head
/\
Semantic Pivots
Narrative discontinuities
!!
Crisis Response
Event recovery patterns
Connected
Nav: C E R N
Search: /
Command: ⌘K
Embeddings: large
Company

Chatbot Arena

4 articles accelerating
Articles
4
mentions
Velocity
+100.0%
growth rate
Acceleration
+1.000
velocity change
Sources
3
publications

Coverage Timeline

2025-05-01
TechCrunch 7 related

A study from Cohere, Stanford, MIT, and Ai2 accuses LMArena of helping Meta, OpenAI, Google, and Amazon game its popular crowdsourced AI benchmark Chatbot Arena

A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI …

2025-04-18
Bloomberg 6 related

LMArena says it's starting a company, whose corporate name will be Arena Intelligence, with plans to raise money, and releases a new beta version of its website

fixing errors/bugs, improving our UI layout, and more.  To keep supporting the development and continual improvement of this platform, we're also forming a company.  Future improvements will continue ...

2024-09-08
TechCrunch

A look at LMSYS' Chatbot Arena and the issues surrounding the crowdsourced LLM benchmark platform, including biases, lack of transparency, and commercial ties

Kyle Wiggers / TechCrunch : X: @woojinrad X: Woojin Kim / @woojinrad : The AI industry is obsessed with Chatbot Arena, but it might not be the best benchmark | @TechCrunch Human raters bring their bi...

2024-03-28
Ars Technica 21 related

Anthropic's Claude 3 Opus surpassed OpenAI's GPT-4 on Chatbot Arena, a crowdsourced LLM leaderboard used by AI researchers; GPT-4 has been first since launch

Anthropic's Claude 3 is first to unseat GPT-4 since launch of Chatbot Arena in May '23.  —  On Tuesday, Anthropic's Claude 3 …

Loading articles...

Quarterly Coverage

Top Sources

Narrative

Loading narrative...

Relationships

Loading graph...