Back to Pulse
TechCrunch
Study accuses LM Arena of helping top AI labs game its benchmark
Read the full articleStudy accuses LM Arena of helping top AI labs game its benchmark on TechCrunch
↗What Happened
A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores at the expense of rivals. According to the authors, LM Arena allowed some
Our Take
We are tracking this story. Our take is coming soon.
What To Do
Check back for our analysis.
Cited By
React
Loading comments...
