Skip to main content
Back to Pulse
TechCrunch

Study accuses LM Arena of helping top AI labs game its benchmark

Read the full articleStudy accuses LM Arena of helping top AI labs game its benchmark on TechCrunch

What Happened

A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores at the expense of rivals. According to the authors, LM Arena allowed some

Our Take

We are tracking this story. Our take is coming soon.

What To Do

Check back for our analysis.

Cited By

React

Loading comments...