Skip to main content
Back to Pulse
TechCrunch

Crowdsourced AI benchmarks have serious flaws, some experts say

Read the full articleCrowdsourced AI benchmarks have serious flaws, some experts say on TechCrunch

What Happened

AI labs are increasingly relying on crowdsourced benchmarking platforms such as Chatbot Arena to probe the strengths and weaknesses of their latest models. But some experts say that there are serious problems with this approach from an ethical and academic perspective. Over the past few years, labs

Our Take

We are tracking this story. Our take is coming soon.

What To Do

Check back for our analysis.

Cited By

React

Loading comments...