Skip to main content
Back to Pulse
TechCrunch

Microsoft built a fake marketplace to test AI agents — they failed in surprising ways

Read the full articleMicrosoft built a fake marketplace to test AI agents — they failed in surprising ways on TechCrunch

What Happened

The research raises new questions about how well AI agents will perform when working unsupervised — and how quickly AI companies can make good on promises of an agentic future.

Our Take

Finally—someone publicly admitting that AI agents aren't ready. Microsoft's marketplace simulation should've been easy. Instead, agents got confused, made bad decisions, got stuck in loops. That's actual important data.

Look, the hype cycle needs a real failure. This is it. The agents didn't scale to unsupervised tasks. Autonomy isn't just better LLMs—it's a different problem entirely. This sets back the "agentic future" narrative by 18 months, minimum.

What To Do

Don't bet your business on unsupervised AI agents yet.

Cited By

React

Loading comments...