9to5MacJan 14, 2025

Meta accused of training its AI using pirated content from torrents

Read the full articleMeta accused of training its AI using pirated content from torrents on 9to5Mac

What Happened

A new day, a new controversy around artificial intelligence. This time, Meta has been accused of using pirated content from torrents to train its large language model (LLM) Llama, which powers Meta AI. The case was one of the first copyright lawsuits filed against a tech company for training AI.

Fordel's Take

this is the ugly truth we've been ignoring. they're training models on whatever garbage is floating around the net because the data is there, and copyright law hasn't caught up. it's a massive, exploitative problem baked into the LLM supply chain. they're using massive, unsecured datasets to build their moat, and it's a legal nightmare waiting to happen.

What To Do

we need clear, enforceable rules on data provenance for LLM training, or the entire field collapses into lawsuits.

Cited By

9to5Mac Meta accused of training its AI using pirated content from torrents

React

Newsletter

Get the weekly AI digest

The stories that matter, with a builder's perspective. Every Thursday.

Loading comments...