The Atlantic uncovers millions of copyrighted songs in AI training data
Summary
An investigation by The Atlantic has uncovered that millions of copyrighted songs, including tracks from artists like Taylor Swift and Bad Bunny, have been used to train AI music models. The publication developed four searchable databases, collectively containing 12 million, 9 million, and two additional sets of approximately 100,000 songs each, detailing this extensive use. Legal actions are currently underway against generative AI music platforms such as Suno and Udio, which are asserting fair use as a defense. This follows a book publishing lawsuit where piracy allegations proved more effective than copyright claims, leading to an initial \$1.5 billion settlement. The music industry may utilize The Atlantic's databases for future copyright infringement lawsuits, while streaming services are implementing varied measures to identify and label AI-generated music, amidst exploitation by scammers creating imitations.
Key takeaway
For legal professionals advising on generative AI music platforms, you must recognize the significant copyright infringement risks associated with training data. The Atlantic's findings highlight that millions of copyrighted songs are in use, making fair use defenses challenging. Consider the precedent from book publishing lawsuits where piracy claims proved more effective, and prepare for rigorous scrutiny of your model's training datasets to mitigate potential multi-billion dollar liabilities.
Key insights
Millions of copyrighted songs are embedded in AI music training data, fueling ongoing legal disputes over fair use.
Principles
- Fair use is a primary defense in AI training data lawsuits.
- Piracy allegations may gain more traction than direct copyright claims in AI training data cases.
In practice
- The Atlantic's databases offer resources for music industry copyright lawsuits.
- Streaming services are implementing varied measures against AI-generated music.
Topics
- AI Music Models
- Copyright Infringement
- Fair Use
- Generative AI
- Training Data
- Music Industry Lawsuits
Best for: CTO, VP of Engineering/Data, Executive, Legal Professional, Director of AI/ML, Tech Journalist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Dataconomy.