The Atlantic uncovers millions of copyrighted songs in AI training data

· Source: Dataconomy · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Intellectual Property & Patents · Depth: Fundamental Awareness, quick

Summary

An investigation by The Atlantic has uncovered that millions of copyrighted songs, including tracks from artists like Taylor Swift and Bad Bunny, have been used to train AI music models. The publication developed four searchable databases, collectively containing 12 million, 9 million, and two additional sets of approximately 100,000 songs each, detailing this extensive use. Legal actions are currently underway against generative AI music platforms such as Suno and Udio, which are asserting fair use as a defense. This follows a book publishing lawsuit where piracy allegations proved more effective than copyright claims, leading to an initial \$1.5 billion settlement. The music industry may utilize The Atlantic's databases for future copyright infringement lawsuits, while streaming services are implementing varied measures to identify and label AI-generated music, amidst exploitation by scammers creating imitations.

Key takeaway

For legal professionals advising on generative AI music platforms, you must recognize the significant copyright infringement risks associated with training data. The Atlantic's findings highlight that millions of copyrighted songs are in use, making fair use defenses challenging. Consider the precedent from book publishing lawsuits where piracy claims proved more effective, and prepare for rigorous scrutiny of your model's training datasets to mitigate potential multi-billion dollar liabilities.

Key insights

Millions of copyrighted songs are embedded in AI music training data, fueling ongoing legal disputes over fair use.

Principles

In practice

Topics

Best for: CTO, VP of Engineering/Data, Executive, Legal Professional, Director of AI/ML, Tech Journalist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Dataconomy.