
"Quintessential Fair Use" or Wholesale Theft? Inside the AI Music Lawsuits
This episode explores the heated legal disputes between AI music generators like Suno and Udio and the RIAA over alleged copyright infringement. It dissects the "fair use" defense, which frames AI training as learning, contrasting it with evidence suggesting AI models may be memorizing and replicating copyrighted material, including producers' watermarks. Listeners will learn about the significant financial stakes and the fundamental conflict between AI developers' data needs and creators' intellectual property rights.
Key Takeaways
- AI music generators like Suno and Udio face major lawsuits from the RIAA for alleged copyright infringement, arguing their training processes constitute "quintessential fair use."
- Forensic evidence, including the reproduction of producer audio watermarks and near-exact musical replicas, directly challenges AI companies' claims of merely "learning" patterns.
- Legal scholarship suggests that AI models themselves can be considered infringing "copies" during the training phase, shifting the legal focus from output to the model's internal architecture.
- The "build first, clear later" business model, prevalent in the AI sector, prioritizes rapid development and scaling over upfront intellectual property licensing.
- These music industry lawsuits are pivotal test cases that will establish precedents for the future of all intellectual property and the value of human creative labor in the age of AI.
Detailed Report
AI music generators, such as Suno and Udio, are currently embroiled in significant lawsuits initiated by the Recording Industry Association of America (RIAA). These lawsuits allege that the AI companies have engaged in mass infringement of copyrighted sound recordings by using them without permission to train their models. The AI developers' core defense hinges on the argument that their training process constitutes "quintessential fair use," likening their models to human students learning from vast amounts of information.
The "Fair Use" Defense Under Scrutiny
AI companies assert that the copying of copyrighted music occurs in a non-public, "back-end" process, where systems learn statistical patterns unseen by human eyes. Suno's CEO, Mikey Shulman, has framed this process as akin to a "kid writing their own rock songs after listening to the genre," attempting to anthropomorphize the AI and downplay the industrial scale of data ingestion. However, Shulman has also acknowledged that training on copyrighted music is "stock standard" practice across the AI industry.
Evidence of Direct Copying and Memorization
Forensic evidence increasingly challenges the narrative of invisible learning. Ed Newton-Rex, formerly of Stability AI and now leading the nonprofit Fairly Trained, has published detailed analyses demonstrating that these models are not just mimicking styles but are capable of "regurgitating" near-exact melodies, harmonies, and chord progressions from well-known songs. Examples include Suno-generated tracks bearing striking resemblances to works by artists like ABBA, Oasis, and Eminem, even when specific artist names are misspelled to bypass filters.
The "Smoking Gun": Audio Watermarks
Perhaps the most compelling evidence against the "invisible back-end" defense is the documented presence of producer audio watermarks (e.g., "CashMoneyAP," "Jason Derulo") in AI-generated music outputs. These embedded audio clips are used by producers to protect their instrumental tracks. Their appearance in AI-generated content serves as direct proof that the models were trained on the original, watermarked audio files, indicating wholesale ingestion and reproduction rather than abstract pattern learning.
Redefining Copying: Memorization vs. Regurgitation
Academic research further reframes the legal discussion. A 2025 *Chicago-Kent Law Review* paper, "The Files are in the Computer: On Copyright, Memorization, and Generative AI," distinguishes between "memorization" and "regurgitation." "Memorization" refers to the process where a model encodes near-exact copies of its training data into its parameters, suggesting the model itself can be considered an infringing "copy." "Regurgitation" is the subsequent generation of a near-exact copy from that memorized data. This research argues that the act of copying happens during training, embedding potential infringement within the AI's architecture, thereby challenging the defense that infringement only occurs at the output stage.
The "Build First, Clear Later" Business Model
The AI sector's prevailing ethos, often fueled by venture capital, has been to "build first, clear later." This approach prioritizes rapid development and scaling of technologies over securing upfront licensing deals for copyrighted material. An early investor in Suno reportedly stated they might not have funded the company if it had pursued licensing from the outset, believing it would stifle innovation. Despite ongoing legal battles, Suno is reportedly negotiating a new funding round that could value the company at over $2 billion, suggesting a financial bet that the cost of settlement will be less than the value generated by this approach. This business model has galvanized a broad coalition of creators under campaigns like "Stealing Isn't Innovation," advocating for responsible AI development through licensing and partnerships.
Government's Conflicting Role
A profound conflict emerges as the U.S. government plays a dual role. While major music publishers are suing AI company Anthropic for massive copyright infringement, alleging the scraping and reproduction of copyrighted song lyrics, another branch of the U.S. government is rapidly adopting Anthropic's Claude model for mission-critical operations within the Department of Defense and the Intelligence Community. This raises a critical question: if Anthropic's foundational models are found liable for "systematic piracy" by a federal judge, will national security interests provide a backdoor pardon for copyright infringement, or will the government delete its new intelligence tools?
Broader Implications for Intellectual Property
The legal battles in the music industry are not isolated; they serve as pivotal test cases for the future of all intellectual property and digital labor in the age of AI. A ruling in favor of AI companies could establish a sweeping legal precedent, potentially creating a loophole that allows any digital content—from software code and journalism to scientific research and visual art—to be freely ingested, copied, and commercialized by tech giants under the guise of an "invisible back-end process." This outcome would fundamentally devalue human creative labor, transferring immense value from individual creators to the companies owning the AI models. The core challenge lies in adapting analog-era legal doctrines, designed for human-scale actions, to the realities of automated systems capable of memorizing and reproducing vast amounts of human creative output.
Show Notes
Works Referenced
- Recording Industry Association of America (RIAA): The trade organization that represents the U.S. recording industry, currently suing AI music generators like Suno and Udio for copyright infringement.
- Suno: An AI music generation company currently facing a major copyright infringement lawsuit from the RIAA.
- Udio: Another AI music generation company targeted by the RIAA in a significant copyright infringement lawsuit.
- The Files are in the Computer: On Copyright, Memorization, and Generative AI: A 2025 *Chicago-Kent Law Review* paper by Matthew Sag that distinguishes between AI "memorization" and "regurgitation" and argues that the AI model itself can be an infringing copy.
- Fairly Trained: A non-profit organization founded by Ed Newton-Rex that advocates for ethical AI training practices and identifies AI models trained on licensed data.
- Mishcon de Reya Generative AI IP Litigation Tracker: A legal firm's resource tracking intellectual property lawsuits related to generative AI across various creative fields.
- Anthropic: An AI company developing large language models like Claude, currently facing copyright lawsuits from music publishers while also being adopted by U.S. government entities.
- Stealing Isn't Innovation: A campaign advocating for fair compensation and ethical data sourcing for creators whose work is used to train AI models.
- Mikey Shulman: CEO of Suno, who has publicly commented on the company's training practices and legal challenges.
- Ed Newton-Rex: Former VP of Audio at Stability AI and founder of Fairly Trained, known for his analyses debunking claims of AI "invisible learning."
Glossary
- AI Models: Computer systems designed to learn from data and perform tasks, such as generating music, art, or text.
- Fair Use: A legal principle in U.S. copyright law allowing limited use of copyrighted material without permission, often for purposes like criticism, comment, news reporting, teaching, scholarship, or research.
- RIAA: Acronym for the Recording Industry Association of America, a trade group representing the U.S. recording industry.
- Audio Watermark: A short, distinctive sound embedded by producers in music tracks, often to identify ownership or prevent unauthorized use.
- Memorization (AI): The process where an AI model stores near-exact copies of its training data within its internal parameters.
- Regurgitation (AI): When an AI model generates an output that is a near-exact copy of its memorized training data.
- Large Language Model (LLM): An advanced AI program trained on vast amounts of text to understand, generate, and respond to human language.
- Intellectual Property (IP): Legal rights that protect creations of the mind, such as artistic works, inventions, and designs.
- Statutory Damages: Fixed monetary awards set by law for certain legal violations, like copyright infringement, without needing to prove actual financial loss.
- Transformative Use: A type of fair use where a new work significantly changes the original material's purpose, character, or expression.