Skip to main content

Authors Sue Adobe Over AI Training With Pirated Books

Adobe Faces Lawsuit Over Alleged Use of Pirated Books in AI Training

Tech giant Adobe finds itself embroiled in controversy as Oregon author Elizabeth Lyon files a class-action lawsuit accusing the company of using illegally obtained books to train its SlimLM artificial intelligence model. The case shines new light on the ongoing battle between content creators and tech companies over copyright protections in the AI era.

The Core Allegations

Lyon, who writes nonfiction writing guides, claims Adobe incorporated her pirated works into SlimLM's training data without permission or compensation. Court documents allege Adobe relied on problematic datasets that trace back to Books3 - a collection containing approximately 191,000 copyrighted books allegedly scraped from pirate sites.

The complaint specifically targets SlimPajama-627B, the dataset Adobe acknowledges using for SlimLM's development. While publicly available, Lyon's legal team argues SlimPajama inherits copyright issues from its predecessor RedPajama, which directly incorporated Books3 content.

"Adobe took shortcuts," says Lyon's attorney Mark Rifkin. "They built commercial products using stolen creative work while bypassing proper licensing channels."

Industry-Wide Implications

This lawsuit doesn't exist in isolation. Several major tech players now face similar legal challenges:

  • Apple confronted allegations last September regarding its Apple Intelligence system
  • Anthropic settled a $1.5 billion case with authors just last month
  • Salesforce received complaints in October about its AI training practices

The pattern suggests an industry-wide reckoning may be coming regarding how AI companies source their training materials.

Why This Case Matters

The outcome could reshape how tech firms approach AI development moving forward. Currently, many rely on massive datasets scraped from various online sources with questionable copyright status. A ruling against Adobe might force companies to:

  1. Implement stricter vetting processes for training data
  2. Develop new methods to compensate content creators
  3. Potentially limit what materials they can legally use

The stakes extend beyond financial penalties - it's fundamentally about determining fair compensation models for creative work fueling today's AI revolution.

The timing couldn't be more critical as generative AI becomes increasingly dependent on vast quantities of text data.

Key Points:

  • Adobe faces class-action lawsuit over alleged use of pirated books in SlimLM training
  • Case centers on controversial Books3 dataset containing ~191K copyrighted works
  • Similar lawsuits emerging against Apple, Anthropic and Salesforce
  • Outcome could redefine copyright standards for AI training materials
  • Potential billion-dollar implications for tech industry practices

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

Disney Flexes Copyright Muscle as Google Bans AI-Generated Characters

Google has quietly pulled the plug on Disney character generation in its Gemini and Nano Banana AI tools following legal pressure. The move highlights growing tensions between tech giants and copyright holders in the AI era. While Disney blocked Google's use of its characters, it's reportedly cutting deals with other AI companies - showing how intellectual property is becoming both weapon and commodity.

February 12, 2026
AI CopyrightDisneyGenerative AI
OpenAI Snags Coveted GPT.com Domain in Strategic Brand Move
News

OpenAI Snags Coveted GPT.com Domain in Strategic Brand Move

OpenAI appears to have quietly acquired the premium domain GPT.com, which now redirects to ChatGPT's official site. The move mirrors their previous acquisition of Chat.com and suggests a deliberate strategy to control key digital real estate in the AI space. While unconfirmed officially, domain records show GPT.com transferred to OpenAI's preferred registrar, strengthening their brand presence as competition in generative AI intensifies.

March 2, 2026
OpenAIDomain StrategyGenerative AI
News

Amazon Bets Big on AI Content Licensing Amid Copyright Storm

Amazon is quietly developing a marketplace that could reshape how AI companies access training data. The tech giant aims to broker deals between publishers and AI firms, offering a legal alternative to today's copyright chaos. While details remain under wraps, insiders see this as Amazon's answer to Microsoft's similar initiative - and potentially a lifeline for struggling media outlets.

February 11, 2026
AI CopyrightAmazon Web ServicesContent Licensing
News

Reddit Bets Big on AI Search to Revolutionize Online Q&A

Reddit's latest earnings report reveals ambitious plans to transform its platform into an AI-powered answer hub. With search activity growing rapidly, the company is blending traditional search with generative AI capabilities. Weekly active users on Reddit search surged from 60 million to 80 million last year, while its AI Answers feature saw explosive growth from 1 million to 15 million users. The platform also plans to enhance accessibility by removing differences between logged-in and guest experiences starting Q3 2026.

February 6, 2026
RedditAI SearchGenerative AI
News

Design Startup Flora Lands $42M Boost to Revolutionize Creative Workflows

Flora, an innovative design tool shaking up traditional workflows with its node-based system, just secured $42 million in Series A funding led by Redpoint Ventures. Already adopted by major players like Alibaba and Lionsgate, Flora's unique approach lets creatives generate and iterate designs seamlessly across multiple media formats. The fresh capital will fuel team expansion and product enhancements as Flora positions itself against established giants in the increasingly competitive AI-powered design space.

January 28, 2026
Generative AIDesign ToolsStartup Funding
News

OpenAI Seeks $50 Billion Boost from Middle East Investors

OpenAI CEO Sam Altman is courting Middle Eastern investors for a massive funding round that could reach $50 billion, potentially valuing the AI pioneer between $75-83 billion. While discussions remain preliminary, the move signals OpenAI's ambitious growth plans following ChatGPT's breakout success. Analysts predict the company could generate $25 billion annually from advertising by 2030.

January 22, 2026
OpenAIAI FundingSam Altman