Privilege Saved, Facts Still Exposed: Judge Stein’s OpenAI Dataset Deletion Ruling—and What It Means for the Next Wave of AI Copyright Lawsuits
Summary
On February 6, 2026, Judge Sidney H. Stein reversed Magistrate Judge Wang's November 24, 2025, ruling that OpenAI waived attorney–client privilege regarding communications about deleting its "Books1/Books2" datasets and references to "LibGen." Judge Wang had found waiver based on OpenAI's "non-use" explanation, its "moving target" litigation posture, and its denial of willful infringement. Judge Stein, however, ruled that disclosing "non-use" was not a disclosure of privileged legal advice, OpenAI consistently asserted privilege, and merely denying willfulness does not automatically put legal advice "at issue." This decision significantly limits plaintiffs' ability to compel discovery of privileged communications in AI copyright cases, emphasizing that privilege protects legal advice, not ordinary factual statements or business reasons.
Key takeaway
For CTOs and legal counsel navigating AI copyright litigation, your teams should meticulously document data provenance and deletion processes, ensuring factual records are distinct from legal advice. This ruling clarifies that merely denying willful infringement or providing non-privileged business reasons for data handling will not automatically waive attorney–client privilege. Focus on robust data governance and preservation strategies to defend against discovery demands, rather than relying on privilege as a shield for operational facts.
Key insights
Denying willfulness or stating non-privileged facts does not waive attorney–client privilege in AI copyright disputes.
Principles
- Privilege protects legal advice, not factual statements.
- Denying willfulness does not imply reliance on counsel.
- At-issue waiver requires affirmative reliance on legal advice.
Method
To obtain discovery in AI copyright cases, focus on factual evidence, forensic data, and third-party records, rather than attempting to pierce attorney–client privilege.
In practice
- Implement aggressive ESI protocols early in litigation.
- Demand non-privileged facts about data lifecycle.
- Utilize forensic discovery and third-party records.
Topics
- Attorney-Client Privilege
- AI Copyright Lawsuits
- Data Discovery
- Dataset Deletion
- Willful Infringement
Best for: CTO, Legal Professional, Director of AI/ML, AI Ethicist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Pascal’s Substack.