Who's Really Stealing From Whom?

2026-04-17 · Source: What's AI by Louis-François Bouchard · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation, Robotics & Autonomous Systems · Depth: Intermediate, long

Summary

Anthropic, OpenAI, and Google recently accused Chinese AI labs Deepseek, Moonshot AI, and Minimax of "industrial scale distillation attacks" on their frontier models, specifically Claude and Gemini, between February 12-23, 2026. Anthropic's detailed report described "Hydra clusters" using 24,000 fake accounts and 16 million exchanges to extract capabilities like reasoning, coding, and agentic behavior. Deepseek allegedly focused on reasoning extraction, rubric-based grading for reinforcement learning, and censorship-safe alignment. Minimax primarily targeted agentic coding and tool use, while Moonshot AI focused on agentic reasoning and computer vision. This blackbox distillation involves querying models at scale via APIs to harvest outputs for training, a technique formalized in 2015 by Hinton, Vignels, and Dean.

Key takeaway

For CTOs and VPs of Engineering evaluating model development strategies, recognize that frontier model intelligence is increasingly commoditized and extractable via public APIs. While distillation offers cost-effective model training, be aware of the unclear legal landscape regarding terms of service violations versus IP law. Focus on building value into product ecosystems and workflows around models, rather than solely on raw model outputs, to mitigate the impact of widespread distillation.

Key insights

Blackbox distillation allows smaller models to learn from frontier models' API outputs, even without access to internal probabilities.

Principles

Model intelligence is extractable via public APIs.
Scaling blackbox distillation yields significant performance gains.

Method

Query a teacher model with millions of carefully chosen questions, collect its text outputs (answers, explanations, code), then train a student model to imitate these patterns.

In practice

Use frontier models as reward models for RLHF.
Generate synthetic reasoning traces for training thinking models.

Topics

Model Distillation
Blackbox Distillation
Frontier Models
AI Ethics
Copyright Infringement

Best for: CTO, VP of Engineering/Data, Research Scientist, AI Scientist, Machine Learning Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by What's AI by Louis-François Bouchard.