Model Stealing Through the Lens of Model Multiplicity

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Expert, quick

Summary

Model stealing attacks, where adversaries create high-fidelity surrogate models, pose a significant threat to machine learning service intellectual property. This paper challenges the conventional assumption that such surrogates offer economic leverage comparable to original services. It evaluates model stealing beyond mere fidelity, recognizing that query-based extraction provides only partial supervision, leading to a "Rashomon Set" of near-optimal surrogates. The research computes this Rashomon Set and assesses its diversity using multiplicity metrics (ambiguity, discrepancy, and Rashomon Capacity) and group fairness metrics. Experiments across tabular, medical imaging, and NLP tasks on real-world datasets reveal that despite similar fidelity to the target model, surrogate models can exhibit substantial variances in other critical performance metrics, questioning their presumed equivalence in practical deployment.

Key takeaway

For AI Security Engineers evaluating model stealing risks or IP protection strategies, you should not assume high-fidelity stolen models are functionally equivalent to your original service. Your risk assessments must account for the "Rashomon Set" of potential surrogates, which can exhibit critical performance differences despite similar accuracy. This necessitates deeper analysis beyond simple fidelity metrics to truly understand the threat.

Key insights

High-fidelity stolen models may not be functionally equivalent to originals due to model multiplicity.

Principles

Method

Compute the Rashomon Set of surrogate models, then evaluate its diversity using multiplicity metrics (ambiguity, discrepancy, Rashomon Capacity) and group fairness metrics across various tasks.

In practice

Topics

Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Security Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.