Have You Ever Seen Them? Entity-level Membership Inference through Interrogating Large Language Models
Summary
A new study by Yiran Zhu and Ziqi Yang introduces entity-level membership inference (ELMI), a novel approach to assess privacy leakage and copyright compliance risks in Large Language Models (LLMs). Unlike existing methods that focus on specific training samples, ELMI determines if information related to a real-world entity was used in an LLM's training data, drawing an analogy to human memory's ability to accumulate knowledge from scattered mentions. The researchers formalize this task for a practical label-only black-box setting, where only generated texts are observable. They instantiate five interrogation strategies that use limited entity clues to construct prompts, elicit entity-related responses, and infer membership from semantic features. Experiments on person entities demonstrate ELMI's effectiveness, achieving an AUC up to 0.97 and providing gains of 6.0% to 17.5% in Balanced Accuracy over adapted sample-level baselines.
Key takeaway
For AI Security Engineers evaluating LLM privacy, this research indicates that entity-level membership inference is a significant, practical threat. You should prioritize developing robust defenses against black-box interrogation methods that can reveal whether specific entity information was used in training, even without direct sample memorization. Implement proactive measures to mitigate risks associated with sensitive entity data exposure, as current sample-level defenses may be insufficient.
Key insights
LLMs can be interrogated to infer if entity-related information was in their training data, akin to human memory.
Principles
- LLMs accumulate entity knowledge.
- Entity-level MI is feasible.
- Black-box interrogation works.
Method
Five interrogation strategies construct prompts from limited entity clues, elicit entity-related responses, and infer membership from semantic features in generated texts in a label-only black-box setting.
In practice
- Assess LLM privacy risks.
- Evaluate copyright compliance.
- Identify specific entity data.
Topics
- Large Language Models
- Membership Inference
- Entity-level Privacy
- Black-box Attacks
- Data Leakage
- AI Security
Code references
Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Security Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.