Security and Privacy of Large Language Models: Threat Taxonomy, Ethical Implications, and…
Summary
This paper, a narrative review synthesizing 251 references, addresses the security, privacy, and governance challenges introduced by Large Language Models (LLMs). It proposes a unified lifecycle model for understanding LLM risks, moving beyond traditional software vulnerabilities. Key contributions include a lifecycle-based taxonomy classifying attacks from interaction to training phases, an "alignment gap" framework explaining how pre-training probabilistic behavior conflicts with post-training alignment, and a four-layer socio-technical risk model covering technical, human, organizational, and governance aspects. The review also maps defense-in-depth strategies to various attack categories and lifecycle stages, emphasizing that LLM risks stem from systemic structural factors like probabilistic generation and large-scale data ingestion, rather than isolated code defects. It highlights issues like memorization, bias amplification, PII leakage, and the insufficiency of current regulations.
Key takeaway
For AI Security Engineers and Enterprise Architects deploying Large Language Models, you must move beyond traditional software security models. Your strategy should integrate a lifecycle-oriented defense-in-depth approach, combining technical safeguards like differential privacy and runtime monitoring with robust governance mechanisms. Focus on continuous evaluation and privacy-preserving training to address systemic risks stemming from probabilistic generation and data ingestion, ensuring responsible and secure LLM deployment.
Key insights
LLM risks are systemic, requiring a lifecycle-oriented defense-in-depth strategy across technical, human, organizational, and governance layers.
Principles
- LLM risks stem from probabilistic generation, not isolated code defects.
- Alignment techniques only adjust surface behavior, not underlying representations.
- Single technical mitigations are insufficient for LLM security.
Method
The paper proposes a unified lifecycle model for LLM risk analysis, including a lifecycle-based taxonomy, an alignment gap framework, a socio-technical risk model, and defense-in-depth mapping.
In practice
- Adopt lifecycle risk management for LLM deployments.
- Implement privacy protection at the training layer.
- Use runtime protection and deployment controls.
Topics
- LLM Security
- Data Privacy
- Threat Taxonomy
- AI Governance
- Defense-in-Depth
- Prompt Injection
Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Security Engineer, Policy Maker
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by LLM on Medium.