Privacy from Symmetry: Orthogonally Equivariant Transformers for LLM Inference
Summary
A new approach, Orthogonally Equivariant Transformers, specifically ConjFormer, addresses privacy concerns in large language model (LLM) inference when sensitive text is processed by third-party providers. Current split inference methods, which send only hidden representations, are vulnerable to token recovery via nearest-neighbor search against public embedding tables. The proposed orthogonal obfuscation procedure involves the client multiplying embeddings by a secret orthogonal matrix before transmission. ConjFormer, an O(d)-equivariant transformer variant, enables correct inference under these arbitrary rotations through scalar RMSNorm and blockwise orthogonal conjugation of linear weights. Experiments on GPT-2 and Llama 3.2 1B models fine-tuned on PubMed demonstrate that this method eliminates direct cosine nearest-neighbor inversion and reduces token recovery from over 35% top-10 to at most 1.3%, with only a 0.4% perplexity increase.
Key takeaway
For Machine Learning Engineers deploying LLMs with split inference on sensitive data, this research offers a robust privacy defense. You should consider integrating orthogonal obfuscation and ConjFormer's O(d)-equivariant architecture to drastically reduce token recovery risks from hidden representations. This approach achieves strong privacy with minimal perplexity impact, making it a practical alternative to heavy cryptographic methods.
Key insights
Enforcing architectural symmetry via orthogonal obfuscation and equivariant transformers provides practical privacy for LLM inference.
Principles
- Hidden representations are vulnerable to nearest-neighbor token recovery.
- Orthogonal obfuscation can eliminate direct cosine nearest-neighbor inversion.
- Architectural symmetry offers a practical privacy defense without noise.
Method
The client applies a secret orthogonal matrix to embeddings; the server uses ConjFormer, an O(d)-equivariant transformer with scalar RMSNorm and blockwise orthogonal conjugation, to process in the rotated basis.
In practice
- Apply client-side orthogonal obfuscation to embeddings.
- Implement ConjFormer for O(d)-equivariant LLM inference.
- Fine-tune existing LLMs (e.g., GPT-2, Llama 3.2 1B) with ConjFormer.
Topics
- LLM Privacy
- Orthogonal Equivariance
- ConjFormer
- Split Inference
- Data Obfuscation
- Machine Learning Security
Best for: AI Architect, Research Scientist, CTO, AI Scientist, Machine Learning Engineer, AI Security Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.