Separable Expert Architecture: Toward Privacy-Preserving LLM Personalization via Composable Adapters and Deletable User Proxies
Summary
A novel three-layer architecture is proposed to address the computational infeasibility of individual data removal in large language models (LLMs) by decoupling personal data from shared model weights. This architecture integrates a static base model, composable domain-expert LoRA adapters for behavior shaping without user data, and per-user proxy artifacts. The deletion of these proxies enables deterministic unlearning. Evaluation on Phi-3.5-mini and Llama-3.1-8B models demonstrates effective per-user differentiation, where personal data influences outputs while remaining isolated. Verification after proxy removal shows a return to baseline (KL divergence of approximately 0.21 nats, 82-89% pass rate) and near-zero cross-user contamination. This design inherently mitigates model inversion, membership inference, and training-data extraction against shared components and is compatible with differentially private stochastic gradient descent (DP-SGD).
Key takeaway
For engineering teams building personalized LLM applications, this architecture offers a robust solution for integrating user data while ensuring privacy and compliance with data deletion requests. Your ability to deterministically remove user data by deleting proxies simplifies machine unlearning, reducing the operational overhead and legal risks associated with data retention. Consider adopting this approach to enhance user trust and streamline data governance.
Key insights
A three-layer architecture enables privacy-preserving LLM personalization through composable adapters and deletable user proxies.
Principles
- Decouple personal data from shared weights.
- Unlearning via deterministic proxy deletion.
Method
Combine a static base model, domain-expert LoRA adapters, and per-user proxy artifacts. Personalization occurs via proxies, unlearning via proxy deletion.
In practice
- Apply LoRA adapters for behavior shaping.
- Isolate user data in deletable proxies.
Topics
- Separable Expert Architecture
- LLM Personalization
- Privacy-Preserving AI
- Machine Unlearning
- LoRA Adapters
Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, Machine Learning Engineer, AI Security Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.