CausShield: Sample Reconstruction-Resilient Vertical FL via Causal Representation Learning
Summary
CausShield is a novel defense mechanism designed to protect Vertical Federated Learning (VFL) from active sample reconstruction attacks, a vulnerability where existing solutions struggle with balancing model utility and privacy. Developed using insights from structural causal models (SCMs), CausShield addresses the challenge by decomposing shared representations between clients and servers into task-relevant (causal) and task-irrelevant (non-causal, privacy-sensitive) features. This approach ensures full-cycle privacy protection, mitigating early-epoch vulnerabilities common in end-to-end supervised training defenses. The decomposition is achieved through an unsupervised representation learning optimization problem. CausShield is theoretically proven to preserve the convergence behavior of standard VFL. Extensive experiments demonstrate its superior performance in privacy protection, model utility, and computational efficiency compared to seven state-of-the-art methods, including InvL (USENIX Security'25), and its robustness against advanced attacks like URVFL (NDSS'25).
Key takeaway
For Machine Learning Engineers deploying Vertical Federated Learning, you should evaluate CausShield as a robust defense against active sample reconstruction attacks. Its causal representation learning approach provides superior privacy protection and model utility compared to current SOTAs, while maintaining computational efficiency. This mitigates early-epoch vulnerabilities and ensures full-cycle privacy, making it a critical consideration for securing your VFL applications.
Key insights
CausShield enhances Vertical Federated Learning privacy by causally separating task-relevant from privacy-sensitive features, outperforming existing defenses in utility and protection.
Principles
- Causal features drive learning; non-causal features encode private data.
- Decomposing representations ensures full-cycle VFL privacy.
- Unsupervised learning can balance utility and privacy.
Method
CausShield decomposes VFL shared representations into task-relevant and task-irrelevant components. This is achieved by solving a carefully formulated optimization problem using unsupervised representation learning to balance utility and privacy.
In practice
- Implement CausShield for VFL to resist reconstruction attacks.
- Apply causal representation learning for privacy-preserving ML.
- Benchmark VFL defenses against URVFL and similar attacks.
Topics
- Vertical Federated Learning
- Causal Representation Learning
- Data Privacy
- Reconstruction Attacks
- Machine Learning Security
- Unsupervised Learning
Best for: Research Scientist, AI Scientist, AI Security Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.