CausShield: Sample Reconstruction-Resilient Vertical FL via Causal Representation Learning

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Expert, quick

Summary

CausShield is a novel defense mechanism designed to protect Vertical Federated Learning (VFL) from active sample reconstruction attacks, a vulnerability where existing solutions struggle with balancing model utility and privacy. Developed using insights from structural causal models (SCMs), CausShield addresses the challenge by decomposing shared representations between clients and servers into task-relevant (causal) and task-irrelevant (non-causal, privacy-sensitive) features. This approach ensures full-cycle privacy protection, mitigating early-epoch vulnerabilities common in end-to-end supervised training defenses. The decomposition is achieved through an unsupervised representation learning optimization problem. CausShield is theoretically proven to preserve the convergence behavior of standard VFL. Extensive experiments demonstrate its superior performance in privacy protection, model utility, and computational efficiency compared to seven state-of-the-art methods, including InvL (USENIX Security'25), and its robustness against advanced attacks like URVFL (NDSS'25).

Key takeaway

For Machine Learning Engineers deploying Vertical Federated Learning, you should evaluate CausShield as a robust defense against active sample reconstruction attacks. Its causal representation learning approach provides superior privacy protection and model utility compared to current SOTAs, while maintaining computational efficiency. This mitigates early-epoch vulnerabilities and ensures full-cycle privacy, making it a critical consideration for securing your VFL applications.

Key insights

CausShield enhances Vertical Federated Learning privacy by causally separating task-relevant from privacy-sensitive features, outperforming existing defenses in utility and protection.

Principles

Method

CausShield decomposes VFL shared representations into task-relevant and task-irrelevant components. This is achieved by solving a carefully formulated optimization problem using unsupervised representation learning to balance utility and privacy.

In practice

Topics

Best for: Research Scientist, AI Scientist, AI Security Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.