FloatDoor: Platform-Triggered Backdoors in LLMs
Summary
FloatDoor introduces the first input-independent, platform-triggered backdoor attack targeting generative large language models (LLMs). This novel attack exploits platform-dependent variability in LLM outputs, a consequence of non-associative floating-point arithmetic and divergent kernel implementations across different deployment environments. FloatDoor employs two lightweight LoRA adapters: one to amplify inter-platform numerical divergence and another to link the resulting platform signature to a malicious downstream task, while preserving overall model utility. The compromised model behaves maliciously only when served on a specific target platform, remaining benign otherwise. Demonstrated on Qwen3-4B across NVIDIA GPUs, Google TPUs, AWS Graviton, and Alibaba Yitian-710, FloatDoor can reliably induce exploitable code vulnerabilities. This research establishes a new class of LLM deployment attacks, emphasizing the critical need for trusted model supply chains in sensitive applications.
Key takeaway
For AI Security Engineers or Machine Learning Engineers deploying LLMs in sensitive settings, you must account for platform-dependent numerical variability as a potential attack vector. FloatDoor demonstrates that models can be benign during auditing but malicious on specific deployment hardware, exploiting a time-of-check, time-of-use gap. Implement robust auditing across all target deployment environments, not just a single reference, to mitigate such sophisticated backdoor attacks.
Key insights
Platform-dependent numerical divergence in LLMs creates a novel attack surface for input-independent backdoors.
Principles
- LLM outputs vary by deployment platform.
- Floating-point arithmetic non-associativity causes divergence.
- A time-of-check, time-of-use gap is exploitable.
Method
FloatDoor uses two LoRA adapters: one amplifies inter-platform numerical divergence, the other binds the platform signature to a malicious task.
In practice
- Induce exploitable code vulnerabilities.
- Target specific hardware platforms.
Topics
- LLM Security
- Backdoor Attacks
- Floating-Point Arithmetic
- Platform Dependence
- LoRA Adapters
- Supply Chain Security
Best for: CTO, AI Architect, Research Scientist, AI Scientist, AI Security Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.