OpenAI Help: Lockdown Mode
Summary
OpenAI has launched "Lockdown Mode" for ChatGPT, a security feature first teased in February and now rolling out to eligible personal accounts (Free, Go, Plus, Pro) and self-serve ChatGPT Business accounts. This mode specifically aims to prevent the final stage of data exfiltration resulting from prompt injection attacks by restricting outbound network requests that could transfer sensitive data to an attacker. It is crucial to note that Lockdown Mode does not prevent prompt injections from appearing in content ChatGPT processes, nor does it affect the behavior or accuracy of responses influenced by such injections. The feature directly addresses the "Lethal Trifecta" by cutting off data exfiltration vectors, implying that default ChatGPT settings may not offer robust protection against determined data theft. OpenAI CISO Dane Stuckey clarified that Lockdown Mode targets users with an elevated risk profile, who prioritize security despite potential tradeoffs in functionality and utility.
Key takeaway
For AI Security Engineers evaluating LLM deployments handling sensitive data, you should enable OpenAI's Lockdown Mode for ChatGPT. This feature directly mitigates data exfiltration risks from prompt injection attacks by restricting outbound network requests. While this mode involves functionality tradeoffs, for high-risk profiles, the enhanced security against the "Lethal Trifecta" is a worthwhile compromise. You must also recognize that default ChatGPT settings may not provide robust protection against determined exfiltration attempts.
Key insights
OpenAI's Lockdown Mode prevents data exfiltration from prompt injections by limiting outbound network requests, mitigating a key "Lethal Trifecta" risk.
Principles
- Restrict data exfiltration to mitigate LLM "Lethal Trifecta."
- Deterministic, non-AI evaluated mechanisms enhance security.
- Default LLM settings may lack robust exfiltration protection.
Method
Lockdown Mode prevents data exfiltration by limiting outbound network requests, blocking the final stage of prompt injection attacks. This mechanism is deterministic and not AI-evaluated, enhancing reliability against subversion.
In practice
- Enable Lockdown Mode for elevated risk profiles.
- Accept functionality tradeoffs for enhanced security.
- Evaluate default LLM security for sensitive data.
Topics
- OpenAI
- ChatGPT
- Lockdown Mode
- Prompt Injection
- Data Exfiltration
- LLM Security
- Lethal Trifecta
Best for: CTO, VP of Engineering/Data, Executive, AI Security Engineer, Software Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Simon Willison's Weblog.