Announcing ControlConf 2026

· Source: Redwood Research blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy, Emerging Technologies & Innovation · Depth: Expert, quick

Summary

ControlConf, a two-day conference on AI control, will be held to address risks from AI misalignment through safeguards designed to counter models attempting to undermine them. Since February 2025, AI agents have significantly improved, making control techniques critical for real-world agent deployments. The conference will feature presentations on current research problems, promising interventions, and key future research directions. Topics will include AI monitoring and attack generation capabilities, external auditor evaluation of internal agent threats, Chain-of-Thought (CoT) monitoring reliability, permissions management for high-security agents, and risk analysis for non-scheming misaligned motivations. A one-day workshop on AI futurism and threat modeling will also run on April 17, focusing on catastrophic AI risks and mitigation strategies.

Key takeaway

For AI Scientists and Research Scientists developing or deploying advanced AI agents, understanding and implementing robust AI control measures is increasingly critical. Your focus should be on evaluating and integrating safeguards that can withstand sophisticated AI attempts to undermine them, particularly as agent capabilities advance. Consider attending ControlConf to engage with frontier research on threat modeling, permissions management, and monitoring techniques to proactively mitigate misalignment risks in your deployments.

Key insights

AI control focuses on concrete risk analysis for misalignment, even against AI models actively trying to subvert safeguards.

Principles

In practice

Topics

Best for: AI Scientist, Research Scientist, AI Researcher, AI Security Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Redwood Research blog.