Announcing ControlConf 2026

2024-06-17 · Source: Redwood Research blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy, Emerging Technologies & Innovation · Depth: Expert, quick

Summary

ControlConf, a two-day conference on AI control, will be held to address risks from AI misalignment through safeguards designed to counter models attempting to undermine them. Since February 2025, AI agents have significantly improved, making control techniques critical for real-world agent deployments. The conference will feature presentations on current research problems, promising interventions, and key future research directions. Topics will include AI monitoring and attack generation capabilities, external auditor evaluation of internal agent threats, Chain-of-Thought (CoT) monitoring reliability, permissions management for high-security agents, and risk analysis for non-scheming misaligned motivations. A one-day workshop on AI futurism and threat modeling will also run on April 17, focusing on catastrophic AI risks and mitigation strategies.

Key takeaway

For AI Scientists and Research Scientists developing or deploying advanced AI agents, understanding and implementing robust AI control measures is increasingly critical. Your focus should be on evaluating and integrating safeguards that can withstand sophisticated AI attempts to undermine them, particularly as agent capabilities advance. Consider attending ControlConf to engage with frontier research on threat modeling, permissions management, and monitoring techniques to proactively mitigate misalignment risks in your deployments.

Key insights

AI control focuses on concrete risk analysis for misalignment, even against AI models actively trying to subvert safeguards.

Principles

AI control techniques are becoming load-bearing for agent safety.
Risk analysis can be concretized by focusing on AI capabilities.

In practice

Evaluate CoT monitoring reliability in your AI systems.
Implement permissions management for high-security AI agents.

Topics

AI Control
Misalignment Risk
AI Agent Safeguards
Threat Modeling
AI-Empowered Security

Best for: AI Scientist, Research Scientist, AI Researcher, AI Security Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Redwood Research blog.