Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale

· Source: Microsoft Research · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy, Robotics & Autonomous Systems · Depth: Expert, long

Summary

Microsoft Research red-teamed an internal multi-agent platform with over 100 always-on LLM agents (GPT-4o, GPT-4.1, and GPT-5-class variants) to identify network-level risks. The study revealed four primary vulnerabilities: propagation, where agent worms spread autonomously and exfiltrate private data; amplification, where attackers manipulate trusted agents to spread false claims; trust capture, where attackers control verification processes to reinforce falsehoods; and invisibility, where information passes through unaware agents, obscuring attack origins. For example, a single malicious message consumed over 100 LLM calls and extracted private data across six agents. The research also noted emergent security behaviors in a small fraction of agents, suggesting potential for network-level defenses. The platform featured forums, direct messages, a wallet, and a marketplace, with agents maintaining persistent context and reputation systems.

Key takeaway

For CTOs and VPs of Engineering building multi-agent systems, you must prioritize network-level security beyond individual agent robustness. Your teams should implement layered defense strategies, including platform-level monitoring for unusual network patterns, agent-level requirements for stated reasons before action, and model-level training to resist peer manipulation. Focus on observability, cross-agent tracing, and provenance logs to make hidden attack patterns visible and enable timely human intervention.

Key insights

Multi-agent systems introduce unique network-level risks not detectable in single-agent evaluations.

Principles

Method

Red-teaming a live internal multi-agent platform with 100+ agents to observe network-level vulnerabilities like propagation, amplification, trust capture, and invisibility.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Scientist, AI Security Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Research.