AWS Announces General Availability of DevOps Agent for Automated Incident Investigation
Summary
AWS has announced the general availability of DevOps Agent, a generative AI-powered assistant designed to automate incident investigation and operational tasks across AWS, Azure, and on-prem environments. Introduced at re:Invent 2025 and built on Amazon Bedrock AgentCore, the agent integrates with observability tools, runbooks, code repositories, and CI/CD pipelines to correlate telemetry, code, and deployment data. It autonomously triages issues, speeds up resolution, and identifies patterns to prevent future outages. Key improvements include support for custom agent skills and custom charts/reports. Early preview numbers showed up to 75% lower Mean Time To Resolution (MTTR) and 94% root cause accuracy. The service is now billed per second based on operational task time, with credits available for AWS Support customers. AWS also announced the general availability of Security Agent for on-demand penetration testing.
Key takeaway
For Directors of AI/ML evaluating operational efficiency tools, AWS DevOps Agent offers a compelling solution to reduce incident MTTR and improve root cause accuracy. Its autonomous investigation capabilities and broad integration with existing observability and CI/CD tools can significantly offload SRE teams. You should assess its cost structure against potential savings in operational overhead and consider its applicability to your hybrid cloud environments.
Key insights
AWS DevOps Agent uses generative AI to autonomously investigate incidents and automate operational tasks across diverse environments.
Principles
- AI agents enhance SRE efficiency.
- Autonomous investigation reduces MTTR.
- Extensibility is key for operational data integration.
Method
The agent correlates telemetry, code, and deployment data from various sources (CloudWatch, PagerDuty, GitHub) to triage issues, identify root causes, and recommend preventive actions without human prompting.
In practice
- Integrate with existing observability tools.
- Configure webhooks for incident triggers.
- Develop custom agent skills for specific needs.
Topics
- AWS DevOps Agent
- Generative AI
- Incident Investigation
- Operational Automation
- Amazon Bedrock AgentCore
Best for: CTO, Director of AI/ML, VP of Engineering/Data, MLOps Engineer, DevOps Engineer, Software Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by InfoQ.