awesome-devops-ai

Awesome DevOps AI Awesome

Awesome Lint Link Check GitHub Pages GitHub stars GitHub forks

A curated list of AI tools, agents, MCP servers, and resources for DevOps, SRE, and Platform Engineering.

The AI revolution is transforming how infrastructure is built, monitored, and operated. This list tracks every meaningful tool at the intersection of AI and DevOps, from coding agents that write Terraform to AI-powered incident response that pages you with a root cause already identified.

Why this list? Engineers are adopting AI tooling faster than any technology shift in history, but the landscape is fragmented across hundreds of repos, products, and frameworks. This is one place to find them all.

257 tools across 20 categories — updated March 2026. See the Quick Start Guide for role-based recommendations.

If this list is useful, please give it a star to help others find it.

Contents

AI Coding Agents for Infrastructure

AI-powered coding agents that help write, review, and maintain infrastructure code including Terraform, Kubernetes manifests, Dockerfiles, and CI/CD pipelines.

AI-Powered Kubernetes

AI tools specifically designed for Kubernetes cluster management, troubleshooting, and operations.

AI-Powered Terraform and IaC

Tools that bring AI capabilities to Infrastructure as Code workflows.

AI Incident Response and Troubleshooting

AI systems that detect, investigate, and remediate production incidents.

AI Monitoring and Observability

AI-enhanced monitoring, alerting, and observability platforms.

AI Security Scanning

AI-powered security tools for infrastructure, containers, and supply chain.

AI Cost Optimization

AI and automation tools for cloud cost management, FinOps, and resource optimization.

MCP Servers for DevOps

Model Context Protocol servers that give AI assistants like Claude, ChatGPT, and Cursor access to DevOps tools and infrastructure.

AI-Powered CI/CD

AI tools that enhance continuous integration and delivery pipelines.

AI Log Analysis and Debugging

AI tools for log analysis, pattern detection, and debugging production systems.

AI Agent Frameworks for Infrastructure

General-purpose AI agent frameworks with strong infrastructure and DevOps use cases.

AI for Platform Engineering

AI tools for building internal developer platforms, service catalogs, and self-service infrastructure.

AI for Database Operations

AI tools for database management, query optimization, and data operations.

AI for Networking and Service Mesh

AI tools for network management, service mesh, and traffic engineering.

AI for Container Security and Supply Chain

AI tools for container image security, software supply chain, and build verification.

AI for Chaos Engineering and Reliability

AI tools for chaos engineering, resilience testing, and reliability validation.

AI for Cloud Migration and Modernization

AI tools that assist with cloud migration planning, execution, and application modernization.

AI for GitOps

AI tools for GitOps workflows, declarative infrastructure, and continuous reconciliation.

System Prompt and Config Templates

Ready-to-use AI agent configurations for infrastructure repositories.

Learning Resources

Courses, certifications, articles, and guides on AI for DevOps.

Articles and Guides

Books

Certifications

Podcasts

Community and Newsletters

Communities, forums, and newsletters covering AI and DevOps.

Contributing

Contributions are welcome! Please read the contribution guidelines first. We especially welcome:

Join the discussion to suggest tools or ask questions.

Author

Hammad Haqqani - DevOps Architect and Cloud Engineer


Support

If you find this useful, consider buying me a coffee!

Buy Me A Coffee