awesome-devops-ai

Awesome DevOps AI Awesome

Awesome Lint Link Check GitHub Pages GitHub stars GitHub forks

A curated list of AI tools, agents, MCP servers, and resources for DevOps, SRE, and Platform Engineering.

The AI revolution is transforming how infrastructure is built, monitored, and operated. This list tracks every meaningful tool at the intersection of AI and DevOps, from coding agents that write Terraform to AI-powered incident response that pages you with a root cause already identified.

Why this list? Engineers are adopting AI tooling faster than any technology shift in history, but the landscape is fragmented across hundreds of repos, products, and frameworks. This is one place to find them all.

459 tools across 20 categories — updated April 2026. See the Quick Start Guide for role-based recommendations.

If this list is useful, please give it a star to help others find it.

Contents

Tool of the Week

JetBrains Junie — JetBrains’ LLM-agnostic coding agent that works in the terminal, IDEs, and CI/CD pipelines. Supports OpenAI, Anthropic, Google, and Grok models out of the box. The first major IDE vendor to ship a standalone agentic CLI.

Previous picks: Gemini CLI (Google’s terminal AI agent) Goose (Block’s Rust-based autonomous agent) K8sGPT (CNCF Kubernetes diagnostics) HolmesGPT (agentic troubleshooting)

What’s New

Late April 2026 (#2) — Web research update adding 45 new tools across 9 categories and fixing the broken KubeStellar Console MCP link from PR #19. AI Coding Agents (3): Kilo Code, Plandex, Mistral Vibe. Kubernetes (3): Causely, Parity, Azure SRE Agent. Incident Response (5): Cleric, Resolve AI, NeuBird Hawkeye, Edge Delta, Traversal. Observability (2): SigNoz, Flip AI. Security (7): Aikido, ZeroPath, Pixee, Corgea, Backslash, Ghost Security, Cyera. FinOps (3): Antimetal, PointFive, ProsperOps. IaC (2): Resourcely, Terrateam. Agent Frameworks (5): Letta, Strands Agents, BeeAI, Agno, Mirascope. MCP Servers (15): Render, Fly.io, SigNoz, Coralogix, Logz.io, ClickHouse, Turso, Databricks, Harness, Twilio, Stripe, CrowdStrike Falcon, Wiz, Bitwarden, Doppler. Total: 459 tools across 20 categories.

April 2026 (Late) — Major update adding 31 new entries from deep research. AI Coding Agents (7): OpenCode (140k+ stars), Roo Code, OpenHands (77.6% SWEBench), Crush (Charm), Factory AI, Qwen Code (Alibaba), Trae (ByteDance). AI Agent Frameworks (3): Microsoft Agent Framework 1.0 (April 3 release), LangChain Deep Agents, Llama Stack (Meta). AI CI/CD and Testing (5): Mendral (YC AI DevOps engineer), Stably AI, Momentic, Cursor BugBot, Claude Code Review (Anthropic multi-agent). MCP Servers (2): Cloudflare Code Mode, Bitbucket. Kubernetes (2): NVIDIA AI Cluster Runtime, Velero (CNCF Sandbox). Terraform/IaC (1): Terracotta AI. Databases (2): Tembo, Xata. Security/Governance (3): Credo AI, Holistic AI, Microsoft Purview. Observability (1): OpenLLMetry. System Prompts (2): AGENTS.md (OpenAI), Claude Skills (Anthropic). Community (1): Agentic AI Foundation (Linux Foundation). Total: 414 tools across 20 categories.

April 2026 — Major update with 69 new entries. Added 18 new tools including JetBrains Junie, aiac, NVIDIA KAI Scheduler, llm-d, Sedai, AccuKnox, Checkmarx One, Coroot, Apache SkyWalking, Dash0, MetaGPT, testRigor, Testsigma, ControlMonkey, and Braintrust. Added 24 new MCP servers from GCP, DigitalOcean, Oracle, New Relic, Splunk, Elastic, Dynatrace, CircleCI, Buildkite, MongoDB, Redis, Neon, Supabase, Vault, Snyk, Trivy, Rootly, FireHydrant, incident.io, and Ansible. Added 8 new books, 16 new certifications (AWS, Azure, GCP, CNCF, HashiCorp, FinOps, Datadog), and the Agentic DevOps podcast. Total: 383 tools across 20 categories.

March 2026 — Added 34 new tools including Gemini CLI, Goose, Kiro, KubeAI, Kubescape, Keep, Sysdig, and Pydantic AI. Added 9 new MCP servers (Azure DevOps, GitLab, JFrog, Jenkins, Prometheus, Pulumi, Argo CD, Slack, Notion). Added 3 new CNCF projects (Kubeflow, Kubescape, KServe) and new agent frameworks (Google ADK, Pydantic AI, smolagents, DSPy, OpenClaw). Total: 314 tools across 20 categories.

February 2026 — Added 23 new tools across 12 categories to reach 280 total. New coverage for database operations, networking, and container security.

Most Starred Projects

The most popular open-source projects in this list by GitHub stars.

Project Stars Category
Gemini CLI Stars AI Coding Agents
OpenClaw Stars AI Agent Frameworks
Grafana Stars AI Monitoring
Elasticsearch Stars AI Log Analysis
n8n Stars AI Agent Frameworks
LangChain Stars AI Agent Frameworks
Dify Stars AI Agent Frameworks
Aider Stars AI Coding Agents
K8sGPT Stars AI Kubernetes
Trivy Stars AI Security
ArgoCD Stars AI CI/CD
Istio Stars AI Networking
Helm Stars AI GitOps
Falco Stars AI Security

AI Coding Agents for Infrastructure

AI-powered coding agents that help write, review, and maintain infrastructure code including Terraform, Kubernetes manifests, Dockerfiles, and CI/CD pipelines.

AI-Powered Kubernetes

AI tools specifically designed for Kubernetes cluster management, troubleshooting, and operations.

AI-Powered Terraform and IaC

Tools that bring AI capabilities to Infrastructure as Code workflows.

AI Incident Response and Troubleshooting

AI systems that detect, investigate, and remediate production incidents.

AI Monitoring and Observability

AI-enhanced monitoring, alerting, and observability platforms.

AI Security Scanning

AI-powered security tools for infrastructure, containers, and supply chain.

AI Cost Optimization

AI and automation tools for cloud cost management, FinOps, and resource optimization.

MCP Servers for DevOps

Model Context Protocol servers that give AI assistants like Claude, ChatGPT, and Cursor access to DevOps tools and infrastructure.

AI-Powered CI/CD

AI tools that enhance continuous integration and delivery pipelines.

AI Log Analysis and Debugging

AI tools for log analysis, pattern detection, and debugging production systems.

AI Agent Frameworks for Infrastructure

General-purpose AI agent frameworks with strong infrastructure and DevOps use cases.

AI for Platform Engineering

AI tools for building internal developer platforms, service catalogs, and self-service infrastructure.

AI for Database Operations

AI tools for database management, query optimization, and data operations.

AI for Networking and Service Mesh

AI tools for network management, service mesh, and traffic engineering.

AI for Container Security and Supply Chain

AI tools for container image security, software supply chain, and build verification.

AI for Chaos Engineering and Reliability

AI tools for chaos engineering, resilience testing, and reliability validation.

AI for Cloud Migration and Modernization

AI tools that assist with cloud migration planning, execution, and application modernization.

AI for GitOps

AI tools for GitOps workflows, declarative infrastructure, and continuous reconciliation.

System Prompt and Config Templates

Ready-to-use AI agent configurations for infrastructure repositories.

Learning Resources

Courses, certifications, articles, and guides on AI for DevOps.

Articles and Guides

Books

Certifications

Podcasts

Community and Newsletters

Communities, forums, and newsletters covering AI and DevOps.

Contributing

Contributions are welcome! Please read the contribution guidelines first. We especially welcome:

Join the discussion to suggest tools or ask questions.

Author

Hammad Haqqani - DevOps Architect and Cloud Engineer


Support

If you find this useful, consider buying me a coffee!

Buy Me A Coffee