
Atlassian Opsgenie
Service-aware alerting, on-call, and incident orchestration.
Discover top open-source software, updated regularly with real-world adoption signals.

AI-driven agent that diagnoses cloud incidents and suggests fixes.
HolmesGPT connects LLMs with live observability data to automatically investigate alerts, pinpoint root causes, and recommend remediations across dozens of cloud and monitoring tools.

HolmesGPT is an AI‑powered assistant designed for SREs and incident responders. It links large language models with real‑time observability data, enabling automatic investigation of alerts and generation of clear, actionable remediation steps.
The agent operates through an agentic loop, pulling information from over 20 built‑in integrations—including Kubernetes, Prometheus, ArgoCD, AWS RDS, and many logging and tracing systems. It can fetch alerts from sources like AlertManager, PagerDuty, or OpsGenie, analyze logs, metrics, and configuration, then write the findings back to Slack, ticketing platforms, or GitHub pull requests.
HolmesGPT is available as a command‑line tool and via the Robusta SaaS platform. Users configure their LLM provider with an API key and can extend functionality with custom data sources or runbooks through simple YAML files or the SaaS UI.
When teams consider HolmesGPT, these hosted platforms usually appear on the same shortlist.
Looking for a hosted option? These are the services engineering teams benchmark against before choosing open source.
Alert investigation from Prometheus
HolmesGPT fetches the alert, queries metrics and logs, identifies the failing pod, and posts a concise root‑cause summary to AlertManager.
PagerDuty incident remediation
The agent retrieves the incident details, analyzes related logs, suggests a fix, and adds the recommendation as a comment on the PagerDuty ticket.
Kubernetes health check
By asking “what pods are unhealthy and why?”, HolmesGPT aggregates pod status, recent events, and log snippets to pinpoint the problematic service.
Custom runbook execution
When a known alert pattern matches a user‑provided runbook, HolmesGPT follows the steps automatically and reports completion.
Use the official CLI installer documented at holmesgpt.dev/installation/cli-installation or access the Robusta SaaS platform.
Any provider that offers an HTTP API key; configuration details are in the LLM Providers documentation.
Yes, it can post analysis to Slack, PagerDuty, OpsGenie, Jira, and GitHub (some integrations are currently beta).
Provide a YAML toolset file via the `-t` flag for the CLI or upload it through the Robusta UI.
The core project is open source; usage of external LLM APIs may incur costs, and the Robusta SaaS offers a free trial tier.
Project at a glance
ActiveLast synced 4 days ago