Wachd receives alerts from Grafana, Datadog, and Prometheus and tells you WHY it fired — not just that it fired. Self-hosted on Kubernetes. Your data never leaves your cluster.

3am. Your phone rings. You open the alert.
It says: “HighErrorRate firing.”
You still don't know where to start.
Wachd already ran the diagnosis while the alert was routing to you.
Your monitoring tool sends a webhook to Wachd. Signature validated, event queued in Redis.
Last 10 commits from GitHub. 30 min of error logs from Loki or Datadog. Metric history around the alert window.
PII stripped first. Then Ollama, vLLM, or any OpenAI-compatible model in-cluster — or Claude, OpenAI, or Gemini — builds a plain-English root cause with a suggested action.
SMS, voice call, email, and Slack — in the order you configured. Escalation if not acknowledged. Cancel when resolved.
Correlates commits, logs, and metrics into a 2-sentence answer. Powered by Ollama, Claude, OpenAI, or Gemini — your choice.
Primary, secondary, and custom rotation layers. Time-window restrictions (business hours, nights, weekends). Self-service overrides without admin.
Email immediately. SMS at the same time. Voice call after 10 minutes if unacknowledged. Each engineer sets their own preference.
Run Ollama, vLLM, or any OpenAI-compatible model in-cluster with no outbound API calls. Fully functional in isolated VPCs and classified environments.
Entra ID, Okta, and Google Workspace group mappings. Local accounts always available as break-glass. Both are first-class.
Emails, IPs, API keys, and account IDs are stripped before any data touches the AI backend. Non-negotiable, runs synchronously.
Daily scan against NVD, OSV, and CISA KEV. Cross-referenced against your team's actual dependency inventory. Auto-opens fix PRs.
Every team is fully isolated. Team A cannot see Team B's incidents, schedules, or data sources. Enforced at every query.
Helm chart with sane defaults. Supports external RDS/ElastiCache or in-cluster Postgres/Redis. Apache 2.0, no vendor lock-in.
Wachd connects to your infrastructure to understand it — not to change it. It cannot deploy, restart, scale, or modify anything in your environment. Ever.
contents:read scope only. Wachd reads commits and lock files. It cannot push, open PRs, or modify any repository.
Query-only access to Loki, Datadog, Prometheus, and Grafana. No write operations, no configuration changes.
Wachd runs inside your cluster and makes outbound API calls only. No inbound access to your services beyond the webhook endpoint.
Wachd ships with first-class Ollama support. Run Llama, Mistral, or any open model in-cluster — incident data never reaches an external API. One config line to switch backends.
# values.yaml — switch AI backend with one line
ai:
backend: ollama
model: llama3.2
ollamaEndpoint: http://ollama:11434
# vLLM, TGI, or any OpenAI-compatible endpoint:
ollamaEndpoint: http://my-vllm-service:8000
# or switch to cloud AI:
backend: claude # ollama | vllm | claude | openai | gemini
Works with your existing stack
Start free and self-hosted. Pay only when you need more.
Free
Self-hosted, unlimited teams
$99/mo
For growing engineering teams
Custom
For regulated and large-scale orgs
Email sales@wachd.io to join the waitlist
We're giving 10 engineering teams a lifetime Wachd SMB license at no cost. No monthly fee. No expiry. In exchange: use it seriously and tell us what's broken.
Or email sales@wachd.io with your company name
Deploy in under 30 minutes on any Kubernetes cluster. No vendor lock-in. No data leaves your infrastructure.
Deploy free on Kubernetes →Apache 2.0 · Self-hosted · No account required