homelab-codex-ws/hosts/piha/runtime
Oskar Kapala 039f9f7247 feat(piha): brain-watchdog — external watchdog for control-plane
Polls /summary on VPS over Tailscale every 60s; computes freshness
locally from last_update epoch (never trusts self-reported status).
Alerts via Telegram Bot API directly after 3 consecutive failures;
sends recovery message on heal. State (fail_count, alerted) persisted
to volume so debounce survives restarts.

- services/brain-watchdog/: Python service, no external deps (stdlib only)
- hosts/piha/runtime/brain-watchdog/: override with mem_limit 64m
- hosts/piha/services.yaml + inventory/topology.yaml: manifest entries

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-06-01 17:54:36 +02:00
..
agent-system Fix Copy for AI: materializer fetches from control-plane API instead of Redis 2026-05-27 16:07:51 +02:00
brain-watchdog feat(piha): brain-watchdog — external watchdog for control-plane 2026-06-01 17:54:36 +02:00
node-agent fix(node-agent): correct VPS_EVENTS_HOST to actual VPS Tailscale IP 2026-05-27 14:07:27 +02:00
stability-agent Roll out stability agent to homelab nodes 2026-05-17 15:54:19 +02:00