homelab-codex-ws/hosts
Oskar Kapala 039f9f7247 feat(piha): brain-watchdog — external watchdog for control-plane
Polls /summary on VPS over Tailscale every 60s; computes freshness
locally from last_update epoch (never trusts self-reported status).
Alerts via Telegram Bot API directly after 3 consecutive failures;
sends recovery message on heal. State (fail_count, alerted) persisted
to volume so debounce survives restarts.

- services/brain-watchdog/: Python service, no external deps (stdlib only)
- hosts/piha/runtime/brain-watchdog/: override with mem_limit 64m
- hosts/piha/services.yaml + inventory/topology.yaml: manifest entries

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-06-01 17:54:36 +02:00
..
chelsty-ha Fix pending actions: node_exporter, zigbee2mqtt, chelsty-ha monitoring 2026-05-27 15:10:48 +02:00
chelsty-infra feat(ha-diag-agent): test environment with dual HA Docker instances 2026-05-29 12:56:13 +02:00
piha feat(piha): brain-watchdog — external watchdog for control-plane 2026-06-01 17:54:36 +02:00
saturn Add node capability model 2026-05-11 20:46:50 +02:00
solaria fix(node-agent): correct VPS_EVENTS_HOST to actual VPS Tailscale IP 2026-05-27 14:07:27 +02:00
vps vps: mem_limit + oom_score_adj na serwisach in-repo; deploy-local stosuje override (stop OOM) 2026-06-01 14:23:58 +02:00