Update incident action items: 4/7 complete

Done: disaster recovery runbook, fallback DNS resolvers, sqlite chmod
fix (mcdsl v1.8.0), UID stabilization.
Remaining: agent recover command, boot sequencing, multi-address nodes.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
2026-04-03 09:33:41 -07:00
parent 5aceb496e8
commit 02cfb27937

View File

@@ -217,13 +217,14 @@ ownership changes.
## Action Items ## Action Items
- [ ] Write disaster recovery runbook (separate document) - [x] Write disaster recovery runbook `docs/disaster-recovery.md`
- [ ] Add fallback DNS resolvers to NixOS config (1.1.1.1, 8.8.8.8) - [x] Add fallback DNS resolvers to NixOS config → all nodes now have
1.1.1.1 and 8.8.8.8 as fallbacks after MCNS
- [ ] Implement `mcp agent recover` command (recreate containers from - [ ] Implement `mcp agent recover` command (recreate containers from
registry) registry)
- [ ] Implement boot sequencing in the agent (Phase 3+ architecture) - [ ] Implement boot sequencing in the agent (Phase 4+ architecture)
- [ ] Investigate modernc.org/sqlite `fchmod` workaround for rootless - [x] Fix modernc.org/sqlite `fchmod` → was our own `os.Chmod` in
podman `mcdsl/db/db.go`, not sqlite. Made best-effort in mcdsl v1.8.0.
- [ ] Add multi-address support to node config (Tailnet IP + LAN IP - [ ] Add multi-address support to node config (Tailnet IP + LAN IP
+ hostname) + hostname)
- [ ] Revert mcp UID to a stable value and never change it again - [x] Stabilize mcp UID → pinned at 850 with NEVER CHANGE comment