mc/mcp - mcp - wntrmute labs git

mc/mcp

Author	SHA1	Message	Date
Kyle Isom	84c487e7f8	Phase B: Agent registers routes with mc-proxy on deploy The agent connects to mc-proxy via Unix socket and automatically registers/removes routes during deploy and stop. This eliminates manual mcproxyctl usage or TOML editing. - New ProxyRouter abstraction wraps mc-proxy client library - Deploy: after container starts, registers routes with mc-proxy using host ports from the registry - Stop: removes routes from mc-proxy before stopping container - Config: [mcproxy] section with socket path and cert_dir - Nil-safe: if mc-proxy socket not configured, route registration is silently skipped (backward compatible) - L7 routes use certs from convention path (<cert_dir>/<service>.pem) - L4 routes use TLS passthrough (backend_tls=true) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-27 01:35:06 -07:00
Kyle Isom	8b1c89fdc9	Add mcp build command and deploy auto-build Extends MCP to own the full build-push-deploy lifecycle. When deploying, the CLI checks whether each component's image tag exists in the registry and builds/pushes automatically if missing and build config is present. - Add Build, Push, ImageExists to runtime.Runtime interface (podman impl) - Add mcp build <service>[/<image>] command - Add [build] section to CLI config (workspace path) - Add path and [build.images] to service definitions - Wire auto-build into mcp deploy before agent RPC - Update ARCHITECTURE.md with runtime interface and deploy auto-build docs Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> v0.2.0	2026-03-27 01:34:25 -07:00
Kyle Isom	d7f18a5d90	Add Platform Evolution tracking to PROGRESS_V1.md Phase A complete: route declarations, port allocation, $PORT env vars. Phase B in progress: agent mc-proxy route registration. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-27 01:25:26 -07:00
kyle	5a802bceb6	Merge pull request 'Add route declarations and automatic port allocation' (#1 ) from mcp-routes-port-allocation into master	2026-03-27 08:16:20 +00:00
Kyle Isom	777ba8a0e1	Add route declarations and automatic port allocation to MCP agent Service definitions can now declare routes per component instead of manual port mappings: [[components.routes]] name = "rest" port = 8443 mode = "l4" The agent allocates free host ports at deploy time and injects $PORT/$PORT_<NAME> env vars into containers. Backward compatible: components with old-style ports= work unchanged. Changes: - Proto: RouteSpec message, routes + env fields on ComponentSpec - Servicedef: RouteDef parsing and validation from TOML - Registry: component_routes table with host_port tracking - Runtime: Env field on ContainerSpec, -e flag in BuildRunArgs - Agent: PortAllocator (random 10000-60000, availability check), deploy wiring for route→port mapping and env injection Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-27 01:04:47 -07:00
Kyle Isom	503c52dc26	Update service definition example for convention-driven format Drop uses_mcdsl, full image URLs, ports, network, user, restart. Add route declarations and service-level version. Image names and most config are now derived from conventions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-27 00:19:12 -07:00
Kyle Isom	6465da3547	Add build and release lifecycle to ARCHITECTURE.md Service definitions now include [build] config (path, uses_mcdsl, images) so MCP owns the full build-push-deploy lifecycle, replacing mcdeploy.toml. Documents mcp build, mcp sync auto-build, image versioning policy (explicit tags, never :latest), and workspace convention. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 23:31:05 -07:00
Kyle Isom	e18a3647bf	Add Nix flake for mcp and mcp-agent Exposes two packages: - default (mcp CLI) for operator workstations - mcp-agent for managed nodes Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 22:46:36 -07:00
Kyle Isom	1e58dcce27	Implement mcp purge command for registry cleanup Add PurgeComponent RPC to the agent service that removes stale registry entries for components that are both gone (observed state is removed, unknown, or exited) and unwanted (not in any current service definition). Refuses to purge components with running or stopped containers. When all components of a service are purged, the service row is deleted too. Supports --dry-run to preview without modifying the database. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 22:30:45 -07:00
Kyle Isom	1afbf5e1f6	Add purge design to architecture doc Purge removes stale registry entries — components that are no longer in service definitions and have no running container. Designed as an explicit, safe operation separate from sync: sync is additive (push desired state), purge is subtractive (remove forgotten entries). Includes safety rules (refuses to purge running containers), dry-run mode, agent RPC definition, and rationale for why sync should not be made destructive. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 22:22:27 -07:00
Kyle Isom	ea8a42a696	P5.2 + P5.3: Bootstrap docs, README, and RUNBOOK - docs/bootstrap.md: step-by-step bootstrap procedure with lessons learned from the first deployment (NixOS sandbox issues, podman rootless setup, container naming, MCR auth workaround) - README.md: quick-start guide, command reference, doc links - RUNBOOK.md: operational procedures for operators (health checks, common operations, unsealing metacrypt, cert renewal, incident response, disaster recovery, file locations) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 15:32:22 -07:00
Kyle Isom	ff9bfc5087	Update PROGRESS_V1.md with deployment status and remaining work Documents Phase 6 (deployment), bugs fixed during rollout, remaining work organized by priority (operational, quality, design, infrastructure), and current platform state. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 15:27:30 -07:00
Kyle Isom	17ac0f3014	Trim whitespace from token file in CLI Token files with trailing newlines caused gRPC "non-printable ASCII characters" errors in the authorization header. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 15:19:27 -07:00
Kyle Isom	7133871be2	Default CLI config path to ~/.config/mcp/mcp.toml Eliminates the need to pass --config on every command. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 15:16:34 -07:00
Kyle Isom	efa32a7712	Fix container name handling for hyphenated service names Extract ContainerNameFor and SplitContainerName into names.go. ContainerNameFor handles single-component services where service name equals component name (e.g., mc-proxy → "mc-proxy" not "mc-proxy-mc-proxy"). SplitContainerName checks known services from the registry before falling back to naive split on "-", fixing mc-proxy being misidentified as service "mc" component "proxy". Also fixes podman ps JSON parsing (Command field is []string not string) found during deployment. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 15:13:20 -07:00
Kyle Isom	941dd7003a	Fix design-vs-implementation gaps found in verification Critical fixes: - Wire monitor subsystem to agent startup (was dead code) - Implement NodeStatus RPC (disk, memory, CPU, runtime version, uptime) - Deploy respects active=false (sets desired_state=stopped, not always running) Medium fixes: - Add Started field to runtime.ContainerInfo, populate from podman inspect - Populate ComponentInfo.started in status handlers for uptime display - Add Monitor field to Agent struct for graceful shutdown Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> v0.1.0	2026-03-26 12:29:04 -07:00
Kyle Isom	8f913ddf9b	P2.2-P2.9, P3.2-P3.10, P4.1-P4.3: Complete Phases 2, 3, and 4 11 work units built in parallel and merged: Agent handlers (Phase 2): - P2.2 Deploy: pull images, stop/remove/run containers, update registry - P2.3 Lifecycle: stop/start/restart with desired_state tracking - P2.4 Status: list (registry), live check (runtime), get status (drift+events) - P2.5 Sync: receive desired state, reconcile unmanaged containers - P2.6 File transfer: push/pull scoped to /srv/<service>/, path validation - P2.7 Adopt: match <service>-* containers, derive component names - P2.8 Monitor: continuous watch loop, drift/flap alerting, event pruning - P2.9 Snapshot: VACUUM INTO database backup command CLI commands (Phase 3): - P3.2 Login, P3.3 Deploy, P3.4 Stop/Start/Restart - P3.5 List/Ps/Status, P3.6 Sync, P3.7 Adopt - P3.8 Service show/edit/export, P3.9 Push/Pull, P3.10 Node list/add/remove Deployment artifacts (Phase 4): - Systemd units (agent service + backup timer) - Example configs (CLI + agent) - Install script (idempotent) All packages: build, vet, lint (0 issues), test (all pass). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 12:21:18 -07:00
Kyle Isom	d7cc970133	Split CLI command stubs into separate files Move each command function from main.go into its own file (deploy.go, lifecycle.go, status.go, etc.) to enable parallel development by multiple workers without file conflicts. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:59:17 -07:00
Kyle Isom	53535f1e96	P2.1 + P3.1: Agent skeleton and CLI skeleton Agent (P2.1): Agent struct with registry DB, runtime, and logger. gRPC server with TLS 1.3 and MCIAS auth interceptor. Graceful shutdown on SIGINT/SIGTERM. All RPCs return Unimplemented until handlers are built in P2.2-P2.9. CLI (P3.1): Full command tree with all 15 subcommands as stubs (login, deploy, stop, start, restart, list, ps, status, sync, adopt, service show/edit/export, push, pull, node list/add/remove). gRPC dial helper with TLS, CA cert, and bearer token attachment. Both gates for parallel Phase 2+3 work are now open. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:51:03 -07:00
Kyle Isom	15b8823810	P1.2-P1.5: Complete Phase 1 core libraries Four packages built in parallel: - P1.2 runtime: Container runtime abstraction with podman implementation. Interface (Pull/Run/Stop/Remove/Inspect/List), ContainerSpec/ContainerInfo types, CLI arg building, version extraction from image tags. 2 tests. - P1.3 servicedef: TOML service definition file parsing. Load/Write/LoadAll, validation (required fields, unique component names), proto conversion. 5 tests. - P1.4 config: CLI and agent config loading from TOML. Duration type for time fields, env var overrides (MCP_/MCP_AGENT_), required field validation, sensible defaults. 7 tests. - P1.5 auth: MCIAS integration. Token validator with 30s SHA-256 cache, gRPC unary interceptor (admin role enforcement, audit logging), Login/LoadToken/SaveToken for CLI. 9 tests. All packages pass build, vet, lint, and test. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:36:12 -07:00
Kyle Isom	6122123064	P1.1: Registry package with full CRUD and tests SQLite schema (services, components, ports, volumes, cmd, events), migrations, and complete CRUD operations. 7 tests covering: idempotent migration, service CRUD, duplicate name rejection, component CRUD with ports/volumes/cmd, composite PK enforcement, cascade delete, and event insert/query/count/prune. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:18:35 -07:00
Kyle Isom	3f23b14ef4	P0.2: Proto definitions and code generation Full gRPC service definition with 12 RPCs: Deploy, StopService, StartService, RestartService, SyncDesiredState, ListServices, GetServiceStatus, LiveCheck, AdoptContainers, PushFile, PullFile, NodeStatus. buf lint passes. Generated Go code compiles. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:14:43 -07:00
Kyle Isom	eaad18116a	P0.1: Repository and module setup Go module, Makefile with standard targets, golangci-lint v2 config, CLAUDE.md, and empty CLI/agent binaries. Build, vet, and lint all pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:12:52 -07:00
Kyle Isom	6a90b21a62	Add PROJECT_PLAN_V1.md and PROGRESS_V1.md 30 discrete tasks across 5 phases, with dependency graph and parallelism analysis. Phase 1 (5 core libraries) is fully parallel. Phases 2+3+4 (agent handlers, CLI commands, deployment artifacts) support up to 8+ concurrent engineers/agents. Critical path is proto → registry + runtime → agent deploy → integration tests. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:08:06 -07:00
Kyle Isom	a1bbc008b5	Update ARCHITECTURE.md with design audit findings Incorporates all 14 items from DESIGN_AUDIT.md: node registry in CLI config, container naming convention (<service>-<component>), active state semantics, adopt by service prefix, EventInfo service field, version from image tag, snapshot/backup timer, exec-style alert commands, overlay-only bind address, RPC audit logging, /srv/ ownership, rootless podman UID mapping docs. Three minor fixes from final review: stale adopt syntax in bootstrap section, explicit container naming in deploy flow, clarify that list/ps query all registered nodes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:03:25 -07:00
Kyle Isom	12d8d733be	Add DESIGN_AUDIT.md from second review pass Engineering and security review of the rewritten ARCHITECTURE.md. 14 issues identified and resolved: node registry in CLI config, active/desired_state semantics, container naming convention (<service>-<component>), exec-style alert commands to prevent injection, agent binds to overlay IP only (not behind MC-Proxy), RPC audit logging, /srv/ owned by mcp user, and others. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 10:58:04 -07:00
Kyle Isom	ea7a9dcf4d	Rewrite ARCHITECTURE.md incorporating review findings Major design changes from the review: - Merge agent and watcher into a single smart per-node daemon - CLI is a thin client with no database; service definition files are the operator's source of truth for desired state - Registry database lives on the agent, not the CLI - Rename containers to components; components are independently deployable within a service (mcp deploy metacrypt/web) - active: true/false in service definitions; desired_state values are running/stopped/ignore - Server-side TLS + bearer token (not mTLS) - Dedicated mcp user with rootless podman - CLI commands: list (registry), ps (live), status (drift+events), sync (push desired state) - Agent reports node resources (disk, memory, CPU) for future scheduling - Agent is gRPC-only (deliberate exception to REST+gRPC parity rule) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 10:31:48 -07:00
Kyle Isom	c8d0d42ea8	Add REVIEW.md from architecture review session Documents 12 issues found during critical review of ARCHITECTURE.md and their resolutions: merged agent/watcher into single smart daemon, components model for independent deploy within services, database lives on agent not CLI, TLS+bearer (not mTLS), desired_state=ignore for unmanaged containers, and other clarifications. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 10:27:57 -07:00
Kyle Isom	6b99937a69	Add MCP v1 architecture specification Design spec for the Metacircular Control Plane covering master/agent architecture, service registry with desired/observed state tracking, container lifecycle management, service definition files, single-file transfer scoped to /srv/<service>/, and continuous monitoring via mcp watch with event logging and alerting. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 09:42:41 -07:00

29 Commits