HarborForge.PlexumPlugin

6 Commits 1 Branch 0 Tags

Author	SHA1	Message	Date
hzhang	046a7753e6	log monitor push start + slow heartbeat First successful push emits an info-level "monitor push started" so operators can confirm the loop wired up correctly. Subsequent successes log every 60 cycles ("monitor push heartbeat") so the journal stays quiet but still proves the loop isn't dead. Errors already log at warn; this fills the success-side gap so a silent journal can't hide a "no successes, no errors" pathology.	2026-06-03 13:13:33 +01:00
hzhang	6e3ad669f8	feat(monitor): active push loop replacing standalone monitor Adds a periodic POST loop to <backend>/monitor/server/heartbeat so HF plugin can take over the standalone harborforge-monitor daemon's job — same X-API-Key header, same flat telemetry shape (cpu_pct / mem_pct / disk_pct / swap_pct / load_avg / uptime_seconds / plugin_version / agents[]). HF backend stays unchanged. Config: monitor_push_enabled (default false; opt-in to avoid surprise heartbeats from existing deployments), monitor_push_interval_seconds (default 30), reuses apiKey for the X-API-Key header. Lift the container's HF_MONITER_API_KEY into config.apiKey, flip monitor_push_enabled true, then docker rm -f the container — DB last_seen_at keeps advancing under the plugin's loop. Collector grew swap + cpu sampling (two reads of /proc/stat over a 1-second window when SampleCPU=true). Bridge endpoint stays cheap (SampleCPU=false on demand); push loop is the only caller paying the sampling cost. E2E in sim: monitor_push_enabled=true + apiKey from injected MonitoredServer row → server_states.last_seen_at advances exactly every interval_seconds (10s configured, 10s observed). cpu/mem/disk/ swap_pct all populate correctly.	2026-06-03 13:04:51 +01:00
hzhang	472cecd771	feat: read agent_id from ctx (SDK now plumbs it) Plexum-sdk-go now propagates the caller agent id via `_meta.agent_id` on tools/call. AgentIDFromCtx prefers plugin.AgentIDFromContext(ctx); falls back to the single-active-calendar-slot heuristic for host-driven dispatch paths (channel manager, CLI plugin-call) that lack ctx. Drops bestEffortAgentID — the inline closure does the same thing without the dead-Slot-iterate noise.	2026-06-03 12:54:57 +01:00
hzhang	bc1ab7b6ea	fix: snake_case SlotStatus + scheduler debug logs Two issues found while end-to-end testing against a running harborforge-backend: - SlotStatus enum values: backend stores snake_case ("not_started" / "ongoing" / …), not the camelCase the OpenClaw plugin's TypeScript types.ts misled the initial drop into using. Heartbeat responses came back with Slot.Status="not_started" which the scheduler never matched against SlotStatus("NotStarted"), so dispatchSlot never fired. Aligned with backend's actual enum string values (verified via heartbeat response shape). - Added info-level logs at slot selection + dispatchSlot entry + WakeAgent fire/result so operators can see the plugin's decision chain in production without enabling debug. Cheap (~one tick per agent per heartbeat interval). E2E in sim: backend returns slots=1 → selection chosen=true → dispatch enter → WakeAgent enqueued ok → backend slot ongoing → next heartbeat returns slots=0.	2026-06-03 11:42:18 +01:00
hzhang	78b1ec5181	fix: align calendar API with actual HarborForge.Backend contract Initial drop guessed the heartbeat shape; sim e2e against a running harborforge-backend revealed the real contract is per-agent with header auth, not server-wide with bearer: POST /calendar/agent/heartbeat headers: X-Agent-ID, X-Claw-Identifier body: {claw_identifier, agent_id} response: {slots: [Slot], agent_status, message?} PATCH /calendar/slots/{id}/agent-update PATCH /calendar/slots/virtual/{vid}/agent-update body: {status, started_at?, actual_duration?} POST /calendar/agent/status body: {claw_identifier, agent_id, status} Refactors: - internal/calendar/types.go now mirrors OpenclawPlugin/calendar/ types.ts 1:1 (SlotStatus camelCase, real vs virtual slot id discrimination, event_data shape) - internal/calendar/bridge.go: header-based auth, per-agent method signatures, separate UpdateRealSlot vs UpdateVirtualSlot - internal/calendar/scheduler.go: per-agent heartbeat loop (one HTTP call per agent per tick), highest-priority slot selection, agent-update PATCH for terminal/non-terminal states - SingleActiveAgentID helper for main.bestEffortAgentID Also fix two bugs found in sim: - bgCtx capture: AgentLister closures were capturing Init's ctx which dies the moment MCP initialize returns; switched to bgCtx (lifetime = plugin process) - tools.toolRestartStatus referenced a non-existent sch.RestartPending — HF backend has no restart endpoint per /openapi.json, so the tool now reports last_heartbeats freshness Scheduler logs each tick + each heartbeat outcome at info so operators can see backend connectivity without enabling debug. E2E against http://harborforge-backend:8000 in sim: daemon → heartbeat → 404 "Agent not found" (= correct endpoint, correct headers, correct body — agent just isn't registered yet, which is expected for an untenanted plugin)	2026-06-03 11:28:05 +01:00
hzhang	754e5183f7	initial: HarborForge plugin for Plexum (port of OpenclawPlugin) Plugin id `harbor-forge` mirrors the OpenClaw counterpart's runtime surface on top of the Plexum SDK: * eager activation — Monitor bridge + Calendar scheduler boot at host start, before any agent turn fires * monitor bridge: HTTP 127.0.0.1:<monitor_port> serving /telemetry + /health for HarborForge.Monitor * calendar scheduler: heartbeats <backendUrl>/calendar/agent/ heartbeat, dispatches returned slots via HostAPI.WakeAgent (state-aware queue, depth-1 replace-newest), tracks active slot state in-memory, terminal status pushed back to backend * 9 harborforge_* tools (status / telemetry / monitor_telemetry / calendar_{status,complete,abort,pause,resume} / restart_status) Key differences from OpenClaw equivalent: * api.spawn → HostAPI.WakeAgent (new SDK primitive) * api.getAgentStatus → HostAPI.ReadAgentState (existing) * --install-monitor / --install-cli not included; Monitor + hf CLI deploy via the HangmanLab.Server.T3 docker compose layer Initial drop. TODO before v1 ship: * tool ctx → calling-agent-id: SDK doesn't currently expose; v1 falls back to a single-active-slot heuristic in main.bestEffortAgentID * tests for the bridge + scheduler	2026-06-03 11:11:36 +01:00