Claude Code Supervisor
Bridge between Claude Code's lifecycle hooks and your agent harness.
Architecture
Claude Code (in tmux)
โ Stop / Error / Notification
โผ
Bash pre-filter (Option D)
โ obvious cases handled directly
โ ambiguous cases pass through
โผ
Fast LLM triage (claude -p with Haiku, or local LLM)
โ classifies: FINE | NEEDS_NUDGE | STUCK | DONE | ESCALATE
โ FINE โ logged silently
โผ
Notify command (configurable)
โ openclaw wake, webhook, ntfy, script, etc.
โผ
Agent harness decides + acts
โ nudge (send-keys to tmux), wait, escalate to human
Quick Start
1. Install hooks into a project
{baseDir}/scripts/install-hooks.sh /path/to/your/project
Creates:
.claude/hooks/supervisor/โ hook scripts + triage.claude/settings.jsonโ wired into Claude Code lifecycle.claude-code-supervisor.ymlโ configuration (edit this)
2. Configure
Edit .claude-code-supervisor.yml:
triage:
command: "claude -p --no-session-persistence" # or: ollama run llama3.2
model: "claude-haiku-4-20250414"
notify:
command: "openclaw gateway call wake --params" # or: curl, ntfy, script
3. Register a supervised session
Create ~/.openclaw/workspace/supervisor-state.json (or wherever your harness keeps state):
{
"sessions": {
"my-task": {
"socket": "/tmp/openclaw-tmux-sockets/openclaw.sock",
"tmuxSession": "my-task",
"projectDir": "/path/to/project",
"goal": "Fix issue #42",
"successCriteria": "Tests pass, committed",
"maxNudges": 5,
"escalateAfterMin": 60,
"status": "running"
}
}
}
4. Launch Claude Code in tmux
SOCKET="/tmp/openclaw-tmux-sockets/openclaw.sock"
tmux -S "$SOCKET" new -d -s my-task
tmux -S "$SOCKET" send-keys -t my-task "cd /path/to/project && claude 'Fix issue #42'" Enter
Hooks fire automatically. Triage assesses. You get notified only when it matters.
How the Pre-Filter Works (Option D)
Not every hook event needs an LLM call. Bash catches the obvious cases first:
on-stop.sh
| Signal | Bash decision | LLM triage? |
|---|---|---|
max_tokens |
Always needs attention | โ Yes |
end_turn + shell prompt back |
Agent might be done | โ Yes |
end_turn + no prompt |
Agent is mid-work | โ Skip |
stop_sequence |
Normal | โ Skip |
on-error.sh
| Signal | Bash decision | LLM triage? |
|---|---|---|
| API 429 / rate limit | Transient, will resolve | โ Log only |
| API 500 | Agent likely stuck | โ Yes |
| Other tool error | Unknown severity | โ Yes |
on-notify.sh
| Signal | Bash decision | LLM triage? |
|---|---|---|
auth_* |
Internal, transient | โ Skip |
permission_prompt |
Needs decision | โ Yes |
idle_prompt |
Agent waiting | โ Yes |
Triage Classifications
The LLM returns one of:
| Verdict | Meaning | Typical action |
|---|---|---|
| FINE | Agent is working normally | Log silently, no notification |
| NEEDS_NUDGE | Transient error, should continue | Send "continue" to tmux |
| STUCK | Looping or not progressing | Try different approach or escalate |
| DONE | Task completed successfully | Report to human |
| ESCALATE | Needs human judgment | Notify human with context |
Handling Notifications (for agent harness authors)
Wake events arrive with the prefix cc-supervisor: followed by the classification:
cc-supervisor: NEEDS_NUDGE | error:api_500 | cwd=/home/user/project | ...
cc-supervisor: DONE | stopped:end_turn:prompt_back | cwd=/home/user/project | ...
Nudging via tmux
tmux -S "$SOCKET" send-keys -t "$SESSION" "continue โ the API error was transient" Enter
Escalation format
See references/escalation-rules.md for when to nudge vs escalate and quiet hours.
Watchdog (Who Watches the Watchman?)
Hooks depend on Claude Code being alive. If the session hard-crashes, hits account limits, or the process gets OOM-killed, no hooks fire. The watchdog catches this.
scripts/watchdog.sh is a pure bash script (no LLM, no Claude Code dependency) that:
- Reads
supervisor-state.jsonfor all"running"sessions - Checks: is the tmux socket alive? Is the session there? Is Claude Code still running?
- If something is dead and no hook reported it โ notifies via the configured command
- Updates
lastWatchdogAtin state for tracking
Run it on a timer. Choose your poison:
System cron:
*/15 * * * * /path/to/claude-code-supervisor/scripts/watchdog.sh
OpenClaw cron:
{
"schedule": { "kind": "every", "everyMs": 900000 },
"payload": { "kind": "systemEvent", "text": "cc-supervisor: watchdog โ run /path/to/scripts/watchdog.sh and report" },
"sessionTarget": "main"
}
systemd timer, launchd, or whatever runs periodically on your box.
The watchdog is deliberately dumb โ no LLM, no complex logic, just "is the process still there?" This means it works even when the triage model is down, the API is melting, or your account hit its limit. Belts and suspenders.
Files
scripts/install-hooks.shโ one-command setup per projectscripts/hooks/on-stop.shโ Stop event handler with bash pre-filterscripts/hooks/on-error.shโ PostToolUseFailure handler with bash pre-filterscripts/hooks/on-notify.shโ Notification handler with bash pre-filterscripts/triage.shโ LLM triage (called by hooks for ambiguous cases)scripts/lib.shโ shared config loading and notification functionsscripts/watchdog.shโ dead session detector (pure bash, no LLM dependency)references/state-patterns.mdโ terminal output pattern matching guidereferences/escalation-rules.mdโ when to nudge vs escalate vs waitsupervisor.yml.exampleโ example configuration