mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Go to file

Teknium 97d54f0e4d fix(terminal): three-layer defense against watch_patterns notification spam (#15642 )

* fix(terminal): three-layer defense against watch_patterns notification spam

Background processes that stack notify_on_complete=True with watch_patterns
can flood the user with duplicate, delayed notifications — matches deliver
asynchronously via the completion queue and continue arriving minutes after
the process has exited. The docstring warning against this (PR #12113) has
proven insufficient; agents still misuse the combination.

Three layered defenses, each sufficient on its own:

1. Mutual exclusion (terminal_tool.py): When both flags are set on a
   background process, drop watch_patterns with a warning. notify_on_complete
   wins because 'let me know when it's done' is the more useful signal and
   fires exactly once. Extracted as _resolve_notification_flag_conflict() so
   the rule is testable in isolation.

2. Suppress-after-exit (process_registry.py): _check_watch_patterns() now
   bails the moment session.exited is True. Post-exit chunks (buffered reads
   draining after the process is gone) no longer produce notifications. This
   is the fix flagged as future work in session 20260418_020302_79881c.

3. Global circuit breaker (process_registry.py): Per-session rate limits don't
   catch the sibling-flood case — N concurrent processes can each stay under
   8/10s and still collectively spam. New WATCH_GLOBAL_MAX_PER_WINDOW=15 cap
   trips a 30-second cooldown across ALL sessions, emits a single
   watch_overflow_tripped event, silently counts dropped events, and emits a
   watch_overflow_released summary when the cooldown ends.

Also updates the tool schema + docstring to document the new behavior.

Tests: 8 new tests covering all three fixes (suppress-after-exit x2,
mutual-exclusion resolver x4, global breaker trip/cooldown/release x2).
All 60 tests across test_watch_patterns.py, test_notify_on_complete.py,
test_terminal_tool.py pass.

Real-world trigger: self-inflicted in session 20260425_051924 — three
concurrent hermes-sweeper review subprocesses each set watch_patterns=
['failed validation', 'errored'] AND notify_on_complete=True, then iterated
over multiple items, producing enough matches per process to defeat the
per-session cap while staying under the global cap that didn't yet exist.

* fix(terminal): aggressive 1-per-15s watch_patterns rate limit + strike-3 promotion

Per Teknium's direction, the watch_patterns rate limit is now much more
aggressive and self-healing.

## New rule — per session

- HARD cap: 1 watch-match notification per 15 seconds per process.
- Any match arriving inside the cooldown window is dropped and counts as
  ONE strike for that window (many drops in the same window still = 1 strike).
- After 3 consecutive strike windows, watch_patterns is permanently disabled
  for the session and the session is auto-promoted to notify_on_complete
  semantics — exactly one notification when the process actually exits.
- A cooldown window that expires with zero drops resets the consecutive
  strike counter — healthy cadence is forgiven.

## Schema + docstring rewritten

The tool schema description now gives the model explicit guidance:
- notify_on_complete is 'the right choice for almost every long-running task'
- watch_patterns is for RARE one-shot signals on LONG-LIVED processes
- Do NOT use watch_patterns with loops/batch jobs — error patterns fire every
  iteration and will hit the strike limit fast
- Mutual exclusion is stated on both parameter descriptions
- 1/15s cooldown and 3-strike promotion are stated in the watch_patterns
  description so the model sees the contract every turn

## Removed

- WATCH_MAX_PER_WINDOW (8/10s) and WATCH_OVERLOAD_KILL_SECONDS (45) — the
  new 1/15s limit subsumes both; keeping them would double-count.
- _watch_window_hits / _watch_window_start / _watch_overload_since fields
  on ProcessSession. Replaced by _watch_last_emit_at / _watch_cooldown_until
  / _watch_strike_candidate / _watch_consecutive_strikes.

## Kept

- Global circuit breaker across all sessions (15/10s → 30s cooldown) as a
  secondary safety net for concurrent siblings. Still valuable when 20
  short-lived processes each fire once — none individually violates the
  per-session limit.
- Suppress-after-exit guard.
- Mutual exclusion resolver at the tool entry point.

## Tests

- 6 new tests in TestPerSessionRateLimit covering: first match delivers,
  second in cooldown suppressed, multi-drop = single strike, 3 strikes
  disables + promotes, clean window resets counter, suppressed count
  carried to next emit.
- Global circuit breaker tests rewritten to use fresh sessions instead of
  hacking removed per-window fields.
- 50/50 watch_patterns + notify_on_complete tests pass.
- 60/60 including test_terminal_tool.py pass.

2026-04-25 06:41:58 -07:00

.github

docs(website): dedicated page per bundled + optional skill (#14929 )

2026-04-23 22:22:11 -07:00

.plans

Merge PR #724 : feat: --yolo flag to bypass all approval prompts

2026-03-10 20:56:30 -07:00

acp_adapter

fix(acp): include MCP toolsets in ACP sessions

2026-04-24 03:04:42 -07:00

acp_registry

feat: restore ACP server implementation from PR #949 (#1254 )

2026-03-14 00:09:05 -07:00

agent

fix(auxiliary): generalize unsupported-parameter detector and harden max_tokens retry (#15633 )

2026-04-25 05:50:34 -07:00

assets

Update banner image to new version

2026-02-25 11:53:44 -08:00

cron

fix(cron): silent skip when context_from job has no output yet

2026-04-25 04:49:28 -07:00

datagen-config-examples

feat: add WebResearchEnv RL environment for multi-step web research

2026-03-05 14:34:36 +00:00

docker

fix(docker): fix HERMES_UID permission handling and add docker-compose.yml

2026-04-24 04:52:11 -07:00

environments

refactor: remove remaining redundant local imports (comprehensive sweep)

2026-04-21 00:50:58 -07:00

gateway

feat(tools): make discord/discord_admin opt-in, Discord-only

2026-04-25 04:51:11 -07:00

hermes_cli

fix(update): poll is-active instead of one-shot sleep(3) after gateway restart (#15639 )

2026-04-25 06:11:22 -07:00

nix

fix(nix): use --rebuild in fix-lockfiles to bypass cached FOD store paths (#15444 )

2026-04-25 06:14:32 +05:30

optional-skills

feat(optional-skills): add page-agent skill under new web-development category (#13976 )

2026-04-22 04:54:26 -07:00

packaging/homebrew

chore: prepare Hermes for Homebrew packaging (#4099 )

2026-03-30 17:34:43 -07:00

plans

fix(gemini): tighten native routing and streaming replay

2026-04-19 12:40:08 -07:00

plugins

refactor(spotify): convert to built-in bundled plugin under plugins/spotify (#15174 )

2026-04-24 07:06:11 -07:00

scripts

chore(release): map ash@users.noreply.github.com to ash

2026-04-25 05:27:17 -07:00

skills

fix(skills): ship google-workspace deps as [google] extra; make setup.py 3.9-parseable

2026-04-24 16:45:27 -07:00

tests

fix(terminal): three-layer defense against watch_patterns notification spam (#15642 )

2026-04-25 06:41:58 -07:00

tinker-atropos @ 65f084ee80

Add tinker-atropos submodule and update RL training tools

2026-02-04 10:36:01 -08:00

tools

fix(terminal): three-layer defense against watch_patterns notification spam (#15642 )

2026-04-25 06:41:58 -07:00

tui_gateway

fix(tui): proactive mouse disable on ConPTY + /mouse toggle command

2026-04-24 20:32:12 -07:00

ui-tui

fix(tui): proactive mouse disable on ConPTY + /mouse toggle command

2026-04-24 20:32:12 -07:00

web

chore: address copilot comments

2026-04-24 12:51:04 -04:00

website

docs: consolidate dashboard themes and plugins into Extending the Dashboard (#15530 )

2026-04-24 23:26:51 -07:00

.dockerignore

fix(docker): exclude runtime data/ from build context

2026-04-22 21:15:28 -07:00

.env.example

feat: add Ollama Cloud as built-in provider

2026-04-16 02:22:09 -07:00

.envrc

nix: add tui lockfile update script

2026-04-10 00:46:37 -04:00

.gitattributes

feat: web UI dashboard for managing Hermes Agent (#8756 )

2026-04-12 22:26:28 -07:00

.gitignore

Update .gitignore

2026-04-22 20:02:46 -07:00

.gitmodules

refactor: remove mini-swe-agent dependency — inline Docker/Modal backends (#2804 )

2026-03-24 07:30:25 -07:00

.mailmap

chore: add MestreY0d4-Uninter to AUTHOR_MAP and .mailmap

2026-04-15 15:03:28 -07:00

AGENTS.md

feat(web): add /api/pty WebSocket bridge to embed TUI in dashboard

2026-04-24 10:51:49 -04:00

batch_runner.py

fix: eliminate duplicate checkpoint entries and JSON-unsafe coercion

2026-04-24 14:32:21 -07:00

cli-config.yaml.example

fix(delegate): resolve subagent approval prompts without deadlocking parent TUI (#15491 )

2026-04-24 22:37:22 -07:00

cli.py

fix the reset of model change by /model.

2026-04-25 04:49:07 -07:00

constraints-termux.txt

feat: add tested Termux install path and EOF-aware gh auth

2026-04-09 16:24:53 -07:00

CONTRIBUTING.md

Update CONTRIBUTING.md

2026-04-23 15:08:41 -07:00

docker-compose.yml

fix(docker): safer docker-compose defaults for UID and dashboard bind

2026-04-24 04:52:11 -07:00

Dockerfile

fix(docker): reap orphaned subprocesses via tini as PID 1 (#15116 )

2026-04-24 05:22:34 -07:00

flake.lock

fix nix build

2026-04-11 15:30:37 -04:00

flake.nix

nix: add tui lockfile update script

2026-04-10 00:46:37 -04:00

hermes

fix: use argparse entrypoint in top-level launcher (#3874 )

2026-03-29 21:54:36 -07:00

hermes_constants.py

Merge branch 'main' of github.com:NousResearch/hermes-agent into feat/ink-refactor

2026-04-13 21:17:41 -05:00

hermes_logging.py

fix: detect and strip non-ASCII characters from API keys (#6843 )

2026-04-14 20:20:31 -07:00

hermes_state.py

fix(resume): redirect --resume to the descendant that actually holds the messages

2026-04-24 03:04:42 -07:00

hermes_time.py

refactor: extract shared helpers to deduplicate repeated code patterns (#7917 )

2026-04-11 13:59:52 -07:00

hermes-already-has-routines.md

docs: automation templates gallery + comparison post (#9821 )

2026-04-14 12:30:50 -07:00

LICENSE

fix: restore missing MIT license file

2026-03-07 13:43:08 -08:00

MANIFEST.in

chore: prepare Hermes for Homebrew packaging (#4099 )

2026-03-30 17:34:43 -07:00

mcp_serve.py

fix: point optional-dep install hints at the venv's python (#11938 )

2026-04-17 21:16:33 -07:00

mini_swe_runner.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

model_tools.py

feat(discord): split discord_server into discord + discord_admin tools

2026-04-25 04:50:14 -07:00

package-lock.json

perf(browser): upgrade agent-browser 0.13 -> 0.26, wire daemon idle timeout

2026-04-22 16:33:36 -07:00

package.json

perf(browser): upgrade agent-browser 0.13 -> 0.26, wire daemon idle timeout

2026-04-22 16:33:36 -07:00

pyproject.toml

fix(skills): ship google-workspace deps as [google] extra; make setup.py 3.9-parseable

2026-04-24 16:45:27 -07:00

README.md

docs(readme): fix stale RL submodule instructions, skills table row, test runner (#14758 )

2026-04-23 15:12:04 -07:00

RELEASE_v0.2.0.md

chore: rebuild changelog with correct time window (Feb 25 12PM PST onwards)

2026-03-12 02:33:50 -07:00

RELEASE_v0.3.0.md

chore: release v0.3.0 (v2026.3.17)

2026-03-17 00:38:48 -07:00

RELEASE_v0.4.0.md

docs: revise v0.4.0 changelog — fix feature attribution, reorder sections

2026-03-23 22:42:22 -07:00

RELEASE_v0.5.0.md

chore: release v0.5.0 (v2026.3.28) (#3568 )

2026-03-28 13:11:39 -07:00

RELEASE_v0.6.0.md

chore: release v0.6.0 (2026.3.30) (#3985 )

2026-03-30 08:29:38 -07:00

RELEASE_v0.7.0.md

chore: release v0.7.0 (2026.4.3) (#4812 )

2026-04-03 11:14:55 -07:00

RELEASE_v0.8.0.md

docs: update v0.8.0 highlights — notify_on_complete, MiMo v2 Pro, reorder

2026-04-08 04:59:45 -07:00

RELEASE_v0.9.0.md

fix: add contributor audit script + fix missed contributors (#9264 )

2026-04-13 16:31:27 -07:00

RELEASE_v0.10.0.md

chore: release v0.10.0 (2026.4.16) (#11209 )

2026-04-16 12:53:06 -07:00

RELEASE_v0.11.0.md

chore: release v0.11.0 (2026.4.23) (#14791 )

2026-04-23 15:31:59 -07:00

rl_cli.py

refactor: consolidate get_hermes_home() and parse_reasoning_effort() (#3062 )

2026-03-25 15:54:28 -07:00

run_agent.py

fix(compression): reserve system+tools headroom when aux binds threshold (#15631 )

2026-04-25 05:41:56 -07:00

SECURITY.md

docs: add terminal bypass test to Out of Scope section

2026-04-15 14:34:09 -07:00

setup-hermes.sh

fix(termux): make setup-hermes use android path

2026-04-09 16:24:53 -07:00

toolset_distributions.py

chore: fix 154 f-strings, simplify getattr/URL patterns, remove dead code (#3119 )

2026-03-25 19:47:58 -07:00

toolsets.py

feat(feishu): wire feishu doc/drive tools into hermes-feishu composite

2026-04-25 04:50:14 -07:00

trajectory_compressor.py

fix: sweep remaining provider-URL substring checks across codebase

2026-04-20 22:14:29 -07:00

utils.py

fix(agent): normalize socks:// env proxies for httpx/anthropic

2026-04-21 05:52:46 -07:00

uv.lock

chore: address copilot comments

2026-04-24 12:51:04 -04:00

README.md

Hermes Agent ☤

The self-improving AI agent built by Nous Research. It's the only agent with a built-in learning loop — it creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who you are across sessions. Run it on a $5 VPS, a GPU cluster, or serverless infrastructure that costs nearly nothing when idle. It's not tied to your laptop — talk to it from Telegram while it works on a cloud VM.

Use any model you want — Nous Portal, OpenRouter (200+ models), NVIDIA NIM (Nemotron), Xiaomi MiMo, z.ai/GLM, Kimi/Moonshot, MiniMax, Hugging Face, OpenAI, or your own endpoint. Switch with hermes model — no code changes, no lock-in.

A real terminal interface	Full TUI with multiline editing, slash-command autocomplete, conversation history, interrupt-and-redirect, and streaming tool output.
Lives where you do	Telegram, Discord, Slack, WhatsApp, Signal, and CLI — all from a single gateway process. Voice memo transcription, cross-platform conversation continuity.
A closed learning loop	Agent-curated memory with periodic nudges. Autonomous skill creation after complex tasks. Skills self-improve during use. FTS5 session search with LLM summarization for cross-session recall. Honcho dialectic user modeling. Compatible with the agentskills.io open standard.
Scheduled automations	Built-in cron scheduler with delivery to any platform. Daily reports, nightly backups, weekly audits — all in natural language, running unattended.
Delegates and parallelizes	Spawn isolated subagents for parallel workstreams. Write Python scripts that call tools via RPC, collapsing multi-step pipelines into zero-context-cost turns.
Runs anywhere, not just your laptop	Six terminal backends — local, Docker, SSH, Daytona, Singularity, and Modal. Daytona and Modal offer serverless persistence — your agent's environment hibernates when idle and wakes on demand, costing nearly nothing between sessions. Run it on a $5 VPS or a GPU cluster.
Research-ready	Batch trajectory generation, Atropos RL environments, trajectory compression for training the next generation of tool-calling models.

Quick Install

curl -fsSL https://raw.githubusercontent.com/NousResearch/hermes-agent/main/scripts/install.sh | bash

Works on Linux, macOS, WSL2, and Android via Termux. The installer handles the platform-specific setup for you.

Android / Termux: The tested manual path is documented in the Termux guide. On Termux, Hermes installs a curated .[termux] extra because the full .[all] extra currently pulls Android-incompatible voice dependencies.

Windows: Native Windows is not supported. Please install WSL2 and run the command above.

After installation:

source ~/.bashrc    # reload shell (or: source ~/.zshrc)
hermes              # start chatting!

Getting Started

hermes              # Interactive CLI — start a conversation
hermes model        # Choose your LLM provider and model
hermes tools        # Configure which tools are enabled
hermes config set   # Set individual config values
hermes gateway      # Start the messaging gateway (Telegram, Discord, etc.)
hermes setup        # Run the full setup wizard (configures everything at once)
hermes claw migrate # Migrate from OpenClaw (if coming from OpenClaw)
hermes update       # Update to the latest version
hermes doctor       # Diagnose any issues

📖 Full documentation →

CLI vs Messaging Quick Reference

Hermes has two entry points: start the terminal UI with hermes, or run the gateway and talk to it from Telegram, Discord, Slack, WhatsApp, Signal, or Email. Once you're in a conversation, many slash commands are shared across both interfaces.

Action	CLI	Messaging platforms
Start chatting	`hermes`	Run `hermes gateway setup` + `hermes gateway start`, then send the bot a message
Start fresh conversation	`/new` or `/reset`	`/new` or `/reset`
Change model	`/model [provider:model]`	`/model [provider:model]`
Set a personality	`/personality [name]`	`/personality [name]`
Retry or undo the last turn	`/retry`, `/undo`	`/retry`, `/undo`
Compress context / check usage	`/compress`, `/usage`, `/insights [--days N]`	`/compress`, `/usage`, `/insights [days]`
Browse skills	`/skills` or `/<skill-name>`	`/<skill-name>`
Interrupt current work	`Ctrl+C` or send a new message	`/stop` or send a new message
Platform-specific status	`/platforms`	`/status`, `/sethome`

For the full command lists, see the CLI guide and the Messaging Gateway guide.

Documentation

All documentation lives at hermes-agent.nousresearch.com/docs:

Section	What's Covered
Quickstart	Install → setup → first conversation in 2 minutes
CLI Usage	Commands, keybindings, personalities, sessions
Configuration	Config file, providers, models, all options
Messaging Gateway	Telegram, Discord, Slack, WhatsApp, Signal, Home Assistant
Security	Command approval, DM pairing, container isolation
Tools & Toolsets	40+ tools, toolset system, terminal backends
Skills System	Procedural memory, Skills Hub, creating skills
Memory	Persistent memory, user profiles, best practices
MCP Integration	Connect any MCP server for extended capabilities
Cron Scheduling	Scheduled tasks with platform delivery
Context Files	Project context that shapes every conversation
Architecture	Project structure, agent loop, key classes
Contributing	Development setup, PR process, code style
CLI Reference	All commands and flags
Environment Variables	Complete env var reference

Migrating from OpenClaw

If you're coming from OpenClaw, Hermes can automatically import your settings, memories, skills, and API keys.

During first-time setup: The setup wizard (hermes setup) automatically detects ~/.openclaw and offers to migrate before configuration begins.

Anytime after install:

hermes claw migrate              # Interactive migration (full preset)
hermes claw migrate --dry-run    # Preview what would be migrated
hermes claw migrate --preset user-data   # Migrate without secrets
hermes claw migrate --overwrite  # Overwrite existing conflicts

What gets imported:

SOUL.md — persona file
Memories — MEMORY.md and USER.md entries
Skills — user-created skills → ~/.hermes/skills/openclaw-imports/
Command allowlist — approval patterns
Messaging settings — platform configs, allowed users, working directory
API keys — allowlisted secrets (Telegram, OpenRouter, OpenAI, Anthropic, ElevenLabs)
TTS assets — workspace audio files
Workspace instructions — AGENTS.md (with --workspace-target)

See hermes claw migrate --help for all options, or use the openclaw-migration skill for an interactive agent-guided migration with dry-run previews.

Contributing

We welcome contributions! See the Contributing Guide for development setup, code style, and PR process.

Quick start for contributors — clone and go with setup-hermes.sh:

git clone https://github.com/NousResearch/hermes-agent.git
cd hermes-agent
./setup-hermes.sh     # installs uv, creates venv, installs .[all], symlinks ~/.local/bin/hermes
./hermes              # auto-detects the venv, no need to `source` first

Manual path (equivalent to the above):

curl -LsSf https://astral.sh/uv/install.sh | sh
uv venv venv --python 3.11
source venv/bin/activate
uv pip install -e ".[all,dev]"
scripts/run_tests.sh

RL Training (optional): The RL/Atropos integration (environments/) ships via the atroposlib and tinker dependencies pulled in by .[all,dev] — no submodule setup required.

Community

💬 Discord
📚 Skills Hub
🐛 Issues
🔌 HermesClaw — Community WeChat bridge: Run Hermes Agent and OpenClaw on the same WeChat account.

License

MIT — see LICENSE.

Built by Nous Research.

Languages

Python 88.1%

TypeScript 8.9%

TeX 1.7%

Shell 0.5%

Nix 0.3%

Other 0.5%