hermes-agent/tests at d848ea7109d62a2fc4ba6da36fc4f0366b5ded94 - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-03 01:07:31 +08:00

Files

History

Teknium d848ea7109 fix: circuit breaker stops CPU-burning restart loops on persistent errors

When a gateway session hits a non-retryable error (e.g. invalid model
ID → HTTP 400), the agent fails and returns. But if the session keeps
receiving messages (or something periodically recreates agents), each
attempt spawns a new AIAgent — reinitializing MCP server connections,
burning CPU — only to hit the same 400 error again. On a 4-core server,
this pegs an entire core per stuck session and accumulates 300+ minutes
of CPU time over hours.

Fix: add a per-session consecutive failure counter in the gateway runner.

- Track consecutive non-retryable failures per session key
- After 3 consecutive failures (_MAX_CONSECUTIVE_FAILURES), block
  further agent creation for that session and notify the user:
  '⚠️ This session has failed N times in a row with a non-retryable
  error. Use /reset to start a new session.'
- Evict the cached agent when the circuit breaker engages to prevent
  stale state from accumulating
- Reset the counter on successful agent runs
- Clear the counter on /reset and /new so users can recover
- Uses getattr() pattern so bare GatewayRunner instances (common in
  tests using object.__new__) don't crash

Tests:
- 8 new tests in test_circuit_breaker.py covering counter behavior,
  threshold, reset, session isolation, and bare-runner safety

Addresses #7130.

2026-04-10 21:07:10 -07:00

..

fix(acp): declare session load and resume capabilities in initialize response (#6985 )

2026-04-10 03:45:36 -07:00

feat: wire context engine plugin slot into agent and plugin system

2026-04-10 19:15:50 -07:00

fix(cli): make /status show gateway-style session status

2026-04-10 05:19:26 -07:00

feat(cron): support Discord thread_id in deliver targets

2026-04-10 03:20:05 -07:00

test(e2e): add Slack to parametrized e2e platform tests

2026-04-10 16:51:44 -07:00

environments/benchmarks

fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )

2026-04-07 17:28:37 -07:00

fix: streaming tool call parsing, error handling, and fake HA state mutation

2026-03-14 14:27:20 +03:00

fix: circuit breaker stops CPU-burning restart loops on persistent errors

2026-04-10 21:07:10 -07:00

fix: no auto-activation + unified hermes plugins UI with provider categories

2026-04-10 19:15:50 -07:00

fix(honcho): migration guard for observation mode default change

2026-04-05 12:34:11 -07:00

refactor: remove mini-swe-agent dependency — inline Docker/Modal backends (#2804 )

2026-03-24 07:30:25 -07:00

feat(hindsight): feature parity, setup wizard, and config improvements

2026-04-08 23:54:15 -07:00

fix: activate fallback provider on repeated empty responses + user-visible status (#7505 )

2026-04-10 19:15:41 -07:00

fix: update tests for gws migration

2026-04-09 14:28:35 -07:00

test: add zombie process cleanup tests

2026-04-10 16:51:44 -07:00

__init__.py

A bit of restructuring for simplicity and organization

2025-10-01 23:29:25 +00:00

conftest.py

fix(tests): fix several failing/flaky tests on main (#6777 )

2026-04-09 13:17:06 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_batch_runner_checkpoint.py

fix: sanitize chat payloads and provider precedence

2026-03-13 23:59:12 -07:00

test_cli_file_drop.py

fix(gateway): reject file paths in get_command() + file-drop tests (#7356 )

2026-04-10 13:06:02 -07:00

test_cli_skin_integration.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_ctx_halving_fix.py

fix(compaction): don't halve context_length on output-cap-too-large errors

2026-04-09 11:27:41 -07:00

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_hermes_constants.py

fix: profile paths broken in Docker — profiles go to /root/.hermes instead of mounted volume (#7170 )

2026-04-10 05:53:10 -07:00

test_hermes_logging.py

feat(nix): shared-state permission model for interactive CLI users (#6796 )

2026-04-10 03:48:42 +05:30

test_hermes_state.py

fix(state): orphan children instead of cascade-deleting in prune/delete (#6513 )

2026-04-09 02:41:56 -07:00

test_honcho_client_config.py

feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )

2026-04-02 15:33:51 -07:00

test_mcp_serve.py

feat: add MCP server mode — hermes mcp serve (#3795 )

2026-03-29 15:47:19 -07:00

test_minisweagent_path.py

chore: remove all remaining mini-swe-agent references

2026-03-24 08:19:23 -07:00

test_model_picker_scroll.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_model_tools_async_bridge.py

fix: use per-thread persistent event loops in worker threads

2026-03-20 15:41:06 -04:00

test_model_tools.py

Add request-scoped plugin lifecycle hooks

2026-04-05 23:31:29 -07:00

test_ollama_num_ctx.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_packaging_metadata.py

chore: prepare Hermes for Homebrew packaging (#4099 )

2026-03-30 17:34:43 -07:00

test_project_metadata.py

fix(nix): gate matrix extra to Linux in [all] profile (#7461 )

2026-04-11 05:59:56 +05:30

test_retry_utils.py

feat(agent): add jittered retry backoff

2026-04-08 00:41:36 -07:00

test_sql_injection.py

fix(security): eliminate SQL string formatting in execute() calls

2026-03-19 15:16:35 +01:00

test_subprocess_home_isolation.py

fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )

2026-04-10 13:37:45 -07:00

test_timezone.py

fix: remove 115 verified dead code symbols across 46 production files

2026-04-10 03:44:43 -07:00

test_toolset_distributions.py

test: add unit tests for 8 modules (batch 2)

2026-02-26 13:54:20 +03:00

test_toolsets.py

fix: add missing Platform.SIGNAL to toolset mappings, update test + config docs

2026-03-09 23:27:19 -07:00

test_trajectory_compressor_async.py

fix: create AsyncOpenAI lazily in trajectory_compressor to avoid closed event loop (#4013 )

2026-03-30 13:16:16 -07:00

test_trajectory_compressor.py

fix: URL-based auth for third-party Anthropic endpoints + CI test fixes (#4148 )

2026-03-30 20:36:56 -07:00

test_utils_truthy_values.py

Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.

2026-03-30 13:28:10 +09:00