hermes-agent/tests/agent at c850a40e4e1226b381aa9d76e71efd97807e7d8d - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 15:01:34 +08:00

Files

History

Teknium 9d9b424390 fix: Nous Portal rate limit guard — prevent retry amplification (#10568 )

When Nous returns a 429, the retry amplification chain burns up to 9
API requests per conversation turn (3 SDK retries × 3 Hermes retries),
each counting against RPH and deepening the rate limit. With multiple
concurrent sessions (cron + gateway + auxiliary), this creates a spiral
where retries keep the limit tapped indefinitely.

New module: agent/nous_rate_guard.py
- Shared file-based rate limit state (~/.hermes/rate_limits/nous.json)
- Parses reset time from x-ratelimit-reset-requests-1h, x-ratelimit-
  reset-requests, retry-after headers, or error context
- Falls back to 5-minute default cooldown if no header data
- Atomic writes (tempfile + rename) for cross-process safety
- Auto-cleanup of expired state files

run_agent.py changes:
- Top-of-retry-loop guard: when another session already recorded Nous
  as rate-limited, skip the API call entirely. Try fallback provider
  first, then return a clear message with the reset time.
- On 429 from Nous: record rate limit state and skip further retries
  (sets retry_count = max_retries to trigger fallback path)
- On success from Nous: clear the rate limit state so other sessions
  know they can resume

auxiliary_client.py changes:
- _try_nous() checks rate guard before attempting Nous in the auxiliary
  fallback chain. When rate-limited, returns (None, None) so the chain
  skips to the next provider instead of piling more requests onto Nous.

This eliminates three sources of amplification:
1. Hermes-level retries (saves 6 of 9 calls per turn)
2. Cross-session retries (cron + gateway all skip Nous)
3. Auxiliary fallback to Nous (compression/session_search skip too)

Includes 24 tests covering the rate guard module, header parsing,
state lifecycle, and auxiliary client integration.

2026-04-15 16:31:48 -07:00

..

__init__.py

test: add unit tests for 8 modules (batch 2)

2026-02-26 13:54:20 +03:00

test_anthropic_adapter.py

fix: MiniMax/Alibaba incorrectly detected as Anthropic OAuth, causing mcp_ tool prefix (#7509 )

2026-04-11 00:43:01 -07:00

test_auxiliary_client.py

fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )

2026-04-14 01:43:45 -07:00

test_auxiliary_config_bridge.py

fix: remove legacy compression.summary_* config and env var fallbacks (#8992 )

2026-04-13 04:59:26 -07:00

test_auxiliary_named_custom_providers.py

fix(agent): propagate api_mode to vision provider resolution

2026-04-13 05:02:54 -07:00

test_bedrock_adapter.py

feat: native AWS Bedrock provider via Converse API

2026-04-15 16:17:17 -07:00

test_bedrock_integration.py

feat: native AWS Bedrock provider via Converse API

2026-04-15 16:17:17 -07:00

test_compress_focus.py

fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )

2026-04-14 01:43:45 -07:00

test_context_compressor.py

fix(agent): route compression aux through live session runtime

2026-04-12 01:34:52 -07:00

test_context_engine.py

feat: wire context engine plugin slot into agent and plugin system

2026-04-10 19:15:50 -07:00

test_context_references.py

fix(agent): preserve quoted @file references with spaces

2026-04-10 13:05:01 -07:00

test_credential_pool_routing.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_credential_pool.py

fix(copilot): preserve base URL and gpt-5-mini routing

2026-04-15 15:04:14 -07:00

test_crossloop_client_cache.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_display_emoji.py

feat(tools): centralize tool emoji metadata in registry + skin integration

2026-03-15 20:21:21 -07:00

test_display.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_error_classifier.py

fix: add vLLM/local server error patterns + MCP initial connection retry (#9281 )

2026-04-13 18:46:14 -07:00

test_external_skills.py

feat(skills): support external skill directories via config (#3678 )

2026-03-29 00:33:30 -07:00

test_insights.py

fix: remove 115 verified dead code symbols across 46 production files

2026-04-10 03:44:43 -07:00

test_local_stream_timeout.py

fix: is_local_endpoint misses Docker/Podman DNS names (#7950 )

2026-04-11 14:46:18 -07:00

test_memory_provider.py

fix(memory): discover user-installed memory providers from $HERMES_HOME/plugins/ (#10529 )

2026-04-15 14:25:40 -07:00

test_memory_user_id.py

fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )

2026-04-14 01:43:45 -07:00

test_minimax_auxiliary_url.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_minimax_provider.py

fix: preserve dots in model names for OpenCode Zen and ZAI providers (#8794 )

2026-04-12 21:22:59 -07:00

test_model_metadata_local_ctx.py

fix(agent): prefer Ollama Modelfile num_ctx over GGUF training max

2026-04-13 04:24:07 -07:00

test_model_metadata.py

fix: use ceiling division for token estimation, deduplicate inline formula

2026-04-11 16:33:40 -07:00

test_models_dev.py

fix: three provider-related bugs (#8161 , #8181 , #8147 ) (#8243 )

2026-04-12 01:44:18 -07:00

test_nous_rate_guard.py

fix: Nous Portal rate limit guard — prevent retry amplification (#10568 )

2026-04-15 16:31:48 -07:00

test_prompt_builder.py

feat: add WSL environment hint to system prompt (#8285 )

2026-04-12 02:26:28 -07:00

test_prompt_caching.py

fix(prompt-caching): skip top-level cache_control on role:tool for OpenRouter

2026-03-21 16:54:43 -07:00

test_proxy_and_url_validation.py

fix(runtime): surface malformed proxy env and base URL before client init

2026-04-15 16:10:53 -07:00

test_rate_limit_tracker.py

feat: capture provider rate limit headers and show in /usage (#6541 )

2026-04-09 03:43:14 -07:00

test_redact.py

fix(security): add JWT token and Discord mention redaction (#10547 )

2026-04-15 16:08:52 -07:00

test_skill_commands.py

fix: sanitize Telegram command names to strip invalid characters

2026-04-06 11:27:28 -07:00

test_smart_model_routing.py

fix: hermes update causes dual gateways on macOS (launchd) (#1567 )

2026-03-16 12:36:29 -07:00

test_subagent_progress.py

feat(api): structured run events via /v1/runs SSE endpoint

2026-04-05 12:05:13 -07:00

test_subdirectory_hints.py

fix(agent): catch PermissionError in subdirectory hint discovery

2026-04-09 03:10:30 -07:00

test_title_generator.py

feat: auto-generate session titles after first exchange

2026-03-17 04:14:40 -07:00

test_usage_pricing.py

feat: use endpoint metadata for custom model context and pricing (#1906 )

2026-03-18 03:04:07 -07:00