hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-05 10:17:17 +08:00

Files

teknium1 885c5dc5e6 feat: context window usage warnings at 80% and 95%

Adds one-time warnings when context usage crosses critical thresholds:
- 80%: suggests /compress or /new if responses degrade
- 95%: warns of imminent errors/truncation, suggests /new

Each threshold fires at most once per session to avoid spam.
Warnings show actual token counts and percentage. Suppressed for
subagents (delegate_depth > 0) where the user can't act on them.
Always shown in CLI mode regardless of quiet_mode setting.

Inspired by OpenCode PR #152 (context window warning).

Bug fix found during live testing:
- Anthropic prompt caching reports input tokens across three fields
  (input_tokens, cache_read_input_tokens, cache_creation_input_tokens).
  The existing code only counted input_tokens, causing the context
  compressor to see ~0 tokens when caching was active. Fixed by summing
  all three fields. This also fixes context % display in the status bar
  for Anthropic users.

Changes:
- agent/context_compressor.py: add check_context_warning() with
  _warned_80/_warned_95 state tracking
- run_agent.py: call check_context_warning() after each API response,
  fix Anthropic cached token counting
- tests/test_context_warning.py: 8 tests covering thresholds,
  one-shot behavior, escalation, edge cases

Live tested with:
- Nous Portal (chat_completions mode) ✔
- Anthropic direct (anthropic_messages mode) ✔
- Interactive CLI session ✔

2026-03-16 06:30:04 -07:00

acp

feat(acp): support slash commands in ACP adapter (#1532 )

2026-03-16 05:19:36 -07:00

agent

feat(tools): centralize tool emoji metadata in registry + skin integration

2026-03-15 20:21:21 -07:00

cron

fix(cron): support per-job runtime overrides

2026-03-14 22:22:31 -07:00

fakes

fix: streaming tool call parsing, error handling, and fake HA state mutation

2026-03-14 14:27:20 +03:00

gateway

fix(privacy): skip PII redaction on Discord/Slack (mentions need real IDs)

2026-03-16 05:58:34 -07:00

hermes_cli

feat(cli): add file path autocomplete in the input prompt (#1545 )

2026-03-16 06:07:45 -07:00

honcho_integration

refactor(honcho): remove local memory mode

2026-03-12 16:23:34 -04:00

integration

test(voice): add integration tests with real NaCl crypto and Opus codec

2026-03-15 05:20:17 -07:00

skills

fix: persist google oauth pkce for headless auth

2026-03-14 22:11:34 -07:00

tools

test: fake minisweagent for docker cwd mount regressions

2026-03-16 05:40:05 -07:00

__init__.py

A bit of restructuring for simplicity and organization

2025-10-01 23:29:25 +00:00

conftest.py

feat: add direct endpoint overrides for auxiliary and delegation

2026-03-14 21:11:37 -07:00

run_interrupt_test.py

fix(honcho): isolate session routing for multi-user gateway (#1500 )

2026-03-16 00:23:47 -07:00

test_413_compression.py

feat: improve context compaction handoff summaries (#1273 )

2026-03-14 02:33:31 -07:00

test_860_dedup.py

fix: eliminate 3x SQLite message duplication in gateway sessions (#860 )

2026-03-10 15:22:44 -07:00

test_agent_loop_tool_calling.py

fix: skip hanging tests + add global test timeout

2026-03-12 01:23:28 -07:00

test_agent_loop_vllm.py

test: restore vllm integration coverage and add dict-args regression

2026-03-15 08:02:29 -07:00

test_agent_loop.py

fix: salvage gateway dedup and executor cleanup from PR #993

2026-03-14 11:03:20 -07:00

test_anthropic_adapter.py

Merge origin/main into hermes/hermes-daa73839

2026-03-14 23:44:47 -07:00

test_anthropic_oauth_flow.py

fix: preflight Anthropic auth and prefer Claude store

2026-03-14 19:38:55 -07:00

test_anthropic_provider_persistence.py

fix: preflight Anthropic auth and prefer Claude store

2026-03-14 19:38:55 -07:00

test_api_key_providers.py

fix: exclude Coding Plan-only models from Moonshot model selection

2026-03-14 20:42:30 -07:00

test_atomic_json_write.py

test: cover atomic temp cleanup on interrupts

2026-03-14 22:31:51 -07:00

test_atomic_yaml_write.py

test: cover atomic temp cleanup on interrupts

2026-03-14 22:31:51 -07:00

test_auth_codex_provider.py

refactor(auth): transition Codex OAuth tokens to Hermes auth store

2026-03-01 19:59:24 -08:00

test_auth_nous_provider.py

Fix nous refresh token rotation failure in case where api key mint/retrieval fails

2026-03-02 17:18:15 +11:00

test_auxiliary_config_bridge.py

feat: add direct endpoint overrides for auxiliary and delegation

2026-03-14 21:11:37 -07:00

test_batch_runner_checkpoint.py

fix: sanitize chat payloads and provider precedence

2026-03-13 23:59:12 -07:00

test_cli_approval_ui.py

fix(cli): repair dangerous command approval UI

2026-03-14 11:57:44 -07:00

test_cli_init.py

fix: initialize CLI voice state for single-query mode

2026-03-14 06:31:32 -07:00

test_cli_interrupt_subagent.py

fix: use session_key instead of chat_id for adapter interrupt lookups

2026-03-12 08:35:45 -07:00

test_cli_loading_indicator.py

fix(cli): add loading indicators for slow slash commands

2026-03-10 17:31:00 -07:00

test_cli_mcp_config_watch.py

fix: auto-reload MCP tools when mcp_servers config changes without restart (#1474 )

2026-03-15 19:03:34 -07:00

test_cli_model_command.py

feat: auto-detect provider when switching models via /model (#1506 )

2026-03-16 04:34:45 -07:00

test_cli_new_session.py

fix(cli): make /new, /reset, and /clear start real fresh sessions

2026-03-13 21:53:54 -07:00

test_cli_plan_command.py

fix: save /plan output in workspace (#1381 )

2026-03-14 21:28:51 -07:00

test_cli_prefix_matching.py

fix(test): add missing session_id and _pending_input to _make_cli fixture

2026-03-14 10:33:58 -07:00

test_cli_preloaded_skills.py

feat: preload CLI skills on launch (#1359 )

2026-03-14 19:33:59 -07:00

test_cli_provider_resolution.py

fix(custom-endpoint): verify /models and suggest working /v1 base URL (#1480 )

2026-03-15 20:09:50 -07:00

test_cli_retry.py

test: lock retry replacement semantics

2026-03-14 21:19:22 -07:00

test_cli_secret_capture.py

feat: secure skill env setup on load (core #688 )

2026-03-13 03:14:04 -07:00

test_cli_skin_integration.py

fix(test): add missing voice state attrs to CLI stub in skin tests

2026-03-14 15:00:45 +03:00

test_cli_status_bar.py

feat: add persistent CLI status bar and usage details (#1522 )

2026-03-16 04:42:48 -07:00

test_codex_execution_paths.py

feat: simple fallback model for provider resilience

2026-03-08 20:22:33 -07:00

test_codex_models.py

fix: add codex forward-compat model listing

2026-03-13 21:34:01 -07:00

test_context_warning.py

feat: context window usage warnings at 80% and 95%

2026-03-16 06:30:04 -07:00

test_dict_tool_call_args.py

test: restore vllm integration coverage and add dict-args regression

2026-03-15 08:02:29 -07:00

test_display.py

fix: add upstream guard for non-dict function_args + tests for build_tool_preview

2026-03-09 21:01:40 -07:00

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_external_credential_detection.py

refactor(auth): transition Codex OAuth tokens to Hermes auth store

2026-03-01 19:59:24 -08:00

test_fallback_model.py

refactor: route main agent client + fallback through centralized router

2026-03-11 21:38:29 -07:00

test_file_permissions.py

security: enforce 0600/0700 file permissions on sensitive files (inspired by openclaw)

2026-03-09 02:19:32 -07:00

test_flush_memories_codex.py

fix: update all test mocks for call_llm migration

2026-03-11 21:06:54 -07:00

test_hermes_state.py

fix(cli): accept session ID prefixes for session actions

2026-03-15 04:01:56 -07:00

test_honcho_client_config.py

fix(honcho): auto-enable when API key is present

2026-03-01 03:12:37 -05:00

test_insights.py

feat: add persistent CLI status bar and usage details (#1522 )

2026-03-16 04:42:48 -07:00

test_interactive_interrupt.py

fix(honcho): isolate session routing for multi-user gateway (#1500 )

2026-03-16 00:23:47 -07:00

test_interrupt_propagation.py

fix: use session_key instead of chat_id for adapter interrupt lookups

2026-03-12 08:35:45 -07:00

test_managed_server_tool_support.py

test: fix stale CI assumptions in parser and quick-command coverage (#1236 )

2026-03-13 21:56:12 -07:00

test_minisweagent_path.py

fix: worktree-aware minisweagent path discovery + clean up requirements check (#1248 )

2026-03-13 23:39:51 -07:00

test_model_provider_persistence.py

fix: provider selection not persisting when switching via hermes model

2026-03-10 17:12:34 -07:00

test_model_tools.py

test: strengthen assertions across 3 more test files (batch 2)

2026-03-05 18:46:30 -08:00

test_openai_client_lifecycle.py

fix(honcho): isolate session routing for multi-user gateway (#1500 )

2026-03-16 00:23:47 -07:00

test_personality_none.py

feat(cli,gateway): add /personality none and custom personality support

2026-03-09 17:31:54 +03:00

test_provider_parity.py

fix: convert anthropic image content blocks

2026-03-14 23:41:20 -07:00

test_quick_commands.py

fix(gateway): support quick commands from GatewayConfig

2026-03-14 03:51:28 -07:00

test_real_interrupt_subagent.py

fix(test): patch correct method in subagent interrupt test

2026-03-12 15:05:42 -04:00

test_reasoning_command.py

fix: /reasoning command — add gateway support, fix display, persist settings (#1031 )

2026-03-12 05:38:19 -07:00

test_redirect_stdout_issue.py

fix: use session_key instead of chat_id for adapter interrupt lookups

2026-03-12 08:35:45 -07:00

test_resume_display.py

feat: display previous messages when resuming a session in CLI

2026-03-08 17:45:45 -07:00

test_run_agent_codex_responses.py

fix: add missing Responses API parameters for Codex provider

2026-03-11 04:28:31 -07:00

test_run_agent.py

fix(honcho): isolate session routing for multi-user gateway (#1500 )

2026-03-16 00:23:47 -07:00

test_runtime_provider_resolution.py

fix: restore config-saved custom endpoint resolution

2026-03-14 20:58:12 -07:00

test_setup_model_selection.py

fix(setup): remove dead code causing is_coding_plan NameError crash

2026-03-13 04:42:26 +03:00

test_timezone.py

test(cron): add cross-timezone naive timestamp regression

2026-03-14 10:33:32 -07:00

test_tool_call_parsers.py

fix: use non-greedy regex in DeepSeek V3 parser for multi-tool calls (#1300 )

2026-03-14 06:19:28 -07:00

test_toolset_distributions.py

test: add unit tests for 8 modules (batch 2)

2026-02-26 13:54:20 +03:00

test_toolsets.py

fix: add missing Platform.SIGNAL to toolset mappings, update test + config docs

2026-03-09 23:27:19 -07:00

test_trajectory_compressor.py

fix: harden trajectory compressor summary content handling

2026-03-14 11:03:25 -07:00

test_worktree_security.py

fix: harden salvaged worktree include checks

2026-03-14 21:51:27 -07:00

test_worktree.py

fix: harden salvaged worktree include checks

2026-03-14 21:51:27 -07:00