hermes-agent/tests at 0517ac3e9325a0548c3f5878185a926921be9311 - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Files

History

trevthefoolish 0517ac3e93 fix(agent): complete Claude Opus 4.7 API migration

Claude Opus 4.7 introduced several breaking API changes that the current
codebase partially handled but not completely. This patch finishes the
migration per the official migration guide at
https://platform.claude.com/docs/en/about-claude/models/migration-guide

Fixes NousResearch/hermes-agent#11137

Breaking-change coverage:

1. Adaptive thinking + output_config.effort — 4.7 is now recognized by
   _supports_adaptive_thinking() (extends previous 4.6-only gate).

2. Sampling parameter stripping — 4.7 returns 400 for any non-default
   temperature / top_p / top_k. build_anthropic_kwargs drops them as a
   safety net; the OpenAI-protocol auxiliary path (_build_call_kwargs)
   and AnthropicCompletionsAdapter.create() both early-exit before
   setting temperature for 4.7+ models. This keeps flush_memories and
   structured-JSON aux paths that hardcode temperature from 400ing
   when the aux model is flipped to 4.7.

3. thinking.display = "summarized" — 4.7 defaults display to "omitted",
   which silently hides reasoning text from Hermes's CLI activity feed
   during long tool runs. Restoring "summarized" preserves 4.6 UX.

4. Effort level mapping — xhigh now maps to xhigh (was xhigh→max, which
   silently over-efforted every coding/agentic request). max is now a
   distinct ceiling per Anthropic's 5-level effort model.

5. New stop_reason values — refusal and model_context_window_exceeded
   were silently collapsed to "stop" (end_turn) by the adapter's
   stop_reason_map. Now mapped to "content_filter" and "length"
   respectively, matching upstream finish-reason handling already in
   bedrock_adapter.

6. Model catalogs — claude-opus-4-7 added to the Anthropic provider
   list, anthropic/claude-opus-4.7 added at top of OpenRouter fallback
   catalog (recommended), claude-opus-4-7 added to model_metadata
   DEFAULT_CONTEXT_LENGTHS (1M, matching 4.6 per migration guide).

7. Prefill docstrings — run_agent.AIAgent and BatchRunner now document
   that Anthropic Sonnet/Opus 4.6+ reject a trailing assistant-role
   prefill (400).

8. Tests — 4 new tests in test_anthropic_adapter covering display
   default, xhigh preservation, max on 4.7, refusal / context-overflow
   stop_reason mapping, plus the sampling-param predicate. test_model_metadata
   accepts 4.7 at 1M context.

Tested on macOS 15.5 (darwin). 119 tests pass in
tests/agent/test_anthropic_adapter.py, 1320 pass in tests/agent/.

2026-04-16 10:48:20 -07:00

..

fix(acp): declare session load and resume capabilities in initialize response (#6985 )

2026-04-10 03:45:36 -07:00

fix(agent): complete Claude Opus 4.7 API migration

2026-04-16 10:48:20 -07:00

fix(tests): resolve 12 CI failures + 10 errors across 6 root causes (#11040 )

2026-04-16 06:49:36 -07:00

fix(cron): treat empty agent response as error in last_status (fixes #8585 )

2026-04-16 06:49:57 -07:00

refactor: extract shared helpers to deduplicate repeated code patterns (#7917 )

2026-04-11 13:59:52 -07:00

environments/benchmarks

fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )

2026-04-07 17:28:37 -07:00

fix: streaming tool call parsing, error handling, and fake HA state mutation

2026-03-14 14:27:20 +03:00

fix(gateway): guard pending_event.channel_prompt against None in recursive _run_agent

2026-04-16 07:45:27 -07:00

fix: wire up Ollama Cloud dynamic model discovery in /model TUI picker

2026-04-16 07:17:45 -07:00

fix(honcho): strip whitespace from conclusion and delete_id inputs

2026-04-16 09:50:10 -07:00

refactor: remove dead code — 1,784 lines across 77 files (#9180 )

2026-04-13 16:32:04 -07:00

feat: sort tool search results by score and add corresponding unit test

2026-04-14 10:49:35 -07:00

fix(run_agent): prevent _create_openai_client from mutating caller kwargs

2026-04-16 07:45:22 -07:00

fix(google-workspace): normalize authorized user token writes

2026-04-16 04:22:16 -07:00

fix: follow-up for salvaged PR #10854

2026-04-16 06:42:45 -07:00

__init__.py

A bit of restructuring for simplicity and organization

2025-10-01 23:29:25 +00:00

conftest.py

fix(tests): fix several failing/flaky tests on main (#6777 )

2026-04-09 13:17:06 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_batch_runner_checkpoint.py

fix: sanitize chat payloads and provider precedence

2026-03-13 23:59:12 -07:00

test_cli_file_drop.py

fix(gateway): reject file paths in get_command() + file-drop tests (#7356 )

2026-04-10 13:06:02 -07:00

test_cli_skin_integration.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_ctx_halving_fix.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_empty_model_fallback.py

fix: fall back to provider's default model when model config is empty (#8303 )

2026-04-12 03:53:30 -07:00

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_hermes_constants.py

fix(gateway): harden Docker/container gateway pathway

2026-04-12 16:36:11 -07:00

test_hermes_logging.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_hermes_state.py

fix(state): orphan children instead of cascade-deleting in prune/delete (#6513 )

2026-04-09 02:41:56 -07:00

test_honcho_client_config.py

feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )

2026-04-02 15:33:51 -07:00

test_ipv4_preference.py

feat: add network.force_ipv4 config to fix IPv6 timeout issues (#8196 )

2026-04-11 23:12:11 -07:00

test_mcp_serve.py

feat: add MCP server mode — hermes mcp serve (#3795 )

2026-03-29 15:47:19 -07:00

test_minisweagent_path.py

chore: remove all remaining mini-swe-agent references

2026-03-24 08:19:23 -07:00

test_model_picker_scroll.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_model_tools_async_bridge.py

fix: use per-thread persistent event loops in worker threads

2026-03-20 15:41:06 -04:00

test_model_tools.py

feat(plugins): let pre_tool_call hooks block tool execution

2026-04-13 22:01:49 -07:00

test_ollama_num_ctx.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_packaging_metadata.py

chore: prepare Hermes for Homebrew packaging (#4099 )

2026-03-30 17:34:43 -07:00

test_plugin_skills.py

feat(plugins): namespaced skill registration for plugin skill bundles

2026-04-14 10:42:58 -07:00

test_project_metadata.py

refactor(matrix): swap matrix-nio for mautrix-python dependency

2026-04-10 21:15:59 -07:00

test_retry_utils.py

feat(agent): add jittered retry backoff

2026-04-08 00:41:36 -07:00

test_sql_injection.py

fix(security): eliminate SQL string formatting in execute() calls

2026-03-19 15:16:35 +01:00

test_subprocess_home_isolation.py

fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )

2026-04-10 13:37:45 -07:00

test_timezone.py

fix: remove 115 verified dead code symbols across 46 production files

2026-04-10 03:44:43 -07:00

test_toolset_distributions.py

test: add unit tests for 8 modules (batch 2)

2026-02-26 13:54:20 +03:00

test_toolsets.py

fix(mcp): make server aliases explicit

2026-04-14 17:19:20 -07:00

test_trajectory_compressor_async.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_trajectory_compressor.py

fix: load credentials from HERMES_HOME .env in trajectory_compressor

2026-04-14 10:24:19 -07:00

test_utils_truthy_values.py

Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.

2026-03-30 13:28:10 +09:00