hermes-agent/tests at 2b728e12748e3a30273acdbef36ecad15a04f2b9 - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-04 17:57:28 +08:00

Files

History

Teknium 2b728e1274 fix(agent): drop thinking-only assistant turns before provider call (#16959 )

Adds a pre-call sanitizer that detects assistant messages containing only
reasoning (reasoning / reasoning_content, no visible content, no
tool_calls) and drops them from the API copy. Adjacent user messages
left behind are merged so role alternation is preserved for the
provider.

Mirrors Claude Code's approach in src/utils/messages.ts
(filterOrphanedThinkingOnlyMessages + mergeAdjacentUserMessages). We
drop the whole turn rather than fabricate stub text (the '.' /
'(continued)' pattern from contributor PRs #11098, #13010, #16842 that
were rejected because they put words in the model's mouth).

The stored conversation history (self.messages) is never mutated — only
the per-call api_messages copy. Users still see the reasoning block in
the CLI/gateway transcript; only the wire copy is cleaned. Session
persistence keeps the full trace.

Two call sites covered:
- Main agent loop, after _sanitize_api_messages (catches every turn).
- Iteration-limit-summary fallback path.

Tests: tests/run_agent/test_thinking_only_sanitizer.py — 25 cases
covering detection (string/list content, whitespace-only, tool_calls,
reasoning_details list form), drop behavior, adjacent-user merge
(string+string, list+list, mixed), non-mutation of input dicts, and
system-message handling.

E2E live-tested against 5 providers with a poisoned history (empty
assistant message + reasoning_content): OpenRouter→Anthropic/OpenAI/
DeepSeek-R1/Qwen, native Gemini. All 5 accepted the cleaned request.
Happy-path regression (5/5) confirms the sanitizer is a noop when no
thinking-only turn exists.

Related: #16823 (wontfix — stub-text approach rejected).

Co-authored-by: teknium1 <teknium@users.noreply.github.com>

2026-04-28 03:50:51 -07:00

..

fix(acp): wire HERMES_SESSION_KEY per session so sudo cache scope activates

2026-04-28 01:34:16 -07:00

revert: computer-use cua-driver (PR #16919 ) (#16927 )

2026-04-28 01:57:21 -07:00

feat(fast): broaden /fast whitelist to all OpenAI + Anthropic models (#16883 )

2026-04-28 00:44:43 -07:00

fix(cron): preserve Telegram topic targets

2026-04-28 00:44:12 -07:00

fix(gateway): coerce plaintext "restart gateway" DMs to /restart

2026-04-28 01:40:28 -07:00

environments/benchmarks

…

…

fix(session): make SQLite transcript rewrites transactional

2026-04-28 01:49:46 -07:00

feat(providers): add tencent-tokenhub provider support

2026-04-28 03:45:52 -07:00

fix(resume): redirect --resume to the descendant that actually holds the messages

2026-04-24 03:04:42 -07:00

feat(honcho): explain why when honcho_profile returns an empty card

2026-04-27 12:37:33 -07:00

fix(discord): strip RTP padding before DAVE/Opus decode (#11267 )

2026-04-16 16:50:15 -07:00

feat(plugins): add bundled observability/langfuse plugin

2026-04-28 01:40:59 -07:00

fix(agent): drop thinking-only assistant turns before provider call (#16959 )

2026-04-28 03:50:51 -07:00

feat(claw-migrate): harden OpenClaw import with plan-first apply, redaction, and pre-migration backup (#16911 )

2026-04-28 01:50:23 -07:00

✨ feat(web): expose search result limit

2026-04-28 02:09:30 -07:00

Revert "feat(onboarding): port first-touch hints to the TUI (#16054 )" (#16062 )

2026-04-26 06:31:37 -07:00

fix(website): auto-wrap ASCII-art code blocks in generated skill pages (#16497 )

2026-04-27 03:38:39 -07:00

__init__.py

…

conftest.py

feat(providers): add GMI Cloud as a first-class API-key provider (#11955 )

2026-04-27 11:17:59 -07:00

run_interrupt_test.py

…

test_account_usage.py

feat(account-usage): add per-provider account limits module

2026-04-21 01:56:35 -07:00

test_base_url_hostname.py

security(runtime_provider): close OLLAMA_API_KEY substring-leak sweep miss (#13522 )

2026-04-21 06:06:16 -07:00

test_batch_runner_checkpoint.py

test: regression coverage for checkpoint dedup and inf/nan coercion

2026-04-24 14:32:21 -07:00

test_cli_file_drop.py

fix(tui): improve macOS paste and shortcut parity

2026-04-21 08:00:00 -07:00

test_cli_skin_integration.py

fix: align status bar skin tests with upstream main

2026-04-22 13:20:02 -07:00

test_ctx_halving_fix.py

…

test_empty_model_fallback.py

…

test_evidence_store.py

…

test_hermes_constants.py

…

test_hermes_logging.py

fix(logging): attach gateway log after cli init

2026-04-26 19:01:26 -07:00

test_hermes_state.py

fix(state): repair FTS5 delete trigger and add v11 migration for tool-call indexing

2026-04-28 01:33:00 -07:00

test_honcho_client_config.py

…

test_ipv4_preference.py

…

test_mcp_serve.py

…

test_mini_swe_runner.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_minimax_model_validation.py

fix(models): validate MiniMax models against static catalog (#12611 , #12460 , #12399 , #12547 )

2026-04-19 22:44:47 -07:00

test_minisweagent_path.py

…

test_model_picker_scroll.py

…

test_model_tools_async_bridge.py

fix(core): ensure non-blocking executor shutdown on async timeout

2026-04-22 14:42:32 -07:00

test_model_tools.py

feat(hooks): add duration_ms to post_tool_call + transform_tool_result (#15429 )

2026-04-25 22:13:12 -07:00

test_ollama_num_ctx.py

…

test_packaging_metadata.py

…

test_plugin_skills.py

fix(tests): attach caplog to specific logger in 3 order-dependent tests (#11453 )

2026-04-17 00:20:40 -07:00

test_project_metadata.py

build(deps): add qrcode to dingtalk + feishu extras (parity with messaging) (#11627 )

2026-04-17 13:31:53 -07:00

test_retry_utils.py

…

test_sql_injection.py

…

test_subprocess_home_isolation.py

…

test_timezone.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_toolset_distributions.py

…

test_toolsets.py

feat(discord): split discord_server into discord + discord_admin tools

2026-04-25 04:50:14 -07:00

test_trajectory_compressor_async.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_trajectory_compressor.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_transform_tool_result_hook.py

test: stop testing mutable data — convert change-detectors to invariants (#13363 )

2026-04-20 23:20:33 -07:00

test_tui_gateway_server.py

fix(tui): /model writes HERMES_TUI_PROVIDER unconditionally (#16857 ) (#16897 )

2026-04-28 01:17:04 -07:00

test_utils_truthy_values.py

…

test_yuanbao_integration.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00

test_yuanbao_markdown.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00

test_yuanbao_pipeline.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00

test_yuanbao_proto.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00