hermes-agent/tests at 3cc4d7374f2ca92112fabb20a12b4716f729b6cd - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Files

History

Kian Meng 063bc3c1e2 fix(kimi): send max_tokens, reasoning_effort, and thinking for Kimi/Moonshot

Kimi/Moonshot endpoints require explicit parameters that Hermes was not
sending, causing 'Response truncated due to output length limit' errors
and inconsistent reasoning behavior.

Root cause analysis against Kimi CLI source (MoonshotAI/kimi-cli,
packages/kosong/src/kosong/chat_provider/kimi.py):

1. max_tokens: Kimi's API defaults to a very low value when omitted.
   Reasoning tokens share the output budget — the model exhausts it on
   thinking alone.  Send 32000, matching Kimi CLI's generate() default.

2. reasoning_effort: Kimi CLI sends this as a top-level parameter (not
   inside extra_body).  Hermes was not sending it at all because
   _supports_reasoning_extra_body() returns False for non-OpenRouter
   endpoints.

3. extra_body.thinking: Kimi CLI uses with_thinking() which sets
   extra_body.thinking={"type":"enabled"} alongside reasoning_effort.
   This is a separate control from the OpenAI-style reasoning extra_body
   that Hermes sends for OpenRouter/GitHub.  Without it, the Kimi gateway
   may not activate reasoning mode correctly.

Covers api.kimi.com (Kimi Code) and api.moonshot.ai/cn (Moonshot).

Tests: 6 new test cases for max_tokens, reasoning_effort, and
extra_body.thinking under various configs.

2026-04-21 05:32:27 -07:00

..

refactor(acp): validate method_id against advertised provider in authenticate() (#13468 )

2026-04-21 03:39:55 -07:00

test(copilot-acp): patch HERMES_HOME alongside HOME in hub-block test

2026-04-21 01:31:58 -07:00

fix(cli): dispatch /steer inline while agent is running (#13354 )

2026-04-20 23:05:38 -07:00

fix(cron): run due jobs in parallel to prevent serial tick starvation (#13021 )

2026-04-20 11:53:07 -07:00

fix: follow-up for salvaged PRs #6293 , #7387 , #9091 , #13131

2026-04-20 14:56:04 -07:00

environments/benchmarks

fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )

2026-04-07 17:28:37 -07:00

…

test(telegram): update /cmd@botname assertion for entity-only detection

2026-04-21 03:06:56 -07:00

fix(/model): accept provider switches when /models is unreachable

2026-04-21 05:19:43 -07:00

feat(honcho): wizard cadence default 2, surface reasoning level, backwards-compat fallback

2026-04-18 22:50:55 -07:00

fix(discord): strip RTP padding before DAVE/Opus decode (#11267 )

2026-04-16 16:50:15 -07:00

feat(plugins): make all plugins opt-in by default

2026-04-20 04:46:45 -07:00

fix(kimi): send max_tokens, reasoning_effort, and thinking for Kimi/Moonshot

2026-04-21 05:32:27 -07:00

fix(google-workspace): normalize authorized user token writes

2026-04-16 04:22:16 -07:00

test(mcp): add failing tests for circuit-breaker recovery

2026-04-21 05:19:03 -07:00

fix(tui-gateway): dispatch slow RPC handlers on a thread pool (#12546 )

2026-04-19 07:47:15 -05:00

__init__.py

…

conftest.py

test(conftest): reset module-level state + unset platform allowlists (#13400 )

2026-04-21 01:33:10 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_account_usage.py

feat(account-usage): add per-provider account limits module

2026-04-21 01:56:35 -07:00

test_base_url_hostname.py

fix: sweep remaining provider-URL substring checks across codebase

2026-04-20 22:14:29 -07:00

test_batch_runner_checkpoint.py

fix(batch_runner): mark discarded no-reasoning prompts as completed (#9950 )

2026-04-20 04:56:06 -07:00

test_cli_file_drop.py

fix(gateway): reject file paths in get_command() + file-drop tests (#7356 )

2026-04-10 13:06:02 -07:00

test_cli_skin_integration.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_ctx_halving_fix.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_empty_model_fallback.py

fix: fall back to provider's default model when model config is empty (#8303 )

2026-04-12 03:53:30 -07:00

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_hermes_constants.py

fix(gateway): harden Docker/container gateway pathway

2026-04-12 16:36:11 -07:00

test_hermes_logging.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_hermes_state.py

fix(session_search): restore same-session context when message ids are interleaved

2026-04-20 05:10:03 -07:00

test_honcho_client_config.py

feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )

2026-04-02 15:33:51 -07:00

test_ipv4_preference.py

feat: add network.force_ipv4 config to fix IPv6 timeout issues (#8196 )

2026-04-11 23:12:11 -07:00

test_mcp_serve.py

feat: add MCP server mode — hermes mcp serve (#3795 )

2026-03-29 15:47:19 -07:00

test_mini_swe_runner.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_minimax_model_validation.py

fix(models): validate MiniMax models against static catalog (#12611 , #12460 , #12399 , #12547 )

2026-04-19 22:44:47 -07:00

test_minisweagent_path.py

chore: remove all remaining mini-swe-agent references

2026-03-24 08:19:23 -07:00

test_model_picker_scroll.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_model_tools_async_bridge.py

fix: use per-thread persistent event loops in worker threads

2026-03-20 15:41:06 -04:00

test_model_tools.py

feat(plugins): add transform_tool_result hook for generic tool-result rewriting (#12972 )

2026-04-20 03:48:08 -07:00

test_ollama_num_ctx.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_packaging_metadata.py

chore: prepare Hermes for Homebrew packaging (#4099 )

2026-03-30 17:34:43 -07:00

test_plugin_skills.py

fix(tests): attach caplog to specific logger in 3 order-dependent tests (#11453 )

2026-04-17 00:20:40 -07:00

test_project_metadata.py

build(deps): add qrcode to dingtalk + feishu extras (parity with messaging) (#11627 )

2026-04-17 13:31:53 -07:00

test_retry_utils.py

feat(agent): add jittered retry backoff

2026-04-08 00:41:36 -07:00

test_sql_injection.py

fix(security): eliminate SQL string formatting in execute() calls

2026-03-19 15:16:35 +01:00

test_subprocess_home_isolation.py

fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )

2026-04-10 13:37:45 -07:00

test_timezone.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_toolset_distributions.py

…

test_toolsets.py

fix(ci): unblock test suite + cut ~2s of dead Z.AI probes from every AIAgent

2026-04-19 19:18:19 -07:00

test_trajectory_compressor_async.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_trajectory_compressor.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_transform_tool_result_hook.py

test: stop testing mutable data — convert change-detectors to invariants (#13363 )

2026-04-20 23:20:33 -07:00

test_tui_gateway_server.py

fix(tui): /model picker surfaces curated list, matching classic CLI (#12671 )

2026-04-19 16:15:22 -07:00

test_utils_truthy_values.py

Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.

2026-03-30 13:28:10 +09:00