hermes-agent/tests at b35d692f45d5f8c4d2ba567a64daa38ebba96a1a - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Files

History

Ash Rowan Vale 🌿 facea84559 fix(auxiliary): retry without temperature when any provider rejects it

Universal reactive fix for 'HTTP 400: Unsupported parameter: temperature'
across all providers/models — not just Codex Responses.

The same backend can accept temperature for some models and reject it for
others (e.g. gpt-5.4 accepts but gpt-5.5 rejects on the same OpenAI
endpoint; similar patterns on Copilot, OpenRouter reasoning routes, and
Anthropic Opus 4.7+ via OAI-compat). An allow/deny-list by model name does
not scale.

call_llm / async_call_llm now detect the concrete 'unsupported parameter:
temperature' 400 and transparently retry once without temperature. Kimi's
server-managed omission and Opus 4.7+'s proactive strip stay in place —
this is the safety net for everything else.

Changes:
- agent/auxiliary_client.py: add _is_unsupported_temperature_error helper;
  wire into both sync and async call_llm paths before the existing
  max_tokens/payment/auth retry ladder
- tests/agent/test_unsupported_temperature_retry.py: 19 tests covering
  detector phrasings, sync + async retry, no-retry-without-temperature,
  and non-temperature 400s not triggering the retry

Builds on PR #15620 (codex_responses fallback) which stripped temperature
up front for that one api_mode. This PR closes the gap for every other
provider/model combo via reactive retry.

Credit: retry approach and detector originate from @BlueBirdBack's PR #15578.

Co-authored-by: BlueBirdBack <BlueBirdBack@users.noreply.github.com>

2026-04-25 05:27:17 -07:00

..

fix(acp): include MCP toolsets in ACP sessions

2026-04-24 03:04:42 -07:00

fix(auxiliary): retry without temperature when any provider rejects it

2026-04-25 05:27:17 -07:00

feat: add slash command for busy input mode

2026-04-24 15:15:26 -07:00

fix(cron): wire context_from through the update action

2026-04-25 04:49:28 -07:00

refactor(commands): drop /provider, /plan handler, and clean up slash registry (#15047 )

2026-04-24 03:10:52 -07:00

environments/benchmarks

fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )

2026-04-07 17:28:37 -07:00

fix: streaming tool call parsing, error handling, and fake HA state mutation

2026-03-14 14:27:20 +03:00

fix(matrix): drop needless DeviceID import + mock put_device_id in tests

2026-04-25 07:17:03 +05:30

feat(tools): make discord/discord_admin opt-in, Discord-only

2026-04-25 04:51:11 -07:00

fix(resume): redirect --resume to the descendant that actually holds the messages

2026-04-24 03:04:42 -07:00

feat(honcho): wizard cadence default 2, surface reasoning level, backwards-compat fallback

2026-04-18 22:50:55 -07:00

fix(discord): strip RTP padding before DAVE/Opus decode (#11267 )

2026-04-16 16:50:15 -07:00

feat(hindsight): optional bank_id_template for per-agent / per-user banks

2026-04-24 03:38:17 -07:00

fix(flush_memories): strip temperature from codex_responses fallback (#15620 )

2026-04-25 05:01:25 -07:00

fix(skills): factor HERMES_HOME resolution into shared _hermes_home helper

2026-04-24 16:45:27 -07:00

feat(discord): split discord_server into discord + discord_admin tools

2026-04-25 04:50:14 -07:00

fix(tui): keep default personality neutral

2026-04-24 16:19:23 -05:00

__init__.py

…

conftest.py

test(conftest): reset module-level state + unset platform allowlists (#13400 )

2026-04-21 01:33:10 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_account_usage.py

feat(account-usage): add per-provider account limits module

2026-04-21 01:56:35 -07:00

test_base_url_hostname.py

security(runtime_provider): close OLLAMA_API_KEY substring-leak sweep miss (#13522 )

2026-04-21 06:06:16 -07:00

test_batch_runner_checkpoint.py

test: regression coverage for checkpoint dedup and inf/nan coercion

2026-04-24 14:32:21 -07:00

test_cli_file_drop.py

fix(tui): improve macOS paste and shortcut parity

2026-04-21 08:00:00 -07:00

test_cli_skin_integration.py

fix: align status bar skin tests with upstream main

2026-04-22 13:20:02 -07:00

test_ctx_halving_fix.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_empty_model_fallback.py

fix: fall back to provider's default model when model config is empty (#8303 )

2026-04-12 03:53:30 -07:00

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_hermes_constants.py

fix(gateway): harden Docker/container gateway pathway

2026-04-12 16:36:11 -07:00

test_hermes_logging.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_hermes_state.py

feat(dashboard): track real API call count per session

2026-04-22 05:51:58 -07:00

test_honcho_client_config.py

feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )

2026-04-02 15:33:51 -07:00

test_ipv4_preference.py

feat: add network.force_ipv4 config to fix IPv6 timeout issues (#8196 )

2026-04-11 23:12:11 -07:00

test_mcp_serve.py

feat: add MCP server mode — hermes mcp serve (#3795 )

2026-03-29 15:47:19 -07:00

test_mini_swe_runner.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_minimax_model_validation.py

fix(models): validate MiniMax models against static catalog (#12611 , #12460 , #12399 , #12547 )

2026-04-19 22:44:47 -07:00

test_minisweagent_path.py

chore: remove all remaining mini-swe-agent references

2026-03-24 08:19:23 -07:00

test_model_picker_scroll.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_model_tools_async_bridge.py

fix(core): ensure non-blocking executor shutdown on async timeout

2026-04-22 14:42:32 -07:00

test_model_tools.py

test: regression coverage for checkpoint dedup and inf/nan coercion

2026-04-24 14:32:21 -07:00

test_ollama_num_ctx.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_packaging_metadata.py

chore: prepare Hermes for Homebrew packaging (#4099 )

2026-03-30 17:34:43 -07:00

test_plugin_skills.py

fix(tests): attach caplog to specific logger in 3 order-dependent tests (#11453 )

2026-04-17 00:20:40 -07:00

test_project_metadata.py

build(deps): add qrcode to dingtalk + feishu extras (parity with messaging) (#11627 )

2026-04-17 13:31:53 -07:00

test_retry_utils.py

feat(agent): add jittered retry backoff

2026-04-08 00:41:36 -07:00

test_sql_injection.py

fix(security): eliminate SQL string formatting in execute() calls

2026-03-19 15:16:35 +01:00

test_subprocess_home_isolation.py

fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )

2026-04-10 13:37:45 -07:00

test_timezone.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_toolset_distributions.py

…

test_toolsets.py

feat(discord): split discord_server into discord + discord_admin tools

2026-04-25 04:50:14 -07:00

test_trajectory_compressor_async.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_trajectory_compressor.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_transform_tool_result_hook.py

test: stop testing mutable data — convert change-detectors to invariants (#13363 )

2026-04-20 23:20:33 -07:00

test_tui_gateway_server.py

feat(tui): per-section visibility for the details accordion

2026-04-24 02:34:32 -05:00

test_utils_truthy_values.py

Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.

2026-03-30 13:28:10 +09:00