hermes-agent/tests/agent at 9de4a38ce06eff052d09772b2a975ef029bc042d - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 23:11:37 +08:00

Files

History

Teknium 3cba81ebed fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

Kimi's gateway selects the correct temperature server-side based on the
active mode (thinking -> 1.0, non-thinking -> 0.6).  Sending any
temperature value — even the previously "correct" one — conflicts with
gateway-managed defaults.

Replaces the old approach of forcing specific temperature values (0.6
for non-thinking, 1.0 for thinking) with an OMIT_TEMPERATURE sentinel
that tells all call sites to strip the temperature key from API kwargs
entirely.

Changes:
- agent/auxiliary_client.py: OMIT_TEMPERATURE sentinel, _is_kimi_model()
  prefix check (covers all kimi-* models), _fixed_temperature_for_model()
  returns sentinel for kimi models.  _build_call_kwargs() strips temp.
- run_agent.py: _build_api_kwargs, flush_memories, and summary generation
  paths all handle the sentinel by popping/omitting temperature.
- trajectory_compressor.py: _effective_temperature_for_model returns None
  for kimi (sentinel mapped), direct client calls use kwargs dict to
  conditionally include temperature.
- mini_swe_runner.py: same sentinel handling via wrapper function.
- 6 test files updated: all 'forces temperature X' assertions replaced
  with 'temperature not in kwargs' assertions.

Net: -76 lines (171 added, 247 removed).
Inspired by PR #13137 (@kshitijk4poor).

2026-04-20 12:23:05 -07:00

..

__init__.py

test: add unit tests for 8 modules (batch 2)

2026-02-26 13:54:20 +03:00

test_anthropic_adapter.py

fix(agent): downgrade xhigh→max on Anthropic pre-4.7 adaptive models

2026-04-16 12:00:56 -07:00

test_auxiliary_client_anthropic_custom.py

fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 )

2026-04-19 22:43:09 -07:00

test_auxiliary_client.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_auxiliary_config_bridge.py

fix: remove legacy compression.summary_* config and env var fallbacks (#8992 )

2026-04-13 04:59:26 -07:00

test_auxiliary_main_first.py

feat(auxiliary): default 'auto' routing to main model for all users (#11900 )

2026-04-17 19:13:23 -07:00

test_auxiliary_named_custom_providers.py

fix(tests): resolve CI test failures — pool auto-seeding, stale assertions, mock isolation

2026-04-15 22:05:21 -07:00

test_bedrock_adapter.py

feat: native AWS Bedrock provider via Converse API

2026-04-15 16:17:17 -07:00

test_bedrock_integration.py

fix(run_agent): preserve dotted Bedrock inference-profile model IDs (#11976 )

2026-04-19 20:30:44 -07:00

test_codex_cloudflare_headers.py

fix(codex): pin correct Cloudflare headers and extend to auxiliary client

2026-04-19 11:59:25 -07:00

test_compress_focus.py

fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )

2026-04-14 01:43:45 -07:00

test_context_compressor.py

fix(context_compressor): keep tool-call arguments JSON valid when shrinking

2026-04-18 12:40:56 -07:00

test_context_engine.py

feat: wire context engine plugin slot into agent and plugin system

2026-04-10 19:15:50 -07:00

test_context_references.py

fix(agent): fall back when rg is blocked for @folder references

2026-04-20 01:56:41 -07:00

test_credential_pool_routing.py

refactor: remove smart_model_routing feature (#12732 )

2026-04-19 18:12:55 -07:00

test_credential_pool.py

fix(tests): resolve CI test failures — pool auto-seeding, stale assertions, mock isolation

2026-04-15 22:05:21 -07:00

test_crossloop_client_cache.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_display_emoji.py

feat(tools): centralize tool emoji metadata in registry + skin integration

2026-03-15 20:21:21 -07:00

test_display.py

fix(display): render <missing old_text> in memory previews instead of empty quotes (#12852 )

2026-04-19 22:45:47 -07:00

test_error_classifier.py

test(error_classifier): broaden non-string message type coverage

2026-04-20 02:40:20 -07:00

test_external_skills.py

feat(skills): support external skill directories via config (#3678 )

2026-03-29 00:33:30 -07:00

test_gemini_cloudcode.py

fix(gemini): assign unique stream indices to parallel tool calls

2026-04-20 02:10:53 -07:00

test_gemini_native_adapter.py

fix(gemini): sanitize tool schemas for Google providers

2026-04-20 00:26:18 -07:00

test_insights.py

Merge branch 'main' into feat/dashboard-skill-analytics

2026-04-20 05:25:49 -07:00

test_local_stream_timeout.py

fix: is_local_endpoint misses Docker/Podman DNS names (#7950 )

2026-04-11 14:46:18 -07:00

test_memory_provider.py

fix(honcho): dialectic lifecycle — defaults, retry, prewarm consumption

2026-04-18 22:50:55 -07:00

test_memory_user_id.py

fix(honcho): scope gateway sessions by runtime user id

2026-04-18 22:50:55 -07:00

test_minimax_auxiliary_url.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_minimax_provider.py

fix: preserve dots in model names for OpenCode Zen and ZAI providers (#8794 )

2026-04-12 21:22:59 -07:00

test_model_metadata_local_ctx.py

fix(agent): prefer Ollama Modelfile num_ctx over GGUF training max

2026-04-13 04:24:07 -07:00

test_model_metadata.py

fix(agent): complete Claude Opus 4.7 API migration

2026-04-16 10:48:20 -07:00

test_models_dev.py

fix: three provider-related bugs (#8161 , #8181 , #8147 ) (#8243 )

2026-04-12 01:44:18 -07:00

test_nous_rate_guard.py

fix: Nous Portal rate limit guard — prevent retry amplification (#10568 )

2026-04-15 16:31:48 -07:00

test_prompt_builder.py

fix(agent): refresh skills prompt cache when disabled skills change

2026-04-19 11:16:24 -07:00

test_prompt_caching.py

fix(prompt-caching): skip top-level cache_control on role:tool for OpenRouter

2026-03-21 16:54:43 -07:00

test_proxy_and_url_validation.py

fix(runtime): surface malformed proxy env and base URL before client init

2026-04-15 16:10:53 -07:00

test_rate_limit_tracker.py

feat: capture provider rate limit headers and show in /usage (#6541 )

2026-04-09 03:43:14 -07:00

test_redact.py

feat: replace kimi-k2.5 with kimi-k2.6 on OpenRouter and Nous Portal (#13148 )

2026-04-20 11:49:54 -07:00

test_skill_commands.py

fix: sanitize Telegram command names to strip invalid characters

2026-04-06 11:27:28 -07:00

test_subagent_progress.py

test: update stale tests to match current code (#11963 )

2026-04-17 21:35:30 -07:00

test_subdirectory_hints.py

fix(agent): catch PermissionError in subdirectory hint discovery

2026-04-09 03:10:30 -07:00

test_title_generator.py

feat: auto-generate session titles after first exchange

2026-03-17 04:14:40 -07:00

test_usage_pricing.py

feat: use endpoint metadata for custom model context and pricing (#1906 )

2026-03-18 03:04:07 -07:00

test_vision_resolved_args.py

fix: pass resolved args to resolve_vision_provider_client()

2026-04-16 07:45:13 -07:00