hermes-agent/tests/run_agent at 5e0eed470fe8c4d8a4e6bd1f66365acf2a1f3e5d - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Files

History

Teknium 5e0eed470f fix(cache): enable prompt caching for Qwen on OpenCode/OpenCode-Go/Alibaba (#13528 )

Qwen models on OpenCode, OpenCode Go, and direct DashScope accept
Anthropic-style cache_control markers on OpenAI-wire chat completions,
but hermes only injected markers for Claude-named models. Result: zero
cache hits on every turn, full prompt re-billed — a community user
reported burning through their OpenCode Go subscription on Qwen3.6.

Extend _anthropic_prompt_cache_policy to return (True, False) — envelope
layout, not native — for the Alibaba provider family when the model name
contains 'qwen'. Envelope layout places markers on inner content blocks
(matching pi-mono's 'alibaba' cacheControlFormat) and correctly skips
top-level markers on tool-role messages (which OpenCode rejects).

Non-Qwen models on these providers (GLM, Kimi) keep their existing
behaviour — they have automatic server-side caching and don't need
client markers.

Upstream reference: pi-mono #3392 / #3393 documented this contract for
opencode-go Qwen models.

Adds 7 regression tests covering Qwen3.5/3.6/coder on each affected
provider plus negative cases for GLM/Kimi/OpenRouter-Qwen.

2026-04-21 06:40:58 -07:00

..

__init__.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

conftest.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_413_compression.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_860_dedup.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_1630_context_overflow_loop.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_agent_guardrails.py

fix(delegate): make max_concurrent_children configurable + error on excess

2026-04-10 13:38:14 -07:00

test_agent_loop_tool_calling.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_agent_loop_vllm.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_agent_loop.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_anthropic_error_handling.py

feat(providers): extend request_timeout_seconds to all client paths

2026-04-19 11:23:00 -07:00

test_anthropic_prompt_cache_policy.py

fix(cache): enable prompt caching for Qwen on OpenCode/OpenCode-Go/Alibaba (#13528 )

2026-04-21 06:40:58 -07:00

test_anthropic_third_party_oauth_guard.py

fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 )

2026-04-19 22:43:09 -07:00

test_anthropic_truncation_continuation.py

fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 )

2026-04-19 22:43:09 -07:00

test_async_httpx_del_neuter.py

fix: bound auxiliary client cache to prevent fd exhaustion in long-running gateways (#10200 ) (#10470 )

2026-04-15 13:16:28 -07:00

test_compression_boundary.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_compression_feasibility.py

fix(compression): enforce 64k floor on aux model + auto-correct threshold (#12898 )

2026-04-20 00:56:04 -07:00

test_compression_persistence.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_compression_trigger_excludes_reasoning.py

fix(compression): exclude completion tokens from compression trigger (#12026 )

2026-04-20 05:12:10 -07:00

test_compressor_fallback_update.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_concurrent_interrupt.py

fix(tests): unstick CI — sweep stale tests from recent merges (#12670 )

2026-04-19 12:39:58 -07:00

test_context_token_tracking.py

feat(providers): extend request_timeout_seconds to all client paths

2026-04-19 11:23:00 -07:00

test_create_openai_client_kwargs_isolation.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_create_openai_client_proxy_env.py

fix(agent): normalize socks:// env proxies for httpx/anthropic

2026-04-21 05:52:46 -07:00

test_create_openai_client_reuse.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_dict_tool_call_args.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_exit_cleanup_interrupt.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_fallback_model.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_flush_memories_codex.py

fix(agent): respect config timeout for flush_memories instead of hardcoded 30s

2026-04-08 18:55:33 -07:00

test_interactive_interrupt.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_interrupt_propagation.py

test: stop testing mutable data — convert change-detectors to invariants (#13363 )

2026-04-20 23:20:33 -07:00

test_invalid_context_length_warning.py

fix(tests): resolve CI test failures — pool auto-seeding, stale assertions, mock isolation

2026-04-15 22:05:21 -07:00

test_long_context_tier_429.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_memory_provider_init.py

fix(memory): keep Honcho provider opt-in

2026-04-18 22:50:55 -07:00

test_openai_client_lifecycle.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_percentage_clamp.py

fix: update 6 test files broken by dead code removal

2026-04-10 03:44:43 -07:00

test_plugin_context_engine_init.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_primary_runtime_restore.py

fix(run_agent): recover primary client on openai transport errors

2026-04-10 03:21:24 -07:00

test_provider_attribution_headers.py

feat: attribution default_headers for ai-gateway provider

2026-04-20 21:02:28 -07:00

test_provider_fallback.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_provider_parity.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_real_interrupt_subagent.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_redirect_stdout_issue.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_repair_tool_call_arguments.py

fix: extract _repair_tool_call_arguments helper, add tests, bound loop

2026-04-20 05:12:55 -07:00

test_run_agent_codex_responses.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_run_agent_multimodal_prologue.py

feat(api-server): inline image inputs on /v1/chat/completions and /v1/responses (#12969 )

2026-04-20 04:16:13 -07:00

test_run_agent.py

fix(kimi): send max_tokens, reasoning_effort, and thinking for Kimi/Moonshot

2026-04-21 05:32:27 -07:00

test_sequential_chats_live.py

test: regression guards for the keepalive/transport bug class (#10933 ) (#11266 )

2026-04-16 16:36:33 -07:00

test_session_meta_filtering.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_session_reset_fix.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_steer.py

refactor(steer): simplify injection marker to 'User guidance:' prefix (#13340 )

2026-04-20 22:18:49 -07:00

test_streaming.py

fix(ci): unblock test suite + cut ~2s of dead Z.AI probes from every AIAgent

2026-04-19 19:18:19 -07:00

test_strict_api_validation.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_switch_model_context.py

fix: pass config_context_length to switch_model context compressor

2026-04-10 05:52:45 -07:00

test_token_persistence_non_cli.py

fix(tests): make AIAgent constructor calls self-contained (#11755 )

2026-04-17 12:32:03 -07:00

test_tool_arg_coercion.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_unicode_ascii_codec.py

fix: always retry on ASCII codec UnicodeEncodeError — don't gate on per-component sanitization

2026-04-15 15:03:28 -07:00