hermes-agent/agent at 438db0c7b062d5ceeadec5d9de009324ee822467 - hermes-agent - ling

ling/hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Files

History

Teknium 438db0c7b0 fix(cli): /model picker honors provider-specific context caps (#16030 )

`_apply_model_switch_result` (the interactive `/model` picker's
confirmation path) printed `ModelInfo.context_window` straight from
models.dev, which reports the vendor-wide value (1.05M for gpt-5.5 on
openai). ChatGPT Codex OAuth caps the same slug at 272K, so the picker
showed 1M while the runtime (compressor, gateway `/model`, typed
`/model <name>`) correctly used 272K — the classic 'sometimes 1M,
sometimes 272K' mismatch on a single model.

Both display paths now go through `resolve_display_context_length()`,
matching the fix that `_handle_model_switch` received earlier.

Also bump the stale last-resort fallback in DEFAULT_CONTEXT_LENGTHS
(`gpt-5.5: 400000 -> 1050000`) to match the real OpenAI API value; the
272K Codex cap is already enforced via the Codex-OAuth branch, so the
fallback now reflects what every non-Codex probe-miss should see.

Tests: adds `test_apply_model_switch_result_context.py` with three
scenarios (Codex cap wins, OpenRouter shows 1.05M, resolver-empty falls
back to ModelInfo). Updates the existing non-Codex fallback test to
assert 1.05M (the correct value).

## Validation
| path                          | before    | after     |
|-------------------------------|-----------|-----------|
| picker -> gpt-5.5 on Codex    | 1,050,000 | 272,000   |
| picker -> gpt-5.5 on OpenAI   | 1,050,000 | 1,050,000 |
| picker -> gpt-5.5 on OpenRouter | 1,050,000 | 1,050,000 |
| typed /model gpt-5.5 on Codex | 272,000   | 272,000   |

2026-04-26 05:43:31 -07:00

..

fix(agent): preserve Codex message items for replay

2026-04-25 18:22:06 -07:00

__init__.py

Refactor Terminal and AIAgent cleanup

2026-02-21 22:31:43 -08:00

account_usage.py

feat(account-usage): add per-provider account limits module

2026-04-21 01:56:35 -07:00

anthropic_adapter.py

fix: pass api-version as default_query param, not in base_url — SDK was producing malformed URLs like /anthropic?api-version=.../v1/messages

2026-04-25 18:48:43 -07:00

auxiliary_client.py

fix: preserve URL query params for Azure OpenAI and custom endpoints

2026-04-25 18:48:43 -07:00

bedrock_adapter.py

fix(bedrock): evict cached boto3 client on stale-connection errors

2026-04-24 07:26:07 -07:00

codex_responses_adapter.py

fix(agent): preserve Codex message items for replay

2026-04-25 18:22:06 -07:00

context_compressor.py

fix: recalculate token budgets on model switch in ContextCompressor

2026-04-25 15:07:56 +05:30

context_engine.py

fix(compress): don't reach into ContextCompressor privates from /compress (#15039 )

2026-04-24 02:55:43 -07:00

context_references.py

fix(agent): fall back when rg is blocked for @folder references

2026-04-20 01:56:41 -07:00

copilot_acp_client.py

fix: set HOME for Copilot ACP subprocesses

2026-04-24 05:09:08 -07:00

credential_pool.py

fix(credential_pool): add Nous OAuth cross-process auth-store sync

2026-04-24 05:20:05 -07:00

credential_sources.py

fix(auth): unify credential source removal — every source sticks (#13427 )

2026-04-21 01:52:49 -07:00

display.py

fix(display): render <missing old_text> in memory previews instead of empty quotes (#12852 )

2026-04-19 22:45:47 -07:00

error_classifier.py

fix(agent): only set rate-limit cooldown when leaving primary; add tests

2026-04-24 05:35:43 -07:00

file_safety.py

fix(security): apply file safety to copilot acp fs

2026-04-21 01:31:58 -07:00

gemini_cloudcode_adapter.py

refactor: remove redundant local imports already available at module level

2026-04-21 00:50:58 -07:00

gemini_native_adapter.py

fix(gemini): fail fast on missing API key + surface it in hermes dump (#15133 )

2026-04-24 05:35:17 -07:00

gemini_schema.py

fix(gemini): drop integer/number/boolean enums from tool schemas (#15082 )

2026-04-24 03:40:00 -07:00

google_code_assist.py

fix(gemini-cli): surface MODEL_CAPACITY_EXHAUSTED cleanly + drop retired gemma-4-26b (#11833 )

2026-04-17 15:34:12 -07:00

google_oauth.py

feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )

2026-04-16 16:49:00 -07:00

image_gen_provider.py

feat(plugins): pluggable image_gen backends + OpenAI provider (#13799 )

2026-04-21 21:30:10 -07:00

image_gen_registry.py

feat(plugins): pluggable image_gen backends + OpenAI provider (#13799 )

2026-04-21 21:30:10 -07:00

insights.py

Merge branch 'main' into feat/dashboard-skill-analytics

2026-04-20 05:25:49 -07:00

manual_compression_feedback.py

fix(gateway): make manual compression feedback truthful

2026-04-10 21:16:53 -07:00

memory_manager.py

fix(memory): add write origin metadata

2026-04-24 14:37:55 -07:00

memory_provider.py

fix(memory): add write origin metadata

2026-04-24 14:37:55 -07:00

model_metadata.py

fix(cli): /model picker honors provider-specific context caps (#16030 )

2026-04-26 05:43:31 -07:00

models_dev.py

fix: normalize provider in list_provider_models to support aliases

2026-04-23 01:59:20 -07:00

moonshot_schema.py

fix(kimi,mcp): Moonshot schema sanitizer + MCP schema robustness (#14805 )

2026-04-23 16:11:57 -07:00

nous_rate_guard.py

fix(nous): don't trip cross-session rate breaker on upstream-capacity 429s (#15898 )

2026-04-26 04:53:42 -07:00

prompt_builder.py

feat(agent): add PLATFORM_HINTS for matrix, mattermost, and feishu (#14428 )

2026-04-23 12:50:22 +05:30

prompt_caching.py

fix(prompt-caching): skip top-level cache_control on role:tool for OpenRouter

2026-03-21 16:54:43 -07:00

rate_limit_tracker.py

refactor: remove dead code — 1,784 lines across 77 files (#9180 )

2026-04-13 16:32:04 -07:00

redact.py

feat: replace kimi-k2.5 with kimi-k2.6 on OpenRouter and Nous Portal (#13148 )

2026-04-20 11:49:54 -07:00

retry_utils.py

feat(agent): add jittered retry backoff

2026-04-08 00:41:36 -07:00

shell_hooks.py

feat: shell hooks — wire shell scripts as Hermes hook callbacks

2026-04-20 20:53:51 -07:00

skill_commands.py

fix(skills): apply inline shell in skill_view

2026-04-24 15:15:07 -07:00

skill_preprocessing.py

fix(skills): apply inline shell in skill_view

2026-04-24 15:15:07 -07:00

skill_utils.py

fix(skills): follow symlinks in iter_skill_index_files

2026-04-22 17:43:30 -07:00

subdirectory_hints.py

fix(agent): catch PermissionError in subdirectory hint discovery

2026-04-09 03:10:30 -07:00

title_generator.py

fix: increase max_tokens for GLM 5.1 reasoning headroom

2026-04-22 18:44:07 -07:00

trajectory.py

Refactor Terminal and AIAgent cleanup

2026-02-21 22:31:43 -08:00

usage_pricing.py

fix(usage): read top-level Anthropic cache fields from OAI-compatible proxies

2026-04-22 17:40:49 -07:00