hermes-agent/hermes_cli/model_switch.py at 4db58d45d4e06fc819b8ff6729548bd9d7d02a8a

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-04 09:47:54 +08:00

Files

Teknium 05d8f11085 fix(/model): show provider-enforced context length, not raw models.dev (#15438 )

/model gpt-5.5 on openai-codex showed 'Context: 1,050,000 tokens' because
the display block used ModelInfo.context_window directly from models.dev.
Codex OAuth actually enforces 272K for the same slug, and the agent's
compressor already runs at 272K via get_model_context_length() — so the
banner + real context budget said 272K while /model lied with 1M.

Route the display context through a new resolve_display_context_length()
helper that always prefers agent.model_metadata.get_model_context_length
(which knows about Codex OAuth, Copilot, Nous caps) and only falls back
to models.dev when that returns nothing.

Fix applied to all 3 /model display sites:
  cli.py _handle_model_switch
  gateway/run.py picker on_model_selected callback
  gateway/run.py text-fallback confirmation

Reported by @emilstridell (Telegram, April 2026).

2026-04-24 17:21:38 -07:00

55 KiB

Raw Blame History

View Raw

55 KiB Raw Blame History

55 KiB

Raw Blame History