hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Author	SHA1	Message	Date
TechPrototyper	3a7653dd1f	feat: Add Azure Foundry provider with OpenAI/Anthropic API mode selection Add support for Azure Foundry as a new inference provider. Azure Foundry endpoints can use either OpenAI-style (/v1/chat/completions) or Anthropic-style (/v1/messages) API formats. Changes: - Add azure-foundry to PROVIDER_REGISTRY (auth.py) - Add azure-foundry overlay in HERMES_OVERLAYS (providers.py) - Add empty model list for azure-foundry (models.py) - Add _model_flow_azure_foundry() interactive setup (main.py) - Add azure-foundry runtime resolution with api_mode support (runtime_provider.py) - Add AZURE_FOUNDRY_API_KEY and AZURE_FOUNDRY_BASE_URL env vars (config.py) Usage: hermes model -> More providers -> Azure Foundry The setup wizard prompts for: - Endpoint URL - API format (OpenAI or Anthropic-style) - API key - Model name Configuration is saved to config.yaml (model.provider, model.base_url, model.api_mode, model.default) and ~/.hermes/.env (AZURE_FOUNDRY_API_KEY).	2026-04-25 18:48:43 -07:00
Teknium	125de02056	fix(context): honor custom_providers context_length on /model switch + bump probe tier to 256K (#15844 ) Fixes #15779. Custom-provider per-model context_length (`custom_providers[].models.<id>.context_length`) is now honored across every resolution path, not just agent startup. Also adds 256K as the top probe tier and default fallback. ## What changed New helper `hermes_cli.config.get_custom_provider_context_length()` — single source of truth for the per-model override lookup, with trailing-slash-insensitive base-url matching. `agent.model_metadata.get_model_context_length()` gains an optional `custom_providers=` kwarg (step 0b — runs after explicit `config_context_length` but before every other probe). Wired through five call sites that previously either duplicated the lookup or ignored it entirely: - `run_agent.py` startup — refactored to use the new helper (dedups legacy inline loop, keeps invalid-value warning) - `AIAgent.switch_model()` — re-reads custom_providers from live config on every /model switch - `hermes_cli.model_switch.resolve_display_context_length()` — new `custom_providers=` kwarg - `gateway/run.py` /model confirmation (picker callback + text path) - `gateway/run.py` `_format_session_info` (/info) ## Context probe tiers `CONTEXT_PROBE_TIERS = [256_000, 128_000, 64_000, 32_000, 16_000, 8_000]` — was `[128_000, ...]`. `DEFAULT_FALLBACK_CONTEXT` follows tier[0], so unknown models now default to 256K. The stale `128000` literal in the OpenRouter metadata-miss path is replaced with `DEFAULT_FALLBACK_CONTEXT` for consistency. ## Repro (from #15779) ```yaml custom_providers: - name: my-custom-endpoint base_url: https://example.invalid/v1 model: gpt-5.5 models: gpt-5.5: context_length: 1050000 ``` `/model gpt-5.5 --provider custom:my-custom-endpoint` → previously "Context: 128,000", now "Context: 1,050,000". ## Tests - `tests/hermes_cli/test_custom_provider_context_length.py` — new file, 19 tests covering the helper, step-0b integration, and the 256K tier invariants - `tests/hermes_cli/test_model_switch_context_display.py` — added regression tests for #15779 through the display resolver - `tests/gateway/test_session_info.py` — updated default-fallback assertion (128K → 256K) - `tests/agent/test_model_metadata.py` — updated tier assertions for the new top tier	2026-04-25 18:47:53 -07:00
Teknium	4c591c2819	chore(release): map fqsy1416@gmail.com to EKKOLearnAI	2026-04-25 18:40:35 -07:00
Teknium	01535a4732	fix(api_server): cap stop-run wait at 5s so interrupt can't hang handler task.cancel() can't preempt the run_in_executor thread running run_conversation(), so we rely on agent.interrupt() to wake the loop. Without a timeout, a slow/unresponsive interrupt blocks the HTTP response indefinitely. Wrap the await in wait_for(shield(task), 5.0) and log a warning on timeout. Also tidy one extra space in the module docstring's /stop entry.	2026-04-25 18:40:35 -07:00
ekko	0a15dbdc43	feat(api_server): add POST /v1/runs/{run_id}/stop endpoint Add ability to interrupt a running agent via the runs API. Previously /v1/runs could start a run and subscribe to events, but there was no way to cancel it. The new endpoint stores agent and task references during execution, calls agent.interrupt() to stop LLM calls, then cancels the asyncio task. Includes 15 tests covering start, events, and stop scenarios.	2026-04-25 18:40:35 -07:00
Teknium	ce0513dd2e	chore(release): map Feranmi10 personal email	2026-04-25 18:39:55 -07:00
Oluwadare Feranmi	dc5e02ea7f	feat(cli): implement hermes update --check flag (fixes #10318 )	2026-04-25 18:39:55 -07:00
brooklyn!	ff851ba7b9	Merge pull request #15821 from NousResearch/fix/tui-ctrl-g-editor fix: external editor handoff in CLI/TUI	2026-04-25 20:37:05 -05:00
Brooklyn Nicholson	14dd8e9a72	fix(tui): address Copilot review on editor handoff - resolveEditor() now returns argv (string[]) so EDITOR='code --wait' and VISUAL='emacsclient -t' tokenize correctly into spawnSync's separate command + args. Previously the whole string was passed as argv[0] and would ENOENT. - Skip the POSIX X_OK PATH walk on Windows; return ['notepad.exe'] there since fs.constants.X_OK is not meaningful and PATHEXT-based resolution would need its own implementation. - Surface openEditor() rejections via actions.sys instead of letting them become unhandled promise rejections in the useInput callback. - Hotkey docs/comment now say Cmd/Ctrl+G to match isAction()'s platform-action-modifier behavior (Cmd on macOS, Ctrl elsewhere).	2026-04-25 20:34:24 -05:00
Wysie	1d80e92c7e	test(discord): add guild to fake e2e messages	2026-04-25 18:25:56 -07:00
Teknium	edce7522a5	chore(release): add AUTHOR_MAP entry for voidborne-d personal email	2026-04-25 18:25:13 -07:00
voidborne-d	45e1228a8a	fix(cli): suppress OSError EIO on interrupt shutdown When the user interrupts a long-running task, prompt_toolkit tries to flush stdout during emergency shutdown. If stdout is in a broken state (redirected to /dev/null, pipe closed, terminal gone), the flush raises `OSError: [Errno 5] Input/output error` which propagates unhandled and crashes the CLI. Two defense layers: 1. `_suppress_closed_loop_errors`: add `OSError` with `errno.EIO` to the asyncio exception handler, matching the existing pattern for `RuntimeError("Event loop is closed")` and `KeyError("is not registered")`. 2. Outer `except (KeyError, OSError)` block: add `errno.EIO` check before the existing string-match guards, silently suppressing the error instead of printing a misleading stdin-related message. Fixes #13710.	2026-04-25 18:25:13 -07:00
Brooklyn Nicholson	83129e72de	refactor(tui): tighten editor handoff helpers - editor.ts: collapse two private helpers into one flatMap-driven lookup, keep `isExecutable` as the only named primitive, document the fallback chain with prompt_toolkit parity - editor.test.ts: hoist the `exe` helper out of `describe`, drop the empty afterEach + dead mkdir branch, materialize expected paths before the resolveEditor call so argument evaluation order doesn't bite - useComposerState.openEditor: rmSync the mkdtemp dir (was leaking), early-return on bad exit / empty buffer, run cleanup in finally - useInputHandlers: cheap `ch.toLowerCase() === 'g'` guard before the modifier check - hermes-ink/screen.ts: pick up `npm run fix` import-sort cleanup so lint passes	2026-04-25 20:24:06 -05:00
Teknium	4d170134ef	chore(release): map nerijusn76@gmail.com to Nerijusas (#15833 )	2026-04-25 18:22:49 -07:00
nerijusas	81e01f6ee9	fix(agent): preserve Codex message items for replay	2026-04-25 18:22:06 -07:00
Brooklyn Nicholson	7fd8dc0bfb	fix: preserve prompt_toolkit editor picker and mirror it in TUI Base CLI's editor UX was better because prompt_toolkit picks the system editor first, then friendly terminal editors before vi. Do not override that with a vim-first chain. Keep the CLI on prompt_toolkit's picker and only set tempfile_suffix='.md' to avoid the complex-tempfile EEXIST path. Update the TUI resolver to match prompt_toolkit's fallback order: $VISUAL, $EDITOR, editor, nano, pico, vi, emacs.	2026-04-25 20:20:05 -05:00
Brooklyn Nicholson	d056b610b7	fix: avoid prompt_toolkit complex tempfile bug and prefer nvim first Setting buffer.tempfile = 'prompt.md' pushed prompt_toolkit into its complex-tempfile path, which creates a temp dir and then calls os.makedirs() on that same path when no subdirectory is present. That raises EEXIST before the editor can launch. Keep prompt_toolkit on the simple tempfile path with .md suffix, and make the editor fallback chain explicit on both surfaces: $VISUAL -> $EDITOR -> nvim -> vim -> vi -> nano.	2026-04-25 20:16:50 -05:00
Teknium	2536a36f6f	fix(tui): route /save through session.save JSON-RPC The cherry-picked approach serialized the UI-shaped transcript on the Node side, producing a third JSON format alongside cli.py save_conversation and tui_gateway session.save. Simpler to call the existing session.save method, which already writes the canonical agent history (raw OpenAI messages + model) to an absolute-path file. - /save still short-circuits before the slash worker - Empty transcript -> 'no conversation yet' - No active session -> 'no active session - nothing to save' - Otherwise: rpc('session.save', {session_id}) and echo back the file path - Tests updated to assert RPC contract; new test covers the no-sid case	2026-04-25 18:11:37 -07:00
helix4u	1b8ca9254f	fix(tui): save live transcript from slash command	2026-04-25 18:11:37 -07:00
Brooklyn Nicholson	db7c5735f0	fix: prefer vim over nano for $EDITOR fallback (CLI + TUI) prompt_toolkit's default editor list is: $VISUAL, $EDITOR, /usr/bin/editor, /usr/bin/nano, /usr/bin/pico, /usr/bin/vi, /usr/bin/emacs — so when neither env var is set, the base CLI launched nano. The TUI fell back to a literal 'vi'. Same Ctrl+G keystroke, two different editors. Pick the same chain on both surfaces: $VISUAL → $EDITOR → vim → vi → nano CLI: override input_area.buffer._open_file_in_editor on the TextArea once at app build time. Local to that buffer; doesn't touch os.environ or affect other subprocesses. TUI: extract resolveEditor() into ui-tui/src/lib/editor.ts. PATH walk with accessSync(X_OK), no shelling out. Six-line unit test verifies the priority order and the multi-entry PATH walk.	2026-04-25 20:11:25 -05:00
Teknium	8bbeaea6c7	fix(config): broaden api-key ref lookup to templated base_url The raw-template lookup added in PR #15817 went through `get_compatible_custom_providers(read_raw_config())`, which calls `_normalize_custom_provider_entry` → `urlparse(base_url)`. Any entry whose `base_url` is itself an env-ref (`${NEURALWATT_API_BASE}`) was dropped as 'not a valid URL', so `api_key_ref` stayed empty and the resolved secret was still written to `model.api_key` — the exact case the original Discord report described. Replace the normalizer-gated lookup with a direct read of `raw['custom_providers']` and `raw['providers']`, indexed by name (case-insensitive, optionally qualified by model) so the loaded (expanded) entry can be matched regardless of how `base_url` is written. Add an integration regression test driving the real `select_provider_and_model` entry point with the Discord-reported NeuralWatt config (`${VAR}` in both `base_url` and `api_key`). This test fails on the PR-only fix and passes with the broadened lookup.	2026-04-25 18:10:52 -07:00
helix4u	1fdc31b214	fix(config): preserve custom provider api key refs	2026-04-25 18:10:52 -07:00
Brooklyn Nicholson	5fac6c3440	fix(cli): write editor draft to prompt.md so syntax highlighting works Base CLI was handing prompt_toolkit's Buffer.open_in_editor() a default config — Buffer.tempfile_suffix and .tempfile both empty — so it created /tmp/tmpXXXXXX with no extension. nano/vim/helix all key syntax highlighting off the file extension, so the buffer rendered plain. The TUI already writes to <mkdtemp>/prompt.md and gets full markdown highlighting + a sensible title bar. Set buffer.tempfile = 'prompt.md' on the TextArea so prompt_toolkit's complex-tempfile path produces <mkdtemp>/prompt.md to match. shutil.rmtree cleanup is built-in.	2026-04-25 20:04:04 -05:00
kshitijk4poor	2c56dce0ed	fix(model): preserve custom endpoint credentials and accept cloud models not in /v1/models When switching models on a custom endpoint (ollama-launch): - Same-provider switches no longer re-resolve credentials (fixes base_url being lost for 'custom' provider on subsequent switches) - Named providers (ollama-launch) are resolved via user_providers so switch_model can find their base_url from config - Models not in the /v1/models probe but present in the user's saved provider config are accepted with a warning instead of rejected - CLI /model and TUI /model both pass user_providers/custom_providers to switch_model so the config model list is available for validation Closes #15088	2026-04-25 18:03:47 -07:00
Teknium	01cf2c65cc	chore(release): map iris@growthpillars.co to irispillars (#15825 ) Follow-up to #15533 (merged). Prevents release notes CI from attributing the contributor to the placeholder.	2026-04-25 18:02:13 -07:00
helix4u	b2d3308f98	fix(doctor): accept bare custom provider	2026-04-25 18:01:36 -07:00
Iris Jin	25ba6a4a74	fix(gateway): make reasoning session-scoped by default	2026-04-25 18:01:31 -07:00
Brooklyn Nicholson	4c797bfae9	fix(cli): accept Alt+G as Ctrl+G fallback in VSCode/Cursor terminals Same problem as the TUI: Cursor and VSCode bind Ctrl+G to "Find Next" at the editor level, so the keystroke never reaches the terminal and the prompt_toolkit-driven Hermes CLI sees nothing. Register ('escape', 'g') alongside the existing 'c-g' on the same handler so the editor handoff works inside Cursor/VSCode too. The filter (no clarify/approval/sudo/secret prompt active) is unchanged.	2026-04-25 20:01:03 -05:00
Brooklyn Nicholson	c58956a9a2	fix(tui): accept Alt+G as Ctrl+G fallback in VSCode/Cursor terminals VSCode and Cursor bind Ctrl+G to "Find Next" at the editor level, so the keystroke never reaches the embedded terminal — Ctrl+G to open \$EDITOR was effectively dead inside those IDEs. Alt+G is unbound in both editors and reaches the TUI cleanly as `\x1bg` → `key.meta && ch === 'g'` after parse-keypress. Accept it alongside the existing isAction(key, ch, 'g') check, and document the fallback in README + the hotkeys panel.	2026-04-25 19:57:17 -05:00
Brooklyn Nicholson	3944b22506	fix(tui): suspend Ink properly when opening $EDITOR via Ctrl+G The Ctrl+G handler was toggling the alt-screen by hand (`\x1b[?1049l` ... `\x1b[?1049h`) without releasing stdin or kitty keyboard mode, so the launched editor would lose keystrokes (Ink kept swallowing them) and editors that don't speak CSI-u (e.g. nano) would print "Unknown sequence" for every Ctrl-key. Switch to `withInkSuspended` from @hermes/ink, the same helper `/setup` already uses. It pauses Ink, removes stdin listeners, drops raw mode, disables kitty/modifyOtherKeys + mouse + focus reporting, runs the editor, then restores everything with a full repaint.	2026-04-25 19:54:06 -05:00
brooklyn!	489bed6f96	Merge pull request #15478 from yes999zc/fix-deepseek-reasoning-all-assistant-messages fix: DeepSeek/Kimi thinking mode requires reasoning_content on ALL assistant messages	2026-04-25 19:19:33 -05:00
FocusFlow Dev	ad0ac89478	fix: DeepSeek/Kimi thinking mode requires reasoning_content on ALL assistant messages Previously _copy_reasoning_content_for_api only padded reasoning_content when the assistant message had tool_calls. DeepSeek V4 thinking mode requires the field on every assistant turn, including plain text replies without tool_calls. - Remove the 'source_msg.get("tool_calls") and' guard - Update test: plain assistant turns now get padded for DeepSeek/Kimi Fixes #15213	2026-04-26 07:47:13 +08:00
Teknium	dc4d92f131	docs: embed tutorial videos on webhooks + auxiliary models pages (#15809 ) - webhooks.md: adds a Video Tutorial section under the intro with a responsive YouTube iframe (WNYe5mD4fY8). - configuration.md: adds a Video Tutorial subsection under Auxiliary Models with a responsive YouTube iframe (NoF-YajElIM). Both use a 16:9 aspect-ratio wrapper so the embeds scale cleanly on mobile. Verified with `npm run build` — MDX parses clean, no new warnings or broken links introduced.	2026-04-25 16:44:53 -07:00
Teknium	47420a84b9	docs(obliteratus): link YouTube video guide in SKILL.md (#15808 ) Adds a 'Video Guide' section pointing at the walkthrough of a Hermes agent abliterating Gemma with OBLITERATUS, so the agent can surface it when the user wants a visual overview before running the workflow.	2026-04-25 16:30:38 -07:00
brooklyn!	f93d4624bf	Merge pull request #15749 from Zjianru/fix/copy-reasoning-content-ordering-and-cross-provider-isolation fix(agent): ordering fix in _copy_reasoning_content_for_api — cross-provider reasoning isolation	2026-04-25 17:21:49 -05:00
codez	5ae608152e	fix: remove has_reasoning guard — inject empty reasoning_content for DeepSeek/Kimi tool_calls unconditionally	2026-04-26 06:08:54 +08:00
brooklyn!	88b65cc82a	Update run_agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-26 05:49:38 +08:00
brooklyn!	edc78e258c	Merge pull request #15766 from NousResearch/bb/tui-ssh-copy fix(tui): honor client copy shortcut over ssh	2026-04-25 15:33:17 -05:00
Brooklyn Nicholson	31d7f1951a	fix(tui): clamp copied selection bounds Clamp copied selection columns to the screen width before scanning rendered cells.	2026-04-25 15:32:45 -05:00
Brooklyn Nicholson	b1c18e5a41	refactor(tui): format screen imports Keep screen.ts import ordering aligned with the ui-tui formatter.	2026-04-25 15:26:51 -05:00
Brooklyn Nicholson	bd66e55a02	fix(tui): track rendered spaces for selection copy - add a written-cell bitmap so selection can distinguish rendered spaces from blank padding - preserve code indentation without markdown-specific rendering hacks	2026-04-25 15:21:26 -05:00
Brooklyn Nicholson	1735ced93b	fix(tui): preserve code block indentation in selection Render code indentation spaces as selectable cells so copied fenced code keeps its leading whitespace.	2026-04-25 15:17:36 -05:00
Brooklyn Nicholson	bba16943f6	fix(tui): preserve rendered indentation in selections - trim only empty edge rows instead of full selected text - bound selection paint using unwritten cells so rendered indentation remains copyable	2026-04-25 15:14:26 -05:00
Brooklyn Nicholson	132620ba3d	refactor(tui): simplify remote copy hotkey hints Use an explicit conditional table instead of spread casting for SSH copy hint rows.	2026-04-25 15:09:12 -05:00
Brooklyn Nicholson	876bb60044	fix(tui): trim whitespace-only selection chrome - clamp selection highlight to real row content so blank drag margins do not render or copy - keep successful copy actions quiet while preserving usage and failure feedback	2026-04-25 15:07:29 -05:00
Brooklyn Nicholson	a68793b6c4	refactor(tui): share remote shell detection Reuse the platform helper for SSH-aware copy hints so hotkey display and input handling cannot drift.	2026-04-25 14:55:28 -05:00
Brooklyn Nicholson	bcc5362432	fix(tui): honor client copy shortcut over ssh - accept forwarded Cmd+C for selection copy in SSH sessions even when Hermes runs on Linux - keep local Linux Alt+C from acting as copy and update TUI hotkey hints for remote shells	2026-04-25 14:44:39 -05:00
brooklyn!	283c8fd6e2	Merge pull request #15755 from NousResearch/bb/tui-model-flag fix(tui): honor launch model overrides	2026-04-25 14:30:26 -05:00
Brooklyn Nicholson	919274b60e	fix(tui): align overlay q shortcut casing Keep shared overlay close behavior consistent with pager and agents overlays by binding lowercase q only.	2026-04-25 14:26:35 -05:00
Brooklyn Nicholson	6e83d90eb4	refactor(tui): tighten overlay helpers - rename overlay help text component to match its role - share picker window math across model, session, and skills overlays	2026-04-25 14:23:45 -05:00

... 4 5 6 7 8 ...

6194 Commits