hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Author	SHA1	Message	Date
Teknium	4d170134ef	chore(release): map nerijusn76@gmail.com to Nerijusas (#15833 )	2026-04-25 18:22:49 -07:00
nerijusas	81e01f6ee9	fix(agent): preserve Codex message items for replay	2026-04-25 18:22:06 -07:00
Teknium	2536a36f6f	fix(tui): route /save through session.save JSON-RPC The cherry-picked approach serialized the UI-shaped transcript on the Node side, producing a third JSON format alongside cli.py save_conversation and tui_gateway session.save. Simpler to call the existing session.save method, which already writes the canonical agent history (raw OpenAI messages + model) to an absolute-path file. - /save still short-circuits before the slash worker - Empty transcript -> 'no conversation yet' - No active session -> 'no active session - nothing to save' - Otherwise: rpc('session.save', {session_id}) and echo back the file path - Tests updated to assert RPC contract; new test covers the no-sid case	2026-04-25 18:11:37 -07:00
helix4u	1b8ca9254f	fix(tui): save live transcript from slash command	2026-04-25 18:11:37 -07:00
Teknium	8bbeaea6c7	fix(config): broaden api-key ref lookup to templated base_url The raw-template lookup added in PR #15817 went through `get_compatible_custom_providers(read_raw_config())`, which calls `_normalize_custom_provider_entry` → `urlparse(base_url)`. Any entry whose `base_url` is itself an env-ref (`${NEURALWATT_API_BASE}`) was dropped as 'not a valid URL', so `api_key_ref` stayed empty and the resolved secret was still written to `model.api_key` — the exact case the original Discord report described. Replace the normalizer-gated lookup with a direct read of `raw['custom_providers']` and `raw['providers']`, indexed by name (case-insensitive, optionally qualified by model) so the loaded (expanded) entry can be matched regardless of how `base_url` is written. Add an integration regression test driving the real `select_provider_and_model` entry point with the Discord-reported NeuralWatt config (`${VAR}` in both `base_url` and `api_key`). This test fails on the PR-only fix and passes with the broadened lookup.	2026-04-25 18:10:52 -07:00
helix4u	1fdc31b214	fix(config): preserve custom provider api key refs	2026-04-25 18:10:52 -07:00
kshitijk4poor	2c56dce0ed	fix(model): preserve custom endpoint credentials and accept cloud models not in /v1/models When switching models on a custom endpoint (ollama-launch): - Same-provider switches no longer re-resolve credentials (fixes base_url being lost for 'custom' provider on subsequent switches) - Named providers (ollama-launch) are resolved via user_providers so switch_model can find their base_url from config - Models not in the /v1/models probe but present in the user's saved provider config are accepted with a warning instead of rejected - CLI /model and TUI /model both pass user_providers/custom_providers to switch_model so the config model list is available for validation Closes #15088	2026-04-25 18:03:47 -07:00
Teknium	01cf2c65cc	chore(release): map iris@growthpillars.co to irispillars (#15825 ) Follow-up to #15533 (merged). Prevents release notes CI from attributing the contributor to the placeholder.	2026-04-25 18:02:13 -07:00
helix4u	b2d3308f98	fix(doctor): accept bare custom provider	2026-04-25 18:01:36 -07:00
Iris Jin	25ba6a4a74	fix(gateway): make reasoning session-scoped by default	2026-04-25 18:01:31 -07:00
brooklyn!	489bed6f96	Merge pull request #15478 from yes999zc/fix-deepseek-reasoning-all-assistant-messages fix: DeepSeek/Kimi thinking mode requires reasoning_content on ALL assistant messages	2026-04-25 19:19:33 -05:00
FocusFlow Dev	ad0ac89478	fix: DeepSeek/Kimi thinking mode requires reasoning_content on ALL assistant messages Previously _copy_reasoning_content_for_api only padded reasoning_content when the assistant message had tool_calls. DeepSeek V4 thinking mode requires the field on every assistant turn, including plain text replies without tool_calls. - Remove the 'source_msg.get("tool_calls") and' guard - Update test: plain assistant turns now get padded for DeepSeek/Kimi Fixes #15213	2026-04-26 07:47:13 +08:00
Teknium	dc4d92f131	docs: embed tutorial videos on webhooks + auxiliary models pages (#15809 ) - webhooks.md: adds a Video Tutorial section under the intro with a responsive YouTube iframe (WNYe5mD4fY8). - configuration.md: adds a Video Tutorial subsection under Auxiliary Models with a responsive YouTube iframe (NoF-YajElIM). Both use a 16:9 aspect-ratio wrapper so the embeds scale cleanly on mobile. Verified with `npm run build` — MDX parses clean, no new warnings or broken links introduced.	2026-04-25 16:44:53 -07:00
Teknium	47420a84b9	docs(obliteratus): link YouTube video guide in SKILL.md (#15808 ) Adds a 'Video Guide' section pointing at the walkthrough of a Hermes agent abliterating Gemma with OBLITERATUS, so the agent can surface it when the user wants a visual overview before running the workflow.	2026-04-25 16:30:38 -07:00
brooklyn!	f93d4624bf	Merge pull request #15749 from Zjianru/fix/copy-reasoning-content-ordering-and-cross-provider-isolation fix(agent): ordering fix in _copy_reasoning_content_for_api — cross-provider reasoning isolation	2026-04-25 17:21:49 -05:00
codez	5ae608152e	fix: remove has_reasoning guard — inject empty reasoning_content for DeepSeek/Kimi tool_calls unconditionally	2026-04-26 06:08:54 +08:00
brooklyn!	88b65cc82a	Update run_agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-26 05:49:38 +08:00
brooklyn!	edc78e258c	Merge pull request #15766 from NousResearch/bb/tui-ssh-copy fix(tui): honor client copy shortcut over ssh	2026-04-25 15:33:17 -05:00
Brooklyn Nicholson	31d7f1951a	fix(tui): clamp copied selection bounds Clamp copied selection columns to the screen width before scanning rendered cells.	2026-04-25 15:32:45 -05:00
Brooklyn Nicholson	b1c18e5a41	refactor(tui): format screen imports Keep screen.ts import ordering aligned with the ui-tui formatter.	2026-04-25 15:26:51 -05:00
Brooklyn Nicholson	bd66e55a02	fix(tui): track rendered spaces for selection copy - add a written-cell bitmap so selection can distinguish rendered spaces from blank padding - preserve code indentation without markdown-specific rendering hacks	2026-04-25 15:21:26 -05:00
Brooklyn Nicholson	1735ced93b	fix(tui): preserve code block indentation in selection Render code indentation spaces as selectable cells so copied fenced code keeps its leading whitespace.	2026-04-25 15:17:36 -05:00
Brooklyn Nicholson	bba16943f6	fix(tui): preserve rendered indentation in selections - trim only empty edge rows instead of full selected text - bound selection paint using unwritten cells so rendered indentation remains copyable	2026-04-25 15:14:26 -05:00
Brooklyn Nicholson	132620ba3d	refactor(tui): simplify remote copy hotkey hints Use an explicit conditional table instead of spread casting for SSH copy hint rows.	2026-04-25 15:09:12 -05:00
Brooklyn Nicholson	876bb60044	fix(tui): trim whitespace-only selection chrome - clamp selection highlight to real row content so blank drag margins do not render or copy - keep successful copy actions quiet while preserving usage and failure feedback	2026-04-25 15:07:29 -05:00
Brooklyn Nicholson	a68793b6c4	refactor(tui): share remote shell detection Reuse the platform helper for SSH-aware copy hints so hotkey display and input handling cannot drift.	2026-04-25 14:55:28 -05:00
Brooklyn Nicholson	bcc5362432	fix(tui): honor client copy shortcut over ssh - accept forwarded Cmd+C for selection copy in SSH sessions even when Hermes runs on Linux - keep local Linux Alt+C from acting as copy and update TUI hotkey hints for remote shells	2026-04-25 14:44:39 -05:00
brooklyn!	283c8fd6e2	Merge pull request #15755 from NousResearch/bb/tui-model-flag fix(tui): honor launch model overrides	2026-04-25 14:30:26 -05:00
Brooklyn Nicholson	919274b60e	fix(tui): align overlay q shortcut casing Keep shared overlay close behavior consistent with pager and agents overlays by binding lowercase q only.	2026-04-25 14:26:35 -05:00
Brooklyn Nicholson	6e83d90eb4	refactor(tui): tighten overlay helpers - rename overlay help text component to match its role - share picker window math across model, session, and skills overlays	2026-04-25 14:23:45 -05:00
Brooklyn Nicholson	c6fdf48b79	fix(tui): sync inference model after switches - keep HERMES_INFERENCE_MODEL aligned with HERMES_MODEL after in-TUI model switches - clarify static provider detection remapping docs	2026-04-25 14:17:57 -05:00
Brooklyn Nicholson	a046483e86	fix(tui): share overlay close controls - add reusable overlay key and help-text helpers for picker-style overlays - make model, session, skills, and pager hints consistently support Esc/q close behavior	2026-04-25 14:17:04 -05:00
Brooklyn Nicholson	fdcbd2257b	fix(tui): resolve startup model aliases statically - expand short model aliases like sonnet/opus via static catalogs during startup runtime resolution - keep startup alias resolution network-free and add regression tests in models and tui gateway suites	2026-04-25 14:13:02 -05:00
Brooklyn Nicholson	48bdd2445e	fix(tui): apply ui-tui fix pass and restore type-check - run the requested ui-tui lint+format pass and include resulting formatting updates - guard text-measure cache eviction key in hermes-ink so ui-tui type-check stays green	2026-04-25 14:08:54 -05:00
Brooklyn Nicholson	5e52011de3	fix(tui): bind provider as model alias	2026-04-25 13:58:59 -05:00
Brooklyn Nicholson	e48a497d16	fix(tui): share static model detection	2026-04-25 13:56:16 -05:00
Brooklyn Nicholson	2dfcc8087a	fix(tui): avoid network lookup during startup	2026-04-25 13:47:18 -05:00
Brooklyn Nicholson	4db58d45d4	fix(tui): address startup provider review	2026-04-25 13:29:15 -05:00
Brooklyn Nicholson	57b43fdd4b	fix(tui): preserve provider precedence on startup	2026-04-25 13:25:43 -05:00
Brooklyn Nicholson	e9c47c7042	fix(tui): honor launch model overrides	2026-04-25 13:21:59 -05:00
brooklyn!	ee0728c6c4	Merge pull request #15351 from helix4u/fix/tui-rebuild-missing-ink-bundle fix(tui): rebuild when ink bundle is missing	2026-04-25 13:14:23 -05:00
codez	9daa0620a6	fix(agent): ordering fix in _copy_reasoning_content_for_api — cross-provider reasoning isolation Fix logic-ordering bug where normalized_reasoning promotion returns before the DeepSeek/Kimi needs_empty_reasoning guard, causing cross-provider reasoning content (MiniMax → DeepSeek) to leak into reasoning_content and trigger HTTP 400. Changes: - Reorder branching: existing reasoning_content check first - Add 'not has_reasoning' guard so poisoned histories (no reasoning) still get '' injected for DeepSeek/Kimi - Healthy same-provider reasoning promotion path unchanged Refs: #15250, #15213	2026-04-26 02:04:52 +08:00
kshitij	648b89911f	fix: use output_text for assistant message content in Codex Responses API (#15690 ) The Codex Responses API rejects input_text inside assistant messages — only output_text and refusal are valid content types for assistant role. _chat_content_to_responses_parts() previously hardcoded all text content to input_text regardless of the message role. When an assistant message had list-format content (multimodal or structured), this produced invalid input_text parts that the API rejected with: Invalid value: 'input_text'. Supported values are: 'output_text' and 'refusal'. Fix: add a role parameter to _chat_content_to_responses_parts() that selects output_text for assistant messages and input_text for user messages. Thread this through _chat_messages_to_responses_input() and _preflight_codex_input_items(). Fixes #15687	2026-04-25 10:13:29 -07:00
kshitijk4poor	7c17accb29	fix: /stop now immediately aborts streaming retry loop When a user sends /stop during a streaming API call, the outer poll loop detects _interrupt_requested and closes the HTTP connection. However, the inner _call() thread catches the connection error and enters its retry loop — opening a FRESH connection without checking the interrupt flag. On slow providers like ollama-cloud, each retry attempt blocks for the full stream-read timeout (120s+). With 3 retry attempts this caused 510+ second delays between /stop and actual response — the agent appeared completely unresponsive despite the stop being acknowledged. Fix: add an _interrupt_requested check at the top of the streaming retry loop so the agent exits immediately instead of retrying. Also fix log truncation: all session key logging in gateway/run.py used [:20] or [:30] slices, which truncated 'agent:main:telegram:dm:5690190437' (33 chars) to 'agent:main:telegram:' — losing the identifying chat type and user ID. Replace with full keys to make logs debuggable. Reported by user Sidharth Pulipaka via Telegram on ollama-cloud provider.	2026-04-25 09:51:39 -07:00
Teknium	5006b2204b	fix(update): honor RestartSec when polling for gateway respawn (#15707 ) The post-graceful-drain is-active poll used a fixed 10s timeout, but systemd's hermes-gateway.service has RestartSec=30 — so systemd won't respawn the unit for 30s after exit-75, and our poll gives up during the cooldown. Result: every 'hermes update' printed ⚠ hermes-gateway drained but didn't relaunch — forcing restart followed by a redundant 'systemctl restart' that kicked the newly- respawning gateway again (and re-started WhatsApp / Discord a second time in the process). Fix: read RestartUSec from the unit via 'systemctl show' and set the poll budget to max(10s, RestartSec + 10s slack). Units without RestartSec set (or value=infinity) fall back to the original 10s. Observed timeline from journalctl before fix: 08:56:22.262 old PID exits 75 08:56:32.707 systemd logs Stopped -> Started (10.4s gap, > 10s budget) After fix the poll covers 40s — comfortably inside RestartSec + slack. Validation: - RestartUSec parser tested against '30s', '100ms', '1min 30s', 'infinity', '', 'garbage', '500us', '2min' — all correct. - Against the live hermes-gateway.service: parses to 30.0s. - tests/hermes_cli/test_update_gateway_restart.py: 41/41 pass.	2026-04-25 09:08:27 -07:00
Teknium	a9fa73a620	feat(oneshot): add --model / --provider / HERMES_INFERENCE_MODEL (#15704 ) Makes hermes -z usable by sweeper without mutating user config. - Top-level -m/--model and --provider flags that apply to -z/--oneshot (mirrors hermes chat's plumbing). - HERMES_INFERENCE_MODEL env var as the parallel to HERMES_INFERENCE_PROVIDER for CI / scripted invocations. - resolve_runtime_provider() gets the requested provider; when --model is given without --provider, detect_provider_for_model() auto-selects the provider that serves it (same semantic as /model in an interactive session). - --provider without --model errors out with exit 2 — carrying a config model across to a different provider is usually wrong, and silently picking the provider's catalog default hides the mismatch. Config defaults still used when both flags are omitted (existing behavior). Validation (all live against OpenRouter): -z 'x' ....................... uses config default (opus-4.7) -z 'x' --model haiku-4.5 ..... haiku-4.5 via auto-detected openrouter -z 'x' --model ... --provider pair as given HERMES_INFERENCE_MODEL=... -z haiku-4.5 via env var -z 'x' --provider anthropic .. exits 2 with error to stderr	2026-04-25 08:55:36 -07:00
Teknium	7c8c031f60	feat: add `hermes -z <prompt>` one-shot mode (#15702 ) * feat: add `hermes -z <prompt>` one-shot mode Top-level flag that runs a single prompt and prints ONLY the final response text to stdout. No banner, no spinner, no tool previews, no session_id line — stdout is machine-readable, stderr is silent. Tools, memory, rules, and AGENTS.md in the CWD are loaded as normal. Approvals are auto-bypassed (sets HERMES_YOLO_MODE=1 for the call). Bypasses cli.py entirely — goes straight to AIAgent.chat(). * feat(oneshot): handle interactive-callback gaps explicitly Document (and where needed, patch) the interactive surfaces that have no user to answer in oneshot mode: - clarify — inject a callback that tells the agent to pick the best default and continue (previously returned a generic 'not available in this execution context' error that wastes a tool call) - sudo password — terminal_tool already gates on HERMES_INTERACTIVE (we don't set it); sudo fails gracefully - shell hooks — HERMES_ACCEPT_HOOKS=1 auto-approves; also falls back to deny on non-tty stdin - dangerous cmd — HERMES_YOLO_MODE=1 short-circuits before input() - secret capture— tool returns gracefully when no callback wired Live-tested: agent asked clarify(['red','blue']) and got 'red' back, replied with only 'red'.	2026-04-25 08:44:38 -07:00
Teknium	ea01bdcebe	refactor(memory): remove flush_memories entirely (#15696 ) The AIAgent.flush_memories pre-compression save, the gateway _flush_memories_for_session, and everything feeding them are obsolete now that the background memory/skill review handles persistent memory extraction. Problems with flush_memories: - Pre-dates the background review loop. It was the only memory-save path when introduced; the background review now fires every 10 user turns on CLI and gateway alike, which is far more frequent than compression or session reset ever triggered flush. - Blocking and synchronous. Pre-compression flush ran on the live agent before compression, blocking the user-visible response. - Cache-breaking. Flush built a temporary conversation prefix (system prompt + memory-only tool list) that diverged from the live conversation's cached prefix, invalidating prompt caching. The gateway variant spawned a fresh AIAgent with its own clean prompt for each finalized session — still cache-breaking, just in a different process. - Redundant. Background review runs in the live conversation's session context, gets the same content, writes to the same memory store, and doesn't break the cache. Everything flush_memories claimed to preserve is already covered. What this removes: - AIAgent.flush_memories() method (~248 LOC in run_agent.py) - Pre-compression flush call in _compress_context - flush_memories call sites in cli.py (/new + exit) - GatewayRunner._flush_memories_for_session + _async_flush_memories (and the 3 call sites: session expiry watcher, /new, /resume) - 'flush_memories' entry from DEFAULT_CONFIG auxiliary tasks, hermes tools UI task list, auxiliary_client docstrings - _memory_flush_min_turns config + init - #15631's headroom-deduction math in _check_compression_model_feasibility (headroom was only needed because flush dragged the full main-agent system prompt along; the compression summariser sends a single user-role prompt so new_threshold = aux_context is safe again) - The dedicated test files and assertions that exercised flush-specific paths What this renames (with read-time backcompat on sessions.json): - SessionEntry.memory_flushed -> SessionEntry.expiry_finalized. The session-expiry watcher still uses the flag to avoid re-running finalize/eviction on the same expired session; the new name reflects what it now actually gates. from_dict() reads 'expiry_finalized' first, falls back to the legacy 'memory_flushed' key so existing sessions.json files upgrade seamlessly. Supersedes #15631 and #15638. Tested: 383 targeted tests pass across run_agent/, agent/, cli/, and gateway/ session-boundary suites. No behavior regressions — background memory review continues to handle persistent memory extraction on both CLI and gateway.	2026-04-25 08:21:14 -07:00
kshitijk4poor	d635e2df3f	fix(compression): pass provider to context length resolver in feasibility check _check_compression_model_feasibility calls get_model_context_length without provider=, so Codex OAuth users get 1,050,000 (from models.dev for 'openai') instead of the actual 272,000 limit. This happens because _infer_provider_from_url maps chatgpt.com → 'openai' (not 'openai-codex'), skipping the Codex-specific resolution branch entirely. Result: compression threshold set at 85% of 1.05M = 892K — conversations never trigger compression, the context grows unbounded, and when gateway hygiene eventually forces compression, the Codex endpoint drops the oversized streaming request ('peer closed connection without sending complete message body'). Fix: forward self.provider to get_model_context_length so provider- specific resolution branches (Codex OAuth 272K, Copilot live /models, Nous suffix-match) fire correctly. Reported by user on GPT 5.5 via Codex OAuth Pro (paste.rs/vsra3).	2026-04-25 07:09:47 -07:00
Teknium	cf2fabc40f	docs(dashboard): document page-scoped plugin slots (#15662 ) Follow-up to PR #15658. The feature PR introduced page-scoped slots (<page>:top / <page>:bottom inside every built-in page) but only touched the Shell slots catalogue. Adds proper narrative coverage so plugin authors find the feature. Changes - extending-the-dashboard.md: - Frontmatter description + intro bullet now mention page-scoped slots - New TOC entry "Augmenting built-in pages (page-scoped slots)" - New dedicated subsection after "Replacing built-in pages" explaining the heavy-vs-light tradeoff, listing the pages that expose slots, and showing a worked manifest + IIFE example with tab.hidden: true - Cross-link from the tab.override section pointing readers to the lighter augmentation option - web-dashboard.md: - Bullet mentioning "page-scoped slots (inject widgets into built-in pages without overriding them)" Validation - TOC anchor "#augmenting-built-in-pages-page-scoped-slots" matches the generated heading slug - Code fences balanced (64, even) - Pre-existing docusaurus build errors (skills.json, api-server.md link) reproduce on bare main -- not introduced here	2026-04-25 06:59:24 -07:00

1 2 3 4 5 ...

5924 Commits