hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-28 06:51:16 +08:00

Author	SHA1	Message	Date
Brooklyn Nicholson	cd7a200e6c	perf(tui): instrument scroll fast-path decline reasons Adds scrollFastPathStats counters to render-node-to-output.ts: captures every time a ScrollBox's DECSTBM scroll hint is generated, records whether the fast path took it (blit+shift from prevScreen) or declined, and why. Exposed through hermes-ink's public exports and snapshotted on every FrameEvent so the profiler harness can correlate decline reasons with the actual patch/renderer cost per frame. This is pure observation — no behaviour change. Preparing for the virtual-history rewrite: the hypothesis was that our topSpacer/ bottomSpacer scheme disqualifies every scroll via heightDelta mismatch, but the data shows the fast path is actually taken on most scrolls (19/23 over a 6s PageUp hold through 1100 messages) — the remaining steady-state renderer cost is Yoga tree traversal, not the per-frame full redraw I initially suspected. Declines that do happen correlate with React commits that changed the mounted range mid-scroll (heightDelta=±3 to ±35). Those are the rarer cases the virtualization rewrite still needs to address. No test diffs — instrumentation-only. Build verified: `tsc --noEmit` plus the full `npm run build` compiler post-pass pass cleanly.	2026-04-26 16:45:53 -05:00
Brooklyn Nicholson	71eee26640	perf(tui): full-pipeline instrumentation + profiling harness Extends HERMES_DEV_PERF to capture the complete render pipeline, not just React commits. Adds scripts/profile-tui.py to drive repeatable hold-PageUp stress tests against a real long session. perfPane.tsx: Wires ink's onFrame callback (already plumbed through the fork) into the same perf.log as the React.Profiler samples. Captures per-phase timing (yoga calculateLayout, renderNodeToOutput, screen diff, patch optimize, stdout write) plus yoga counters (visited/measured/cache- Hits/live) and patch counts per frame. Events are tagged {src: 'react'\|'frame'} so jq can split them. logFrameEvent is undefined when HERMES_DEV_PERF is unset, so ink doesn't even attach the callback. entry.tsx: Passes logFrameEvent into render(). types/hermes-ink.d.ts: Declares FrameEvent + onFrame on RenderOptions so the ui-tui side type-checks against the plumbed-through ink option. scripts/profile-tui.py: New harness. Launches the built TUI under a PTY with the longest session in state.db resumed, holds PageUp/PageDown/etc at a configurable Hz for N seconds, then parses perf.log and prints per-phase p50/p95/p99/max plus yoga-counter summaries. Zero deps beyond stdlib. Exit 2 if nothing was captured (wiring broken). Initial findings (1106-msg session, 6s PageUp hold at 30Hz): - Steady state: 10 fps; renderer phase p99=63ms, write p99=0.2ms - 4/107 heavy frames (>=16ms), all dominated by renderNodeToOutput - One pathological 97ms frame with yoga measuring 70,415 text cells and Yoga visiting 225k nodes — the cold-unmeasured-region hit - Ink's scroll fast-path (DECSTBM blit from prevScreen) is disqualified because our spacer-based virtual history doesn't keep heightDelta in sync with scroll.delta, so every PageUp step falls through to a full 2000-4800 patch re-render instead of ~40	2026-04-26 16:36:25 -05:00
Brooklyn Nicholson	69ff201050	feat(tui): anchor todo panel above streaming output	2026-04-26 16:26:50 -05:00
Brooklyn Nicholson	2259eac49e	feat(tui): collapse completed todo panel on turn end	2026-04-26 16:24:15 -05:00
Brooklyn Nicholson	cb7cfba6de	fix(cli): surface last_active in search_sessions so -c works	2026-04-26 16:21:57 -05:00
Brooklyn Nicholson	debae25f1c	perf(tui): incremental markdown during streaming Split in-flight assistant text at the last stable block boundary so only the unclosed tail re-tokenizes per stream delta. Previously the full text was rendered as plain <Text> during streaming and only flipped to <Md> at message.complete — cheap per delta but loses live markdown formatting. New StreamingMd component holds a monotonically-growing stablePrefix in a ref (idempotent under StrictMode double-render), renders it as one <Md> that memoizes across deltas, and renders the unstable suffix as a second <Md> that re-parses on each delta. Cost per delta drops from O(total length) to O(unstable length). findStableBoundary walks back to the last "\n\n" outside an open fenced code block — splitting inside an open fence would orphan the opener and break highlighting in the prefix. Adapted from claude-code's src/components/Markdown.tsx:186 but built on our line-based tokenizer instead of marked.lexer. 9 new tests cover fence balance, boundary walk, and empty input. Part of the --tui perf audit (see audit #7).	2026-04-26 16:21:34 -05:00
Brooklyn Nicholson	bde89c169b	fix(cli): -c picks the most recently used session	2026-04-26 16:17:39 -05:00
Brooklyn Nicholson	b36007b246	feat(tui): allow collapsing archived todo panels	2026-04-26 16:15:59 -05:00
Brooklyn Nicholson	c78b528125	feat(tui): archive todos at turn end with incomplete hint	2026-04-26 16:14:58 -05:00
Brooklyn Nicholson	319c1c1691	fix(tui): inline todo in transcript, group across thinking	2026-04-26 16:09:28 -05:00
Brooklyn Nicholson	4943ea2a7c	fix(tui): merge tools into contextual shelves	2026-04-26 16:00:38 -05:00
Brooklyn Nicholson	4d3e3a738d	chore(tui): sort imports	2026-04-26 15:56:47 -05:00
Brooklyn Nicholson	a5319fb7af	test(tui): cover live todo completion flow	2026-04-26 15:56:08 -05:00
Brooklyn Nicholson	f5552f92e2	fix(tui): stabilize live todo progress	2026-04-26 15:55:38 -05:00
Brooklyn Nicholson	1566f1eecc	fix(tui): report actual session on exit	2026-04-26 15:55:01 -05:00
Brooklyn Nicholson	a30db69dd5	chore(tui): clean live progress lint	2026-04-26 15:42:07 -05:00
Brooklyn Nicholson	f6846205cc	fix(tui): isolate turn state from app render	2026-04-26 15:40:38 -05:00
Brooklyn Nicholson	6a3873942f	fix(tui): format thinking paragraphs	2026-04-26 15:38:18 -05:00
Brooklyn Nicholson	64de685d3f	test(tui): remove stale turn freeze experiment	2026-04-26 15:35:41 -05:00
Brooklyn Nicholson	cee4036e8b	fix(tui): merge tool shelves in transcript	2026-04-26 15:35:38 -05:00
Brooklyn Nicholson	cf8439263a	fix(tui): keep todo pinned outside transcript	2026-04-26 15:33:01 -05:00
Brooklyn Nicholson	3271ffbd80	fix(tui): pin todo panel above live output	2026-04-26 15:27:31 -05:00
Brooklyn Nicholson	a7831b63db	fix(tui): stabilize live progress rendering	2026-04-26 15:23:43 -05:00
Brooklyn Nicholson	d4dde6b5f2	fix(tui): restore resumed transcript lineage	2026-04-26 15:16:12 -05:00
Teknium	755a280424	chore(release): map Wang-tianhao in AUTHOR_MAP	2026-04-26 13:02:51 -07:00
Wang-tianhao	6087e04043	fix(slack): extract rich_text quotes/lists and link unfurl previews Slack's modern composer sends messages with a 'blocks' array that contains rich_text elements. When a user forwards or quotes another message, the quoted content shows up in the rich_text_quote children of that array — and is NOT included in the plain 'text' field. The agent saw only the lossy plain text and was blind to forwarded / quoted content. Same story for link unfurl previews (Notion, docs, GitHub, etc.) which Slack puts in the 'attachments' array. Two fixes in the inbound handler: 1. _extract_text_from_slack_blocks walks rich_text / rich_text_quote / rich_text_list / rich_text_preformatted trees and renders readable text ('> quoted', '• bullet', code fences), dedupes against the plain text field, and appends the extracted content so the agent sees everything. 2. Link unfurl / attachment preview extraction reads title, url, body, and footer from the 'attachments' array and appends a '📎 [title](url)\n body\n _footer_' section per preview. Skips is_msg_unfurl to avoid echoing our own Slack replies back. Routing is careful not to trust augmented text: mention gating (is_mentioned) and slash-command detection both run against the original 'text' field, so forwarded content containing '<@bot>' or '/deploy' in a quote can't trick the bot into responding in a channel it shouldn't or classifying a normal message as a command. Adjustment from original PR: dropped _serialize_slack_blocks_for_agent, which inlined a redacted JSON dump of non-rich_text blocks (section, accessory, actions, etc.) — the agent would see the raw Block Kit structure for UI-heavy alerts. It added up to 6000 characters to the prompt context on every qualifying message with no opt-out. The rich_text extraction and attachment unfurls cover the common bug-fix case (quoted/forwarded content + link previews) without the prefill tax. If a user needs block inspection later, it can return as a config opt-in. Also updates the Slack platform notes in session.py to accurately describe what the gateway inlines.	2026-04-26 13:02:51 -07:00
Teknium	4921b26945	fix(cron): keep homeassistant toolset enabled when HASS_TOKEN is set (#16208 ) After #14798 made cron honor per-platform `hermes tools` config, the `_DEFAULT_OFF_TOOLSETS` filter silently stripped `homeassistant` from cron jobs for users who'd been relying on the previous blanket toolset. Norbert's HA cron reports regressed as a result. The HA toolset is already runtime-gated by its `check_fn` (requires HASS_TOKEN to register any tools). When HASS_TOKEN is set the user has explicitly opted in — `_DEFAULT_OFF_TOOLSETS` adds nothing in that case, so stop double-gating and restore HA for cron / cli / other platforms without an explicit saved toolset list. moa and rl stay off by default (original #14798 goal preserved). Fixes HA cron regression reported by Norbert.	2026-04-26 12:55:58 -07:00
Teknium	822b507a72	chore(release): map maxims-oss in AUTHOR_MAP	2026-04-26 12:54:46 -07:00
maxims-oss	18beb69b49	fix(memory): close embedded Hindsight async client cleanly HindsightEmbedded.close() delegates to its sync client.close(). When Hermes created/used that client on the shared async loop, closing it from the main thread raises 'attached to a different loop' before aiohttp releases the session — so the ClientSession / TCPConnector leak past provider teardown. Close the embedded inner async client on the shared loop first via _run_sync(inner_client.aclose()), then let the wrapper's sync close() do its daemon/UI bookkeeping. Salvage of #14605: test placement rebased — appended TestShutdown class after TestSharedEventLoopLifecycle (which landed on main after the PR was written). Original author attribution preserved.	2026-04-26 12:54:46 -07:00
Tranquil-Flow	bf05b8f4a2	fix(gateway): clean up cached agents on shutdown (#11205 )	2026-04-26 12:51:53 -07:00
Zainan Victor Zhou	778fd1898e	fix(slack): surface attachment access diagnostics Translate Slack attachment failures into actionable user-facing notices instead of generic download errors. When a scope/auth/permission issue breaks attachment processing, the user sees: [Slack attachment notice] - Slack attachment access failed for photo.jpg. Missing scope: files:read. Update the Slack app scopes/settings and reinstall the app to the workspace. Two helpers do the translation: _describe_slack_api_error — handles SlackApiError responses (missing_scope, invalid_auth, file_not_found, access_denied, etc.) _describe_slack_download_failure — handles httpx.HTTPStatusError (401/403/404) and Slack-returns-HTML-sign-in fallbacks Wired into three existing call sites: - the Slack Connect files.info path (PR #11111) so scope errors surface instead of being logged as generic "files.info failed" - the image, audio, and document download paths so 401/403 and HTML-body responses translate into actionable notices Adjustment from original PR: dropped _probe_slack_file_access_issue, the proactive pre-download files.info probe. It added one extra Slack API call per attachment even on healthy ones, and overlapped with the existing files.info call from PR #11111. The post-failure translation path covers the same user-facing diagnostic value without the per-message tax. Also documents files:read scope more prominently in the Slack setup guide and troubleshooting table. Contributed back from https://github.com/xinbenlv/zn-hermes-agent. Closes #7015. Co-authored-by: xinbenlv <zzn+pa@zzn.im>	2026-04-26 12:47:43 -07:00
Teknium	45bfcb9e71	test: update bare-agent helper for live-runtime attrs added by #16099 Background review fork now inherits session_id, credential_pool, and status_callback from the parent (added in #16099 after this PR was written). Extend the bare-agent helper so the regression test keeps reaching the cleanup assertions instead of failing in the runtime resolver. Signed-off-by: Teknium <8425893+teknium1@users.noreply.github.com>	2026-04-26 12:45:39 -07:00
MRHwick	aa7b5acfcd	pass attribution check	2026-04-26 12:45:39 -07:00
MRHwick	36e352afa7	preserve the original comment	2026-04-26 12:45:39 -07:00
MRHwick	2d86e97a7e	fix(run_agent): shut down background review memory providers Temporary background review agents can initialize Hindsight-backed memory clients, but close() alone skips provider teardown. Shut the memory provider down before closing so aiohttp sessions do not leak at process exit. Made-with: Cursor	2026-04-26 12:45:39 -07:00
Teknium	edadeaf495	chore(release): map Satoshi-agi and kunlabs in AUTHOR_MAP	2026-04-26 12:35:16 -07:00
kunlabs	f9885130b4	fix(slack): download files in Slack Connect channels Slack Connect channels return file objects with file_access="check_file_info" and no url_private_download field (see https://docs.slack.dev/reference/objects/file-object/#slack_connect_files). These stub objects must be resolved via files.info before download can proceed. Without this the agent silently skips attachments posted in Slack Connect channels. Call files.info on every file whose file_access is check_file_info, replace the stub with the full file object, and let the existing download path continue. Warn and skip on files.info failures. Closes #11095.	2026-04-26 12:35:16 -07:00
flobo3	f414df3a56	fix(slack): include team_id in thread-context cache key	2026-04-26 12:35:16 -07:00
Satoshi-agi	c0d25df311	fix(slack): preserve thread-parent context when cron/bot posted the parent The Slack thread-context fetcher used to drop every message with a bot_id, which silently erased the thread parent whenever a cron job (or any other bot) had posted it. As a result, replies to a cron-posted summary lost all context and the agent answered as if from a blank thread. Changes: 1. gateway/platforms/slack.py::_fetch_thread_context - Keep the thread parent even when it was posted by a bot (e.g. cron summaries, third-party integrations). - Only skip our own prior bot replies to avoid circular context, matching the per-workspace bot user id via _team_bot_user_ids so multi-workspace deployments stay correct. - Keep non-self bot children (useful third-party context). 2. gateway/platforms/slack.py::_handle_slack_message - Populate MessageEvent.reply_to_text for thread replies (parity with Telegram/Discord/Feishu/WeCom). gateway.run uses this field to inject a [Replying to: "..."] prefix when the parent is not already in the session history, which is exactly the scenario triggered by cron-generated thread parents. - New helper _fetch_thread_parent_text reuses the existing thread- context cache (and its 60s TTL) to avoid duplicate conversations.replies calls; falls back to a cheap limit=1 fetch when the cache is cold. Tests: - Updated TestSlackThreadContext::test_skips_bot_messages to reflect the new behaviour (self-bot child dropped, third-party bot kept). - Added: * test_fetch_thread_context_includes_bot_parent * test_fetch_thread_context_excludes_self_bot_replies * test_fetch_thread_context_multi_workspace * test_fetch_thread_context_current_ts_excluded (regression guard) * test_fetch_thread_parent_text_from_cache * test_slack_reply_to_text_set_on_thread_reply * test_slack_reply_to_text_none_for_top_level_message Full Slack suite: 176 passed (was 169).	2026-04-26 12:35:16 -07:00
helix4u	10e36188da	fix(cli): wire approvals in background tasks	2026-04-26 12:29:48 -07:00
Teknium	6a3102f9d4	chore(release): map hhuang91 in AUTHOR_MAP	2026-04-26 12:29:02 -07:00
bde3249023	75d3eaa0e4	fix(slack): exclude U/W user IDs from explicit target regex Slack's chat.postMessage API rejects user IDs (U...) and workspace IDs (W...) — they are not valid conversation IDs. Posting to them fails because the API requires a channel ID (C/G/D). To DM a user, the sender must first call conversations.open to obtain a D... ID. Tighten _SLACK_TARGET_RE from [CGDUW] to [CGD] so the send path rejects U/W values as explicit targets and instead falls through to channel- name resolution (where they'll fail with a clear 'could not resolve' error rather than silently getting stuck in a retry loop on the API). Flip the corresponding regression test to assert U/W values are not explicit. Matches the narrower regex briandevans proposed in #15939. Co-authored-by: briandevans <brian@bde.io>	2026-04-26 12:29:02 -07:00
hhuang91	802c7acb81	fix(Slack): resolve Slack channels by raw ID and enumerate joined channels send_message(target='slack:<channel_id>') failed with "Could not resolve" because _parse_target_ref had no Slack branch — Slack's uppercase alphanumeric IDs fell through to channel-name resolution, which only matched by name. As a fallback, the agent would retry with bare target='slack' and post to the home channel instead. Three fixes: - _parse_target_ref recognizes Slack IDs (C/G/D/U/W prefix) as explicit targets so the name-resolver is bypassed entirely. - resolve_channel_name tries a case-sensitive raw-ID match before the existing name match, so any platform's IDs resolve cleanly. - _build_slack now actually calls users.conversations against each workspace's AsyncWebClient (paginated), instead of only returning session-history entries. This populates the directory with public and private channels the bot has joined, so action='list' shows them and they can also be addressed by name. Errors from one workspace don't block others. build_channel_directory becomes async (Slack web calls require it). The two async-context callers in gateway/run.py are awaited; the cron ticker thread call bridges via asyncio.run_coroutine_threadsafe. Slack bot needs channels:read and groups:read scopes for full enumeration; missing scopes degrade gracefully per-workspace. addressing #15927	2026-04-26 12:29:02 -07:00
Teknium	541cd732e8	chore(models): drop deepseek from OpenRouter and Nous Portal curated picker lists (#16197 ) Removes deepseek/deepseek-v4-pro and deepseek/deepseek-v4-flash from OPENROUTER_MODELS and _PROVIDER_MODELS['nous'], then regenerates website/static/api/model-catalog.json so the hosted picker JSON drops them too. Direct-API deepseek provider support is unchanged.	2026-04-26 12:28:17 -07:00
Teknium	4d119bb62a	test: blank platform-gating env vars in hermetic fixture load_gateway_config() has a side effect: when config.yaml contains platform-gating keys (slack.require_mention, slack.strict_mention, slack.free_response_channels, slack.allow_bots, slack.reactions, plus analogous keys for discord/telegram/whatsapp/dingtalk/matrix), it calls os.environ[KEY] = ... to bridge them to env-var form. monkeypatch.delenv doesn't track direct os.environ mutations made inside the test body, so tests that call load_gateway_config() leak those env vars into later tests on the same xdist worker. The failure mode is flaky seed-dependent: test_top_level_message_requires_mention_ even_with_session (and siblings in TestThreadReplyHandling) pass when SLACK_REQUIRE_MENTION is unset but fail when a leaked value of 'false' is present. Add the gating env vars to _HERMES_BEHAVIORAL_VARS so the hermetic autouse fixture blanks them on every test setup, closing the leak regardless of which test sets them.	2026-04-26 12:23:20 -07:00
Teknium	878c196738	chore(release): map hhhonzik in AUTHOR_MAP	2026-04-26 12:23:20 -07:00
Honza Stepanovsky	50dd67c680	fix(slack): skip _mentioned_threads registration when strict_mention is on Extends the strict_mention feature so an @mention in strict mode no longer persistently tags the thread as 'mentioned'. Without this, the thread's first mention would permanently auto-trigger the bot on every subsequent message — which is exactly what strict_mention is designed to prevent. Closes the agent-to-agent ack loop hole hhhonzik identified in #14117. Co-authored-by: hhhonzik <me@janstepanovsky.cz>	2026-04-26 12:23:20 -07:00
Ching	aea4a90f0e	feat(slack): add opt-in slack.strict_mention gate for channel threads Adds a strict_mention config option that, when enabled, requires an explicit @-mention on every message in channel threads. Disables the 'once mentioned, forever in the thread' and session-presence auto-triggers. - New _slack_strict_mention() helper (config.extra + SLACK_STRICT_MENTION env) - Bridged top-level slack.strict_mention yaml to SLACK_STRICT_MENTION env, matching require_mention/allow_bots bridging - Unit tests for the helper + config bridge	2026-04-26 12:23:20 -07:00
Teknium	897dc3a2bb	fix(install+update): add /usr/local/bin PATH guard for RHEL root non-login shells (#16191 ) * fix(install): add /usr/local/bin PATH guard for RHEL root non-login shells The FHS-layout branch assumed /usr/local/bin is on PATH for every standard shell. That holds for login shells (via /etc/profile's pathmunge) but breaks on RHEL/CentOS/Rocky/Alma 8+ root in non-login interactive shells (su, sudo -s, tmux panes, some web terminals) — /etc/bashrc does not add /usr/local/bin and /root/.bash_profile doesn't either. Result: hermes command links to /usr/local/bin/hermes but the user has to type the absolute path each time. Probe a fresh 'bash -i -c' (non-login interactive, matching the user scenario) after symlinking. If hermes isn't resolvable, append an idempotent PATH guard to /root/.bashrc and /root/.bash_profile, same grep pattern already used by the ~/.local/bin branch below. No change on distros where /usr/local/bin is already inherited. * fix(update): repair RHEL root PATH on hermes update Existing RHEL/CentOS/Rocky/Alma root installs won't be repaired by the install.sh fix alone because 'hermes update' is an in-place git pull, not a rerun of install.sh. Port the same probe + idempotent .bashrc write into cmd_update so affected users get fixed automatically on next update. _ensure_fhs_path_guard() runs after 'Update complete!': - Linux + root + FHS-layout install (command at /usr/local/bin/hermes) only - Probe: env -i bash -i -c 'command -v hermes' — fresh non-login interactive shell, same scenario the user reports - On failure, append PATH guard to /root/.bashrc and /root/.bash_profile, skipping if any uncommented PATH line already mentions /usr/local/bin - Silent no-op on macOS, non-root, legacy layout, or shells that already resolve hermes	2026-04-26 12:22:37 -07:00
Brooklyn Nicholson	350ee1bf23	refactor(tui): render progress in ordered stream timeline	2026-04-26 14:12:43 -05:00

1 2 3 4 5 ...

6189 Commits