hermes-agent-features

Author	SHA1	Message	Date
teyrebaz33	1caee06b22	fix: tool call repair — auto-lowercase, fuzzy match, helpful error on unknown tool (#520 ) - Add _repair_tool_call(): tries lowercase, normalize, then fuzzy match (difflib 0.7) - Replace 3-retry-then-abort with graceful error: model receives helpful message and self-corrects - Conversation stays alive instead of dying on hallucinated tool names Closes #520	2026-03-10 06:54:11 -07:00
teknium1	771969f747	fix: wire up enabled_tools in agent loop + simplify sandbox tool selection Completes the fix started in `8318a51` — handle_function_call() accepted enabled_tools but run_agent.py never passed it. Now both call sites in _execute_tool_calls() pass self.valid_tool_names, so each agent session uses its own tool list instead of the process-global _last_resolved_tool_names (which subagents can overwrite). Also simplifies the redundant ternary in code_execution_tool.py: sandbox_tools is already computed correctly (intersection with session tools, or full SANDBOX_ALLOWED_TOOLS as fallback), so the conditional was dead logic. Inspired by PR #663 (JasonOA888). Closes #662. Tests: 2857 passed.	2026-03-10 06:35:28 -07:00
vincent	b0a5fe8974	fix: continue after output-length truncation	2026-03-10 04:30:19 -07:00
teknium1	899dfdcfb9	Merge PR #616 : fix: retry with rebuilt payload after compression Authored by tripledoublev. After context compression on 413/400 errors, the inner retry loop was reusing the stale pre-compression api_messages payload. Fix breaks out of the inner retry loop so the outer loop rebuilds api_messages from the now-compressed messages list. Adds regression test verifying the second request actually contains the compressed payload.	2026-03-10 04:22:42 -07:00
teknium1	f16f2912cf	Merge PR #607 : fix: reset all retry counters at start of run_conversation() Authored by 0xbyt4. Adds missing resets for _incomplete_scratchpad_retries and _codex_incomplete_retries to prevent stale counters carrying over between CLI conversations.	2026-03-10 04:17:47 -07:00
teknium1	c1775de56f	feat: filesystem checkpoints and /rollback command Automatic filesystem snapshots before destructive file operations, with user-facing rollback. Inspired by PR #559 (by @alireza78a). Architecture: - Shadow git repos at ~/.hermes/checkpoints/{hash}/ via GIT_DIR - CheckpointManager: take/list/restore, turn-scoped dedup, pruning - Transparent — the LLM never sees it, no tool schema, no tokens - Once per turn — only first write_file/patch triggers a snapshot Integration: - Config: checkpoints.enabled + checkpoints.max_snapshots - CLI flag: hermes --checkpoints - Trigger: run_agent.py _execute_tool_calls() before write_file/patch - /rollback slash command in CLI + gateway (list, restore by number) - Pre-rollback snapshot auto-created on restore (undo the undo) Safety: - Never blocks file operations — all errors silently logged - Skips root dir, home dir, dirs >50K files - Disables gracefully when git not installed - Shadow repo completely isolated from project git Tests: 35 new tests, all passing (2798 total suite) Docs: feature page, config reference, CLI commands reference	2026-03-10 00:49:15 -07:00
teknium1	ee4008431a	fix: stop terminal border flashing with steady cursor and TUI spinner widget Cherry-picked and improved from PR #470 (fixes #464). Problem: On Ubuntu 24.04 with ghostty + tmux, the prompt input box border lines flash due to cursor blink and raw spinner terminal writes conflicting with prompt_toolkit's rendering. Changes: - cli.py: Add CursorShape.BLOCK to Application() to disable cursor blink - cli.py: Add thinking_callback + spinner_widget in TUI layout so thinking status displays as a proper prompt_toolkit widget instead of raw terminal writes that conflict with the TUI renderer - run_agent.py: Add thinking_callback parameter to AIAgent; when set, uses the callback instead of KawaiiSpinner for thinking display What was NOT changed (preserving existing behavior): - agent/display.py: Untouched. KawaiiSpinner _write() stdout capture, _animate() logic, and 0.12s frame interval all preserved. This protects subagent stdout redirection and keeps smooth animations for non-CLI contexts (gateway, batch runner). - Original emoji spinner types (brain/sparkle/pulse/moon/star) preserved for all non-CLI contexts. Fixes from original PR #470: - CursorShape.STEADY_BLOCK -> CursorShape.BLOCK (STEADY_BLOCK doesn't exist in prompt_toolkit 3.0.52) - Removed duplicate self._spinner_text = '' line - Removed redundant nested if-checks Tested: 2706 tests pass, interactive CLI verified via tmux.	2026-03-09 23:26:43 -07:00
teknium1	3e352f8a0d	fix: add upstream guard for non-dict function_args + tests for build_tool_preview Complements PR #453 by 0xbyt4. Adds isinstance(dict) guard in run_agent.py to catch cases where json.loads returns non-dict (e.g. null, list, string) before they reach downstream code. Also adds 15 tests for build_tool_preview covering None args, empty dicts, known/unknown tools, fallback keys, truncation, and all special-cased tools (process, todo, memory, session_search).	2026-03-09 21:01:40 -07:00
teyrebaz33	94023e6a85	feat: conditional skill activation based on tool availability Skills can now declare fallback_for_toolsets, fallback_for_tools, requires_toolsets, and requires_tools in their SKILL.md frontmatter. The system prompt builder filters skills automatically based on which tools are available in the current session. - Add _read_skill_conditions() to parse conditional frontmatter fields - Add _skill_should_show() to evaluate conditions against available tools - Update build_skills_system_prompt() to accept and apply tool availability - Pass valid_tool_names and available toolsets from run_agent.py - Backward compatible: skills without conditions always show; calling build_skills_system_prompt() with no args preserves existing behavior Closes #539	2026-03-09 23:13:39 +03:00
teknium1	1f0944de21	fix: handle non-string content from OpenAI-compatible servers (#759 ) Some local LLM servers (llama-server, etc.) return message.content as a dict or list instead of a plain string. This caused AttributeError 'dict object has no attribute strip' on every API call. Normalizes content to string immediately after receiving the response: - dict: extracts 'text' or 'content' field, falls back to json.dumps - list: extracts text parts (OpenAI multimodal content format) - other: str() conversion Applied at the single point where response.choices[0].message is read in the main agent loop, so all downstream .strip()/.startswith()/[:100] operations work regardless of server implementation. Closes #759	2026-03-09 03:32:32 -07:00
0xbyt4	4684aaffdc	merge: resolve file_tools.py conflict with origin/main Combine read/search loop detection with main's redact_sensitive_text and truncation hint features. Add tracker reset to TestSearchHints to prevent cross-test state leakage.	2026-03-09 13:21:46 +03:00
teknium1	aedb773f0d	fix: stabilize system prompt across gateway turns for cache hits Two changes to prevent unnecessary Anthropic prompt cache misses in the gateway, where a fresh AIAgent is created per user message: 1. Reuse stored system prompt for continuing sessions: When conversation_history is non-empty, load the system prompt from the session DB instead of rebuilding from disk. The model already has updated memory in its conversation history (it wrote it!), so re-reading memory from disk produces a different system prompt that breaks the cache prefix. 2. Stabilize Honcho context per session: - Only prefetch Honcho context on the first turn (empty history) - Bake Honcho context into the cached system prompt and store to DB - Remove the per-turn Honcho injection from the API call loop This ensures the system message is identical across all turns in a session. Previously, re-fetching Honcho could return different context on each turn, changing the system message and invalidating the cache. Both changes preserve the existing behavior for compression (which invalidates the prompt and rebuilds from scratch) and for the CLI (where the same AIAgent persists and the cached prompt is already stable across turns). Tests: 2556 passed (6 new)	2026-03-09 01:50:58 -07:00
teknium1	35d57ed752	refactor: unified OAuth/API-key credential resolution for fallback Split fallback provider handling into two clean registries: _FALLBACK_API_KEY_PROVIDERS — env-var-based (openrouter, zai, kimi, minimax) _FALLBACK_OAUTH_PROVIDERS — OAuth-based (openai-codex, nous) New _resolve_fallback_credentials() method handles all three cases (OAuth, API key, custom endpoint) and returns a uniform (key, url, mode) tuple. _try_activate_fallback() is now just validation + client build. Adds Nous Portal as a fallback provider — uses the same OAuth flow as the primary provider (hermes login), returns chat_completions mode. OAuth providers get credential refresh for free: the existing 401 retry handlers (_try_refresh_codex/nous_client_credentials) check self.provider, which is set correctly after fallback activation. 4 new tests (nous activation, nous no-login, codex retained). 27 total fallback tests passing, 2548 full suite.	2026-03-08 21:44:48 -07:00
teknium1	5785bd3272	feat: add openai-codex as fallback provider Codex OAuth uses a different auth flow (OAuth tokens, not env vars) and a different API mode (codex_responses, not chat_completions). The fallback now handles this specially: - Resolves credentials via resolve_codex_runtime_credentials() - Sets api_mode to codex_responses - Fails gracefully if no Codex OAuth session exists Also added to the commented-out config.yaml example. 2 new tests (codex activation + graceful failure).	2026-03-08 21:34:15 -07:00
teknium1	b3765c28d0	fix: restrict fallback providers to actual hermes providers Remove hallucinated providers (openai, deepseek, together, groq, fireworks, mistral, gemini, nous) from the fallback provider map. These don't exist in hermes-agent's provider system. The real supported providers for fallback are: openrouter (OPENROUTER_API_KEY) zai (ZAI_API_KEY) kimi-coding (KIMI_API_KEY) minimax (MINIMAX_API_KEY) minimax-cn (MINIMAX_CN_API_KEY) For any other OpenAI-compatible endpoint, users can use the base_url + api_key_env overrides in the config. Also adds Kimi User-Agent header for kimi fallback (matching the main provider system).	2026-03-08 20:49:55 -07:00
teknium1	161436cfdd	feat: simple fallback model for provider resilience When the primary model/provider fails after retries (rate limit, overload, auth errors, connection failures), Hermes automatically switches to a configured fallback model for the remainder of the session. Config (in ~/.hermes/config.yaml): fallback_model: provider: openrouter model: anthropic/claude-sonnet-4 Supports all major providers: OpenRouter, OpenAI, Nous, DeepSeek, Together, Groq, Fireworks, Mistral, Gemini — plus custom endpoints via base_url and api_key_env overrides. Design principles: - Dead simple: one fallback model, not a chain - One-shot: switches once, doesn't ping-pong back - Zero new dependencies: uses existing OpenAI client - Minimal code: ~100 lines in run_agent.py, ~5 lines in cli.py/gateway - Three trigger points: max retries exhausted, non-retryable client errors, and invalid response exhaustion Does NOT trigger on context overflow or payload-too-large errors (those are handled by the existing compression system). Addresses #737. 25 new tests, 2492 total passing.	2026-03-08 20:22:33 -07:00
teknium1	2394e18729	fix: add context to interruption messages for model awareness When the agent is interrupted, the model now receives descriptive context instead of a generic 'Operation interrupted.' string: - Tool skip messages include the tool name: '[Tool execution cancelled — terminal was skipped due to user interrupt]' '[Tool execution skipped — web_search was not started. User sent a new message]' - API call interrupts include timing: 'Operation interrupted: waiting for model response (4.2s elapsed).' - Retry/error interrupts include retry context: 'Operation interrupted: retrying API call after rate limit (retry 2/5).' 'Operation interrupted: handling API error (Timeout: connection timed out).' This helps the model understand what was happening when it was interrupted, reducing wasted iterations spent re-discovering state.	2026-03-08 18:58:23 -07:00
teknium1	60b6abefd9	feat: session naming with unique titles, auto-lineage, rich listing, resume by name - Schema v4: unique title index, migration from v2/v3 - set/get/resolve session titles with uniqueness enforcement - Auto-lineage: context compression auto-numbers titles (Task -> Task #2 -> Task #3) - resolve_session_by_title: auto-latest finds most recent continuation - list_sessions_rich: preview (first 60 chars) + last_active timestamp - CLI: -c accepts optional name arg (hermes -c 'my project') - CLI: /title command with deferred mode (set before session exists) - CLI: sessions list shows Title, Preview, Last Active, ID - 27 new tests (1844 total passing)	2026-03-08 15:20:29 -07:00
0xbyt4	9eee529a7f	fix: detect and warn on file re-read loops after context compression When context compression summarizes conversation history, the agent loses track of which files it already read and re-reads them in a loop. Users report the agent reading the same files endlessly without writing. Root cause: context compression is lossy — file contents and read history are lost in the summary. After compression, the model thinks it hasn't examined the files yet and reads them again. Fix (two-part): 1. Track file reads per task in file_tools.py. When the same file region is read again, include a _warning in the response telling the model to stop re-reading and use existing information. 2. After context compression, inject a structured message listing all files already read in the session with explicit "do NOT re-read" instruction, preserving read history across compression boundaries. Adds 16 tests covering warning detection, task isolation, summary accuracy, tracker cleanup, and compression history injection.	2026-03-08 20:44:42 +03:00
teknium1	19b6f81ee7	fix: allow Anthropic API URLs as custom OpenAI-compatible endpoints Removed the hard block on base_url containing 'api.anthropic.com'. Anthropic now offers an OpenAI-compatible /chat/completions endpoint, so blocking their URL prevents legitimate use. If the endpoint isn't compatible, the API call will fail with a proper error anyway. Removed from: run_agent.py, mini_swe_runner.py Updated test to verify Anthropic URLs are accepted.	2026-03-07 23:36:35 -08:00
Christo Mitov	4447e7d71a	fix: add Kimi Code API support (api.kimi.com/coding/v1) Kimi Code (platform.kimi.ai) issues API keys prefixed sk-kimi- that require: 1. A different base URL: api.kimi.com/coding/v1 (not api.moonshot.ai/v1) 2. A User-Agent header identifying a recognized coding agent Without this fix, sk-kimi- keys fail with 401 (wrong endpoint) or 403 ('only available for Coding Agents') errors. Changes: - Auto-detect sk-kimi- key prefix and route to api.kimi.com/coding/v1 - Send User-Agent: KimiCLI/1.0 header for Kimi Code endpoints - Legacy Moonshot keys (api.moonshot.ai) continue to work unchanged - KIMI_BASE_URL env var override still takes priority over auto-detection - Updated .env.example with correct docs and all endpoint options - Fixed doctor.py health check for Kimi Code keys Reference: https://github.com/MoonshotAI/kimi-cli (platforms.py)	2026-03-07 21:00:12 -05:00
vincent	86eed141af	fix: rebuild compressed payload before retry	2026-03-07 18:55:01 -05:00
teknium1	e64d646bad	Critical: fix bug in new subagent tool call budget to not be session-level but tool call loop level	2026-03-07 10:32:51 -08:00
teknium1	b84f9e410c	feat: default reasoning effort from xhigh to medium Reduces token usage and latency for most tasks by defaulting to medium reasoning effort instead of xhigh. Users can still override via config or CLI flag. Updates code, tests, example config, and docs.	2026-03-07 10:14:19 -08:00
teknium1	23e84de830	refactor: remove model parameter from AIAgent initialization Eliminated the model parameter from the AIAgent class initialization, streamlining the constructor and ensuring consistent behavior across agent instances. This change aligns with recent updates to the task delegation logic.	2026-03-07 09:48:19 -08:00
teknium1	5a711f32b1	fix: enhance payload and context compression handling Added logic to manage multiple compression attempts for large payloads and context length errors. Introduced limits on compression attempts to prevent infinite retries, with appropriate logging and error handling. This ensures better resilience and user feedback when facing compression issues during API calls.	2026-03-07 09:19:07 -08:00
0xbyt4	8c26a057a3	fix: reset all retry counters at start of run_conversation() _incomplete_scratchpad_retries and _codex_incomplete_retries were not reset at the start of run_conversation(). In CLI mode, where the same AIAgent instance is reused across conversations, stale counters from a previous conversation could carry over, causing premature retry exhaustion and partial responses.	2026-03-07 20:12:08 +03:00
teknium1	4d34427cc7	fix: update model version in agent configurations Updated the default model version from "anthropic/claude-sonnet-4-20250514" to "anthropic/claude-sonnet-4.6" across multiple files including AGENTS.md, batch_runner.py, mini_swe_runner.py, and run_agent.py for consistency and to reflect the latest model improvements.	2026-03-07 09:06:37 -08:00
teknium1	0a82396718	feat: shared iteration budget across parent + subagents Subagent tool calls now count toward the same session-wide iteration limit as the parent agent. Previously, each subagent had its own independent counter, so a parent with max_iterations=60 could spawn 3 subagents each doing 50 calls = 150 total tool calls unmetered. Changes: - IterationBudget: thread-safe shared counter (run_agent.py) - consume(): try to use one iteration, returns False if exhausted - refund(): give back one iteration (for execute_code turns) - Thread-safe via Lock (subagents run in ThreadPoolExecutor) - Parent creates the budget, children inherit it via delegate_tool.py - execute_code turns are refunded (don't count against budget) - Default raised from 60 → 90 to account for shared consumption - Per-child cap (50) still applies as a safety valve The per-child max_iterations (default 50) remains as a per-child ceiling, but the shared budget is the hard session-wide limit. A child stops at whichever comes first.	2026-03-07 08:16:37 -08:00
teknium1	5da55ea1e3	fix: sanitize orphaned tool-call/result pairs in message compression Enhance message compression by adding a method to clean up orphaned tool-call and tool-result pairs. This ensures that the API receives well-formed messages, preventing errors related to mismatched IDs. The new functionality includes removing orphaned results and adding stub results for missing calls, improving overall message integrity during compression.	2026-03-07 08:08:00 -08:00
teknium1	69a36a3361	Merge PR #309 : fix(timezone): timezone-aware now() for prompt, cron, and execute_code Authored by areu01or00. Adds timezone support via hermes_time.now() helper with IANA timezone resolution (HERMES_TIMEZONE env → config.yaml → server-local). Updates system prompt timestamp, cron scheduling, and execute_code sandbox TZ injection. Includes config migration (v4→v5) and comprehensive test coverage.	2026-03-07 00:04:41 -08:00
Robin Fernandes	bc091eb7ef	fix: implement Nous credential refresh on 401 error for retry logic	2026-03-07 13:34:23 +11:00
teknium1	8ae4a6f824	fix: improve handling of empty responses after tool calls - Added fallback mechanism to utilize previous content when the model generates an empty response after tool calls, reducing unnecessary API retries. - Enhanced logging to indicate when prior content is used as a final response. - Updated logic to ensure that genuine empty responses are retried appropriately, maintaining user experience.	2026-03-06 16:54:31 -08:00
teknium1	3e93db16bd	Merge PR #436 : fix: use _max_tokens_param in max-iterations retry path Authored by Farukest. Fixes #435. The retry summary in _handle_max_iterations() hardcoded max_tokens instead of using _max_tokens_param(), which returns max_completion_tokens for direct OpenAI API (required by gpt-4o, o-series). The first attempt already used _max_tokens_param correctly — only the retry path was wrong. Includes 4 tests for _max_tokens_param provider detection.	2026-03-06 04:46:24 -08:00
teknium1	c886333d32	feat: smart context length probing with persistent caching + banner display Replaces the unsafe 128K fallback for unknown models with a descending probe strategy (2M → 1M → 512K → 200K → 128K → 64K → 32K). When a context-length error occurs, the agent steps down tiers and retries. The discovered limit is cached per model+provider combo in ~/.hermes/context_length_cache.yaml so subsequent sessions skip probing. Also parses API error messages to extract the actual context limit (e.g. 'maximum context length is 32768 tokens') for instant resolution. The CLI banner now displays the context window size next to the model name (e.g. 'claude-opus-4 · 200K context · Nous Research'). Changes: - agent/model_metadata.py: CONTEXT_PROBE_TIERS, persistent cache (save/load/get), parse_context_limit_from_error(), get_next_probe_tier() - agent/context_compressor.py: accepts base_url, passes to metadata - run_agent.py: step-down logic in context error handler, caches on success - cli.py + hermes_cli/banner.py: context length in welcome banner - tests: 22 new tests for probing, parsing, and caching Addresses #132. PR #319's approach (8K default) rejected — too conservative.	2026-03-05 16:09:57 -08:00
PercyDikec	938499ddfb	fix: add missing empty-content guard after think-block stripping in retry path	2026-03-05 18:57:59 +03:00
Farukest	e25ad79d5d	fix: use _max_tokens_param in max-iterations retry path The retry summary in _handle_max_iterations hardcodes max_tokens instead of calling _max_tokens_param(). For direct OpenAI API users (gpt-4o, o-series), the correct parameter name is max_completion_tokens. The first attempt at line 2697 already uses _max_tokens_param correctly but the retry path at line 2743 was missed.	2026-03-05 17:49:37 +03:00
teknium1	41adca4e77	fix: strip internal fields from API messages in _handle_max_iterations The flush_memories() and run_conversation() code paths already stripped finish_reason and reasoning from API messages (added in `7a0b377` via PR #253), but _handle_max_iterations() was missed. It was sending raw messages.copy() which could include finish_reason, causing 422 errors on strict APIs like Mistral when the agent hit max iterations. Now strips the same internal fields consistently across all three API call sites.	2026-03-04 21:08:20 -08:00
teknium1	3220bb8aaa	Merge PR #403 : Fix context overrun crash with local LLM backends Authored by ch3ronsa. Fixes #348. Adds 'context size' (LM Studio) and 'context window' (Ollama) to context-length error detection phrases so local backend 400 errors trigger compression instead of aborting. Also removes 'error code: 400' from the non-retryable error list as defense in depth.	2026-03-04 17:48:44 -08:00
teknium1	8311e8984b	fix: preflight context compression + error handler ordering for model switches Two fixes for the case where a user switches to a model with a smaller context window while having a large existing session: 1. Preflight compression in run_conversation(): Before the main loop, estimate tokens of loaded history + system prompt. If it exceeds the model's compression threshold (85% of context), compress proactively with up to 3 passes. This naturally handles model switches because the gateway creates a fresh AIAgent per message with the current model's context length. 2. Error handler reordering: Context-length errors (400 with 'maximum context length' etc.) are now checked BEFORE the generic 4xx handler. Previously, OpenRouter's 400-status context-length errors were caught as non-retryable client errors and aborted immediately, never reaching the compression+retry logic. Reported by Sonicrida on Discord: 840-message session (2MB+) crashed after switching from a large-context model to minimax via OpenRouter.	2026-03-04 14:42:41 -08:00
Vicaversa	e9ab711b66	Fix context overrun crash with local LLM backends (fixes #348 ) Local backends (LM Studio, Ollama, llama.cpp) return HTTP 400 with messages like "Context size has been exceeded" when the context window is full. The error phrase list did not include "context size" or "context window", so these errors fell through to the generic 4xx abort handler instead of triggering compression. Changes: - Move context-length check above generic 4xx handler so it runs first (same pattern as the existing 413 check) - Add "context size" and "context window" to the phrase list - Guard 4xx handler with `not is_context_length_error` to prevent context-related 400s from being treated as non-retryable	2026-03-05 01:12:34 +03:00
teknium1	70a0a5ff4a	fix: exclude current session from session_search results session_search was returning the current session if it matched the query, which is redundant — the agent already has the current conversation context. This wasted an LLM summarization call and a result slot. Added current_session_id parameter to session_search(). The agent passes self.session_id and the search filters out any results where either the raw or parent-resolved session ID matches. Both the raw match and the parent-resolved match are checked to handle child sessions from delegation. Two tests added verifying the exclusion works and that other sessions are still returned.	2026-03-04 06:06:40 -08:00
teknium1	db0521ce0e	Merge PR #184 : feat: Home Assistant integration (REST tools + WebSocket gateway) Authored by 0xbyt4. Adds smart home control via REST tools (ha_list_entities, ha_get_state, ha_call_service) with domain blocklist and entity_id validation, plus WebSocket gateway adapter for real-time event monitoring. Also includes Gemini 3 thought_signature preservation fix (extra_content on tool calls) needed for multi-turn tool calling via OpenRouter.	2026-03-03 05:01:39 -08:00
areu01or00	a1c25046a9	fix(timezone): add timezone-aware clock across agent, cron, and execute_code	2026-03-03 18:23:40 +05:30
0xbyt4	aefc330b8f	merge: resolve conflict with main (add mcp + homeassistant extras)	2026-03-03 14:52:22 +03:00
teknium1	4f5ffb8909	fix: NoneType not iterable error when summarizing at max iterations In _handle_max_iterations, the codex_responses path set tools=None to prevent tool calls during summarization. However, the OpenAI SDK's _make_tools() treats None as a valid value (not its Omit sentinel) and tries to iterate over it, causing TypeError: 'NoneType' object is not iterable. Fix: use codex_kwargs.pop('tools', None) to remove the key entirely, so the SDK never receives it and uses its default omit behavior. Fixes #300	2026-03-03 03:42:44 -08:00
teknium1	3c13feed4c	feat: show detailed tool call args in gateway based on config Issue #263: Telegram/Discord/WhatsApp/Slack now show tool call details based on display.tool_progress in config.yaml. Changes: - gateway/run.py: 'verbose' mode shows full args (keys + JSON, 200 char max). 'all' mode preview increased from 40 to 80 chars. Added missing tool emojis (execute_code, delegate_task, clarify, skill_manage, search_files). - agent/display.py: Added execute_code, delegate_task, clarify, skill_manage to primary_args. Added 'code' and 'goal' to fallback keys. - run_agent.py: Pass function_args dict to tool_progress_callback so gateway can format based on its own verbosity config. Config usage: display: tool_progress: verbose # off \| new \| all \| verbose	2026-03-02 05:23:15 -08:00
teknium1	56b53bff6e	Merge PR #229 : fix(agent): copy conversation_history to avoid mutating caller's list Authored by Farukest. Fixes #228. # Conflicts: # tests/test_run_agent.py	2026-03-02 04:21:39 -08:00
teknium1	c4ea996612	fix: repair flush sentinel test — mock auxiliary client and add guard The TestFlushSentinelNotLeaked test from PR #227 had two issues: 1. flush_memories() uses get_text_auxiliary_client() which could bypass agent.client entirely — mock it to return (None, None) 2. No assertion that the API was actually called — added guard assert Without these fixes the test passed vacuously (API never called).	2026-03-02 03:21:08 -08:00
teknium1	e27e3a4f8a	Merge PR #223 : fix: correct off-by-one in retry exhaustion checks Authored by Farukest. Fixes #222.	2026-03-02 02:54:10 -08:00
teknium1	33ab5cec82	fix: handle None message content across codebase (fixes #276 ) The OpenAI API returns content: null on assistant messages with tool calls. msg.get('content', '') returns None when the key exists with value None, causing TypeError on len(), string concatenation, and .strip() in downstream code paths. Fixed 4 locations that process conversation messages: - agent/auxiliary_client.py:84 — None passed to API calls - cli.py:1288 — crash on content[:200] and len(content) - run_agent.py:3444 — crash on None.strip() - honcho_integration/session.py:445 — 'None' rendered in transcript 13 other instances were verified safe (already protected, only process user/tool messages, or use the safe pattern). Pattern: msg.get('content', '') → msg.get('content') or '' Fixes #276	2026-03-02 02:23:53 -08:00
Sertug17	7a0b37712f	fix(agent): strip finish_reason from assistant messages to fix Mistral 422 errors (#253 ) * fix(agent): skip reasoning param for Mistral API to prevent 422 errors * fix(agent): strip finish_reason from assistant messages to fix Mistral 422 errors	2026-03-02 00:35:03 -08:00
teknium1	45d132d098	fix(agent): remove preview truncation in assistant message output Updated the AIAgent class to print the full content of assistant messages without truncation, enhancing visibility of the messages during runtime. This change improves the clarity of communication from the agent.	2026-03-02 00:32:06 -08:00
teknium1	0512ada793	feat(agent): include tools in agent status output Added the tools attribute to the AIAgent class's status output, ensuring that the current tools used by the agent are included in the status information. This enhancement improves the visibility of the agent's capabilities during runtime.	2026-03-02 00:13:41 -08:00
teknium1	47289ba6f1	feat(agent): include system prompt in agent status output Added the system prompt to the AIAgent class's status output, ensuring that the current system prompt is included in the agent's status information. This enhancement improves visibility into the agent's configuration during runtime.	2026-03-01 23:50:54 -08:00
teknium1	e5893075f9	feat(agent): add summary handling for reasoning items Enhanced the AIAgent class to capture and normalize summary information for reasoning items. Implemented logic to handle summaries as lists, ensuring proper formatting for API interactions. Updated tests to validate the inclusion of summaries in reasoning items, both for existing and default cases.	2026-03-01 20:03:03 -08:00
teknium1	8bc2de4ab6	feat(provider-routing): add OpenRouter provider routing configuration Introduced a new `provider_routing` section in the CLI configuration to control how requests are routed across providers when using OpenRouter. This includes options for sorting providers by throughput, latency, or price, as well as allowing or ignoring specific providers, setting the order of provider attempts, and managing data collection policies. Updated relevant classes and documentation to support these features, enhancing flexibility in provider selection.	2026-03-01 18:24:27 -08:00
teknium1	92da8e7e62	feat(agent): enhance reasoning handling and configuration Added support for processing encrypted reasoning content within the AIAgent class. Introduced logic to determine reasoning effort and enable/disable reasoning based on configuration settings. Updated the kwargs to reflect these changes, ensuring proper handling of reasoning parameters during agent execution.	2026-03-01 16:15:20 -08:00
0xbyt4	3fdf03390e	Merge remote-tracking branch 'origin/main' into feature/homeassistant-integration # Conflicts: # run_agent.py	2026-03-01 11:59:12 +03:00
teknium1	177be32b7f	feat(cli): add /usage command to display session token usage Introduced a new command "/usage" in the CLI to show cumulative token usage for the current session. This includes details on prompt tokens, completion tokens, total tokens, API calls, and context state. Updated command documentation to reflect this addition. Enhanced the AIAgent class to track token usage throughout the session.	2026-03-01 00:23:19 -08:00
lila	dd69f16c3e	feat(gateway): expose subagent tool calls and thinking to user (fixes #169 ) (#186 ) When subagents run via delegate_task, the user now sees real-time progress instead of silence: CLI: tree-view activity lines print above the delegation spinner 🔀 Delegating: research quantum computing ├─ 💭 "I'll search for papers first..." ├─ 🔍 web_search "quantum computing" ├─ 📖 read_file "paper.pdf" └─ ⠹ working... (18.2s) Gateway (Telegram/Discord): batched progress summaries sent every 5 tool calls to avoid message spam. Remaining tools flushed on subagent completion. Changes: - agent/display.py: add KawaiiSpinner.print_above() to print status lines above an active spinner without disrupting animation. Uses captured stdout (self._out) so it works inside the child's redirect_stdout(devnull). - tools/delegate_tool.py: add _build_child_progress_callback() that creates a per-child callback relaying tool calls and thinking events to the parent's spinner (CLI) or progress queue (gateway). Each child gets its own callback instance, so parallel subagents don't share state. Includes _flush() for gateway batch completion. - run_agent.py: fire tool_progress_callback with '_thinking' event when the model produces text content. Guarded by _delegate_depth > 0 so only subagents fire this (prevents gateway spam from main agent). REASONING_SCRATCHPAD/think/ reasoning XML tags are stripped before display. Tests: 21 new tests covering print_above, callback builder, thinking relay, SCRATCHPAD filtering, batching, flush, thread isolation, delegate_depth guard, and prefix handling.	2026-02-28 23:18:00 -08:00
teknium1	23d0b7af6a	feat(logging): implement persistent error logging for tool failures - Introduce a separate error log for capturing warnings and errors related to tool execution, ensuring detailed inspection of issues post-failure. - Enhance error handling in the AIAgent class to log exceptions with stack traces for better debugging. - Add a similar error logging mechanism in the gateway to streamline debugging processes.	2026-02-28 22:49:58 -08:00
teknium1	95b0610f36	refactor(cli, auth): Add Codex/OpenAI OAuth Support - finalized - Replace `hermes login` with `hermes model` for selecting providers and managing authentication. - Update documentation and CLI commands to reflect the new provider selection process. - Introduce a new redaction system for logging sensitive information. - Enhance Codex model discovery by integrating API fetching and local cache. - Adjust max turns configuration logic for better clarity and precedence. - Improve error handling and user feedback during authentication processes.	2026-02-28 21:56:27 -08:00
teknium1	500f0eab4a	refactor(cli): Finalize OpenAI Codex Integration with OAuth - Enhanced Codex model discovery by fetching available models from the API, with fallback to local cache and defaults. - Updated the context compressor's summary target tokens to 2500 for improved performance. - Added external credential detection for Codex CLI to streamline authentication. - Refactored various components to ensure consistent handling of authentication and model selection across the application.	2026-02-28 21:47:51 -08:00
Teknium	5a79e423fe	Merge branch 'main' into codex/align-codex-provider-conventions-mainrepo	2026-02-28 18:13:38 -08:00
teknium1	7f7643cf63	feat(hooks): introduce event hooks system for lifecycle management Add a new hooks system allowing users to run custom code at key lifecycle points in the agent's operation. This includes support for events such as `gateway:startup`, `session:start`, `agent:step`, and more. Documentation for creating hooks and available events has been added to `README.md` and a new `hooks.md` file. Additionally, integrate step callbacks in the agent to facilitate hook execution during tool-calling iterations.	2026-02-28 17:09:26 -08:00
Teknium	31a5cd185a	Merge pull request #174 from Bartok9/fix-think-block-leakage fix: strip <think> blocks from final response to users	2026-02-28 16:43:47 -08:00
Farukest	e87859e82c	fix(agent): copy conversation_history to avoid mutating caller's list	2026-03-01 03:06:13 +03:00
Farukest	de101a8202	fix(agent): strip _flush_sentinel from API messages	2026-03-01 02:51:31 +03:00
Farukest	c33f8d381b	fix: correct off-by-one in retry exhaustion checks The retry exhaustion checks used > instead of >= to compare retry_count against max_retries. Since the while loop condition is retry_count < max_retries, the check retry_count > max_retries can never be true inside the loop. When retries are exhausted, the loop exits and falls through to response.choices[0] on an invalid response, crashing with IndexError instead of returning a proper error.	2026-03-01 02:27:26 +03:00
teknium1	2205b22409	fix(headers): update X-OpenRouter-Categories to include 'productivity'	2026-02-28 10:38:49 -08:00
0xbyt4	dfd50ceccd	fix: preserve Gemini thought_signature in tool call messages Gemini 3 thinking models attach extra_content with thought_signature to function call responses. This must be echoed back on subsequent API calls or the server rejects with a 400 error. The assistant message builder was dropping this field, causing all Gemini 3 Flash/Pro tool-calling flows to fail after the first function call.	2026-02-28 18:10:05 +03:00
teknium1	6366177118	refactor: update context compression configuration to use config.yaml and improve model handling	2026-02-28 04:46:38 -08:00
Bartok9	1e463a8e39	fix: strip <think> blocks from final response to users Fixes #149 The _strip_think_blocks() method existed but was not applied to the final_response in the normal completion path. This caused <think>...</think> XML tags to leak into user-facing responses on all platforms (CLI, Telegram, Discord, Slack, WhatsApp). Changes: - Strip think blocks from final_response before returning in normal path (line ~2600) - Strip think blocks from fallback content when salvaging from prior tool_calls turn Notes: - The raw content with think blocks is preserved in messages[] for trajectory export - this only affects the user-facing final_response - The _has_content_after_think_block() check still uses raw content before stripping, which is correct for detecting think-only responses	2026-02-28 03:06:20 -05:00
Teknium	4a9086b848	Merge branch 'main' into feat/honcho-integration	2026-02-27 23:32:49 -08:00
teknium1	50cb4d5fc7	fix(agent): update error message for unsupported Anthropic API endpoints to clarify usage of OpenRouter	2026-02-27 23:23:31 -08:00
Teknium	2bc9508b7c	Merge pull request #173 from adavyas/fix/anthropic-base-url-guard fix(agent): fail fast on Anthropic native base URLs	2026-02-27 23:22:01 -08:00
teknium1	19f28a633a	fix(agent): enhance 413 error handling and improve conversation history management in tests	2026-02-27 23:04:32 -08:00
Teknium	2c817ce4a5	Merge pull request #153 from tekelala/main fix(agent): handle 413 payload-too-large via compression instead of aborting	2026-02-27 22:57:55 -08:00
adavyas	0c0a2eb0a2	fix(agent): fail fast on Anthropic native base URLs	2026-02-27 21:19:29 -08:00
teknium1	de0829cec3	fix(cli): increase max iterations for child agents and extend API call timeout for improved reliability	2026-02-27 17:35:29 -08:00
tekelala	79bd65034c	fix(agent): handle 413 payload-too-large via compression instead of aborting The 413 "Request Entity Too Large" error from the LLM API was caught by the generic 4xx handler which aborts immediately. This is wrong for 413 — it's a payload-size issue that can be resolved by compressing conversation history. - Intercept 413 before the generic 4xx block and route to _compress_context - Exclude 413 from generic is_client_error detection - Add 'request entity too large' to context-length phrases as safety net - Add tests for 413 compression behavior Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 12:21:27 -05:00
teknium1	c77f3da0ce	Cherry-pick 6 bug fixes from PR #76 and update documentation Code fixes (run_agent.py): - Fix off-by-one in _flush_messages_to_session_db skipping one message per flush - Add clear_interrupt() to 3 early-return paths preventing stale interrupt state - Wrap handle_function_call in try/except so tool crashes don't kill the conversation - Replace fragile `is` identity check with _flush_sentinel marker for memory flush cleanup - Fix retry loop off-by-one (6 attempts not 7) - Remove redundant inline `import re`	2026-02-27 03:21:49 -08:00
Bartok Moltbot	8aa531c7fa	fix(gateway): Pass session_db to AIAgent, fixing session_search error When running via the gateway (e.g. Telegram), the session_search tool returned: {"error": "session_search must be handled by the agent loop"} Root cause: - gateway/run.py creates AIAgent without passing session_db= - self._session_db is None in the agent instance - The dispatch condition "elif function_name == 'session_search' and self._session_db" skips when _session_db is None, falling through to the generic error This fix: 1. Initializes self._session_db in GatewayRunner.__init__() 2. Passes session_db to all AIAgent instantiations in gateway/run.py 3. Adds defensive fallback in run_agent.py to return a clear error when session_db is unavailable, instead of falling through Fixes #105	2026-02-27 00:32:17 -05:00
teknium1	58fce0a37b	feat(api): implement dynamic max tokens handling for various providers - Added _max_tokens_param method in AIAgent to return appropriate max tokens parameter based on the provider (OpenAI vs. others). - Updated API calls in AIAgent to utilize the new max tokens handling. - Introduced auxiliary_max_tokens_param function in auxiliary_client for consistent max tokens management across auxiliary clients. - Refactored multiple tools to use auxiliary_max_tokens_param for improved compatibility with different models and providers.	2026-02-26 20:23:56 -08:00
Erosika	70d1abf81b	refactor: run Honcho and USER.md in tandem USER.md stays in system prompt when Honcho is active -- prefetch is additive context, not a replacement. Memory tool user observations write to both USER.md (local) and Honcho (cross-session) simultaneously.	2026-02-26 18:07:33 -05:00
Erosika	1fd0fcddb2	feat: integrate Honcho with USER.md memory system When Honcho is active: - System prompt uses Honcho prefetch instead of USER.md - memory tool target=user add routes to Honcho - MEMORY.md untouched in all cases When disabled, everything works as before. Also wires up contextTokens config to cap prefetch size.	2026-02-26 18:07:17 -05:00
Erosika	ab4bbf2fb2	feat: add Honcho AI-native memory integration Opt-in persistent cross-session user modeling via Honcho. Reads ~/.honcho/config.json as single source of truth (shared with Claude Code, Cursor, and other Honcho-enabled tools). Zero impact when disabled or unconfigured. - honcho_integration/ package (client, session manager, peer resolution) - Host-based config resolution matching claude-honcho/cursor-honcho pattern - Prefetch user context into system prompt per conversation turn - Sync user/assistant messages to Honcho after each exchange - query_user_context tool for mid-conversation dialectic reasoning - Gated activation: requires ~/.honcho/config.json with enabled=true	2026-02-26 18:07:17 -05:00
George Pickett	32070e6bc0	Merge remote-tracking branch 'origin/main' into codex/align-codex-provider-conventions-mainrepo # Conflicts: # cron/scheduler.py # gateway/run.py # tools/delegate_tool.py	2026-02-26 10:56:29 -08:00
Dean Kerr	5a569eb1b6	fix: resolve .env and config paths from HERMES_HOME, not PROJECT_ROOT The `hermes` CLI entry point (hermes_cli/main.py) and the agent runner (run_agent.py) only loaded .env from the project installation directory. After the standard installer, code lives at ~/.hermes/hermes-agent/ but config lives at ~/.hermes/ — so the .env was never found. Aligns these entry points with the pattern already used by gateway/run.py and rl_cli.py: load ~/.hermes/.env first, fall back to project root .env for dev-mode compatibility. Also fixes: - status.py checking .env existence and API keys at PROJECT_ROOT - doctor.py KeyError on tool availability (missing_vars vs env_vars) - doctor.py checking logs/ and Skills Hub at PROJECT_ROOT instead of HERMES_HOME - doctor.py redundant logs/ check (already covered by subdirectory loop) - mini-swe-agent loading config from platformdirs default instead of ~/.hermes/ Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 16:49:14 +11:00
George Pickett	74c662b63a	Harden Codex auth refresh and responses compatibility	2026-02-25 19:27:54 -08:00
George Pickett	91bdb9eb2d	Fix Codex stream fallback for Responses completion gaps	2026-02-25 19:08:11 -08:00
George Pickett	47f16505d2	Omit optional function_call id in Responses replay input	2026-02-25 19:00:11 -08:00
George Pickett	e63986b534	Harden Codex stream handling and ack continuation	2026-02-25 18:56:06 -08:00
George Pickett	ce175d7372	Fix Codex Responses continuation and schema parity	2026-02-25 18:20:41 -08:00
George Pickett	609b19b630	Add OpenAI Codex provider runtime and responses integration (without .agent/PLANS.md)	2026-02-25 18:20:38 -08:00
teknium1	e3cb957a10	refactor: streamline reasoning configuration checks in AIAgent - Simplified the logic for determining support for reasoning based on the base URL by introducing clearer variable names. - Added product attribution for the Nous Portal to the extra body of requests when applicable, enhancing tagging for better tracking.	2026-02-25 16:49:41 -08:00
teknium1	9a858b8d67	add identifier for openrouter calls	2026-02-25 16:34:47 -08:00
teknium1	d72b9eadec	More fixes for windoze	2026-02-25 15:20:42 -08:00
teknium1	f64a87209d	refactor: enhance session content handling in AIAgent and update TTS output path - Introduced a new static method `_clean_session_content` in the `AIAgent` class to convert REASONING_SCRATCHPAD tags to <think> blocks and clean up whitespace in session logs. - Updated the `_save_session_log` method to utilize the cleaned content for assistant messages, ensuring consistency in session logs. - Changed the default output directory for TTS audio files from `~/voice-memos` to `~/.hermes/audio_cache`, reflecting a more appropriate storage location.	2026-02-25 04:22:03 -08:00
teknium1	41df8ee4f5	refactor: enhance interrupt handling in AIAgent class - Updated the `clear_interrupt` method to also reset the global tool interrupt signal, improving the clarity of interrupt management within the agent. - This change ensures that all interrupt states are properly cleared, enhancing the reliability of the agent's operation.	2026-02-25 03:45:47 -08:00
teknium1	681141a526	fix: ansi escapes causing broken terminal cli output	2026-02-24 03:42:12 -08:00
teknium1	e049441d93	feat: add reasoning effort configuration for agent - Introduced a new configuration option for reasoning effort in the CLI, allowing users to specify the level of reasoning the agent should perform before responding. - Updated the CLI and agent initialization to incorporate the reasoning configuration, enhancing the agent's responsiveness and adaptability. - Implemented logic to load reasoning effort from environment variables and configuration files, providing flexibility in agent behavior. - Enhanced the documentation in the example configuration file to clarify the new reasoning effort options available.	2026-02-24 03:30:19 -08:00
teknium1	2bf96ad244	feat: add ephemeral prefill messages and system prompt loading - Implemented functionality to load ephemeral prefill messages from a JSON file, enhancing few-shot priming capabilities for the agent. - Introduced a mechanism to load an ephemeral system prompt from environment variables or configuration files, ensuring dynamic prompt adjustments at API-call time. - Updated the CLI and agent initialization to utilize the new prefill messages and system prompt, improving the overall interaction experience. - Enhanced configuration options with new environment variables for prefill messages and system prompts, allowing for greater customization without persistence.	2026-02-23 23:55:42 -08:00
teknium1	d18c753b3c	refactor: streamline scratchpad handling in AIAgent - Removed static methods for converting and checking <REASONING_SCRATCHPAD> tags, simplifying the codebase. - Replaced calls to the removed methods with direct function calls for better clarity and maintainability. - Updated trajectory saving logic to utilize a dedicated function for improved organization and readability.	2026-02-23 09:55:09 -08:00
teknium1	90af34bc83	feat: enhance interrupt handling and container resource configuration - Introduced a shared interrupt signaling mechanism to allow tools to check for user interrupts during long-running operations. - Updated the AIAgent to handle interrupts more effectively, ensuring in-progress tool calls are canceled and multiple interrupt messages are combined into one prompt. - Enhanced the CLI configuration to include container resource limits (CPU, memory, disk) and persistence options for Docker, Singularity, and Modal environments. - Improved documentation to clarify interrupt behaviors and container resource settings, providing users with better guidance on configuration and usage.	2026-02-23 02:11:33 -08:00
teknium1	c7857dc1d4	feat: enhance AIAgent's tool usage nudges and content handling - Introduced a method to strip <think> blocks from content, improving text visibility. - Implemented counters to reset nudge intervals when memory and skill tools are used, enhancing user guidance. - Captured content from turns with tool calls to provide fallback responses, ensuring continuity in conversation. - Updated nudge logic to remind users about saving memories and creating skills based on interaction patterns.	2026-02-22 21:33:28 -08:00
teknium1	6037b6a5ab	Fix session saving to DB with full conversation history (not just user/assistant messages without tool calls)	2026-02-22 17:10:24 -08:00
teknium1	db23f51bc6	feat: introduce skills management features in AIAgent and CLI - Added skills configuration options in cli-config.yaml.example, including a nudge interval for skill creation reminders. - Implemented skills guidance in AIAgent to prompt users to save reusable workflows after complex tasks. - Enhanced skills indexing in the prompt builder to include descriptions from SKILL.md files for better context. - Updated the agent's behavior to periodically remind users about potential skills during tool-calling iterations.	2026-02-22 13:28:13 -08:00
teknium1	3c6750f37b	feat: enhance memory management features in AIAgent and CLI - Added configuration options for memory nudge interval and flush minimum turns in cli-config.yaml.example. - Implemented memory flushing before conversation reset, clearing, and exit in the CLI to ensure memories are saved. - Introduced a flush_memories method in AIAgent to handle memory persistence before context loss. - Added periodic nudges to remind the agent to consider saving memories based on user interactions.	2026-02-22 10:15:17 -08:00
teknium1	e223b4ac09	Enhance agent guidance with memory and session search tools - Introduced MEMORY_GUIDANCE and SESSION_SEARCH_GUIDANCE to improve agent's contextual awareness and proactive assistance. - Updated AIAgent to conditionally include tool-aware guidance in prompts based on available tools. - Enhanced descriptions in memory and session search schemas for clearer user instructions on when to utilize these features.	2026-02-22 02:31:52 -08:00
teknium1	f072801f38	refactor: remove unused compression model variable in AIAgent - Eliminated the `compression_model` variable from the AIAgent class, as it was not being utilized. - Cleaned up the context compressor initialization for improved clarity and maintainability.	2026-02-22 02:17:33 -08:00
teknium1	ededaaa874	Hermes Agent UX Improvements	2026-02-22 02:16:11 -08:00
teknium1	51b95236f9	refactor: move model metadata functions to agent/model_metadata.py - Relocated functions related to model metadata, including fetch_model_metadata, get_model_context_length, estimate_tokens_rough, and estimate_messages_tokens_rough, to agent/model_metadata.py for better organization and maintainability. - Updated imports in run_agent.py to reflect the new location of these functions.	2026-02-21 22:34:18 -08:00
teknium1	9123cfb5dd	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
teknium1	8f6788474b	feat: enhance logging in AIAgent for quiet mode - Added functionality to suppress logging noise from specific modules when in quiet mode, improving user experience in CLI. - Updated terminal_tool.py to change the log level for fallback directory usage from warning to debug, providing clearer context without cluttering logs.	2026-02-21 12:41:05 -08:00
teknium1	c98ee98525	feat: implement interactive prompts for sudo password and command approval in CLI - Added methods for handling sudo password and dangerous command approval prompts using a callback mechanism in cli.py. - Integrated these prompts with the prompt_toolkit UI for improved user experience. - Updated terminal_tool.py to support callback registration for interactive prompts, enhancing the CLI's interactivity. - Introduced a background thread for API calls in run_agent.py to allow for interrupt handling during long-running operations. - Enhanced error handling for interrupted API calls, ensuring graceful degradation of user experience.	2026-02-21 12:15:40 -08:00
teknium1	ecb430effe	refactor: enhance API interaction and message handling in AIAgent - Introduced new methods in run_agent.py for building API keyword arguments and normalizing assistant messages from API responses. - Added functionality for compressing conversation context and managing session state in SQLite. - Improved tool call execution handling, including enhanced logging and error management. - Updated path handling in multiple platform files to utilize pathlib for better compatibility and readability.	2026-02-21 04:17:27 -08:00
teknium1	748fd3db88	refactor: enhance error handling with structured logging across multiple modules - Updated various modules including cli.py, run_agent.py, gateway, and tools to replace silent exception handling with structured logging. - Improved error messages to provide more context, aiding in debugging and monitoring. - Ensured consistent logging practices throughout the codebase, enhancing traceability and maintainability.	2026-02-21 03:32:11 -08:00
teknium1	a885d2f240	refactor: implement structured logging across multiple modules - Introduced logging functionality in cli.py, run_agent.py, scheduler.py, and various tool modules to replace print statements with structured logging. - Enhanced error handling and informational messages to improve debugging and monitoring capabilities. - Ensured consistent logging practices across the codebase, facilitating better traceability and maintenance.	2026-02-21 03:11:11 -08:00
teknium1	3555c6173d	refactor: remove temporary API payload logging and enhance session log structure - Eliminated the `_log_api_payload` method used for temporary debugging, streamlining the codebase. - Updated the `_save_session_log` method to save the full raw session, including all messages and metadata, improving the clarity and completeness of session logs. - Adjusted session log entry to include additional context such as `base_url` and `platform` for better tracking.	2026-02-21 01:26:37 -08:00
teknium1	3976962621	fix: update session logging directory path in README and code - Changed the session logging directory from `~/.hermes-agent/logs/` to `~/.hermes/sessions/` for consistency. - Updated the `run_agent.py` to reflect the new logging path, ensuring session logs are stored correctly alongside gateway sessions.	2026-02-21 01:20:18 -08:00
teknium1	b33ed9176f	feat: update database schema and enhance message persistence - Incremented schema version to 2 and added a new column `finish_reason` to the `messages` table. - Implemented a method to flush un-logged messages to the session database, ensuring data integrity during conversation interruptions. - Enhanced error handling to persist messages in various early-return scenarios, preventing data loss.	2026-02-21 00:05:39 -08:00
teknium1	70dd3a16dc	Cleanup time!	2026-02-20 23:23:32 -08:00
teknium1	cfef34f7a6	feat: add multi-provider authentication and inference provider selection - Implemented a multi-provider authentication system for the Hermes Agent, supporting OAuth for Nous Portal and traditional API key methods for OpenRouter and custom endpoints. - Enhanced CLI with commands for logging in and out of providers, allowing users to authenticate and manage their credentials easily. - Updated configuration options to select inference providers, with detailed documentation on usage and setup. - Improved status reporting to include authentication status and provider details, enhancing user awareness of their current configuration. - Added new files for authentication handling and updated existing components to integrate the new provider system.	2026-02-20 17:24:00 -08:00
teknium1	ba07d9d5e3	feat: enhance task delegation with spinner updates and progress display - Added a spinner to visually indicate task delegation progress in quiet mode, improving user experience during batch processing. - Implemented a method to update spinner text dynamically based on remaining tasks, providing real-time feedback. - Enhanced the `delegate_task` function to include per-task completion messages, ensuring clarity on task status during execution. - Updated the KawaiiSpinner class to allow message updates while running, facilitating better interaction during long-running tasks.	2026-02-20 03:23:23 -08:00
teknium1	90e5211128	feat: implement subagent delegation for task management - Introduced the `delegate_task` tool, allowing the main agent to spawn child AIAgent instances with isolated context for complex tasks. - Supported both single-task and batch processing (up to 3 concurrent tasks) to enhance task management capabilities. - Updated configuration options for delegation, including maximum iterations and default toolsets for subagents. - Enhanced documentation to provide clear guidance on using the delegation feature and its configuration. - Added comprehensive tests to ensure the functionality and reliability of the delegation logic.	2026-02-20 03:15:53 -08:00
teknium1	f9eb5edb96	refactor: rename search tool for clarity and consistency - Updated the tool name from "search" to "search_files" across multiple files to better reflect its functionality. - Adjusted related documentation and descriptions to ensure clarity in usage and expected behavior. - Enhanced the toolset definitions and mappings to incorporate the new naming convention, improving overall consistency in the codebase.	2026-02-20 02:43:57 -08:00
teknium1	783acd712d	feat: implement code execution sandbox for programmatic tool calling - Introduced a new `execute_code` tool that allows the agent to run Python scripts that call Hermes tools via RPC, reducing the number of round trips required for tool interactions. - Added configuration options for timeout and maximum tool calls in the sandbox environment. - Updated the toolset definitions to include the new code execution capabilities, ensuring integration across platforms. - Implemented comprehensive tests for the code execution sandbox, covering various scenarios including tool call limits and error handling. - Enhanced the CLI and documentation to reflect the new functionality, providing users with clear guidance on using the code execution tool.	2026-02-19 23:23:43 -08:00
teknium1	9350e26e68	feat: introduce clarifying questions tool for interactive user engagement - Added a new `clarify_tool` to enable the agent to ask structured multiple-choice or open-ended questions to users. - Implemented callback functionality for user interaction, allowing the platform to handle UI presentation. - Updated the CLI and agent to support clarify questions, including timeout handling and response management. - Enhanced toolset definitions and requirements to include the clarify tool, ensuring availability across platforms.	2026-02-19 20:06:14 -08:00
teknium1	4d5f29c74c	feat: introduce skill management tool for agent-created skills and skills migration to ~/.hermes - Added a new `skill_manager_tool` to enable agents to create, update, and delete their own skills, enhancing procedural memory capabilities. - Updated the skills directory structure to support user-created skills in `~/.hermes/skills/`, allowing for better organization and management. - Enhanced the CLI and documentation to reflect the new skill management functionalities, including detailed instructions on creating and modifying skills. - Implemented a manifest-based syncing mechanism for bundled skills to ensure user modifications are preserved during updates.	2026-02-19 18:25:53 -08:00
teknium1	3f4b494c61	refactor: streamline thinking spinner behavior in AIAgent - Updated the logic for stopping the thinking spinner to improve clarity in tool execution messages. - Removed unnecessary checks for tool calls, simplifying the spinner's stop behavior while maintaining informative output for users.	2026-02-19 01:56:04 -08:00
teknium1	56ee8a5cc6	refactor: remove 'read' action from memory tool and agent logging - Eliminated the 'read' action from the memory tool and related logging in the agent, streamlining the available actions to 'add', 'replace', and 'remove'. - Updated error messages and documentation to reflect the removal of the 'read' action, ensuring clarity in the API's usage.	2026-02-19 01:03:08 -08:00
teknium1	440c244cac	feat: add persistent memory system + SQLite session store Two-part implementation: Part A - Curated Bounded Memory: - New memory tool (tools/memory_tool.py) with MEMORY.md + USER.md stores - Character-limited (2200/1375 chars), § delimited entries - Frozen snapshot injected into system prompt at session start - Model manages pruning via replace/remove with substring matching - Usage indicator shown in system prompt header Part B - SQLite Session Store: - New hermes_state.py with SessionDB class, FTS5 full-text search - Gateway session.py rewritten to dual-write SQLite + legacy JSONL - Compression-triggered session splitting with parent_session_id chains - New session_search tool with Gemini Flash summarization of matched sessions - CLI session lifecycle (create on launch, close on exit) Also: - System prompt now cached per session, only rebuilt on compression (fixes prefix cache invalidation from date/time changes every turn) - Config version bumped to 3, hermes doctor checks for new artifacts - Disabled in batch_runner and RL environments	2026-02-19 00:57:31 -08:00
teknium1	d7cef744ec	Add autocomplete and multiline support in HermesCLI input - Introduced SlashCommandCompleter for command autocompletion, enhancing user experience by suggesting commands as users type. - Enabled multiline input with Shift+Enter, allowing users to enter longer messages more conveniently. - Implemented paste detection to handle large text inputs, saving them to temporary files and replacing them with compact references in the input area. - Updated input area styling and hint display to improve usability and feedback during agent operation.	2026-02-17 21:47:54 -08:00
teknium1	a7f52911e1	Refactor CLI output formatting in AIAgent - Removed ANSI escape codes for color in tool activity messages to simplify output. - Updated the _get_cute_tool_message method to provide a cleaner, more consistent format for various tool activities. - Enhanced readability by aligning messages and removing unnecessary complexity, ensuring a more straightforward user experience.	2026-02-17 21:29:23 -08:00
teknium1	1e31614572	Refactor tool activity messages in AIAgent for improved CLI output - Introduced ANSI escape codes for color-coded CLI messages to enhance readability. - Updated the _get_cute_tool_message method to generate clean, aligned activity lines for various tools, replacing kawaii ASCII art with a more structured format. - Simplified message construction for web tools, terminal commands, and process management, ensuring consistent and scannable output.	2026-02-17 21:26:41 -08:00
teknium1	3b615b0f7a	Enhance tool previews in AIAgent and GatewayRunner - Updated the _build_tool_preview function to include detailed previews for new tools: 'todo', 'send_message', and various 'rl_' tools, improving user feedback during task execution. - Added emoji representations for tools in GatewayRunner, including 'process', 'todo', and 'send_message', to enhance visual clarity in progress messages. - Improved handling of task management and messaging outputs, ensuring more informative and user-friendly interactions.	2026-02-17 17:11:31 -08:00
teknium1	e184f5ab3a	Add todo tool for agent task planning and management Single `todo` tool that reads (no params) or writes (provide todos array with merge flag). In-memory TodoStore on AIAgent, no system prompt mutation, behavioral guidance in tool description only. State re-injected after context compression events. Gateway sessions hydrate from conversation history. Added to all platform toolsets. Also wired into RL agent_loop.py with per-run TodoStore and fixed browser_snapshot user_task passthrough from first user message.	2026-02-17 17:02:33 -08:00
teknium1	6731230d73	Add special handling for 'process' tool in _build_tool_preview function - Enhanced the _build_tool_preview function to include specific formatting for the 'process' tool, displaying action, session_id, data, and timeout when applicable. - This update improves the clarity of tool previews, particularly for actions that require session tracking and timeout management.	2026-02-17 03:18:27 -08:00
teknium1	48b5cfd085	Add skip_context_files option to AIAgent for batch processing - Introduced a new parameter `skip_context_files` in the AIAgent class to control the inclusion of context files (SOUL.md, AGENTS.md, .cursorrules) in the system prompt. - Updated the _process_single_prompt function to set `skip_context_files` to True, preventing pollution of trajectories during batch processing and data generation.	2026-02-16 22:40:31 -08:00
teknium1	84718d183a	Add platform-specific formatting hints and identity for AIAgent - Introduced a default agent identity prompt to ensure consistent behavior across platforms. - Added platform-specific formatting hints for CLI, WhatsApp, Telegram, and Discord to guide the agent's output style. - Updated the AIAgent initialization to accept a platform parameter, enhancing adaptability to different interfaces.	2026-02-12 16:11:16 -08:00
teknium1	3099a2f53c	Add timestamp to active system prompt in AIAgent - Appended the current local date and time to the active system prompt to provide context for the model, addressing potential misinterpretations due to training cutoffs.	2026-02-12 15:59:31 -08:00
teknium1	f5be6177b2	Add Text-to-Speech (TTS) functionality with multiple providers Add tool previews Add AGENTS and SOUL.md support Add Exec Approval	2026-02-12 10:05:08 -08:00
teknium1	153cd5bb44	Refactor skills tool integration and enhance system prompt - Removed the skills_categories tool from the skills toolset, streamlining the skills functionality to focus on skills_list and skill_view. - Updated the system prompt to dynamically build a compact skills index, allowing the model to quickly reference available skills without additional tool calls. - Cleaned up related code and documentation to reflect the removal of skills_categories, ensuring clarity and consistency across the codebase.	2026-02-10 19:48:38 -08:00
teknium1	cfe2f3fe15	Implement interrupt handling for long-running tool executions in AIAgent - Added functionality to signal and terminate long-running terminal commands when a new user message is received, allowing for immediate agent response. - Introduced a global interrupt event in the terminal tool to facilitate early termination of subprocesses. - Updated the AIAgent class to handle interrupts gracefully, ensuring that remaining tool calls are skipped and appropriate messages are returned to maintain valid message sequences.	2026-02-10 16:34:27 -08:00
teknium	1b1307d0d1	Implement Anthropic prompt caching for Claude models via OpenRouter - Introduced a caching strategy that reduces input token costs by ~75% on multi-turn conversations by caching the conversation prefix. - Added functions to apply cache control markers to messages, enhancing efficiency in token usage. - Updated AIAgent to auto-enable prompt caching for Claude models, with configurable cache TTL. - Enhanced logging to track cache hit statistics when caching is active, improving monitoring of token usage.	2026-02-10 06:49:41 +00:00
teknium	dd70d57b9b	Refactor BatchRunner and AIAgent for enhanced reasoning and tool management, improved tool definitions for fileops - Updated `ALL_POSSIBLE_TOOLS` to auto-derive from `TOOL_TO_TOOLSET_MAP` for consistent schema. - Introduced `_extract_reasoning_stats` function to track reasoning coverage in assistant turns. - Enhanced `_process_batch_worker` to discard prompts with no reasoning and aggregate reasoning statistics. - Updated documentation and comments for clarity on new features and changes.	2026-02-08 20:19:14 +00:00
teknium	f12ea1bc02	Enhance BatchRunner and AIAgent with new configuration options, default model now opus 4.6, default summarizer gemini flash 3 - Added `max_tokens`, `reasoning_config`, and `prefill_messages` parameters to `BatchRunner` and `AIAgent` for improved model response control. - Updated CLI to support new options for reasoning effort and prefill messages from a JSON file. - Modified example configuration files to reflect changes in default model and summary model. - Improved error handling for loading prefill messages and reasoning configurations in the CLI. - Updated documentation to include new parameters and usage examples.	2026-02-08 10:49:24 +00:00
teknium1	3c0d0dba49	Update RL tools and enhance configuration management - Modified `model_tools.py` to update default model IDs and add new RL function `rl_test_inference`. - Enhanced `README.md` with installation instructions for submodules and updated API key usage. - Improved `rl_cli.py` to load configuration from `~/.hermes/config.yaml` and set terminal working directory for RL tools. - Updated `run_agent.py` to handle empty string arguments as empty objects for better JSON validation. - Refined installation scripts to ensure submodules are cloned and installed correctly, enhancing setup experience.	2026-02-04 13:57:59 -08:00
teknium1	9bfe185a2e	Implement interrupt handling for agent and CLI input and persistent prompt line at bottom of CLI :) - Enhanced the AIAgent class to support interrupt requests, allowing for graceful interruption of ongoing tasks and processing of new messages. - Updated the HermesCLI to manage user input in a persistent manner, enabling real-time interruption of the agent's conversation. - Introduced a mechanism in the GatewayRunner to handle incoming messages while an agent is running, allowing for immediate response to user commands. - Improved overall user experience by providing feedback during interruptions and ensuring that pending messages are processed correctly.	2026-02-03 16:15:49 -08:00
teknium1	beeb7896e0	Refactor message handling and error logging in agent and gateway - Updated the AIAgent class to extract the first user message for trajectory formatting, improving the accuracy of user queries in the trajectory format. - Enhanced the GatewayRunner to convert transcript history into the agent format, ensuring proper handling of message roles and content. - Adjusted the typing indicator refresh rate to every 2 seconds for better responsiveness. - Improved error handling in the message sending process for the Telegram adapter, implementing a fallback mechanism for Markdown parsing failures, and logging send failures for better debugging.	2026-02-03 15:42:54 -08:00
teknium1	212460289b	Enhance skills tool to have an arg so it is more reliably called, and error handling in agent - Updated the `skills_categories` function to include a `verbose` parameter, allowing users to request skill counts per category. - Modified the `handle_skills_function_call` method to pass the `verbose` argument to `skills_categories`. - Improved error handling in the `AIAgent` class by injecting a recovery message when invalid JSON arguments are detected, guiding users on how to correct their tool calls. - Enhanced the `GatewayRunner` to return a user-friendly error message if the agent fails to generate a final response, improving overall user experience.	2026-02-03 15:26:59 -08:00
teknium1	e7f0ffbf5d	Add tool progress notifications for messaging channels - Introduced a new callback mechanism in the AIAgent class to send tool progress messages during execution, enhancing user feedback in messaging platforms. - Updated the GatewayRunner to support tool progress notifications, allowing users to enable or disable this feature via environment variables. - Enhanced the CLI setup wizard to prompt users for enabling tool progress messages and selecting the notification mode (all or new), improving configuration options. - Updated relevant documentation to reflect the new features and configuration settings for tool progress notifications.	2026-02-03 14:54:43 -08:00
teknium1	7eac4ee9fe	Update agent configuration for maximum tool-calling iterations - Increased the default maximum tool-calling iterations from 20 to 60 in the CLI configuration and related files, allowing for more complex tasks. - Updated documentation and comments to reflect the new recommended range for iterations, enhancing user guidance. - Implemented backward compatibility for loading max iterations from the root-level configuration, ensuring a smooth transition for existing users. - Adjusted the setup wizard to prompt for the maximum iterations setting, improving user experience during configuration.	2026-02-03 14:48:19 -08:00
teknium1	e114f09f70	Implement reasoning extraction and enhance assistant message handling - Added a new method `_extract_reasoning` to extract reasoning content from assistant messages, accommodating multiple formats from various providers. - Updated message handling to ensure all assistant messages include reasoning content for API compatibility, preserving multi-turn reasoning context. - Enhanced logging to capture reasoning details for debugging and analysis. - Modified the TODO.md to reflect changes in planning and task management, emphasizing the need for structured task decomposition and progress tracking.	2026-02-01 22:48:18 -08:00
teknium1	9b4d9452ba	Add context compression feature for long conversations - Implemented automatic context compression to manage long conversations that approach the model's context limit. - Configured the feature to summarize middle turns while protecting the first three and last four turns, ensuring important context is retained. - Added configuration options in `cli-config.yaml` and environment variables for enabling/disabling compression and setting thresholds. - Updated documentation in `README.md`, `cli.md`, and `.env.example` to explain the context compression functionality and its configuration. - Enhanced the `cli.py` to load compression settings into environment variables, ensuring seamless integration with the CLI. - Completed the implementation of context compression as outlined in the TODO list, marking it as a significant enhancement to conversation management.	2026-02-01 18:01:31 -08:00
teknium1	bbeed5b5d1	Enhance session logging and interactive sudo support - Implemented automatic session logging, saving conversation trajectories to the `logs/` directory in JSON format, with each session having a unique identifier. - Updated the CLI to display the session ID in the welcome banner for easy reference. - Introduced an interactive sudo password prompt in CLI mode, allowing users to enter their password with a 45-second timeout, enhancing user experience during command execution. - Documented session logging and interactive sudo features in `README.md`, `cli.md`, and `cli-config.yaml.example` for better user guidance.	2026-02-01 15:36:26 -08:00
teknium1	32254d3010	Add skills guidance to system prompts in run_agent.py - Introduced a default skills guidance prompt to assist the model in checking relevant skills before technical tasks. - Updated the logic in AIAgent to auto-include skills guidance when skills tools are available, enhancing the model's contextual understanding during API calls.	2026-02-01 01:31:59 -08:00
teknium	bc76a032ba	Add a claude code-like CLI - Introduced `cli-config.yaml.example` to provide a template for configuring the CLI behavior, including model settings, terminal tool configurations, agent behavior, and toolsets. - Created `cli.py` for an interactive terminal interface, allowing users to start the Hermes Agent with various options and toolsets. - Added `hermes` launcher script for convenient CLI access. - Updated `model_tools.py` to support quiet mode for suppressing output during tool initialization and execution. - Enhanced logging in various tools to respect quiet mode, improving user experience by reducing unnecessary output. - Added `prompt_toolkit` to `requirements.txt` for improved CLI interaction capabilities. - Created `TODO.md` for future improvements and enhancements to the Hermes Agent framework.	2026-01-31 06:30:48 +00:00
teknium	8e8b6be690	Add timeout configuration for trajectory processing - Updated `trajectory_compression.yaml` to include a new `per_trajectory_timeout` setting, allowing for a timeout of 300 seconds per trajectory. This enhancement helps prevent hanging on problematic entries during processing, improving overall reliability and efficiency in trajectory handling.	2026-01-30 07:34:58 +00:00
teknium	4c05ef0ba8	Enhance logging and tool initialization for improved performance - Updated logging configuration in `run_agent.py` to suppress debug messages from additional third-party libraries, reducing noise in logs. - Enhanced shell scripts for terminal tasks to utilize Singularity for containerized execution, including pre-build SIF image logic and improved logging. - Refactored tool initialization in `mixture_of_agents_tool.py`, `vision_tools.py`, and `web_tools.py` to implement lazy loading of API clients, optimizing resource usage and error handling. - Updated ephemeral system prompts in shell scripts to provide clearer guidance on task execution and resource usage.	2026-01-29 19:59:59 +00:00
teknium	248acf715e	Add browser automation tools and enhance environment configuration - Introduced new browser automation tools in `browser_tool.py` for navigating, interacting with, and extracting content from web pages using the agent-browser CLI and Browserbase cloud execution. - Updated `.env.example` to include new configuration options for Browserbase API keys and session settings. - Enhanced `model_tools.py` and `toolsets.py` to integrate browser tools into the existing tool framework, ensuring consistent access across toolsets. - Updated `README.md` with setup instructions for browser tools and their usage examples. - Added new test script `test_modal_terminal.py` to validate Modal terminal backend functionality. - Improved `run_agent.py` to support browser tool integration and logging enhancements for better tracking of API responses.	2026-01-29 06:10:24 +00:00
teknium	ba19d530ad	Update environment configuration and enhance terminal tool integration - Updated `.env.example` to include new API keys and configuration options for the mini-swe-agent backend, including support for local, Docker, and Modal environments. - Added `.gitmodules` to include mini-swe-agent as a submodule for easier integration. - Refactored `mini_swe_runner.py` to use the updated model format and default to OpenRouter for API calls. - Enhanced `model_tools.py` to support the new terminal tool definitions and ensure compatibility with the mini-swe-agent backend. - Updated `README.md` to reflect changes in setup instructions and environment variable configurations. - Improved `terminal_tool.py` to manage execution environments and lifecycle, ensuring proper cleanup and error handling. - Introduced `terminal_hecate.py` for executing commands on MorphCloud VMs, providing an alternative backend for terminal operations.	2026-01-23 12:26:53 +00:00
teknium	b32cc4b09d	Refactor batch processing with rich progress tracking and update logging in AIAgent - Replaced tqdm with rich for enhanced visual progress tracking in batch processing. - Adjusted logging levels in AIAgent to suppress asyncio debug messages. - Modified datagen script to reduce number of workers for improved performance.	2026-01-14 14:02:59 +00:00
teknium	6e3dbb8d8b	Enhance batch processing with progress tracking and update AIAgent for OpenRouter detection - Integrated tqdm for progress tracking in batch processing, replacing map with imap_unordered for improved performance. - Added base_url attribute in AIAgent to facilitate OpenRouter detection.	2026-01-14 13:46:16 +00:00
teknium	13d360030f	Enhance tool normalization and API integration across modules - Introduced normalization functions for tool statistics and error counts to ensure consistent schema across all trajectory entries, facilitating compatibility with HuggingFace datasets. - Updated batch processing to utilize normalized tool stats and error counts, improving data integrity. - Refactored vision tools and mixture of agents tool to integrate with OpenRouter API, replacing Nous Research API references and updating model configurations. - Enabled reasoning capabilities in API calls for enhanced response quality across various tools. - Improved error handling and API key validation for OpenRouter integration.	2026-01-14 13:40:10 +00:00
teknium	66daebe88f	Implement enhanced response handling and tool call validation in run_agent - Added methods to check for meaningful content after <think> blocks and to retrieve messages up to the last complete assistant turn. - Introduced retry logic for handling truncated responses and invalid JSON arguments in tool calls, with a maximum retry limit. - Improved logging for invalid JSON and empty responses, ensuring better error tracking and handling. - Updated the batch data generation script to adjust dataset file, batch size, and ephemeral system prompt for improved context management.	2026-01-10 13:04:43 +00:00
teknium	4071ba29da	Enhance batch processing and tool validation - Added support for tracking partial results and tool error counts in batch processing. - Implemented filtering of corrupted entries during batch file combination based on valid tool names. - Updated terminal tool to improve command execution and error handling, including retry logic for transient failures. - Refactored model tools to use a simple terminal tool with no session persistence. - Improved logging and error messages for invalid API responses and tool calls. - Introduced chunked processing for large content in web tools to manage size limitations effectively.	2026-01-10 05:56:26 +00:00
Teknium	80d326310e	Merge branch 'main' into speed-upgrades	2026-01-08 01:03:34 -08:00
teknium	6af6ff2a0a	updates for stability and speed	2026-01-08 08:57:51 +00:00
hjc-puro	1614c15bb1	rate limits	2025-11-17 18:35:36 -05:00
hjc-puro	0c618482c4	add logging of prefix of tool call and tool response	2025-11-07 14:43:44 -05:00
hjc-puro	2d8f6c46f1	log first 20 chars	2025-11-07 14:08:06 -05:00
Teknium	4135cf4682	Merge branch 'main' into test	2025-11-04 19:54:40 -08:00
teknium	c82741c3d8	some cleanups	2025-11-05 03:47:17 +00:00
hjc-puro	fbd3a2fdb8	prevent leakage of morph instances between tasks	2025-11-04 03:32:43 -05:00
hjc-puro	a4db3fdee5	fix leakage	2025-11-03 17:42:23 -05:00
teknium	de9c0edc51	some bugfixes	2025-10-15 18:07:06 +00:00
teknium	d36790de91	Add ephemeral system prompt support in batch and agent runners. Update README with usage examples and documentation for the new feature. Ensure prompt is not saved to trajectories.	2025-10-08 02:33:58 +00:00
teknium	0411ca1880	Add environment configuration file, restructure tool imports, and enhance README setup instructions	2025-10-01 09:54:17 +00:00
Teknium	c5386ed7e6	add better logging when requests fail	2025-09-10 00:51:41 -07:00
Teknium	17608c1142	Update to use toolsets and make them easy to create and configure	2025-09-10 00:43:55 -07:00
Teknium	587d1cf720	Fix Web Tools, Upgrade MoA to GPT5, Add Trajectory Saving	2025-08-31 03:04:10 -07:00
Teknium	cde7e64418	add vision model tool, cli updates for exclusive and inclusive toolsets	2025-08-04 00:14:16 -07:00
hjc-puro	a49596cbb2	terminal tool	2025-07-26 04:31:17 +00:00
hjc-puro	122d8788ae	terminal tool	2025-07-25 15:15:36 +00:00
Teknium	21d80ca683	initital commit	2025-07-22 18:32:44 -07:00

... 4 5 6 7 8 ...

438 Commits