hermes-agent-features

Author	SHA1	Message	Date
Teknium	91eea7544f	refactor(creative): promote pixel-art from optional to built-in skills	2026-04-19 11:57:51 -07:00
Teknium	ea0bd81b84	feat(skills): consolidate find-nearby into maps as a single location skill find-nearby and the (new) maps optional skill both used OpenStreetMap's Overpass + Nominatim to answer the same question — 'what's near this location?' — so shipping both would be duplicate code for overlapping capability. Consolidate into one active-by-default skill at skills/productivity/maps/ that is a strict superset of find-nearby. Moves + deletions: - optional-skills/productivity/maps/ → skills/productivity/maps/ (active, no install step needed) - skills/leisure/find-nearby/ → DELETED (fully superseded) Upgrades to maps_client.py so it covers everything find-nearby did: - Overpass server failover — tries overpass-api.de then overpass.kumi.systems so a single-mirror outage doesn't break the skill (new overpass_query helper, used by both nearby and bbox) - nearby now accepts --near "<address>" as a shortcut that auto-geocodes, so one command replaces the old 'search → copy coords → nearby' chain - nearby now accepts --category (repeatable) for multi-type queries in one call (e.g. --category restaurant --category bar), results merged and deduped by (osm_type, osm_id), sorted by distance, capped at --limit - Each nearby result now includes maps_url (clickable Google Maps search link) and directions_url (Google Maps directions from the search point — only when a ref point is known) - Promoted commonly-useful OSM tags to top-level fields on each result: cuisine, hours (opening_hours), phone, website — instead of forcing callers to dig into the raw tags dict SKILL.md: - Version bumped 1.1.0 → 1.2.0, description rewritten to lead with capability surface - New 'Working With Telegram Location Pins' section replacing find-nearby's equivalent workflow - metadata.hermes.supersedes: [find-nearby] so tooling can flag any lingering references to the old skill External references updated: - optional-skills/productivity/telephony/SKILL.md — related_skills find-nearby → maps - website/docs/reference/skills-catalog.md — removed the (now-empty) 'leisure' section, added 'maps' row under productivity - website/docs/user-guide/features/cron.md — find-nearby example usages swapped to maps - tests/tools/test_cronjob_tools.py, tests/hermes_cli/test_cron.py, tests/cron/test_scheduler.py — fixture string values swapped - cli.py:5290 — /cron help-hint example swapped Not touched: - RELEASE_v0.2.0.md — historical record, left intact E2E-verified live (Nominatim + Overpass, one query each): - nearby --near "Times Square" --category restaurant --category bar → 3 results, sorted by distance, all with maps_url, directions_url, cuisine, phone, website where OSM had the tags All 111 targeted tests pass across tests/cron/, tests/tools/, tests/hermes_cli/.	2026-04-19 05:19:22 -07:00
Teknium	206a449b29	feat(webhook): direct delivery mode for zero-LLM push notifications (#12473 ) External services can now push plain-text notifications to a user's chat via the webhook adapter without invoking the agent. Set deliver_only=true on a route and the rendered prompt template becomes the literal message body — dispatched directly to the configured target (Telegram, Discord, Slack, GitHub PR comment, etc.). Reuses all existing webhook infrastructure: HMAC-SHA256 signature validation, per-route rate limiting, idempotency cache, body-size limits, template rendering with dot-notation, home-channel fallback. No new HTTP server, no new auth scheme, no new port. Use cases: Supabase/Firebase webhooks → user notifications, monitoring alert forwarding, inter-agent pings, background job completion alerts. Changes: - gateway/platforms/webhook.py: new _direct_deliver() helper + early dispatch branch in _handle_webhook when deliver_only=true. Startup validation rejects deliver_only with deliver=log. - hermes_cli/main.py + hermes_cli/webhook.go: --deliver-only flag on subscribe; list/show output marks direct-delivery routes. - website/docs/user-guide/messaging/webhooks.md: new Direct Delivery Mode section with config example, CLI example, response codes. - skills/devops/webhook-subscriptions/SKILL.md: document --deliver-only with use cases (bumped to v1.1.0). - tests/gateway/test_webhook_deliver_only.py: 14 new tests covering agent bypass, template rendering, status codes, HMAC still enforced, idempotency still applies, rate limit still applies, startup validation, and direct-deliver dispatch. Validation: 78 webhook tests pass (64 existing + 14 new). E2E verified with real aiohttp server + real urllib POST — agent not invoked, target adapter.send() called with rendered template, duplicate delivery_id suppressed. Closes the gap identified in PR #12117 (thanks to @H1an1 / Antenna team) without adding a second HTTP ingress server.	2026-04-19 05:18:19 -07:00
Teknium	66ee081dc1	skills: move 7 niche mlops/mcp skills to optional (#12474 ) Built-in → optional-skills/: mlops/training/peft → optional-skills/mlops/peft mlops/training/pytorch-fsdp → optional-skills/mlops/pytorch-fsdp mlops/models/clip → optional-skills/mlops/clip mlops/models/stable-diffusion → optional-skills/mlops/stable-diffusion mlops/models/whisper → optional-skills/mlops/whisper mlops/cloud/modal → optional-skills/mlops/modal mcp/mcporter → optional-skills/mcp/mcporter Built-in mlops training kept: axolotl, trl-fine-tuning, unsloth. Built-in mlops models kept: audiocraft, segment-anything. Built-in mlops evaluation/research/huggingface-hub/inference all kept. native-mcp stays built-in (documents the native MCP tool); mcporter was a redundant alternative CLI. Also: removed now-empty skills/mlops/cloud/ dir, refreshed skills/mlops/models/DESCRIPTION.md and skills/mcp/DESCRIPTION.md to match what's left, and synchronized both catalog pages (skills-catalog.md, optional-skills-catalog.md).	2026-04-19 05:14:17 -07:00
Teknium	6b31e20894	chore(skills): touchdesigner-mcp follow-ups - Remove orphan skills/creative/touchdesigner/references/pitfalls.md left over from the rename commit (git add-then-edit instead of git mv meant the old file never got deleted). - Honour $HERMES_HOME in setup.sh and SKILL.md setup invocation so profile-aware installs work correctly. - Fix troubleshooting.md config path to use $HERMES_HOME instead of hardcoding ~/.hermes/. - Add touchdesigner-mcp entries to skills-catalog.md and optional-skills-catalog.md for parity with blender-mcp/meme-generation.	2026-04-18 17:43:42 -07:00
kshitijk4poor	6d2fe1d624	feat: rename touchdesigner -> touchdesigner-mcp, move to optional-skills/ - Rename skill to touchdesigner-mcp (matches blender-mcp convention) - Move from skills/creative/ to optional-skills/creative/ - Fix duplicate pitfall numbering (#3 appeared twice) - Update SKILL.md cross-references for renumbered pitfalls - Update setup.sh path for new directory location	2026-04-18 17:43:42 -07:00
kshitijk4poor	6f27390fae	feat: rewrite TouchDesigner skill for twozero MCP (v2.0.0) Major rewrite of the TouchDesigner skill: - Replace custom API handler with twozero MCP (36 native tools) - Add audio-reactive GLSL proven recipe (spectrum chain, pitfalls) - Add recording checklist (FPS>0, non-black, audio cueing) - Expand pitfalls: 38 entries from real sessions (was 20) - Update network-patterns with MCP-native build scripts - Rewrite mcp-tools reference for twozero v2.774+ - Update troubleshooting for MCP-based workflow - Remove obsolete custom_api_handler.py - Generalize Environment section for all users - Remove session-specific Paired Skills section - Bump version to 2.0.0	2026-04-18 17:43:42 -07:00
kshitijk4poor	7a5371b20d	feat: add TouchDesigner integration skill New skill: creative/touchdesigner — control a running TouchDesigner instance via REST API. Build real-time visual networks programmatically. Architecture: Hermes Agent -> HTTP REST (curl) -> TD WebServer DAT -> TD Python env Key features: - Custom API handler (scripts/custom_api_handler.py) that creates a self-contained WebServer DAT + callback in TD. More reliable than the official mcp_webserver_base.tox which frequently fails module imports. - Discovery-first workflow: never hardcode TD parameter names. Always probe the running instance first since names change across versions. - Persistent setup: save the TD project once with the API handler baked in. TD auto-opens the last project on launch, so port 9981 is live with zero manual steps after first-time setup. - Works via curl in execute_code (no MCP dependency required). - Optional MCP server config for touchdesigner-mcp-server npm package. Skill structure (2823 lines total): SKILL.md (209 lines) — setup, workflow, key rules, operator reference references/pitfalls.md (276 lines) — 24 hard-won lessons references/operators.md (239 lines) — all 6 operator families references/network-patterns.md (589 lines) — audio-reactive, generative, video processing, GLSL, instancing, live performance recipes references/mcp-tools.md (501 lines) — 13 MCP tool schemas references/python-api.md (443 lines) — TD Python scripting patterns references/troubleshooting.md (274 lines) — connection diagnostics scripts/custom_api_handler.py (140 lines) — REST API handler for TD scripts/setup.sh (152 lines) — prerequisite checker Tested on TouchDesigner 099 Non-Commercial (macOS/darwin).	2026-04-18 17:43:42 -07:00
Teknium	0175ff7516	feat(skills): replace xitter with xurl — the official X API CLI (#12303 ) Swap the social-media/xitter skill (third-party wrapper around Infatoshi/x-cli) for a new social-media/xurl skill wrapping xdevplatform/xurl — the official X API CLI from the X developer platform team. Why: - xurl is officially maintained by the X dev platform team - OAuth 2.0 PKCE with auto-refresh + multi-app / multi-user support (vs. xitter's 5-env-var OAuth 1.0a + single account) - Credentials stored in ~/.xurl managed by xurl itself — no manual env var juggling for users - Substantially larger API surface: DMs, follows, blocks, mutes, media upload, streaming, and raw v2 endpoint access - Ships stronger agent-safety guardrails (forbidden-flag list, no --verbose in agent mode, never-read-~/.xurl rule) Adaptation: - Ported the openclaw SKILL.md (which the xdevplatform team seeded) to Hermes frontmatter conventions (prerequisites.commands, platforms, metadata.hermes.tags/homepage) — dropped openclaw-specific metadata - Added a Hermes-oriented one-time user setup section so the agent knows to direct the user to run auth commands themselves, never execute them with inline secrets - Preserved the mandatory secret-safety rules verbatim - Attribution block credits xdevplatform, openclaw, and the Hermes port Docs: updated website/docs/reference/skills-catalog.md to replace the xitter row with xurl.	2026-04-18 15:11:32 -07:00
Teknium	ade7958f1f	docs: add PORT_NOTES.md for baoyu-infographic Documents what changed from upstream and how to sync future updates.	2026-04-18 12:32:16 -07:00
Teknium	65c0a30a77	feat(skills): add baoyu-infographic skill — 21 layouts × 21 styles Port of baoyu-infographic from JimLiu/baoyu-skills (v1.56.1) adapted for Hermes Agent's tool ecosystem. Adaptations from upstream: - Frontmatter: openclaw metadata → hermes metadata - Usage: slash command syntax → natural language triggers - Removed EXTEND.md config system (not part of Hermes infrastructure) - AskUserQuestion → clarify tool (one question at a time) - Image generation → image_generate tool - Removed Windows-specific paths - Simplified file operations to use Hermes file tools - All 45 reference files (layouts, styles, templates) preserved intact Attribution preserved per agreement with 宝玉 (Jim Liu): - author, version, GitHub homepage URL in frontmatter Co-authored-by: Jim Liu 宝玉 <junminliu@gmail.com>	2026-04-18 12:32:16 -07:00
Teknium	73bccc94c7	skills: consolidate mlops redundancies (gguf+llama-cpp, grpo+trl, guidance→optional) (#11965 ) Three tightly-scoped built-in skill consolidations to reduce redundancy in the available_skills listing injected into every system prompt: 1. gguf-quantization → llama-cpp (merged) GGUF is llama.cpp's format; two skills covered the same toolchain. The merged llama-cpp skill keeps the full K-quant table + imatrix workflow from gguf and the ROCm/benchmarks/supported-models sections from the original llama-cpp. All 5 reference files preserved. 2. grpo-rl-training → fine-tuning-with-trl (folded in) GRPO isn't a framework, it's a trainer inside TRL. Moved the 17KB deep-dive SKILL.md to references/grpo-training.md and the working template to templates/basic_grpo_training.py. TRL's GRPO workflow section now points to both. Atropos skill's related_skills updated. 3. guidance → optional-skills/mlops/ Dropped from built-in. Outlines (still built-in) covers the same structured-generation ground with wider adoption. Listed in the optional catalog for users who specifically want Guidance. Net: 3 fewer built-in skill lines in every system prompt, zero content loss. Contributor authorship preserved via git rename detection.	2026-04-17 21:36:40 -07:00
Teknium	fdefd98aa3	docs(skills): make descriptions self-contained, not cross-dependent Previous pass assumed both skills would always be loaded together, so each description pointed at the other ('use concept-diagrams instead'). That breaks when only one skill is active — the agent reads 'use the other skill' and there is no other skill. Now each skill's description and scope section is fully self-contained: - States what it's best suited for - Lists subjects where a more specialized skill (if available) would be a better fit, naming them only as 'consider X if available' - Explicitly offers itself as a general SVG diagram fallback when no more specialized skill exists An agent loading either skill alone gets unambiguous guidance; an agent with both loaded still gets useful routing via the 'consider X if available' hints and the related_skills metadata.	2026-04-16 20:39:55 -07:00
Teknium	7d535969ff	docs(skills): make architecture-diagram vs concept-diagrams routing explicit Both skills generate SVG system diagrams, but for very different subjects and aesthetics. The old descriptions didn't make the split clear, so an agent loading either one couldn't confidently pick. Changes: - Rewrote both frontmatter descriptions to state the scope up front plus an explicit 'for X, use the other skill instead' pointer. - Added a symmetric 'When to use this skill vs <other>' decision table to the top of each SKILL.md body, so the guidance is visible whether the agent is reading frontmatter or full content. - Added architecture-diagram <-> concept-diagrams to each other's related_skills metadata. Rule of thumb baked into both skills: software/cloud infra -> architecture-diagram physical / scientific / educational -> concept-diagrams	2026-04-16 20:39:55 -07:00
Teknium	80855f964e	fix: stop hermes update from nagging about llm-wiki's wiki.path (#11222 ) llm-wiki was the only shipped skill using metadata.hermes.config, which caused 'hermes update' and 'hermes config migrate' to prompt for a wiki directory on every run — even for users who have never touched the skill — because 'enabled' is opt-out (all shipped skills count as enabled unless explicitly disabled). Declining the prompt didn't persist anything, so the nag fired again on every update. Switch llm-wiki to the env var + runtime default pattern that obsidian and google-workspace already use: WIKI_PATH env var, default $HOME/wiki. No prompting infrastructure, no config.yaml touch, no nag loop. Changes: - skills/research/llm-wiki/SKILL.md: remove metadata.hermes.config, document WIKI_PATH env var in the Wiki Location section, update the orientation snippet and initialization guidance. - Docs: replace llm-wiki's wiki.path examples with a generic 'myplugin.path' placeholder across configuration.md, features/skills.md, and creating-skills.md so users don't try to set skills.config.wiki.path expecting llm-wiki to use it. - skills-catalog.md: mention WIKI_PATH instead of skills.config.wiki.path. E2E verified: discover_all_skill_config_vars() and get_missing_skill_config_vars() both return 0 entries after this change, so the prompt branch in migrate_config() no longer fires. The metadata.hermes.config feature stays in place for third-party skills that genuinely need structured config, but built-ins now prefer env vars.	2026-04-16 13:34:16 -07:00
LeonSGP43	daef0519e9	fix(google-workspace): normalize authorized user token writes	2026-04-16 04:22:16 -07:00
Teknium	5ef0fe1665	docs: fix stale hermes login references in hermes-agent skill (#10603 ) Follow-up to #10471 — replace remaining 'hermes login --provider' references with current 'hermes auth' flow.	2026-04-15 17:43:54 -07:00
Teknium	e402906d48	fix: five HERMES_HOME profile-isolation leaks (#10570 ) * fix: show correct env var name in provider API key error (#9506) The error message for missing provider API keys dynamically built the env var name as PROVIDER_API_KEY (e.g. ALIBABA_API_KEY), but some providers use different names (alibaba uses DASHSCOPE_API_KEY). Users following the error message set the wrong variable. Fix: look up the actual env var from PROVIDER_REGISTRY before building the error. Falls back to the dynamic name if the registry lookup fails. Closes #9506 * fix: five HERMES_HOME profile-isolation leaks (#5947) Bug A: Thread session_title from session_db to memory provider init kwargs so honcho can derive chat-scoped session keys instead of falling back to cwd-based naming that merges all gateway users into one session. Bug B: Replace 14 hardcoded ~/.hermes/skills/ paths across 10 skill files with HERMES_HOME-aware alternatives (${HERMES_HOME:-$HOME/.hermes} in shell, os.environ.get('HERMES_HOME', ...) in Python). Bug C: install.sh now respects HERMES_HOME env var and adds --hermes-home flag. Previously --dir only set INSTALL_DIR while HERMES_HOME was always hardcoded to $HOME/.hermes. Bug D: Remove hardcoded ~/.hermes/honcho.json fallback in resolve_config_path(). Non-default profiles no longer silently inherit the default profile's honcho config. Falls through to ~/.honcho/config.json (global) instead. Bug E: Guard _edit_skill, _patch_skill, _delete_skill, _write_file, and _remove_file against writing to skills found in external_dirs. Skills outside the local SKILLS_DIR are now read-only from the agent's perspective. Closes #5947	2026-04-15 17:09:41 -07:00
Misturi	dedc4600dd	fix(skills): handle missing fields in Google Workspace token file gracefully instead of crashing with KeyError	2026-04-15 03:45:09 -07:00
Misturi	8bc9b5a0b4	fix(skills): use `is None` check for coordinates in find-nearby to avoid dropping valid 0.0 values	2026-04-15 03:45:09 -07:00
Teknium	ef04de3e98	docs: update tool-adding instructions for auto-discovery - AGENTS.md: 3 files → 2 files, remove _discover_tools() step - adding-tools.md: remove Step 3, note auto-discovery - architecture.md: update discovery description - tools-runtime.md: replace manual list with discover_builtin_tools() docs - hermes-agent skill: remove manual import step	2026-04-14 21:12:29 -07:00
Teknium	b24e5ee4b0	feat(google-workspace): add --from flag for custom sender display name (#9931 ) Adds --from flag to gmail send and gmail reply commands, allowing agents to customize the From header display name when sharing the same email account. Usage: --from '"Agent Name" <user@example.com>' Also syncs repo google_api.py with the deployed standalone implementation (replaces outdated gws_bridge thin wrapper), adds dedicated docs page under Features > Skills, and updates sidebar navigation. Requested by community user @Maxime44.	2026-04-14 16:55:34 -07:00
Teknium	55ce76b372	feat: add architecture-diagram skill (Cocoon AI port) (#9906 ) Port of Cocoon AI's architecture-diagram-generator (MIT) as a Hermes skill. Generates professional dark-themed system architecture diagrams as standalone HTML/SVG files. Self-contained output, no dependencies. - SKILL.md with design system specs, color palette, layout rules - HTML template with all component types, arrow styles, legend examples - Fits alongside excalidraw in creative/ category Source: https://github.com/Cocoon-AI/architecture-diagram-generator	2026-04-14 16:10:18 -07:00
Teknium	3804556cd9	fix: restore clarify toolset row removed in cherry-pick	2026-04-13 02:49:11 -07:00
Haoqing Wang	8e0ae66520	fix(skills): correct TTS/STT providers, add missing platforms/commands in hermes-agent skill Fixes verified via 5-container parallel testing against v0.8.0 codebase. Critical fixes: - TTS providers: replace nonexistent kokoro/fish with actual minimax/mistral/neutts - STT providers: add missing mistral (Voxtral Transcribe) - Testing section: remove `source venv/bin/activate` (no venv dir in project) Expanded coverage: - Provider table: 13 → 22 entries (add Gemini, xAI, Xiaomi, Qwen OAuth, MiniMax CN, etc.) - Platform list: add BlueBubbles (iMessage) and Weixin (WeChat), clarify Open WebUI - Slash commands: add 14 undocumented commands (/approve, /deny, /branch, /fast, etc.) - Toolsets: add 4 missing (messaging, search, todo, rl) - Troubleshooting: expand from 6 to 10 sections with practical deployment fixes (Copilot OAuth 403, gateway linger, WSL2 systemd, Discord intents, etc.) Minor fixes: - agent/ directory description expanded - delegation config keys completed - /restart noted as gateway-only - hermes honcho noted as plugin-dependent	2026-04-13 02:49:11 -07:00
Teknium	b22663ea69	docs: restore Orchestra Research attribution in research-paper-writing skill (#8800 ) PR #4654 replaced ml-paper-writing with research-paper-writing, preserving the writing philosophy and reference files but dropping the dedicated 'Sources Behind This Guidance' attribution table from the SKILL.md body. Re-adds: - The researcher attribution table (Nanda, Farquhar, Gopen & Swan, Lipton, Steinhardt, Perez, Karpathy) with affiliations and links to SKILL.md - Orchestra Research credit as original compiler of the writing philosophy - 'Origin & Attribution' section in sources.md documenting the full chain: Nanda blog → Orchestra skill → teknium integration → SHL0MS expansion	2026-04-12 22:03:18 -07:00
SHL0MS	801a26c014	feat(skills): add creative ideation — constraint-driven project generation Generate project ideas through creative constraints. Constraint + direction = creativity. Core skill (SKILL.md, 147 lines): - 15 curated constraints across 3 categories: developers, makers, anyone - Developer-focused prompts: 'solve your own itch', 'the CLI tool that should exist', 'automate the annoying thing', 'nothing new except glue' - Matching table: maps user mood/intent to appropriate constraints - Complete worked example with 3 concrete project ideas - Output format for consistent, actionable idea presentation Extended library (references/full-prompt-library.md, 110 lines): - 30+ additional constraints: communication, screens, philosophy, transformation, identity, scale, starting points Constraint approach inspired by wttdotm.com/prompts.html. Adapted for software development and general-purpose ideation.	2026-04-11 01:44:36 -04:00
SHL0MS	941608cdde	feat(skills): add creative divergence strategies for experimental output Adds opt-in creative thinking frameworks to ascii-video, p5js, and manim-video skills, based on Lluminate (joelsimon.net/lluminate). Only engaged when the user explicitly asks for creative, experimental, or unconventional output. Straightforward requests are unaffected. Each skill gets 2-3 strategies matched to its domain: - ascii-video: Forced Connections, Conceptual Blending, Oblique Strategies - p5js: Conceptual Blending, SCAMPER, Distance Association - manim-video: SCAMPER, Assumption Reversal Strategies sourced from creativity research (Boden, Eno, de Bono, Koestler, Fauconnier & Turner, Osborn), formalized for LLM prompting by Lluminate.	2026-04-09 21:40:16 -04:00
Teknium	e9168f917e	fix: handle HTTP errors gracefully in gws_bridge token refresh Instead of crashing with a raw urllib traceback on refresh failure, print a clean error message and suggest re-running setup.py.	2026-04-09 14:28:35 -07:00
Teknium	73eb59db8d	fix: follow-up fixes for google-workspace gws migration - Fix npm package name: @anthropic -> @googleworkspace/cli - Add Homebrew install option - Fix calendar_list to respect --start/--end args (uses raw Calendar API for date ranges, +agenda helper for default 7-day view) - Improve check_auth partial scope output (list missing scopes) - Add output format documentation with key JSON shapes - Use npm install in troubleshooting (no Rust toolchain needed) Follow-up to cherry-picked PR #6713	2026-04-09 14:28:35 -07:00
spideystreet	127b4caf0d	feat(skills): migrate google-workspace to gws CLI backend Migrate the google-workspace skill from custom Python API wrappers (google-api-python-client) to Google's official Rust CLI gws (googleworkspace/cli). Add gws_bridge.py for headless-compatible token refresh. Fix partial OAuth scope handling. Co-authored-by: spideystreet <dhicham.pro@gmail.com> Cherry-picked from PR #6713	2026-04-09 14:28:35 -07:00
Teknium	1780ad24b1	fix: normalize remaining reasoning effort orderings and add missing 'minimal' Follow-up to cherry-picked PR #6698. Fixes spots the original PR missed: - hermes_constants.py: VALID_REASONING_EFFORTS tuple ordering - gateway/run.py: _load_reasoning_config docstring + validation tuple - configuration.md and batch-processing.md: docs ordering - hermes-agent skill: /reasoning usage hint was missing 'minimal'	2026-04-09 14:20:16 -07:00
Teknium	2772d99085	fix: remove /prompt slash command — footgun via prefix expansion (#6752 ) /pr <anything> silently resolved to /prompt via the shortest-match tiebreaker in prefix expansion, permanently overwriting the system prompt and persisting to config. The command's functionality (setting agent.system_prompt) is available via config.yaml and /personality covers the common use case. Removes: CommandDef, dispatch branch, _handle_prompt_command handler, docs references, and updates subcommand extraction test.	2026-04-09 11:27:27 -07:00
Teknium	d0ffb111c2	refactor: codebase-wide lint cleanup — unused imports, dead code, and inefficient patterns (#5821 ) Comprehensive cleanup across 80 files based on automated (ruff, pyflakes, vulture) and manual analysis of the entire codebase. Changes by category: Unused imports removed (~95 across 55 files): - Removed genuinely unused imports from all major subsystems - agent/, hermes_cli/, tools/, gateway/, plugins/, cron/ - Includes imports in try/except blocks that were truly unused (vs availability checks which were left alone) Unused variables removed (~25): - Removed dead variables: connected, inner, channels, last_exc, source, new_server_names, verify, pconfig, default_terminal, result, pending_handled, temperature, loop - Dropped unused argparse subparser assignments in hermes_cli/main.py (12 instances of add_parser() where result was never used) Dead code removed: - run_agent.py: Removed dead ternary (None if False else None) and surrounding unreachable branch in identity fallback - run_agent.py: Removed write-only attribute _last_reported_tool - hermes_cli/providers.py: Removed dead @property decorator on module-level function (decorator has no effect outside a class) - gateway/run.py: Removed unused MCP config load before reconnect - gateway/platforms/slack.py: Removed dead SessionSource construction Undefined name bugs fixed (would cause NameError at runtime): - batch_runner.py: Added missing logger = logging.getLogger(__name__) - tools/environments/daytona.py: Added missing Dict and Path imports Unnecessary global statements removed (14): - tools/terminal_tool.py: 5 functions declared global for dicts they only mutated via .pop()/[key]=value (no rebinding) - tools/browser_tool.py: cleanup thread loop only reads flag - tools/rl_training_tool.py: 4 functions only do dict mutations - tools/mcp_oauth.py: only reads the global - hermes_time.py: only reads cached values Inefficient patterns fixed: - startswith/endswith tuple form: 15 instances of x.startswith('a') or x.startswith('b') consolidated to x.startswith(('a', 'b')) - len(x)==0 / len(x)>0: 13 instances replaced with pythonic truthiness checks (not x / bool(x)) - in dict.keys(): 5 instances simplified to in dict - Redefined unused name: removed duplicate _strip_mdv2 import in send_message_tool.py Other fixes: - hermes_cli/doctor.py: Replaced undefined logger.debug() with pass - hermes_cli/config.py: Consolidated chained .endswith() calls Test results: 3934 passed, 17 failed (all pre-existing on main), 19 skipped. Zero regressions.	2026-04-07 10:25:31 -07:00
SHL0MS	902a02e3d5	Merge pull request #5791 from leotrs/manim-ce-reference-improvements Expand Manim CE reference docs: geometry, animations, and LaTeX environments	2026-04-07 12:15:59 -04:00
Teknium	8b861b77c1	refactor: remove browser_close tool — auto-cleanup handles it (#5792 ) * refactor: remove browser_close tool — auto-cleanup handles it The browser_close tool was called in only 9% of browser sessions (13/144 navigations across 66 sessions), always redundantly — cleanup_browser() already runs via _cleanup_task_resources() at conversation end, and the background inactivity reaper catches anything else. Removing it saves one tool schema slot in every browser-enabled API call. Also fixes a latent bug: cleanup_browser() now handles Camofox sessions too (previously only Browserbase). Camofox sessions were never auto-cleaned per-task because they live in a separate dict from _active_sessions. Files changed (13): - tools/browser_tool.py: remove function, schema, registry entry; add camofox cleanup to cleanup_browser() - toolsets.py, model_tools.py, prompt_builder.py, display.py, acp_adapter/tools.py: remove browser_close from all tool lists - tests/: remove browser_close test, update toolset assertion - docs/skills: remove all browser_close references * fix: repeat browser_scroll 5x per call for meaningful page movement Most backends scroll ~100px per call — barely visible on a typical viewport. Repeating 5x gives ~500px (~half a viewport), making each scroll tool call actually useful. Backend-agnostic approach: works across all 7+ browser backends without needing to configure each one's scroll amount individually. Breaks early on error for the agent-browser path. * feat: auto-return compact snapshot from browser_navigate Every browser session starts with navigate → snapshot. Now navigate returns the compact accessibility tree snapshot inline, saving one tool call per browser task. The snapshot captures the full page DOM (not viewport-limited), so scroll position doesn't affect it. browser_snapshot remains available for refreshing after interactions or getting full=true content. Both Browserbase and Camofox paths auto-snapshot. If the snapshot fails for any reason, navigation still succeeds — the snapshot is a bonus, not a requirement. Schema descriptions updated to guide models: navigate mentions it returns a snapshot, snapshot mentions it's for refresh/full content. * refactor: slim cronjob tool schema — consolidate model/provider, drop unused params Session data (151 calls across 67 sessions) showed several schema properties were never used by models. Consolidated and cleaned up: Removed from schema (still work via backend/CLI): - skill (singular): use skills array instead - reason: pause-only, unnecessary - include_disabled: now defaults to true - base_url: extreme edge case, zero usage - provider (standalone): merged into model object Consolidated: - model + provider → single 'model' object with {model, provider} fields. If provider is omitted, the current main provider is pinned at creation time so the job stays stable even if the user changes their default. Kept: - script: useful data collection feature - skills array: standard interface for skill loading Schema shrinks from 14 to 10 properties. All backend functionality preserved — the Python function signature and handler lambda still accept every parameter. * fix: remove mixture_of_agents from core toolsets — opt-in only via hermes tools MoA was in _HERMES_CORE_TOOLS and composite toolsets (hermes-cli, hermes-messaging, safe), which meant it appeared in every session for anyone with OPENROUTER_API_KEY set. The _DEFAULT_OFF_TOOLSETS gate only works after running 'hermes tools' explicitly. Now MoA only appears when a user explicitly enables it via 'hermes tools'. The moa toolset definition and check_fn remain unchanged — it just needs to be opted into.	2026-04-07 03:28:44 -07:00
Leo Torres	e8f6854cab	docs: expand Manim CE reference docs with additional API coverage Add geometry mobjects, movement/creation animations, and LaTeX environments to the skill's reference docs. All verified against Manim CE v0.20.1. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 11:36:13 +02:00
Teknium	f609bf277d	feat: update blogwatcher skill to JulienTant's fork (#5759 ) Replace Hyaxia/blogwatcher with JulienTant/blogwatcher-cli fork which adds: - Docker support with BLOGWATCHER_DB env var for persistent storage - SQL injection prevention - SSRF protection (blocks private IPs/metadata endpoints) - HTML scraping fallback when RSS unavailable - OPML import from Feedly/Inoreader/NewsBlur - Category filtering for articles - Direct binary downloads (no Go required) - Migration guide from original blogwatcher Binary name changed: blogwatcher -> blogwatcher-cli Community contribution by Ao (JulienTant). Closes discussion about Docker compatibility.	2026-04-06 23:59:26 -07:00
Zainan Victor Zhou	190471fdc0	docs: use HERMES_HOME in google-workspace skill examples - avoid hard-coded ~/.hermes paths in the setup and API shorthands - prefer HERMES_HOME with a sane default to /Users/peteradams/.hermes - keep the examples aligned with profile-aware Hermes installs	2026-04-06 16:50:07 -07:00
Zainan Victor Zhou	83df001d01	fix: allow google-workspace skill scripts to run directly - fall back to adding the repo root to sys.path when hermes_constants is not importable - fixes direct execution of setup.py and google_api.py from the repo checkout - keeps the upstream PR scoped to the google-workspace compatibility fix	2026-04-06 16:50:07 -07:00
Teknium	539629923c	docs(llm-wiki): add Obsidian Headless setup for servers (#5660 ) Adds obsidian-headless (npm) setup guide to the Obsidian Integration section — Node 22+, ob login, sync-create-remote, sync-setup, systemd service for continuous background sync. Covers the full headless workflow for agents running on servers syncing to Obsidian desktop on other devices.	2026-04-06 16:37:14 -07:00
Teknium	150f70f821	feat(skills): add skill config interface + llm-wiki skill (#5635 ) Skills can now declare config.yaml settings via metadata.hermes.config in their SKILL.md frontmatter. Values are stored under skills.config.* namespace, prompted during hermes config migrate, shown in hermes config show, and injected into the skill context at load time. Also adds the llm-wiki skill (Karpathy's LLM Wiki pattern) as the first skill to use the new config interface, declaring wiki.path. Skill config interface (new): - agent/skill_utils.py: extract_skill_config_vars(), discover_all_skill_config_vars(), resolve_skill_config_values(), SKILL_CONFIG_PREFIX - agent/skill_commands.py: _inject_skill_config() injects resolved values into skill messages as [Skill config: ...] block - hermes_cli/config.py: get_missing_skill_config_vars(), skill config prompting in migrate_config(), Skill Settings in show_config() LLM Wiki skill (skills/research/llm-wiki/SKILL.md): - Three-layer architecture (raw sources, wiki pages, schema) - Three operations (ingest, query, lint) - Session orientation, page thresholds, tag taxonomy, update policy, scaling guidance, log rotation, archiving workflow Docs: creating-skills.md, configuration.md, skills.md, skills-catalog.md Closes #5100	2026-04-06 13:49:13 -07:00
SHL0MS	17e2a27c51	feat(skills): add p5js creative coding skill Production pipeline for interactive and generative visual art using p5.js. Covers 7 modes: generative art, data visualization, interactive experiences, animation/motion graphics, 3D scenes, image processing, and audio-reactive. Includes: - SKILL.md with creative standard, pipeline, and critical implementation notes - 10 reference files covering core API, shapes, visual effects (noise, flow fields, particles, domain warp, attractors, L-systems, circle packing, bloom, reaction-diffusion), animation (easing, springs, state machines, scene transitions), typography, color systems, WebGL/3D/shaders, interaction, and comprehensive export pipeline - Deterministic headless frame capture via Puppeteer (noLoop + redraw) - ffmpeg render pipeline for MP4 video export - Per-clip architecture for multi-scene video production - Interactive viewer template with seed navigation and parameter controls - Performance guidance: FES disable, Math.* hot loops, per-pixel budgets - Addon library coverage: p5.brush, p5.grain, CCapture.js, p5.js-svg - fxhash/Art Blocks generative platform conventions - p5.js 2.0 migration guide (async setup, OKLCH, splineVertex, shader.modify) - 13 documented common mistakes and troubleshooting patterns 17 files, ~5,900 lines.	2026-04-06 14:39:00 -04:00
SHL0MS	a912cd4568	docs(manim-video): add 5 new reference files — design thinking, updaters, paper explainer, decorations, production quality Five new reference files expanding the skill from rendering knowledge into production methodology: animation-design-thinking.md (161 lines): When to animate vs show static, concept decomposition into visual beats, pacing rules, narration sync, equation reveal strategies, architecture diagram patterns, common design mistakes. updaters-and-trackers.md (260 lines): Deep ValueTracker mental model, lambda/time-based/always_redraw updaters, DecimalNumber and Variable live displays, animation-based updaters, 4 complete practical patterns (dot tracing, live area, connected diagram, parameter exploration). paper-explainer.md (255 lines): Full workflow for turning research papers into animations. Audience selection, 5-minute template, pre-code gates (narration, scene list, style contract), equation reveal strategies, architecture diagram building, results animation, domain-specific patterns for ML/physics/ biomedical papers. decorations.md (202 lines): SurroundingRectangle, BackgroundRectangle, Brace, arrows (straight, curved, labeled), DashedLine, Angle/RightAngle, Cross, Underline, color highlighting workflows, annotation lifecycle pattern. production-quality.md (190 lines): Pre-code, pre-render, post-render checklists. Text overlap prevention, spatial layout coordinate budget, max simultaneous elements, animation variety audit, tempo curve, color consistency, data viz minimums. Total skill now: 14 reference files, 2614 lines.	2026-04-06 13:51:36 -04:00
SHL0MS	95a044a2e0	feat(research-paper-writing): fill coverage gaps and integrate patterns from AI-Scientist, GPT-Researcher Fix duplicate step numbers (5.3, 7.3) and missing 7.5. Add coverage for human evaluation, theory/survey/benchmark/position papers, ethics/broader impact, arXiv strategy, code packaging, negative results, workshop papers, multi-author coordination, compute budgeting, and post-acceptance deliverables. Integrate ensemble reviewing with meta-reviewer and negative bias, pre-compilation validation pipeline, experiment journal with tree structure, breadth/depth literature search, context management for large projects, two-pass refinement, VLM visual review, and claim verification. New references: human-evaluation.md, paper-types.md.	2026-04-06 01:12:32 -04:00
SHL0MS	b26e7fd43a	fix(manim-video): recommend monospace fonts — proportional fonts have broken kerning in Pango Manim's Pango text renderer produces broken kerning with proportional fonts (Helvetica, Inter, SF Pro, Arial) at all sizes and resolutions. Characters overlap and spacing is inconsistent. This is a fundamental Pango limitation. Changes: - Recommend Menlo (monospace) as the default font for ALL text - Proportional fonts only acceptable for large titles (>=48, short strings) - Set minimum font_size=18 for readability - Update all code examples to use MONO='Menlo' pattern - Remove Inter/Helvetica/SF Pro from recommendations	2026-04-06 00:35:43 -04:00
SHL0MS	447ec076a4	docs(manim-video): expand references with comprehensive Manim CE and 3b1b patterns Adds 601 lines across 6 reference files, sourced from deep review of: - Manim CE v0.20.1 full reference manual - 3b1b/manim example_scenes.py and source modules - 3b1b/videos production CLAUDE.md and workflow patterns - Manim CE thematic guides (voiceover, text, configuration) animations.md: always_redraw, TracedPath, FadeTransform, TransformFromCopy, ApplyMatrix, squish_rate_func, ShowIncreasingSubsets, ShowPassingFlash, expanded rate functions mobjects.md: SVGMobject, ImageMobject, Variable, BulletedList, DashedLine, Angle/RightAngle, boolean ops, LabeledArrow, t2c/t2f/t2s/t2w per-substring styling, backstroke for readability, apply_complex_function with prepare_for_nonlinear_transform equations.md: substrings_to_isolate, multi-line equations, TransformMatchingTex with matched_keys and key_map, set_color_by_tex graphs-and-data.md: Graph/DiGraph with layout algorithms, ArrowVectorField/StreamLines, ComplexPlane/PolarPlane camera-and-3d.md: ZoomedScene with inset zoom, LinearTransformationScene for 3b1b-style linear algebra rendering.md: manim.cfg project config, self.next_section() chapter markers, manim-voiceover plugin with ElevenLabs/GTTS integration and bookmark-based audio sync	2026-04-06 00:08:17 -04:00
SHL0MS	1572956fdc	Merge pull request #4930 from SHL0MS/feat/manim-video-skill-v2 feat(skills): add manim-video skill for mathematical and technical animations	2026-04-05 16:10:30 -07:00
SHL0MS	9d885b266c	feat(skills): add manim-video skill for mathematical and technical animations Production pipeline for creating 3Blue1Brown-style animated videos using Manim Community Edition. The agent handles the full workflow: creative planning, Python code generation, rendering, scene stitching, audio muxing, and iterative refinement. Modes: concept explainers, equation derivations, algorithm visualizations, data stories, architecture diagrams, paper explainers, 3D visualizations. 9 reference files, setup verification script, README. All API references verified against ManimCommunity/manim source.	2026-04-05 19:09:37 -04:00
Teknium	cb63b5f381	feat(skills): add popular-web-designs skill with 54 website design systems (#5194 ) Curated collection of production-quality design system specifications extracted from real websites (sourced from VoltAgent/awesome-design-md). Each template captures a site's complete visual language: colors, typography, components, layout, shadows, responsive behavior, and agent-ready CSS values. Hermes-specific adaptations in every template: - Google Fonts CDN link tags for proprietary font substitutes - CSS font-family stacks with proper fallbacks - Integration notes for write_file + generative-widgets workflow - browser_vision verification reminders SKILL.md includes categorized catalog, font substitution reference table, HTML generation pattern, and design-to-use-case matching guide. Sites: Airbnb, Airtable, Apple, BMW, Cal.com, Claude, Clay, ClickHouse, Cohere, Coinbase, Composio, Cursor, ElevenLabs, Expo, Figma, Framer, HashiCorp, IBM, Intercom, Kraken, Linear, Lovable, Minimax, Mintlify, Miro, Mistral AI, MongoDB, Notion, NVIDIA, Ollama, OpenCode, Pinterest, PostHog, Raycast, Replicate, Resend, Revolut, RunwayML, Sanity, Sentry, SpaceX, Spotify, Stripe, Supabase, Superhuman, Together AI, Uber, Vercel, VoltAgent, Warp, Webflow, Wise, xAI, Zapier	2026-04-05 00:42:55 -07:00
Teknium	0fd3de2674	docs(skill): claude-code v2.2 — add cheat sheet commands, env vars, rules, advanced features (#5158 ) Expands the claude-code skill with content from official docs and community cheat sheets that was missing from v2.0: Slash commands: /cost, /btw, /plan, /loop, /batch, /security-review, /resume, /effort (with auto level), /mcp, /release-notes, /voice details Keyboard shortcuts: Alt+P (model), Alt+T (thinking), Alt+O (fast mode), Ctrl+V (paste image), Ctrl+O (transcript), Ctrl+G (external editor) Ultrathink keyword for max reasoning on a specific turn Rules directory: .claude/rules/.md and ~/.claude/rules/.md Auto-memory: ~/.claude/projects/<proj>/memory/ (25KB/200 lines limit) Environment variables: CLAUDE_CODE_EFFORT_LEVEL, MAX_THINKING_TOKENS, CLAUDE_CODE_NO_FLICKER, CLAUDE_CODE_SUBPROCESS_ENV_SCRUB MCP limits: 2KB tool desc cap, maxResultSizeChars 500K, transport types Reorganized slash commands into Session/Development/Configuration groups Reorganized keyboard shortcuts into Controls/Toggles/Multiline groups	2026-04-04 19:15:57 -07:00
Teknium	ff544526cd	docs(skill): comprehensive claude-code skill rewrite v2.0 (#5155 ) Major rewrite of the claude-code orchestration skill from 94 to 460 lines. Based on official docs research, community guides, and live experimentation. Key additions: - Two orchestration modes: Print mode (-p) vs Interactive PTY via tmux - Detailed PTY dialog handling (trust + permissions bypass patterns) - Print mode deep dive: JSON output, piped input, session resumption, --json-schema, --bare mode for CI - Complete flag reference (20+ flags organized by category) - Interactive session patterns with tmux send-keys/capture-pane - Claude's slash commands and keyboard shortcuts reference - CLAUDE.md, hooks, custom subagents, MCP, custom commands docs - Cost/performance tips (effort levels, budget caps, context mgmt) - 10 specific pitfalls discovered through live testing - 10 rules for Hermes agents orchestrating Claude Code	2026-04-04 19:00:50 -07:00
Teknium	34308e4de9	docs: improve youtube-content skill structure and workflow Clearer workflow with validation/chunking steps, expanded description with trigger terms for better agent matching, tightened error handling. Fixed stray pipe character in original PR diff. Based on PR #4778 by fernandezbaptiste. Co-authored-by: fernandezbaptiste <fernandezbaptiste@users.noreply.github.com>	2026-04-03 22:18:00 -07:00
kshitijk4poor	5359921199	refactor: simplify scope validation helpers in google workspace scripts Fix double file read bug in google_api.py _missing_scopes(), consolidate redundant _normalize_scope_values into callers, merge duplicate except blocks.	2026-04-03 17:49:18 -07:00
kshitijk4poor	37e2ef6c3f	fix: protect profile-scoped google workspace oauth tokens	2026-04-03 17:49:18 -07:00
Teknium	52ddd6bc64	refactor(skills): consolidate code verification skills into one (#4854 ) * chore: release v0.7.0 (2026.4.3) 168 merged PRs, 223 commits, 46 resolved issues, 40+ contributors. Highlights: pluggable memory providers, credential pools, Camofox browser, inline diff previews, API server session continuity, ACP MCP registration, gateway hardening, secret exfiltration blocking. * refactor(skills): consolidate code-review + verify-code-changes into requesting-code-review Merge the passive code-review checklist and the automated verification pipeline (from PR #4459 by @MorAlekss) into a single requesting-code-review skill. This eliminates model confusion between three overlapping skills. Now includes: - Static security scan (grep on diff lines) - Baseline-aware quality gates (only flag NEW failures) - Multi-language tool detection (Python, Node, Rust, Go) - Independent reviewer subagent with fail-closed JSON verdict - Auto-fix loop with separate fixer agent (max 2 attempts) - Git checkpoint and [verified] commit convention Deletes: skills/software-development/code-review/ (absorbed) Closes: #406 (independent code verification)	2026-04-03 14:13:27 -07:00
SHL0MS	b86647c295	Replace ml-paper-writing with research-paper-writing: full research pipeline skill Replaces the writing-focused ml-paper-writing skill (940 lines) with a complete end-to-end research paper pipeline (1,599 lines SKILL.md + 3,184 lines across 7 reference files). New content: - Full 8-phase pipeline: project setup, literature review, experiment design, execution/monitoring, analysis, paper drafting, review/revision, submission preparation - Iterative refinement strategy guide from autoreason research (when to use autoreason vs critique-and-revise vs single-pass, model selection) - Hermes agent integration: delegate_task parallel drafting, cronjob monitoring, memory/todo state management, skill composition - Professional LaTeX tooling: microtype, siunitx, TikZ diagram patterns, algorithm2e, subcaption, latexdiff, SciencePlots - Human evaluation design: annotation protocols, inter-annotator agreement, crowdsourcing platforms - Title, Figure 1, conclusion, appendix strategy, page budget management - Anonymization checklist, rebuttal writing, camera-ready preparation - AAAI and COLM venue coverage (checklists, reviewer guidelines) Preserved from ml-paper-writing: - All writing philosophy (Nanda, Farquhar, Gopen & Swan, Lipton, Perez) - Citation verification workflow (5-step mandatory process) - All 6 conference templates (NeurIPS, ICML, ICLR, ACL, AAAI, COLM) - Conference requirements, format conversion workflow - Proactivity/collaboration guidance Bug fixes in inherited reference files: - BibLaTeX recommendation now correctly says natbib for conferences - Bare except clauses fixed to except Exception - Jinja2 template tags removed from citation-workflow.md - Stale date caveats added to reviewer-guidelines.md	2026-04-02 16:13:26 -04:00
analista	3400098481	fix: update fetch_transcript.py for youtube-transcript-api v1.x The library removed the static get_transcript() method in v1.0. Migrate to the new instance-based fetch() API and normalize FetchedTranscriptSnippet objects back to dicts for compatibility with the rest of the script.	2026-04-01 10:49:24 -07:00
Teknium	85e96a4638	fix(skills): move unified hermes-agent skill into autonomous-ai-agents category (#4435 ) The unified skill from PR #4332 was placed at a top-level skills/hermes-agent/ directory, creating a redundant standalone category. Move it to skills/autonomous-ai-agents/hermes-agent/ alongside claude-code, codex, and opencode where it belongs.	2026-04-01 03:39:25 -07:00
Teknium	b118f607b2	feat(skills): unify hermes-agent and hermes-agent-setup into single skill (#4332 ) Merges the hermes-agent-spawning skill (autonomous-ai-agents/) and hermes-agent-setup skill (dogfood/) into a single comprehensive skills/hermes-agent/ skill. The unified skill covers: - What Hermes Agent is and how it compares to Claude Code/Codex/OpenClaw - Complete CLI reference (all subcommands and flags) - Slash command reference - Configuration guide (providers, toolsets, config sections) - Voice/STT/TTS setup - Spawning additional agent instances (one-shot and interactive PTY) - Multi-agent coordination patterns - Troubleshooting guide - Where-to-find-things lookup table with docs links - Concise contributor quick reference Removes: - skills/autonomous-ai-agents/hermes-agent/ (hermes-agent-spawning) - skills/dogfood/hermes-agent-setup/	2026-03-31 14:49:20 -07:00
SHL0MS	ab62614a89	ascii-video: add text readability techniques and external layout oracle pattern - composition.md: add text backdrop (gaussian dark mask behind glyphs) and external layout oracle pattern (browser-based text layout → JSON → Python renderer pipeline for obstacle-aware text reflow) - shaders.md: add reverse vignette shader (center-darkening for text readability) - troubleshooting.md: add diagnostic entries for text-over-busy-background readability and kaleidoscope-destroys-text pitfall	2026-03-30 18:48:22 -04:00
Teknium	5ceed021dc	feat(gateway): skill-aware slash commands, paginated /commands, Telegram 100-cap (#3934 ) * feat(gateway): skill-aware slash commands, paginated /commands, Telegram 100-cap Map active skills to Telegram's slash command menu so users can discover and invoke skills directly. Three changes: 1. Telegram menu now includes active skill commands alongside built-in commands, capped at 100 entries (Telegram Bot API limit). Overflow commands remain callable but hidden from the picker. Logged at startup when cap is hit. 2. New /commands [page] gateway command for paginated browsing of all commands + skills. /help now shows first 10 skill commands and points to /commands for the full list. 3. When a user types a slash command that matches a disabled or uninstalled skill, they get actionable guidance: - Disabled: 'Enable it with: hermes skills config' - Optional (not installed): 'Install with: hermes skills install official/<path>' Built on ideas from PR #3921 by @kshitijk4poor. * chore: move 21 niche skills to optional-skills Move specialized/niche skills from built-in (skills/) to optional (optional-skills/) to reduce the default skill count. Users can install them with: hermes skills install official/<category>/<name> Moved skills (21): - mlops: accelerate, chroma, faiss, flash-attention, hermes-atropos-environments, huggingface-tokenizers, instructor, lambda-labs, llava, nemo-curator, pinecone, pytorch-lightning, qdrant, saelens, simpo, slime, tensorrt-llm, torchtitan - research: domain-intel, duckduckgo-search - devops: inference-sh cli Built-in skills: 96 → 75 Optional skills: 22 → 43 * fix: only include repo built-in skills in Telegram menu, not user-installed User-installed skills (from hub or manually added) stay accessible via /skills and by typing the command directly, but don't get registered in the Telegram slash command picker. Only skills whose SKILL.md is under the repo's skills/ directory are included in the menu. This keeps the Telegram menu focused on the curated built-in set while user-installed skills remain discoverable through /skills and /commands.	2026-03-30 10:57:30 -07:00
Teknium	17b6000e90	feat(skills): add songwriting-and-ai-music creative skill (salvage #1901 ) (#3834 ) Adds a songwriting craft and AI music prompt engineering skill covering song structure, rhyme/meter, emotional arcs, Suno metatag reference, phonetic tricks for AI singers, parody adaptation, and production workflow. Complements existing music skills (heartmula, audiocraft, songsee) which cover model setup/usage — this one covers the creative process itself. Also removes the empty skills/music-creation/ category (only had a DESCRIPTION.md, no actual skills). Co-authored-by: 123mikeyd <123mikeyd@users.noreply.github.com>	2026-03-29 17:49:19 -07:00
Teknium	253a9adc72	docs(skills): clarify DuckDuckGo runtime requirements (#3680 ) Co-authored-by: kshitij <82637225+kshitijk4poor@users.noreply.github.com>	2026-03-29 00:17:57 -07:00
Teknium	7a3682ac3f	feat: mount skill credential files + fix env passthrough for remote backends (#3671 ) Two related fixes for remote terminal backends (Modal/Docker): 1. NEW: Credential file mounting system Skills declare required_credential_files in frontmatter. Files are mounted into Docker (read-only bind mounts) and Modal (mounts at creation + sync via exec on each command for mid-session changes). Google Workspace skill updated with the new field. 2. FIX: Docker backend now includes env_passthrough vars Skills that declare required_environment_variables (e.g. Notion with NOTION_API_KEY) register vars in the env_passthrough system. The local backend checked this, but Docker's forward_env was a separate disconnected list. Now Docker exec merges both sources, so skill-declared env vars are forwarded into containers automatically. This fixes the reported issue where NOTION_API_KEY in ~/.hermes/.env wasn't reaching the Docker container despite being registered via the Notion skill's prerequisites. Closes #3665	2026-03-28 23:53:40 -07:00
kshitij	a99c0478d0	fix(skills): move parallel-cli to optional-skills (#3673 ) parallel-cli is a paid third-party vendor skill that requires PARALLEL_API_KEY, but it was shipped in the default skills/ directory with no env-var gate. This caused it to appear in every user's system prompt even when they have no Parallel account or API key. Move it to optional-skills/ so it is only visible through the Skills Hub and must be explicitly installed. Also remove it from the default skills catalog docs.	2026-03-28 23:45:05 -07:00
Mibay	a6bc13ce13	fix(github-auth): check ~/.hermes/.env before ~/.git-credentials for token extraction (#3466 ) * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials for token extraction Users who configured their token via `hermes setup` have it stored in ~/.hermes/.env (GITHUB_TOKEN=...), not in ~/.git-credentials. On macOS with osxkeychain as the default git credential helper, ~/.git-credentials may not exist at all, causing silent 401 failures in all GitHub skills. Add ~/.hermes/.env as the first fallback in the auth detection block and the inline "Extracting the Token from Git Credentials" example. Priority order: env var → ~/.hermes/.env → ~/.git-credentials → none Part of fix for NousResearch/hermes-agent#3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464	2026-03-28 14:46:49 -07:00
Teknium	dabe3c34cc	feat(webhook): hermes webhook CLI + skill for event-driven subscriptions (#3578 ) Adds 'hermes webhook' CLI subcommand and a skill — zero new model tools. CLI commands (require webhook platform to be enabled): hermes webhook subscribe <name> [--events, --prompt, --deliver, ...] hermes webhook list hermes webhook remove <name> hermes webhook test <name> All commands gate on webhook platform being enabled in config. If not configured, prints setup instructions (gateway setup wizard, manual config.yaml, or env vars). The agent uses these via terminal tool, guided by the webhook-subscriptions skill which documents setup, common patterns (GitHub, Stripe, CI/CD, monitoring), prompt template syntax, security, and troubleshooting. Adapter enhancement: webhook.py hot-reloads dynamic subscriptions from ~/.hermes/webhook_subscriptions.json on each incoming request (mtime-gated). Static config.yaml routes always take precedence. Docs: updated webhooks.md with Dynamic Subscriptions section, added hermes webhook to cli-commands.md reference. No new model tools. No toolset changes. 24 new tests for CLI CRUD, persistence, enabled-gate, and adapter dynamic route loading.	2026-03-28 14:33:35 -07:00
Teknium	26bfdc22b4	feat: add godmode jailbreaking skill + docs (#3157 )	2026-03-26 01:37:18 -07:00
Teknium	f84230527c	docs(skill): add split, merge, search examples to ocr-and-documents skill (#2461 ) * fix: respect DashScope v1 runtime mode for alibaba Remove the hardcoded Alibaba branch from resolve_runtime_provider() that forced api_mode='anthropic_messages' regardless of the base URL. Alibaba now goes through the generic API-key provider path, which auto-detects the protocol from the URL: - /apps/anthropic → anthropic_messages (via endswith check) - /v1 → chat_completions (default) This fixes Alibaba setup with OpenAI-compatible DashScope endpoints (e.g. coding-intl.dashscope.aliyuncs.com/v1) that were broken because runtime always forced Anthropic mode even when setup saved a /v1 URL. Based on PR #2024 by @kshitijk4poor. * docs(skill): add split, merge, search examples to ocr-and-documents skill Adds pymupdf examples for PDF splitting, merging, and text search to the existing ocr-and-documents skill. No new dependencies — pymupdf already covers all three operations natively. --------- Co-authored-by: kshitijk4poor <kshitijk4poor@users.noreply.github.com>	2026-03-22 04:31:22 -07:00
Test	672e9752a0	docs: align venv path to match installer (venv/ not .venv/) The install script creates venv/ but several docs referenced .venv/, causing agents to fail with 'No such file or directory' when following AGENTS.md instructions. Fixes #2066	2026-03-19 18:16:26 -07:00
Test	7e30e97a59	chore: trim redundant trigger sentence from huggingface-hub description	2026-03-18 04:18:13 -07:00
Test	adf188c439	chore: add search to huggingface-hub skill description	2026-03-18 04:15:03 -07:00
Test	947827bba0	chore: tighten huggingface-hub skill description	2026-03-18 04:11:33 -07:00
Test	56ca84f243	feat: add huggingface-hub bundled skill Adds the Hugging Face CLI (hf) reference as a built-in skill under mlops/. Covers downloading/uploading models and datasets, repo management, SQL queries on datasets, inference endpoints, Spaces, buckets, and more. Based on the official HF skill from huggingface/skills.	2026-03-18 04:07:41 -07:00
Test	764825bbff	feat: expand hermes-agent-setup skill + tell agent about it in STT notes Skill now covers full CLI usage (hermes setup, hermes skills, hermes tools, hermes config, session management, etc.), config file reference, and expanded gateway commands. Agent context notes for STT failure now mention the hermes-agent-setup skill is available to help users configure Hermes features.	2026-03-18 03:05:17 -07:00
Test	9c0f346258	fix: direct user message on STT failure + hermes-agent-setup skill When a user sends a voice message and STT isn't configured, the gateway now sends a clear message directly to the user explaining how to set up voice transcription, rather than relying on the agent to relay an injected context note (which often gets misinterpreted). Also adds a hermes-agent-setup bundled skill covering STT/TTS setup, tool configuration, dependency installation, and troubleshooting.	2026-03-18 03:01:41 -07:00
Teknium	d132a3dfbb	feat(skills): add inference.sh skill (terminal-based, no custom tools) (#1686 ) Add inference.sh as a built-in skill that uses the terminal tool to run infsh CLI commands. No custom tools or tool registration — the skill teaches the agent how to use the infsh binary via terminal. Covers 150+ AI apps: image gen (FLUX, Reve, Seedream), video (Veo, Wan, Seedance), LLMs, search (Tavily, Exa), 3D, avatars, and more. Includes reference docs for authentication, app discovery, running apps, and CLI command reference. Based on PR #1021 by @okaris, reworked as a skill-only integration. Co-authored-by: okaris <okaris@users.noreply.github.com>	2026-03-17 03:06:53 -07:00
Teknium	c3d626eb07	Revert "feat: add inference.sh integration (infsh tool + skill) (#1682 )" (#1684 ) This reverts commit `6020db0243`.	2026-03-17 03:01:30 -07:00
Teknium	6020db0243	feat: add inference.sh integration (infsh tool + skill) (#1682 ) Add inference.sh CLI (infsh) as a tool integration, giving agents access to 150+ AI apps through a single CLI — image gen (FLUX, Reve, Seedream), video (Veo, Wan, Seedance), LLMs, search (Tavily, Exa), 3D, avatar/lipsync, and more. One API key manages all services. Tools: - infsh: run any infsh CLI command (app list, app run, etc.) - infsh_install: install the CLI if not present Registered as an 'inference' toolset (opt-in, not in core tools). Includes comprehensive skill docs with examples for all app categories. Changes from original PR: - NOT added to _HERMES_CORE_TOOLS (available via --toolsets inference) - Added 12 tests covering tool registration, command execution, error handling, timeout, JSON parsing, and install flow Inspired by PR #1021 by @okaris. Co-authored-by: okaris <okaris@users.noreply.github.com>	2026-03-17 02:59:21 -07:00
SHL0MS	63635744bf	Refactor ascii-video skill: creative-first SKILL.md, consolidate reference files	2026-03-16 20:11:12 -04:00
Teknium	dd7921d514	fix(honcho): isolate session routing for multi-user gateway (#1500 ) Salvaged from PR #1470 by adavyas. Core fix: Honcho tool calls in a multi-session gateway could route to the wrong session because honcho_tools.py relied on process-global state. Now threads session context through the call chain: AIAgent._invoke_tool() → handle_function_call() → registry.dispatch() → handler **kw → _resolve_session_context() Changes: - Add _resolve_session_context() to prefer per-call context over globals - Plumb honcho_manager + honcho_session_key through handle_function_call - Add sync_honcho=False to run_conversation() for synthetic flush turns - Pass honcho_session_key through gateway memory flush lifecycle - Harden gateway PID detection when /proc cmdline is unreadable - Make interrupt test scripts import-safe for pytest-xdist - Wrap BibTeX examples in Jekyll raw blocks for docs build - Fix thread-order-dependent assertion in client lifecycle test - Expand Honcho docs: session isolation, lifecycle, routing internals Dropped from original PR: - Indentation change in _create_request_openai_client that would move client creation inside the lock (causes unnecessary contention) Co-authored-by: adavyas <adavyas@users.noreply.github.com>	2026-03-16 00:23:47 -07:00
teknium1	4524cddc72	fix: persist google oauth pkce for headless auth Store the pending OAuth state and code verifier between --auth-url and --auth-code so the manual headless flow can reuse Flow.fetch_token() without disabling PKCE.	2026-03-14 22:11:34 -07:00
Teknium	b14a07315b	fix: save /plan output in workspace (#1381 )	2026-03-14 21:28:51 -07:00
Teknium	ff3473a37c	feat: add /plan command (#1372 ) * feat: add /plan command * refactor: back /plan with bundled skill * docs: document /plan skill	2026-03-14 21:18:17 -07:00
teknium1	3229e434b8	Merge origin/main into hermes/hermes-5d160594	2026-03-14 19:34:05 -07:00
teknium1	a6dc73fa07	docs: finish cron terminology cleanup	2026-03-14 19:20:58 -07:00
Teknium	6d2cfc24e9	Merge pull request #953 from JackTheGit/fix/docs-typos-batch4 Fix several documentation typos across training references	2026-03-14 10:26:15 -07:00
SHL0MS	66f8c2d5e8	ascii-video README: add missing sections (value fields, SDFs, coordinate transforms, temporal coherence, feedback buffer, masking, OKLAB, design patterns)	2026-03-14 11:08:10 -04:00
teknium1	d2869de477	docs: tighten Parallel CLI skill guidance Clarify that Parallel is an optional paid vendor workflow, add headless auth and context-chaining guidance, and align command examples more closely with upstream docs before salvaging PR #985.	2026-03-14 06:18:04 -07:00
kshitij	8d61ebe183	feat: add Parallel CLI research skill	2026-03-14 06:15:16 -07:00
Teknium	9525db913f	feat(skills): add X/Twitter xitter skill via upstream x-cli (#1285 ) * feat(skills): salvage xitter skill from PR #1065 Adapt the X/Twitter skill onto current main without vendoring an external CLI. Use upstream x-cli installation instructions, add a social-media category, and align credential/setup guidance with Hermes conventions. * docs(skills): explain X credential requirements in xitter skill Clarify why the official X flow needs five credentials and call out the setup/cost friction explicitly.	2026-03-14 04:00:27 -07:00
Teknium	25481d4286	feat: restore ACP server implementation from PR #949 (#1254 ) Restore the ACP editor-integration implementation that was present on the original PR branch but did not actually land in main. Includes: - acp_adapter/ server, session manager, event bridge, auth, permissions, and tool helpers - hermes acp subcommand and hermes-acp entry point - hermes-acp curated toolset - ACP registry manifest, setup guide, and ACP test suite - jupyter-live-kernel data science skill from the original branch Also updates the revived ACP code for current main by: - resolving runtime providers through the modern shared provider router - binding ACP sessions to per-session cwd task overrides - tracking duplicate same-name tool calls with FIFO IDs - restoring terminal approval callbacks after prompts - normalizing supporting docs/skill metadata Validated with tests/acp and the full pytest suite (-n0).	2026-03-14 00:09:05 -07:00
Teknium	2bf6b7ad1a	feat(skills): add Linear project management skill (#1230 ) Comprehensive Linear GraphQL API skill with API key auth (no OAuth needed). Includes all common queries (issues, projects, teams, search, filters) and mutations (create, update, assign, comment, status changes). Addresses user pain point: Linear MCP server OAuth flow is unreliable in headless agent sessions. This skill uses personal API keys which work reliably without browser-based auth flows. Requires: LINEAR_API_KEY env var (personal API key from Linear settings)	2026-03-13 21:20:32 -07:00
SHL0MS	6733a9a538	Update README	2026-03-13 19:31:29 -04:00
SHL0MS	cda5910ab0	update ascii-video skill: design patterns, local time, examples - New references/design-patterns.md: layer hierarchy (bg/content/accent), directional parameter arcs, scene concepts and visual metaphors, counter-rotating systems, wave collision, progressive fragmentation, entropy/consumption, staggered crescendo buildup, scene ordering - New references/examples.md: copy-paste-ready scenes at every complexity - Update scenes.md: local time convention (t=0 at scene start) - Update SKILL.md: add design-patterns.md to reference table - Add README.md to hermes-agent copy - Sync all reference docs with canonical source (SHL0MS/ascii-video)	2026-03-13 19:13:12 -04:00
Teknium	9f676d1394	feat(skills): add bundled opencode autonomous-agent skill Cherry-picked from PR #880 by @arceus77-7, rebased onto current main with corrections. Adds opencode skill under skills/autonomous-ai-agents/ with: - One-shot opencode run workflow - Interactive/background TUI session workflow - PR review workflow (including opencode pr command) - Parallel work patterns - TUI keybindings reference - Session/cost management - Smoke verification Tested with OpenCode v1.2.25. Fixed /exit bug (not a valid command), added missing flags (--file, --thinking, --variant), expanded docs. Co-authored-by: arceus77-7 <261276524+arceus77-7@users.noreply.github.com>	2026-03-13 08:39:21 -07:00
kshitijk4poor	ccfbf42844	feat: secure skill env setup on load (core #688 ) When a skill declares required_environment_variables in its YAML frontmatter, missing env vars trigger a secure TUI prompt (identical to the sudo password widget) when the skill is loaded. Secrets flow directly to ~/.hermes/.env, never entering LLM context. Key changes: - New required_environment_variables frontmatter field for skills - Secure TUI widget (masked input, 120s timeout) - Gateway safety: messaging platforms show local setup guidance - Legacy prerequisites.env_vars normalized into new format - Remote backend handling: conservative setup_needed=True - Env var name validation, file permissions hardened to 0o600 - Redact patterns extended for secret-related JSON fields - 12 existing skills updated with prerequisites declarations - ~48 new tests covering skip, timeout, gateway, remote backends - Dynamic panel widget sizing (fixes hardcoded width from original PR) Cherry-picked from PR #723 by kshitijk4poor, rebased onto current main with conflict resolution. Fixes #688 Co-authored-by: kshitijk4poor <kshitijk4poor@users.noreply.github.com>	2026-03-13 03:14:04 -07:00
JackTheGit	2eb778119d	Fix checkpoint_id typos and add StorageMeta example in checkpoint storage docs	2026-03-12 09:59:17 +00:00
JackTheGit	a182d12778	Fix several documentation typos across training references	2026-03-11 15:49:00 +00:00
teknium1	82113f1f1e	docs: conditional skill activation — tag duckduckgo-search as web fallback and add documentation - Tag duckduckgo-search skill with fallback_for_toolsets: [web] so it auto-hides when Firecrawl is available and auto-shows when it isn't - Add 'Conditional Activation' section to CONTRIBUTING.md with full spec, semantics, and examples for all 4 frontmatter fields - Add 'Conditional Activation (Fallback Skills)' section to the user- facing skills docs with field reference table and practical example - Update SKILL.md format examples in both docs to show the new fields Follow-up to PR #785 (conditional skill activation feature).	2026-03-11 08:47:01 -07:00
Teknium	c69adfbb17	Merge pull request #825 from JackTheGit/fix/docs-typos-batch2 Fix several documentation typos	2026-03-11 07:13:24 -07:00
SHL0MS	c358af7861	Add ASCII video skill to creative category	2026-03-10 15:54:38 -04:00
teknium1	36ac91c902	Merge PR #598 : feat(skill): expand duckduckgo-search with DDGS Python API coverage Authored by areu01or00. Adds Python DDGS library examples for text, news, images, and video search with structured return field docs.	2026-03-10 04:08:53 -07:00
JackTheGit	1db8609ac9	Fix several documentation typos	2026-03-10 08:10:16 +00:00
teknium1	6ab3ebf195	Add hermes-atropos-environments skill (bundled) Add comprehensive skill for building, testing, and debugging Hermes Agent RL environments for Atropos training. Includes: - SKILL.md: Full guide covering HermesAgentBaseEnv interface, required methods, config class, CLI modes (serve/process/evaluate), reward function patterns, common pitfalls, and minimum implementation checklist - New 'Inference Setup' section: instructs the agent to always ask the user for their inference provider (OpenRouter + model choice, self-hosted VLLM endpoint, or other OpenAI-compatible API) before running tests - references/agentresult-fields.md: AgentResult dataclass field reference - references/atropos-base-env.md: Atropos BaseEnv API reference - references/usage-patterns.md: Step-by-step patterns for process, evaluate, serve, and smoke test modes Will be auto-synced to ~/.hermes/skills/ via skills_sync.	2026-03-09 23:04:17 -07:00
teknium1	0ff7fe3ee2	Merge PR #439 : docs: fix spelling of 'publicly' Authored by JackTheGit. Simple typo fix: publically → publicly in axolotl reference docs.	2026-03-09 20:55:37 -07:00
teknium1	b9d55d5719	feat: add pokemon-player skill with battle-tested gameplay tips Comprehensive skill for playing Pokemon Red/Blue via the pokemon-agent package (NousResearch/pokemon-agent). Includes: - Full startup procedure (uv venv, server, localhost.run dashboard tunnel) - Save/load lifecycle and naming conventions - Gameplay loop with emphasis on frequent vision checks - Hard-learned navigation tips: - Use vision every 2-4 steps (RAM state is blind to obstacles) - Wait 2-3 seconds after door/stair warps for map transitions - Sidestep after exiting buildings to avoid re-entering - Hold B to speed Gen 1's slow text scrolling - Ledges are one-way — use vision to find gaps - Battle strategy, type chart, Gen 1 quirks - Memory conventions with PKM: prefix - Progression milestones through all 8 gyms + Elite Four	2026-03-09 20:29:38 -07:00
teknium1	c6b75baad0	feat: find-nearby skill and Telegram location support Adds a 'find-nearby' skill for discovering nearby places using OpenStreetMap (Overpass + Nominatim). No API keys needed. Works with: - Coordinates (from Telegram location pins) - Addresses, cities, zip codes, landmarks (auto-geocoded) - Multiple place types (restaurant, cafe, bar, pharmacy, etc.) Returns names, distances, cuisine, hours, addresses, and Google Maps links (pin + directions). 184-line stdlib-only script. Also adds Telegram location message handling: - New MessageType.LOCATION in gateway base - Telegram adapter handles LOCATION and VENUE messages - Injects lat/lon coordinates into conversation context - Prompts agent to ask what the user wants nearby Inspired by PR #422 (reimplemented with simpler script and broader skill scope — addresses/cities/zips, not just Telegram coordinates).	2026-03-09 05:31:10 -07:00
teknium1	0dafdcab86	Merge: skill reorganization + sub-category support - Sub-category support in prompt_builder.py (backwards-compatible) - Split mlops (40 skills) into 7 logical sub-categories - Merged 8 singleton categories into logical parents - Fixed 2 misplaced skills (code-review, ml-paper-writing)	2026-03-09 03:40:11 -07:00
Teknium	654e16187e	feat(mcp): add sampling support — server-initiated LLM requests (#753 ) Add MCP sampling/createMessage capability via SamplingHandler class. Text-only sampling + tool use in sampling with governance (rate limits, model whitelist, token caps, tool loop limits). Per-server audit metrics. Based on concept from PR #366 by eren-karakus0. Restructured as class-based design with bug fixes and tests using real MCP SDK types. 50 new tests, 2600 total passing.	2026-03-09 03:37:38 -07:00
teknium1	732c66b0f3	refactor: reorganize skills into sub-categories The skills directory was getting disorganized — mlops alone had 40 skills in a flat list, and 12 categories were singletons with just one skill each. Code change: - prompt_builder.py: Support sub-categories in skill scanner. skills/mlops/training/axolotl/SKILL.md now shows as category 'mlops/training' instead of just 'mlops'. Backwards-compatible with existing flat structure. Split mlops (40 skills) into 7 sub-categories: - mlops/training (12): accelerate, axolotl, flash-attention, grpo-rl-training, peft, pytorch-fsdp, pytorch-lightning, simpo, slime, torchtitan, trl-fine-tuning, unsloth - mlops/inference (8): gguf, guidance, instructor, llama-cpp, obliteratus, outlines, tensorrt-llm, vllm - mlops/models (6): audiocraft, clip, llava, segment-anything, stable-diffusion, whisper - mlops/vector-databases (4): chroma, faiss, pinecone, qdrant - mlops/evaluation (5): huggingface-tokenizers, lm-evaluation-harness, nemo-curator, saelens, weights-and-biases - mlops/cloud (2): lambda-labs, modal - mlops/research (1): dspy Merged singleton categories: - gifs → media (gif-search joins youtube-content) - music-creation → media (heartmula, songsee) - diagramming → creative (excalidraw joins ascii-art) - ocr-and-documents → productivity - domain → research (domain-intel) - feeds → research (blogwatcher) - market-data → research (polymarket) Fixed misplaced skills: - mlops/code-review → software-development (not ML-specific) - mlops/ml-paper-writing → research (academic writing) Added DESCRIPTION.md files for all new/updated categories.	2026-03-09 03:35:53 -07:00
teknium1	c21d77ca08	Merge: OBLITERATUS skill v2.0 + unified gateway compression OBLITERATUS skill (PR #408 updated): - 9 CLI methods, 28 analysis modules, 116 model presets - Default method: advanced (multi-direction SVD, norm-preserving) - Live-tested: Qwen2.5-3B 75%→0% refusal, Qwen2.5-0.5B 60%→20% - References, templates, and real-world pitfalls included Gateway compression fix (PR #739): - Unified session hygiene with agent compression config - Uses model context length × compression.threshold from config.yaml - Removed hardcoded 100k/200-msg thresholds	2026-03-09 02:59:41 -07:00
teknium1	d6c710706f	docs: add real-world testing findings to OBLITERATUS skill Added pitfalls discovered during live abliteration testing: - Models < 1B have fragmented refusal, respond poorly (0.5B: 60%→20%) - Models 3B+ work much better (3B: 75%→0% with advanced defaults) - aggressive method can backfire on small models (made it worse) - Spectral certification RED is common even when refusal rate is 0% - Fixed torch property: total_mem → total_memory	2026-03-09 02:52:54 -07:00
teknium1	a6d3becd6a	feat: update OBLITERATUS skill to v2.0 — match current repo state Major updates to reflect the current OBLITERATUS codebase: - Change default recommendation from 'informed' (experimental) to 'advanced' (reliable, well-tested multi-direction SVD) - Add new CLI commands: tourney, recommend, strategies, report, aggregate, abliterate (alias) - Add --direction-method flag (diff_means, svd, leace) - Add strategies module (embedding/FFN ablation, head pruning, layer removal) - Add evaluation module with LM Eval Harness integration - Expand analysis modules from 15 to 28 - Add Apple Silicon (MLX) support - Add study presets (quick, jailbreak, knowledge, etc.) - Add --contribute, --verify-sample-size, --preset flags - Add complete CLI command reference table - Fix torch property name: total_mem -> total_memory (caught during live testing) Tested: Successfully abliterated Qwen2.5-0.5B-Instruct using 'advanced' method — refusal rate 0.4%, coherence 1.0, model responds without refusal to test prompts.	2026-03-09 02:39:03 -07:00
teknium1	eb0b01de7b	chore: move agentmail skill to optional-skills, add API key docs AgentMail requires a third-party API key (free tier available, paid plans from $20/mo) — not appropriate for bundled skills that show up in every user's system prompt. Added a Requirements section at the top with clear instructions to add AGENTMAIL_API_KEY to ~/.hermes/.env. Streamlined setup steps to avoid duplicating the key in both .env and config.yaml.	2026-03-08 23:33:05 -07:00
teknium1	5b1528519c	Merge PR #330 : feat: add AgentMail skill for agent-owned email inboxes Authored by teyrebaz33. Closes #329.	2026-03-08 23:32:26 -07:00
teknium1	a8bf414f4a	feat: browser console/errors tool, annotated screenshots, auto-recording, and dogfood QA skill New browser capabilities and a built-in skill for agent-driven web QA. ## New tool: browser_console Returns console messages (log/warn/error/info) AND uncaught JavaScript exceptions in a single call. Uses agent-browser's 'console' and 'errors' commands through the existing session plumbing. Supports --clear to reset buffers. Verified working in both local and Browserbase cloud modes. ## Enhanced tool: browser_vision(annotate=True) New boolean parameter on browser_vision. When true, agent-browser overlays numbered [N] labels on interactive elements — each [N] maps to ref @eN. Annotation data (element name, role, bounding box) returned alongside the vision analysis. Useful for QA reports and spatial reasoning. ## Config: browser.record_sessions Auto-record browser sessions as WebM video files when enabled: - Starts recording on first browser_navigate - Stops and saves on browser_close - Saves to ~/.hermes/browser_recordings/ - Works in both local and cloud modes (verified) - Disabled by default ## Built-in skill: dogfood Systematic exploratory QA testing for web applications. Teaches the agent a 5-phase workflow: 1. Plan — accept URL, create output dirs, set scope 2. Explore — systematic crawl with annotated screenshots 3. Collect Evidence — screenshots, console errors, JS exceptions 4. Categorize — severity (Critical/High/Medium/Low) and category (Functional/Visual/Accessibility/Console/UX/Content) 5. Report — structured markdown with per-issue evidence Includes: - skills/dogfood/SKILL.md — full workflow instructions - skills/dogfood/references/issue-taxonomy.md — severity/category defs - skills/dogfood/templates/dogfood-report-template.md — report template ## Tests 21 new tests covering: - browser_console message/error parsing, clear flag, empty/failed states - browser_console schema registration - browser_vision annotate schema and flag passing - record_sessions config defaults and recording lifecycle - Dogfood skill file existence and content validation Addresses #315.	2026-03-08 21:28:12 -07:00
teknium1	99f7582175	chore: move Solana skill to optional-skills/ Solana blockchain queries are a niche use case — not needed by every user. Moved from skills/ (bundled) to optional-skills/ (installable via Skills Hub).	2026-03-08 18:52:02 -07:00
teknium1	93c5997290	Merge PR #212 : feat(skills): add Solana blockchain skill Authored by Deniz Alagoz (gizdusum). Closes #164. Will be moved to optional-skills/ and enhanced post-merge.	2026-03-08 18:51:33 -07:00
teknium1	7c30ac2141	fix: overhaul ascii-art skill with working sources (#662 ) Major issues fixed: - Removed dead APIs: artii.herokuapp.com (404 since Heroku free tier ended 2022), patorjk.com TAAG AJAX endpoint (404) - Removed unusable sources: emojicombos.com (3.3MB JS blob, not curl-accessible), asciiart.eu (art loads via JavaScript only) New working sources added: - asciified API (asciified.thelicato.io): free text-to-ASCII REST API, 250+ FIGlet fonts, returns plain text, no auth — perfect remote alternative when pyfiglet isn't installed - ascii.co.uk: classic ASCII art archive, art in <pre> tags, extractable with simple curl + Python parsing - qrenco.de: QR codes as ASCII art via curl - wttr.in: weather and moon phase as ASCII art via curl Also fixed: Tool 6 no longer relies on web_extract inside execute_code (which was the original #662 bug). All web lookups now use terminal curl which is universally available.	2026-03-08 18:09:44 -07:00
Teknium	b8120df860	Revert "feat: skill prerequisites — hide skills with unmet runtime dependencies"	2026-03-08 03:58:13 -07:00
kshitij	f210510276	feat: add prerequisites field to skill spec — hide skills with unmet dependencies Skills can now declare runtime prerequisites (env vars, CLI binaries) via YAML frontmatter. Skills with unmet prerequisites are excluded from the system prompt so the agent never claims capabilities it can't deliver, and skill_view() warns the agent about what's missing. Three layers of defense: - build_skills_system_prompt() filters out unavailable skills - _find_all_skills() flags unmet prerequisites in metadata - skill_view() returns prerequisites_warning with actionable details Tagged 12 bundled skills that have hard runtime dependencies: gif-search (TENOR_API_KEY), notion (NOTION_API_KEY), himalaya, imessage, apple-notes, apple-reminders, openhue, duckduckgo-search, codebase-inspection, blogwatcher, songsee, mcporter. Closes #658 Fixes #630	2026-03-08 13:19:32 +05:30
teknium1	313d522b61	feat: add Polymarket prediction market skill (read-only) Adds a new market-data/polymarket skill for querying Polymarket's public prediction market APIs. Pure read-only, zero authentication required, zero external dependencies (stdlib only). Includes: - SKILL.md: Agent instructions with key concepts and workflow - references/api-endpoints.md: Full API reference (Gamma, CLOB, Data APIs) - scripts/polymarket.py: CLI helper for search, trending, prices, orderbooks, price history, and recent trades Addresses #589.	2026-03-07 17:27:29 -08:00
areu01or00	ce7e7fef30	docs(skill): expand duckduckgo-search with DDGS Python API coverage Add Python DDGS library examples for all 4 search types (text, news, images, videos) with return field documentation, quick reference table, and validated gotchas. Reorganize to put Python API primary, CLI secondary. Soften Firecrawl-fallback framing. All examples validated on ddgs==9.11.2.	2026-03-07 21:15:29 +05:30
teknium1	f668e9fc75	feat: platform-conditional skill loading + Apple/macOS skills Add a 'platforms' field to SKILL.md frontmatter that restricts skills to specific operating systems. Skills with platforms: [macos] only appear in the system prompt, skills_list(), and slash commands on macOS. Skills without the field load everywhere (backward compatible). Implementation: - skill_matches_platform() in tools/skills_tool.py — core filter - Wired into all 3 discovery paths: prompt_builder.py, skills_tool.py, skill_commands.py - 28 new tests across 3 test files New bundled Apple/macOS skills (all platforms: [macos]): - imessage — Send/receive iMessages via imsg CLI - apple-reminders — Manage Reminders via remindctl CLI - apple-notes — Manage Notes via memo CLI - findmy — Track devices/AirTags via AppleScript + screen capture Docs updated: CONTRIBUTING.md, AGENTS.md, creating-skills.md, skills.md (user guide)	2026-03-07 00:47:54 -08:00
Teknium	8712dd6d1c	Merge pull request #308 from batuhankocyigit/patch-2 fix: rename misspelled directory 'fouth-edition' to 'fourth-edition'	2026-03-06 23:43:09 -08:00
teknium1	ab0f4126cf	fix: restore all removed bundled skills + fix skills sync system - Restored 21 skills removed in commits `757d012` and `740dd92`: accelerate, audiocraft, code-review, faiss, flash-attention, gguf, grpo-rl-training, guidance, llava, nemo-curator, obliteratus, peft, pytorch-fsdp, pytorch-lightning, simpo, slime, stable-diffusion, tensorrt-llm, torchtitan, trl-fine-tuning, whisper - Rewrote sync_skills() with proper update semantics: * New skills (not in manifest): copied to user dir * Existing skills (in manifest + on disk): updated via hash comparison * User-deleted skills (in manifest, not on disk): respected, not re-added * Stale manifest entries (removed from bundled): cleaned from manifest - Added sync_skills() to CLI startup (cmd_chat) and gateway startup (start_gateway) — previously only ran during 'hermes update' - Updated cmd_update output to show new/updated/cleaned counts - Rewrote tests: 20 tests covering manifest CRUD, dir hashing, fresh install, user deletion respect, update detection, stale cleanup, and name collision handling 75 bundled skills total. 2002 tests pass.	2026-03-06 15:57:30 -08:00
JackTheGit	71c0cd00e5	docs: fix spelling of 'publicly'	2026-03-05 16:46:21 +00:00
teknium1	2af2f148ab	refactor: rewrite duckduckgo-search skill for accuracy and usability Follow-up to PR #267 merge: - Fix CLI syntax: -k is keywords, -m is max results (was reversed) - Add clear trigger condition: use only when web_search tool unavailable - Remove misleading curl fallback (DuckDuckGo Instant Answer API is not a web search endpoint) - Fix package name: ddgs (renamed from duckduckgo-search) - Add workflow section for search → web_extract pipeline - Add pitfalls and limitations sections - Fix author attribution to actual contributor - Rewrite shell script as simple ddgs wrapper with availability check	2026-03-04 22:11:09 -08:00
teknium1	d19109742e	Merge PR #267 : feat(skills): add DuckDuckGo search skill as Firecrawl fallback Authored by gamedevCloudy. Adds a free web search skill for users without FIRECRAWL_API_KEY, using the ddgs library or curl.	2026-03-04 22:09:07 -08:00
teknium1	11a5a64729	feat: add emojicombos.com as primary ASCII art search source emojicombos.com has a huge curated collection of ASCII art, dot art, kaomoji, and emoji combos searchable via web_extract with a simple URL pattern: https://emojicombos.com/{term}-ascii-art No API key needed. Returns modern/meme art, pop culture references, and kaomoji alongside classic ASCII art. Added as Source A (recommended first) before asciiart.eu (Source B, classic archive). Also added GitHub Octocat API as a fun easter egg and kaomoji search to the decision flow.	2026-03-04 20:23:36 -08:00
teknium1	0dba3027c1	feat: expand ascii-art skill with cowsay, boxes, toilet, image-to-ascii Adds 5 additional tools from the awesome-ascii-art ecosystem: - cowsay: 50+ characters with speech/thought bubbles - boxes: 70+ decorative border designs, composable with pyfiglet - toilet: colored text art with rainbow/metal/border filters - ascii-image-converter: modern image-to-ASCII (PNG/JPEG/GIF/WEBP) - jp2a: lightweight JPEG-to-ASCII fallback Also adds fun extras (Star Wars telnet), resource links, and an expanded decision flow covering all 7 modes. Ref: github.com/moul/awesome-ascii-art	2026-03-04 20:16:38 -08:00
teknium1	405c7e08be	feat: enhance ascii-art skill with pyfiglet and asciiart.eu search Adds two primary modes on top of the original LLM-generation approach: - Mode 1: pyfiglet (571 fonts, pip install, no API key) for text banners - Mode 2: asciiart.eu search (11,000+ pieces) via web_extract for pre-made art - Mode 3: LLM-generated art using Unicode palette (original PR, now fallback) Includes decision flow, font recommendations, and category reference.	2026-03-04 20:01:08 -08:00
teknium1	cb36930f1d	Merge PR #209 : add ascii-art skill for creative text banners and art Authored by 0xbyt4. Initial skill with Unicode character palette and style guide for LLM-generated ASCII art.	2026-03-04 19:59:13 -08:00
teyrebaz33	6a51fd23df	feat: add AgentMail skill for agent-owned email inboxes (#329 )	2026-03-03 22:20:35 +03:00
teknium1	de0af4df66	refactor: enhance software-development skills with Hermes integration Improvements to all 5 skills adapted from obra/superpowers: - Restored anti-rationalization tables and red flags from originals (key behavioral guardrails that prevent LLMs from taking shortcuts) - Restored 'Rule of Three' for debugging (3+ failed fixes = question architecture, not keep fixing) - Restored Pattern Analysis and Hypothesis Testing phases in debugging - Restored 'Why Order Matters' rebuttals and verification checklist in TDD - Added proper Hermes delegate_task integration with real parameter examples and toolset specifications throughout - Added Hermes tool usage (search_files, read_file, terminal) for investigation and verification steps - Removed references to non-existent skills (brainstorming, finishing-a-development-branch, executing-plans, using-git-worktrees) - Removed generic language-specific sections (Go, Rust, Jest) that added bulk without agent value - Tightened prose — cut ~430 lines while adding more actionable content - Added execution handoff section to writing-plans - Consistent cross-references between the 5 skills	2026-03-03 04:08:56 -08:00
teknium1	0e1723ef74	Merge PR #137 : feat: Add Superpowers software development skills Authored by kaos35. Adds 5 software development workflow skills adapted from obra/superpowers: test-driven-development, systematic-debugging, subagent-driven-development, writing-plans, requesting-code-review.	2026-03-03 04:00:00 -08:00
BathreeNode	d10108f8ca	fix: rename misspelled directory 'fouth-edition' to 'fourth-edition' The ECMA schema directory was misspelled as 'fouth-edition' instead of 'fourth-edition'. Renamed all 4 files within to correct the path: - opc-contentTypes.xsd - opc-coreProperties.xsd - opc-digSig.xsd - opc-relationships.xsd	2026-03-03 09:21:28 +03:00
BathreeNode	8b520f9848	fix: rename misspelled directory 'fouth-edition' to 'fourth-edition' The ECMA schema directory was misspelled as 'fouth-edition' instead of 'fourth-edition'. Renamed all 4 files within to correct the path: - opc-contentTypes.xsd - opc-coreProperties.xsd - opc-digSig.xsd - opc-relationships.xsd	2026-03-03 09:20:47 +03:00
BathreeNode	a718aed1be	fix: rename misspelled directory 'fouth-edition' to 'fourth-edition' The ECMA schema directory was misspelled as 'fouth-edition' instead of 'fourth-edition'. Renamed all 4 files within to correct the path: - opc-contentTypes.xsd - opc-coreProperties.xsd - opc-digSig.xsd - opc-relationships.xsd	2026-03-03 09:20:07 +03:00
BathreeNode	5f29e7b63c	fix: rename misspelled directory 'fouth-edition' to 'fourth-edition' The ECMA schema directory was misspelled as 'fouth-edition' instead of 'fourth-edition'. Renamed all 4 files within to correct the path: - opc-contentTypes.xsd - opc-coreProperties.xsd - opc-digSig.xsd - opc-relationships.xsd	2026-03-03 09:17:13 +03:00
teknium1	63f5e14c69	docs: add comprehensive MCP documentation and examples - docs/mcp.md: Full MCP documentation covering prerequisites, configuration, transports (stdio + HTTP), security (env filtering, credential stripping), reconnection, troubleshooting, popular servers, and advanced usage - README.md: Add MCP section with quick config example and install instructions - cli-config.yaml.example: Add commented mcp_servers section with examples for stdio, HTTP, and authenticated server configs - docs/tools.md: Add MCP to Tool Categories table and MCP Tools section - skills/mcp/native-mcp/SKILL.md: Create native MCP client skill with full configuration reference, transport types, security, troubleshooting - skills/mcp/DESCRIPTION.md: Update category description to cover both native MCP client and mcporter bridge approaches	2026-03-02 18:52:33 -08:00
Aayush Chaudhary	c2d8d17285	feat(skills): add DuckDuckGo search skill as Firecrawl fallback	2026-03-02 09:08:59 +05:30
gizdusum	ec97f9ad1a	feat(skills): add Solana blockchain skill (converted from tool)	2026-02-28 23:39:39 +03:00
0xbyt4	10085041cf	feat: add ascii-art skill for creative text banners and art Unicode-based ASCII art generator skill with multiple styles (block, shadow, outlined, gradient, decorative frame), character palette reference, and usage examples. No external dependencies.	2026-02-28 22:33:10 +03:00
teknium1	0862fa96fd	refactor(domain-intel): streamline documentation and add CLI tool for domain intelligence operations	2026-02-27 23:53:24 -08:00
kaos35	2595d81733	feat: Add Superpowers software development skills Add 5 new skills for professional software development workflows, adapted from the Superpowers project ( obra/superpowers ): - test-driven-development: RED-GREEN-REFACTOR cycle enforcement - systematic-debugging: 4-phase root cause investigation - subagent-driven-development: Structured delegation with two-stage review - writing-plans: Comprehensive implementation planning - requesting-code-review: Systematic code review process These skills provide structured development workflows that transform Hermes from a general assistant into a professional software engineer with defined processes for quality assurance. Skills are organized under software-development category and follow Hermes skill format with proper frontmatter, examples, and integration guidance with existing skills.	2026-02-27 15:32:58 +01:00
FurkanL0	f9e05218ca	Create SKILL.md	2026-02-27 17:07:13 +03:00
FurkanL0	2ddda5da89	Create DESCRIPTION.md	2026-02-27 17:06:17 +03:00

1 2 3 4 5 ...

269 Commits