Features Overview

Hermes Agent는 기본 채팅을 훨씬 넘어서는 다양한 기능을 제공합니다. persistent memory, file-aware context, browser automation, voice conversation 같은 기능이 함께 동작해 Hermes를 강력한 autonomous assistant로 만듭니다.

Core

Tools & Toolsets - tools는 agent의 능력을 확장하는 함수입니다. web search, terminal execution, file editing, memory, delegation 등을 logical toolsets로 묶어 platform별로 켜거나 끌 수 있습니다.
Skills System - agent가 필요할 때 load하는 on-demand knowledge 문서입니다. skills는 token 사용량을 줄이기 위해 progressive disclosure pattern을 따르며 agentskills.io open standard와 호환됩니다.
Persistent Memory - session을 넘어 유지되는 bounded, curated memory입니다. Hermes는 MEMORY.md와 USER.md를 통해 사용자 preferences, projects, environment, 학습한 내용을 기억합니다.
Context Files - Hermes는 project 안의 context files(.hermes.md, AGENTS.md, CLAUDE.md, SOUL.md, .cursorrules)를 자동으로 찾아 load하고, 그 내용이 project 안에서 agent 행동을 조정합니다.
Context References - message에서 @ 뒤에 reference를 입력해 files, folders, git diffs, URLs를 직접 주입할 수 있습니다. Hermes는 reference를 inline으로 확장하고 content를 자동 append합니다.
Checkpoints - file을 수정하기 전에 working directory snapshot을 자동으로 만들어, 문제가 생겼을 때 /rollback으로 되돌릴 수 있는 safety net을 제공합니다.

Automation

Scheduled Tasks (Cron) - natural language 또는 cron expression으로 task를 예약합니다. jobs는 skills를 attach하고, 어느 platform으로든 result를 deliver하며, pause/resume/edit을 지원합니다.
Subagent Delegation - delegate_task tool은 격리된 context, 제한된 toolsets, 자체 terminal session을 가진 child agent instance를 spawn합니다. 기본적으로 3개의 subagent를 동시에 실행하며 설정으로 조정할 수 있습니다.
Code Execution - execute_code tool은 agent가 Hermes tools를 programmatically 호출하는 Python script를 작성하게 해, multi-step workflow를 sandboxed RPC execution을 통한 단일 LLM turn으로 압축합니다.
Event Hooks - lifecycle의 핵심 지점에서 custom code를 실행합니다. gateway hooks는 logging, alerts, webhooks를 처리하고 plugin hooks는 tool interception, metrics, guardrails를 담당합니다.
Batch Processing - 수백 또는 수천 개 prompts에 Hermes agent를 병렬 실행해 training data generation 또는 evaluation용 structured ShareGPT-format trajectory data를 생성합니다.

Media & Web

Voice Mode - CLI와 messaging platforms 전반에서 full voice interaction을 제공합니다. microphone으로 agent에게 말하고, spoken reply를 듣고, Discord voice channel에서 live voice conversation을 할 수 있습니다.
Browser Automation - Browserbase cloud, Browser Use cloud, local Chrome via CDP, local Chromium 등 여러 backend를 지원하는 full browser automation입니다. website navigation, form filling, information extraction을 수행합니다.
Vision & Image Paste - multimodal vision support입니다. clipboard image를 CLI에 붙여넣고 vision-capable model로 분석, 설명, 작업을 요청할 수 있습니다.
Image Generation - FAL.ai를 사용해 text prompt에서 image를 생성합니다. FLUX 2 Klein/Pro, GPT-Image 1.5/2, Nano Banana Pro, Ideogram V3, Recraft V4 Pro, Qwen, Z-Image Turbo 등 9개 model을 지원하며 hermes tools로 선택합니다.
Voice & TTS - 모든 messaging platform에서 text-to-speech output과 voice message transcription을 제공합니다. Edge TTS(free), ElevenLabs, OpenAI TTS, MiniMax, Mistral Voxtral, Google Gemini, xAI, NeuTTS, KittenTTS, Piper 등 10개 native provider와 custom command provider를 지원합니다.

Integrations

MCP Integration - stdio 또는 HTTP transport로 MCP server에 연결합니다. native Hermes tool을 작성하지 않고도 GitHub, databases, file systems, internal APIs의 external tools를 사용할 수 있습니다. per-server tool filtering과 sampling도 포함됩니다.
Provider Routing - 어떤 AI provider가 request를 처리할지 세밀하게 제어합니다. sorting, whitelists, blacklists, priority ordering으로 cost, speed, quality를 최적화할 수 있습니다.
Fallback Providers - primary model이 error를 만났을 때 backup LLM provider로 자동 failover합니다. vision, compression 같은 auxiliary tasks의 independent fallback도 포함됩니다.
Credential Pools - 같은 provider의 여러 API key에 calls를 분산합니다. rate limit 또는 failure 시 자동 rotation합니다.
Memory Providers - Honcho, OpenViking, Mem0, Hindsight, Holographic, RetainDB, ByteRover, Supermemory 같은 external memory backend를 연결해 built-in memory system을 넘어 cross-session user modeling과 personalization을 제공합니다.
API Server - Hermes를 OpenAI-compatible HTTP endpoint로 노출합니다. OpenAI format을 말하는 Open WebUI, LobeChat, LibreChat 등 어떤 frontend든 연결할 수 있습니다.
IDE Integration (ACP) - VS Code, Zed, JetBrains 같은 ACP-compatible editor 안에서 Hermes를 사용합니다. chat, tool activity, file diffs, terminal commands가 editor 안에 렌더링됩니다.
RL Training data - agent sessions에서 trajectory data를 생성해 reinforcement learning과 model fine-tuning에 사용할 수 있습니다. 대량 실행은 Batch Processing과 함께 사용합니다.

Customization

Personality & SOUL.md - 완전히 customizable한 agent personality입니다. SOUL.md는 system prompt에서 가장 먼저 들어가는 primary identity file이며, session별로 built-in 또는 custom /personality preset을 바꿔 쓸 수 있습니다.
Skins & Themes - CLI visual presentation을 커스터마이즈합니다. banner colors, spinner faces and verbs, response-box labels, branding text, tool activity prefix 등을 바꿀 수 있습니다.
Plugins - core code를 수정하지 않고 custom tools, hooks, integrations를 추가합니다. general plugins(tools/hooks), memory providers(cross-session knowledge), context engines(alternative context management) 세 가지 type이 있으며 unified hermes plugins interactive UI로 관리합니다.

Core​

Automation​

Media & Web​

Integrations​

Customization​

Core

Automation

Media & Web

Integrations

Customization