okikusan-public / articles / Hermes Agent — The execution engine for your Second Brain
JA EN
← articles
CONTENTS7 sections
  1. 01What Hermes is
  2. 02Overview / positioning
  3. 035 core features
  4. 04Using with Claude/Grok
  5. 05Why Obsidian integration matters
  6. 06Operational tips
  7. 07Known limits
  8. Summary
HERMES AGENT / 2026

Hermes Agent — positioning it as the execution engine of the Second Brain

An open-source self-improving autonomous AI agent released by Nous Research in February 2026. It runs a closed loop of task execution → auto-skill generation → long-term memory update, and keeps learning the user across sessions.

v0.14.0, driven by grok-4.3 via SuperGrok OAuth. The official Obsidian skill is enabled by default. For Obsidian power users, it's what turns a "static Second Brain" into a "dynamic execution agent."

Hermes Agent Nous Research Obsidian Second Brain grok-4.3 OSS / MIT 2026.05.18 · 6 min read
FIG.0 — SELF-IMPROVING LOOP
// HERMES = a resident agent with a self-improving loop ▍ HERMES OSS · MIT · v0.14.0 Execute task Forced tool use Generate skill Auto SKILL.md Update memory MEMORY / USER / SQLite Next task Better than last time // Learns across sessions. Teach once, never forgets.
Task execution → skill generation → memory update → next task. The self-improving loop that spans sessions is the heart of Hermes. Teach it once, it never forgets — and it gets better on its own.
▍ THE PROMISE

Hermes isn't just a chatbot — it's an AI agent with "self-improving loop × resident execution." Combine it with Obsidian and you get "a Second Brain that grows with you."

▍ SOURCES
▍ FROM THE FIELD — 5/18 hands-on run

This article is grounded in actually running Hermes v0.14.0 on a local Mac on 2026-05-18, wired up to an Obsidian Vault. Confirmed working with the defaults: grok-4.3 driven via SuperGrok OAuth with zero API-key billing, Vault read/write through the official Obsidian skill, 3-way parallel sub-agent execution via delegate_task in about 19.5 seconds, native gateways for Telegram/Discord/Slack/LINE/Signal/Teams and more, and reuse for Codex/Aider/Cline through hermes proxy. The feature notes below are written on top of that hands-on baseline.

▍ TL;DR
§ 01 CONTEXT

What is Hermes Agent

For individuals who use Obsidian heavily and are familiar with leading AI like Claude or Grok, Hermes Agent is positioned as "the execution engine of the Second Brain."

From a Karpathy-style LLM Wiki perspective, Obsidian is "the static layer that humans read and organize," while Hermes is "the dynamic layer that reads that knowledge to auto-execute, update, and learn." They complement each other.

▍ In one line

Hermes is an AI that "teach once, never forgets" and "gets better on its own." A resident twin that runs on cron by itself and grows new skills every time it's used.

§ 02 OVERVIEW

Overview and positioning

Hermes Agent is an open-source self-improving autonomous AI agent released by Nous Research in February 2026 (MIT license). Tagline: "personal AI infrastructure that grows with you."

Status as of May 2026:

It's not just a chatbot — it has a closed loop of task execution → auto-skill generation → long-term memory update, and is structured to keep learning the user across sessions.

§ 03 FEATURES

5 core features

>3-1Auto-generated reusable skills

Repeated work is automatically saved and refined as a SKILL.md. The second time you give the same instruction, Hermes recognizes "ah, this is the same pattern as before" and leaves behind a polished skill file. From the next time, a short instruction reproduces the same behavior.

>3-2Persistent memory mechanism

A combination of MEMORY.md / USER.md / SQLite realizes cross-session context retention and accumulation of user preferences. It remembers "you said this last time" and "this user writes like this."

FIG.3 — 3 PERSISTENCE LAYERS
// How Hermes never forgets — 3 persistence layers ▸ AUTO-GENERATED SKILL.md
Repeated steps auto-saved as a skill. Next time, brief input replays it.
▸ PERSISTENT CONTEXT MEMORY.md / USER.md
Long-term memory across sessions, plus accumulated user preferences.
▸ STRUCTURED LOG SQLite
Execution history and task state, structured for search and reference.
// All three layers working together = "teach once, never forgets" + "gets better on its own."
Three persistent layers interoperate to deliver "teach once, never forgets" and "gets better on its own." Skills / memory / history each have their role.

>3-3Strict enforcement of tool use

Tool calls are mandatory by design — "draw up a plan and stop there" isn't allowed. It autonomously combines the Obsidian skill, browser operations, terminal execution, and more to complete tasks. This is the big practical difference from Claude alone.

>3-4cron / webhook support

As a resident agent, it natively supports scheduled execution and external event triggers. Automations like "check X every morning" or "process Y when it arrives in Gmail" are wired up out of the box.

>3-5Official Obsidian skill

Reads and writes the Vault directly, enabling two-way integration with your knowledge base. The standard pattern becomes: Hermes reads notes piled up in daily/ and writes results back to knowledge/.

>3-6Parallel multi-agent execution (delegate_task)

The built-in delegate_task tool lets a parent agent spin up child agents in parallel, each with its own independent context (ThreadPoolExecutor under the hood). In the 5/18 hands-on run, three Vault notes of different genres were dispatched to three children to summarize: the run completed in roughly 19.5 seconds across all three. Because children never see the parent's conversation history, context-cost is zero on the parent side. Per-child model overrides are supported, so a mixed-routing setup (light = grok-4-fast, complex = grok-4.3, review = Claude Opus) works out of the box. Children cannot call delegate_task themselves (no grandchildren), keeping things safe.

The value of parallelism isn't only speed — it's the "integrated meta-extraction" that only emerges from the parent's vantage point over multiple child outputs. The subagent-driven-development skill (from the OSS ecosystem) also ships a plan-driven workflow with built-in two-stage review (spec compliance / quality).

§ 04 VS CLAUDE/GROK

Differences from Claude / Grok and combined-use patterns

Claude (especially Claude Code) excels at high-performance instant reasoning, large-context processing, and high-quality code generation. Hermes Agent, on the other hand, shines at residency, automation, and self-growth.

FIG.1 — CLAUDE × HERMES
// Session-based worker × resident orchestrator CLAUDE / GROK High-power worker Instant reasoning Large context High-quality code gen Per session Human-driven HERMES Resident orchestrator cron / webhook Auto skill creation Persistent memory 24/7 uptime Runs in background × ▍ Combined pattern Hermes runs cron + accumulates memory Claude handles complex reasoning / high-quality output Obsidian Vault as shared knowledge source // Not exclusive — use them together
Claude/Grok = high-performance per-session workers; Hermes = a resident orchestrator running 24/7. Different roles, so combining them is the practical answer.

The typical pattern adopted by many advanced users:

Current Hermes, running on SuperGrok OAuth + grok-4.3, combines Grok's strong reasoning as a base with the autonomy of a resident agent.

§ 05 OBSIDIAN

Why Obsidian integration matters

Obsidian is "the Second Brain that humans read and organize," Hermes is "the twin that reads that knowledge to auto-execute, update, and learn." Perfectly complementary roles.

FIG.2 — OBSIDIAN × HERMES
// Obsidian = static brain. Hermes = dynamic twin. Human Curate & organize ▸ OBSIDIAN Static Second Brain Vault daily / knowledge ▸ HERMES Dynamic execution agent SKILL.md MEMORY · USER · SQLite Read → ← Write back results Knowledge compounds: store in Obsidian → Hermes uses it → results back to the Vault
Humans organize Obsidian; Hermes reads the Vault, executes, and writes results back to the Vault. A pipeline where knowledge compounds.

Three reasons the integration matters:

▍ The execution layer of the Karpathy-style LLM Wiki

As organized in the Vault's knowledge/第二の脳 note, positioning Hermes as the "execution layer" of the Karpathy-style LLM Wiki turns Obsidian's static knowledge base into a dynamic AI agent platform.

▍ Anmoku-kou — the material worth piling into Obsidian

What accumulates in Obsidian isn't only finished tacit knowledge. What really matters is everything before it becomes knowledge — the wobbles in your thinking: hunches, uneasy reactions, half-baked hypotheses, hesitations, habits of judgment. We call this "anmoku-kou" (暗黙考), a coined term meaning roughly "implicit thoughts." Hermes reads that material, organizes it, and turns it into execution — acting as your twin.

Obsidian = a place to pile up thoughts / Hermes = an entity that runs on those thoughts. Leaving rough notes and contradictory hypotheses untouched actually makes the AI twin better in the long run.

§ 06 OPS

Operational tips

§ 07 LIMITS

Known limits

▍ THE WORLDVIEW — A Second Brain that grows with you

Obsidian is the static brain; Hermes is the dynamic twin

Hermes Agent is a platform that persists, automates, and evolves Claude's or Grok's capabilities into something tailored to you, on your own server. Combine it with Obsidian and you really do get "a Second Brain that grows with you."

As an individual — an Obsidian power user who already uses Claude / Grok daily — it's strongly recommended as the next step.

  • Accumulate in Obsidian (the static Second Brain)
  • Hermes reads and executes (the dynamic twin)
  • Write results back to the Vault so knowledge compounds (the execution → knowledge loop)

The moment these three line up, you move past "writing blog posts" and "taking notes" into a state where your brain is always running on its own.