Gormes

Cross-Project Feature Map & Long-Term Plan for Gormes-Agent

Executive Summary

Analysis completed on 2026-04-30 of all 12 opensource projects in workspace-mineru:

  • hermes-agent (Python upstream - 519K lines CLI, 702K lines agent runner)
  • honcho (Python memory/session - Peer paradigm, 3-agent memory system)
  • gbrain (TypeScript memory/runtime - Brain-first knowledge graph, minions queue)
  • browser-harness (Python browser automation - CDP, Camofox, daemon lifecycle)
  • go-browser-harness (Go browser automation - Chromedp, stateless actions)
  • mercury-agent (TypeScript CLI/Telegram - Soul-driven, permission-hardened)
  • space-agent (JavaScript browser-first - Agent reshapes interface, WebLLM)
  • picoclaw (Go lightweight agent - 18+ channels, 30+ providers, MCP)
  • go-agent-os references (8 donor repos - patterns for OAuth, retry, tools, state machines)

Gormes-agent current parity: ~20-30% of hermes-agent features


Master Feature Matrix

Category 1: Provider Adapters

ProviderHermesGormes StatusGap Severity
Anthropicanthropic_adapter.py (1,888)anthropic_client.goComplete
OpenAI/ChatGPTchat_completions.py (20,659)http_client.goComplete
Codex Responsescodex_responses_adapter.py (999)codex_responses_adapter.goComplete
AWS Bedrockbedrock_adapter.py (1,264)bedrock_*.goComplete
Gemini CloudCodegemini_cloudcode_adapter.py (903)gemini_cloudcode.goComplete
Google Code Assistgoogle_code_assist.py (452)google_code_assist.goComplete
OpenRouteropenrouter_client.py (1,091)openrouter.goComplete
Google OAuthgoogle_oauth.py (1,048)google_oauth_state.goPartial
Azure FoundryVia azure_detect.py (11,951)azure_foundry_*.goPartial
LM Studiolmstudio_reasoning.py (48)Not implementedGap
Moonshot/Kimimoonshot_schema.pySchema onlyGap
xAI/Grokxai_http.py (349)Not implementedGap
DeepSeekVia reasoning isolationPartialGap

Learnings from other projects:

  • Picoclaw: 30+ providers with unified interface - could adopt pattern for provider registration
  • Mercury: Provider fallback chain (DeepSeek → OpenAI → Anthropic → Grok → Ollama) - resilient routing
  • GBrain: Multi-provider LLM abstraction with fallback models per provider

Category 2: Browser Automation

FeatureHermes (Python)Go HarnessGormesPriority
Navigate/Snapshot/ClickCompleteCompleteCompleteDone
Type/Scroll/Back/PressCompleteCompleteCompleteDone
Console/Vision/CDPCompleteCompleteCompleteDone
Dialog/Get ImagesCompleteCompleteCompleteDone
HTTP GET (no browser)CompleteGapGapMedium
Daemon lifecycleCompleteGapGapHigh
Profile managementCompleteGapGapMedium
Cloud browser (Browser Use)CompleteGapGapLow
Tab enumeration/switchCompleteGapGapMedium
Iframe targetingCompleteGapGapLow
Event drainCompleteGapGapMedium
Doctor/diagnosticsCompleteGapGapHigh

Learnings:

  • Space-agent: Browser-first architecture with floating windows, DOM capture with typed refs ([link 12]), nested iframe support - completely different paradigm
  • Hermes: 2,991-line browser_tool.py + 1,362-line browser_supervisor.py + 563-line browser_cdp_tool.py

Category 3: Tools Registry

Tool CategoryHermesGormesGap
Core tools~50 tools~15 toolsMajor
Browser automation4 files, ~4,500 lines2 files, ~500 linesLarge
File operations2 files, ~102,000 lines1 file, ~200 linesMassive
Terminal/Shellterminal_tool.py (2,307)terminal_tool.goMedium
Code executioncode_execution_tool.py (1,609)Not implementedGap
Web scrapingweb_tools.py (89,105)PartialMassive
MCP clientmcp_tool.py (3,140) + OAuth (22K)PartialLarge
Delegation/Subagentsdelegate_tool.py (2,531)subagent/ minimalLarge
Skills hubskills_hub.py (118,874!)skills/ basicMassive
Cron/Schedulercronjob_tools.py (26,525)cron/ basicLarge
TTS/Voicevoice_mode.py (38,753)PartialLarge
Image generationimage_generation_tool.py (1,002)image_generation.goComplete
Visionvision_tools.py (31,333)Not implementedGap
RL Trainingrl_training_tool.py (1,396)Not implementedGap
Process registryprocess_registry.py (61,200)BasicMassive
Checkpoint managercheckpoint_manager.py (31,551)BasicLarge
Approval workflowapproval.py (52,681)Not implementedGap

Learnings from other projects:

  • Mercury: 31 built-in tools with permission-hardened execution (shell blocklist, filesystem scoping)
  • Space-agent: Skill system with metadata-driven placement (metadata.when, metadata.loaded, metadata.placement)
  • Picoclaw: Cron, web search (DuckDuckGo, Baidu, Tavily, Brave, Perplexity, SearXNG), filesystem, shell, spawn
  • GBrain: Cathedral II code navigation (call-graph edges, two-pass retrieval)

Category 4: Memory Systems

FeatureHermesHonchoGBrainMercuryGormes
StorageSQLitePostgreSQL+pgvectorSQLite+FTS5 / PostgreSQLSQLite+FTS5SQLite (Goncho)
Vector searchNot core✅ HNSW cosine✅ Hybrid (RRF+cosine)FTS5 keywordNot yet
Typed memoriesNot core✅ 10 typesPages+Chunks+Links✅ 10 typesSession-based
Auto-extractionNot core✅ Deriver agentAuto-link extraction✅ Post-conversationNot yet
Conflict resolutionNot core✅ Confidence-basedN/A✅ Confidence+recencyNot yet
Auto-pruningNot coreNot coreMaintenance cycle✅ 21-day staleNot yet
Peer paradigmNot core✅ Unified users+agentsNot coreNot coreNot yet
Knowledge graphNot coreObservation links✅ Auto-wiring (zero-LLM)Not coreNot yet
Session context✅ Combined✅ Tiered✅ Top-5 injection✅ Partial

Learnings:

  • Honcho: Three-agent memory system (Deriver/Dialectic/Dreamer) with observation levels (explicit/deductive/inductive/contradiction)
  • GBrain: Brain-first lookup (5-step before external API), compiled truth + timeline pattern, tiered enrichment
  • Mercury: Second Brain with 10 typed memories, hourly heartbeat consolidation, confidence/durability scoring

Category 5: Gateway/Channels

ChannelHermesPicoclawGormesStatus
Discord173K linesComplete
Telegram141K linesComplete
Slack102K lines⚠️ BasicPartial
Feishu/Lark192K lines⚠️ BasicPartial
WhatsApp45K lines⚠️ BasicPartial
WeCom65K lines⚠️ BasicPartial
Yuanbao185K lines⚠️ BasicPartial
Matrix105K linesGap
Signal50K linesGap
SMS14K linesGap
Email23K linesGap
DingTalk56K lines⚠️ BasicPartial
Mattermost28K linesGap
HomeAssistant16K lines⚠️ BasicPartial
QQPartialGap
BlueBubbles34K linesGap
LINEPartialGap
VKPartialGap
IRCPartialGap
Webhook30K linesPartialPartial

Learnings:

  • Picoclaw: 18+ channels as donor repo for Go channel-edge work. Gateway Donor Map explicitly documents adaptation patterns.
  • Mercury: Multi-user Telegram org model (admin/member roles, pairing codes)

Category 6: CLI/Operator Surface

FeatureHermesMercuryGormesGap
Auth systemauth.py (4,744)BasicPartialMedium
Config managementconfig.py (4,548)YAMLconfig.goPartial
Model switchingmodel_switch.py (1,588)Provider fallbackPartialMedium
Setup wizardsetup.py (3,488)Noneinstall.shPartial
Diagnosticsdoctor.py (1,390)NonedoctorPartial
Profilesprofiles.py (1,167)NonePartialGap
Backup/restorebackup.py (926)NoneNot yetGap
Logs viewinglogs.py (13,346)NoneNot yetGap
Web serverweb_server.py (3,195)NoneNot yetGap
Tips systemtips.py (27,607)Nonetips.goPartial
Status displaystatus.py (23,006)NonePartialPartial
Voice commandsvoice.py (21,085)NoneNot yetGap
Cron commandscron.py (10,839)Schedulercron/Partial
Skills configskills_config.py (7,151)Not corePartialGap
Plugin commandsplugins_cmd.py (1,280)Not corePartialGap
Completioncompletion.py (10,916)Not corePartialGap

Learnings:

  • Mercury: Token budget system (daily enforcement, auto-concise at 70%), loop detection (5 types), permission manifest
  • Picoclaw: Desktop (WebUI), Headless (TUI), Android APK, Docker, Termux deployment modes

Category 7: Security & Safety

FeatureHermesMercuryGormesGap
Shell blocklistPartial✅ 36+ patternsNot yetCritical
Filesystem scopingPartial✅ Folder-levelNot yetCritical
Permission approvalBasic✅ Inline y/n/alwaysNot yetHigh
SSRF guardurl_safety.py (9,429)Basicbrowser_contract.goPartial
Path securitypath_security.py (1,322)BasicNot yetMedium
Credential redactionredact.py (392)BasicPartialMedium
Website policywebsite_policy.py (9,786)Not coreNot yetGap
Tirith securitytirith_security.py (26,121)Not coreNot yetGap
OSV vulnerability checkosv_check.py (4,925)Not coreNot yetGap

Learnings:

  • Mercury: Most mature permission system - shell blocklist, cwdOnly restriction, auto-approved patterns, YAML manifest
  • Plandex (reference): Provider error classification with Retry-After parsing

Category 8: Personality/Soul Systems

FeatureHermesMercurySpaceGormes
Soul filesNot core✅ soul.md, persona.md, taste.md, heartbeat.mdpersonality.system.include.mdNot yet (Phase 6)
Identity modelNot coreHuman analogyUser-editable fileNot yet
LoadingNot coreUser-owned markdownPlain text includeNot yet

Learnings:

  • Mercury’s soul system: soul=heart, persona=face, taste=palate, heartbeat=breathing
  • Space’s approach: single personality include file (simpler stepping stone)

Category 9: Scheduling & Background Tasks

FeatureHermesMercuryGBrainGormes
Cron schedulingcron/scheduler.py (58,318)✅ Cron + delayedNot corecron/ basic
Job queuePostgres-backedYAML-persisted✅ Minions (BullMQ-inspired)Not yet
Task persistenceDBYAMLPostgres rowsNot yet
Fan-outNot coreNot core✅ N children + aggregatorNot yet
DAG dependenciesNot coreNot core✅ Parent-child with policiesNot yet
Token accountingNot coreNot core✅ Per-job trackingNot yet

Learnings:

  • GBrain Minions: Postgres-native job queue, BullMQ-inspired, zero infra, stall detection, retry with backoff, supervisor auto-restart
  • Mercury: Configurable heartbeat interval, episodic prune + second brain consolidate

Category 10: Unique Paradigms from Other Projects

ProjectUnique FeatureRelevance to Gormes
Space-agentBrowser-first runtime (agent lives in browser)Different paradigm - agent as peer to frontend vs backend orchestrator
Space-agentAgent reshapes interface (builds pages/tools/widgets)Could inform web dashboard capabilities
Space-agentLayered customware (L0 firmware → L1 group → L2 user)Multi-tenant configuration model
Space-agentGit-backed time travel (rollback/revert)Useful for workspace state recovery
Space-agentWebLLM + HuggingFace (browser-side inference)Novel - requires browser runtime
GBrainZero-LLM knowledge graph wiring (regex-based auto-links)HIGH - significantly reduces LLM calls for entity resolution
GBrainCathedral II code navigation (call-graph edges)HIGH - for code-aware agent capabilities
GBrainThin harness, fat skills (intelligence in skills not runtime)Architectural philosophy alignment
GBrainFail-improve loop (logs regex failures, generates better patterns)Self-improving deterministic classifiers
HonchoPeer paradigm (users AND agents are “Peers”)Multi-agent interaction model
HonchoThree-agent memory (Deriver/Dialectic/Dreamer)Memory quality differentiation
HonchoObservation levels (explicit/deductive/inductive/contradiction)Richer memory than simple key-value
MercuryPermission-hardened shell (36+ blocklist patterns)Critical - foundational safety
MercurySecond Brain (10 typed memories with conflict resolution)High - structured memory UX
MercuryLoop detection (5 types: hard/failing/text/no-action/same-tool)High - runaway loop prevention
MercuryToken budget + auto-conciseCost control
PicoclawUltra-lightweight (<10MB RAM, $10 hardware)Deployment target diversity
PicoclawSelf-bootstrapping (95% AI-generated)Development methodology

Strategic Recommendations for Gormes

Immediate Actions (Next 30 Days)

  1. Permission Hardening (from Mercury)

    • Port shell blocklist (36+ patterns)
    • Implement filesystem scoping (folder-level read/write)
    • Add permission approval UX (inline y/n/always)
    • Rationale: Gormes has zero permission hardening; this is foundational for safe agent operation
  2. Provider Completion

    • Complete DeepSeek/Kimi reasoning isolation
    • Implement LM Studio provider
    • Add xAI/Grok provider
    • Rationale: Provider parity is critical path for Python-free normal agent turn
  3. Browser Harness Gaps

    • Implement go-browser-harness doctor subcommand
    • Add daemon lifecycle (start/stop/ensure)
    • Rationale: Core browser tools work but lack production lifecycle

Short-Term (Next 90 Days)

  1. Loop Detection (from Mercury)

    • Port 5-type loop detector (~200 lines of TypeScript)
    • Hard loop, failing loop, text repetition, no-action, same-tool
    • Rationale: Runaway loops are a real production problem
  2. Structured Memory Enhancement (from Mercury + Honcho)

    • Extend Goncho with typed memory categories (start with 6: identity, preference, goal, habit, episode, reflection)
    • Add confidence/durability scoring
    • Implement conflict resolution (confidence wins, equal → newer)
    • Rationale: Current Goncho is session-based; structured memory is major UX improvement
  3. Skill Metadata (from Space-agent)

    • Add metadata.when, metadata.loaded, metadata.placement to SKILL.md schema
    • Implement hierarchical routing skill pattern
    • Rationale: More granular skill activation control
  4. Native Prompt Builder (Hermes parity)

    • Port agent/prompt_builder.py to Go
    • Context files, model-specific guidance, skills snapshots, memory/session-search assembly
    • Rationale: Critical path for Python-free normal agent turn

Medium-Term (Next 6 Months)

  1. Context Compression Reconciliation (Hermes parity)

    • Reconcile with upstream 5006b220 changes
    • Tool-result pruning, protected head/tail invariants
    • Rationale: Compression behavior drift from upstream
  2. GBrain Patterns (from GBrain)

    • Zero-LLM knowledge graph wiring (regex-based auto-links)
    • Brain-first lookup (5-step before external API)
    • Rationale: Significantly reduces LLM calls, compounds knowledge
  3. Token Budget System (from Mercury)

    • Daily budget tracking with mutex-protected counter
    • Auto-concise at 70% threshold
    • Budget commands (/budget)
    • Rationale: Cost control for production deployments
  4. Credential + OAuth (Hermes parity)

    • XDG-scoped token storage
    • Google OAuth flows
    • Rationale: Required for full provider parity

Long-Term (Next 12 Months)

  1. Cathedral II Code Navigation (from GBrain)

    • Call-graph edges, two-pass retrieval
    • 5 commands: code-callers, code-callees, code-def, code-refs, query –near-symbol
    • Rationale: Code-aware agent capabilities
  2. Minions Job Queue (from GBrain)

    • Postgres-native BullMQ-inspired queue
    • Parent-child DAGs, stall detection, retry with backoff
    • Rationale: Background task durability, subagent coordination
  3. Soul/Personality System (Phase 6)

    • User-owned markdown personality files
    • soul.md (heart), persona.md (face), taste.md (palate), heartbeat.md (breathing)
    • Rationale: Planned for gormes Phase 6; Mercury’s implementation is reference
  4. Web Dashboard (Hermes parity)

    • TypeScript/React web UI (Hermes has full web/ directory)
    • Session management, skills, chat, config, logs
    • Rationale: Hermes has 191K-line TUI gateway server
  5. Multi-Memory Backends (from Honcho)

    • Turbopuffer, LanceDB vector store options
    • Redis caching layer
    • Rationale: Scale beyond single-node SQLite
  • Space’s browser-first architecture (fundamentally different runtime model)
  • Space’s WebLLM (requires browser runtime)
  • Mercury’s Telegram org model (Gormes gateway channels are different abstraction)
  • Picoclaw’s ultra-lightweight constraints (different target: $10 hardware vs server deployment)
  • Full Kubernetes ACP controllers (from references) - use local state machine instead

Reference Implementation Priority

From references/go-agent-os/:

PriorityPatternSourceTarget
1Retry-After parsing + backoffplandex/model_error.gointernal/hermes/retry.go
2Tool result truncationnanobot/pkg/agents/truncate.gointernal/tools/truncate.go
3Image token estimationnanobot/pkg/agents/tokencount.gointernal/hermes/image_routing.go
4Token budget trackeraxe/internal/budget/budget.gointernal/hermes/budget.go
5Deterministic write queueengram/internal/mcp/write_queue.gointernal/goncho/writequeue.go
6SQLite/FTS5 memory schemaengram/internal/store/store.goGONCHO enhancement
7State machine transitionsagentcontrolplane/task/state_machine.goTurn lifecycle
8OAuth PKCEgoclaw/internal/oauth/openai.gointernal/oauth/
9Tool declaration schematrpc-agent-go/tool/tool.gointernal/tools/tool.go
10Await-user-reply routetrpc-agent-go/agent/await_user_reply.gointernal/gateway/routing.go

Success Metrics

  • 30 days: Permission hardening shipped, provider parity >80%, browser harness doctor working
  • 90 days: Loop detection, structured memory (6 types), skill metadata, native prompt builder
  • 6 months: Context compression reconciled, GBrain patterns (auto-links, brain-first), token budget
  • 12 months: Cathedral II, Minions queue, Soul system, Web dashboard, multi-memory backends

Current parity: ~20-30% → Target: 80%+ within 12 months