Open Source AI llama agent github ollama

Genesis Agent – A self-modifying AI agent that runs local (Electron, Ollama)

Hacker News AI Topby Garrus800-stackApril 2, 202619 min read2 views

Article URL: https://github.com/Garrus800-stack/genesis-agent Comments URL: https://news.ycombinator.com/item?id=47621307 Points: 1 # Comments: 0

What is Genesis?

Genesis is not a framework for building agents. Genesis is the agent.

It's a self-contained Electron application that reads its own source code, modifies itself, verifies its own output programmatically, and learns from every interaction. It runs locally on your machine — no cloud platform required, no YAML configs, no SDK. Install, start, talk.

Think of it as an AI developer that lives on your desktop: it has a codebase (its own), a memory (episodic + semantic), emotions (that influence its decisions), and goals (that survive restarts). When idle, it dreams — consolidating memories into reusable schemas. When frustrated, it escalates to larger models. When curious, it explores its own architecture.

How it works

You: "Build a REST API module for Genesis with tests" Genesis: [PLAN] 6 steps via FormalPlanner (preconditions checked against WorldState)  [EXPECT] P(success)=0.78 based on 47 prior code-gen outcomes  [SIMULATE] Best path: direct (risk=0.22), vs. incremental (risk=0.31)  [STEP 1/6] Analyzing existing code... ✓ verified programmatically  [STEP 2/6] Generating module (87 lines)... ✓ AST parse valid, imports resolve  [STEP 3/6] Writing tests... ✓ file created, syntax verified  [STEP 4/6] Running npm test... ✓ exit 0, 12/12 passing  [STEP 5/6] Git snapshot... ✓ committed  [STEP 6/6] Wiring into AgentCore...  [VERIFY] Goal complete. VerificationEngine: 5/5 PASS, 0 AMBIGUOUS.  [SURPRISE] Low (0.12) — outcome matched expectation  [SIGN] Module signed: HMAC-SHA256 (abc12def...)  [LEARN] MetaLearning recorded: json-schema prompt @ temp 0.3 → success  "Done. Module + 12 tests, all passing."

You: "Build a REST API module for Genesis with tests" Genesis: [PLAN] 6 steps via FormalPlanner (preconditions checked against WorldState)  [EXPECT] P(success)=0.78 based on 47 prior code-gen outcomes  [SIMULATE] Best path: direct (risk=0.22), vs. incremental (risk=0.31)  [STEP 1/6] Analyzing existing code... ✓ verified programmatically  [STEP 2/6] Generating module (87 lines)... ✓ AST parse valid, imports resolve  [STEP 3/6] Writing tests... ✓ file created, syntax verified  [STEP 4/6] Running npm test... ✓ exit 0, 12/12 passing  [STEP 5/6] Git snapshot... ✓ committed  [STEP 6/6] Wiring into AgentCore...  [VERIFY] Goal complete. VerificationEngine: 5/5 PASS, 0 AMBIGUOUS.  [SURPRISE] Low (0.12) — outcome matched expectation  [SIGN] Module signed: HMAC-SHA256 (abc12def...)  [LEARN] MetaLearning recorded: json-schema prompt @ temp 0.3 → success  "Done. Module + 12 tests, all passing."

Every step is verified by the machine, not the LLM. AST parsing, exit codes, file validation, import resolution — the LLM proposes, deterministic checks verify. Only ambiguous quality judgments fall back to LLM evaluation.

What makes it different

Capability What Genesis does What typical AI tools do

Self-modification Reads its own AST, plans changes, tests in sandbox, snapshots with git, applies only if tests pass Run user-provided code

Verification 66 programmatic checks — AST, exit codes, imports, signatures — LLM is last resort Trust the LLM output

Memory 5-layer system — episodic, semantic, vector, conversation, knowledge graph — with intelligent forgetting Chat history window

Planning FormalPlanner with preconditions, mental simulation, probabilistic branching, failure taxonomy Sequential function calling

Learning Tracks success rates by model/prompt/temperature, auto-optimizes — A/B tests its own prompts Static prompts

Autonomy Pursues multi-step goals, survives restarts, graduates its own trust level (0–3) Single-turn responses

Cognition Expectations, surprise, dreams, working memory, autobiographical identity, emotional steering None

MCP Server Exposes 7 tools (verify, analyze, safety scan, architecture query) — external IDEs invoke Genesis directly MCP client only

Observability 13-panel live dashboard — consciousness, energy, architecture graph, tool synthesis, event flow Log files

Capabilities at a glance

Autonomous execution — FormalPlanner with typed action steps, precondition checking against live WorldState, mental simulation with probabilistic branching, goal persistence across restarts, failure taxonomy with 4 recovery strategies, cooperative cancellation, working memory per goal.

Self-modification — reads its own source via SelfModel, plans changes via SelfModificationPipeline, tests in dual-mode sandbox (VM + process), snapshots with git, HMAC-SHA256 module signing, hot-reloads without restart.

Verification — 66-test VerificationEngine covering AST syntax, import resolution, dangerous patterns, test exit codes, file integrity, module signatures. The LLM proposes — the machine verifies.

Memory & learning — 5-layer memory (conversation, episodic, vector, unified, knowledge graph), adaptive forgetting (surprise amplifies retention 5×), DreamCycle consolidation during idle time, MetaLearning prompt optimization, PromptEvolution A/B testing, OnlineLearner real-time feedback (streak detection, model escalation, temperature tuning), LessonsStore cross-project persistent learning.

Cognition & consciousness — ExpectationEngine (quantitative predictions), SurpriseAccumulator (information-theoretic), PhenomenalField (unified awareness every 2s), TemporalSelf (past/future continuity), IntrospectionEngine (3-level meta-awareness), CognitiveWorkspace (9-slot transient working memory), ArchitectureReflection (live queryable self-model of own architecture), DynamicToolSynthesis (generates new tools on demand via LLM + sandbox).

Organism — 5 emotional dimensions, homeostasis (6 vitals), 4 needs (social, mastery, novelty, rest), metabolism (500 AU energy pool), heritable genome (6 evolvable traits), epigenetic conditioning, immune system (anomaly detection), body schema (capability tracking), embodied perception (UI engagement tracking).

Infrastructure — 13-phase DI boot, EventBus (308 events), MCP bidirectional (client + server — Genesis exposes 7 tools to external IDEs/agents via JSON-RPC 2.0), CircuitBreaker per connection, CorrelationContext tracing, PeerNetwork (AES-256-GCM), 12-layer defense-in-depth security, PreservationInvariants (11 hash-locked safety rules).

For the full feature list with version history, see CAPABILITIES.md.

See it in action

git clone https://github.com/Garrus800-stack/genesis-agent.git cd genesis-agent && npm install node demo.js

git clone https://github.com/Garrus800-stack/genesis-agent.git cd genesis-agent && npm install node demo.js

This boots Genesis headless, shows system health, architecture reflection, MCP server capabilities, and code verification — all without Ollama or API keys.

Quick start

New to Genesis? Read the Quick Start Guide — it walks you through your first conversation, your first goal, and self-modification in under 5 minutes.

Option A — Cloud API (recommended for best results):

git clone https://github.com/Garrus800-stack/genesis-agent.git cd genesis-agent npm install npm start

git clone https://github.com/Garrus800-stack/genesis-agent.git cd genesis-agent npm install npm start

Then open Settings → paste your Anthropic API key or OpenAI API key. Genesis auto-detects and selects the best available model.

Option B — Local with Ollama (fully offline, private):

ollama pull qwen2.5:7b # or gemma2:9b, deepseek-r1:8b, llama3.1:8b, etc. ollama serve

ollama pull qwen2.5:7b # or gemma2:9b, deepseek-r1:8b, llama3.1:8b, etc. ollama serve

git clone https://github.com/Garrus800-stack/genesis-agent.git cd genesis-agent npm install npm start`

Option C — Hybrid (best of both):

Run Ollama locally AND configure a cloud API key. Genesis uses cloud for complex reasoning tasks and auto-failovers to local when cloud is unavailable.

Boot profiles

Genesis supports three boot profiles for different complexity levels:

npm start # Full — all 116 services, all cognitive systems npm start -- --cognitive # Cognitive — skip consciousness (~90 services) npm start -- --minimal # Minimal — core agent loop only (~50 services)

npm start # Full — all 116 services, all cognitive systems npm start -- --cognitive # Cognitive — skip consciousness (~90 services) npm start -- --minimal # Minimal — core agent loop only (~50 services)

Use --minimal to learn the architecture without cognitive overhead. Use --cognitive for development. Use --full (default) for production.

Requires Node.js 20+ (tested on 20, 22) and Git. Ollama is optional if a cloud API is configured. On Windows, double-click Genesis-Start.bat instead.

Headless / CLI Mode (v5.9.0)

Run Genesis without Electron — as a terminal chat, MCP server daemon, or in CI pipelines:

node cli.js # Interactive REPL chat node cli.js --serve # MCP server daemon (no UI, runs until Ctrl+C) node cli.js --serve --port 4000 # Custom port node cli.js --minimal # Minimal boot (~50 services)

node cli.js # Interactive REPL chat node cli.js --serve # MCP server daemon (no UI, runs until Ctrl+C) node cli.js --serve --port 4000 # Custom port node cli.js --minimal # Minimal boot (~50 services)

Or via npm:

npm run cli # REPL chat npm run cli:serve # MCP daemon

npm run cli # REPL chat npm run cli:serve # MCP daemon

REPL commands: /health, /goals, /status, /quit. Environment: GENESIS_API_KEY, GENESIS_OPENAI_KEY.

For IDE integration (VSCode, Cursor, Claude Desktop), see MCP-SERVER-SETUP.md.

Supported backends

Backend Models Config

Anthropic Claude Opus 4, Sonnet 4, Haiku 4.5 Settings → models.anthropicApiKey

OpenAI-compatible GPT-4o, GPT-4, o1, or any compatible API Settings → models.openaiApiKey + models.openaiBaseUrl

Ollama (local) Any model Ollama supports (gemma2, qwen2.5, deepseek, llama, mistral, ...) Auto-detected on 127.0.0.1:11434

Genesis automatically selects the best model: user-preferred → cloud → local. Override via Settings → models.preferred.

Architecture

Thirteen layers with clear boundaries — star topology where every layer depends only on core/ and ports/, never on each other. The kernel is immutable. Critical safety files are hash-locked. Everything else is fair game for self-modification. v5.9.3: zero cross-layer violations, zero orphans, zero phantom late-bindings. TypeScript CI enforced. Self-Preservation Invariants prevent safety regression during self-modification.

┌─────────────────────────────────────────────────────────────┐ │ 🖥️ UI Layer Chat + Monaco Editor + Dashboard(13) │ ├─────────────────────────────────────────────────────────────┤ │ 🌌 Consciousness [P13] PhenomenalField · TemporalSelf │ │ IntrospectionEngine · AttentionalGate│ │ ConsciousnessExtension (6 subsystems)│ ├─────────────────────────────────────────────────────────────┤ │ 🔮 Hybrid [P12] GraphReasoner · AdaptiveMemory │ ├─────────────────────────────────────────────────────────────┤ │ 🌐 Extended [P11] TrustLevels · Effectors · WebPercept │ │ SelfSpawner · GitHubEffector │ ├─────────────────────────────────────────────────────────────┤ │ 🏛️ Agency [P10] GoalPersistence · FailureTaxonomy │ │ DynamicContextBudget · LocalClassifier│ │ EmotionalSteering · FitnessEvaluator │ ├─────────────────────────────────────────────────────────────┤ │ 🧠 Cognitive [P9] Expectations · Simulation · Surprise │ │ DreamCycle · SelfNarrative │ │ CognitiveWorkspace · OnlineLearner │ │ LessonsStore · PromptEvolution │ │ ReasoningTracer · ArchReflection(P3) │ │ DynamicToolSynthesis (SA-P8) │ │ ProjectIntelligence │ ├─────────────────────────────────────────────────────────────┤ │ ⚡ Revolution [P8] FormalPlanner · AgentLoop + Cancel │ │ ModelRouter · VectorMemory │ ├─────────────────────────────────────────────────────────────┤ │ 🧬 Organism [P7] Emotions (5D) · Homeostasis (6 vitals)│ │ Genome · Epigenetic · Fitness │ │ NeedsSystem · Metabolism · BodySchema │ │ EmbodiedPerception (SA-P4) │ ├─────────────────────────────────────────────────────────────┤ │ 🛡️ Autonomy [P6] IdleMind · Daemon · HealthMonitor │ │ HealthServer · CognitiveMonitor │ ├─────────────────────────────────────────────────────────────┤ │ 🔗 Hexagonal [P5] ChatOrchestrator · SelfModPipeline │ │ EpisodicMemory · PeerNetwork │ ├─────────────────────────────────────────────────────────────┤ │ 📋 Planning [P4] GoalStack · MetaLearning · SchemaStore│ ├─────────────────────────────────────────────────────────────┤ │ 🔧 Capabilities [P3] ShellAgent · MCP (Client + Server) │ │ McpServerToolBridge · PluginRegistry │ ├─────────────────────────────────────────────────────────────┤ │ 🧩 Intelligence [P2] VerificationEngine · CodeSafetyScanner│ │ IntentRouter · ContextManager │ │ CircuitBreaker · PromptBuilder │ ├─────────────────────────────────────────────────────────────┤ │ 📦 Foundation [P1] ModelBridge · Sandbox · WorldState │ │ KnowledgeGraph · ModuleSigner │ │ CorrelationContext · BootTelemetry │ ├─────────────────────────────────────────────────────────────┤ │ 🔗 Ports LLM · Memory · KG · Sandbox · │ │ CodeSafety · Workspace │ ├─────────────────────────────────────────────────────────────┤ │ 🔒 KERNEL (immutable) SafeGuard · IPC Contract · Hashes │ │ + 🔐 Hash-Locked Scanner · Verifier · Constants │ │ + 🛡️ Invariants PreservationInvariants (11 rules) │ └─────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────┐ │ 🖥️ UI Layer Chat + Monaco Editor + Dashboard(13) │ ├─────────────────────────────────────────────────────────────┤ │ 🌌 Consciousness [P13] PhenomenalField · TemporalSelf │ │ IntrospectionEngine · AttentionalGate│ │ ConsciousnessExtension (6 subsystems)│ ├─────────────────────────────────────────────────────────────┤ │ 🔮 Hybrid [P12] GraphReasoner · AdaptiveMemory │ ├─────────────────────────────────────────────────────────────┤ │ 🌐 Extended [P11] TrustLevels · Effectors · WebPercept │ │ SelfSpawner · GitHubEffector │ ├─────────────────────────────────────────────────────────────┤ │ 🏛️ Agency [P10] GoalPersistence · FailureTaxonomy │ │ DynamicContextBudget · LocalClassifier│ │ EmotionalSteering · FitnessEvaluator │ ├─────────────────────────────────────────────────────────────┤ │ 🧠 Cognitive [P9] Expectations · Simulation · Surprise │ │ DreamCycle · SelfNarrative │ │ CognitiveWorkspace · OnlineLearner │ │ LessonsStore · PromptEvolution │ │ ReasoningTracer · ArchReflection(P3) │ │ DynamicToolSynthesis (SA-P8) │ │ ProjectIntelligence │ ├─────────────────────────────────────────────────────────────┤ │ ⚡ Revolution [P8] FormalPlanner · AgentLoop + Cancel │ │ ModelRouter · VectorMemory │ ├─────────────────────────────────────────────────────────────┤ │ 🧬 Organism [P7] Emotions (5D) · Homeostasis (6 vitals)│ │ Genome · Epigenetic · Fitness │ │ NeedsSystem · Metabolism · BodySchema │ │ EmbodiedPerception (SA-P4) │ ├─────────────────────────────────────────────────────────────┤ │ 🛡️ Autonomy [P6] IdleMind · Daemon · HealthMonitor │ │ HealthServer · CognitiveMonitor │ ├─────────────────────────────────────────────────────────────┤ │ 🔗 Hexagonal [P5] ChatOrchestrator · SelfModPipeline │ │ EpisodicMemory · PeerNetwork │ ├─────────────────────────────────────────────────────────────┤ │ 📋 Planning [P4] GoalStack · MetaLearning · SchemaStore│ ├─────────────────────────────────────────────────────────────┤ │ 🔧 Capabilities [P3] ShellAgent · MCP (Client + Server) │ │ McpServerToolBridge · PluginRegistry │ ├─────────────────────────────────────────────────────────────┤ │ 🧩 Intelligence [P2] VerificationEngine · CodeSafetyScanner│ │ IntentRouter · ContextManager │ │ CircuitBreaker · PromptBuilder │ ├─────────────────────────────────────────────────────────────┤ │ 📦 Foundation [P1] ModelBridge · Sandbox · WorldState │ │ KnowledgeGraph · ModuleSigner │ │ CorrelationContext · BootTelemetry │ ├─────────────────────────────────────────────────────────────┤ │ 🔗 Ports LLM · Memory · KG · Sandbox · │ │ CodeSafety · Workspace │ ├─────────────────────────────────────────────────────────────┤ │ 🔒 KERNEL (immutable) SafeGuard · IPC Contract · Hashes │ │ + 🔐 Hash-Locked Scanner · Verifier · Constants │ │ + 🛡️ Invariants PreservationInvariants (11 rules) │ └─────────────────────────────────────────────────────────────┘

Kernel (immutable): main.js, preload.js, src/kernel/. SHA-256 hashed at boot, verified periodically.

Critical Safety Files (hash-locked): CodeSafetyScanner, VerificationEngine, Constants, EventBus, Container — locked via SafeGuard.lockCritical(). The agent cannot weaken the modules that enforce its own safety.

Agent Core: Self-modifiable modules — read, analyze, modify, hot-reload — but only after sandbox testing, safety scanning, and git snapshots.

Cognitive Layer: Expectation formation, mental simulation, surprise-driven learning, memory consolidation, autobiographical identity, prompt evolution, online learning, architecture self-reflection, and dynamic tool synthesis.

The Cognitive Loop

Every autonomous step follows a five-phase cycle:

Perceive (WorldState) → Plan (FormalPlanner) → Act (AgentLoop)  → Verify (VerificationEngine) → Learn (MetaLearning + EpisodicMemory)

Perceive (WorldState) → Plan (FormalPlanner) → Act (AgentLoop)  → Verify (VerificationEngine) → Learn (MetaLearning + EpisodicMemory)

The VerificationEngine returns PASS, FAIL, or AMBIGUOUS. Only AMBIGUOUS falls back to LLM judgment. Everything else is deterministic.

What's checked How LLM?

Code syntax AST parse (acorn) No

Imports Filesystem check No

Dangerous patterns AST walk + regex No

Test results Exit code + assertion count No

Shell commands Exit code + timeout + patterns No

File writes Existence + non-empty + encoding No

Plan preconditions WorldState API No

Module integrity HMAC-SHA256 signature No

Subjective quality — AMBIGUOUS only

Phase 9: The Cognitive Meta-Loop

Expect → Simulate → Act → Surprise → Learn → Dream → Schema → better Expect  ↕ ↕  ArchReflection ToolSynthesis

Expect → Simulate → Act → Surprise → Learn → Dream → Schema → better Expect  ↕ ↕  ArchReflection ToolSynthesis

ExpectationEngine — Quantitative predictions using MetaLearning statistics and SchemaStore patterns. No LLM calls.

MentalSimulator — Plan sequences in-memory against cloned WorldState with probabilistic branching and risk scoring.

SurpriseAccumulator — Prediction error via information-theoretic surprise (−log₂P). High surprise amplifies learning up to 4×.

DreamCycle — Idle-time memory consolidation. Phases 1-4 are heuristics; Phase 5 uses one batched LLM call. Schema extraction, value crystallization, and DreamEngine corroboration via phase delegates.

SelfNarrative — Autobiographical identity summary injected into every LLM call (~200 tokens of metacognitive context).

PromptEvolution — A/B testing framework for prompt sections. Runs controlled experiments (25+ trials per arm), auto-promotes variants with statistically significant improvement (≥5%). Identity and safety sections are immutable by design.

ArchitectureReflection (SA-P3) — Live queryable graph of Genesis's own architecture. Indexes services, events, layers, and couplings from Container + EventBus + source scan. Natural language queries: "what depends on EventBus?", "chain from AgentLoop to CognitiveWorkspace", "show couplings". Compressed view injected into LLM prompt context.

DynamicToolSynthesis (SA-P8) — When Genesis needs a tool that doesn't exist, it writes one. LLM generates tool code → CodeSafetyScanner validates → Sandbox tests → ToolRegistry registers. Auto-triggered on "tool not found" errors. Persists across restarts. Max 20 tools, LRU eviction.

Phase 10: Persistent Agency

Goals that survive reboots. Errors that get classified. Context that adapts. Emotions that steer.

Goal created → Checkpoint → [Restart] → Resume → Continue Error caught → FailureTaxonomy.classify() → Strategy (retry | replan | escalate | update-world) Intent detected → DynamicContextBudget.allocate() → Optimized token distribution Emotion shifts → EmotionalSteering.getSignals() → ModelRouter / FormalPlanner / IdleMind adjusts LLM fallback → LocalClassifier.addSample() → Next time: no LLM needed (2-3s saved)

Goal created → Checkpoint → [Restart] → Resume → Continue Error caught → FailureTaxonomy.classify() → Strategy (retry | replan | escalate | update-world) Intent detected → DynamicContextBudget.allocate() → Optimized token distribution Emotion shifts → EmotionalSteering.getSignals() → ModelRouter / FormalPlanner / IdleMind adjusts LLM fallback → LocalClassifier.addSample() → Next time: no LLM needed (2-3s saved)

GoalPersistence — Active goals serialized to disk after every step. Crash recovery via step-level checkpoints. Resume prompt on boot.

FailureTaxonomy — Four error categories with distinct recovery strategies. Transient errors get exponential backoff. Deterministic errors trigger immediate replan. Environmental errors update WorldState. Capability errors escalate to a larger model.

DynamicContextBudget — Intent-based token allocation replaces fixed budgets. Code-gen gets 55% code context. Chat gets 40% conversation. Learns from MetaLearning success rates.

EmotionalSteering — Emotions become functional control signals. Frustration > 0.65 → ModelRouter tries larger model. Energy < 0.30 → FormalPlanner caps plans at 3 steps. Curiosity > 0.75 → IdleMind prioritizes exploration.

LocalClassifier — TF-IDF + cosine similarity classifier trained from IntentRouter's own LLM fallback log. After ~30 samples, handles 60-80% of classifications without LLM calls.

Phase 11: Extended Perception & Action

Genesis sees beyond the filesystem. Acts beyond its project directory. Graduates its own autonomy.

TrustLevelSystem — Four trust levels: Supervised (everything needs approval), Assisted (safe actions auto-execute), Autonomous (only high-risk needs approval), Full Autonomy (only safety invariants block). MetaLearning data can suggest auto-upgrades for specific action types with >90% success over 50+ attempts.

EffectorRegistry — Typed, verifiable, approval-gated actions for the outside world. Built-in: clipboard, OS notifications, browser open, external file write. Plugin: GitHubEffector (issues, PRs, comments). Each effector has risk level, preconditions, optional verification and rollback.

WebPerception — Lightweight HTTP fetch with HTML parsing (cheerio optional). Headless browser mode via Puppeteer (optional). Results cached, fed into WorldState.external for prompt context.

SelfSpawner — Fork-based worker processes for parallel sub-tasks. Each worker gets minimal context (ModelBridge config, focused goal) and runs independently with timeout and memory limits. Up to 3 concurrent workers.

Phase 12: Symbolic + Neural Hybrid

Not everything needs an LLM. Graph reasoning and intelligent forgetting.

GraphReasoner — Deterministic queries over the KnowledgeGraph without LLM calls. Transitive dependency chains, impact analysis ("if I change EventBus, what breaks?"), cycle detection, shortest path between concepts, contradiction detection. Integrated into ReasoningEngine — structural questions are answered in milliseconds instead of seconds.

AdaptiveMemory — Differentiated forgetting based on emotional valence, surprise magnitude, and access frequency. High-surprise memories decay 5× slower. Routine conversations decay quickly. Frequently accessed memories are reinforced. Consolidation runs during DreamCycle.

MCP Bidirectional (v5.8.0)

Genesis is both an MCP client and an MCP server. External tools (VSCode, Cursor, other agents) can invoke Genesis capabilities directly via JSON-RPC 2.0.

Exposed tools (via McpServerToolBridge):

Tool What it does

genesis.verify-code Full code verification (syntax, imports, lint patterns)

genesis.verify-syntax Quick AST parse check

genesis.code-safety-scan Safety violation detection (eval, fs writes, process spawn)

genesis.project-profile Tech stack, conventions, quality indicators

genesis.project-suggestions Improvement suggestions from structural analysis

genesis.architecture-query Natural language queries about Genesis's own architecture

genesis.architecture-snapshot Full service/event/layer/phase snapshot

Protocol: MCP 2025-03-26 with tools/list, tools/call, resources/list, notifications/tools/list_changed, ping, CORS, /health endpoint.

Dashboard (v5.9.0 — 13 live panels)

The Dashboard visualizes Genesis's internal state in real-time (2s polling):

Panel What it shows

Organism Mood ring, 5D emotion bars, sparkline, needs radar

Consciousness Awareness meter, valence/arousal, attention focus, temporal chapter, value alignment

Energy Metabolism gauge, LLM call cost, energy level

Agent Loop Current goal, step progress, approval queue

Vitals Homeostasis vital signs with status indicators

Cognitive VerificationEngine stats, WorldState, MetaLearning

Reasoning Causal decision traces with correlation IDs

Architecture Service/event/layer counts, phase map pills

Project Tech stack grid, conventions, quality

Tool Synthesis Generated/active/failed tools, active tool list

Memory Vector memory stats, session history

Event Flow Recent event chains, listener hotspots

System Services, intervals, circuit breaker, uptime

Security model

┌──────────────────────────────────────────────────────────────────┐ │ KERNEL (immutable) — SHA-256 hashes, periodic integrity checks │ ├──────────────────────────────────────────────────────────────────┤ │ CRITICAL FILE HASH-LOCK — Scanner, Verifier, Constants, Bus │ ├──────────────────────────────────────────────────────────────────┤ │ PRESERVATION INVARIANTS — 11 semantic rules, hash-locked, │ │ └─ fail-closed, integrated into SelfModificationPipeline │ ├──────────────────────────────────────────────────────────────────┤ │ TRUST LEVEL SYSTEM — graduated autonomy (0-3) │ ├──────────────────────────────────────────────────────────────────┤ │ MODULE SIGNING — HMAC-SHA256 for self-modified files │ ├──────────────────────────────────────────────────────────────────┤ │ AST CODE SAFETY SCANNER — acorn AST walk + regex fallback │ │ └─ CodeSafetyPort — hexagonal adapter (DI-injected) │ ├──────────────────────────────────────────────────────────────────┤ │ VERIFICATION ENGINE — programmatic truth, 66 tests │ ├──────────────────────────────────────────────────────────────────┤ │ SANDBOX — dual-mode isolation (Process + VM) │ │ └─ WORKER THREAD — MCP code exec in resource-limited │ │ worker_thread (64MB heap, hard-kill on timeout) │ ├──────────────────────────────────────────────────────────────────┤ │ CIRCUIT BREAKER — per-MCP-connection failure isolation │ │ └─ CLOSED → OPEN → HALF_OPEN → CLOSED state machine │ ├──────────────────────────────────────────────────────────────────┤ │ IMMUNE SYSTEM — self-modification anomaly detection │ ├──────────────────────────────────────────────────────────────────┤ │ EFFECTOR REGISTRY — risk-gated external actions │ ├──────────────────────────────────────────────────────────────────┤ │ CORRELATION CONTEXT — causal tracing via AsyncLocalStorage │ ├──────────────────────────────────────────────────────────────────┤ │ UI ERROR BOUNDARY — global error/rejection handler │ ├──────────────────────────────────────────────────────────────────┤ │ IPC RATE LIMITER — per-channel token bucket (kernel-space) │ ├──────────────────────────────────────────────────────────────────┤ │ SHELL AGENT — 4 tiers, blocklist, rate limiter, no injection │ ├──────────────────────────────────────────────────────────────────┤ │ PEER NETWORK — AES-256-GCM, PBKDF2 600K iterations │ └──────────────────────────────────────────────────────────────────┘

┌──────────────────────────────────────────────────────────────────┐ │ KERNEL (immutable) — SHA-256 hashes, periodic integrity checks │ ├──────────────────────────────────────────────────────────────────┤ │ CRITICAL FILE HASH-LOCK — Scanner, Verifier, Constants, Bus │ ├──────────────────────────────────────────────────────────────────┤ │ PRESERVATION INVARIANTS — 11 semantic rules, hash-locked, │ │ └─ fail-closed, integrated into SelfModificationPipeline │ ├──────────────────────────────────────────────────────────────────┤ │ TRUST LEVEL SYSTEM — graduated autonomy (0-3) │ ├──────────────────────────────────────────────────────────────────┤ │ MODULE SIGNING — HMAC-SHA256 for self-modified files │ ├──────────────────────────────────────────────────────────────────┤ │ AST CODE SAFETY SCANNER — acorn AST walk + regex fallback │ │ └─ CodeSafetyPort — hexagonal adapter (DI-injected) │ ├──────────────────────────────────────────────────────────────────┤ │ VERIFICATION ENGINE — programmatic truth, 66 tests │ ├──────────────────────────────────────────────────────────────────┤ │ SANDBOX — dual-mode isolation (Process + VM) │ │ └─ WORKER THREAD — MCP code exec in resource-limited │ │ worker_thread (64MB heap, hard-kill on timeout) │ ├──────────────────────────────────────────────────────────────────┤ │ CIRCUIT BREAKER — per-MCP-connection failure isolation │ │ └─ CLOSED → OPEN → HALF_OPEN → CLOSED state machine │ ├──────────────────────────────────────────────────────────────────┤ │ IMMUNE SYSTEM — self-modification anomaly detection │ ├──────────────────────────────────────────────────────────────────┤ │ EFFECTOR REGISTRY — risk-gated external actions │ ├──────────────────────────────────────────────────────────────────┤ │ CORRELATION CONTEXT — causal tracing via AsyncLocalStorage │ ├──────────────────────────────────────────────────────────────────┤ │ UI ERROR BOUNDARY — global error/rejection handler │ ├──────────────────────────────────────────────────────────────────┤ │ IPC RATE LIMITER — per-channel token bucket (kernel-space) │ ├──────────────────────────────────────────────────────────────────┤ │ SHELL AGENT — 4 tiers, blocklist, rate limiter, no injection │ ├──────────────────────────────────────────────────────────────────┤ │ PEER NETWORK — AES-256-GCM, PBKDF2 600K iterations │ └──────────────────────────────────────────────────────────────────┘

Testing

npm test # All tests (162 suites) npm run test:coverage # With coverage report (c8) npm run ci # Full CI: tests + event audit + channel audit + fitness gate

npm test # All tests (162 suites) npm run test:coverage # With coverage report (c8) npm run ci # Full CI: tests + event audit + channel audit + fitness gate

All tests run without external dependencies (no Ollama, no API keys, no internet). Tested on Node 20, 22. CI runs on Ubuntu + Windows via GitHub Actions.

Code Metrics by Layer

Layer Files LOC Purpose

Core 14 ~4,450 EventBus, Container, Constants, Logger, CorrelationContext, CircuitBreaker, CancellationToken, PreservationInvariants

Foundation 27 ~7,770 ModelBridge, Backends, Sandbox, KG, WorldState (+ Queries + Snapshot), TrustLevels, Telemetry, LLMCache

Intelligence 17 ~6,270 Verification, Safety Scanner, Intent, Reasoning, ContextManager, PromptBuilder, PromptEvolution

Capabilities 19 ~6,000 Shell, MCP (Client + Server + Transport + CodeExec + Worker + ToolBridge), HotReload, Skills, Plugins, WebPerception

Planning 11 ~2,950 Goals, GoalPersistence, MetaLearning, SchemaStore, ValueStore

Hexagonal 16 ~5,890 ChatOrchestrator, SelfModPipeline, Memory (Unified + Episodic + Adaptive), PeerNetwork, PeerConsensus

Autonomy 8 ~2,650 IdleMind, Daemon, HealthMonitor, HealthServer, CognitiveMonitor, ErrorAggregator

Organism 14 ~4,980 Emotions (5D), Homeostasis (6 vitals), Needs (4 drives), Metabolism, ImmuneSystem, Genome, Epigenetic, Fitness, BodySchema, EmbodiedPerception

Revolution 14 ~5,710 AgentLoop (+ Steps/Planner delegates), FormalPlanner, HTN, VectorMemory, NativeToolUse, ModelRouter

Cognitive 14 ~5,760 DreamCycle, ExpectationEngine, SurpriseAccumulator, MentalSimulator, SelfNarrative, CognitiveWorkspace, OnlineLearner, LessonsStore, ReasoningTracer, ArchitectureReflection, DynamicToolSynthesis

Consciousness 14 ~6,000 PhenomenalField (+ Computation delegate), TemporalSelf, IntrospectionEngine, AttentionalGate, EchoicMemory, DreamEngine

Ports 7 ~860 Hexagonal adapters (LLM, Memory, Knowledge, Sandbox, CodeSafety, Workspace)

Total 218 ~73,500 (all src/, incl. UI + kernel)

Project Stats

Metric Value

Source modules 218 JS files in src/

Lines of code ~73.5k src + ~38k test

Manifest phases 13 (+ Phase 0 bootstrap)

DI services 116 (manifest-registered)

Late-bindings 197 cross-phase property injections

Test suites 163 files, ~2842 tests (coverage gates: 60/50/55)

Dependencies 4 production + 3 optional + 5 dev

LLM backends 3 (Anthropic, OpenAI-compatible, Ollama)

Anthropic models 3 (Opus 4, Sonnet 4, Haiku 4.5)

IPC channels 38 invoke + 2 send + 6 receive = 46 (rate-limited, all in sync)

Event types 308 across 86 namespaces (catalogued in EventTypes.js)

Cross-layer event flows 273 emitted events, 58 listeners (via EventBus, no direct imports)

Hexagonal ports 6 (LLM, Memory, Knowledge, Sandbox, CodeSafety, Workspace)

Cognitive modules 14 (ExpectationEngine, MentalSimulator, SurpriseAccumulator, DreamCycle, SelfNarrative, CognitiveHealthTracker, CognitiveWorkspace, OnlineLearner, LessonsStore, ReasoningTracer, ArchitectureReflection, DynamicToolSynthesis, ProjectIntelligence, McpServerToolBridge)

Consciousness modules 14 (PhenomenalField, TemporalSelf, IntrospectionEngine, AttentionalGate, EchoicMemory, PredictiveCoder, NeuroModulators, SalienceGate, DreamEngine, ConsciousnessState + 3 adapters/delegates)

Organism 5 emotional dimensions + homeostasis + allostasis + 4 needs + steering + metabolism + immune system + heritable genome + epigenetic conditioning + fitness evaluation + body schema + embodied perception

Safety layers 12 (kernel lock → hash-lock → preservation invariants → AST scan → port → sandbox → worker → circuit breaker → immune → trust → validateWrite → blocklist)

Trust levels 4 (supervised → full autonomy)

Languages EN primary (+ DE, FR, ES via i18n)

Architectural fitness 90/90 (100%) — 0 cross-layer violations, 0 orphans, 0 phantoms, 0 TSC errors, 0 @ts-nocheck

TypeScript checking 218/218 files checked — 0 errors, 0 @ts-nocheck

Memory Architecture

Genesis has a five-layer memory system, unified through a facade:

┌─────────────────────────────────────────────────────────┐ │ UnifiedMemory (read facade over all layers) │ ├─────────────────────────────────────────────────────────┤ │ AdaptiveMemory [P12] Retention scoring + smart decay │ │ VectorMemory Embedding-based semantic search │ │ EpisodicMemory Timestamped experiences + causality │ │ ConversationMemory Chat history + episode extraction │ │ KnowledgeGraph Concept nodes + typed relations │ │ WorldState Live snapshot of system state │ ├─────────────────────────────────────────────────────────┤ │ SchemaStore [P9] Abstract patterns from DreamCycle │ └─────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────┐ │ UnifiedMemory (read facade over all layers) │ ├─────────────────────────────────────────────────────────┤ │ AdaptiveMemory [P12] Retention scoring + smart decay │ │ VectorMemory Embedding-based semantic search │ │ EpisodicMemory Timestamped experiences + causality │ │ ConversationMemory Chat history + episode extraction │ │ KnowledgeGraph Concept nodes + typed relations │ │ WorldState Live snapshot of system state │ ├─────────────────────────────────────────────────────────┤ │ SchemaStore [P9] Abstract patterns from DreamCycle │ └─────────────────────────────────────────────────────────┘

All persistence goes through StorageService (write-queued, atomic JSON writes). The EmbeddingService is optional — without it, search degrades to keyword matching.

Known Limitations

Pure JavaScript with TypeScript checking — types/core.d.ts and types/node.d.ts provide type declarations. All 218 source files pass tsc --checkJs with 0 errors. Type-safety relies on JSDoc annotations, targeted @ts-ignore for prototype delegation patterns, and CI enforcement.
VM sandbox is not a true sandbox — vm.createContext provides isolation for quick evals but is explicitly NOT security-grade for untrusted code. Untrusted code must use process-mode execute() or Linux namespace isolation.
sandbox: false on Electron <35 — CJS preload requires require() which is blocked by sandbox:true. contextIsolation:true is the primary security boundary. See preload.mjs for ESM preload implementation.
Single-instance storage — StorageService serializes writes but all autonomous systems share the same .genesis/ directory.
Consciousness metrics are heuristic — Phi (integrated information) in PhenomenalField is a simplified proxy, not a rigorous IIT implementation.

Dependencies

{  "acorn": "^8.16.0",  "chokidar": "^3.6.0",  "electron": "^33.0.0",  "monaco-editor": "^0.44.0",  "tree-kill": "^1.2.2" }

{  "acorn": "^8.16.0",  "chokidar": "^3.6.0",  "electron": "^33.0.0",  "monaco-editor": "^0.44.0",  "tree-kill": "^1.2.2" }

No LangChain. No LlamaIndex. Everything self-written.

Documentation

Architecture & Design

Document What it covers

QUICK-START.md Start here — first conversation, first goal, self-modification, boot profiles

ARCHITECTURE-DEEP-DIVE.md Technical deep-dive — boot sequence, DI container, service lifecycle, data flows

EVENT-FLOW.md EventBus event map — which modules emit/consume which events

CAPABILITIES.md Complete feature overview — what Genesis can do, organized by category

COMMUNICATION.md How Genesis instances communicate — IPC, EventBus, PeerNetwork, MCP

DEGRADATION-MATRIX.md What breaks if each service is missing — auto-generated

Cognitive & Consciousness

Document What it covers

phase9-cognitive-architecture.md Phase 9 design — DreamCycle, Expectations, SelfNarrative, MentalSimulator, ArchitectureReflection, DynamicToolSynthesis

consciousness-extension-architecture.md Phase 12-13 — closed perceptual loop, EchoicMemory, PredictiveCoder, NeuroModulators, DreamEngine

Operations

Document What it covers

TROUBLESHOOTING.md Common problems and solutions — install, boot, LLM, self-modification, platform-specific

ROADMAP-v6.md Development roadmap — completed phases, deferred proposals

MCP-SERVER-SETUP.md MCP server setup — IDE integration (VSCode, Cursor, Claude Desktop), headless CLI

AUDIT-BACKLOG.md Architectural health tracking — resolved issues, monitor items, fitness metrics

Contributing & Security

Document What it covers

CONTRIBUTING.md How to contribute — conventions, security rules, PR process, service templates

SECURITY.md Security policy — 7-layer defense model, threat model, vulnerability reporting

CHANGELOG.md Version history with detailed per-release notes

Schemas & Configuration

Document What it covers

schemas/skill-manifest.schema.json JSON Schema for third-party skill/plugin manifests

typedoc.json TypeDoc config — run npx typedoc to generate API docs

Contributing

See CONTRIBUTING.md for the full guide — architecture, conventions, security rules, and PR process.

License

Genesis doesn't just answer questions. It persists its goals, reasons over its own graph, feels the consequences of failure, and graduates its own autonomy.

Original source

Hacker News AI Top

https://github.com/Garrus800-stack/genesis-agent

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

llamaagentgithub

Models

Anthropic Races to Contain Leak of Code Behind Claude AI Agent - WSJ

Anthropic Races to Contain Leak of Code Behind Claude AI Agent WSJ

GNews AI copyright

1m3 days ago

Products

Dyna.Ai Partners with ejada Systems to Scale AI Agents to Production Across Saudi Call Centers - Yahoo Finance

Dyna.Ai Partners with ejada Systems to Scale AI Agents to Production Across Saudi Call Centers Yahoo Finance

GNews AI Saudi Arabia

1mabout 2 months ago

Models

Anthropic Races to Contain Leak of Code Behind Claude AI Agent - WSJ

Anthropic Races to Contain Leak of Code Behind Claude AI Agent WSJ

GNews AI coding

1m3 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 179 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Open Source AI

Open Source AILive

Running OpenClaw with Gemma 4 TurboQuant on MacAir 16GB

Hi guys, We’ve implemented a one-click app for OpenClaw with Local Models built in. It includes TurboQuant caching, a large context window, and proper tool calling. It runs on mid-range devices. Free and Open source. The biggest challenge was enabling a local agentic model to run on average hardware like a Mac Mini or MacBook Air. Small models work well on these devices, but agents require more sophisticated models like QWEN or GLM. OpenClaw adds a large context to each request, which caused the MacBook Air to struggle with processing. This became possible with TurboQuant cache compression, even on 16gb memory. We found llama.cpp TurboQuant implementation by Tom Turney. However, it didn’t work properly with agentic tool calling in many cases with QWEN, so we had to patch it. Even then, the

Reddit r/LocalLLaMA

2mabout 1 hour ago

Open Source AIFresh

Help running Qwen3-Coder-Next TurboQuant (TQ3) model

I found a TQ3-quantized version of Qwen3-Coder-Next here: https://huggingface.co/edwardyoon79/Qwen3-Coder-Next-TQ3_0 According to the page, this model requires a compatible inference engine that supports TurboQuant. It also provides a command, but it doesn’t clearly specify which version or fork of llama.cpp should be used (or maybe I missed it). llama-server I’ve tried the following llama.cpp forks that claim to support TQ3, but none of them worked for me: https://github.com/TheTom/llama-cpp-turboquant https://github.com/turbo-tan/llama.cpp-tq3 https://github.com/drdotdot/llama.cpp-turbo3-tq3 If anyone has successfully run this model, I’d really appreciate it if you could share how you did it. submitted by /u/UnluckyTeam3478 [link] [comments]

Reddit r/LocalLLaMA

1mabout 10 hours ago

Open Source AIFresh

Show HN: TurboQuant-WASM – Google's vector quantization in the browser

Comments

Hacker News

2mabout 6 hours ago

Open Source AIFresh

🔥 block/goose

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM — Trending on GitHub today with 947 new stars.

GitHub Trending

2mabout 9 hours ago

Genesis Agent – A self-modifying AI agent that runs local (Electron, Ollama)

What is Genesis?

How it works

What makes it different

Capabilities at a glance

See it in action

Quick start

Boot profiles

Headless / CLI Mode (v5.9.0)

Supported backends

Architecture

The Cognitive Loop

Phase 9: The Cognitive Meta-Loop

Phase 10: Persistent Agency

Phase 11: Extended Perception & Action

Phase 12: Symbolic + Neural Hybrid

MCP Bidirectional (v5.8.0)

Dashboard (v5.9.0 — 13 live panels)

Security model

Testing

Code Metrics by Layer

Project Stats

Memory Architecture

Known Limitations

Dependencies

Documentation

Architecture & Design

Cognitive & Consciousness

Operations

Contributing & Security

Schemas & Configuration

Contributing

License

Daily AI Digest

More about

Anthropic Races to Contain Leak of Code Behind Claude AI Agent - WSJ

Dyna.Ai Partners with ejada Systems to Scale AI Agents to Production Across Saudi Call Centers - Yahoo Finance

Anthropic Races to Contain Leak of Code Behind Claude AI Agent - WSJ

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Open Source AI

Running OpenClaw with Gemma 4 TurboQuant on MacAir 16GB

Help running Qwen3-Coder-Next TurboQuant (TQ3) model

Show HN: TurboQuant-WASM – Google&#x27;s vector quantization in the browser

🔥 block/goose

Show HN: TurboQuant-WASM – Google's vector quantization in the browser