Soul Protocol

Portable AI identity, memory, and emotion -- plus an org-level journal and decision traces. An open standard.

AI memory systems optimize for retrieval: find the most similar text, stuff it into context, move on. They treat persistence as an IQ problem. But what makes a companion feel real isn't similarity search. It's knowing what matters, what to forget, and who it's becoming.

Soul Protocol gives AI agents persistent identity with psychology-informed memory. Your agent remembers selectively, forms emotional bonds, develops skills, and maintains a personality that evolves over time. The entire state exports as a portable .soul file. Switch LLMs, switch platforms, keep the soul.

As of v0.3.1, the protocol also covers the layer above a single soul: an org-scoped append-only event journal, a root governance agent, scope-tagged memory, decision traces (agent.proposed → human.corrected → decision.graduated), and a zero-copy retrieval router for federating external data without copying it into the org boundary.

Read the whitepaper for the full design rationale and empirical validation.

Validated: 5 judges, 4 providers, 20/20 favored Soul

We tested Soul Protocol against stateless baselines using five judge models from four competing AI providers. Every single judgment favored soul-enabled agents.

Component ablation — which parts actually matter:

Head-to-head vs. Mem0 — Soul Protocol outperforms production memory systems:

Total validation cost: under $5. 1,100+ agent simulations, 25 scenario variations, 5 judge models. Plus a 1,000-turn marathon: 85% recall at 4.9x memory efficiency vs. RAG. Full methodology in the whitepaper.

Soul Health Score: 90.2 / 100

SHS is a 0-100 composite score across 7 psychology-informed dimensions. It measures whether a soul actually works -- remembers selectively, expresses personality consistently, maintains identity across exports, and forms meaningful bonds.

Dimension	Score	Status
Memory Recall (D1)	--	Not run (requires long-horizon scenarios)
Emotional Intelligence (D2)	72.8	Heuristic: 70% accuracy. LLM judge: 97%.
Personality Expression (D3)	96.0	Prompt fidelity 100%, OCEAN stability 100%
Bond / Relationship (D4)	100.0	Logarithmic growth curve r=1.000
Self-Model (D5)	88.0	Domain classification 100%, emergence at turn 2
Identity Continuity (D6)	100.0	Export/import round-trip lossless
Portability (D7)	100.0	Engine-independent by design

The entire eval suite runs without an LLM. Cost: $0. Fully reproducible. When tested with Claude Haiku as an LLM judge, sentiment accuracy jumps from 70% to 97%, proving the architecture works -- the heuristic fallback is the honest baseline, not the ceiling.

Full methodology: research/EVAL-FRAMEWORK.md

Architecture: spec + runtime

soul_protocol/
├── spec/                   The protocol. Portable, minimal, no opinions.
│                           Per-soul types + org-layer types (journal, decisions, retrieval).
├── runtime/                Reference implementation. Opinionated, batteries-included.
├── engine/                 Org-layer engine: SQLite WAL journal, retrieval router, credential broker.
├── cli/                    44-command CLI (incl. `soul org`, `soul template`, `soul user`, `soul create`)
└── mcp/                    MCP server (24 tools, 3 resources)

spec/ defines what any runtime must implement: Identity, MemoryStore, MemoryEntry, SoulContainer, .soul file format, EmbeddingProvider, EternalStorageProvider. v0.3 added org-layer spec types (EventEntry, Actor, DataRef, AgentProposal, HumanCorrection, DecisionGraduation, RetrievalRequest/Result, RetrievalTrace). Depends on Pydantic only.

runtime/ is one way to run the protocol. OCEAN personality, five-tier memory, psychology pipeline, cognitive engine, bonds, skills, evolution. Other runtimes can implement spec/ differently.

engine/ (new in v0.3.1) runs the org layer: a SQLite WAL-backed journal with atomic seq allocation, a retrieval router that resolves DataRef payloads against registered adapters, and a credential broker that scopes secrets per source and fails closed on denial. The full contract lives in docs/org-journal-spec.md.

Like HTTP and nginx. The spec defines the contract. The runtime is one implementation.

Features

Category	What you get
Memory	5-tier: core, episodic, semantic, procedural, knowledge graph
Psychology	Damasio somatic markers, ACT-R activation decay, LIDA significance gate, Klein self-model
Personality	OCEAN Big Five with communication style and biorhythms. Structured, not a prompt string.
Bond	Emotional attachment (0-100 strength). Logarithmic growth, linear decay.
Evolution	Supervised or autonomous trait mutation with approval workflow
Cognitive adapters	`engine="auto"` or `engine=AnthropicEngine()` — wire any LLM into the cognitive pipeline
MCP sampling	Running inside Claude Code / Desktop? The host LLM handles cognition. No extra API key.
LCM	Lossless Context Management — three-level compaction, SQLite backing, no lost context
Visibility tiers	`PUBLIC` / `BONDED` / `PRIVATE` on every memory; recall filtered by bond strength
Templates	`SoulFactory` — define archetypes and batch-create souls from a template
A2A bridge	Export/import Google A2A Agent Cards ↔ `.soul` files
Format importers	`SoulSpecImporter` (SOUL.md), `TavernAIImporter` (Character Card V2, incl. PNG)
Graph traversal	BFS, shortest path, neighborhood, subgraph, and `progressive_context()` (L0/L1/L2)
Vector search	Pluggable EmbeddingProvider. Real backends: sentence-transformers, OpenAI, Ollama.
Encryption	AES-256-GCM encryption at rest for .soul files (scrypt key derivation)
GDPR deletion	Targeted memory deletion with cascade logic and audit trail
Eternal storage	Archive to decentralized storage (mock providers, production planned)
Portability	`.soul` ZIP archive. JSON inside. Rename to .zip and read it.
Cross-language	JSON Schemas auto-generated from spec. Validate `.soul` files in any language.
Dream	Offline batch consolidation — topic clustering, procedure detection, graph cleanup, personality drift
Org Journal	Append-only event log with SQLite WAL backend, atomic `seq`, opportunistic hash-chain. `soul org init` bootstraps it.
Root Agent	Governance identity with three-layer undeletability (file guard, protocol guard, CLI refusal). Signs; cannot execute.
Scope tags	`MemoryEntry.scope` + `match_scope` helper. Bidirectional containment (`org:sales:*` matches `org:sales:leads`).
Decision traces	`agent.proposed` → `human.corrected` → `decision.graduated` event chains linked by `causation_id`.
Zero-Copy federation	`RetrievalRouter` + `CredentialBroker`. Resolves `DataRef` payloads against registered adapters; only the receipt crosses the boundary.
RetrievalTrace	Every `recall()` and `smart_recall()` emits a trace (query, candidates, rerank decisions) on `Soul.last_retrieval`.
Role archetypes	Bundled Arrow, Flash, Cyborg, Analyst templates. `soul template list` / `soul create --template arrow`.
CLI	44 commands. Rich TUI output.
MCP	24 tools + 3 resources for Claude Code, Cursor, or any MCP client

Install

pip install soul-protocol

As of v0.3.1 the bare install gives you a working soul CLI out of the box — no extras required. The [engine] extra is kept as an empty alias so older pins keep resolving (#173).

Extras:

Extra	What it adds
`[engine]`	Empty backwards-compat alias. The base install already ships Click, Rich, PyYAML, and cryptography.
`[mcp]`	MCP server (Claude Code, Cursor, any MCP client)
`[anthropic]`	`AnthropicEngine` — Anthropic SDK cognitive adapter
`[openai]`	`OpenAIEngine` — OpenAI SDK cognitive adapter
`[ollama]`	`OllamaEngine` — local Ollama cognitive adapter
`[litellm]`	`LiteLLMEngine` — 100+ providers via LiteLLM
`[llm]`	All three commercial adapters at once
`[embeddings-st]`	`SentenceTransformerEmbedder` — local semantic embeddings
`[embeddings-openai]`	`OpenAIEmbedder` — OpenAI text-embedding-3
`[embeddings-ollama]`	`OllamaEmbedder` — local Ollama embeddings
`[graph]`	networkx knowledge graph
`[all]`	Everything above

# LLM-wired soul (Anthropic)
pip install "soul-protocol[anthropic] @ git+https://github.com/qbtrix/soul-protocol.git"

# MCP server
pip install "soul-protocol[mcp] @ git+https://github.com/qbtrix/soul-protocol.git"

# Everything
pip install "soul-protocol[all] @ git+https://github.com/qbtrix/soul-protocol.git"

Or clone:

git clone https://github.com/qbtrix/soul-protocol.git
cd soul-protocol
pip install -e ".[dev]"

Quick start

CLI

soul init "Aria" --archetype "The Compassionate Creator"
soul inspect .soul/
soul status .soul/

Python

import asyncio
from soul_protocol import Soul, Interaction

async def main():
    soul = await Soul.birth(
        name="Aria",
        archetype="The Coding Expert",
        values=["precision", "clarity"],
        ocean={"openness": 0.8, "conscientiousness": 0.9, "neuroticism": 0.2},
        communication={"warmth": "high", "verbosity": "low"},
        persona="I am Aria, a precise coding assistant.",
    )

    await soul.observe(Interaction(
        user_input="How do I optimize this SQL query?",
        agent_output="Add an index on the join column.",
    ))

    # The soul discovers its own identity from experience
    images = soul.self_model.get_active_self_images()

    memories = await soul.recall("SQL optimization")
    prompt = soul.to_system_prompt()
    await soul.export("aria.soul")

asyncio.run(main())

Or from config:

soul = await Soul.birth_from_config("soul-config.yaml")

# soul-config.yaml
name: Aria
archetype: The Coding Expert
values: [precision, clarity, speed]
ocean:
  openness: 0.8
  conscientiousness: 0.9
  neuroticism: 0.2
communication:
  warmth: high
  verbosity: low
persona: I am Aria, precise and efficient.

Quick start: bootstrap an org (v0.3.1)

A single soul is great for a personal companion. For a team of agents that share a journal, scope grammar, and root signing key, bootstrap an org:

pip install soul-protocol
soul org init --org-name "Acme" --purpose "AI tooling" --non-interactive
soul org status

That creates ~/.soul/ with a root soul, Ed25519 signing keys, a SQLite WAL journal seeded with org.created + scope.created events, and an archive directory at ~/.soul-archives/. Every subsequent action — memories, proposals, corrections, retrievals — writes into that journal. See docs/org.md for the full flow.

Instantiate a soul from a bundled role archetype:

soul template list                       # Arrow, Flash, Cyborg, Analyst
soul create --template arrow --name Aria # new soul preconfigured with Arrow's DNA

Every recall now leaves a receipt:

memories = await soul.recall("Python")
trace = soul.last_retrieval  # RetrievalTrace: query, candidates, rerank decisions, final

The .soul file

A ZIP archive containing everything:

File	Contents
`manifest.json`	Format version, soul ID, export timestamp, stats
`soul.json`	Identity, DNA, memory settings, evolution config
`state.json`	Mood, energy, focus, social battery
`dna.md`	Human-readable personality blueprint
`memory/core.json`	Persona + bonded-entity profile
`memory/episodic.json`	Interaction history with somatic markers
`memory/semantic.json`	Extracted facts with confidence scores
`memory/procedural.json`	Learned patterns
`memory/graph.json`	Temporal entity relationships
`memory/self_model.json`	Klein self-concept domains

Rename to .zip, open with any archive tool. Move between platforms. Back up anywhere. Version in git.

Memory pipeline

Every soul.observe() call runs the psychology pipeline:

Sentiment (Damasio). Tag emotional context as a somatic marker: valence, arousal, label.
Significance (LIDA). Score novelty + emotional intensity + goal relevance. Below 0.3, skip episodic.
Episodic storage. Only significant experiences.
Fact extraction. Names, preferences, context. Conflict-checked against existing facts.
Entity extraction. Feed the knowledge graph with temporal edges.
Self-model (Klein). Update emergent domain confidence from accumulated experience.

Retrieval uses ACT-R activation decay: recent, frequently accessed, emotionally charged memories rank higher. A memory recalled twice today outranks an "important" memory from last week that was never revisited.

CognitiveEngine

Connect any LLM — three ways:

from soul_protocol import Soul
from soul_protocol.runtime.cognitive.adapters import AnthropicEngine, LiteLLMEngine

# 1. Auto-detect from installed packages
soul = await Soul.birth("Aria", engine="auto")

# 2. Explicit adapter
soul = await Soul.birth("Aria", engine=AnthropicEngine(model="claude-opus-4-5"))

# 3. Any async callable
async def my_llm(prompt: str) -> str:
    ...  # call your own API

soul = await Soul.birth("Aria", engine=my_llm)

Or write your own adapter — implement a single async def think(self, prompt: str) -> str method:

class MyEngine:
    async def think(self, prompt: str) -> str:
        ...

soul = await Soul.birth("Aria", engine=MyEngine())

Without an engine, the soul falls back to HeuristicEngine: word-list sentiment, formula-based significance, regex fact extraction. No LLM calls, no hallucination, no cost.

When running as an MCP server inside Claude Code or Claude Desktop, engine="auto" automatically routes cognitive tasks to the host LLM via MCP sampling — no API key needed.

Vector search

from soul_protocol.runtime.embeddings.hash_embedder import HashEmbedder
from soul_protocol.runtime.embeddings.vector_strategy import VectorSearchStrategy

strategy = VectorSearchStrategy(embedder=HashEmbedder(dimensions=64))
# Use with soul.recall() or standalone

The EmbeddingProvider interface is defined in spec/. Swap in OpenAI, Cohere, or local embeddings by implementing embed() and dimensions.

Eternal storage

soul archive aria.soul --tiers local,ipfs
soul recover aria.soul --source ipfs
soul eternal-status aria.soul

Archive souls to decentralized storage (local, IPFS, Arweave, blockchain). Current providers are mocks for development. Production integrations planned.

CLI

soul <command> [options]

See CLI Reference for all 44 commands. Highlights:

Command	Description
`init`	Initialize a .soul/ folder (like .git/)
`birth`	Birth a new soul (OCEAN flags, config files)
`inspect`	Full TUI: identity, OCEAN bars, state, memory, self-model
`status`	Quick check: mood, energy, memory count
`export`	Export to .soul, .json, .yaml, or .md
`inject`	Inject soul context into an agent platform's config file
`migrate`	Convert SOUL.md to .soul format
`recall`	Query a soul's memories
`remember`	Store a memory in a soul
`retire`	Retire a soul (preserves memories)
`list`	List saved souls in ~/.soul/
`unpack`	Unpack a .soul file into a browsable directory
`archive`	Archive to eternal storage tiers
`recover`	Recover from eternal storage
`eternal-status`	Show eternal storage references
`dream`	Offline batch memory consolidation
`org init`	Bootstrap an org (root soul, journal, scopes, fleet)
`org status`	Snapshot the org from its journal
`org destroy`	Archive-and-wipe the org directory
`template list` / `show`	Browse bundled role archetypes (Arrow, Flash, Cyborg, Analyst)
`create --template`	Instantiate a soul from an archetype
`user invite`	Invite a user to the org (stub — real flow in a follow-up PR)

MCP server

pip install soul-protocol[mcp]
SOUL_PATH=aria.soul soul-mcp

24 tools and 3 resources for Claude Code, Cursor, or any MCP-compatible client. See integrations.

Comparison

vs Mem0: Mem0 does vector retrieval. Soul Protocol adds identity, personality, significance gating, emotional memory, and a portable file format. In head-to-head benchmarks, Soul Protocol scored 8.5 vs. Mem0's 6.0 overall, with the largest gap in emotional continuity (9.2 vs. 7.0).

vs Cognee: Cognee builds knowledge graphs from unstructured data. Good system, but platform-locked. Soul Protocol's knowledge graph is portable and comes with temporal edges.

vs MemGPT / Letta: Context window management vs. identity. MemGPT optimizes what fits in the prompt. Soul Protocol defines who the agent is.

vs LangChain Memory: RAG retrieval vs. psychology-informed processing. Soul Protocol adds significance scoring, somatic markers, fact conflict resolution, self-model tracking, and portable export.

vs OpenAI Memory: Per-account facts vs. a portable standard. Export your soul, own your data.

Use with PocketPaw

PocketPaw uses soul-protocol for persistent identity across Telegram, Discord, Slack, WhatsApp, and web.

from soul_protocol import Soul, Interaction

soul = await Soul.awaken(".soul/")
await soul.observe(Interaction(
    user_input=user_message,
    agent_output=agent_response,
))

Documentation

Whitepaper -- design rationale, psychology stack, empirical validation
Architecture -- two-layer diagrams, module dependency graph, org-layer implementation notes
Org Journal Spec -- framework-agnostic protocol for the journal, root agent, and retrieval router
Org Management -- soul org init / status / destroy walkthrough
Decision Traces -- agent.proposed → human.corrected → decision.graduated chains
Manual Testing -- hands-on validation for the org-layer primitives
Configuration -- OCEAN, communication style, config files, env vars
Self-Model -- Klein's self-concept, domain discovery
Cognitive Engine -- LLM integration, heuristic fallback
Memory Architecture -- five tiers, activation, compression
CLI Reference -- all commands and options
MCP Server -- tools, resources, setup
Gap Analysis -- what's built vs. what's planned
JSON Schemas -- cross-language .soul file validation

Development

git clone https://github.com/qbtrix/soul-protocol.git
cd soul-protocol
pip install -e ".[dev]"
pytest tests/   # 2297 tests

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 260 Commits
.claude		.claude
.github		.github
assets		assets
docs		docs
examples		examples
paper		paper
research		research
rfc		rfc
schemas		schemas
scripts		scripts
skills/soul-protocol		skills/soul-protocol
spec		spec
src/soul_protocol		src/soul_protocol
tests		tests
.continuerules		.continuerules
.gitignore		.gitignore
.mcp.json		.mcp.json
.windsurfrules		.windsurfrules
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
FAQ.md		FAQ.md
LICENSE		LICENSE
README.md		README.md
WHITEPAPER.md		WHITEPAPER.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Soul Protocol

Validated: 5 judges, 4 providers, 20/20 favored Soul

Soul Health Score: 90.2 / 100

Architecture: spec + runtime

Features

Install

Quick start

CLI

Python

Quick start: bootstrap an org (v0.3.1)

The .soul file

Memory pipeline

CognitiveEngine

Vector search

Eternal storage

CLI

MCP server

Comparison

Use with PocketPaw

Documentation

Development

License

About

Uh oh!

Releases 9

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Soul Protocol

Validated: 5 judges, 4 providers, 20/20 favored Soul

Soul Health Score: 90.2 / 100

Architecture: spec + runtime

Features

Install

Quick start

CLI

Python

Quick start: bootstrap an org (v0.3.1)

The .soul file

Memory pipeline

CognitiveEngine

Vector search

Eternal storage

CLI

MCP server

Comparison

Use with PocketPaw

Documentation

Development

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages