MetaCortex

MetaCortex is a serverless MCP memory service backed by Firestore vector search and deployed through Firebase Cloud Functions 2nd Gen.

Why Metacortex?

Persistent memory across any MCP client (ChatGPT web, Claude, etc.) with zero infrastructure management.

Tip

Brain & Body: MetaCortex resides in the cloud as the memory core, while Autonomous agents such as OpenClaw act as its local manifest (body). See the full Architecture & Use Cases for details.

The practical target is a remote MCP server that chat clients such as ChatGPT web or Claude web can use for:

searching what the project already knows
saving new durable memories from chat
fetching the full stored memory behind a search result

How It Works

1. Chat clients use a narrow memory contract

MetaCortex gives browser clients a three-tool memory contract. The first write comes through remember_context, which stores canonical text and lifecycle metadata on the server side.

2. Retrieval stays on the same remote backend

The same scoped MCP contract makes retrieval available later through search_context and fetch_context. ChatGPT and Claude both follow the same server-side retrieval flow, each with its own token and origin allowlist.

3. Firestore stores durable memory records plus vectors

Every saved memory lands in Firestore with searchable metadata and the corresponding vector embedding. That keeps the retrieval layer serverless while still exposing durable semantic search.

4. Scoped client profiles keep browser access safe

Browser clients never connect to the admin surface directly. Each client profile gets its own endpoint, token, origin allowlist, and allowed tool list so public chat surfaces only see the read/write tools they actually need.

Important constraint

As of March 10, 2026, Cloud Functions production deployment requires the Firebase Blaze plan. The original Spark-only production target from the initial spec is not compatible with current Firebase Functions deployment rules, though low-traffic usage can still remain close to zero cost within Blaze no-cost quotas.

Primary use cases

This project is set up for these workflows:

A chat client asks, "What do we already know about auth/session handling?" The model calls search_context.
The search results include stable id values and external artifact refs when available. The model can call fetch_context with that same id for the one result it wants in full.
A user says, "Remember that we use Ktor for shared Android and iOS networking." The model calls remember_context.
A user shares a screenshot and says to save it for later retrieval. The model calls remember_context with image input plus artifact_refs if the real asset lives in storage.

Tool strategy

The current MCP surface is intentionally split between:

a 3-tool client-facing contract for browser-hosted chat clients
a smaller admin-only maintenance surface documented later in this README

That means the server currently exposes 4 MCP tools total, but normal browser clients should only see 3 of them.

Client-facing tools

This is the public/browser contract:

remember_context The single write tool for normal chat use. The client supplies the memory text, optional topic, optional draft=true for rough notes, optional image input, and optional artifact_refs. The server fills in sensible defaults.
search_context Vector search over stored memories. Results include stable id values and artifact refs when available.
fetch_context Fetch one memory by id after remember_context or search_context.

Why `remember_context` Is The Write Tool

remember_context keeps the public write surface simple:

topic is the public label and maps to the stored module_name internally
normal writes store canonical memory as active
draft=true stores draft material as wip

Explicit lifecycle overrides and cleanup flows are part of the admin maintenance surface described later in this README.

Metadata model

The main public metadata field is topic. Stored records also carry lifecycle metadata such as branch_state, but that is primarily for admin cleanup and filtering and is documented later in Admin Cleanup And Consolidation.

`topic`

Public topic or subsystem label for MCP clients. Internally this is stored as module_name.

Examples:

auth
billing
kmp-networking
ui-settings

If omitted, the server defaults it to general.

Images

This project supports image-backed memories, but it does not store raw image bytes for later download.

What happens today:

the image is normalized into retrieval text by Gemini
that text is embedded and stored
optional artifact_refs can point to the real asset, for example gs://bucket/path.png
search results and fetched records return those artifact refs when they exist

That means the practical image flow is:

save a screenshot with remember_context
store the real asset elsewhere
include its artifact_refs
let semantic search find the memory
let the client follow the returned artifact ref to the actual screenshot

Endpoints

Default Streamable HTTP MCP endpoint: /metaCortexMcp/mcp
Client-scoped Streamable HTTP MCP endpoint: /metaCortexMcp/clients/<clientId>/mcp

Security model:

the default /mcp endpoint is the admin endpoint
clients/<clientId> endpoints let you expose smaller toolsets to specific consumers
MCP_ALLOWED_ORIGINS applies only to the default admin endpoint
browser CORS should be configured per client profile through MCP_CLIENT_PROFILES_JSON[].allowedOrigins
leave MCP_ALLOWED_ORIGINS empty unless you intentionally want browser access to the admin endpoint

Browser Client Setup

For browser-hosted MCP clients, register the scoped endpoint, not the admin endpoint:

ChatGPT web URL: https://<FUNCTION_BASE_URL>/clients/chatgpt-web/mcp?auth_token=<YOUR_CHATGPT_TOKEN>
Claude web URL: https://<FUNCTION_BASE_URL>/clients/claude-web/mcp
bearer token: the token value from the matching client profile
allowed browser origins: the matching profile's allowedOrigins

Do not register https://<FUNCTION_BASE_URL>/mcp with ChatGPT web or Claude web. That endpoint is the admin surface and uses MCP_ADMIN_TOKEN.

Use separate client profiles per browser client:

chatgpt-web with allowedOrigins=["https://chatgpt.com"]
claude-web with allowedOrigins=["https://claude.ai"]

For agent-based clients such as OpenClaw, use a dedicated non-browser scoped profile instead of the admin endpoint. A recommended operating model is documented in docs/OPENCLAW_MEMORY_OPS.md.

Connecting to ChatGPT

ChatGPT's current MCP UI does not support configuring custom Authorization: Bearer headers. To work around this security limitation, MetaCortex supports passing the token securely via the URL.

Open ChatGPT Web or Desktop.
Open Settings -> Connected Apps (or MCP Settings).
Click "Add new App" or "Connect MCP Server".
Set Auth Type to No Authentication.
Set the MCP URL to your tokenized endpoint: https://<FUNCTION_BASE_URL>/clients/chatgpt-web/mcp?auth_token=<YOUR_CHATGPT_TOKEN>

MetaCortex will validate the token from the URL and reject unauthenticated requests even though ChatGPT is configured for "No Auth".

Connecting to Claude

Depending on your Claude client (e.g., experimental web extensions or custom UIs), you can configure the connection in two ways:

Option 1: Standard Headers (Preferred)

MCP URL: https://<FUNCTION_BASE_URL>/clients/claude-web/mcp
Auth Type: Bearer Token / Service Token
Token: Bearer <YOUR_CLAUDE_TOKEN>

Option 2: Tokenized URL (If headers are unsupported)

Auth Type: No Authentication
MCP URL: https://<FUNCTION_BASE_URL>/clients/claude-web/mcp?auth_token=<YOUR_CLAUDE_TOKEN>

Tool Contract

The v1 client-facing tools return one TextContent block whose text is a single JSON object.

`remember_context`

Minimal text memory:

{
  "content": "We use Ktor for shared Android and iOS networking.",
  "topic": "kmp-networking"
}

Typical result:

{
  "item": {
    "id": "abc123",
    "content": "We use Ktor for shared Android and iOS networking.",
    "metadata": {
      "topic": "kmp-networking",
      "branch_state": "active",
      "modality": "text",
      "created_at": "2026-03-14T12:00:00.000Z",
      "updated_at": "2026-03-14T12:00:00.000Z"
    }
  },
  "write_status": "created"
}

Use item.id directly with fetch_context.

Image-backed memory with an external asset reference:

{
  "content": "Settings screen screenshot for the Compose UI.",
  "topic": "ui-settings",
  "artifact_refs": ["gs://your-bucket/settings-screen.png"],
  "image_base64": "<base64 image bytes>",
  "image_mime_type": "image/png"
}

`search_context`

Example input:

{
  "query": "shared networking for android and ios",
  "filter_topic": "kmp-networking",
  "filter_state": "active"
}

Typical result:

{
  "matches": [
    {
      "id": "abc123",
      "summary": "We use Ktor for shared Android and iOS networking.",
      "score": 0.92,
      "content_preview": "We use Ktor for shared Android and iOS networking.",
      "metadata": {
        "topic": "kmp-networking",
        "branch_state": "active",
        "modality": "text",
        "created_at": "2026-03-14T12:00:00.000Z",
        "updated_at": "2026-03-14T12:00:00.000Z"
      }
    }
  ],
  "applied_filters": {
    "filter_topic": "kmp-networking",
    "filter_state": "active"
  }
}

If an item has external refs, they appear in metadata.artifact_refs.

If nothing matches, the result is:

{
  "matches": [],
  "applied_filters": {
    "filter_topic": null,
    "filter_state": "active"
  }
}

`fetch_context`

Preferred input: pass the same id returned by remember_context or search_context.

Example input:

{
  "id": "abc123"
}

Typical result:

{
  "item": {
    "id": "abc123",
    "content": "We use Ktor for shared Android and iOS networking.",
    "metadata": {
      "topic": "kmp-networking",
      "branch_state": "active",
      "modality": "text",
      "created_at": "2026-03-14T12:00:00.000Z",
      "updated_at": "2026-03-14T12:00:00.000Z"
    }
  }
}

Search Behavior

search_context does one exact metadata filter step and one vector step:

filter_state is always applied before nearest-neighbor search
filter_topic, when present, is an exact match on the stored topic label
vector search then runs Firestore findNearest() with cosine distance
the result count is limit when provided, otherwise SEARCH_RESULT_LIMIT
the default state is active unless the client profile allows and requests another state

fetch_context can still fail with 403 if the document exists but its branch_state is outside that client profile's allowedFilterStates.

Write Constraints

Write behavior that matters in production:

request bodies are limited to 1mb, including base64 image data
content or image_base64 is required
image_mime_type is required whenever image_base64 is provided
images are normalized into retrieval text and embedded as text; raw image bytes are not stored for download
if you want the real asset later, store it elsewhere and include artifact_refs
exact duplicate writes within the current idempotency window are replay-safe and reuse the existing memory id
duplicate suppression is intentionally light and based on the normalized write fingerprint, not semantic similarity

remember_context defaults:

omitted topic becomes general
omitted draft and omitted lifecycle overrides store branch_state=active
draft=true stores branch_state=wip

Explicit lifecycle overrides are part of the admin maintenance surface described later in this README.

Admin Cleanup And Consolidation

This section is for operators using the admin endpoint. Browser-hosted clients can usually ignore it.

Admin-only maintenance surface:

deprecate_context Soft-delete obsolete memories by setting branch_state=deprecated and recording superseded_by.

Internal maintenance workflow:

WIP review and consolidation are internal workflows, not part of the public browser contract.

Lifecycle states:

active Canonical memory that normal search should return.
wip Draft memory awaiting consolidation.
merged Incorporated memory that is no longer the main active record.
deprecated Obsolete memory kept only for history and audit.

Recommended usage:

Browser clients save durable memories with remember_context.
Agent clients such as OpenClaw should use a dedicated scoped client profile with remember_context, search_context, and fetch_context only.
Use draft=true only for provisional notes that should not appear in normal active search.
WIP review and consolidation stay in internal maintenance workflows.
Admin flows can set explicit branch_state when they need non-default lifecycle control.
After writing the canonical replacement, admins can mark obsolete records with the admin-only deprecate_context tool.

Current lifecycle behavior:

remember_context defaults to active, supports draft=true for wip, and also accepts explicit branch_state for advanced writes
draft and branch_state are mutually exclusive
deprecate_context does not delete data; it sets branch_state=deprecated and records superseded_by
merged exists as a searchable historical state for explicit admin writes
client profiles can restrict visible lifecycle states through allowedFilterStates

Observability

After deployment, there are three places to look:

memory_vectors in Firestore shows the current memory corpus
memory_events in Firestore shows client-attributed tool usage over time
Cloud Logging shows request failures and structured tool-event logs

memory_events records one document per tool call and one document per ingress rejection. Events include:

client_id
event_type
status
timestamp
latency_ms
a compact request summary
either a compact response summary, an error, or a request rejection reason

Examples:

public tool payloads use id for fetchable memory identifiers
remember_context events record the written id, topic, branch_state, and modality
search_context events record the requested filters, result_count, and returned result_ids
fetch_context events record which id was read
deprecate_context events record id, superseding_id, and previous_state
rejected browser/admin requests record reason=origin_not_allowed or reason=unauthorized Traceability is by client profile id, so:
admin endpoint traffic is attributed to client_id=default
ChatGPT web traffic is attributed to client_id=chatgpt-web
Claude web traffic is attributed to client_id=claude-web

What is intentionally not stored in observability events:

full memory bodies
full image bytes
raw image downloads

Search events do include a short query_preview, but the observability collection is designed to track behavior, not duplicate the corpus.

Quick start

Install dependencies:
```
npm --prefix functions install
```

Create local env vars:

cp functions/.env.example functions/.env

For browser-hosted clients, set a scoped client profile in functions/.env or functions/.env.prod:

MCP_CLIENT_PROFILES_JSON=[{"id":"chatgpt-web","token":"replace-chatgpt-token","allowedTools":["remember_context","search_context","fetch_context"],"allowedFilterStates":["active"],"allowedOrigins":["https://chatgpt.com"]},{"id":"claude-web","token":"replace-claude-token","allowedTools":["remember_context","search_context","fetch_context"],"allowedFilterStates":["active"],"allowedOrigins":["https://claude.ai"]}]

Run verification:

npm --prefix functions test
npm --prefix functions run build

Start emulators:
```
npm --prefix functions run serve
```

Optional MCP smoke test:

cd functions
MCP_BASE_URL="http://127.0.0.1:5001/demo-open-brain/us-central1/metaCortexMcp/mcp" \
MCP_ADMIN_TOKEN="replace-me" \
MCP_SMOKE_MODE="admin-read-write" \
npm run smoke

Browser-client flow:

cd functions
MCP_BASE_URL="http://127.0.0.1:5001/demo-open-brain/us-central1/metaCortexMcp/clients/chatgpt-web/mcp" \
MCP_ADMIN_TOKEN="replace-chatgpt-token" \
MCP_SMOKE_MODE="browser-read-write" \
npm run smoke

Repeat with /clients/claude-web/mcp and the Claude token to verify Claude separately.

Deployment

Deployment playbook: docs/DEPLOYMENT.md

For the next production deployment session, start with:

cd /Users/nick/git/metacortex
./scripts/deploy-session-preflight.sh

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.agent/skills		.agent/skills
.agents/skills		.agents/skills
.claude		.claude
docs		docs
functions		functions
journey-kit		journey-kit
scripts		scripts
.gitignore		.gitignore
.node-version		.node-version
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
firebase.json		firebase.json
firestore-debug.log		firestore-debug.log
firestore.indexes.json		firestore.indexes.json
firestore.rules		firestore.rules
metacortexplan.md		metacortexplan.md
skills-lock.json		skills-lock.json

Folders and files

Latest commit

History

Repository files navigation

MetaCortex

Why Metacortex?

How It Works

1. Chat clients use a narrow memory contract

2. Retrieval stays on the same remote backend

3. Firestore stores durable memory records plus vectors

4. Scoped client profiles keep browser access safe

Important constraint

Primary use cases

Tool strategy

Client-facing tools

Why remember_context Is The Write Tool

Metadata model

topic

Images

Endpoints

Browser Client Setup

Connecting to ChatGPT

Connecting to Claude

Tool Contract

remember_context

search_context

fetch_context

Search Behavior

Write Constraints

Admin Cleanup And Consolidation

Observability

Quick start

Deployment

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Why `remember_context` Is The Write Tool

`topic`

`remember_context`

`search_context`

`fetch_context`

Packages