bmfote

v0.11.7

Published

2 months ago

One memory across Claude Code, Cursor, Messages API, and Managed Agents — the context layer your AI tools share.

Downloads

104

0High
0Medium
0Low

bmfote

ai agents memory context claude mcp turso

bmfote

The problem: your AI is in silos

Every AI tool you run lives in its own context.

Claude Code on your laptop has its own history in ~/.claude/.
Cursor / Windsurf have their own local histories, inside the app.
A Messages API script you ran yesterday stored nothing, anywhere.
A Managed Agents session runs in Anthropic's cloud and has no idea any of the above exist.

None of them can see into the others. That's not a bug in Anthropic's design — it's a consequence of each product being built by a different team for a different job. The result: you tell every tool the same things every day, and nothing compounds.

bmfote is one shared memory across all of them. Every Claude Code turn, every Messages API call, and every Managed Agents run reads from and writes to the same searchable store. Ask any agent on any surface "what was the ICP we agreed on last Tuesday?" and it finds the answer no matter where the original conversation happened.

vs. Anthropic Managed Agents memory stores

Anthropic shipped built-in memory stores for Managed Agents in April 2026. They're good — auto-invoked, with versioning, redact, and a console UI bmfote doesn't have. But they only connect one silo (Managed Agents) to itself. bmfote is the bridge across all four.

| | Managed Agents memory stores | bmfote | |---|---|---| | Auto-invoked during a session | ✅ | ✅ (Claude Code hooks) | | Versioning, redact, console UI | ✅ | ❌ | | Claude Code history | ❌ | ✅ | | Cursor / Windsurf history | ❌ | ✅ (via MCP) | | Messages API agents | ❌ | ✅ (bmfote-client) | | Managed Agents sessions | ✅ | ✅ (bmfote-agent CLI) | | Bridge between all four surfaces | ❌ | ✅ | | Your data, your infra | ❌ | ✅ | | Multi-user / team-shared (architecture) | ❌ | ✅ (workspace_id; UI coming) |

Use Managed Agents memory stores if your agents live entirely inside /v1/sessions and you want Anthropic to manage versioning and redact for you.

Use bmfote if you want one memory across every surface an agent can run on, owned by you, survivable if you ever leave Anthropic's walled garden.

Key Features

🌉 One memory, four surfaces — Claude Code, Cursor (via MCP), the Messages API, the Claude Agent SDK, and Anthropic Managed Agents all read from and write to the same searchable pool. The only memory layer that isn't captive to /v1/sessions.
🪝 Zero-glue for Claude Code — hooks auto-record every session; MCP tools auto-recall on SessionStart.
🐍 Any Python agent, same surface — pip install bmfote-client gives Messages API and Agent SDK agents the same recall + write loop Claude Code gets for free.
🧠 Agent-initiated writes — agents call remember() to persist what matters, not just passively recall.
🔒 Your Turso, your token, your data — self-hosted, bring-your-own-bearer. AGPL server (no closed-SaaS re-hosts) + MIT client (drop into any agent codebase, proprietary or not).

Five MCP tools ship out of the box: search_memory, find_error, get_context, get_recent, remember. Retrieval is SQLite FTS5 with BM25 ranking — no vector DB, no embedding pipeline, no re-index step.

How It Works

Write — every Claude Code turn is captured by a PostToolUse hook and streamed to a Turso database. Non-Claude-Code agents do the same via bmfote-client, or by calling remember() mid-turn as an MCP tool.
Search — a FastAPI server exposes BM25 full-text search over every message, session, and tool call you've ever had with any agent.
Recall — five MCP tools are auto-registered in Claude Code and reachable over HTTP by any MCP-speaking agent (Cursor, Managed Agents, custom Agent SDK apps).
Bridge — because recall is HTTP + MCP and writes are SDK-based, the same memory is reachable from every surface an agent can run on. No surface owns it.

See CLAUDE.md for architecture details — schema, FTS5 triggers, embedded-replica vs direct-Turso modes.

Proof: one memory, multiple agent identities

The cross-surface story only matters if it actually works across different agents, not just the same agent on different days. Here's the reproduction, live against the real APIs.

The three-run proof

Three sessions in project n8n-managed-agent, two different agent identities, one shared store:

| # | Agent | Prompt | Behavior | |---|---|---|---| | 1 | Agent A | "What do you remember about the bmfote project?" | Retrieved context, answered, persisted turn. | | 2 | Agent A | "What did you tell me last time?" | Called get_context(uuid=...-agent) on Run 1's UUID, summarized it back. | | 3 | Fresh Agent B (zero prior sessions) | "Summarize every previous run and cite session IDs." | Made 15 MCP calls including direct UUID lookups of both Run 1 and Run 2, reconstructed the full history, attributed it to Agent A, even flagged garbage rows from an earlier debug session as "known artifacts." |

A brand-new agent identity with no prior history correctly surfaced another agent's work. The substrate is the store, not the agent — memory is portable across agent identities, not tied to any one of them.

The workflow that produced it

Use n8n as a visual orchestration layer, Anthropic Managed Agents as the runtime, bmfote as the shared memory substrate. No SDK, no deployment glue — just HTTP Request nodes.

Ten n8n nodes — HTTP Request + one IF + one Wait + one Code:

Manual Trigger
  → Create Anthropic session         POST /v1/sessions
  → Send user message                POST /v1/sessions/{id}/events
  → Stash session id                 (Set)
  → Wait 3s  ←──┐
  → Poll events                      GET  /v1/sessions/{id}/events
  → IF last == session.status_idle   ├─ false → back to Wait
                                     └─ true  → Extract final answer (Code)
  → Create bmfote session            POST /api/sessions
  → Persist user message             POST /api/messages
  → Persist agent message            POST /api/messages

Every request to Anthropic sends:

x-api-key:         <ANTHROPIC_API_KEY>
anthropic-version: 2023-06-01
anthropic-beta:    managed-agents-2026-04-01

Every request to bmfote sends Authorization: Bearer <BMFOTE_TOKEN>. The Create Session body references an agent_id you previously created with bmfote-agent create, plus the shared env_id and vault_id the CLI auto-provisions.

Why it matters

The agent invoked from n8n already has bmfote wired as an MCP server with always_allow permission (courtesy of bmfote-agent create). When the session runs, the agent calls search_memory, get_context, and remember mid-turn against bmfote — all orchestrated by Anthropic's infrastructure, not your machine. n8n never needs to know bmfote exists; it only sees the final answer. The persist-back nodes then write the Q/A pair into the same messages table Claude Code uses, so the next run — from any agent on any surface — can find it.

Prerequisites

A bmfote server reachable from Anthropic's infrastructure (not localhost) — Anthropic's servers call the MCP endpoint, not your laptop.
An agent_id from bmfote-agent create. The CLI reuses bmfote-default vault and env for every agent.
ANTHROPIC_API_KEY, BMFOTE_URL, BMFOTE_TOKEN exposed to n8n as credentials.
n8n running anywhere — self-hosted, n8n Cloud, or Docker.

Quick Start

Prerequisites
A running bmfote server (URL + API_TOKEN). Don't have one? See Host your own server — ~5 minutes.
Claude Code installed on this machine.

Connect a machine to an existing bmfote deployment with one command:

npx bmfote setup --url https://your-bmfote-server --token <API_TOKEN>

Restart Claude Code. Context from previous sessions will automatically appear in new ones, and every new session will be saved back. Safe to re-run; run once per machine.

This command:

Registers an MCP server (bmfote-memory) that exposes 5 memory tools
Installs hooks at ~/.claude/hooks/bmfote-*.sh for automatic session sync
Writes ~/.claude/bmfote.env with the URL and token
Merges hook entries into ~/.claude/settings.json

Use from any agent SDK

bmfote isn't Claude-Code-only. If your agent runs on the Messages API, the Claude Agent SDK, or Anthropic Managed Agents, install the Python client and you get the same recall + write surface with no code specific to Claude Code.

pip install -e ./client
export BMFOTE_URL=https://your-bmfote-server
export BMFOTE_TOKEN=...

Anthropic Managed Agents — the hardest silo to bridge

Managed Agents don't expose client-side hooks, so the integration flips: the agent itself calls remember and search_memory as MCP tools against bmfote. bmfote ships a bmfote-agent CLI that handles the whole wiring — vault + credential, environment with allowed_hosts, agent config with mcp_servers + mcp_toolset + always_allow — in one command.

# Create a memory-only agent wired to bmfote (idempotent — reruns are no-ops)
bmfote-agent create \
  --name "my agent" \
  --system "You are a memory retrieval agent backed by bmfote."

# Run it with a prompt; returns the final agent response
bmfote-agent run <agent_id> "What did we decide about Acme last week?"

# Audit or retrofit an agent created elsewhere
bmfote-agent doctor <agent_id> --fix
bmfote-agent list

The CLI reads BMFOTE_URL, BMFOTE_TOKEN (from npx bmfote setup), and ANTHROPIC_API_KEY from your shell. Shared resources — a bmfote-default vault and bmfote-default-env environment — are discovered by name and created on first use, so there is no separate setup step.

All paths write into the same messages table as Claude Code sessions. See client/README.md for the full surface, failure semantics, and limitations.

Claude Agent SDK — no glue code

Plug the bmfote MCP server into options for reads, register hooks for writes, done:

from claude_agent_sdk import ClaudeAgentOptions, query
from bmfote_client import agent_sdk_hooks

options = ClaudeAgentOptions(
    mcp_servers={
        "bmfote": {
            "type": "http",
            "url": "https://your-bmfote-server/mcp/",
            "headers": {"Authorization": f"Bearer {BMFOTE_TOKEN}"},
        }
    },
    hooks=agent_sdk_hooks(project="ops-agent"),
)
async for msg in query(prompt="Continue yesterday's investigation", options=options):
    ...

The agent gets search_memory, find_error, get_context, get_recent, and remember as tools automatically, and every user prompt + tool call is recorded back.

Messages API — the full loop

Day 1 and Day 2 of an agent that continues its own research:

import anthropic
from bmfote_client import Client, record_exchange

bmfote = Client()
session = bmfote.session(project="research-agent")

# 1. Recall prior memory
prior = session.recall("competitor pricing research", limit=10)

# 2. Run the turn with that context in the system prompt
ac = anthropic.Anthropic()
user = "Continue from where we left off."
response = ac.messages.create(
    model="claude-sonnet-4-5",
    max_tokens=2048,
    system=f"You are a research agent.\n\n{prior}",
    messages=[{"role": "user", "content": user}],
)

# 3. Write the new turn back
record_exchange(session, user, response)
session.close()

Or let the agent choose when to recall by exposing bmfote as tools it can call mid-turn:

from bmfote_client import TOOL_SPECS, handle_tool_use

response = ac.messages.create(
    model="claude-sonnet-4-5",
    max_tokens=2048,
    tools=TOOL_SPECS,    # search_memory, find_error, get_context, get_recent, remember
    messages=[{"role": "user", "content": "What did we decide about Acme last week?"}],
)
# Handle any tool_use blocks with handle_tool_use(block, client=bmfote)

For teams — the shape of what's next

bmfote is currently a single-user primitive with multi-user architecture. The workspace_id column landed recently; the surface area to use it has not. If you're running a Claude-centric team of 1–5 people who are comfortable with a self-hosted server, bmfote is deployable today. Beyond that, the gaps below are the roadmap.

Works today

Multi-tenant row isolation at the database level (workspace_id on every message)
Self-hosted deployment to any Docker host your team can reach
Shared bearer token across all team members
One store, N agents — different Claude Code machines, Managed Agents sessions, and Messages API scripts all see the same pool

Not shipped yet

Team-invite flow / per-user bearer tokens with role-based access
Web dashboard for non-technical users
ChatGPT and Copilot adapters (Claude-speaking tools only today)
Author attribution beyond session metadata
Audit log / change history

If any of the gaps above would block your team, open an issue — the team direction is the explicit next phase and your use case will shape what ships first.

Host your own server

bmfote is self-hosted. You need a Turso database and any Docker-compatible host (Railway, Fly, Render, bare Docker). ~5 minutes end-to-end.

Step 0 — Install the Turso CLI

brew install tursodatabase/tap/turso
turso auth login

(Non-macOS install instructions: https://docs.turso.tech/cli/installation)

Step 1 — Clone the repo

git clone https://github.com/bmfote/bmfote && cd bmfote

Keep this shell open. Every command below runs from inside this directory.

Step 2 — Create a Turso database

turso db create bmfote-memory
turso db show bmfote-memory --url              # -> libsql://...
turso db tokens create bmfote-memory --expiration none

Save the URL and token. You'll pass them to the server as environment variables.

Step 3 — Apply the schema and generate an API token

turso db shell bmfote-memory < engine/schema.sql
openssl rand -hex 32    # save this — every client needs it

Step 4 — Deploy the server

The server is a single Dockerfile. Pick your provider.

All commands below must be run from inside the cloned bmfote directory (same shell as Step 1). Your provider CLI needs to see the Dockerfile.

railway init
railway service                                # link or create a service
railway variables --set TURSO_DATABASE_URL=libsql://...
railway variables --set TURSO_AUTH_TOKEN=...
railway variables --set API_TOKEN=...
railway up
railway domain                                 # your public URL

Railway distinguishes projects from services. railway init creates a project but does not always link a service. If later commands complain No service linked or No services found, run railway service and pick or create one, then re-run the failing command.

fly launch --no-deploy
fly secrets set TURSO_DATABASE_URL=libsql://... \
                TURSO_AUTH_TOKEN=... \
                API_TOKEN=...
fly deploy

docker build -t bmfote .
docker run -d -p 8000:8000 \
  -e TURSO_DATABASE_URL=libsql://... \
  -e TURSO_AUTH_TOKEN=... \
  -e API_TOKEN=... \
  bmfote

Required environment variables

| Var | Required | Purpose | |---|---|---| | TURSO_DATABASE_URL | yes | libsql://... from turso db show | | TURSO_AUTH_TOKEN | yes | from turso db tokens create | | API_TOKEN | yes | shared secret clients send as Authorization: Bearer | | PORT | no | defaults to 8000; providers set this automatically |

The server fails closed: it refuses to start without API_TOKEN in cloud mode.

Verify

curl https://your-domain/health
curl -H "Authorization: Bearer $API_TOKEN" https://your-domain/api/stats

Troubleshooting

railway up says Dockerfile not found or no build context — you're not inside the cloned bmfote directory. cd into it and retry.
railway up / railway variables says No service linked or No services found — run railway service, pick or create a service, then re-run the failing command.
fly launch offers to generate a Dockerfile — decline. The repo already ships one; make sure you ran fly launch from inside bmfote/.
turso db shell errors on engine/schema.sql: No such file — you're not in the repo root. cd bmfote and retry.
curl /health returns connection refused — the container failed to start. Check provider logs; the most common cause is a missing API_TOKEN, which makes the server fail closed.
curl /api/stats returns 401 — the API_TOKEN on the server does not match the token in your Authorization: Bearer ... header.
/api/stats returns zeros or empty — schema was not applied. Re-run turso db shell bmfote-memory < engine/schema.sql from the repo root.

Local development

source .venv/bin/activate      # Python 3.12
python -m engine.server        # starts on PORT from .env (default 8026)

Local dev uses an embedded libSQL replica at engine/local-replica.db that syncs to your Turso database. Auth is optional locally (no API_TOKEN required).

License

bmfote uses a split license:

Server, hooks, installer, and CLI — GNU AGPL-3.0. If you modify bmfote and run it as a network service, AGPL-3.0 requires you to make your modified source available to your users.
Python client library (client/) — MIT. Free to embed in proprietary agent code with no copyleft obligations.

The server is AGPL so commercial re-hosters can't take bmfote, add private features, and compete as a closed SaaS. The client is MIT so you can drop it into any agent codebase — proprietary or not — without license friction.

Built with FastMCP | Powered by Turso (libSQL) | AGPL-3.0 + MIT (split)

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

bmfote

The problem: your AI is in silos

vs. Anthropic Managed Agents memory stores

Key Features

How It Works

Proof: one memory, multiple agent identities

The three-run proof

The workflow that produced it

Why it matters

Prerequisites

Quick Start

Use from any agent SDK

Anthropic Managed Agents — the hardest silo to bridge

Claude Agent SDK — no glue code

Messages API — the full loop

For teams — the shape of what's next

Host your own server

Step 0 — Install the Turso CLI

Step 1 — Clone the repo

Step 2 — Create a Turso database

Step 3 — Apply the schema and generate an API token

Step 4 — Deploy the server

Required environment variables

Verify

Troubleshooting

Local development

License