npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

squish-memory

v1.6.0

Published

Local-first memory runtime for Claude Code, Codex, ChatGPT, and AI agents. Searchable session history, durable memory, and cross-tool sync. No external LLM required.

Downloads

1,415

Readme

Squish -- Memory Runtime for AI Agents

npm version License: MIT npm downloads CI

4.5k+ npm downloads. Local-first. No external LLM required.

Squish is a local-first memory runtime for AI coding agents. It gives every AI agent persistent memory using local embeddings with zero LLM dependency. If you use Claude Code, Cursor, Codex, Copilot, Gemini CLI, or any MCP-compatible tool and want your agents to remember decisions, constraints, and context across sessions -- Squish is the MCP memory server that works locally and optionally syncs to the cloud.

npm install -g squish-memory && squish install --all

Squish provides AI agent memory that persists between sessions, across agents, and across machines. It is a local-first MCP server with built-in embeddings, a knowledge graph, and hybrid retrieval -- no external database or API key required. Use it for free locally, or enable Squish Cloud for cross-device sync.


The Problem: Agents Forget Everything

Every AI coding agent starts from zero when a new session begins. The architecture decision from last week, the config you spent an hour debugging, the preference you mentioned yesterday -- gone.

Built-in memory files like CLAUDE.md and .cursorrules help, but they have hard limits. They cap out around 200 lines, require manual curation, and do not work across agents. You end up copy-pasting the same context into every tool.

Squish gives you persistent memory for coding agents that scales without limits. No manual maintenance. No token waste. No agent lock-in.

Three Layers of Memory

| Layer | What It Does | Command | |-------|-------------|---------| | Recall | Durable memory -- decisions, preferences, constraints that persist across sessions | squish recall | | Sessions | Searchable history -- past agent runs you can inspect for evidence and context | squish sessions search | | Remember | Write to long-term memory -- store new facts, decisions, observations | squish remember |

Token Cost Comparison

| Method | Token Usage | Cost per Session | Cross-Agent | Auto-Capture | |--------|-------------|------------------|-------------|--------------| | Paste full context | ~2,000 tokens | $0.06 - $0.12 | No | No | | LLM-summarized context | ~500 tokens | $0.02 - $0.05 | No | No | | CLAUDE.md / .cursorrules | ~200 lines max | Free | No | No | | Squish (local) | ~50-200 tokens | $0.00 | Yes | Yes | | Squish (Cloud) | ~50-200 tokens | $0.00 | Yes | Yes |

Squish retrieves only the relevant memories for the current task. The average context injection is 50-200 tokens -- a fraction of what you would paste manually.


Quick Start

Step 1: Install

npm install -g squish-memory && squish install --all

This installs the Squish CLI, MCP server, and plugin hooks for all detected agents.

Step 2: Work

Start your coding agent as usual. Squish runs in the background, auto-capturing decisions, constraints, preferences, and context.

squish remember "We chose PostgreSQL for team mode" --type decision
squish recall "project decisions"

Step 3: Search Past Sessions

After a few sessions, search your agent history:

squish sessions search "postgres migration"
squish sessions related --repo-path .

Step 4: Restart

Close your session and open a new one. Your agent picks up where you left off -- all context is restored automatically.

squish context    # See what your agent remembers
squish stats      # Check memory health

Works locally free. Optional cloud sync available at squishplugin.dev.


Works with Every Agent

Squish works with any AI coding agent that supports MCP (Model Context Protocol) or HTTP connections. One memory server, shared across all of them.

| Agent | Integration Method | Notes | |-------|-------------------|-------| | Claude Code | MCP server + plugin | Auto-captures via hooks | | Codex CLI | MCP server | OpenAI's CLI agent | | GitHub Copilot CLI | MCP server | VS Code integration | | Cursor | MCP server | Editor + agent | | Gemini CLI | MCP server | Google's CLI agent | | OpenCode | MCP server + hooks | Auto-capture + MCP tools | | Cline | MCP server | VS Code extension | | Goose | MCP server | Block's agent | | Kilo Code | MCP server | VS Code extension | | Windsurf | MCP server | Codeium's editor | | Roo Code | MCP server | VS Code extension | | Claude Desktop | MCP server | Desktop app | | Aider | MCP server | Terminal pair programmer | | ChatGPT | MCP server (via Squish Cloud) | Cloud sync required | | VS Code (Copilot) | MCP server | Via MCP extension |

Works with any agent that speaks MCP or HTTP. One server, memories shared across all of them.

MCP Server Configuration

Add Squish to any MCP-compatible client:

{
  "mcpServers": {
    "squish": {
      "command": "squish-mcp",
      "args": ["--http", "--port", "8767"],
      "env": {
        "SQUISH_DB_PATH": "./squish-data"
      }
    }
  }
}

For cloud-connected agents:

{
  "mcpServers": {
    "squish-cloud": {
      "type": "url",
      "url": "https://api.squishplugin.dev/mcp",
      "headers": {
        "Authorization": "Bearer <your-token>"
      }
    }
  }
}

Why Squish

Most memory tools need a second LLM for embeddings and retrieval. That means extra API costs, latency, and infrastructure you have to manage.

Squish uses local embeddings by default. Zero LLM dependency. 1-5ms latency. $0 runtime cost in local mode.

Comparison

| Feature | Squish | Built-in (CLAUDE.md) | agentmemory | mem0 | |---------|--------|----------------------|-------------|------| | Auto-capture | Yes (hooks) | Manual | Yes (12 hooks) | Manual API | | Local embeddings | Yes (default) | N/A | Yes | No (cloud) | | External DB required | No (SQLite) | No | Yes (iii-engine) | Yes (Qdrant) | | MCP tools | 15 | 0 | 53 | 9 | | Knowledge graph | Yes | No | Yes | No | | Cross-agent sync | Yes (Cloud) | No | No | API-based | | Price | Free local / $9/mo cloud | Free | Free | $249/mo Pro | | Setup time | 30 seconds | 5 minutes | 15 minutes | 30 minutes | | Data ownership | Full (local SQLite) | Git repo | External DB | Cloud vendor |


Features

Memory Intelligence

  • Auto-captures decisions, constraints, and preferences as you work
  • Restores relevant context when an agent restarts
  • Handles contradictions and temporal facts with expiration
  • Graph-boosted retrieval connects related memories across sessions
  • Contradiction detection flags conflicting information
  • Temporal reasoning tracks when facts were true vs. now
  • Confidence scoring adjusts memory relevance over time
  • Decay system automatically ages low-value memories

Session Search

  • Search previous Claude Code, Codex, and OpenCode sessions
  • Find related sessions by project path or file overlap
  • Inspect past decisions, errors, and commands as evidence
  • Separate from long-term memory -- raw session history, not distilled facts

Interfaces

  • CLI: squish remember, recall, inspect, context, stats, search, sessions
  • MCP Server: 15 tools for any MCP client -- recall, health, graph, recency, maintenance
  • Web UI: Local dashboard at localhost:37777 for visualizing memories
  • Cloud Dashboard: Analytics and management at squishplugin.dev

Storage

  • SQLite (local, default) or PostgreSQL (team mode)
  • Hybrid retrieval: keyword + semantic similarity with RRF fusion
  • AES-256-GCM encryption for sensitive memories
  • Places routing: organize memories by project, feature, or context
  • Full-text search with BM25 ranking
  • Vector search with TF-IDF embeddings (768-dimensional)

Memory Pipeline

Squish uses a 4-stage pipeline to process memories:

  1. Capture -- Filters noisy tool output, promotes what matters (decisions, constraints, preferences)
  2. Filter -- Deduplicates, resolves contradictions, scores importance
  3. Store -- Persists to SQLite/PostgreSQL with graph relationships and embeddings
  4. Retrieve -- Hybrid search combines keyword, semantic, recency, and importance scoring

Architecture

Agent Action
    |
    v
[1. Capture] -----> Filter noisy output
    |               Promote decisions, constraints, preferences
    v
[2. Store] -------> SQLite / PostgreSQL
    |               Embeddings (local TF-IDF)
    |               Knowledge graph edges
    |               Places routing
    v
[3. Retrieve] ----> Keyword search (BM25)
    |               Semantic search (cosine similarity)
    |               Recency weighting
    |               RRF fusion scoring
    v
[4. Context] -----> Inject relevant memories into agent context
                    50-200 tokens average
                    Auto-decay old/low-value memories

Three-Layer Memory Model

+------------------+     +------------------+     +------------------+
|   RECALL         |     |   SESSIONS       |     |   REMEMBER       |
|   (durable)      |     |   (evidence)     |     |   (write)        |
|                  |     |                  |     |                  |
|  Decisions       |     |  Past agent runs |     |  Store new facts |
|  Preferences     |     |  Searchable      |     |  Auto-classify   |
|  Constraints     |     |  Raw history     |     |  Graph update    |
|  Beliefs         |     |  Related repos   |     |  Place routing   |
+------------------+     +------------------+     +------------------+
        |                        |                        |
        v                        v                        v
   squish recall          squish sessions          squish remember
   squish_recall          search/show/list         squish_remember

Storage Layer

SQLite (default)               PostgreSQL (team mode)
    |                               |
    v                               v
+------------------+          +------------------+
| memories         |          | memories         |
| associations     |          | associations     |
| embeddings       |          | embeddings       |
| graph_edges      |          | graph_edges      |
| places           |          | places           |
| sessions         |          | sessions         |
+------------------+          +------------------+
| AES-256-GCM     |          | AES-256-GCM     |
| encryption       |          | encryption       |
+------------------+          +------------------+

Squish Cloud

Persistent memory across ChatGPT, Claude Desktop, Claude Code, and local agents. One account, synchronized everywhere.

  ChatGPT          Claude Desktop     Claude Code       Local Agents
 [OAuth 2.1]       [OAuth 2.1]     [Streamable HTTP]  [MCP / CLI]
      +-------------------+---------------+------------------+
                          |
                  Squish Cloud API
                          |
                 [PostgreSQL + Encrypted Storage]
                          |
                  Admin Dashboard & Analytics

Cloud features: OAuth 2.1 + PKCE login, cross-platform sync, team workspaces, admin dashboard, priority support.

Pricing

| Tier | Price | Storage | Users | Features | |------|-------|---------|-------|----------| | Local | Free | Local SQLite | 1 | Full memory, CLI, MCP, Web UI | | Cloud Solo | $9/mo | 50 MB synced | 1 | Cloud sync, OAuth, dashboard | | Cloud Pro | $29/mo | 250 MB synced | 1 | Pro features, priority support | | Team | $99/mo | 1 GB shared | Up to 10 | Shared workspaces, admin | | Founder Pass | $99/yr | Pro features | 1 | Launch-only annual pricing |

Sign up at squishplugin.dev -- 30 seconds, no credit card needed.

Founder Pass is a launch-only offer. $99/year instead of $348/year (Pro monthly).


Installation Guides

  • Claude Code -- MCP server + plugin hooks for auto-capture
  • OpenCode -- MCP server + hooks for OpenCode agent
  • OpenClaw -- MCP server setup for OpenClaw

Quick install for all detected agents:

npm install -g squish-memory && squish install --all

Squish auto-detects which agents you have installed and configures hooks for each one.


Benchmarks

Squish is tested against real-world memory retrieval tasks and synthetic benchmarks.

| Metric | Result | Notes | |--------|--------|-------| | Core Tests | 9/9 passed (100%) | All memory operations | | LoCoMo Memory | 65% | 100 REAL questions from locomo10.json | | Throughput | 39 ops/sec | With local embeddings | | Total Time | 230ms | For 9 core tests | | Package Size | 283 KB | Lightweight footprint | | Latency (embed) | 6.6ms | Local TF-IDF embeddings | | Latency (search) | 6.1ms | Hybrid retrieval |

Full benchmark details: docs/BENCHMARK.md


Documentation

| Document | Description | |----------|-------------| | CLI Reference | All CLI commands and options | | MCP Server | 15 MCP tools and configuration | | Architecture | System design and data flow | | Decay System | How memories age and lose relevance | | Scoring | Importance and relevance scoring | | Environment Config | Environment variables and settings | | Plugin Architecture | Hook system and agent integration | | Quick Start | Getting started guide | | Agent Comparison | Squish vs other memory tools | | Contributing | How to contribute | | Release Notes | Changelog and version history |


FAQ

What is Squish?

Squish is a local-first memory runtime for AI coding agents. It gives your agents persistent memory using local embeddings with zero LLM dependency. Think of it as a brain that persists between sessions -- your agents remember decisions, constraints, preferences, and context without you having to re-explain everything. In v1.6.0, Squish also searches past agent sessions as evidence, so agents can inspect prior work instead of starting from zero.

Does Squish require an API key?

No. Squish works locally by default with zero API keys. It uses local embeddings (TF-IDF) and SQLite storage. An API key is only needed if you want to use Squish Cloud for cross-device sync.

How does Squish compare to mem0 or agentmemory?

Squish is the only option that works locally with zero external dependencies. mem0 requires Qdrant (a vector database) and cloud API calls. agentmemory requires iii-engine. Squish uses SQLite and local embeddings by default. See the full comparison in the Why Squish section above.

Can I use Squish with multiple AI agents?

Yes. Squish works with any MCP-compatible agent. One memory server is shared across Claude Code, Cursor, Codex, Copilot, Gemini CLI, and any other agent that supports MCP. Memories are available to all connected agents.

Is my data private with Squish?

Yes. In local mode, all data stays on your machine in an encrypted SQLite database. Nothing is sent to any cloud service. AES-256-GCM encryption protects sensitive memories. In cloud mode, data is encrypted in transit and at rest.

What databases does Squish support?

Squish supports SQLite (default, local) and PostgreSQL (team mode). SQLite requires zero configuration. PostgreSQL is used for team workspaces and shared memory across multiple users.

What is the difference between recall and sessions?

squish recall searches your long-term memory -- distilled facts, decisions, and preferences that Squish has captured and organized. squish sessions search searches raw past agent runs -- the actual messages, commands, and file changes from previous Claude Code, Codex, or OpenCode sessions. Recall gives you what the system decided to remember. Sessions give you the evidence.


Contributing

See docs/CONTRIBUTING.md for guidelines on how to contribute to Squish.


License

MIT -- see LICENSE for details.