@possumtech/rummy

v2.3.3

Published

18 days ago

Relational Unknowns Memory Management Yoke

0High
0Medium
0Low

possumtechcom

llm

RUMMY: The General-Purpose Agent Kernel

Rummy is a headless, metacognitive relational architecture for LLM agents. It is designed to be integrated into real-world workflows—from IDEs and CLI tools to autonomous research pipelines—where project state is complex and accuracy is non-negotiable.

While traditional agents "thrash" and fail under the weight of linear chat history, Rummy treats the LLM as a program executing on a managed memory substrate. This "Virtual Memory" architecture ensures that Rummy remains reliable in sessions that span hundreds of turns and tens of thousands of files.

The Architecture: Virtual Memory for Tokens

Rummy provides the memory hierarchy necessary to maintain high-fidelity reasoning over unlimited-turn sessions. This is not a benchmarking "harness," but a production-grade Operating System for AI agency:

L1 Cache (visible): High-fidelity, character-perfect context. This is the active "Working Set" the model is reasoning with right now.
RAM (summarized): Folksonomic metadata and searchable indices. This allows the model to know what information exists and how to address it without consuming the L1 token budget.
The Disk (archived): Persistent SQLite storage. A relational substrate where every historical finding, raw source document, and prior tool result is safely indexed and searchable, ready to be "paged" back into Cache on demand.

Key Features

Headless & RPC-First

Rummy is a headless service. It exposes a JSON-RPC over WebSocket interface, allowing it to be embedded into any client (e.g., rummy.nvim). The server manages the project state and the "Kernel" loop, while the client drives the UI and handles local proposal resolution.

Extensible Plugin Architecture

Rummy is built for integration. Every <tag> the model sees is a plugin. Every URI scheme (known://, unknown://, sh://) is registered by its owner. Developers can drop custom logic into src/plugins/ to add new tools, filters, or event hooks. See PLUGINS.md for details.

The Six Primitives

Every operation in Rummy reduces to one of six verbs over a single entry contract: set / get / rm / mv / cp / update. Tools (<sh>, <search>, <known>, <unknown>, …) are plugins that compose these primitives. Three actor surfaces — model XML tags, plugin RummyContext methods, JSON-RPC client calls — speak the same grammar at the store layer.

The Model Owns Its Context

Visibility (visible / summarized / archived) is the model's exclusive lever. The engine never silently mutates an entry's visibility behind the model's back; the only enforcements that touch visibility (Turn Demotion at budget overflow, Prompt Demotion at context-exceeded) surface through error:// so the model sees the trigger. No chat-waterfall horizon, no auto-prune — the model controls what it sees and what it doesn't.

Apophatic Reasoning (The Rumsfeld Loop)

Rummy turns "Not Knowing" into a formal state to be processed. By mapping Unknowns (unknown://) into verified Knowns (known://), Rummy provides a transparent, auditable trail of how the agent arrived at its conclusion.

Installation

Rummy loads configuration from exactly one directory per invocation:

The current working directory (if it contains .env.example).
Otherwise, ${RUMMY_HOME} (default ~/.rummy).

# Set your RUMMY_HOME
export RUMMY_HOME=~/.rummy

# Install globally
npm i -g @possumtech/rummy

# Configure your environment
$EDITOR ~/.rummy/.env.example   # set model aliases and keys
rummy

Usage

Start the service and connect your preferred client. The server defaults to port 3044.

Official Client: rummy.nvim (Neovim interface)
In-process CLI: rummy-cli (one-shot ask/act invocations against a project; see src/plugins/cli/)
Diagnostic Suite: test/tbench/ and test/programbench/ (autonomous diagnostic and benchmarking harnesses)

Documentation

| Document | Contents | |----------|----------| | SPEC.md | Technical Specification: K/V store, packet structure, dispatch path, and lifecycle contracts. | | PLUGINS.md | Extensibility: Hook registry, event filtering, and custom scheme registration. | | src/plugins/ | Plugin Reference: Internal documentation for each scheme and toolset. | | AGENTS.md | Project roadmap, planning history, and architectural lessons. |

Rummy: The Managed Operating System for AI Agency.