nimble-mcp
v0.7.0
Published
A token-efficient MCP server that lazy loads tools and proxies tool calls
Downloads
247
Readme
nimble
A token-efficient MCP server that lazy loads tools and proxies tool calls
- comes with a local dashboard to connect and configure MCP servers
- proxies tool calls with 3 simple top-level tools:
list-tools,get-tool,execute-tool - supports tool summarization by LLM, you may also customize your own tool summaries
How it works
MCP clients naively include all tool descriptions and schemas on the context window. This results in excessive token consumption even when you only use a few tools. Multiply this over several MCP servers and your chat session becomes bloated. It's not only costly but also unsable as you'll quickly run into the model's token limit.
nimble solves the problem by allowing tools to be lazy loaded only when they're needed. The idea is connect to a unified MCP server with concise tool summaries and let LLM discover and expand full tool description when needed.
When tested with popular MCP servers (Notion, Linear, Figma, etc.), we see over 99% token savings on initial load and 90% during typical chat sessions.
Installation
nimble runs over stdio. Configure your MCP client to launch it:
{
"mcpServers": {
"nimble-mcp": {
"command": "npx",
"args": ["-y", "nimble-mcp"],
"env": {
"NIMBLE_ENCRYPTION_KEY": "your-encryption-key",
"NIMBLE_UI_PORT": "3333"
}
}
}
}The NIMBLE_UI_PORT determines the port for the local config server (default: http://localhost:3333).
The NIMBLE_ENCRYPTION_KEY is used to encrypt server credentials (access & refresh tokens), which are stored in local sqlite db.
Add OpenAI env vars here if you want LLM summaries to be automatically inferred when connecting a server.
Example:
{
"mcpServers": {
"nimble-mcp": {
"command": "npx",
"args": ["-y", "nimble-mcp"],
"env": {
"NIMBLE_ENCRYPTION_KEY": "your-encryption-key",
"OPENAI_API_KEY": "sk-...",
"OPENAI_MODEL": "gpt-5-mini"
}
}
}
}Tools
list-toolsprovides a list of available tools and a brief summary of each toolget-toolretrieves a given tool's detailed description and schemaexecute-toolperforms the tool call that proxies the input argument to the original tool server
Quick guide
Once configuration in your MCP client complete, open the config dashboard (http://localhost:3333/) in the browser to setup MCP connections.
Add a server and authenticate
If you provided an OPENAI_API_KEY, the summaries will be automatically inferred by LLM (OpenAI for now). Otherwise, the first sentence from the description will be used. You may also customize this by clicking on the tool and modify the summary from the tool modal
You can also toggle tools on/off
Test out the server from the included MCP client
Repeat the process to add more MCP servers.
Development
Create a .env file in the repo root (see .env.example) to manage env vars locally.
npm run devScripts
Server:
npm run build
npm run dev
npm testUI:
npm run ui:build
npm run ui:dev
npm run ui:previewConfig UI
The server also hosts a local config UI on http://127.0.0.1:3333.
The UI reads and writes the SQLite DB.
Build UI once:
npm run ui:buildOr build UI + server:
npm run buildRun UI dev server:
npm run ui:devLLM Summaries
Optional: auto-generate tool summaries on connect using OpenAI.
OPENAI_API_KEY=sk-... \
OPENAI_MODEL=gpt-5-mini \
npm run devStorage
nimble stores configuration and tool cache in a local SQLite database.
Default DB path:
./nimble.sqliteOverride with:
NIMBLE_DB_PATH=/path/to/nimble.sqliteOAuth flow by URL/transport (manual):
NIMBLE_ENCRYPTION_KEY=your-encryption-key node dist/index.js --server-url https://mcp.notion.com/mcp --transport streamableHttpIf the server does not exist yet, this will auto-add a default OAuth entry and use a local callback at http://127.0.0.1:8787/callback.
Publish
npm login
npm publish --access public