viberag

v0.3.3

Published

6 days ago

Local code RAG for AI coding assistants - semantic search via MCP server

0High
0Medium
0Low

mdrideout

rag mcp semantic-search code-search embeddings ai claude vscode cursor

VIBERAG MCP Server

Free, Open Source, Local / Offline Capable, Container-Free Semantic Search For Your Codebase

VibeRAG is fully local, offline capable MCP server for local codebase search.

Semantic codebase search
Keyword codebase search (BM25)
Hybrid codebase search (with tunable parameters)

VibeRAG automatically indexes your codebase into a local container-free vector database (lancedb). Every time you make a change, the indexes are automatically updated.

Install

npm install -g viberag

Quick Start

# Initialize in your project
cd your-project
viberag

# Run the initialization wizard to configure embeddings, run initial indexing, and automatically configure MCP server integration.
/init

# In addition to allowing Agents to search via the MCP server,
# you can search yourself via the CLI.
/search authentication handler

Example

When using a coding agent like Claude Code, add use viberag to your prompt.

────────────────────────────────────────────────────────────────────
> How is authentication handled in this repo? use viberag
────────────────────────────────────────────────────────────────────

Tip: include "use viberag" in your prompt to ensure your agent will use viberag's codebase search features. Most agents will select MCP tools as appropriate, but sometimes they need a little help with explicit prompting.

Features

CLI based setup - CLI commands and wizards for setup, editor integration, and configuration
Semantic code search - Find code by meaning, not just keywords
Flexible embeddings - Local model (offline, free) or cloud providers (Gemini, Mistral, OpenAI)
MCP server - Works with Claude Code, Cursor, VS Code Copilot, and more
Automatic Incremental indexing - Watches for file changes and reindexes only what has changed in real time
Multi-language support - TypeScript, JavaScript, Python, Go, Rust, and more

How It Works:

Your coding agent would normally use Search / Grep / Find and guess search terms that are relevant. VibeRAG indexes the codebase into a local vector database (based on lancedb) and can use semantic search to find all relevant code snippets even if the search terms are not exact.

When searching for "authentication", VibeRAG will find all code snippets that are relevant to authentication, such as "login", "logout", "register", and names of functions and classes like AuthDependency, APIKeyCache, etc.

This ensures comprehensive search of your codebase so you don't miss important files and features that are relevant to your changes or refactor.

Great for Monorepos

Semantic search is especially useful in monorepos, where you may be trying to understand how different parts of the codebase interact with each other. Viberag can find all the pieces with fewer searches, fewer tokens used, and a shorter amount of time spent searching.

Embedding Models

You can use a locally run embedding model (Qwen3-Embedding-0.6B) so that nothing leaves your machine.
SOTA API based embeddings from Gemini, OpenAI, and Mistral are also supported.

MCP Server

VibeRAG includes an MCP server that integrates with AI coding tools.

IDE / Agent Setup Wizard

Run /mcp-setup in the VibeRAG CLI for interactive setup. This wizard will attempt to automatically configure your coding agents / editors with viberags MCP server settings.

# Start viberag
$ viberag

# Run the setup wizard (after having initialized with /init)
$ /mcp-setup

# Automatic configuration wizard
╭───────────────────────────────────────────────────────────────╮
│ MCP Setup Wizard                                              │
│                                                               │
│ Select AI coding tool(s) to configure:                        │
│ (Space to toggle, Enter to confirm)                           │
│                                                               │
│ > [x] Claude Code (auto-setup)                                │
│   [ ] Cursor (auto-setup)                                     │
│   [ ] Gemini CLI (global config)                              │
│   [ ] JetBrains IDEs (manual setup)                           │
│   [ ] OpenAI Codex (global config)                            │
│   [ ] OpenCode (global config)                                │
│   [ ] Roo Code (auto-setup)                                   │
│   [ ] VS Code Copilot (auto-setup)                            │
│   [ ] Windsurf (global config)                                │
│   [ ] Zed (global config)                                     │
│                                                               │
│ 1 selected | ↑/↓ move, Space toggle, Enter confirm, Esc cancel│
╰───────────────────────────────────────────────────────────────╯

The wizard can auto-configure project-level configs and merge into global configs.

Manual Setup Instructions

The following sections describe manual MCP server setup configurations for various editors and agents.

CLI Command:

claude mcp add viberag -- npx viberag-mcp

Global Config: ~/.claude.json

{
	"mcpServers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Project Config: .mcp.json

{
	"mcpServers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Verify: Run /mcp in Claude Code, look for "viberag: connected"

Documentation

Global Config: ~/.cursor/mcp.json

{
	"mcpServers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Project Config: .cursor/mcp.json

{
	"mcpServers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Verify: Settings → Cursor Settings → MCP, verify "viberag" shows with toggle enabled

Documentation

CLI Command:

gemini mcp add viberag -- npx viberag-mcp

Global Config: ~/.gemini/settings.json

{
	"mcpServers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Project Config: .gemini/settings.json

{
	"mcpServers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Verify: Run /mcp in Gemini CLI, look for "viberag" in server list

Documentation

UI Setup:

Open Settings → Tools → AI Assistant → MCP
Click "Add Server"
Set name: viberag
Set command: npx
Set args: viberag-mcp

Verify: Settings → Tools → AI Assistant → MCP, verify "viberag" shows green in Status column

Documentation

CLI Command:

codex mcp add viberag -- npx -y viberag-mcp

Global Config: ~/.codex/config.toml

[mcp_servers.viberag]
command = "npx"
args = ["-y", "viberag-mcp"]

Note: The -y flag is required for npx to auto-confirm package installation

Verify: Run /mcp in Codex TUI, look for "viberag" in server list

Documentation

Global Config: ~/.config/opencode/opencode.json (Linux/macOS) or %APPDATA%/opencode/opencode.json (Windows)

{
	"mcp": {
		"viberag": {
			"type": "local",
			"command": ["npx", "-y", "viberag-mcp"]
		}
	}
}

Project Config: opencode.json

{
	"mcp": {
		"viberag": {
			"type": "local",
			"command": ["npx", "-y", "viberag-mcp"]
		}
	}
}

Note: OpenCode uses "mcp" key and requires "type": "local" with command as an array

Verify: Check MCP servers list in OpenCode, verify "viberag" appears and is enabled

Documentation

Global Config: UI only — Click MCP icon in Roo Code pane header → Edit Global MCP

Project Config: .roo/mcp.json

{
	"mcpServers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Verify: Click MCP icon in Roo Code pane header, verify "viberag" appears in server list

Documentation

Global Config: Add to User settings.json under mcp.servers:

{
	"mcp": {
		"servers": {
			"viberag": {
				"command": "npx",
				"args": ["viberag-mcp"]
			}
		}
	}
}

Project Config: .vscode/mcp.json

{
	"servers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Note: VS Code uses "servers" instead of "mcpServers"

Required: Enable Agent Mode in VS Code settings:
Settings → search chat.agent.enabled → check the box, OR
Add "chat.agent.enabled": true to your User settings.json

Verify: Cmd/Ctrl+Shift+P → "MCP: List Servers", verify "viberag" appears

Documentation

Global Config: ~/.codeium/windsurf/mcp_config.json

{
	"mcpServers": {
		"viberag": {
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Verify: Click Plugins icon in Cascade panel, verify "viberag" shows in plugin list

Documentation

Global Config: ~/.config/zed/settings.json

{
	"context_servers": {
		"viberag": {
			"source": "custom",
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Project Config: .zed/settings.json

{
	"context_servers": {
		"viberag": {
			"source": "custom",
			"command": "npx",
			"args": ["viberag-mcp"]
		}
	}
}

Note: Zed uses "context_servers" instead of "mcpServers" and requires "source": "custom" for non-extension servers

Verify: Open Agent Panel settings, verify "viberag" shows green indicator

Documentation

Exposed MCP Tools

| Tool | Description | | -------------------------- | ---------------------------------------------------- | | codebase_search | Semantic, keyword, or hybrid search for code | | codebase_parallel_search | Run multiple search strategies in parallel | | viberag_index | Index the codebase (incremental by default) | | viberag_status | Get index status, file count, and embedding provider | | viberag_watch_status | Get file watcher status for auto-indexing |

`codebase_search`

The primary search tool. Finds code by meaning, not just keywords.

Search Modes:

| Mode | Best For | | ------------ | -------------------------------------------- | | hybrid | Most queries (default) - combines both | | semantic | Conceptual queries ("how does auth work?") | | exact | Symbol names, specific strings | | definition | Direct symbol lookup ("where is X defined?") | | similar | Find code similar to a snippet |

Key Parameters:

query - Natural language search query
mode - Search mode (default: hybrid)
limit - Max results (default: 10, max: 100)
bm25_weight - Balance keyword vs semantic (0-1, default: 0.3)
filters - Path, type, and metadata filters

Example:

{
	"query": "authentication middleware",
	"mode": "hybrid",
	"limit": 15,
	"filters": {
		"path_not_contains": ["test", "mock"],
		"is_exported": true
	}
}

`codebase_parallel_search`

Run multiple search strategies simultaneously and merge results. Best for comprehensive exploration.

Use Cases:

Compare semantic vs keyword results
Search related concepts together
Test different weight settings

Example:

{
	"searches": [
		{"query": "authentication", "mode": "semantic", "limit": 10},
		{"query": "auth login JWT", "mode": "exact", "limit": 10},
		{"query": "user session", "mode": "hybrid", "bm25_weight": 0.5, "limit": 10}
	],
	"merge_results": true,
	"merge_strategy": "rrf",
	"merged_limit": 20
}

`viberag_index`

Manually trigger indexing. Normally not needed as file watching handles updates automatically.

Parameters:

force - Full reindex ignoring cache (default: false)

`viberag_status`

Check index health and configuration.

Returns:

File count, chunk count
Embedding provider and dimensions
Schema version
Last update timestamp
Warmup status (ready, initializing, etc.)

`viberag_watch_status`

Check the file watcher for auto-indexing.

Returns:

Whether watching is active
Number of files being watched
Pending changes count
Last update timestamp

CLI Commands

VibeRAG includes a CLI for easy execution of initialization, indexing, setup, and other things you may want to manually control outside of agent use.

| Command | Description | | ----------------- | --------------------------------------------------------- | | /init | Initialize VibeRAG (configure embeddings, index codebase) | | /index | Index the codebase (incremental) | | /reindex | Force full reindex | | /search <query> | Semantic search | | /status | Show index status | | /mcp-setup | Configure MCP server for AI tools | | /clean | Remove VibeRAG from project | | /help | Show all commands |

Embedding Providers

Choose your embedding provider during /init:

Local Model - Offline, Free

| Model | Quant | Download | RAM | | ---------- | ----- | -------- | ------ | | Qwen3-0.6B | Q8 | ~700MB | ~1.5GB |

Works completely offline, no API key required
Initial indexing may take time; future updates are incremental
Works great for code and natural language (docs, docstrings, code comments, etc.)

Cloud Providers - Fastest, Best Quality

| Provider | Model | Dims | Cost | Get API Key | | -------- | ---------------------- | ---- | --------- | ------------------------------------------------------- | | Gemini | gemini-embedding-001 | 1536 | Free tier | Google AI Studio | | Mistral | codestral-embed | 1536 | $0.10/1M | Mistral Console | | OpenAI | text-embedding-3-small | 1536 | $0.02/1M | OpenAI Platform |

Gemini - Free tier available, great for getting started
Mistral - Code-optimized embeddings for technical content
OpenAI - Fast and reliable with low cost

API keys are entered during the /init wizard and stored securely in .viberag/config.json (automatically added to .gitignore).

How It Works

Parsing - Tree-sitter extracts functions, classes, and semantic chunks
Embedding - Code chunks are embedded using local or API-based models
Storage - Vectors stored in LanceDB (local, no server required)
Search - Hybrid search combines vector similarity + full-text search
MCP - AI tools query the index via the MCP protocol

AI Agent Best Practices

VibeRAG works best when AI agents use sub-agents for exploration tasks. This keeps the main conversation context clean and uses ~8x fewer tokens.

Why Sub-Agents?

When an AI calls viberag directly, all search results expand the main context. Sub-agents run searches in isolated context windows and return only concise summaries.

| Approach | Context Usage | Token Efficiency | | -------------------- | ------------- | ---------------- | | Direct viberag calls | 24k tokens | Baseline | | Sub-agent delegation | 3k tokens | 8x better |

Platform-Specific Guidance

Claude Code

# For exploration tasks, use the Task tool:
Task(subagent_type='Explore', prompt='Use viberag to find how authentication works')

# For parallel comprehensive search:
Task(subagent_type='Explore', prompt='Search auth patterns') # runs in parallel
Task(subagent_type='Explore', prompt='Search login flows')   # with this one

Add to your CLAUDE.md:

When exploring the codebase, use Task(subagent_type='Explore') and instruct it
to use codebase_search or codebase_parallel_search. This keeps the main context clean.

VS Code Copilot

Use Agent HQ to delegate exploration to background agents
Background agents can iterate with viberag without blocking your session
Use /delegate to hand off exploration tasks to Copilot coding agent

Cursor

Enable Agent mode for multi-step exploration
Agent mode can orchestrate multiple viberag searches autonomously
Consider the Sub-Agents MCP server for Claude Code-style delegation

Windsurf

Cascade automatically plans multi-step tasks
Enable Turbo Mode for autonomous exploration
Cascade's planning agent will orchestrate viberag calls efficiently

Roo Code

Use Architect mode for exploration and understanding
Boomerang tasks coordinate complex multi-mode workflows
Each mode (Architect, Code, Debug) can use viberag with focused context

Gemini CLI

Create extensions that scope viberag tools for specific tasks
Extensions can bundle viberag with custom prompts for specialized exploration
Use gemini mcp add viberag then reference in extension configs

OpenAI Codex

Use Agents SDK to orchestrate viberag as an MCP tool
Codex can run as an MCP server itself for multi-agent setups
Approval modes control how autonomously Codex explores

JetBrains IDEs

Junie agent handles multi-step exploration autonomously
Claude Agent integration provides sub-agent-like capabilities
Access viberag through AI Chat with multi-agent support

Zed

Use External Agents (Claude Code, Codex, Gemini CLI) for exploration
Set auto_approve in settings for autonomous agent operation
ACP (Agent Client Protocol) enables BYO agent integration

Quick Lookup vs Exploration

| Task Type | Recommended Approach | | ------------------------------- | ----------------------------------------------- | | "Where is function X defined?" | Direct codebase_search with mode='definition' | | "What file handles Y?" | Direct codebase_search - single query | | "How does authentication work?" | Sub-agent - needs multiple searches | | "Find all API endpoints" | Sub-agent or codebase_parallel_search | | "Understand the data flow" | Sub-agent - iterative exploration |

For Platforms Without Sub-Agents

Use codebase_parallel_search to run multiple search strategies in a single call:

{
	"searches": [
		{"query": "authentication", "mode": "semantic"},
		{"query": "auth login", "mode": "exact"},
		{"query": "user session", "mode": "hybrid", "bm25_weight": 0.5}
	],
	"merge_results": true,
	"merge_strategy": "rrf"
}

This provides comprehensive coverage without multiple round-trips.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

VIBERAG MCP Server

Install

Quick Start

Example

Features

How It Works:

Great for Monorepos

Embedding Models

MCP Server

IDE / Agent Setup Wizard

Manual Setup Instructions

Exposed MCP Tools

codebase_search

codebase_parallel_search

viberag_index

viberag_status

viberag_watch_status

CLI Commands

Embedding Providers

Local Model - Offline, Free

Cloud Providers - Fastest, Best Quality

How It Works

AI Agent Best Practices

Why Sub-Agents?

Platform-Specific Guidance

Claude Code

VS Code Copilot

Cursor

Windsurf

Roo Code

Gemini CLI

OpenAI Codex

JetBrains IDEs

Zed

Quick Lookup vs Exploration

For Platforms Without Sub-Agents

`codebase_search`

`codebase_parallel_search`

`viberag_index`

`viberag_status`

`viberag_watch_status`