simple-milvus-mcp

v1.0.11

Published

6 months ago

MCP server for Milvus vector database with semantic and full-text search capabilities

0High
0Medium
0Low

blevinstein

milvus mcp vector database semantic-search ai embeddings

Simple Milvus MCP

⚠️ NOTE: This is NOT the official MCP server for Milvus
The official MCP server is available at: https://github.com/zilliztech/mcp-server-milvus

Why This Alternative MCP Server?

This simplified MCP server was created for specific use cases where the official server may not be the best fit:

Primary Use Case: Multi-Tenant Collection Management

Multiple MCP servers can use the same Milvus database instance with segregated storage
Each server instance can operate on different collections (e.g., per-user, per-account, per-application)
Enables cost-effective shared infrastructure while maintaining data isolation
Perfect for SaaS applications where each customer needs their own vector space

Secondary Benefit: Agent-Friendly Simplified Interface

Agents and LLMs can struggle with too many tool options, leading to poor decision-making
This server provides just 3 focused tools (store_memory, search_memory, forget_memory)
Simplified interface makes it easier to combine with other MCP servers and capabilities
Optimized for memory/knowledge management workflows rather than full database administration

A Model Context Protocol (MCP) server for Milvus vector database that provides semantic and full-text search capabilities using both dense embeddings and BM25 sparse vectors.

Features

Semantic Search: Vector-based similarity search using dense embeddings
Full-text Search: BM25-based keyword search using sparse vectors
Memory Management: Store, search, and delete documents/memories with auto-generated IDs
Flexible Embedding Models: Support for OpenAI, Vertex AI, and Google embedding models
Automatic Schema Management: Collections are created automatically with proper BM25 configuration
Configurable Collections: Use default collection or specify per-operation

Prerequisites

Milvus Server: Running Milvus 2.5+ instance

API Keys: Required environment variables for embedding models:

# For Google models (default)
export GEMINI_API_KEY="your_google_api_key"
  
# For OpenAI models
export OPENAI_API_KEY="your_openai_api_key"
  
# For Vertex AI models
export VERTEX_PROJECT_ID="your-gcp-project"
export VERTEX_LOCATION="us-central1"
export VERTEX_CREDENTIALS='{"type":"service_account","project_id":"your-project","private_key":"-----BEGIN PRIVATE KEY-----\n...\n-----END PRIVATE KEY-----\n","client_email":"[email protected]",...}'

Installation

# Install dependencies
pnpm install

# Build the package
pnpm run build

# Test the installation
pnpm run test

Usage

Running the MCP Server

# Using npx (after building)
npx simple-milvus-mcp --host localhost --port 19530

# With a default collection
npx simple-milvus-mcp --collection my_memories

# Using OpenAI embeddings
npx simple-milvus-mcp --embedding-model openai/text-embedding-3-small

# Development mode (without building)
pnpm run dev --help

Command Line Options

--host: Milvus server host (default: localhost)
--port: Milvus server port (default: 19530)
--collection: Default collection name (optional - collections created as needed)
--embedding-model: Embedding model to use (default: google/text-embedding-004)

Available Tools

1. `store_memory`

Store a document/memory in Milvus with automatic embedding generation and ID creation.

Parameters:

content (string, required): The text content to store
metadata (object, optional): Additional metadata to store with the memory
collection (string, optional): Collection name (if not set as default)

Response:

{
  "success": true,
  "operation": "store",
  "result": {
    "id": "mem_1234567890_abc123def",
    "collection": "my_memories",
    "content_length": 42,
    "embedding_dimensions": 768,
    "embedding_model": "google/text-embedding-004",
    "metadata": {"topic": "AI"},
    "created_at": "2024-01-01T12:00:00.000Z"
  }
}

2. `search_memory`

Search for memories/documents using semantic or full-text search.

Parameters:

query (string, required): Search query text
mode (string, optional): Search mode - semantic or fulltext (default: semantic)
limit (number, optional): Maximum number of results (default: 10)
collection (string, optional): Collection name (if not set as default)

Response:

{
  "success": true,
  "operation": "search",
  "result": {
    "query": "machine learning",
    "mode": "semantic",
    "count": 2,
    "memories": [
      {
        "id": "mem_1234567890_abc123def",
        "content": "Machine learning is a subset of AI...",
        "similarity": 0.92,
        "metadata": {"topic": "AI"},
        "created_at": "2024-01-01T12:00:00.000Z"
      }
    ]
  }
}

3. `forget_memory`

Delete a memory/document from Milvus using its auto-generated ID.

Parameters:

id (string, required): Auto-generated ID of the memory to delete (format: mem_timestamp_randomstring)
collection (string, optional): Collection name (if not set as default)

Response:

{
  "success": true,
  "operation": "delete",
  "result": {
    "id": "mem_1234567890_abc123def",
    "collection": "my_memories"
  }
}

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Simple Milvus MCP

Why This Alternative MCP Server?

Features

Prerequisites

Installation

Usage

Running the MCP Server

Command Line Options

Available Tools

1. store_memory

2. search_memory

3. forget_memory

1. `store_memory`

2. `search_memory`

3. `forget_memory`