strapi-content-embeddings

v0.2.0

Published

a day ago

Strapi v5 plugin for vector embeddings with OpenAI and Neon PostgreSQL. Enables semantic search, RAG chat, and MCP (Model Context Protocol) integration.

Downloads

1,044

0High
0Medium
0Low

codingafterthirty

strapi strapi-plugin embeddings vector-search rag openai neon pgvector mcp semantic-search

Strapi Content Embeddings

A Strapi v5 plugin that creates vector embeddings from your content using OpenAI and stores them in Neon PostgreSQL with pgvector. Enables semantic search, RAG (Retrieval-Augmented Generation) chat, and MCP (Model Context Protocol) integration for AI assistants like Claude Desktop.

Features

Vector Embeddings: Generate embeddings from your content using OpenAI's embedding models
Neon PostgreSQL Storage: Store embeddings in Neon DB with pgvector for efficient similarity search
RAG Chat Interface: Built-in chat widget to ask questions about your content
MCP Server: Expose your embeddings to AI assistants via Model Context Protocol
Content Manager Integration: Create embeddings directly from any content type's edit view
Standalone Embeddings: Create embeddings independent of content types
Multiple Embedding Models: Support for OpenAI's text-embedding-3-small, text-embedding-3-large, and text-embedding-ada-002
Database Sync: Sync embeddings from Neon DB to Strapi via admin UI or API endpoints
Automatic Chunking: Split large content into multiple embeddings with overlap for context preservation
Content Preprocessing: Automatically strips HTML and Markdown formatting for cleaner embeddings

Requirements

Strapi v5.x
Node.js 18+
OpenAI API key
Neon PostgreSQL database with pgvector extension

Installation

npm install strapi-content-embeddings
# or
yarn add strapi-content-embeddings

Configuration

1. Enable the Plugin

Add the plugin to your config/plugins.ts (or config/plugins.js):

export default ({ env }) => ({
  "strapi-content-embeddings": {
    enabled: true,
    config: {
      openAIApiKey: env("OPENAI_API_KEY"),
      neonConnectionString: env("NEON_CONNECTION_STRING"),
      // Optional: Choose embedding model (default: "text-embedding-3-small")
      embeddingModel: env("EMBEDDING_MODEL", "text-embedding-3-small"),
    },
  },
});

2. Set Environment Variables

Add the following to your .env file:

OPENAI_API_KEY=sk-your-openai-api-key
NEON_CONNECTION_STRING=postgresql://user:[email protected]/dbname?sslmode=require
# Optional
EMBEDDING_MODEL=text-embedding-3-small

3. Get Your Neon Connection String

Sign up at Neon
Create a new project
Navigate to your project's Connection Details
Copy the connection string (it should look like postgresql://user:[email protected]/dbname?sslmode=require)

The plugin will automatically:

Enable the pgvector extension
Create the embeddings_documents table
Set up HNSW indexes for fast similarity search

MCP Integration

This plugin exposes an MCP (Model Context Protocol) server that allows AI assistants like Claude Desktop to search your embeddings.

MCP Endpoint

POST /api/strapi-content-embeddings/mcp

Available MCP Tools

| Tool | Description | Trigger | |------|-------------|---------| | rag_query | Ask questions and get AI-generated answers from your content | /rag [question] | | semantic_search | Find semantically similar content | /rag search [query] | | list_embeddings | List all stored embeddings | - | | get_embedding | Get a specific embedding by ID | - | | create_embedding | Create a new embedding | - |

Claude Desktop Configuration

Add to your Claude Desktop config (~/Library/Application Support/Claude/claude_desktop_config.json):

{
  "mcpServers": {
    "strapi-content-embeddings": {
      "command": "npx",
      "args": [
        "mcp-remote",
        "https://your-strapi-url.com/api/strapi-content-embeddings/mcp",
        "--header",
        "Authorization: Bearer YOUR_STRAPI_API_TOKEN"
      ]
    }
  }
}

Usage in Claude Desktop

Type /rag followed by your question to search your embeddings:

/rag What is Strapi?
/rag Who is Paul Bratslavsky?

Available Embedding Models

| Model | Dimensions | Description | |-------|------------|-------------| | text-embedding-3-small | 1536 | Fast, cost-effective (default) | | text-embedding-3-large | 3072 | Higher accuracy, more expensive | | text-embedding-ada-002 | 1536 | Legacy model |

Usage

Admin Panel

Create Embeddings Page

Navigate to Content Embeddings in the Strapi admin sidebar to:

View all existing embeddings
Create new standalone embeddings
Delete embeddings
Search and filter embeddings

Content Manager Integration

When editing any content type, you'll see an Embeddings panel in the right sidebar:

Create Embedding: Generate an embedding from the current content
View Embedding: Navigate to the embedding details
Update Embedding: Update the embedding with current content changes

Chat Widget

Click the robot icon in the bottom-right corner to open the RAG chat interface:

Ask questions about your embedded content
View source documents used to generate answers
Navigate to source embeddings

Programmatic Usage

Create an Embedding

const result = await strapi
  .plugin("strapi-content-embeddings")
  .service("embeddings")
  .createEmbedding({
    data: {
      title: "My Document",
      content: "This is the content to embed...",
      collectionType: "api::article.article", // optional
      fieldName: "content", // optional
      metadata: { customField: "value" }, // optional
    },
  });

Query Embeddings (RAG)

const response = await strapi
  .plugin("strapi-content-embeddings")
  .service("embeddings")
  .queryEmbeddings("What is this document about?");

// response.text - The AI-generated answer
// response.sourceDocuments - The documents used for context

Similarity Search

const documents = await strapi
  .plugin("strapi-content-embeddings")
  .service("embeddings")
  .similaritySearch("search query", 4); // returns top 4 similar documents

API Endpoints

All endpoints require admin authentication.

| Method | Endpoint | Description | |--------|----------|-------------| | POST | /strapi-content-embeddings/embeddings/create-embedding | Create a new embedding | | PUT | /strapi-content-embeddings/embeddings/update-embedding/:id | Update an existing embedding | | DELETE | /strapi-content-embeddings/embeddings/delete-embedding/:id | Delete an embedding | | GET | /strapi-content-embeddings/embeddings/find | List all embeddings | | GET | /strapi-content-embeddings/embeddings/find/:id | Get a single embedding | | GET | /strapi-content-embeddings/embeddings/embeddings-query?query=... | RAG query |

Database Sync (Neon to Strapi)

The plugin provides endpoints to sync embeddings from Neon DB (source of truth) to Strapi. These endpoints are designed to be triggered manually or via cron jobs.

Sync Endpoints

| Method | Endpoint | Description | |--------|----------|-------------| | GET/POST | /api/strapi-content-embeddings/sync | Sync embeddings from Neon to Strapi | | GET | /api/strapi-content-embeddings/sync/status | Check sync status without making changes |

Query Parameters

| Parameter | Type | Default | Description | |-----------|------|---------|-------------| | dryRun | boolean | false | Preview changes without applying them | | removeOrphans | boolean | false | Remove Strapi entries that don't exist in Neon |

Usage Examples

Check sync status:

curl "http://localhost:1337/api/strapi-content-embeddings/sync/status" \
  -H "Authorization: Bearer YOUR_API_TOKEN"

Response:

{
  "neonCount": 150,
  "strapiCount": 145,
  "inSync": false,
  "missingInStrapi": 5,
  "missingInNeon": 0,
  "contentDifferences": 2
}

Dry run (preview changes):

curl "http://localhost:1337/api/strapi-content-embeddings/sync?dryRun=true" \
  -H "Authorization: Bearer YOUR_API_TOKEN"

Run sync:

curl "http://localhost:1337/api/strapi-content-embeddings/sync" \
  -H "Authorization: Bearer YOUR_API_TOKEN"

Sync and remove orphans:

curl "http://localhost:1337/api/strapi-content-embeddings/sync?removeOrphans=true" \
  -H "Authorization: Bearer YOUR_API_TOKEN"

Sync Response

{
  "success": true,
  "timestamp": "2024-01-07T12:00:00.000Z",
  "neonCount": 150,
  "strapiCount": 150,
  "actions": {
    "created": 5,
    "updated": 2,
    "orphansRemoved": 0
  },
  "details": {
    "created": ["doc1 (Title 1)", "doc2 (Title 2)"],
    "updated": ["doc3 (Title 3)"],
    "orphansRemoved": []
  },
  "errors": []
}

Cron Job Example

# Sync every hour
0 * * * * curl -s "https://your-strapi.com/api/strapi-content-embeddings/sync" \
  -H "Authorization: Bearer YOUR_API_TOKEN" >> /var/log/embeddings-sync.log

Content Chunking

For large content that exceeds the recommended size for embeddings (~4000 characters / ~1000 tokens), the plugin supports automatic chunking.

How Chunking Works

Smart Splitting: Content is split at natural boundaries (paragraphs, sentences, words) to preserve meaning
Overlap: Chunks include overlapping content (default: 200 chars) to maintain context between chunks
Metadata: Each chunk stores metadata linking it to the original content and other chunks
Titles: Chunk titles include part numbers (e.g., "My Document [Part 1/3]")

Configuration

Add chunking options to your plugin config:

// config/plugins.ts
export default ({ env }) => ({
  "strapi-content-embeddings": {
    enabled: true,
    config: {
      openAIApiKey: env("OPENAI_API_KEY"),
      neonConnectionString: env("NEON_CONNECTION_STRING"),
      // Chunking options
      chunkSize: 4000,      // Max characters per chunk (default: 4000)
      chunkOverlap: 200,    // Overlap between chunks (default: 200)
      autoChunk: false,     // Auto-chunk large content globally (default: false)
    },
  },
});

Using Chunking

Via MCP Tool

{
  "tool": "create_embedding",
  "arguments": {
    "title": "My Long Document",
    "content": "... very long content ...",
    "autoChunk": true
  }
}

Programmatic Usage

// Create with automatic chunking
const result = await strapi
  .plugin("strapi-content-embeddings")
  .service("embeddings")
  .createChunkedEmbedding({
    data: {
      title: "My Long Document",
      content: "... very long content ...",
    },
  });

console.log(result);
// {
//   entity: { ... first chunk ... },
//   chunks: [ ... all chunks ... ],
//   totalChunks: 5,
//   wasChunked: true
// }

// Or use createEmbedding with autoChunk flag
const embedding = await strapi
  .plugin("strapi-content-embeddings")
  .service("embeddings")
  .createEmbedding({
    data: {
      title: "My Document",
      content: "... long content ...",
      autoChunk: true,  // Enable chunking
    },
  });

Chunk Metadata

Each chunk embedding includes metadata:

{
  "isChunk": true,
  "chunkIndex": 0,
  "totalChunks": 5,
  "startOffset": 0,
  "endOffset": 4200,
  "originalTitle": "My Long Document",
  "parentDocumentId": "abc123",
  "estimatedTokens": 1050
}

Content Preprocessing

The plugin automatically preprocesses content before creating embeddings to improve semantic search quality. This is enabled by default.

What Gets Cleaned

HTML tags: Stripped while preserving text content
Markdown syntax: Headers (#), bold (**), italic (*), links, lists, code blocks
Whitespace: Normalized (multiple spaces/newlines collapsed)

Why Preprocess?

Raw markdown/HTML formatting adds noise to embeddings without adding semantic meaning:

Input:  "## Features\n- **Fast** search\n- <b>Reliable</b>"
Output: "Features: Fast search. Reliable"

Both produce the same semantic meaning, but the cleaned version creates better embeddings for search.

Configuration

Preprocessing is enabled by default. To disable:

// config/plugins.ts
export default ({ env }) => ({
  "strapi-content-embeddings": {
    enabled: true,
    config: {
      openAIApiKey: env("OPENAI_API_KEY"),
      neonConnectionString: env("NEON_CONNECTION_STRING"),
      preprocessContent: false,  // Disable preprocessing
    },
  },
});

Note: The original content is always preserved in Strapi. Preprocessing only affects the text sent to OpenAI for embedding generation.

Admin Sync UI

The plugin includes a built-in sync interface accessible from the admin panel. Click the Sync button in the Content Embeddings page header.

Available Operations

| Operation | Description | |-----------|-------------| | Check Status | Compare Neon and Strapi databases, shows counts and differences | | Sync from Neon | Import embeddings from Neon to Strapi (with preview option) | | Recreate All | Delete all Neon embeddings and recreate from Strapi data |

Sync Workflow

Click Sync button to open the sync modal
View current sync status (Neon vs Strapi counts)
Select Sync from Neon operation
Click Preview Sync to see what changes would be made
Review the results (Created/Updated/Removed counts)
Click Apply Changes to execute the sync

Sync Options

Dry Run: Preview changes without applying them (enabled by default)
Remove Orphans: Delete Strapi entries that don't exist in Neon

How It Works

Embedding Creation: When you create an embedding, the content is sent to OpenAI's embedding API to generate a vector representation (1536 or 3072 dimensions depending on the model).
Storage: The embedding vector is stored in Neon PostgreSQL using the pgvector extension, along with the content and metadata.
Similarity Search: When querying, the search query is converted to an embedding and compared against stored embeddings using cosine similarity via pgvector's HNSW index.
RAG Response: For chat queries, the most relevant documents are retrieved and passed to GPT-4o-mini as context to generate an accurate response.

Database Schema

The plugin creates an embeddings_documents table in your Neon database:

CREATE TABLE embeddings_documents (
  id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
  content TEXT,
  metadata JSONB,
  embedding vector(1536)  -- or 3072 for text-embedding-3-large
);

Indexes:

HNSW index on embedding for fast similarity search
GIN index on metadata for filtering

Permissions

The plugin registers the following RBAC permissions:

plugin::strapi-content-embeddings.read - View embeddings
plugin::strapi-content-embeddings.create - Create embeddings
plugin::strapi-content-embeddings.update - Update embeddings
plugin::strapi-content-embeddings.delete - Delete embeddings
plugin::strapi-content-embeddings.chat - Use the RAG chat feature

Configure these in Settings > Roles for each admin role.

Troubleshooting

Embeddings not being created

Check that OPENAI_API_KEY is set correctly
Check that NEON_CONNECTION_STRING is valid
Look for errors in the Strapi console

Chat returns "cannot find the answer"

Ensure embeddings exist in the database
Try creating more specific content
Check that the embedding model matches between creation and query

Connection errors

Verify your Neon connection string includes ?sslmode=require
Check that your Neon project is active (not paused)
Ensure the pgvector extension is enabled

MCP not connecting

Verify the MCP endpoint URL is correct
Check the Authorization header has a valid Strapi API token
Ensure the plugin is properly configured and Strapi is running

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Strapi Content Embeddings

Features

Requirements

Installation

Configuration

1. Enable the Plugin

2. Set Environment Variables

3. Get Your Neon Connection String

MCP Integration

MCP Endpoint

Available MCP Tools

Claude Desktop Configuration

Usage in Claude Desktop

Available Embedding Models

Usage

Admin Panel

Create Embeddings Page

Content Manager Integration

Chat Widget

Programmatic Usage

Create an Embedding

Query Embeddings (RAG)

Similarity Search

API Endpoints

Database Sync (Neon to Strapi)

Sync Endpoints

Query Parameters

Usage Examples

Sync Response

Cron Job Example

Content Chunking

How Chunking Works

Configuration

Using Chunking

Via MCP Tool

Programmatic Usage

Chunk Metadata

Content Preprocessing

What Gets Cleaned

Why Preprocess?

Configuration

Admin Sync UI

Available Operations

Sync Workflow

Sync Options

How It Works

Database Schema

Permissions

Troubleshooting

Embeddings not being created

Chat returns "cannot find the answer"

Connection errors

MCP not connecting

License