@gleanwork/connector-mcp

v0.5.0

Published

3 months ago

MCP server for AI-assisted Glean connector development

Downloads

0High
0Medium
0Low

mcp mcp-server glean connector

@gleanwork/connector-mcp

An MCP server for AI-assisted Glean connector development. Add it to your IDE's MCP config and use Claude Code, Cursor, or any MCP-compatible AI assistant to scaffold, schema-map, generate, and test Glean connectors — without leaving your editor.

Prerequisites

Node.js ≥ 20
Python + uv (for run_connector and Copier scaffolding)
A Glean API token

Setup

Claude Code

Add to .claude/mcp.json in your project (or ~/.claude/mcp.json globally):

{
  "mcpServers": {
    "local": {
      "command": "npx",
      "args": [
        "-y",
        "@gleanwork/connector-mcp"
      ],
      "type": "stdio"
    }
  }
}

For setup instructions for Cursor, VS Code, Windsurf, Goose, Codex, JetBrains, Gemini CLI, OpenCode, and more, see docs/setup.md.

Environment Variables

These are set in the MCP server config, not in your connector project.

| Variable | Required | Description | | ------------------------------- | -------- | -------------------------------------------------------------------------------------- | | GLEAN_PROJECT_PATH | No | Default project directory; overridden by create_connector | | GLEAN_CONNECTOR_TEMPLATE_PATH | No | Path to a custom Copier template (defaults to copier-glean-connector in workspace) | | GLEAN_WORKER_COMMAND | No | Command to start the Python worker (default: uv run python -m glean.indexing.worker) | | GLEAN_WORKER_REQUEST_TIMEOUT | No | Max seconds to wait for a worker JSON-RPC response (default: 30) | | GLEAN_WORKER_SHUTDOWN_TIMEOUT | No | Seconds to wait for graceful worker shutdown before SIGKILL (default: 5) |

GLEAN_INSTANCE and GLEAN_API_TOKEN belong in your connector project's .env file — create_connector generates a .env.example to get you started.

Quick Start

In your AI assistant, try:

"I want to build a Glean connector for our Salesforce Opportunities data. The API uses OAuth2 bearer tokens and returns paginated JSON. Let's start."

Or call get_started — the assistant will ask what you're connecting and walk you through the rest.

Core Workflow

Six steps from zero to a running connector. The assistant guides you through each one.

| Step | What you're doing | Tool | | ---- | -------------------------------------------------------- | ---------------------------------------- | | 1 | Scaffold the project | create_connector | | 2 | Configure the data source | set_config | | 3 | Define the schema (infer from a sample file or write it) | infer_schema + update_schema | | 4 | Map fields and verify required Glean fields are covered | confirm_mappings + validate_mappings | | 5 | Generate the Python connector code | build_connector | | 6 | Run the connector and inspect results | run_connector + inspect_execution |

Tool Reference

Project Setup

| Tool | Description | | ------------------ | --------------------------------------------------------------------------- | | get_started | Open the guided workflow; the assistant asks what you're connecting | | create_connector | Scaffold a new connector project and set the active session path | | set_config | Write connector config (auth, endpoint, pagination) to .glean/config.json | | get_config | Read .glean/config.json |

Schema

| Tool | Description | | --------------- | --------------------------------------------------------------------- | | infer_schema | Parse a .csv, .json, or .ndjson file and return field analysis | | get_schema | Read current .glean/schema.json | | update_schema | Write field definitions to .glean/schema.json | | analyze_field | Deep-dive on a single field: samples, type, Glean mapping suggestions |

Field Mapping

| Tool | Description | | ------------------- | ------------------------------------------------------ | | get_mappings | Show source schema and Glean entity model side-by-side | | confirm_mappings | Save field mapping decisions to .glean/mappings.json | | validate_mappings | Check mappings for missing required Glean fields |

Build & Run

| Tool | Description | | ------------------- | ----------------------------------------------------------------------------------------------- | | build_connector | Generate src/{module}/connector.py, models.py, mock_data.json from schema+mappings+config | | run_connector | Start async connector execution; returns execution_id immediately | | inspect_execution | Poll execution status; returns records, validation issues, logs | | manage_recording | Record/replay/list/delete connector data recordings |

Project Layout

After create_connector, your project directory looks like:

my-connector/
├── src/
│   └── {module_name}/
│       ├── connector.py    ← generated by build_connector
│       ├── models.py       ← generated TypedDict for source data
│       └── mock_data.json  ← sample data for local testing
├── CLAUDE.md           ← workflow guidance (for Claude Code users)
└── .glean/
    ├── schema.json     ← field schema
    ├── mappings.json   ← field mappings to Glean entity model
    ├── config.json     ← connector configuration
    ├── executions/     ← execution results (written on completion)
    └── recordings/     ← captured API responses for replay

MCP Resource

The server exposes a connector://workflow resource that returns the full authoring guide. Your AI assistant can fetch it at session start for workflow context.

Known Limitations

Single project per MCP session. The server tracks one active project at a time. To switch projects, set GLEAN_PROJECT_PATH in the MCP server config or restart the server after running create_connector for the new project.
Execution state is in-memory. Active execution history is lost when the MCP server restarts. Completed execution results written to .glean/executions/ persist on disk, but any in-progress executions must be re-run.

Troubleshooting

`spawn uv ENOENT`

uv is not installed or is not on your PATH. The server requires uv to scaffold connector projects and to run the Python worker.

Install it following the official uv instructions, then verify:

uv --version

If uv is installed but not on the PATH seen by your IDE, add it explicitly in the MCP server env config or set GLEAN_WORKER_COMMAND to the full path of an alternative command.

`Copier template not found`

The server could not locate the copier-glean-connector template. By default it looks for the template alongside this package in the Glean workspace or clones it from github.com/gleanwork over SSH.

Set the GLEAN_CONNECTOR_TEMPLATE_PATH environment variable to the absolute path of a local checkout of the template:

{
  "env": {
    "GLEAN_CONNECTOR_TEMPLATE_PATH": "/path/to/copier-glean-connector"
  }
}

`Worker exited` / `glean.indexing.worker` module not found

The Python worker process exited immediately. This usually means one of:

You are not inside a Copier-scaffolded connector project. The run_connector tool must be called after create_connector has set up the project directory with the correct pyproject.toml and dependencies.
The glean-indexing-sdk is not installed in the project's virtual environment. Run uv sync inside the connector project directory to install dependencies.
Wrong working directory. Ensure GLEAN_PROJECT_PATH points to the connector project root, or run create_connector first to set the active session path automatically.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@gleanwork/connector-mcp

Prerequisites

Setup

Claude Code

Environment Variables

Quick Start

Core Workflow

Tool Reference

Project Setup

Schema

Field Mapping

Build & Run

Project Layout

MCP Resource

Known Limitations

Troubleshooting

spawn uv ENOENT

Copier template not found

Worker exited / glean.indexing.worker module not found

`spawn uv ENOENT`

`Copier template not found`

`Worker exited` / `glean.indexing.worker` module not found