cubeapm-mcp

v1.2.0

Published

3 months ago

MCP server for CubeAPM - Query traces, metrics, and logs from your observability platform using AI assistants

0High
0Medium
0Low

mcp model-context-protocol cubeapm observability apm traces metrics logs claude ai monitoring distributed-tracing promql logsql

CubeAPM MCP Server

A Model Context Protocol (MCP) server for CubeAPM - enabling AI assistants like Claude to query your observability data including traces, metrics, and logs.

What is this?

This MCP server connects AI assistants (like Claude) to your CubeAPM instance, allowing you to:

Query logs using natural language that gets translated to LogsQL
Analyze metrics with PromQL-compatible queries
Search and inspect traces to debug distributed systems
Monitor your services through conversational interfaces

Installation

From NPM (Recommended)

npm install -g cubeapm-mcp

From Source

git clone https://github.com/TechnicalRhino/cubeapm-mcp.git
cd cubeapm-mcp
npm install
npm run build

Quick Start

1. Configure Claude Code

Add to your Claude Code settings (~/.claude/settings.json):

{
  "mcpServers": {
    "cubeapm": {
      "command": "npx",
      "args": ["-y", "cubeapm-mcp"],
      "env": {
        "CUBEAPM_HOST": "your-cubeapm-server.com"
      }
    }
  }
}

2. Restart Claude Code

After updating the settings, restart Claude Code to load the MCP server.

3. Start Querying

You can now ask Claude questions like:

"Show me error logs from the payment-service in the last hour"
"What's the p99 latency for the checkout API?"
"Find traces where duration > 5s in production"
"Get the full trace for trace ID abc123def456"

Configuration

| Environment Variable | Default | Description | |---------------------|---------|-------------| | CUBEAPM_URL | - | Full URL to CubeAPM (e.g., https://cube.example.com). Takes precedence over HOST/PORT settings. | | CUBEAPM_HOST | localhost | CubeAPM server hostname or IP (used if CUBEAPM_URL not set) | | CUBEAPM_QUERY_PORT | 3140 | Port for querying APIs (traces, metrics, logs) | | CUBEAPM_INGEST_PORT | 3130 | Port for ingestion APIs |

Example Configurations

Local Development:

{
  "mcpServers": {
    "cubeapm": {
      "command": "npx",
      "args": ["-y", "cubeapm-mcp"],
      "env": {
        "CUBEAPM_HOST": "localhost"
      }
    }
  }
}

Production (with full URL):

{
  "mcpServers": {
    "cubeapm": {
      "command": "npx",
      "args": ["-y", "cubeapm-mcp"],
      "env": {
        "CUBEAPM_URL": "https://cubeapm.internal.company.com"
      }
    }
  }
}

Production (with host/port):

{
  "mcpServers": {
    "cubeapm": {
      "command": "npx",
      "args": ["-y", "cubeapm-mcp"],
      "env": {
        "CUBEAPM_HOST": "cubeapm.internal.company.com",
        "CUBEAPM_QUERY_PORT": "3140"
      }
    }
  }
}

Available Tools

Logs

| Tool | Description | |------|-------------| | query_logs | Query logs using LogsQL syntax with time range and limit |

Parameters:

query - LogsQL query string (e.g., {service="api"} error)
start - Start time (RFC3339 or Unix timestamp)
end - End time (RFC3339 or Unix timestamp)
limit - Maximum entries to return (default: 100)

Metrics

| Tool | Description | |------|-------------| | query_metrics_instant | Execute a PromQL query at a single point in time | | query_metrics_range | Execute a PromQL query over a time range |

Instant Query Parameters:

query - PromQL expression
time - Evaluation timestamp
step - Optional time window in seconds

Range Query Parameters:

query - PromQL expression
start / end - Time range
step - Resolution in seconds

Traces

| Tool | Description | |------|-------------| | search_traces | Search traces by service, environment, or custom query | | get_trace | Fetch complete trace details by trace ID |

Search Parameters (Required):

query - Search query (default: * for wildcard)
env - Environment filter (default: UNSET)
service - Service name filter (required, case-sensitive)
start / end - Time range (RFC3339 or Unix timestamp)

Search Parameters (Optional):

limit - Maximum results (default: 20)
spanKind - Filter by span type: server, client, consumer, producer
sortBy - Sort by: duration (useful for finding slow traces)

Get Trace Parameters:

trace_id - Hex-encoded trace ID
start / end - Time range to search within

Ingestion

| Tool | Description | |------|-------------| | ingest_metrics_prometheus | Send metrics in Prometheus text exposition format |

Prompts

Pre-defined templates for common observability tasks:

| Prompt | Description | |--------|-------------| | investigate-service | Comprehensive service investigation - checks errors, latency, and traces | | check-latency | Get P50, P95, P99 latency percentiles for a service | | find-slow-traces | Find slowest traces to identify performance bottlenecks |

Usage Example:

Use the investigate-service prompt for Kratos-Prod

Resources

Readable resources exposing CubeAPM data and configuration:

| Resource URI | Description | |--------------|-------------| | cubeapm://config | Current CubeAPM connection configuration | | cubeapm://query-patterns | Query patterns and naming conventions reference |

CubeAPM Query Patterns

Metrics Naming Conventions

CubeAPM uses specific naming conventions that differ from standard OpenTelemetry:

| What | CubeAPM Convention | |------|-------------------| | Metric prefix | cube_apm_* (e.g., cube_apm_calls_total, cube_apm_latency_bucket) | | Service label | service (NOT server or service_name) | | Common labels | env, service, span_kind, status_code, http_code |

Histogram Queries (P50, P90, P95, P99)

CubeAPM uses VictoriaMetrics-style histograms with vmrange labels instead of Prometheus le buckets:

# ✅ Correct - Use histogram_quantiles() with vmrange
histogram_quantiles("phi", 0.95, sum by (vmrange, service) (
  increase(cube_apm_latency_bucket{service="MyService", span_kind="server"}[5m])
))

# ❌ Wrong - Standard Prometheus syntax won't work
histogram_quantile(0.95, sum by (le) (rate(http_request_duration_bucket[5m])))

Note: Latency values are returned in seconds (0.05 = 50ms)

Logs Label Discovery

Log labels vary by source. Use * query first to discover available labels:

| Source | Common Labels | |--------|---------------| | Lambda functions | faas.name, faas.arn, env, aws.lambda_request_id | | Services | service_name, level, host |

# Discover all labels
*

# Lambda function logs
{faas.name="my-lambda-prod"}

# Search with text filter
{faas.name=~".*-prod"} AND "error"

Example Queries

Logs

"Show me logs from webhook-lambda-prod"
"Find all logs containing 'timeout' in the last hour"
"Query logs from Lambda functions in production"

Metrics

"What's the P95 latency for Kratos-Prod service?"
"Show me error rate for all services"
"List all available services in CubeAPM"

Traces

"Find slow traces (>2s) from the order-service"
"Show me traces with errors in the production environment"
"Get the full waterfall for trace ID abc123"

Development

# Clone the repository
git clone https://github.com/TechnicalRhino/cubeapm-mcp.git
cd cubeapm-mcp

# Install dependencies
npm install

# Run in development mode (with hot reload)
npm run dev

# Build for production
npm run build

# Test the build
npm start

How It Works

┌─────────────────┐     MCP Protocol     ┌─────────────────┐     HTTP API     ┌─────────────────┐
│  Claude / AI    │◄───────────────────►│  cubeapm-mcp    │◄────────────────►│    CubeAPM      │
│   Assistant     │   (stdio transport)  │   MCP Server    │   (REST calls)   │    Server       │
└─────────────────┘                      └─────────────────┘                  └─────────────────┘

The MCP server:

Receives tool calls from the AI assistant via stdio
Translates them to CubeAPM HTTP API requests
Returns formatted results back to the assistant

Requirements

Node.js 18+
CubeAPM instance (self-hosted or cloud)
Claude Code or any MCP-compatible client

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT License - see LICENSE file for details.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

CubeAPM MCP Server

What is this?

Installation

From NPM (Recommended)

From Source

Quick Start

1. Configure Claude Code

2. Restart Claude Code

3. Start Querying

Configuration

Example Configurations

Available Tools

Logs

Metrics

Traces

Ingestion

Prompts

Resources

CubeAPM Query Patterns

Metrics Naming Conventions

Histogram Queries (P50, P90, P95, P99)

Logs Label Discovery

Example Queries

Logs

Metrics

Traces

Development

How It Works

Requirements

Related Links

Contributing

License