@intentsolutionsio/databricks-pack
v1.1.0
Published
Claude Code skill pack for Databricks - 24 skills covering lakehouse platform, Spark, and ML pipelines (v1, DEPRECATED — rebuilt as 5 live-detection skills + MCP in v2.0.0)
Maintainers
Readme
Databricks Skill Pack
24 production-ready skills for the Databricks Lakehouse Platform — Unity Catalog, Delta Lake, MLflow, Spark SQL, Asset Bundles, and the full REST API.
[!WARNING] This pack is being rebuilt —
v1.xis deprecated. Every v1 skill below carries a deprecation banner and will be removed in[email protected], which replaces the 24 documentation-style skills with 5 live-detection skills + a shareddatabricks-workspace-mcpserver that runs against your own workspace. If you have anydatabricks-*skill in yourCLAUDE.md, read Migration: v1 → v2 before upgrading. The v2 rebuild ships on thedatabricks-packslug (no rename).
Installation
/plugin install databricks-pack@claude-code-plugins-plusWhat It Does
This pack gives Claude Code deep operational knowledge of Databricks: real REST API endpoints (/api/2.1/jobs/*, /api/2.0/clusters/*), real Python SDK patterns (databricks-sdk WorkspaceClient, typed error handling), real Spark SQL (Auto Loader, MERGE INTO, OPTIMIZE, Liquid Clustering), and real deployment workflows (Declarative Automation Bundles, GitHub Actions CI/CD).
Every skill contains working code, not placeholder templates.
Skills (24)
Standard (S01-S12)
| Skill | What It Covers |
|-------|---------------|
| databricks-install-auth | CLI v2, Python SDK, PAT/OAuth U2M/OAuth M2M, profiles |
| databricks-hello-world | First cluster, notebook upload, runs/submit, SQL warehouse |
| databricks-local-dev-loop | Databricks Connect v2, pytest fixtures, Asset Bundle sync |
| databricks-sdk-patterns | Singleton client, typed errors, cluster lifecycle, job builder |
| databricks-core-workflow-a | Medallion ETL: Auto Loader, MERGE upserts, DLT pipelines |
| databricks-core-workflow-b | Feature Store, MLflow tracking, model registry, serving endpoints |
| databricks-common-errors | OOM, concurrent writes, permissions, schema mismatch, 429 |
| databricks-debug-bundle | Diagnostic tar.gz: cluster events, run output, driver logs |
| databricks-rate-limits | Exponential backoff, token-bucket, idempotent submissions |
| databricks-security-basics | Secret scopes, token rotation, column masking, audit queries |
| databricks-prod-checklist | Pre-deploy checklist, job YAML, rollback procedure |
| databricks-upgrade-migration | DBR version upgrade, Hive-to-Unity-Catalog, protocol upgrade |
Pro (P13-P18)
| Skill | What It Covers |
|-------|---------------|
| databricks-ci-integration | GitHub Actions, bundle validation, unit tests, OIDC auth |
| databricks-deploy-integration | Declarative Automation Bundles, targets, variables, permissions |
| databricks-webhooks-events | Notification destinations, SQL alerts, system table auditing |
| databricks-performance-tuning | Cluster sizing, AQE, Liquid Clustering, Z-order, query plans |
| databricks-cost-tuning | Billing tables, cluster policies, spot instances, instance pools |
| databricks-reference-architecture | Lakehouse layout, Unity Catalog hierarchy, maintenance jobs |
Flagship (F19-F24)
| Skill | What It Covers |
|-------|---------------|
| databricks-multi-env-setup | Dev/staging/prod profiles, per-env secrets, Terraform |
| databricks-observability | System tables, job health, cost-per-job, SQL alerts, Prometheus |
| databricks-incident-runbook | Triage script, decision tree, evidence collection, postmortem |
| databricks-data-handling | GDPR deletion, PII masking, retention enforcement, row-level security |
| databricks-enterprise-rbac | SCIM groups, Unity Catalog grants, cluster policies, service principals |
| databricks-migration-deep-dive | Hadoop/Snowflake/Redshift migration, schema conversion, cutover |
Key APIs Covered
| API | Endpoints |
|-----|-----------|
| Jobs API 2.1 | POST /api/2.1/jobs/create, runs/submit, run-now |
| Clusters API 2.0 | create, list, start, delete, events |
| SQL Statement API | execute-statement |
| Unity Catalog | catalogs, schemas, tables, grants |
| DBFS / Files API | put, get, list |
| Secrets API | create-scope, put-secret, list-acls |
| SCIM API | groups, users, service-principals |
| Model Serving | serving-endpoints/create, query |
Usage
Skills trigger automatically on Databricks topics:
- "Set up Databricks auth" --
databricks-install-auth - "Build a Delta Lake pipeline" --
databricks-core-workflow-a - "Deploy my Databricks job" --
databricks-deploy-integration - "Optimize my Spark queries" --
databricks-performance-tuning - "Set up Unity Catalog permissions" --
databricks-enterprise-rbac - "Migrate from Snowflake" --
databricks-migration-deep-dive
Architecture
Why two MCP servers, not one
The v2 rebuild ships with a deliberate split across two MCP servers, not a single shared one. Common question on contributor PRs — answering it once at the top so future readers don't have to re-derive it.
- Databricks managed SQL MCP — serves
system.*reads (cost data, query history, streaming progress). Operated by Databricks; we consume it. - Custom workspace MCP — serves cluster events, instance pools, pipeline event logs, external locations, storage credentials. Operated by this pack.
The two authenticate independently. Losing access to one does not disable the other; cost-leak-hunter (SQL MCP) and cluster-forensics (workspace MCP) fail independently when their respective MCP is unavailable. Single skills can be installed without pulling in the other MCP's dependency surface.
Full scope-boundary rationale — including the 8 → 6 endpoint cut and the auth-flow decisions — is in 000-docs/013-AT-ADEC-epic1-mcp-scope-adjustment.md. Reference document for any "why is this skill not pulling X?" question.
Thanks to @Gingiris-1031 (#795) for surfacing the isolation-story framing that made this section necessary.
Migration: v1 → v2
[email protected] is a ground-up rebuild. The 24 v1 skills described Databricks ops;
the 5 v2 skills run them — live detection against your own workspace via a shared
databricks-workspace-mcp server (control plane) composed with the Databricks managed SQL
MCP (system.* reads). Rationale: 000-docs/007-AT-ADEC-databricks-v2-cto-decision.md
and 000-docs/013-AT-ADEC-epic1-mcp-scope-adjustment.md.
Timeline: 1.1.0 (this release — deprecation banners) → 2.0.0 (5 skills + MCP, v1
skills removed with tombstones) → 2.1.0 (tombstones cleaned up). Users on auto-update get a
2–4 week window on 1.1.0 to read these banners before 2.0.0 lands.
Where each v1 skill goes
| v1 skill | v2 destination |
|----------|----------------|
| databricks-cost-tuning | databricks-cost-leak-hunter |
| databricks-performance-tuning | databricks-cost-leak-hunter + databricks-cluster-forensics |
| databricks-incident-runbook | databricks-cluster-forensics + databricks-streaming-guardian |
| databricks-observability | databricks-streaming-guardian |
| databricks-upgrade-migration | databricks-cluster-forensics (DBR-upgrade triage) |
| databricks-debug-bundle · databricks-deploy-integration · databricks-local-dev-loop · databricks-ci-integration | databricks-bundle-medic |
| databricks-migration-deep-dive · databricks-multi-env-setup · databricks-enterprise-rbac | databricks-uc-migration-pilot |
| databricks-security-basics | databricks-uc-migration-pilot + databricks-bundle-medic (identity/secrets) |
| databricks-hello-world · databricks-install-auth · databricks-sdk-patterns · databricks-core-workflow-a · databricks-core-workflow-b · databricks-common-errors · databricks-prod-checklist · databricks-rate-limits · databricks-webhooks-events · databricks-reference-architecture · databricks-data-handling | Cut — no direct replacement (setup folds into the MCP .env.sops + each skill's ## Prerequisites; checklists/architecture move into v2 references/; error catalogs ship per-skill) |
The 5 v2 skills
| v2 skill | What it does (live) |
|----------|---------------------|
| databricks-cost-leak-hunter (pilot) | $X/month wasted audit from your own system.billing.usage — idle clusters, All-Purpose-vs-Jobs, instance-pool waste, DLT tier, tag-based chargeback |
| databricks-cluster-forensics | Cold-start / launch-failure / Photon / DBR-upgrade triage from cluster events |
| databricks-streaming-guardian | Delta + Liquid Clustering + Structured Streaming + DLT health |
| databricks-uc-migration-pilot | Unity Catalog readiness + IAM/SCIM + access tracing (HMS deadline Sept 30, 2026) |
| databricks-bundle-medic | Asset Bundles deploy diagnostics + CMK rotation + PrivateLink audit |
Design Records
All architecture decisions, pain research, and pressure tests for this pack live in 000-docs/. Index: 000-INDEX.md.
License
MIT
