homingo

v0.11.0

Published

16 hours ago

Homingo — the homing instinct for your AI skills. Detect, diagnose, and fix routing drift.

Downloads

555

0High
0Medium
0Low

rk-yen

ai skills routing drift lint audit llm cli devtools

The Problem

You build 10 AI skills and routing works great. You build 50 and the model starts picking the wrong one — "Tax Optimizer" fires for invoice summaries, "Legal Review" and "Legal Compliance" blur together. Descriptions accumulate overlap silently. There is no tooling to detect this.

Homingo is the orchestration layer that sits on top of your AI platform's skill primitives. It turns "I hope the model picks the right tool" into "I have verified the model will pick the right tool."

Quick Start

# Install
npm install -g homingo

# Init/Configure
homingo init

# Run a read-only diagnostic on your skill fleet
homingo audit

# Pre-deploy validation with fix suggestions
homingo lint

# Test a single skill before shipping it
homingo lint --skill my-new-skill

Commands

`homingo init`

Interactive setup. Creates a .homingo/config.json and scaffolds a skills directory with a sample SKILL.md.

homingo init

`homingo audit`

Read-only fleet diagnostic. No writes, no rewrites, just truth.

homingo audit --skills-dir ./skills

Outputs a conflict map showing which skill pairs have routing overlap, an estimated fleet error rate, severity-ranked pair list, and the top offenders. The report opens as a self-contained HTML file.

| Flag | Description | |------|-------------| | --all-pairs | Test every skill pair (expensive, thorough) | | --prompts <n> | Adversarial prompts per pair (default: 25) | | --model <model> | LLM model to use | | --enhanced | Bigram matching for improved overlap detection | | --concurrency <n> | Max parallel API calls (default: 5) | | --dry-run | Show pairs without making API calls | | --json | Output JSON instead of terminal table | | --no-open | Don't auto-open the HTML report |

`homingo lint`

Pre-deploy validation. Tests routing accuracy, detects scope overload, suggests description rewrites, and proposes skill sharding when descriptions cover too many intents.

# Fleet-wide lint
homingo lint

# Single-skill lint (test one skill against its neighbors)
homingo lint --skill invoice-summary

# Auto-fix descriptions and write shard plans
homingo lint --fix

| Flag | Description | |------|-------------| | --skill <name> | Test one skill against its neighbors instead of the full fleet | | --threshold <pct> | Accuracy threshold to pass (default: 90) | | --neighbors <n> | Max neighbors in single-skill mode (default: 5) | | --fix | Apply suggested rewrites and shard plans to SKILL.md files | | --force | Run scope overload checks on all skills, not just flagged ones | | --prompts <n> | Adversarial prompts per pair (default: 25) | | --model <model> | LLM model to use | | --enhanced | Bigram matching for improved overlap detection | | --concurrency <n> | Max parallel API calls (default: 5) | | --dry-run | Show pairs and scope checks without making API calls | | --json | Output JSON only | | --no-open | Don't auto-open the HTML report |

`homingo logs`

Browse past audit and lint results. Opens an HTML viewer with tabs for each command type.

homingo logs         # html output
homingo logs --json  # raw json data

How It Works

Homingo's core engine is the Shadow Router; an adversarial prompt generator combined with a routing simulator.

Skill Fleet
    │
    ▼
┌─────────────────────┐
│   Pair Selector     │  Identifies which skill pairs have overlapping descriptions
└─────────┬───────────┘
          │
          ▼
┌─────────────────────┐
│  Prompt Generator   │  Creates adversarial prompts at the boundary between two skills
└─────────┬───────────┘
          │
          ▼
┌─────────────────────┐
│  Routing Simulator  │  Asks the LLM: "Given these skills, which one handles this prompt?"
└─────────┬───────────┘
          │
          ▼
┌─────────────────────┐
│   Conflict Scorer   │  Aggregates results into accuracy scores and severity ratings
└─────────┬───────────┘
          │
          ▼
┌─────────────────────┐
│ Description Rewriter│  Suggests targeted rewrites to reduce routing confusion
└─────────────────────┘

Skill Format

Each skill is a directory with a SKILL.md file using YAML frontmatter:

---
name: invoice-summary
description: "Generates concise summaries of invoice documents, extracting key fields like total amount, due date, vendor name, and line items."
---

# Invoice Summary

Additional context, examples, or documentation for the skill.

Overlap Detection

Homingo uses a two-stage approach:

Heuristic pair selection: Jaccard similarity on tokenized descriptions + name keyword overlap. Optional bigram matching with --enhanced. This runs locally with no API calls.
LLM-based routing simulation: For selected pairs, Homingo generates adversarial prompts and asks the model to route them. This measures actual routing accuracy, not just textual similarity.

Scope Overload Detection

When a skill description exceeds 1024 characters or shows signs of multi-intent scope (multiple semicolons, excessive clauses, "and [verb]" patterns), Homingo flags it as overloaded and proposes a shard plan: splitting into focused sub-skills plus an orchestrator.

Configuration

Homingo stores its configuration in .homingo/config.json:

{
  "model": "claude-sonnet-4-20250514",
  "skillsDir": "./skills",
  "shadowRouter": {
    "promptsPerPair": 25,
    "minPrompts": 10,
    "accuracyThreshold": 90,
    "maxIterations": 3
  },
  "output": {
    "reportDir": "./reports",
    "format": "both"
  }
}

Provider Support

Homingo supports both Anthropic and OpenAI models. The provider is auto-detected from the model name:

claude-* → Anthropic
gpt-*, o1-*, o3-* → OpenAI

Reports

Every audit and lint run generates:

JSON metadata: run parameters, token usage, results
Self-contained HTML report: no external dependencies, opens in any browser

Reports are stored in the configured reportDir (default: ./reports/) with automatic retention of the last 10 runs per command. Use homingo logs to browse them.

Requirements

Node.js ≥ 22
An API key for Anthropic or OpenAI

Development

git clone https://github.com/homingo/homingo.github.io.git
cd homingo
npm install
npm run build
npm test

npm run typecheck   # TypeScript type checking
npm run lint        # ESLint
npm run format      # Prettier
npm run test:watch  # Vitest in watch mode

Security

See SECURITY.md for our security policy and how to report vulnerabilities.

Contributing

See CONTRIBUTING.md for guidelines on how to contribute.

Special thanks to Chris Mikelson for his support.

Thanks to all contributors:

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme