@auxot/model-registry
v1.1.4
Published
Curated catalog of GGUF models for the Auxot GPU worker ecosystem
Readme
@auxot/model-registry
Shared model registry package for worker-cli and web applications.
This package contains curated information about LLM models available for use with worker-cli, including model metadata, Hugging Face IDs, VRAM requirements, and capabilities.
Structure
registry.json- Generated JSON file containing model definitions (committed to repo)src/types.ts- TypeScript type definitionssrc/schemas.ts- Zod validation schemassrc/loader.ts- Runtime loader for registry.jsonsrc/query.ts- Query functions for filtering and searching modelsscripts/build-registry.ts- Build script to generate registry.json from Hugging Face
Usage
import { loadRegistry, getModels, getModelById, suggestModelsForVRAM } from '@auxot/model-registry';
// Load the registry
const registry = loadRegistry();
// Get all models
const allModels = getModels(registry);
// Filter models
const chatModels = getModels(registry, { capabilities: ['chat'] });
const largeModels = getModels(registry, { min_vram_gb: 20 });
// Get a specific model
const model = getModelById(registry, 'qwen2.5-coder-30b-q4_k_m');
// Suggest models for VRAM budget
const suggestions = suggestModelsForVRAM(registry, 24, 1); // 24GB VRAM, parallelism 1Building the Registry
To generate or update registry.json, run:
pnpm --filter @auxot/model-registry run build:registryThis will scan Hugging Face for compatible models and generate the registry JSON file.
The registry is committed to the repository, so it's available at runtime without needing to run the build script.
Registry JSON Structure
See src/types.ts for the complete TypeScript definitions. The registry contains:
version- Registry version (semver)generated_at- ISO timestamp when registry was generatedmodels- Array of model entries with:id- Unique identifiermodel_name- Normalized model namehuggingface_id- Full Hugging Face repo IDquantization- Quantization level (e.g. "Q4_K_M")family- Model family (MoE or Dense)parameters- Parameter count (e.g. "30B")default_context_size- Default context window sizevram_requirements_gb- Estimated VRAM neededcapabilities- Model capabilities (chat, vision, embedding, code)file_name- GGUF filenamefile_size_bytes- File size (optional)
