npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@dev-mn/budgetai

v1.0.4

Published

Claude Code × NVIDIA NIM — Free AI Coding, Zero Compromise

Readme

⚡ budgetai

AI Cost Saver for Coding — Route Claude Code to NVIDIA NIM for free

npm version npm downloads License: MIT Node.js NestJS NVIDIA NIM

PRs Welcome GitHub Stars GitHub Issues

🚀 Drop-in proxy that routes Claude Code CLI to NVIDIA NIM's free inference backend.
Get 40 req/min of production-grade AI code completion — no Anthropic subscription required.

Quick Start · Configuration · Models · Troubleshooting


💸 The Story Behind This

I've been deep in the AI coding tools rabbit hole for a while now.

Started with GitHub Copilot, moved to Cursor, tried Claude Code with max mode, bounced between a dozen tools chasing the "best" experience. At some point I stopped counting subscriptions and just... paid. Month after month.

Then the bill came. $5,000+ in API token costs. Not including subscriptions.

That was the moment I sat down and asked myself: do I actually need to be paying this much?

So I started researching. Digging through free tiers, open-weight models, inference providers. Turns out NVIDIA NIM offers surprisingly capable models completely free — with a real API, low latency, and no credit card required for the free tier.

budgetai is what came out of that research. It's a proxy that makes Claude Code CLI talk to NIM's backend instead of Anthropic's — so you get the same familiar UX, for free.

⚠️ Honest disclaimer: This is not a perfect solution. It's a work in progress. The models aren't Claude, the quality varies, and there are rate limits. I'm actively researching better approaches and will keep improving this. If you're hitting limits or have ideas, open an issue — let's figure it out together.


✨ Features

| Feature | Details | |---|---| | 🎯 Drop-in Replacement | Fully Anthropic API-compatible — Claude Code works as-is | | 🆓 Free Inference | NVIDIA NIM free tier: 40 req/min, no credit card needed | | 🔀 Per-Model Routing | Route Opus / Sonnet / Haiku to different NIM models | | ⚡ Trivial Request Optimization | 5 categories intercepted locally — zero latency, zero quota | | 🛡️ Smart Rate Limiting | Rolling window throttle + automatic 429 handling | | 🧠 Thinking Token Support | Parses <think> tags into native Anthropic-style blocks |


📦 Installation

# Recommended: install globally from npm
npm install -g @dev-mn/budgetai

# Verify
budgetai --version
git clone https://github.com/dev-mn/budgetai.git
cd budgetai
npm install && npm run build

🚀 Quick Start

1 — Get your NVIDIA API key

Head to build.nvidia.com/settings/api-keys and create a free key.

2 — Initialize config

budgetai init

This creates ~/.config/budgetai/.env. Open it and paste your key:

NVIDIA_NIM_API_KEY=nvapi-xxxxxxxxxxxxxxxxxxxxxxx

3 — Start the proxy

budgetai start
✅ Claude Code NIM Proxy running on port 8082
   Model   : nvidia_nim/z-ai/glm4.7
   Base URL : https://integrate.api.nvidia.com/v1

   export ANTHROPIC_BASE_URL=http://localhost:8082
   Then run: claude

4 — Run Claude Code

Open a second terminal:

export ANTHROPIC_BASE_URL=http://localhost:8082
claude

Windows (PowerShell)

$env:ANTHROPIC_BASE_URL="http://localhost:8082"
claude

VSCode Extension

  1. Open Settings → search claude-code.environmentVariables
  2. Click Edit in settings.json and add:
"claudeCode.environmentVariables": [
  { "name": "ANTHROPIC_BASE_URL", "value": "http://localhost:8082" }
]
  1. Reload extensions — done! ✅

⚙️ Configuration

All config lives in ~/.config/budgetai/.env:

| Variable | Default | Description | |---|---|---| | NVIDIA_NIM_API_KEY | required | Your key from build.nvidia.com | | NIM_MODEL | nvidia_nim/z-ai/glm4.7 | Default model | | NIM_BASE_URL | https://integrate.api.nvidia.com/v1 | NIM endpoint | | PORT | 8082 | Local proxy port | | RATE_LIMIT | 40 | Requests per minute | | RATE_LIMIT_WINDOW | 60 | Window in seconds | | ENABLE_THINKING | true | Parse thinking tokens |

Per-Model Routing

Map each Claude tier to a different NIM model:

MODEL_OPUS="nvidia_nim/minimaxai/minimax-m2.5"
MODEL_SONNET="nvidia_nim/qwen/qwen3.5-397b-a17b"
MODEL_HAIKU="nvidia_nim/z-ai/glm4.7"

🧩 NVIDIA NIM Models

| Model | Tag | Notes | |---|---|---| | GLM 4.7 | nvidia_nim/z-ai/glm4.7 | ⚡ Default — fast & reliable | | MiniMax M2.5 | nvidia_nim/minimaxai/minimax-m2.5 | 🏆 High quality | | Qwen 3.5 397B | nvidia_nim/qwen/qwen3.5-397b-a17b | 🔥 Largest model | | Kimi K2.5 | nvidia_nim/moonshotai/kimi-k2.5 | 🌙 Great for long context | | Step 3.5 Flash | nvidia_nim/stepfun-ai/step-3.5-flash | ⚡ Fastest alternative |

Browse the full catalog → build.nvidia.com/explore/discover


🖥️ CLI Reference

budgetai init      # Initialize config file
budgetai start     # Start the proxy server
budgetai config    # Show current configuration
budgetai --help    # Show help
budgetai --version # Show version

🔌 API Endpoints

| Method | Path | Description | |---|---|---| | POST | /v1/messages | Main streaming endpoint | | POST | /v1/messages/count_tokens | Token counting | | GET | /v1/models | List available models | | GET | /health | Health check | | GET | / | Proxy info | | POST | /stop | Stop server |


🛠️ Troubleshooting

budgetai init
nano ~/.config/budgetai/.env
# Add: NVIDIA_NIM_API_KEY=your-key-here

NVIDIA NIM free tier allows 40 req/min. The proxy will automatically retry after the window resets (60s). You can also switch to a different NIM model or API key.

Make sure the env variable is actually set in the terminal where you run claude:

echo $ANTHROPIC_BASE_URL
# Should print: http://localhost:8082

If it's empty, re-run export ANTHROPIC_BASE_URL=http://localhost:8082 before launching Claude Code.


👩‍💻 Development

npm run dev        # Dev mode with hot reload
npm run build      # Production build
npm run typecheck  # Type checking
npm run lint       # Lint

🤝 Contributing

PRs and issues are welcome! Please open an issue first for major changes.

PRs Welcome


Made with ❤️ · MIT License · Powered by NVIDIA NIM 🟢