npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

doppler-gpu

v0.2.0

Published

Browser-native WebGPU inference engine for local intent and inference loops

Readme

doppler-gpu

Inference and training on raw WebGPU. Pure JS + WGSL.

Try the live demo | npm | docs

Phase-latency comparison on one workload across models

Quick start

import { doppler } from 'doppler-gpu';

// Stream tokens
const model = await doppler.load('gemma3-270m');
for await (const token of model.generate('Describe WebGPU briefly')) {
  process.stdout.write(token);
}

// One-shot
const text = await model.generateText('Explain WebGPU in one sentence');

// LoRA hot-swap
await model.loadLoRA('https://oneshift-twoshift-redshift-blueshift.com/manifest.json');

Registry IDs resolve to hosted RDRR artifacts from Clocksmith/rdrr by default. Tokens stream from a native AsyncGenerator. See the canonical Root API guide.

Why Doppler

JS → WGSL → WebGPU. Direct JavaScript orchestration into native WebGPU kernels, avoiding ONNX runtimes, WASM blobs, and bridge layers.

for await streaming. Generation uses a native AsyncGenerator that fits normal app control flow.

LoRA hot-swap. Swap adapters at runtime without reloading the base model.

Independent model instances. Run multiple models concurrently. Each owns its pipeline, buffers, and KV cache.

Supported models

All models below are verified with deterministic greedy decoding on WebGPU hardware. Registry IDs resolve to hosted RDRR artifacts automatically.

| Model | Registry ID | Quant | Params | | --- | --- | --- | --- | | Gemma 3 270M IT | gemma3-270m | Q4K | 270M | | Gemma 3 1B IT | gemma3-1b | Q4K | 1B | | TranslateGemma 4B IT | translategemma-4b-it-q4k-ehf16-af32 | Q4K | 4B | | EmbeddingGemma 300M | google-embeddinggemma-300m-q4k-ehf16-af32 | Q4K | 300M | | Qwen 3.5 0.8B | qwen-3-5-0-8b-q4k-ehaf16 | Q4K | 0.8B | | Qwen 3.5 2B | qwen-3-5-2b-q4k-ehaf16 | Q4K | 2B | | LFM2.5 1.2B Instruct | lfm2-5-1-2b-instruct-q4k-ehf16-af32 | Q4K | 1.2B |

Additional model families (Llama 3, DeepSeek, Gemma 4 MoE, Mixtral, and others) have conversion configs ready but are not yet cataloged. See the full model support matrix for details.

Under the hood

  • Sharded weight loading via OPFS moves multi-GB weights into VRAM without blocking the main thread.
  • Quantized inference (Q4K, F16) runs practical model sizes on consumer GPUs.
  • Kernel hot-swap between prefill and decode paths with zero graph recompilation.
  • Config-driven runtime with explicit profiles, kernel-path selection, and sampling.

Documentation

Environment requirements

  • WebGPU is required.
  • Supported runtimes: WebGPU-capable browsers, or Node with a WebGPU provider.
  • Chrome / Edge 113+ supported.
  • Firefox support varies (typically behind a flag).
  • Safari support is evolving.

License

Apache License 2.0 (Apache-2.0). See LICENSE and NOTICE.