npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@dvai-bridge/android-llama-core

v4.0.0

Published

DVAI-Bridge Android llama.cpp core — pure Kotlin + JNI embedded HTTP server + handlers + bridge. Capacitor-free.

Readme

DVAI-Bridge

DVAI-Bridge

License Node.js TypeScript Swift Kotlin Flutter .NET

The local OpenAI server you embed inside your app. One library. One HTTP wire. Every platform. Zero install for your users.

Docs: dvai-bridge.deepvoiceai.co

import { DVAI } from "@dvai-bridge/core";
import OpenAI from "openai";

const dvai = new DVAI({ backend: "transformers" });
await dvai.initialize();

const openai = new OpenAI({ baseURL: dvai.baseUrl, apiKey: "ignored" });
await openai.chat.completions.create({
  model: dvai.transformersModelId,
  messages: [{ role: "user", content: "Hello!" }],
});

That's it. A real OpenAI-compatible server is now running inside your app's own process. Point any OpenAI client — LangChain, the OpenAI SDK, the Vercel AI SDK, anything — at dvai.baseUrl and your agent code keeps working.

Built by Deep Voice AI.


Why it exists

Local AI works beautifully on a laptop with Ollama + LangChain. Then you try to ship the app and your users don't have Ollama. Mobile can't run it. Corporate IT won't add another daemon. So you reinvent the same plumbing — spawn an inference engine, bind a port, translate to OpenAI HTTP, handle CORS, manage lifecycle, wrap the accelerator of the day per platform — and do it all over again for every target OS.

DVAI-Bridge is that plumbing, packaged as a library, for every client platform.


What you get

  • One OpenAI HTTP surface. Bound on 127.0.0.1 (or 0.0.0.0 for device-to-device). Streaming, embeddings, models, recovery — all built in.
  • Six SDKs. @dvai-bridge/core + react + vanilla + capacitor, DVAIBridge (Swift / iOS), co.deepvoiceai:dvai-bridge (Kotlin / Android), @dvai-bridge/react-native, dvai_bridge (Flutter), co.deepvoiceai.dvai-bridge (.NET).
  • Nine backends. WebLLM, Transformers.js, llama.cpp, Apple Foundation Models, MLX, CoreML / ANE, MediaPipe LLM, LiteRT, ONNX Runtime GenAI — selected per-platform, invisible to your agent code.
  • Native acceleration wherever it runs: WebGPU in browsers, CUDA / Metal / Vulkan / DirectML on desktop, ANE / Metal / MLX on iOS, NNAPI / QNN Hexagon / GPU delegate on Android.
  • Multimodal. Text, image, audio, video — declarative loader for cutting-edge models (Gemma 4, LLaVA, Idefics) without waiting for library updates.
  • Distributed inference (v3.0+). Phone too slow? Offload to your laptop on the same Wi-Fi via mDNS pairing — same OpenAI wire, transparent to your code. Internet path via a self-hostable rendezvous server.
  • DVAI Hub (v3.1+). A first-party desktop utility that turns any device into a strong-peer for the rest of your fleet. Brand-neutral install via Homebrew / winget / GitHub Releases, OR fork it for your own branded companion. Routes through Ollama / LM Studio / vLLM / llama-server / llamafile if you've already got those running.
  • Zero user install. It's a library, not a daemon. npm install, cocoapods, gradle — your CI already has the muscle for it.

Supported platforms

| Stack | Package | Backends | | --- | --- | --- | | Browser (React, Vue, Svelte, vanilla JS) | @dvai-bridge/core + react / vanilla | WebLLM (WebGPU), Transformers.js (WebGPU / WASM SIMD) | | Node / Bun / Electron | @dvai-bridge/core | Transformers.js, native llama.cpp | | Capacitor hybrid mobile | @dvai-bridge/capacitor + backend slice | Native llama.cpp (Metal iOS, Vulkan / CPU Android) | | iOS native (Swift) | DVAIBridge (SPM / CocoaPods) | llama.cpp (Metal), CoreML / ANE, Apple Foundation Models, MLX | | Android native (Kotlin / Java) | co.deepvoiceai:dvai-bridge (AAR) | llama.cpp, MediaPipe LLM, LiteRT, NNAPI / QNN | | React Native (≥0.77, TurboModule) | @dvai-bridge/react-native | All iOS + Android backends (delegates) | | Flutter (≥3.39) | dvai_bridge (pub.dev) | All iOS + Android backends (Pigeon channels) | | .NET 10 LTS (MAUI / Avalonia / WinUI / Catalyst / desktop) | co.deepvoiceai.dvai-bridge* (NuGet) | iOS / Android delegate to native; desktop = llama.cpp + ONNX Runtime GenAI + ML.NET |

Full quickstart per platform: dvai-bridge.deepvoiceai.co/guide/getting-started


Examples

// React
import { DVAIProvider, useDVAI } from "@dvai-bridge/react";
<DVAIProvider config={{ backend: "transformers" }}>
  <Chat />
</DVAIProvider>;
function Chat() {
  const { isReady, baseUrl } = useDVAI();
  return isReady ? <div>Local AI live at {baseUrl}</div> : <Loading />;
}
// iOS
let server = try await DVAIBridge.shared.start()
// server.baseUrl = "http://127.0.0.1:38883/v1"
// Android
val server = DVAIBridge.start(context)
// server.baseUrl = "http://127.0.0.1:38883/v1"
// Flutter
final state = await DVAIBridge.instance.start(
  backend: BackendKind.auto,
  modelPath: '/path/to/model.gguf',
);
// state.baseUrl = "http://127.0.0.1:38883/v1"
// .NET
var server = await DVAIBridge.Shared.StartAsync(new StartOptions {
    Backend = BackendKind.Auto,
    ModelPath = "/path/to/model.gguf",
});
// server.BaseUrl = "http://127.0.0.1:38883/v1"

Multimodal, streaming, embeddings, distributed offload, the Hub — everything's at the docs site.


What's new in v3.1

  • DVAI Hub — Tauri desktop utility that's the strong-peer side of v3 distributed inference. brew install deepvoiceai/dvai-hub/dvai-hub (or winget install DeepVoiceAI.DVAIHub) → mobile apps on the same Wi-Fi pair with it and offload heavy inference. Guide →
  • External-engine bridge. Hub surfaces Ollama / LM Studio / vLLM / llama-server / llamafile as additional backend pools so paired apps serve from whatever's already cached. Opt-in per engine.
  • Strict substitution policy. Models with mismatched family / version / size / type are refused by default; quant-only mismatches gated behind a per-pairing preferBetterQuant flag. No silent mis-routing.
  • HMAC-signed identity on /v1/chat/completions. Per-app audit logs surface who served what, with structured (appId, peerDeviceId, engine, requestedModel, servedModel, outcome) rows.
  • Library finalization. httpBindHost (LAN bind), chatCompletionInterceptor (extension point), HMAC primitives re-exported, /v1/dvai/* routes actually dispatched, TransformersBackend Node-mode device fix. Migration v3.0 → v3.1 →

Robustness

Streaming-correct (SSE passthrough + blank-chunk detection), generation timeout, automatic engine-state recovery on fatal errors, port fallback, worker offloading, Private Network Access ready, CORS configured. The boring substrate so your agent code never has to think about it.


Licensing

Dual: free for development & personal use on localhost (verified at runtime). Commercial use requires a license key — [email protected].


Contributing

PRs welcome.

pnpm install
pnpm build
bash scripts/build-all.sh   # full matrix (auto-skips per-host)

CONTRIBUTING.md for the PR flow. Per-platform contributor docs (iOS / Android / RN / Flutter / .NET) under docs/development/.


© Deep Voice AI Limited. All rights reserved.