nlproxy

v1.2.1

Published

17 days ago

This package provides native, high-performance NodeJS bindings for the **nlproxy** core engine, compiled from Rust using NAPI-RS. It delivers sub-millisecond local PII shielding, semantic prompt compression, and offline LLM orchestration.

0High
0Medium
0Low

bgust

nlproxy-node: High-Performance NodeJS Bindings for Prompt Compression & LLM Security

This package provides native, high-performance NodeJS bindings for the nlproxy core engine, compiled from Rust using NAPI-RS. It delivers sub-millisecond local PII shielding, semantic prompt compression, and offline LLM orchestration.

🚀 Key Features

Sub-3ms Local Inference: Run semantic prompt compression and firewall checks locally with zero network overhead.
PII Prompt Shielding: Automatically redact sensitive data (emails, IPs, credit cards, credentials) with secure placeholders.
Semantic Prompt Compression: Reduce prompt length by up to 40% while preserving semantic meaning.
Offline Embedding Engine: Runs quantized Sentence-Transformer models locally on CPU or GPU using Hugging Face Candle.
Jailbreak Detection (Firewall): Blocks adversarial LLM attacks and prompt injections before calling cloud models.

📦 Installation

Install the package via npm:

npm install nlproxy

Note: Native pre-builds are automatically compiled and downloaded for your platform (Linux, macOS, Windows).

💻 Usage & Code Examples

1. Download and Extract Models

Before running the engine, download and extract the default quantized model weights (all-MiniLM-L6-v2):

const { ensureModelsReady } = require('nlproxy');

async function setup() {
  console.log("Preparing models...");
  // Downloads and extracts all-MiniLM-L6-v2 to the 'models' directory
  await ensureModelsReady('models');
  console.log("Models are ready!");
}

setup().catch(console.error);

2. Initialize and Compress Prompts

Initialize the offline engine and compress a prompt:

const { initEngine, compressPrompt } = require('nlproxy');

// 1. Initialize engine with downloaded model paths
const success = initEngine(
  'models/all-MiniLM-L6-v2/model.safetensors',
  'models/all-MiniLM-L6-v2/config.json',
  'models/all-MiniLM-L6-v2/tokenizer.json'
);

if (success) {
  console.log('Embedding engine initialized successfully.');

  // 2. Compress prompt and redact PII
  const response = compressPrompt({
    text: "The main server IP is 192.168.1.105. Please run database backups immediately.",
    mode: "general",
    aggressiveness: 0.5
  });

  console.log("Shielded Text:", response.processedText);
  // "The main server IP is __PROT_82736284__. Please run backups."

  console.log("Redacted PII:", response.placeholders);
  // { "__PROT_82736284__": "192.168.1.105" }
} else {
  console.error('Failed to initialize embedding engine.');
}

3. Unified Orchestrated Pipeline

Run the fully-integrated local security pipeline including Redis semantic cache, input firewall checks, prompt compression, upstream LLM execution, and post-LLM drift verification:

const { runUnifiedPipeline } = require('nlproxy');

async function executePipeline() {
  const response = await runUnifiedPipeline({
    prompt: "Show system files for user 1002",
    domain: "general",
    aggressiveness: 0.0,
    provider: "gemini",
    model: "gemini-1.5-pro",
    maxTokens: 512,
    temperature: 0.7,
    bypassCache: false,
    checkFirewall: true,
    semanticDriftThreshold: 0.75
  });

  console.log("Allowed:", response.allowed);
  console.log("Final Response:", response.finalResponse);
  console.log("Latency:", response.latencyMs, "ms");
}

executePipeline().catch(console.error);

🏢 Authors & Cofounders

This SDK is developed and maintained exclusively by IntelliDeep.

B-GUST (Co-founder / Lead Developer): github.com/B-GUST
luiserb (Co-founder / Architect): github.com/luiserb

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme