@guardianclaw/voltagent

v0.3.0

Published

8 days ago

AI safety guardrails for VoltAgent: CLAW protocol validation, OWASP protection, and PII detection for autonomous AI agents

@guardianclaw/voltagent

AI safety guardrails for VoltAgent applications. Implements CLAW protocol validation, OWASP security protection, and PII detection/redaction.

Validated against @voltagent/[email protected] on 2026-04-23.

Features

CLAW Protocol: Credibility, Limits, Avoidance, Worth validation gates
OWASP Protection: SQL injection, XSS, command injection, SSRF detection
PII Detection: Email, phone, SSN, credit card, API keys, and more
Streaming Support: Real-time PII redaction for streaming responses
VoltAgent Native: Works directly with VoltAgent's guardrail system

Installation

npm install @guardianclaw/voltagent

Quick Start

The simplest way to add GuardianClaw protection to your VoltAgent agent:

import { Agent } from "@voltagent/core";
import { createGuardianClawGuardrails } from "@guardianclaw/voltagent";

// Create guardrails with preset configuration
const { inputGuardrails, outputGuardrails } = createGuardianClawGuardrails({
  level: "strict",
  enablePII: true,
});

// Add to your agent
const agent = new Agent({
  name: "safe-agent",
  inputGuardrails,
  outputGuardrails,
});

Configuration Presets

| Level | Description | |-------|-------------| | permissive | Log only, no blocking. Good for development. | | standard | Block unsafe content, CLAW + OWASP enabled. Recommended for production. | | strict | All validations, block on any issue. For high-security applications. |

Usage Examples

Basic Input Protection

import { createGuardianClawInputGuardrail } from "@guardianclaw/voltagent";

const inputGuard = createGuardianClawInputGuardrail({
  enableCLAW: true,
  enableOWASP: true,
  blockUnsafe: true,
});

const agent = new Agent({
  inputGuardrails: [inputGuard],
});

PII Redaction in Responses

import { createGuardianClawOutputGuardrail } from "@guardianclaw/voltagent";

const outputGuard = createGuardianClawOutputGuardrail({
  enablePII: true,
  redactPII: true,
});

const agent = new Agent({
  outputGuardrails: [outputGuard],
});

// Input: "Contact [email protected] or call 555-123-4567"
// Output: "Contact [EMAIL] or call [PHONE]"

Specialized Guardrails

import {
  createChatGuardrails,
  createAgentGuardrails,
  createPrivacyGuardrails,
} from "@guardianclaw/voltagent";

// For chat applications (jailbreak prevention)
const chatGuards = createChatGuardrails();

// For agent applications (tool call protection)
const agentGuards = createAgentGuardrails();

// For privacy-sensitive applications (full PII protection)
const privacyGuards = createPrivacyGuardrails();

Custom Patterns

const guard = createGuardianClawInputGuardrail({
  customPatterns: [
    {
      pattern: /internal\s+only/i,
      name: "Internal content restriction",
      gate: "limits",
      severity: "high",
    },
  ],
});

CLAW Protocol

The CLAW protocol validates content against four safety gates:

| Gate | Description | Example Violations | |------|-------------|-------------------| | Truth | Factual accuracy | Fake documents, impersonation | | Harm | Potential for harm | Violence, malware, theft | | Scope | Operational boundaries | Jailbreaks, persona switching | | Purpose | Legitimate intent | Purposeless destruction |

OWASP Protection

Detects common security vulnerabilities:

SQL Injection
Cross-Site Scripting (XSS)
Command Injection
Path Traversal
Server-Side Request Forgery (SSRF)
Prompt Injection
Sensitive Data Exposure

PII Types Detected

Email addresses
Phone numbers
Social Security Numbers
Credit card numbers
IP addresses
Dates of birth
API keys / AWS keys
Private keys
JWT tokens
Passport numbers
Driver license numbers

Streaming Support

For streaming responses, use the stream handler:

import { createGuardianClawPIIRedactor } from "@guardianclaw/voltagent";

const piiRedactor = createGuardianClawPIIRedactor({
  enablePII: true,
  piiTypes: ["EMAIL", "PHONE", "SSN"],
});

const guardrail = {
  name: "pii-stream-redactor",
  handler: async (args) => ({ pass: true }),
  streamHandler: piiRedactor,
};

API Reference

Bundle Functions

| Function | Description | |----------|-------------| | createGuardianClawGuardrails(config) | Create complete input/output guardrail bundle | | createChatGuardrails() | Preset for chat applications | | createAgentGuardrails() | Preset for agent applications | | createPrivacyGuardrails() | Preset for privacy-focused applications | | createDevelopmentGuardrails(logger) | Permissive preset for development |

Input Guardrails

| Function | Description | |----------|-------------| | createGuardianClawInputGuardrail(config) | Main input guardrail factory | | createStrictInputGuardrail() | Strict preset | | createPermissiveInputGuardrail(logger) | Log-only preset | | createCLAWOnlyGuardrail() | CLAW validation only | | createOWASPOnlyGuardrail() | OWASP validation only |

Output Guardrails

| Function | Description | |----------|-------------| | createGuardianClawOutputGuardrail(config) | Main output guardrail factory | | createPIIOutputGuardrail(options) | PII-focused output guardrail | | createStrictOutputGuardrail() | Block on any sensitive content | | createPermissiveOutputGuardrail(logger) | Redact only, no blocking |

Streaming Handlers

| Function | Description | |----------|-------------| | createGuardianClawPIIRedactor(config) | Streaming PII redactor | | createStrictStreamingRedactor(config) | Abort on sensitive content | | createPermissiveStreamingRedactor(types) | PII redaction only | | createMonitoringStreamHandler(logger) | Detection without modification |

Validators (Advanced)

| Function | Description | |----------|-------------| | validateCLAW(content, context, patterns) | Run CLAW validation | | validateOWASP(content, checks, patterns) | Run OWASP validation | | detectPII(content, types, patterns) | Detect PII in content | | redactPII(content, types, format) | Redact PII from content | | quickCheck(content) | Fast CLAW check | | quickOWASPCheck(content) | Fast OWASP check | | hasPII(content) | Quick PII detection |

Configuration Options

interface GuardianClawGuardrailConfig {
  // Behavior
  blockUnsafe?: boolean;           // Block unsafe content (default: true)
  logChecks?: boolean;             // Enable logging (default: false)
  logger?: (msg, data) => void;    // Custom logger function

  // Validation modules
  enableCLAW?: boolean;            // Enable CLAW (default: true)
  enableOWASP?: boolean;           // Enable OWASP (default: true)
  enablePII?: boolean;             // Enable PII detection (default: false)

  // CLAW options
  customPatterns?: PatternDefinition[];
  skipActions?: string[];
  minBlockLevel?: RiskLevel;       // 'low' | 'medium' | 'high' | 'critical'

  // OWASP options
  owaspChecks?: OWASPViolationType[];
  customOWASPPatterns?: OWASPPatternDefinition[];

  // PII options
  piiTypes?: PIIType[];
  redactPII?: boolean;
  redactionFormat?: string | ((type, value) => string);

  // Performance
  maxContentLength?: number;       // Max content length (default: 100000)
  timeout?: number;                // Timeout in ms (default: 5000)
}

Requirements

Node.js >= 18.0.0
VoltAgent >= 0.1.0 (tested with @voltagent/core v1.5.2)

Development

This package depends on @guardianclaw/core which provides the CLAW validation patterns. When developing locally, the dependency is resolved via file:../core in the monorepo structure.

For production npm installations, the core patterns are bundled during the build process. If you're building from source, ensure you have the full monorepo cloned:

git clone https://github.com/guardian-claw/guardianclaw.git
cd guardianclaw
npm install
npm run build -w packages/core
npm run build -w packages/voltagent

Related Packages

guardianclaw: Python package
@guardianclaw/elizaos-plugin: ElizaOS integration

License

MIT License (see LICENSE for details)

Built by GuardianClaw Team

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@guardianclaw/voltagent

Features

Installation

Quick Start

Configuration Presets

Usage Examples

Basic Input Protection

PII Redaction in Responses

Specialized Guardrails

Custom Patterns

CLAW Protocol

OWASP Protection

PII Types Detected

Streaming Support

API Reference

Bundle Functions

Input Guardrails

Output Guardrails

Streaming Handlers

Validators (Advanced)

Configuration Options

Requirements

Development

Related Packages

Links

License