@agentick/guardrails

v0.4.0

Published

17 hours ago

Guardrail middleware for Agentick — gate tool execution with rules and classifiers

0High
0Medium
0Low

rlindgren

agenticklabs

@agentick/guardrails

Guardrail middleware for Agentick — gate tool execution with rules and classifiers.

Install

pnpm add @agentick/guardrails

Quick Start

import { toolGuardrail, deny, allow } from "@agentick/guardrails";

const guardrail = toolGuardrail({
  rules: [deny("file_delete", "exec_*"), allow("file_read", "file_write")],
});

app.use(guardrail);

API

`toolGuardrail(config)`

Create middleware that gates tool execution.

toolGuardrail({
  rules?: GuardrailRule[],
  classify?: GuardrailClassifier,
  onDeny?: (toolName: string, reason: string) => void,
})

Only intercepts procedures with operationName === "tool:run". Other procedures pass through.

`deny(...patterns)`

Create a deny rule.

deny("file_delete", "exec_*");
// { patterns: ["file_delete", "exec_*"], action: "deny" }

`allow(...patterns)`

Create an allow rule.

allow("file_read", "search");
// { patterns: ["file_read", "search"], action: "allow" }

Rule Patterns

Patterns support * wildcard matching:

| Pattern | Matches | | ----------- | -------------------------------- | | "search" | Exact match only | | "file_*" | file_read, file_write, ... | | "*_admin" | read_admin, write_admin, ... | | "*" | Everything |

Evaluation Order

Static rules — first-match-wins
- deny → throw GuardrailDenied
- allow → skip classifier, proceed
Classifier — only runs if no rule matched
- Return { action: "deny", reason } to block
- Return null / undefined / { action: "allow" } to proceed
Default — allow

Classifier

const guardrail = toolGuardrail({
  classify: async (call, envelope) => {
    if (call.input?.dangerous) {
      return { action: "deny", reason: "Dangerous input detected" };
    }
    return null; // allow
  },
});

Error Handling

Denied tools throw GuardrailDenied (extends GuardError):

import { isGuardError } from "@agentick/shared";

try {
  await tool.run(input);
} catch (error) {
  if (isGuardError(error)) {
    // Access denied — error.code === "GUARD_DENIED"
  }
}

The model sees a tool error result with the denial reason, allowing it to try a different approach.

Future

inputGuardrail — gate based on user input content
outputGuardrail — gate based on model output content

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@agentick/guardrails

Install

Quick Start

API

toolGuardrail(config)

deny(...patterns)

allow(...patterns)

Rule Patterns

Evaluation Order

Classifier

Error Handling

Future

`toolGuardrail(config)`

`deny(...patterns)`

`allow(...patterns)`