codex-camel

v0.1.0

Published

a month ago

`codex-cli-camel` is a Codex CLI fork with integrated **CaMeL prompt-injection defense methodology**. This implementation is based on Google DeepMind's CaMeL work and adapted to Codex CLI runtime boundaries.

0High
0Medium
0Low

nativ3ai

codex-cli-camel

codex-cli-camel is a Codex CLI fork with integrated CaMeL prompt-injection defense methodology. This implementation is based on Google DeepMind's CaMeL work and adapted to Codex CLI runtime boundaries.

What is added vs upstream

Native CaMeL guard in the core turn pipeline.
Runtime modes:
- off
- monitor (warn, continue)
- enforce (block suspicious turn)
CLI commands:
- codex camel activate --mode monitor|enforce --threshold <n>
- codex camel deactivate
- codex camel status
- codex camel scan "<payload>"
- codex camel compare
Reproducible benchmark harness and runtime comparison docs.
Companion plugin path (medium protection):
- https://github.com/nativ3ai/codex-cli-camel-plugin

CaMeL references (source methodology)

Paper (arXiv): Defeating Prompt Injections by Design
https://arxiv.org/abs/2503.18813
Google research repository:
https://github.com/google-research/camel-prompt-injection

Install (this fork)

npm (global, one line)

npm install -g codex-camel
codex --version

Build from source

git clone https://github.com/nativ3ai/codex-cli-camel.git
cd codex-cli-camel/codex-rs

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh -s -- -y
source "$HOME/.cargo/env"
rustup component add rustfmt clippy

cargo build -p codex-cli
cargo run -p codex-cli -- --help

Optional local binary install

cd codex-rs
cargo install --path cli --force
codex --help

Activate guard

# Persist guard in ~/.codex/config.toml
codex camel activate --mode monitor --threshold 6

# Check effective settings (env vars override config)
codex camel status

# Switch to strict enforcement
codex camel activate --mode enforce --threshold 6

# Disable
codex camel deactivate

Environment override knobs:

export CODEX_CAMEL_GUARD_MODE=monitor   # off | monitor | enforce
export CODEX_CAMEL_GUARD_THRESHOLD=6

Malicious prompt behavior

Monitor mode

suspicious input/tool-context is detected
warning is emitted
turn continues

Enforce mode

suspicious input/tool-context is detected
turn is blocked
explicit error is returned

Benchmarks and research docs

Run benchmark:

python3 benchmarks/camel_guard/benchmark.py
cat benchmarks/camel_guard/latest.json

Latest benchmark snapshot

| Metric | Value | | --- | ---: | | samples | 8 | | threshold | 6 | | accuracy | 1.00 | | benign false-positive rate | 0.00 | | malicious detection rate | 1.00 | | throughput (samples/sec) | 456074.47 |

Implementation matrix

| Implementation | Scope | Protection level | Runtime behavior | | --- | --- | --- | --- | | codex-cli-camel (this fork) | Core CLI/runtime hooks | High | monitor warn, enforce block | | codex-cli-camel-plugin | Plugin/hook layer | Medium | Hook-based scan and policy actions | | hermes-agent-camel | Hermes agent runtime | High | Runtime-integrated CaMeL guardrails |

Security model notes

This fork adds deterministic low-overhead guarding at runtime boundaries.
It is not a complete formal defense by itself.
Keep Codex approval policy and sandboxing enabled for defense-in-depth.