@roll-agent/reply-policy-tuner-agent

v0.1.4

Published

a day ago

策略 RSI 编排 Agent（Reply Policy Tuner）

0High
0Medium
0Low

steveoooon

niannianfc

reply-policy-tuner-agent

基于 Roll Agent SDK 的回复策略 RSI 编排 Agent：校验 → 预览 → 双路评估 → 有条件写入，对接 Reply Authority Service（RAS）。

npm 包： @roll-agent/reply-policy-tuner-agent

简介

本仓库实现一个通过 stdio 与上层通信的 Roll Agent，用 MCP Tool 管理租户级回复策略（读取、补丁校验、话术预览、双路评估、有条件写入）。策略变更须先经 RAS 评估；update_policy 受本地 evaluate 门禁约束，避免跳过安全评估直接落库。

本 Agent 不自动驱动完整 RSI 阶段，而是向上层编排返回 orchestration，由编排方根据 orchestration.action 分支调度。

| 范围内 | 范围外 | |--------|--------| | 策略 CRUD、校验、预览、评估、有条件写入 | 向候选人发消息、解析 BOSS DOM | | orchestration、evaluate 门禁、口语化错误 | 代替运营做「是否写入」的最终决策 | | 与 RAS 集成 | 自动串联 browser-use / 多 Agent 全流程 |

特性

12 个 MCP Tools：从 diagnostic_status 到 reset_policy
Evaluate 门禁：评估成功后须在 TTL 内（默认 15 分钟）方可 update_policy
编排信号：submit_evaluate_policy_patch 返回 orchestration.action
评估稳健性：默认 2 primary + 1 regression；超 3 条裁切；超时降为 1p+1r 重试
Judge 固定开启：Hard Gate + Fact Verification + Frozen Rubric Judge
双层确认：预览后确认进入评估；评估后唯一写入确认；高危操作 needs_confirmation
运营可读输出：operatorSummary、evaluationSummaryMarkdown、对比表

环境要求

roll CLI
可访问的 Reply Authority Service 与有效 Bearer Token（含所需 reply-policy:* scope）

安装

1. 安装 Agent

roll agent install @roll-agent/reply-policy-tuner-agent

2. 配置 `roll.config.yaml`

在 Roll 配置中为 reply-policy-tuner-agent 注入环境变量（必填项见配置）：

agents:
  env:
    reply-policy-tuner-agent:
      REPLY_AUTHORITY_URL: https://reply-authority.example.com
      REPLY_AUTHORITY_BEARER_TOKEN: 你的Token
      REPLY_AUTHORITY_TIMEOUT_MS: "120000"

3. 验证安装

roll agent list
roll run reply-policy-tuner-agent diagnostic_status --json

使用

读取策略

roll run reply-policy-tuner-agent get_policy \
  --input-json '{"tenantId":"<your-tenant-id>"}' --json

查看 Tool Schema

roll agent tools reply-policy-tuner-agent --json

多 Agent 路由

roll ask "评估并修改回复策略，先查账号再 evaluate" --json

典型改策流程（编排层负责 browser-use 与用户确认）：

get_policy → validate_patch → preview_policy_effect
  → [用户确认评估]
  → build_evaluate_cases → submit_evaluate_policy_patch
  → [按 orchestration 分支 + 用户确认写入]
  → update_policy

门禁细则、租户解析、硬阻断话术：见 SKILL.md、references/orchestration.md。

配置

环境变量通过 roll.config.yaml → agents.env.reply-policy-tuner-agent 注入（示例见安装）。

| 变量 | 必填 | 说明 | |------|:----:|------| | REPLY_AUTHORITY_URL | 是 | RAS 基础地址 | | REPLY_AUTHORITY_BEARER_TOKEN | 是 | Bearer Token | | REPLY_AUTHORITY_TIMEOUT_MS | 否 | 一般请求超时，默认 30000 ms；示例中为 120000 | | REPLY_AUTHORITY_EVALUATE_TIMEOUT_MS | 否 | 默认 60000 ms | | REPLY_POLICY_TUNER_POLICY_JSON | 否 | Tool 策略、approvalTtlMs、evaluateGateTtlMs | | REPLY_POLICY_TUNER_EVALUATE_GATE_DIR | 否 | 门禁存储目录（默认 ~/.roll-agent/...） | | REPLY_POLICY_TUNER_APPROVAL_DIR | 否 | needs_confirmation 批准目录 | | REPLY_POLICY_TUNER_PREVIEW_RECRUITER_USERNAME | 否 | 仅作后备；生产应由编排层传入 recruiterUsername |

Roll 环境声明：references/env.yaml。

常用 scope（diagnostic_status 可查）：reply-policy:read、validate、preview、write；Judge 需 reply-policy:judge。

Tools

| Tool | 说明 | |------|------| | diagnostic_status | 环境指纹、RAS 健康、auth context、Admin 租户列表 | | get_policy | 当前策略 + policyVersion + 运营摘要 | | validate_patch | 校验补丁；diff + warnings | | resolve_recruiter_binding | BOSS 账号 ↔ 租户（含权限校验） | | build_evaluate_cases | 拼装 evaluate 请求体 | | submit_evaluate_policy_patch | 双路回放 + Judge；写入门禁；返回 orchestration | | preview_policy_effect | 单条话术预览 | | format_policy_preview | 合并运营可读 Markdown | | update_policy | 局部写入（evaluate 门禁 + 可选 confirm） | | validate_policy | 整份草稿校验（边缘场景） | | reset_policy | 删除租户覆盖（confirm，不经 evaluate 门禁） |

架构

flowchart LR
  Orchestrator[上层编排]
  Tuner[reply-policy-tuner-agent]
  Browser[browser-use-agent]
  RAS[Reply Authority Service]
  SmartReply[smart-reply-agent]

  Orchestrator --> Tuner
  Orchestrator --> Browser
  Browser -->|recruiterUsername| Orchestrator
  Tuner --> RAS
  RAS --> SmartReply

| 组件 | 职责 | |------|------| | 上层编排 | 多 Agent 串联、用户确认、orchestration 分支 | | reply-policy-tuner-agent | MCP Tools、门禁存储、呈现层 | | browser-use-agent | 按需从 BOSS 获取 recruiterUsername | | RAS | 策略存储与 validate / preview / evaluate API | | smart-reply-agent | 运行时消费已发布策略 |

运行时： stdio 传输、on-demand 拉起，入口 node dist/index.js（见 package.json → rollAgent）。

目录结构

reply-policy-tuner-agent/
├── src/
│   ├── index.ts                 # Agent 入口与 Tool 注册
│   ├── tools/                   # MCP Tool 实现
│   ├── services/                # RAS 客户端、绑定解析
│   ├── presentation/            # 摘要、编排推导
│   └── evaluate-publish-gate.ts # 本地 evaluate 门禁
├── references/
│   ├── env.yaml                 # Roll 环境声明
│   └── orchestration.md         # 上层编排手册
├── SKILL.md                     # Agent Skill（门禁、运营话术）
├── dist/                        # 构建产物（npm 发布）
└── package.json

工作流

发布门禁（`update_policy` 不可绕过）

对同一 tenantId、basePolicyVersion、patch 成功执行 submit_evaluate_policy_patch
publishBlocked === false，且 hardRecommendedForPublish、factRecommendedForPublish 均为 true
门禁记录在 evaluateGateTtlMs 内有效（默认 15 分钟）
写入的 patch 与最近一次 evaluate 完全一致

`orchestration.action`

| 值 | 含义 | |----|------| | rollback_to_propose | Hard / Fact 硬阻断，须改 patch 后重评 | | decide_with_warnings | Hard/Fact 已过，Judge 或回归有告警，用户确认后可发布 | | ready_to_publish | 评估通过，展示结果后用户确认再写入 |

评估默认行为

默认 2 primary + 1 regression；首次请求前超过 3 条则裁至 2p+1r
超时：降为 1p+1r 重试一次；仍失败则报错（禁止跳过评估直接写入）

本地开发

从源码构建（需 Node.js >= 22.6.0 与 pnpm）：

git clone https://github.com/daixueyun3377/reply-policy-tuner-agent.git
cd reply-policy-tuner-agent
pnpm install
pnpm build

pnpm dev          # 运行 TS 入口（Node --experimental-strip-types）
pnpm typecheck
pnpm test
pnpm build        # 声明 + esbuild 打包 + 混淆 → dist/
pnpm clean

对接真实 RAS 的联调测试：

REPLY_POLICY_TUNER_LIVE_TEST=1 pnpm test

文档

| 文档 | 读者 | |------|------| | SKILL.md | 编排 / 子 Agent：门禁、话术、patch 方法论 | | references/orchestration.md | 上层编排：调用顺序、needs_confirmation | | references/env.yaml | Roll 环境变量 | | CHANGELOG.md | 版本记录 |

参与贡献

欢迎提交 Issue 与 Pull Request。

Fork 本仓库
创建功能分支（git checkout -b feature/your-feature）
提交变更（git commit -m 'Add some feature'）
推送分支（git push origin feature/your-feature）
打开 Pull Request

较大改动建议先在 Issues 讨论。

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme