@kognitivedev/evals
v0.2.28
Published
Provider-agnostic evaluation framework with model-graded and rule-based scorers
Maintainers
Readme
@kognitivedev/evals
Provider-agnostic evaluation framework with model-graded, rule-based, and composite scorers.
Installation
bun add @kognitivedev/evals @kognitivedev/adapter-ai-sdk @ai-sdk/openai zodQuick Start
import { EvalDatasetBuilder, modelGradedScorer, ruleBased, runEvals } from "@kognitivedev/evals";
import { createAISDKStructuredGenerationAdapter } from "@kognitivedev/adapter-ai-sdk";
import { openai } from "@ai-sdk/openai";
const judge = createAISDKStructuredGenerationAdapter({
model: openai("gpt-4o-mini"),
});
const report = await runEvals({
generateResponse: async (messages) => {
return "Hello";
},
dataset: EvalDatasetBuilder.fromArray([
{ input: [{ role: "user", content: "Hello" }] },
]),
scorers: [
ruleBased.containsKeyword("hello"),
modelGradedScorer({ runtime: judge, criteria: "Is this helpful?" }),
],
});
console.log(report.summary.avgScore);Scorers
modelGradedScorervia aStructuredGenerationAdaptervisionScorerfor multimodal judging- rule-based scorers
compositeScorer
