evaluhealth

v0.20.0

Published

3 months ago

Test your LLM-powered healthcare and life sciences apps with a TypeScript-native, Vitest-based eval runner. No API key required.

0High
0Medium
0Low

yamz8

ai evals typescript vitest healthcare life-sciences medical-ai clinical-ai biomedical hipaa fda regulatory-compliance

Evaluhealth: the TypeScript-native, local-first tool for testing LLM-powered apps.

By Kernelius — Building AI tools for healthcare and life sciences.

What Is Evaluhealth?

Evaluhealth is a fork of Evalite specifically designed for healthcare and life sciences applications. It's a TypeScript-native, local-first evaluation framework for testing LLM-powered healthcare applications, medical AI systems, and life science research tools.

Core Features

Fully open source: No API Key required.
Local-first: runs on your machine, your data never leaves your laptop.
Based on Vitest, the best TypeScript test runner around.
Terminal UI for quick prototyping.
Supports tracing and custom scorers.

Healthcare & Life Sciences Focus

Evaluhealth is tailored for developers and researchers working in healthcare and life sciences:

Medical AI Evaluation: Test clinical decision support systems, diagnostic tools, and medical chatbots with comprehensive evaluation metrics
Life Sciences Research: Evaluate LLM performance on scientific literature analysis, drug discovery pipelines, and biomedical data processing
HIPAA-Compliant Testing: Local-first architecture ensures sensitive healthcare data (PHI) never leaves your secure environment
Regulatory Readiness: Build evaluation frameworks that support FDA submissions, CE marking, and other regulatory compliance requirements
Clinical Validation: Create robust test suites for validating AI models in real-world healthcare contexts with domain-specific scorers

How Do I Learn More?

Evaluhealth Is Experimental

Evaluhealth is still an experimental project. I'm actively working on it, and for now am pushing breaking changes.

If you run into any unexpected behavior:

Delete the node_modules/.evalite folder.
Update evaluhealth to the latest version.
Rerun your evals.

If, after that, you run into unexpected behavior, report an issue.

Guides

Watch Mode

You can run Evaluhealth in watch mode by running evaluhealth watch:

evaluhealth watch

This will watch for changes to your .eval.ts files and re-run the evals when they change.

[!IMPORTANT]
I strongly recommend implementing a caching layer in your LLM calls when using watch mode. This will keep your evals running fast and avoid burning through your API credits.

Running Specific Files

You can run specific files by passing them as arguments:

evaluhealth my-eval.eval.ts

This also works for watch mode:

evaluhealth watch my-eval.eval.ts

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme