npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@agentmark-ai/cli

v0.24.3

Published

Agentmark's CLI

Readme

AgentMark CLI

The command-line tool for developing, testing, and evaluating AI agents with AgentMark.

Installation

npm install -g @agentmark-ai/cli

Or use it directly with npx:

npx @agentmark-ai/cli dev

Quick Start

# Scaffold a new project
npm create agentmark@latest

# Start the dev server (API + trace UI + hot reload)
agentmark dev

# Run a prompt with its test props
agentmark run-prompt my-prompt.prompt.mdx

# Run an experiment against a dataset
agentmark run-experiment my-prompt.prompt.mdx

Commands

agentmark dev

Start the local development environment: API server, webhook server, and trace UI.

agentmark dev
agentmark dev --api-port 9418 --webhook-port 9417 --app-port 3000
agentmark dev --no-ui         # API + webhook only (CI / headless use)
agentmark dev --no-forward    # Disable trace forwarding to AgentMark Cloud

The dev server auto-detects your project language (TypeScript or Python), finds your virtual environment, and handles port conflicts automatically.

agentmark doctor

Diagnose your project setup: config, prompts, environment, and (optionally) a live smoke test against the dev server.

agentmark doctor
agentmark doctor --json             # Machine-readable report
agentmark doctor --strict           # Exit non-zero on warnings too (useful in CI)
agentmark doctor --smoke --boot     # Also run a prompt end-to-end and verify the trace

agentmark run-prompt <filepath>

Execute a single prompt and display the result.

# Run with test props from the prompt's frontmatter
agentmark run-prompt customer-support.prompt.mdx

# Run with custom props
agentmark run-prompt customer-support.prompt.mdx --props '{"customer_question": "Where is my order?"}'

# Run with props from a file
agentmark run-prompt customer-support.prompt.mdx --props-file test-data.json

Output includes the LLM response, token usage, cost, and a link to the trace in the local UI.

agentmark run-experiment <filepath>

Run a prompt against every item in its dataset, with optional evaluations.

# Run with evals (default)
agentmark run-experiment my-prompt.prompt.mdx

# Skip evals
agentmark run-experiment my-prompt.prompt.mdx --skip-eval

# Output as JSON instead of table
agentmark run-experiment my-prompt.prompt.mdx --format json

# Fail if pass rate is below 80%
agentmark run-experiment my-prompt.prompt.mdx --threshold 80

# Sample 20% of the dataset, reproducibly
agentmark run-experiment my-prompt.prompt.mdx --sample 20 --seed 42

# Run specific rows only
agentmark run-experiment my-prompt.prompt.mdx --rows 0,3-5,9

# Compare against a prior run and fail on regressions
agentmark run-experiment my-prompt.prompt.mdx --baseline-commit <ref>

Output formats: table (default), csv, json, jsonl, junit.

agentmark build

Pre-compile .prompt.mdx files into JSON for production use with the file loader.

agentmark build
agentmark build --out dist/agentmark   # default output directory

agentmark generate-types

Generate type definitions from your prompts for type-safe usage in code.

agentmark generate-types
agentmark generate-types --language python

agentmark generate-schema

Generate a JSON Schema for .prompt.mdx frontmatter, enabling IDE validation and autocomplete.

agentmark generate-schema
agentmark generate-schema --out .agentmark

agentmark pull-models

Interactively select and add LLM models from a provider to your agentmark.json.

agentmark pull-models
agentmark pull-models --provider anthropic --models <csv>   # non-interactive

agentmark login / agentmark logout

Authenticate with AgentMark Cloud.

agentmark login
agentmark logout

agentmark link

Link your project to an AgentMark Cloud app for trace forwarding.

agentmark link
agentmark link --app-id <uuid>

Documentation

Full CLI reference at docs.agentmark.co/reference/cli-commands.

License

AGPL-3.0-or-later