npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@taskproof/adapter-browser-use

v0.2.1

Published

browser-use runner adapter: drives the Python/FastAPI sidecar and emits the identical taskproof artifacts

Readme

@taskproof/adapter-browser-use

The second runner adapter — and the proof of the moat: it drives browser-use (a Python agent) through a thin FastAPI sidecar, yet emits the identical RunArtifact and grades through the same @taskproof/grader as the Claude adapter. The only difference between the two is which agent drove the browser.

How it stays uniform: the sidecar runs browser-use and reports raw run data plus probe results (final URL, network log, and a DOM-probe result per spec selector — evaluated via CDP since browser-use 0.13 dropped Playwright). The TS adapter maps that into the shared artifact and grades it. Grading logic lives in one place; the sidecar never sees the spec or artifact schema.

Running it

The sidecar is a uv project under sidecar/. One-time setup:

cd sidecar
uv sync                      # installs browser-use 0.13.1 + FastAPI (pinned)
uv run browser-use install   # fetches browser-use's Chromium (separate from pip)
export ANTHROPIC_API_KEY=…    # browser-use drives Claude as its LLM
uv run uvicorn taskproof_sidecar.app:app --port 8765

Then point the adapter at it (default http://127.0.0.1:8765, override with TASKPROOF_BROWSER_USE_URL or sidecarUrl) and run:

taskproof run tasks/*.yaml --models claude-opus-4-8,browser-use

browser-use (or browser-use:claude-sonnet-4-6) selects this adapter; the Claude model named is the LLM browser-use uses. Comparing claude-opus-4-8 vs browser-use on the same tasks is the headline comparison: same model, two different agent harnesses.

Pre-release. Live-validated 2026-06-13 against browser-use 0.13.1: a real taskproof run --models browser-use reached the target, the CDP DOM probe and the url assertion both graded correctly through the shared grader, and token cost was extracted (~$0.24 for a 3-step task). Verified API details now in runner.py: omit temperature (Opus 4.8 rejects it), read the version via importlib.metadata, get usage from agent.token_cost_service.get_usage_summary(), and set keep_alive=True so the session survives agent.run() for the post-run DOM probe.

Network capture (HAR-based): uses browser-use's HarRecordingWatchdog (record_har_path), validated live — it captures same-origin HTTPS traffic (the site's own API calls, the common network-assertion case). Remaining edge: the watchdog only enables CDP Network on the initial session, so a cross-origin navigation to a new target is missed; full coverage would need per-target Network.enable via Target.attachedToTarget. HTTP (non-TLS) isn't captured.

Budget cap (maxCostUsd / --max-cost) is NOT enforced mid-run here. Unlike the Claude adapter, which gates each turn against the cap before paying, taskproof can't stop a browser-use run partway — it runs to its own completion or maxSteps. The cap is therefore advisory for this adapter: cost may exceed it (the report shows the real figure), and the CLI warns when you pass --max-cost with a browser-use model. maxSteps is the hard bound on browser-use spend — lower it to bound cost.