npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@digital4better/data

v1.3.40

Published

Digital4Better Open Data

Readme

Digital4Better Open Data

Open datasets maintained by Digital4Better to describe the environmental footprint of digital services, cloud infrastructure, electricity systems, and AI models.

This repository is meant to be used as a data source, not as developer documentation. The main audience is analysts, sustainability teams, researchers, product teams, and anyone who needs reusable reference data in JSON or CSV.

These reference datasets are used, among other things, by fruggr, Digital4Better's platform for measuring and managing the environmental footprint of digital services.

What You Can Find Here

The repository is organized as a set of reusable data collections:

| Collection | What it covers | Main files | | --- | --- | --- | | data/ai | AI model catalog across vendors and cloud providers | models.json | | data/cloud | Cloud regions, virtual machines, CPUs, accelerators | *-regions.*, *-vms.*, cpus.*, accelerators.* | | data/country | Countries, regions, continents, and distance referentials | regions.*, countries.*, continents.*, *-distances.* | | data/energy | Environmental impacts of electricity production technologies | energy-impacts.* | | data/mix | Electricity mix by geography and time period | world-*, continent-*, country-*, subdivision-* | | data/factor | Electricity impact factors derived from energy mix data | world-*, continent-*, country-*, subdivision-* | | data/equipment | Equipment energy and embodied impact reference data | energy.*, embodied.* |

Why This Repository Exists

These datasets are used to:

  • estimate the environmental footprint of digital services
  • compare cloud infrastructure options across providers and regions
  • model electricity-related impacts by country, continent, or subdivision
  • enrich internal or public sustainability dashboards
  • document AI models and their characteristics in a structured way

Highlights

AI Models

The AI catalog in data/ai/models.json documents model families from providers such as OpenAI, Anthropic, Google, Mistral, Meta, Qwen, DeepSeek, Amazon, Cohere, and others.

This makes it useful for market mapping, observatories, governance, and cloud/AI portfolio analysis.

Main source families:

Cloud Infrastructure

The cloud referentials in data/cloud provide structured information for major providers including AWS, Azure, GCP, Oracle Cloud Infrastructure, OVHcloud, and Scaleway.

Typical use cases:

  • mapping regions and datacenter footprints
  • comparing VM families and hardware characteristics
  • linking compute infrastructure to sustainability calculations

Main source families:

Current cloud assumptions kept in the datasets:

  • aws: pue and wue come from the 2024 AWS regional CSV; ref stays at 0 because AWS public renewable matching disclosures are not used as a region-level factor in this referential
  • azure: values come from Microsoft regional fact sheets, combining still-live PDFs with previously curated factsheet values for regions whose older PDFs are no longer publicly retrievable
  • gcp: ref comes from regional CFE%; wue comes from previously derived values based on Google environmental reporting and is kept until Google publishes a clearer general regional water metric
  • oracle: use a uniform pue = 1.07 and provisional wue = 0 until OCI publishes region-level metrics
  • ovhcloud: follow FY25 KPI values for pue, wue, and ref
  • scaleway: values come from documented datacenter figures, completed where needed with the provider's impact reports

Electricity Mix And Impact Factors

The datasets in data/mix and data/factor help translate electricity consumption into environmental impacts.

They are available at several levels:

  • world
  • continent
  • country
  • subdivision

And across different time granularities:

  • yearly
  • monthly

Green-only variants are also available through files ending with -green.

Main source families:

Geography And Distances

The datasets in data/country provide geographic referentials used to map countries, continents, subdivisions, and estimated network distances.

Typical use cases:

  • geographic normalization
  • country and subdivision mapping
  • rough estimation of distances between users, countries, regions, and datacenters

Main source families:

  • ISO country and subdivision standards
  • internally maintained geographic referentials used to derive administrative mappings and distance approximations

Equipment Reference Data

The datasets in data/equipment provide reference values for embodied impacts and operational energy of common digital equipment categories.

Typical use cases:

  • footprint modeling at equipment level
  • simplified lifecycle modeling for digital services
  • comparative analysis of device or infrastructure categories

Main source families:

  • Digital4Better internal modeling inputs
  • lifecycle assessment literature and equipment reference datasets used for sustainability calculations

Formats

Most collections are published in both formats:

  • JSON for structured or nested data
  • CSV for tabular exploration, spreadsheets, and BI tools

If a collection is only available in one format, it is usually because that format is the most natural one for the data structure.

Quick Navigation

Notes On Data Quality

This repository aims to provide transparent and reusable reference data, but some values should be interpreted with care.

  • Some fields are derived from public documentation, model cards, technical reports, or literature rather than official disclosures.
  • Some collections include explicit uncertainty markers such as estimated.
  • AI and cloud catalogs evolve quickly, so historical and legacy entries may coexist with current ones.
  • Environmental factors are based on a mix of primary data, literature, and modeling assumptions.

When available, source URLs are kept directly in the data files themselves.

Related Links

License

This repository is published under the ODC Open Database License (ODbL).