npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2024 – Pkg Stats / Ryan Hefner

pii-filter

v1.0.24

Published

PII Filter

Downloads

46

Readme

PII Filter Node Module

CI npm version License: MIT

A library for detecting, parsing, and removing personally identifiable information from strings and objects.

PII replaced with placeholders

Scenarios

We hope that this software can be useful in some of the following scenarios:

  • privacy, security, fraud-detection, and data-auditing
  • anonymizing data for research, marketing and machine learning
  • accessibility and online guidance
  • word tagging and word spotting
  • designing chatbots

Languages

pii-filter currently supports the following languages and PII:

  • Dutch
    • First Names
    • Family Names
    • Pet Names
    • Medicine Names
    • Phone Numbers
    • Email Addresses
    • Dates

Installing

You can add the pii-filter npm package to your project by running:

npm install --save-dev pii-filter

Documentation

The docs can be read here.

Examples

Sanitizing strings:

import * as pf from 'pii-filter';

const pii_filter = pf.make_pii_classifier(pf.languages.nl.make_lm());
const raw_str = 'Hallo Johan, mijn 06 is 0612345678, tot morgen.';
const sanitized_str = pii_filter.sanitize_str(raw_str, true);

console.log(sanitized_str);

// output:
'Hallo {first_name}, mijn 06 is {phone_number}, tot morgen.'

Sanitizing objects:

import * as pf from 'pii-filter';

const pii_filter = pf.make_pii_classifier(pf.languages.nl.make_lm());
const obj =
{
    message: 'Wilma de Vries, 20 november 1964',
    detail: 'Werking Paracetamol bij gebruik medicatie'
};
const sanitized_obj =   pii_filter.sanitize_obj(obj, true, false);

console.dir(sanitized_obj);

// output:
{
    message: '{first_name} {family_name}, {date}',
    detail: 'Werking {medicine_name} bij gebruik medicatie'
}

Parsing PII:

import * as pf from 'pii-filter';

const pii_filter = pf.make_pii_classifier(pf.languages.nl.make_lm());
const raw_str = 'Hallo Johan, mijn e-mail is [email protected] en mijn nummer is 0612345678, tot dan.';
const results = pii_filter.classify(raw_str);

for (let pii of results.pii)
    console.dir(pii);

// output:
{
    value: 'Johan',
    type: 'first_name',
    confidence: 0.755,
    severity: 0.4539742200500001,
    start_pos: 6,
    end_pos: 11
}
{
    value: '[email protected]',
    type: 'email_address',
    confidence: 1,
    severity: 0.2,
    start_pos: 28,
    end_pos: 41
}
{
    value: '0612345678',
    type: 'phone_number',
    confidence: 0.8512500000000001,
    severity: 0.35,
    start_pos: 60,
    end_pos: 70
}  

Main repository

For more information and access to used the datasets check out the main repository.