@tricoteuses/cnen
v0.1.3
Published
Retrieve & handle French CNEN (Conseil National d'Évaluation des Normes) documents
Downloads
204
Readme
Tricoteuses-Cnen
Retrieve & handle French CNEN (Conseil National d'Évaluation des Normes) documents
Tricoteuses Cnen is free and open source software.
Installation
git clone https://git.tricoteuses.fr/logiciels/tricoteuses-cnen
cd tricoteuses-cnen/npm installDownload data
Basic usage
Create a directory to store the data, then run the following command to scrape and download CNEN documents (délibérations, ordres du jour, relevés d'avis, rapports).
mkdir ../cnen-data/
npm run data:download ../cnen-dataThis retrieves documents for the current year by default.
Available Commands
npm run data:download <dir>: Download all CNEN document types (délibérations, bilans & rapports, ODJ & avis)npm run data:retrieve_deliberations <dir>: Retrieve délibérations onlynpm run data:retrieve_bilans_rapports <dir>: Retrieve bilans & rapports onlynpm run data:retrieve_odj_avis <dir>: Retrieve ordres du jour & relevés d'avis only
Common Options
dataDir(positional, mandatory): Path to the directory where CNEN data is stored--allor-a: Retrieve all years (not just current year)--commitor-c: Commit data to git after download (detects changes since last run)--silentor-s: Disable logging--verboseor-v: Enable verbose logging (list each document)
Examples
# Download current year data
npm run data:download ../cnen-data
# Download all historical data with verbose output and git commit
./node_modules/.bin/tsx src/scripts/data-download.ts ../cnen-data --all --verbose --commit
# Retrieve only délibérations for all years
./node_modules/.bin/tsx src/scripts/retrieve_deliberations.ts ../cnen-data -a
# Retrieve only ODJ & avis silently
./node_modules/.bin/tsx src/scripts/retrieve_odj_avis.ts ../cnen-data -sData structure
Downloaded data is organized as follows:
cnen-data/
├── Avis/
│ └── 2026/
│ ├── CNENAVIS20260108.pdf
│ ├── CNENAVIS20260122.pdf
│ └── ...
├── Deliberations/
│ └── 2026/
│ ├── CNENDELIB20260122.pdf
│ └── ...
├── OrdreDuJour/
│ └── 2026/
│ ├── CNENODJ20260108.pdf
│ └── ...
├── Rapports/
│ └── 2024/
│ └── CNENRAPPORT20240101.pdf
└── manifest.jsonThe manifest.json file indexes all downloaded documents with their metadata (date, type, séance type, source URL, etc.). It enables incremental downloads — re-running the scraper skips documents already present in the manifest.
Source data
Documents are scraped from the official CNEN website: https://www.cnen.dgcl.interieur.gouv.fr/
Three pages are crawled:
The site is built on Ploopi CMS and has no public API — all data is retrieved via HTML scraping.
