fr-spell

v1.0.9

Published

a month ago

Lightweight French lemmatization and conjugation. Convert between nouns, verbs, and adjectives using fast, quantized INT8 ONNX models under 2MB.

Downloads

0High
0Medium
0Low

cndv3996

french Le français word lemma word derivative lemmatization conjugation spelling french noun french verb french adjective

FR-SPELL

English | 中文 | Français

FR-SPELL is an npm package for French lemma prediction and derivative form generation. It supports:

conjugation to lemma prediction
noun form generation
adjective form generation
verb form generation

The package runs with ONNX Runtime and quantized INT8 models for high speed and small model footprint.

Version Comparison

A free community version is also available: fr-spell

| | fr-spell (community) | fr-spell-mini | |---|---|---| | Lemma model size | 1.48 MB | 0.96 MB | | Derive model size | 1.40 MB | 0.91 MB | | Total model size | ~2.88 MB | ~1.87 MB | | Lemma accuracy | 97% (97/100) | 99% (99/100) | | Noun derive accuracy | 100% (100/100) | 99% (99/100) | | Verb derive accuracy | 100% (100/100) | 100% (100/100) | | Adjective derive accuracy | 100% (100/100) | 100% (100/100) | | Lemma avg latency | 21.97 ms | 16.23 ms | | Noun derive avg latency | 23.19 ms | 17.10 ms | | Verb derive avg latency | 22.93 ms | 16.89 ms | | Adjective derive avg latency | 23.22 ms | 17.12 ms | | Pricing | Free | $4.99 | | Payment | No Need | Purchase The Product |

Install

npm install onnxruntime-node
npm install fr-spell

Integrate Into Your Project

import { FrSpell } from 'fr-spell';

const predictor = await FrSpell();

const lemma = await predictor.lemma('mangeons');
const noun = await predictor.nounDerive('chat', 'THD_PLF');
const adje = await predictor.adjeDerive('beau', 'THD_F');
const verb = await predictor.verbDerive('manger', 'FST_PL', 'INDI', 'PRES');

console.log(lemma);
console.log(noun);
console.log(adje);
console.log(verb);

Sample runtime output:

{ input: 'mangeons', lemma: 'manger', wordType: 'VERB', confidence: 0.9965604285, timeMs: 3.89 }
{ lemma: 'chat', wordType: 'NOUN', person: 'THD_PLF', mode: 'ALL', tense: 'ALL', output: 'chattes', confidence: 0.9997230679, timeMs: 5.06 }
{ lemma: 'beau', wordType: 'ADJE', person: 'THD_F', mode: 'ALL', tense: 'ALL', output: 'belle', confidence: 0.9999751771, timeMs: 3.08 }
{ lemma: 'manger', wordType: 'VERB', person: 'FST_PL', mode: 'INDI', tense: 'PRES', output: 'mangeons', confidence: 0.9999864523, timeMs: 4.79 }

Browser Usage

<script src="./frspell.browser.js"></script>
<script>
	(async () => {
		const predictor = await window.FrSpell({
			modelBasePath: './models/community'
		});
		const result = await predictor.lemma('mangeons');
		console.log(result);
	})();
</script>

Prediction Parameters

Lemma prediction:

API: predictor.lemma(input)
input: string, inflected/conjugated word form, for example mangeons

Derive prediction:

Noun API: predictor.nounDerive(lemma, person)
Adjective API: predictor.adjeDerive(lemma, person)
Verb API: predictor.verbDerive(lemma, person, mode, tense)
Generic API: predictor.derive(lemma, wordType, person, mode, tense)

Allowed wordType values:

NOUN (noun)
ADJE (adjective)
VERB (verb)

Allowed person values:

FST (1st person singular)
SND (2nd person singular)
THD_M (3rd person masculine singular)
THD_F (3rd person feminine singular)
FST_PL (1st person plural)
SND_PL (2nd person plural)
THD_PLM (3rd person masculine plural)
THD_PLF (3rd person feminine plural)

Allowed mode values:

INDI (indicative)
SUBJ (subjunctive)
COND (conditional)
PART (participle)
IMPE (imperative)
INFI (infinitive)

Allowed tense values in current implementation:

PRES (present)
IMPA (imperfect)
FUTU (future)
PASS (past)

Note:

The original grammar definition file includes more tense names, but this package implementation currently supports only PRES, IMPA, FUTU, PASS.
For noun/adjective derive calls, mode and tense are not required in user input.

Benchmark Result (Latest Local Run)

Results:

lemma from conjugation: 97/100, accuracy 97.00%, average 21.97 ms
noun derive: 100/100, accuracy 100.00%, average 23.19 ms
verb derive: 100/100, accuracy 100.00%, average 22.93 ms
adjective derive: 100/100, accuracy 100.00%, average 23.22 ms

Model Size

current default (community) lemma ONNX model: models/community/lemma_type_model.int8.onnx = 1.48 MB
current default (community) derive ONNX model: models/community/derive_form_model.int8.onnx = 1.40 MB
current default total ONNX model size: about 2.88 MB

Mini version note:

mini lemma ONNX model target: 0.96 MB
mini derive ONNX model target: 0.91 MB
mini total ONNX model target: about 1.87 MB
the mini model package is planned to be published soon.

Why It Is Great For Web Frontend Products

high accuracy for key French morphology tasks
low per-request latency (about 22 to 23 ms average in latest local benchmark)
current default ONNX footprint is compact (about 2.88 MB total), with a smaller mini model package (about 1.87 MB) coming soon
ideal for backend inference powering web frontend features such as live writing assistance, grammar hints, and lemma-aware search

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme