@fidel-tools/lang-am
v0.1.6
Published
Amharic language pack for Fidel Tools
Maintainers
Readme
@fidel-tools/lang-am
The curated Amharic language pack and schema configuration for Fidel Tools.
Features
Contains a comprehensive, schema-compliant definition of Amharic linguistic rules, containing:
meta: Versioning, authors, and script information.normalization: Key-value pairs mapping Amharic homophones, labialized sequences (ቷ->ቱዋ), and gemination repetition rules.tokenization: Sentence boundaries, punctuation, and578+abbreviation expansion rules.stopwords: A cleaned, academic stopword list containing435+unique words.stemmer: Prefix and suffix rules for light stemming, alongside protected terms (e.g. proper nouns likeኢትዮጵያ).transliteration: SERA and Felig ASCII transliteration schemas.
Installation
pnpm add @fidel-tools/lang-amUsage
Pass the Amharic language pack into the @fidel-tools/core Pipeline:
import { Pipeline } from '@fidel-tools/core'
import amPack from '@fidel-tools/lang-am'
const nlp = new Pipeline(amPack)
console.log(nlp.normalize("ሐኪም ኀይሉ")) // "ሃኪም ሃይሉ"