How To AddA New Language
Adding a new language to the Github infrastructure
Languages reside within the GiellaLT organisation,
Prerequisites
You need gut to be able to add a new language
How to add a new language
Follow these instructions.
When done, also add entries in the registry and check the README file.
To get spellers via the Divvun Manager (nightly channel only in the beginning), there are two more steps:
- fill in details in the manifest.toml.in file
- set up github actions (for now, should be automatic in the future)
- create an entry in pahkat (description forthcoming)
Result
The above steps will create a new directory for the specified language, and
To start doing real work, you must do one set of preparations still:
cd lang-LANGCODE ./autogen.sh ./configure
Now you can start editing the source files, and whenever you want to make sure
Setting up the documentation page for the new language
The new language must also be added to
Adding a new language to the $GTBIG/prooftesting dir
The procedure is the same as above, but by adding a template to the command:
-
cd $GTHOME/langs
- $GTCORE/scripts/new-language.sh LANGCODE [[TEMPLATECOLL]
where
- TEMPLATECOLL (optional, usually automatically identified) is the name of the
-
prooftesting - templates for populating directories for testing proofing
-
prooftesting - templates for populating directories for testing proofing
This directory contains infrastructure for testing proofing tools for a number
- at least one speller gold-standard document for the targeted language, stored
$GTFREE/[pre]stable/goldstandard/converted/
- a speller lexicon available in the test infra for that language
- a command line speller for the lexicon(s) in the test infra
The command to set up the basic testing infrastructure for a new language is
-
cd $GTBIG/prooftesting
- $GTCORE/scripts/new-language.sh LANGCODE
Result
A new language will be added to the testing infrastructure, ready to be
cd LANGCODE ./autogen.sh ./configure
If everything is ok, at least one of the speller tests listed at the end of the