docu-lex.eng
Lexicon for the Komi analyser
The lexicon file format
The Komi lexicon files are used both for dictionary creation and for
The interplay between lexicon xml files, derived lexc files and morph files
The main komi file is kt/kom/src/kom-lex.txt.
During compilation, the entries from the xml files in the dictionary are extracted, and put
To take an example:
The file working_files/PRON-PERS_kom-lex.xml has an entry
<entry> <lemma>ме</lemma> <stem/> <contlex>PRON-PERS-SG1-NOM</contlex> <pos>PRON-PERS</pos> <article> <eng> <choice> <variant>I</variant> </choice> </eng> <fin> <choice> <variant>minä</variant> </choice> </fin> </article> </entry>
From this file, the compilation process derives a lexc file to the catalogue
LEXICON PRON-PERS ме PRON-PERS-SG1-NOM "I" ;
The file-name of the xml file (PRON-PERS) is the name of the continuation lexicon.
The lexicon files
The Komi lexicon files are found here (you may have to choose "show source code" in the browser):
-
A-SUPERL_kom-lex.xml
-
ADVROOT_kom-lex.xml
-
ADV_kom-lex.xml
-
A_kom-lex.xml
-
CONJ_kom-lex.xml
-
IDIOM_kom-lex.xml
-
INTERJ_kom-lex.xml
-
NUM_kom-lex.xml
-
N_kom-lex.xml
-
NounOBJ.txt
-
PART-ONOM_kom-lex.xml
-
PART_kom-lex.xml
-
POSTP_kom-lex.xml
-
PRON-DEM_kom-lex.xml
-
PRON-INDEF_kom-lex.xml
-
PRON-INTER_kom-lex.xml
-
PRON-PERS_kom-lex.xml
-
PRON-Q_kom-lex.xml
-
PRON-REFL_kom-lex.xml
-
PRON_kom-lex.xml
-
PROP_kom-lex.xml
-
V-INST_kom-lex.xml
-
V-REFL_kom-lex.xml
-
VNEG_kom-lex.xml
-
V_kom-lex.xml
-
examples.txt