Variation_in_lexc
How to handle variation in lexc
Non-normative forms:Err/Orth
Non-normative variant with inflection
fáláhas:fáláhass JOHTOLAT ;
The descriptive FST will inflect both fálahas and fáláhas, but the line with the tag Err/Orth is removed from the normative analyser/generator during the compilation prosess.
fáláhasat fáláhasat fáláhas+N+Pl+Nom fálahasat fálahasat fáláhas+Err/Orth+N+Pl+Nom
The normative analyser:
fáláhasat fáláhasat fáláhas+N+Pl+Nom fálahasat fálahasat fálahasat +?
Non-normative variant without inflection (hard coding)
The normative form on the left side, and so the lemma in the analysis will be a normative form and can be found e.g. in the dictionary.
fáláhas:fáláhass JOHTOLAT ;
The word itself is non-normative:Err/Lex
brillefutterála+Err/Lex:brille#futterál SOSIAL
The descriptive FST will inflect brillefutterála, but the line with the tag Err/Lex is removed from the normative analyser/generator during the compilation prosess.
brillefutterálat brillefutterálat brillefutterála+N+Pl+Nom
The normative analyser:
brillefutterálat brillefutterálat brillefutterálat +?
Lexical homonymi:how to identify the correct lemma e.g. in a dictionary
The lemmas belong to different stem-categories:Add morphogical tags
beassi:beassi BEARRI "reir" ;
Analysis:
beassi beassi beassi+N+G3+Sg+Nom beassi beassi+N+G3+Sg+Acc beassi beassi+N+G3+Sg+Gen beassi beassi+N+Sg+Nom beasi beasi beassi+N+Sg+Gen beasi beassi+N+Sg+Acc
Example from North Saami. NomAg tag for derivation Nomen Agent
vuovdi+NomAg:vuovdi ACTOR "salesman" ;
Analysis:
vuovdi vuovdi vuovdi+N+NomAg+Sg+Nom vuovdi vuovdi+N+NomAg+Sg+Acc vuovdi vuovdi+N+NomAg+Sg+Gen vuovdi vuovdi+N+Sg+Nom vuovddi vuovddi vuovdi+N+Sg+Gen vuovddi vuovdi+N+Sg+Acc
There is no reason for adding morphogical tags:Add homonymi tags
govledh+Hom1:govl TJOEHPEDH_TV "höra" ;
Analysis:
gåvla gåvla govledh+Hom1+V+TV+Ind+Prs+Sg3 govloe govloe govledh+Hom2+V+IV+Ind+Prs+Sg3
Orthograpic variants of the same lemma:tags v1, v2...
Orthograpic variants of the same lemma, for base form and at least parts of the inflection paradigm, should be under the same lemma. But we can add a variants tag as a help to recognize the correct base form for the paradigm.
Example from North Saami.
mandáhta+v2:mandáhtta GOAHTI-A ;
If the base forms are identical, but there are variants in the inflection, we don't use these tags.