kca
Free and Open source Khanty analyser giella-kca
- Authors
- Divvun and Giellatekno teams, community members
- Software version
- 2012
- Documentation license
- GNU GFDL
- SVN Revision
- $Revision
: 68217 $ - SVN Date
- $Date
: 2013-01-16 11: 31: 33 +0200 (Wed, 16 Jan 2013) $
giella-kca
This is free and open source Khanty morphology.
Morphology
Analysis symbols
These letters are hopefully are not a problem
The parts-of-speech are:
The parts of speech are further split up into:
The Usage extents are marked using following tags:
The dialect variants are expressed using the following tags:
The nominals are inflected in the following Case and Number
The possession is marked as such:
Other verb forms are
- +Symbol = independent symbols in the text stream, like £, €, ©
Question and Focus particles:
Semantics are classified with
Derivations are classified under the morphophonetic form of the suffix, the
Morphophonology
- +Symbol ! used in possessor indices
- +Symbol ! used in possessor indices
Symbols that need to be escaped on the lower side (towards twolc):
- »7
- Literal »
- «7
- Literal «
%[%>%] - Literal > %[%<%] - Literal <
And following triggers to control variation
Flag diacritics
@P.NeedNoun.ON@ | (Dis)allow compounds with verbs unless nominalised |
@D.NeedNoun.ON@ | (Dis)allow compounds with verbs unless nominalised |
@C.NeedNoun@ | (Dis)allow compounds with verbs unless nominalised |
For languages that allow compounding, the following flag diacritics are needed
@P.CmpFrst.FALSE@ | Require that words tagged as such only appear first |
@D.CmpPref.TRUE@ | Block such words from entering ENDLEX |
@P.CmpPref.FALSE@ | Block these words from making further compounds |
@D.CmpLast.TRUE@ | Block such words from entering R |
@D.CmpNone.TRUE@ | Combines with the next tag to prohibit compounding |
@U.CmpNone.FALSE@ | Combines with the prev tag to prohibit compounding |
@P.CmpOnly.TRUE@ | Sets a flag to indicate that the word has passed R |
@D.CmpOnly.FALSE@ | Disallow words coming directly from root. |
Use the following flag diacritics to control downcasing of derived proper
@U.Cap.Obl@ | Allowing downcasing of derived names: deatnulasj. |
@U.Cap.Opt@ | Allowing downcasing of derived names: deatnulasj. |
The word forms in Khanty language start from the lexeme roots of basic