disambiguator-syntax
This is the Eastern Mari disambiguation file. It chooses
The file first defines sentence delimiters and tags and sets.
Sentence delimiters
The delimiters are: "<.>" "<!>" "<?>" "<...>" "<¶>" sent
The Tags section lists all the tags inherited from the fst, and defines them for
The next section, Sets, contains sets defined
Tags
Beginning and end of sentence
BOS
Clause boundary
Parts of speech tags
N
WORD is the set of all POS
Verbal tense and mood tags
Other verbal tags
Verbal person-number tags
Numeral tags
Sg
Case tags
Nom
Other nominal tags
Pers
Adjective comparison tags
Pos (?)
Possessive suffix tags
PxSg1
Numeral tags
Card
Particles
Punctuation marks
CLB
Derivation tags
Tags for internal testing
Sets
-
CASE = all cases
-
OBLCASE = All cases except Nom
- VFIN = All moods
Der/Date
Sem/Ani
Rule section
Early, word-internal rules
-
CmpTst remove CmpTst if not 1 N
-
CmpTst2 Select CmpTst in all other cases
- NoFocPossNoun remove Foc/Poss if PxSg3
CC or Pcle
-
teveteve1 gives CC if two теве
-
teveteve1 gives CC if two теве
- onlyteve gives Pcle if two теве
- PcleNotCC Lauseen alussa on Pcle
- CCnotInterj
- Posna деч посна
- ikNum ик is never A
Particles
- InterrQ if question mark anywhere to the right
- Interr removes Rel if question mark to the right somewhere
Verbs
Existential ulo
Infinitives
- Ind selects Ind if no Ind to the right or to the left
- 1SgAgr selects (Ind Sg1) if Pron1Sg to the right or to the left
- 1SgAgr selects (Ind Sg1) if (Pron1Sg Nom) to the right or to the left
- 2SgAgr selects (Ind Sg2) if Pron2Sg to the right or to the left
- IndAfterInf selects Ind if Inf to the left
- NotImpWhenInd
- NotImpWhenWords1
- NotImpWhenWords2
Adjectives
- RemAdjBeforeProp removes A if Prop to the left
- AdjBeforeMo selects A if Interr to the right
- AdjBeforeAN selects A if N or A to the right
- RemN removes N if N to the right
- AdjAfterNbeforeEOS selects A if N to the left and EOS to the right (predicative position)
- _AdjBeforeConjAdj_ selects A if conjuction and A to the right ;
- AdjNotAdv removes Adv if N to the right
- AdjNotPron removes Pron Pers if N to the right
- AdjNotN removes N if Pron Pers anywhere to the left
- RemAdj1 removes A if no N or A follows
- RemAdj2 removes A if no N or Pron in a clause
Nouns
- lym nalash "to take a name" = "to be given a title"
- RemNomIfPronLeft removes Nom if Pron Nom anywhere to the left
- RemNomIfPronRight removes Nom if Pron Nom anywhere to the right
- NomBeforeConjNom selects N Nom if conjoined with N Nom
- NafterDem selects N if Dem to the left (demonstratives tend to be sole modifiers)
- NotANoun
- NafterAbeforeEOS
- RemNafterAdv removes N if adverb to the left
Derivations
- RemDerMWN1 removes Der/MWN if N is an option
- RemDerMWN2 removes Der/MWN if N to the right
- Dersa if noun follows
- SelDerMWN select Der/MWN if no noun follows
Cases
- RemNomNif12left removes Nom with N if there is a verb with 1st or 2nd agreement to the lef
- RemNomNif12right removes Nom with N if there is a verb with 1st or 2nd agreement to the right
- AccNeedsVerb prefers Nom (TODO: does this make sense? SASHA: it does but there was a typo, -1* instead of 1* in the third clause of the condition)
Proper nouns
Numerals
-
IkNumN ik is num before N Sg
- IkNumAN ik is num before A N Sg
- KumNumAN ik is num before A N Sg
Pronouns
- NotImp in most тиде cases
- NotInterr if Rel
- Dem if noun follows
- уке
Conjunctions
Postpositions
- PoNeedsGen removes postposition if the word to the left is not Gen or Nom
Adverbs
- molan awaiting rules for dative verbs subcategorising for mo Dat
Verbs
Finite verb or Gerundium
- RemGer removes Ger Gen if there is no verb to the right
- FinNotGer removes Ger if there is a Ind Prt2 Sg3 in the clause
- GerNotFin Ger if there is a Ind next
- GerNotFin Ger if there is a Ger da Ger VFin
First or third person
- Sg1NotSg3 removes Prt1 Sg3 when Pers Sg1 Nom in same clause
-
Sg3NotSg1 removes Prt1 Sg1 when there is no Pers Sg1 Nom in same clause
- This definitely is too strong, it precludes zero Sg1 subjects
ConNeg or not
-
NoConNeg1 No ConNeg if no Neg to the left
- NoConNeg2 No ConNeg if another ConNeg to the left
да
- da1 Adv initially
- da2 CC elsewhere
и
- iNotAbbr
Interjection
- NoExclNoInterj
Predicative
AifVövny selects A if вӧвны somewhere to the left