131220

Questions:

jussive mood (möönev kõneviis?)? Ülle Viks does not have one, EKKR09 has. Is there one or do we really use 3rd ps imperative instead?

(verb) derivations. EKI lexicon (?) and Heli's original system apply phonological/two level rules to both sides of lexicon, so that we get entries for both (eg.) lugema (verb, to read) and loetu (noun, something that has been read) from the same lexicon entry (luG1e 28;). I had a bit of a problem explaining the motivation to Sjur. Solutions seem to be to either generate different lexicon entries -- luge: luG1e 28; and loetu 01; or use flags to mark such productive derivations, something like lugema+V+Der/tu+S (and then it is up to the user of analyze to know that the lexicon-form of that word is really loetu. As the semantic relation is quite close, it might even be easier eg to use for translation in some cases?) Sjur had a article to finish, as it is something that is relevant not only for Estonian, he promised to think about how one should deal with such problems in general in context of GT infra.

Distinguishing homonyms -- palk/palgi vs palk/palga. At the moment there is no way to distinguish some words in their lexicon form although the nominative (and thus the lexicon form) is almost the only case of homonymy. It is usual for GT infra to mark different stems with flags like +Hom1, +Hom2 etc.

Parallel forms. There are words with possible parallel forms like short and long illative, multiple forms for plural etc. Francis Tyers argued that it would be handy if generator-fst would generate just one (preferred) surface form. And if there is no single one preferred form then MT solutions (and probably everything else that generates surface forms) still needs some formalized rules to choose between forms.

(do we need a new lexicon format instead of EKI's stem database / morphological database? One possible way would be to use lexc as format but I'm not sure if that is the best idea)

Rules of compounding. plamk's way of combining machines is somewhat incompatible with gt infra. Some new way to describe compounding rules is needed. OTOH, Filosoft's rules should be partially published by now, should we try to use these somehow?