Northern Haida Disambiguation
This file gives an nverview of some still ad hoc solutions for disambiguation.
Prerequisites:
- vislcg3 installed A text corpus. The one we have,
Making missinglist
cat test/data/haida_grammar.utf8corpus.txt |grep "#"|cut -c3-|\ tr ' ' '\n' | tr '=' ' '|uhdn|grep '?' |cut -f1 | sort |\ uniq -c | sort -nr > doc/data/missing.haidagrammarcorpus
How to analyse
Haida differs from the other languuages in not having an adjusted
Note the = symbol, it is used to mark MWEs in the text.
cat test/data/haida_grammar.utf8corpus.txt |grep "#"|cut -c3-|tr ' ' '\n' | tr '=' ' '|uhdn|lookup2cg | vislcg3 -g src/syntax/disambiguation.cg3
Strategies for disambiguation
Look at common ambiguity patterns
To create similar statics, use the sum-cg.pl script (write sum-cg.pl --help in order to get just that.