developingwork
How to evaluate the output of the FST, as a part of the developing work
Make an alias to the crk-catalog
Add this to your .bashrc or .profile:
alias crk="pushd ~/main/langs/crk"
Open a new terminal window: In any catalogue, write 'crk' ENTER.
Test that all noun lemmas are possible to generate as base form
sh test/data/testnounlemmas.sh
Look at fst-output
A certain form for all lemmas
grep ' AN' src/fst/stems/nouns.lexc |\ egrep -v '^(\!|LEX)' | tr ":" " " | cut -d " " -f1 |\ sed 's/$/+N+AN+Loc/' | dcrk | less
The paradigm for a certain lemma
cat test/data/NI-par.txt | sed 's/^/ôtênaw/' | dcrk | less
Run only one yaml-test
rm test/src/gt-norm-yamls/*
Get the yaml-file you want to test, e.g.:
svn up test/src/gt-norm-yamls/V-mato_gt-norm.yaml make check
Compare the lingvistic output of all yaml-tests for a certain PoS
rm test/src/gt-norm-yamls/*
Get the yaml-file you want to test, e.g.:
svn up test/src/gt-norm-yamls/U-all_gt-norm.yaml
make check
Make/update all yaml-tests in one for a certain PoS (and a certain pattern?)
head -11 test/src/gt-norm-yamls/V-AI-matow_gt-norm.yaml > test/src/gt-norm-yamls/U-all_gt-norm.yaml tail +11 test/src/gt-norm-yamls/V* | grep -v "==" >> test/src/gt-norm-yamls/U-all_gt-norm.yaml
This example is adding all nouns with final -y into one file:
head -11 test/src/gt-norm-yamls/N-AN-amisk_gt-norm.yaml > test/src/gt-norm-yamls/A-Ny-all_gt-norm.yaml tail +11 test/src/gt-norm-yamls/N*y_gt-norm.yaml | grep -v "==" >> test/src/gt-norm-yamls/A-Ny-all_gt-norm.yaml
Make a new yaml-file
head -12 test/src/gt-norm-yamls/N-AN-amisk_gt-norm.yaml\ > test/src/gt-norm-yamls/N-IN-otenaw_gt-norm.yaml cat test/data/NI-par.txt | sed 's/^/ôtênaw/' | dcrk |\ tr '\t' ':' | sed 's/:/: /' | grep -v '^$' |\ sed 's/^/ /' >> test/src/gt-norm-yamls/N-IN-otenaw_gt-norm.yaml
Comment: The last sed-command should give 5 whitespaces
Documentation on unix-commands