Error Classification
Diff list
- Åttjutjit tevstajt list-hábmáj:
cat texts/TextB_smj_eval1.txt |preprocess > list/smj_eval1.list
- Buohtastahttet mt-tevstav ja divoduvvam tevstav:
diff -y list/smj_mt.list list/smj_eval1.list|grep '[><|]'|tr -s '\t'|tr -s ' '|sed 's/^/ /' >> wer_analysis.csv
Diff list boados
Buore | Vuogas ållo | enap merrasáme < > merrasámij > dáfojn ulmusjlåhko | lassánimev lassán | ulmusjlågon Dav | Boados guoradallama | boahtá vuosedi | åvddån maj | guoradallamin > majt
< = gádoduvvam báhko > = laseduvvam báhko | = målssum bágov (lexical selection)
Kategorijja
1 = lexical selection 2 = difference in generation (same wordform, but different shape) 3 = difference in choice of form (different wordform selected) (ax-ay) 4 = word order changed (ab-ba) 5 = punctuation 6 = word added (0-a) 7 = word deleted (a-o)
Mierkki riekta lågujn sæmmi linjan gå rievddadus. Jus li avtan bágon moadda rievddadusá laseduvvi divna lågå dan sæmmi linnjaj:
1 Buore | Vuogas 1 ållo | enap 4,3 merrasáme < - > merrasámij 6 > dáfojn
Diff lissta ij agev vatte riekta gåvåv rievddadusájs. Vuogas le de divodasstet diff listav vaj tjielggasap vuojnná jur mij la rievdaduvvam ja gåktu:
1 Dav | Boados 4,3 guoradallama < 1 vuosedi | boahtá 6 > åvddån - > guoradallamin 3 maj | majt
Diff-lista divna báhkopáraj (aj daj ma ælla rievdaduvvam):
diff -y list/smj_mt.list list/smj_eval1.list|tr -s '\t'|tr -s ' '|sed 's/^/ /' | see