Error report Removing files that only have ordinary linguistic errors. Removing files that have separate Bugzilla reports :::/home/apache_corpus/freecorpus/converted/sme/admin/depts/other_files/::: faktablad_nordsamiska_wordversion.doc.xml: encoding error with ŋ - má.ggain a lot of typing errors 260965-h-2179s_2.pdf.xml: 203210-q-1066_samisk_lav.pdf.xml: STM200220030010000SE_PDFS.pdf.xml: HP_2009_samisk_sprak_nordsam.pdf.xml: typing and spelling errors sami_rapport_sametinget_vedlegg4_SA.pdf.xml: strange abbreviations and foreign words Klimamelding_St_meld_39_samisk.pdf.xml: Reindriftsloven_NordSamisk.pdf.xml: otp200420050021000se_pdts.pdf.xml: 132470-sa-no.pdf.xml: a lot of unrecognized compounds and derivations STM200820090043000SE_PDFS.pdf.xml: STM200920100022000SE_PDFS.pdf.xml: sames001.pdf.xml: SaNo_066.pdf.xml: typos, spelling errors and foreign words :::regjering.no::: 130-000-ruvnnu-kvena-proeavttaide.html_id=573764.xml: kvenagielat sánit 8.html_id=458424.xml: dárugielat sánit 1.html_id=514474.xml: 2.html_id=458416.xml: 14.html_id=514473.xml: some hyphenations, spelling errors and foreign words 3-2.html_id=452320.xml: 4.html_id=452321.xml: spelling errors +++ a lot of very small files with few errors that make error % high :::fi_depts::: yvlakiesite_saame.pdf.xml: yvsuositsaame.pdf.xml: unrecognized propernouns and abbreviations; some spelling errors :::guovda::: 1_2.doc.xml: Čoahkkinprotokolla_27.06.02.doc.xml: some encoding errors máńggabealálašvuoda gŠibidit ILO-konvenèuvnna spelling errors, typos seems to be scanning errors: earniálbmogin såmi auget makkér KS_áššebahpirat_25.03.04.doc.xml: KS_assebahpirat_30.06.05.doc.xml: Plánalávdegoddi_-_áššebahpirat_19.02.04.doc.xml: spelling errors and typos :::admin/others::: Reglement_Djupvik_havn.doc.xml: gulaskuddanNjuolggadusat gulaskuddanNjuolggadusat +? sajiide:Djupvik-Nordmannvik sajiide:Djupvik-Nordmannvik +? Hj Hj +? eftf eftf +? njuolgadusat njuolgadusat +? fiskarlag fiskarlag +? Fiskerigruppa Fiskerigruppa +? havn havn +? båtforening båtforening +? 2004.Etáhtajođiheaddji 2004.Etáhtajođiheaddji +? VÁLGADIKKI.doc.xml: ALMMUHEAPMI ALMMUHEAPMI +? SÁHTÁT SÁHTÁT +? JIENASTIT JIENASTIT +? ČIENALLUOVTTA ČIENALLUOVTTA +? SKUVLA SKUVLA +? DÁLOŠVÁKKI DÁLOŠVÁKKI +? SERVODATVIESSU SERVODATVIESSU +? Gáivuonorrida Gáivuonorrida +? BIERTAVÁRI BIERTAVÁRI +? SERVODATVIESSU SERVODATVIESSU +? Áibanašmohkis Áibanašmohkis +? Hoalbma Hoalbma +? OLMMÁIVÁKKI OLMMÁIVÁKKI +? SKUVLA SKUVLA +? skuterløyer_2006.doc.xml: a lot of uppercase words AVGIFTER_1.doc.xml: few unrecognized words by fst repeated Fylkersrapport06sam0000.pdf.xml: unrecognized propernouns and webaddresses and compounds