tromso-2006-08-polderland
Meeting between Polderland and Divvun (and UiTø project)
Participants
Polderland:
- Frank Nusselder (dev.)
- Inge de Mönnink sales manag. - did not participate in the meeting)
- Peter Beinema (project manager)
Divvun/Saami Parliament:
- Sjur Moshagen (project manager, speller & technical issues)
- Børre Gaup (everything-man: testing, corpus)
- Maaren Palismaa (North Saami linguist, working on mondays and tuesdays)
- Thomas Omma (North and Lule Saami linguist)
- Tomi Pieski (softw. eng.)
University/disambiguation project:
- Trond Trosterud (project manager, computational linguist)
- Saara Huhmarniemi (soft. eng.)
Communication channels
AIM
News
- server: news.uit.no
- group: uit.samiskspraak.giellateknologiija
The news server requires username and password, and does only allow connections
Bugzilla
Documentation
Agenda:
- presentation
- proj. schedule
- linguistic questions:
- samples of full paradigm or derivates from one stem
- samples of full paradigm or derivates from one stem
- technical questions:
- conversion from Divvun format to PLX(?)
- will PLX be the target format?
- finite state or "traditional" Polderland solution?
- conversion from Divvun format to PLX(?)
- formalities:
- publicity of shared documents
Schedule
Planned drop dates:
- alpha: 2006-11-01
- beta: 2007-04-01
- final: 2007-09-01 latest
Project issues
Coordination meetings every Tuesday morning, 9: 30 unless agreed otherwise.
Project phases Project Progress measurement Testing (including acceptance testing) Risk management Lexical sample material Cooperative development (e.g., form of lexical material) Staff availability Spelling Checker: * can internal spelling lexicon be based on Polderland PLX format, or is an extension with e.g. automata necessary? ==> requires analysis of sample material: - level of agglutination - sound / letter changes in agglutination - agglutination vs. compounding Hyphenator: * are lists of hyphenated words available? * there are rule sets in the XFST formalism that will insert hyphenation points in the input string Mac applications: * PowerPC vs. Intel? - both * XCode vs. CodeWarrior?
Linguistic issues
Basic grammar information can be found at:
Tags for derivation
+adda
+h
+meahttun
+Dimin
Output from the disambiguator
"<gulle>" "gullat" V TV Ind Prt Pl3 @+FMAINV "<álggu>" "álgu" N Sg Gen @GP> "<rájes>" "rájes" Po @ADVL "<girkoeiseválddiide>" "girkoeise#váldi" N Pl Ill @ADVL "<,>" "," CLB "<bismmaide>" "bisma" N Pl Ill @ADVL "<,>"