Meeting_2005-07-18

Meeting setup

  • Date: 18.07.2005
  • Time: 10.00 Norw. time
  • Place: Wherever we are : -)
  • Tools: Skype, iChat, SubEthaEdit

Agenda

  1. Opening, agenda review
  2. Reviewing the task list from a week ago
  3. Documentation - divvun.no
  4. Corpus gathering
  5. Corpus infrastructure
  6. Speller infrastructure
  7. risten.no
  8. Other issues
    1. CVS Mailing
    2. Helsinki gathering
  9. Summary, task lists
  10. Closing

1. Opening, agenda review, participants

Opened at 12.10 after some technical hurdles (tried to run the whole meeting online, that is, without a phone). Agenda accepted as is.

Present: Sjur, Tomi, Børre

Absent: Maaren, Thomas, Trond

Main secretary: Børre

2. Reviewing the task list from the last meeting

Sjur

  • Ask Anne-Britt to update the contract with the University (we are getting 3, not 2, persons there from the fall)
    • Done.
  • risten.no bugs and fixes
    • Fixed some more bugs.
  • complete the action summary after our half-year evaluation
    • STILL not done : -/
  • contact Kimmo Koskenniemi about kwic-snt and UTF-8 bug
    • not done - suggestion: postponed to the Helsinki meeting in Aug/Sep
  • follow up on:
    • voice group-chat not working to Sámediggi
    • Maaren has problems with SubEthaEdit (can't connect)
      • nothing done during last week - still waiting for a reply to my e-mail
  • add i18n bug to Forrest issue tracker
    • not done, was waiting for Børre
  • add i18n menu as feature request to Forrest issue tracker
    • still open
  • To the board:
    • write proposal for permanent maintenance organisation
    • write draft specification for the outsourced tasks
    • write agenda, question: should the memos be public?
    • Deadline for the board tasks: 3 weeks ahead of meeting (date not decided, but in September)
      • document stubs and menu entries created within the document tree
  • Follow up on CVS mailing with Steinar T-H (IT/UiT)
    • Several e-mails back and forth
  • add Divvun to the UNESCO Hall-of-Fame list
    • not done
  • Get back to Xerox on the commercial license.
    • We will hopefully get the tools this week.
  • Check with Maaren whether she has checked in her additions - if not, she should (or send them via e-mail to somebody that can do it for her now)
    • Checked, she had not, but was going to check them in on Wednesday

Tomi

  • Common Makefile for all the languages
    • still open
  • catxml default language issue
    • Done.
  • three-part compounding
    • Waiting for Trond, to have a look at it together
  • decapitalisation of proper nouns when (if?) compounded, and when derived (the oslolaš issue)
    • Waiting for Trond
  • corpus infrastructure: dtd location (both public and internal)
    • Not Done
  • corpus infrastructure: file and dir organisation
    • Not done (that is, no articles on how others have done it found)
  • write the LIST option to our test tools, as requested in the discussion group
    • Done, in sme/testing a file wordlist.pl

Trond

  • Work on the bug list (Lule Sámi).
    • Continued
  • Work on compounds (three-part + oslolaš, with Tomi)
    • nothing done
  • Work on the corpus interface (with Lars)
    • nothing done
  • Corpus infrastructure: dtd location
    • nothing done
  • Get the new version of the New Testament
    • nothing done?
  • Check Hans-Ragnars names.
    • nothing done?
  • add Giellatekno to the UNESCO Hall-of-Fame list
    • We don't know : -)

On vacation last week

Børre

  • On vacation, issues from last week repeated at the end of the memo

Maaren

  • On vacation, issues from last week repeated at the end of the memo

Thomas

  • On vacation, issues from last week repeated at the end of the memo

3. Documentation - divvun.no

crontab

Tomi has made a CVS update script, but it is not automatized yet => crontab is missing. Børre will take over this task.

wiki + UTF-8

There's no wiki support in the official site => noone can read our weekly meeting memos.

This works using linux:

export LC_ALL=no_NO.UTF-8
forrest run

It is unclear if we are able to do the same using Mac OS X.

It should work irrespective of the locale of the host OS, and the above is really a hack to work around a bug in Forrest/Cocoon or the JSPWiki reader (Chaperon).

status.xml & RSS feed

Sjur has added support for status.xml, which implies RSS news feeds of the changes we document.

We will for the time being not use the todo section, as our weekly memos and Bugzilla provides enough feedback and support for our tasks, and what needs to be done. Eventually it could be used for more long term goals that do not fit within usage of Bugzilla and meeting memos.

Regarding the changes section, have a look, and start using it. The RSS newsfeed is an excellent tool for keeping interested persons and parties up-to-date with little or no effort on their part. Documentation can be found at Status.xml how-to

Forrest

Forrest 0.7 is officially released. The HEAD of the trunk seems to be unstable wrt creating war files. Be careful!

4. Corpus gathering

Add the Oslo license model to xdocs/adm/legal/. Then we will start to compare, and work out our own license text.

5. Corpus infrastructure

Directory structure

Not optimal. Tomi will continue looking for good examples of internal dir structure.

catxml

Updated to use the language of the pwd as default when extracting corpus texts.

6. Speller infrastructure

Common Makefile

We should finalize a common and language-independent Makefile as a prerequisite for the speller infrastructure, such that what we do for building spellers for North Sámi will automatically be available to other languages.

Saara Huhmarniemi is a makefile expert, but she isn't back at work yet. Maybe we postpone the makefile makeover till she is back.

Aspell

We will soon have the unrestricted Xerox tools, and we'll then be able to generate the word list for Aspell. What we need then is the infrastructure to go from a word list to an Aspell dictionary. This should be high priority ahead of the Helsinki demo.

First version: simple word list based speller, word list generated by the Xerox tools. Makefile target should be something like "make aspell", with the final, ready to use files in a build directory.

7. risten.no

Sjur will continue to use some time on the following issues:

  • more bug fixing
  • a couple of critical features still needs to be added

8. Other issues

CVS Mailing

CVS mailing works to the local accounts on cochise, but forwarding isn't working for all of us. Tomi just received a mail daemon message, saying "no route to host", thus undelivered e-mail for trond.trosterud@helsinki.fi, sjur.moshagen@kolumbus.fi and sjurnm@mac.com.

Sjur will follow up on this one.

Helsinki gathering

Dates

Thomas did not want to go earlier. We extend the meeting with a technical day on Monday, and all of us on Tuesday. Thomas can then travel on Monday, as originally planned.

Dates now also confirmed with Børre. Thus decided:

  • technical meeting day: Monday Aug. 29
  • general meeting day: Tuesday Aug. 30
  • all travel back on Saturday Sep. 3

Demo

We will get the Xerox tools. See above under Speller Infrastructure for more details.

9. Summary, task list

Børre

  • Add crontab specification for the cvs update/export script Tomi made
  • Correct the war file/jspwiki issue
  • reopen the jspwiki + UTF-8 issue
  • Continue http: //giellatekno.uit.no conversion to forrest ...
  • Contact Svenska bibelsällskapet
  • Add issue to forrest issue tracker about utf-8 ihtml documents.
    • Check for existing bug reports, it might already be in there
  • Continue forrest i18n research
    • Sometimes abides the =locale=se/smj/fi in the url (the document gets correctly selected). The menus and tabs are translated to the language that the browser asks for.
  • add RSS newsfeed support (done by Sjur, but requires new war file)
  • discuss with Anders Kintel about whether we could get the Sámi-Norwegian dictionary ahead of acceptance
  • add the Oslo contract to cvs, and begin writing a contract draft

Sjur

  • risten.no bugs and fixes
  • complete the action summary after our half-year evaluation
  • contact Kimmo Koskenniemi about kwic-snt and UTF-8 bug
    • postponed till the Helsinki meeting
  • follow up on:
    • voice group-chat not working to Sámediggi
    • Maaren has problems with SubEthaEdit (can't connect)
  • add i18n bug to Forrest issue tracker
  • add i18n menu as feature request to Forrest issue tracker
  • To the board:
    • write proposal for permanent maintenance organisation
    • write draft specification for the outsourced tasks
    • write half-yearly project report with progress and bugdet status
    • write agenda, question: should the memos be public?
    • Deadline for the board tasks: 3 weeks ahead of meeting (date not decided, but in September)
  • Follow up on CVS mailing with IT/UiT
  • add Divvun to the UNESCO Hall-of-Fame list
  • Get the tools from Xerox

Tomi

  • Common Makefile for all the languages
    • And for speller infrastructure
  • three-part compounding
  • decapitalisation of proper nouns when (if?) compounded, and when derived (the oslolaš issue)
  • corpus infrastructure: dtd location (both public and internal)
  • corpus infrastructure: file and dir organisation
  • Aspell: A simple wordlist based speller

On vacation

Tasks are transferred to the following weeks until they return from vacation, to keep the tasks alive and updated. New tasks may be added: -)

Maaren

  • The missing list, both the overall missing list from our xml corpus, and a file-for-file review, in order to get different terminology
  • Go through the missing list from risten.no
  • send new letter to SGL about Lule Sámi possessives

Thomas

  • write new letter to SGL about Lule Sámi possessives
  • work with Lule Sami substantives

Trond

  • Work on the bug list (Lule Sámi).
  • Work on compounds (three-part + oslolaš, with Tomi)
  • Work on the corpus interface (with Lars)
  • Corpus infrastructure: dtd location
  • Get the new version of the New Testament
  • Check Hans-Ragnars names.
  • add Divvun to the UNESCO Hall-of-Fame list

10. Next meeting, closing

25.07.2005 10: 00

Closed at 13: 45