hdn
Free and Open source Northern Haida analyser giella-hdn
- Authors
 - Divvun and Giellatekno teams, U Alberta, community members 
 - Software version
 - 2012 
 - Documentation license
 - GNU GFDL 
 - SVN Revision
 - $Revision
: 68217 $ - SVN Date
 - $Date
: 2013-01-16 11: 31: 33 +0200 (Wed, 16 Jan 2013) $ 
giella-hdn
This is free and open source Northern Haida morphology.
Northern Haida morphological analyser
This file shows the Northern Haida multichar symbols and initial lexica.
Definitions for Multichar_Symbols
Analysis symbols
(It is highly suggested to follow existing standards when adding new tags).
The parts-of-speech could perhaps also be (remove irrelevant):
These are vowel morphophonemes which will lose their accent when they are no long in closed syllables
- á2  =  
 - é2  = 
 - í2  = 
 - ú2  = 
 - s2 =
 
Haida has these tags for real, says Jordan. 
- +V          = Verbs 
 - +N          = Nouns 
 - +Prop       = Proper nouns 
 - +Neg        = Negative 
 - +3pl        = 3rd personal definite plural participant 
 - +Interr     = Interrogative 
 - +Fut        = Future 
 - +Past       = Past 
 - +Pres       = Present 
 - +Hab        = Habitual 
 - +Bias       = Biased 
 - +Cert       = Certain 
 - +Simp       = Simple 
 - +Rel        = Relative 
 - +NonFact    = Non_Factive 
 - +Imm        = Immediate 
 - +VNear      = Very near 
 - +Rflx       = Reflexive 
 - +Near       = 
 - +Indir      = Indirect evidentiality 
 - +Dir        = Direct evidentiality 
 - +Cont       = Should keep on verbing 
 - +Short      = 
 - +Res        = Resigned obligation 
 - +Long       = 
 - +Ints       = Intensive 
 - +Impv       = Imperative 
 - +Evid       = Evidential 
 - +Ctfact     = Counterfactual 
 - +Sg         = Singular 
 - +Pl         = Plural 
 - +Def        = 
 - +Indef      = 
 - +Rfx        = Used for reflexively possessed body parts and kinterms 
 - +Abs        = Non-reflexive forms of body parts and kinterms 
 - +Pron       = 
 - +Ptcl       = 
 - +Cond-Aux1  = Used before non-ablauting secondary verbs 
 - +Cond-Aux2 = Used before ablauting secondary verbs
 
Quasi-inflectional Tags
- +Ext         = Extensional suffix -dáal 
 - +Circum      = Circumambulative singular suffix -gwáang 
 - +Stand-Sg    = Singular standing quasi-infl suffix -gyaʼáng 
 - +Stand-Pl    = Plural standing quasi-infl suffix -gyaʼáang 
 - +Order       = Quasi-infl suffix -hahl "tell to V"
 - +Incep       = Quasi-infl suffix -hid "start to V"
 - +Almost      = Quasi-infl suffix -sgä "almost V"
 - +Nocturn     = Quasi-infl suffix -ʼuhla "V all night"
 - +Sit-Sg      = Quasi-infl suffix -ʼwä "V sitting (sg)"
 - +Sit-Pl      = Quasi-infl suffix -ʼwaʼáang "V sitting (pl)"
 - +Distrib = Quasi-infl suffix -agang "each"
 
Valency Tags
- +Val/0 = Environmental verbs (no subject) (0)
 - +Val/I = Impersonal Descriptive verbs  (Si)
 - +Val/A = Active Descriptive verbs  (Sa)
 - +Val/P = Passive Descriptive verbs  (Sp)
 - +Val/AO = Active Dynamic verbs  (Sa) (O)
 - +Val/AOR = Active Dynamic Reflexive verbs  (Sa) (Or)
 - +Val/PO = Passive Dynamic verbs  (Sp) (O)
 - +Val/IO = Impersonal Dynamic verbs  (Si) (O)
 - +Val/AC = Active Causative verbs  (Sa) (C)
 - +Val/ACR = Active Causative Reflexive verbs  (Sa) (Cr)
 - +Val/PC = Passive Causative verbs  (Sp) (C)
 - +Val/IC = Impersonal Causative verbs  (Si) (C)
 - +Val/ACO = Active Transcausative verbs  (Sa) (C) (O)
 - +Val/0X = Extended Environmental verbs (no subject) (0) (X)
 - +Val/IX = Extended Impersonal Descriptive verbs  (Si) (X)
 - +Val/AX = Extended Active Descriptive verbs  (Sa) (X)
 - +Val/PX = Extended Passive Descriptive verbs  (Sp) (X)
 - +Val/AOX = Extended Active Dynamic verbs  (Sa) (O) (X)
 - +Val/AORX = Extended Active Dynamic Reflexive verbs  (Sa) (Or) (X)
 - +Val/POX = Extended Passive Dynamic verbs  (Sp) (O) (X)
 - +Val/IOX = Extended Impersonal Dynamic verbs  (Si) (O) (X)
 - +Val/ACX = Extended Active Causative verbs  (Sa) (C) (X)
 - +Val/ACRX = Extended Active Causative Reflexive verbs  (Sa) (Cr) (X)
 - +Val/PCX = Extended Passive Causative verbs  (Sp) (C) (X)
 - +Val/ICX = Extended Impersonal Causative verbs  (Si) (C) (X)
 - +Val/ACOX = Extended Active Transcausative verbs (Sa) (C) (O) (X)
 
The Human Classifiers
- +CL/dla = 
 - +CL/hlga = 
 - +CL/k’u =
 
The Shape Classifiers
- +CL/cha = 
 - +CL/gáng = 
 - +CL/gi = 
 - +CL/gu = 
 - +CL/g̲a = 
 - +CL/hlga = 
 - +CL/hlgi = 
 - +CL/hlg̲a = 
 - +CL/hlk’u = 
 - +CL/hlk̲’a = 
 - +CL/hlk̲’uhl = 
 - +CL/ja = 
 - +CL/k̲ʼíi = 
 - +CL/sda = 
 - +CL/sga = 
 - +CL/sg̲a = 
 - +CL/skáa = 
 - +CL/sk’a = 
 - +CL/sk̲’a = 
 - +CL/stl’a = 
 - +CL/tíi = 
 - +CL/tl’a = 
 - +CL/ts’as = 
 - +CL/t’a = 
 - +CL/t’áw = 
 - +CL/xa =
 
The Descriptive Classifiers
- +CL/cháam  = 
 - +CL/chab  = 
 - +CL/dab  = 
 - +CL/dám  = 
 - +CL/dláam  = 
 - +CL/dlál  = 
 - +CL/gám  = 
 - +CL/gáw  = 
 - +CL/gyáam  = 
 - +CL/g̲áam  = 
 - +CL/ĝám  = 
 - +CL/hám  = 
 - +CL/hlgáam  = 
 - +CL/hlgám  = 
 - +CL/hlgi  = 
 - +CL/hlg̲áam  = 
 - +CL/hlg̲áy  = 
 - +CL/hlĝám  = 
 - +CL/hlku  = 
 - +CL/hlkuhl  = 
 - +CL/hlkʼu  = 
 - +CL/hlk̲ám  = 
 - +CL/hlk̲ʼáam  = 
 - +CL/hlk̲ʼuhl  = 
 - +CL/hlk̲ʼwáahl  = 
 - +CL/hltab  = 
 - +CL/hltám  = 
 - +CL/hltʼáam  = 
 - +CL/hltʼab  = 
 - +CL/hltʼahl  = 
 - +CL/id  = 
 - +CL/is  = 
 - +CL/ja  = 
 - +CL/jah  = 
 - +CL/jíi  = 
 - +CL/káa  = 
 - +CL/kál  = 
 - +CL/kám  = 
 - +CL/ki  = 
 - +CL/kún  = 
 - +CL/kʼu  = 
 - +CL/kʼúl  = 
 - +CL/k̲ám  = 
 - +CL/k̲áw  = 
 - +CL/k̲ʼa  = 
 - +CL/k̲ʼáam  = 
 - +CL/k̲ʼéem  = 
 - +CL/k̲ʼuhl  = 
 - +CL/k̲ʼún  = 
 - +CL/k̲ʼwáahl  = 
 - +CL/mál  = 
 - +CL/sdáam  = 
 - +CL/sdah  = 
 - +CL/sdúu  = 
 - +CL/sga  = 
 - +CL/sgab  = 
 - +CL/sgáam  = 
 - +CL/sgám  = 
 - +CL/sgíl  = 
 - +CL/sgún  = 
 - +CL/sg̲áam  = 
 - +CL/skám  = 
 - +CL/skʼáam  = 
 - +CL/skʼál  = 
 - +CL/sk̲ʼáam  = 
 - +CL/sk̲ʼihl  = 
 - +CL/smál  = 
 - +CL/stad  = 
 - +CL/stlúu  = 
 - +CL/stlʼáam  = 
 - +CL/sʼahl  = 
 - +CL/tlúu  = 
 - +CL/tlʼáam  = 
 - +CL/tlʼab  = 
 - +CL/tlʼad  = 
 - +CL/tlʼúu  = 
 - +CL/tsʼúu  = 
 - +CL/tʼáam  = 
 - +CL/tʼab  = 
 - +CL/tʼám  = 
 - +CL/xab  = 
 - +CL/xáw  = 
 - +CL/x̲a =
 
Restricted Descriptive Classifiers
- +CL/ga  = 
 - +CL/gáam  = 
 - +CL/gab  = 
 - +CL/gáng  = 
 - +CL/gu  = 
 - +CL/gúl  = 
 - +CL/hlkwáahl  = 
 - +CL/hlkʼib  = 
 - +CL/hlk̲áa  = 
 - +CL/hlk̲íl  = 
 - +CL/hltáam  = 
 - +CL/hltab  = 
 - +CL/hltʼah  = 
 - +CL/hltʼab  = 
 - +CL/jám  = 
 - +CL/jíi  = 
 - +CL/jíihl  = 
 - +CL/kám  = 
 - +CL/kún  = 
 - +CL/kʼwáa  = 
 - +CL/kʼwáahl  = 
 - +CL/k̲ab  = 
 - +CL/k̲ʼa  = 
 - +CL/k̲ʼah  = 
 - +CL/síi  = 
 - +CL/skáy  = 
 - +CL/sk̲ʼéehl  = 
 - +CL/stláam  = 
 - +CL/stlab  = 
 - +CL/stlʼúu  = 
 - +CL/sʼab  = 
 - +CL/sʼyúu  = 
 - +CL/tláa  = 
 - +CL/tlʼáahl  = 
 - +CL/tlʼán  = 
 - +CL/tlʼál  = 
 - +CL/tsʼám  = 
 - +CL/tʼab  = 
 - +CL/xáam  = 
 - +CL/xwáad  = 
 - +CL/x̲a  = 
 - +CL/x̲ab  = 
 - +CL/x̲aw  = 
 - +CL/x̲áam  = 
 - +CL/x̲ún  = 
 - +CL/x̲úl =
 
Rare Classifiers (SKIPPED, LEXICALIZE THESE)
Sound Classifiers (ALSO LEXICALIZE?)
Human Classifers (to be added)
The pre-verb classifiers
- CL/Shape+  =  
 - CL/Manner+  =  
 - CL/Human+  =  
 - CL/Human_Male+  =  
 - CL/Human_Female+  =  
 - CL/Descriptive+  =  
 - CL/Sound+  =  
 - CL/Color+ =
 
- +CL/Shape  =  
 - +CL/Manner  =  
 - +CL/Human  =  
 - +CL/Human_Male  =  
 - +CL/Human_Female  =  
 - +CL/Descriptive  =  
 - +CL/Sound  =  
 - +CL/Color =
 
Semantic Tags
Dialect Tags
Triggers
The parts-of-speech could perhaps also be (remove irrelevant):
- +A	     = 
 - +Adv     = 
 - +Pron    = 
 - +CS	     = 
 - +CC	     = 
 - +Adp     = 
 - +Po	     = 
 - +Pr	     = 
 - +Interj  = 
 - +Pcle    = 
 - +Num =
 
- +Def   = 
 - +Indef =
 
The parts of speech are further split up into:
- +Prop    = 
 - +Pers    = 
 - +Dem     = 
 - +Interr  = 
 - +Refl    = 
 - +Recipr  = 
 - +Rel     = 
 - +Indef =
 
The Usage extents are marked using the following tags: 
- +Err/Orth      = Substandard forms 
 - +Use/-Spell = Not included in speller
 
The nominals are inflected in the following Number
- +Sg    = 
 - +Pl    = 
 - +Indef  = 
 - +Def   = 
 - +Abs  = 
 - +Rfx =
 
The verbs can have the following morphological features:
- +1sg    = 
 - +2dl    = 
 - +2pl    = 
 - +2sg    = 
 - +3pl    = 
 - +3sg    = 
 - +Edl    = 
 - +Epl    = 
 - +Fut    = 
 - +FutImp    = 
 - +Hab    = 
 - +Idl    = 
 - +ImmPast    = 
 - +Inf    = 
 - +Ipl    = 
 - +Prs    = 
 - +PrsImp    = 
 - +RemPast    = 
 - +RepPast =
 
Verb prefixes
Muilti word expressions
tag for generating the MWE for abbr
The TAM flags
Verbs and prnouns 
- +1Sg     first singular 
 - +2Sg     etc 
 - +3Sg third singular
 
Verbs and pronouns 
- +1Sg     = first person singular 
 - +2Sg     = second person singular 
 - +3Sg = third person singular
 
- +1Pl     = first person plural 
 - +2Pl     = second person plural 
 - +3Pl = third person plural
 
- +ABBR  = Abbreviations 
 - +Symbol = independent symbols in the text stream, like £, €, © 
 - +ACR = Acronyms
 
Special symbols are classified with: 
- +CLB    = Clause boundary symbols 
 - +PUNCT  = Other punctuation marks 
 - +LEFT   = Left part of paired symbols 
 - +RIGHT = Right part of paired symbols
 
The verbs are syntactically split according to transitivity: 
- +TV   
 - +IV
 
Special multiword units are analysed with: 
- +Multi
 
Non-dictionary words can be recognised with: 
- +Guess
 
Composite UTF-8 characters, i.e. g, k, and x with
Flag diacritics
| @P.NeedNoun.ON@ | (Dis)allow compounds with verbs unless nominalised | 
| @D.NeedNoun.ON@ | (Dis)allow compounds with verbs unless nominalised | 
| @C.NeedNoun@ | (Dis)allow compounds with verbs unless nominalised | 
For languages that allow compounding, the following flag diacritics are needed 
| @P.CmpFrst.FALSE@ | Require that words tagged as such only appear first | 
| @D.CmpPref.TRUE@ | Block such words from entering ENDLEX | 
| @P.CmpPref.FALSE@ | Block these words from making further compounds | 
| @D.CmpLast.TRUE@ | Block such words from entering R | 
| @D.CmpNone.TRUE@ | Combines with the next tag to prohibit compounding | 
| @U.CmpNone.FALSE@ | Combines with the prev tag to prohibit compounding | 
| @P.CmpOnly.TRUE@ | Sets a flag to indicate that the word has passed R | 
| @D.CmpOnly.FALSE@ | Disallow words coming directly from root. | 
Use the following flag diacritics to control downcasing of derived proper 
| @U.Cap.Obl@ | Allowing downcasing of derived names: deatnulasj. | 
| @U.Cap.Opt@ | Allowing downcasing of derived names: deatnulasj. | 
The word forms in Northern Haida start from the lexeme roots of basic 
- VERB_ROOT ;
 
Northern Haida verb affixes
LEXICON CLASS-AA
LEXICON CLASS-AAL
LEXICON CLASS-AAL-INFL
LEXICON CLASS-AAN
LEXICON CLASS-AAN-INFL
LEXICON CLASS-AANG
LEXICON CLASS-AANG-INFL
LEXICON CLASS-AAW
LEXICON CLASS-AAW-INFL
LEXICON CLASS-AAY
LEXICON CLASS-AAY-INFL
LEXICON CLASS-AH
LEXICON CLASS-AH-INFL
LEXICON CLASS-AYD
LEXICON CLASS-AYD-INFL
LEXICON CLASS-EE
LEXICON CLASS-EE-INFL
LEXICON CLASS-EED
LEXICON CLASS-EED-INFL
LEXICON CLASS-I
LEXICON CLASS-I-INFL
LEXICON CLASS-IID
LEXICON CLASS-IID-INFL
LEXICON CLASS-U
LEXICON CLASS-U-INFL
LEXICON CLASS-AAHL
LEXICON CLASS-AAHL-STEM-2-INFL
LEXICON CLASS-AD
LEXICON CLASS-AD-STEM-1-INFL
LEXICON CLASS-AD-STEM-2-INFL
LEXICON CLASS-AL
LEXICON CLASS-AL-STEM-1-INFL
LEXICON CLASS-AL-STEM-2-INFL
LEXICON CLASS-AN
LEXICON CLASS-AN-STEM-1-INFL
LEXICON CLASS-AN-STEM-2-INFL
LEXICON CLASS-ANG
LEXICON CLASS-ANG-STEM-1-INFL
LEXICON CLASS-ANG-STEM-2-INFL
LEXICON CLASS-AW
LEXICON CLASS-AW-STEM-1-INFL
LEXICON CLASS-AW-STEM-2-INFL
LEXICON CLASS-AY
LEXICON CLASS-AY-STEM-1-INFL
LEXICON CLASS-AY-STEM-2-INFL
LEXICON CLASS-EEHL
LEXICON CLASS-EEHL-STEM-1-INFL
LEXICON CLASS-EEHL-STEM-2-INFL
LEXICON CLASS-ID
LEXICON CLASS-ID-STEM-1-INFL
LEXICON CLASS-ID-STEM-2-INFL
LEXICON CLASS-II
LEXICON CLASS-II-STEM-1-INFL
LEXICON CLASS-II-STEM-2-INFL
LEXICON CLASS-IN
LEXICON CLASS-IN-STEM-1-INFL
LEXICON CLASS-IN-STEM-2-INFL
LEXICON CLASS-ING
LEXICON CLASS-ING-STEM-1-INFL
LEXICON CLASS-ING-STEM-2-INFL
LEXICON CLASS-UD
LEXICON CLASS-UD-STEM-1-INFL
LEXICON CLASS-UD-STEM-2-INFL
LEXICON CLASS-UN
LEXICON CLASS-UN-STEM-1-INFL
LEXICON CLASS-UN-STEM-2-INFL
LEXICON CLASS-UNG
LEXICON CLASS-UNG-STEM-1-INFL
LEXICON CLASS-UNG-STEM-2-INFL
LEXICON CLASS-UU
LEXICON CLASS-UU-STEM-1-INFL
LEXICON CLASS-UU-STEM-2-INFL
LEXICON CLASS-A
LEXICON CLASS-A-STEM-1-INFL
LEXICON CLASS-A-STEM-2-INFL
LEXICON CLASS-A.A
LEXICON CLASS-A.A-STEM-1-INFL
LEXICON CLASS-A.A-STEM-2-INFL
LEXICON CLASS-AHL
LEXICON CLASS-AHL-INFL
LEXICON CLASS-AS
LEXICON CLASS-AS-STEM-1-INFL
LEXICON CLASS-AS-STEM-2-INFL
LEXICON CLASS-AS-STEM-3-INFL
LEXICON CLASS-E.E
LEXICON CLASS-E.E-STEM-1-INFL
LEXICON CLASS-E.E-STEM-2-INFL
LEXICON CLASS-E.EHL
LEXICON CLASS-E.EHL-STEM-1-INFL
LEXICON CLASS-E.EHL-STEM-2-INFL
LEXICON CLASS-IHL
LEXICON CLASS-IHL-STEM-1-INFL
LEXICON CLASS-IHL-STEM-2-INFL
LEXICON CLASS-IHL-STEM-3-INFL
LEXICON CLASS-IIHL
LEXICON CLASS-IIHL-STEM-1-INFL
LEXICON CLASS-IIHL-STEM-2-INFL
LEXICON CLASS-IIHL-STEM-3-INFL
LEXICON CLASS-IS
LEXICON CLASS-IS-STEM-1-INFL
LEXICON CLASS-IS-STEM-2-INFL
LEXICON CLASS-IS-STEM-3-INFL
LEXICON CLASS-UHL
LEXICON CLASS-UHL-STEM-1-INFL
LEXICON CLASS-UHL-STEM-2-INFL
LEXICON CLASS-UHL-STEM-3-INFL
LEXICON CLASS-US
LEXICON CLASS-US-STEM-1-INFL
LEXICON CLASS-US-STEM-2-INFL
LEXICON CLASS-US-STEM-3-INFL
LEXICON CLASS-UUHL
LEXICON CLASS-UUHL-STEM-1-INFL
LEXICON CLASS-UUHL-STEM-2-INFL
LEXICON CLASS-UUHL-STEM-3-INFL
The Northern Haida morphophonological/twolc rules file
Alphabet
- %>: 0 	 affix boundary 
 - %^WS: 0     white space dummy 
 - %^DEF: 0 white space dummy
 
Sets
- Vow = a e i o u y æ ø å   
 á é ó ú í à è ò ù ì ä ë ö ü ï â ê ô û î ã ; - Cns = b c d f g h j k l m n p q r s t v w x z ð þ ' ʼ ç ý ñ ;  
 - Sgm = Vow Cns Orth ;
 
Rules
ahl to ál, ahl to áal ahl changes to ál at the end of a stem verb when it is followed by an ending belonging to Set B, F, G or H
Destressing rule - this should be a general rule, but we have problems of getting the variables to accept 0: Vow
Northern Haida verb stems
LEXICON VERBS  

