smj-phon-phonology
Background
The file itself is located in langs/smj/src/phonology/smj-phon.twolc.
File structure
The Alphabet section
The real Lule Sámi Alphabet
All Lule Saami letters are listed. The Lule Sámi ENG sound is represented as ñ.
The 3rd degree mark º is never realized, hence declared as º: 0.
h2, g2 etc. are consonants deleted in the Nom. m3, d3 etc. (?) are consonants that undergo certain processes word-finally.
The Dummy symbols
- X1:0
- Deletes final consonants in short essive of odd syllables
- X2:0
- WeG and neutralization of g8, etc. (hivsik-hivsiga)
- X3:0
- Weg and deletion of g8, etc. (bena-bednaga)
- X4:0
- e
: á and e: å in illatives and px. a: á and o: u in Px and ill of a-stem actors and o-stems - X5:0
- e
: á, e: å and o: u in odd-syllable nouns, but also for some even nouns (o: u f.eks) - X6:0
- Deviant III-I consonant gradation (in contracted stems, guobbmu
: guomoj) - X7:0
- WeG and e
: á, e: å, o: á, o: u in front of diminutives, e: å in -lasj der - X8:0
- Stem vowel alternations in Px
- X9:0
- Stem-vowel and central consonant shortening in first part(s) of compounds
- Q1:0
- The general weak grade trigger. Stem vowel change e
: i and o: u in front of j. - Q2:0
- Vowel harmony
: 2nd syll e realized as å whenever 1st syll is å. - Q3:0
- WeG in contracted, also does not trigger Dipht simpl.
- Q4:0
- Stem vowel change e
: i and o: u in front of j. Dipht. simpl. Like Q1 but strong grade. - Q5:0
- e
: á stem vowel change for word diehtet. Weak grade. - Q6:0
- e
: á stem wovel change for word diehtet. Strong grade. - Q7:0
- e
: á stem vowel change for word diehtte. Extra strong grade - Q8:0
- Stem vowel deletion, impII of verbs.
- Q9:0
- TBW
- Y1:0
- Stem vowel deletion, imp 3sg, 3du, 2pl, 3pl of verbs
- Y2:0
- "Indicative Present Singular 3rd Final Vowel in verbs"
- Y3:0
- PrsPrc
- Y4:0
- e > u in front of dersuff, o > u and e > á in front of dersuffix -alla
- Y5:0
- e > a, i > á, o > u, e > å in verb derivation
- Y6:0
- "Consonant insertion as II-III strengthening gradation", verbs +PrsPrt and +Imprt+Du2
- Y7:0
- "Consonant insertion as II-III strengthening gradation", nouns and propernouns
- Y8:0
- "Stem vowel deletion in even-syllable verbs, imp 1du, 1pl"
- Y9:0
- "Stem vowel deletion in short passives of even-syllable verbs
- Z1:0
- TBW "i
: á in Verb Derivation guollir>guollár"
Morpheme boundaries:
- «
- Derivational prefix
- »
- Derivational suffix
- %<
- Inflectional prefx
- %>
- Inflectional suffix
- #
- Word boundary for both lexicalised and dynamic compounds
- %^
- (exceptional) soft hyphenation point
- %
- a space
- ∑
- mark before # to indicate dynamic comounds
The Sets section
These are the sets:
- Vow
- the vowels
- Cns
- the consonants
- StemCns
- consonants that may occur in stem-final position
- DelCns
- the consonants that are deleted in nominative
- Dummy
- the set of dummy symbols, they are there to trigger certain morphophonological symbols
- WeG
- the dummy symbols that trigger weak grade
The Definitions section
In this section, the consonants are defined. This includes consonant clusters in the various grades and consonant alternations.
G3 vs G2
S7 | kkn: k0n | series 1 |
S8 | fºf: f0f | series 2 |
S9 | jgg: j0g | series 3 |
S4 | hkk: h0k | series 4 |
S5 | xy: zy (no zeros) | series 5 |
S6 | xx: yy (no zeros) | series 6 |
S7 | xy: zy (no zeros) | series 7 |
S8 | ----- (no cg) | series 8 |
- LowerG2
- A definition of Grade2 consonant sequences referring mostly to the surface level
- LowerG1
- A definition of Grade 1 consonant sequences
- LowerG12
- A definition of Grade 1 or 2 consonant sequences
- G32
- A definition of Grade 3 or 2 consonant sequences
- G3
- A definition of Grade 3 consonant sequences
The Rules section
Overview
The rules section has the following chapters: Consonant alternations in certain pos, vowel lengthening, diphthong simplification, stem vowel alternations, consonant gradation rules
Consonant alternations in certain pos
All rules deal with word-final position.
-
*a (is not standard language)
- *b (is not standard language)
Word Final Devoicing of Certain Single Consonants d9 etc.
-
iemed9#
- iemet#
Word final weakening -tj and -ttj to -sj part 1
Word final weakening -tj and -ttj to -sj part 2
-
jågåtj
- jågåsj
-
gålºleX7tj
- gål0lå0sj
Word Final Deletion of n8 m8 g8 h8
-
loavddag8X3#
- l0åv0da00#
Word Final Neutralization of g8, h8, m8
Deleting Final h9 in Short Essive of Uneven Syllables
Deleting Final l9 in Short Essive of Uneven Syllables
Deleting Final m9 in Short Essive of Uneven Syllables
Deleting Final n9 in Short Essive of Uneven Syllables
Deleting Final r9 in Short Essive of Uneven Syllables
-
málest#
- máles0#
Vowel lengthening
The second syllable vowel a is lengthened to á whenever the stem consonants are in grade 1 and the first syllable vowel is short. Short vowels cannot preceed and follow a single intervocalic consonant.
Compulsatory lengthening in grade I even-syllables
-
gussaQ1#
-
gu0sá0#
-
skihpaQ1s#
- ski0bá0s#
Diphtong simplification
The diphthong simplification handles oa: å and æ: e. Phonologically, these are identical processes, but since the dipthong is written by two letters in the former case and by one letter in the latter, the alternations must be handled separately. This section also handles ie: æ, these are in principle the same as oa: å, but the alternation does not occur in so many contexts.
oa:å Diphtong Simplification Part I
oa:å Diphtong Simplification Part II
-
toahkkeY6X5jn
- toahkki00jn
-
*toahkkeY6X5jn (is not standard language)
- *t0åhkki00jn (is not standard language)
-
boalloX4j
- b0ållu0j
-
roavggoX4j
-
roavggu0j
-
*roavggoX4j (is not standard language)
- *r0åvggu0j (is not standard language)
-
toasºsoQ1X5jn
- t0ås0su00jn
-
*toasºsoQ1X5jn (is not standard language)
- *toas0su00jn (is not standard language)
-
*moasºsoX5jn (is not standard language)
- *m0ås0su0jn (is not standard language)
-
moasºsoX5jn
- moas0su0jn
-
goarºroY6X5jn
- goar0ru00jn
-
goarroY6X5jn
- goarru00jn
-
*goarºroY6X5jn (is not standard language)
- *g0år0ru00jn (is not standard language)
-
*goarºroY2 (is not standard language)
- *g0år0ru0 (is not standard language)
-
goarroY2
- g0årru0
-
doad0jeY6
- doaddje0
-
*doad0jeY6 (is not standard language)
- *d0åddje0 (is not standard language)
-
goarºroY5d9it
- g0år0ru0dit
-
*goarºroY5d9it (is not standard language)
- *goar0ru0dit (is not standard language)
-
toab0moY6X4j
- toabbmu00j
-
toabmoX4j
- t0åbmu0j
-
*toa0mboY6X4j (is not standard language)
- *t0åbbmu00j (is not standard language)
-
toabmoX7dallat
-
t0å0mu0dallat
-
*toabmoX7dallat (is not standard language)
- *toa0mu0dallat (is not standard language)
-
oaddoY6X4j
- oaddu00j
-
boassjkoQ1X5jn
- b0å0sjku00jn
-
*boassjkoQ1X5jn (is not standard language)
- *boas0jku00jn (is not standard language)
-
boajsstoQ1X5jn
- b0åj0stu00jn
-
*boajsstoQ1X5jn (is not standard language)
- *boaj0stu00jn (is not standard language)
-
boaggoQ1X5jn
- b0åkku00jn
-
*boaggoQ1X5jn (is not standard language)
- *boakku00jn (is not standard language)
examples:
examples:
examples:
examples:
examples:
examples:
examples:
examples:
examples:
examples:
æ:e Diphthong Simplification 1
æ:e Diphthong Simplification 2
-
hærránis
- hærránis
-
hærránis#gæhttjalibme>
- hærránis#gæhttjalibme>
-
pasiænnta>Q1
- pasien0ta>0
-
patænnta>Q1
- paten0ta>0
-
kvotiænnta>Q1
- kvotien0ta>0
-
kliænnta>Q1
- klien0ta>0
-
Lævnnja>Q1
- Lev0nja>0
ie:æ Diphthong Simplification Part I
-
ielvveY9ut
- 0ælvv00ut
-
iehttseY1up
- 0æhtts00up
-
giesseQ8us
- g0ess00us
ie:æ Diphthong Simplification Part IIa Norwegian æ is an option...
ie:ä Diphthong Simplification Part IIb Swedish ä is an option...
ie:ä Diphthong Simplification Part IIc And there are no other options. This rule to block the e, and allow for the æ and ä.
-
jeht0sa>Y6
- jæhttse>0
-
jeht0sa>Y6
- jähttse>0
-
gierre»X7dalla>t
- g0æ0rá»0dalla>t
-
boarkkaQ1
-
b0år0ka0
-
loavddag8X3#
- l0åv0da00#
Vowel-change oa:å for verbs part I
Vowel-change oa:å for verbs part II
-
hå0llaY2
- hoallá0
-
gå0d0naY6
- goaddne0
-
*hållaY2 (is not standard language)
- *hållá0 (is not standard language)
-
gå0ht0saY6
- goahttse0
Stem vowel alternations
This section is divided according to stem vowels: a-, e-, o-, å-stems.
a-stem alternations
For a-stems, there is a: e and a: i. Each alternation is triggered by a combination of phonological content and dummy symbols.
a:e in Present Participle of even-syllable verbs
-
bassa>Y6
- basse>0
a:i in Prs Prc of even-syllable verbs
-
basºsaY6jt#
- bas0si0jt#
a-stem vowel deletion
-
giedjeg9>a#
- giedjeg>a#
e-stem alternations
For e-stems, there is e: i, e: á, e: å, e: u and e: a. Each alternation is triggered by a combination of phonological content and dummy symbols.
e:i in e-stems
-
manasseQ4j
- manassi0j
-
biesseQ1j
- bie0si0j
-
boaht0eY6j
- boahtti0j
-
gálleQ1tj
- gá0li0sj
-
gálleQ1tjav
- gá0li0tjav
-
gálleQ1tjin
- gá0li0tjin
-
gálleQ1tjihpit
- gá0li0tjihpit
-
gálleQ1tjibá
- gá0li0tjibá
-
gálleQ1tjip
- gá0li0tjip
-
gálleQ1tja
- gá0li0tja
-
gierre>Q1tja
- gie0ri>0tja
-
gierre>Q1tj
- gie0ri>0sj
The following two rules constitute a <= / => rule pair.
e:á in certain stem types 1
-
bálggeX4v
- bálggá0v
-
gálleY3m#
- gállá0m#
-
gálleQ2v#
- gá0lá0v#
-
báhkoX7tj#
- bá0gu0sj#
-
goahteX7tj#
- goa0dá0sj#
-
*goahteX7tj# (is not standard language)
- *go00dá0sj# (is not standard language)
e:á in certain stem types 2
-
bárnneX4m
- bárnná0m
-
*bárnneX4m (is not standard language)
- *bárnne0m (is not standard language)
e:å in certain stem types with å as root vowel
-
gådeQ2v
-
gådå0v
-
jåhteQ2v
- jå0då0v
-
gådeY2
-
gådå0
-
jåhteY2
- jåhtå0
-
jåhteY3m
- jåhtå0m
-
låhkkeY7tj#
- låhkkå0sj#
e-stem vowel deletion
-
ielvveY9ut
- 0ælvv00ut
i-stem alternations
For i-stems, there is i: á. The alternation is triggered by a combination of phonological content and dummy symbols.
i:á in Verb Derivation
o-stem alternations
The duplicates of the three lines of the two following rules are
o:u in certain stem types 1
o:u in certain stem types 2
u:o in contracted nouns
o-stem vowel deletion
For å-stems there is å:e and å:i and vowel deletion. Each alternation is triggered by a combination of phonological content and dummy symbols.
å:e in Present Participle of even-syllable verbs
å:i in Actor nouns of even-syllable verbs
å-stem vowel deletion
alternations valid for several stem types
Stem vowel deletion in even-syllable verbs, imp 3sg, 3du, 2pl, 3pl
-
ielvveY1up
- 0ælvv00up
-
giessaY1up
- giess00up
-
bårråY1up
- bårr00up
Consonant gradation rules
The consonant gradation rules differ considerably from the corresponding rules for North Sámi.
This actually opens up for a migration to an xfst rule file
The rules are divided in two subsections, deletion rules and
Deletion rules
The b, d, g deletion rules are similar, via the optional ( b ) etc. in front of the "_" symbol, both
Note that one of the rules for t: 0 refers to #: as part of its context. As soon as clitics are
Consonant gradation b:0
Consonant gradation d:0
-
bednag8>X3
- be0na0>0
Consonant gradation g:0
Consonant gradation k:0
Consonant gradation l:0
Consonant gradation m:0
Consonant gradation n:0
Consonant gradation p:0
Consonant gradation s:0
-
russjpeQ1
- ru0sjpe0
-
*russjpeQ1 (is not standard language)
- *russjpe0 (is not standard language)
Consonant gradation ŋ:0
Consonant gradation f:0
Consonant gradation r:0
Consonant gradation v:0
Consonant gradation j:0
Consonant gradation t:0
-
oajváladtj#
- oajvála0sj#
Gradation Series 4, II-I, tj and ts
Change rules
The Cx: Cy format was kept for hk: g, hp: b, ht: d, since the left context h: 0 was unique,
The bb: pp, gg: kk, dd: tt alternations were split into three rules,
Gradation Series 4, II-I
bb:pp
-
oabbáQ1
- oappá0
gg:kk
-
vággeQ1
- vákke0
-
*vággeQ1 (is not standard language)
- *vágge0 (is not standard language)
g:k change for clitic -ge
dd:tt and dtj, dts
Gradation Series 7, III-II, ks(t), kt, ktj, kts
Exceptional II-III inverse gradation in present participles
This gradation is only for II-I syllable verbs that get III as
- bbm - bm - m
- ddn - dn - n
- ddnj- dnj- nj
- ggŋ - gŋ - ŋ
- ddj - dj - dj
- hkk - hk - g
- hpp - hp - b
- htt - ht - d
- httj- htj- tj
- htts- hts- ts
Strategy: Do insertion rule for the initial element.
Consonant insertion as II-III strengthening gradation with bm, gŋ
Consonant insertion as II-III strengthening gradation with dn/j + as I-III strengthening gradation with d
Consonant insertion as II-III strengthening gradation with hk, hp,
Consonant insertion as II-III strengthening gradation with htt(j/s)
Debugging of twol-rules
All rule conflicts have been successfully resolved. The rule file