smj

Free and Open source Lule Sami analyser giella-smj

Authors
Divvun and Giellatekno teams, community members
Software version
2012
Documentation license
GNU GFDL
SVN Revision
$Revision:68217 $
SVN Date
$Date:2013-01-16 11:31:33 +0200 (Wed, 16 Jan 2013) $

giella-smj

This is free and open source Lule Sami morphology.

Lule Sámi morphological analyser

Definitions for Multichar_Symbols

Tags for POS

  • +N Noun
  • +A Adjective
  • +Adv Adverb
  • +V Verb
  • +Pron Pronouns
  • +CS Subjunction
  • +CC Conjunction
  • +Adp Adposition
  • +Po Postposition
  • +Pr Preposition
  • +Interj Interjection
  • +Pcle Particle
  • +Num Numeral
  • +TODO = Code for items that have not been modeled yet
  • +Prop Propernouns
  • +ACR Acronym
  • +Pers Personal pronoun
  • +Dem Demonstrative pronoun
  • +Interr Interrogative pronoun
  • +Refl reflexive pronoun
  • +Recipr reciprocal pronoun
  • +Rel relative pronoun
  • +Indef indefinite pronoun
  • +Coll collective numerals
  • +Arab arabic numerals
  • +Rom remertall
  • +Err/Orth Substandard. An ungrammatical, non-normative form of normative lemma.
  • +Err/Lex No normative lemma, often ungrammatical compounds like "bajásbuollda" and "songdebutierit".
  • +Err/Hyph No normative lemma
  • +Err/SpaceCmp No normative lemma
  • +Err/Der Lemmas that break with regular derivation rules, both morphologically and semantically
  • +Err/Infl Lemmas that break with normative inflection rules, often used with morphological changes
  • +Use/Marg Marginal, but normative lemmas. Not in speller.
  • +Use/-Spell Excluded from speller
  • +Use/-PLX Excluded from PLX speller
+Use/-PMatch Do not include in fst's made for hfst-pmatch
  • +Use/SpellNoSugg Recognized, but not suggested in speller
  • +Use/Circ Circular path
  • +Use/CircN Circular number path
  • +Use/Ped Remove from pedagogical speller
  • +Use/NG Do not generate, only for Oahpa and MT. In speller.
  • +Use/MT Generate for MT only, for restricting analyses needed
  • +Use/NGminip Not for miniparadigm in VD dicts
  • +Use/NotDNorm For words without formal normalization. Divvun suggest that this shouldn't be normative.
  • +Use/DNorm For words without formal normalization. Divvun suggest that this should be normative. Included in speller.
  • +Area/SE In Sweden
  • +Area/NO In Norway
  • +Dial/N Used in the northern areas. Some might say that these words are sme-words, but they are used by lulesamis in the northern part of the dialect area. Words like "válmas"
  • +Dial/S Used in the southern areas
  • +Dial/SH Short forms

Compounding tags

The tags are of the following form:

  • +CmpNP/xxx - Normative (N), Position (P), ie. the tag describes what position the tagged word can be in in a compound
  • +CmpN/xxx - Normative (N) form ie. the tag describes what form the tagged word should use when making compounds
  • +Cmp/xxx - Descriptive compounding tags, ie. tags that describes what form a word actually is using in a compound

Normative/prescriptive compounding tags: (to govern compound behaviour for the speller, ie. what a compound SHOULD BE)

The first part of the component may be ..

  • +CmpN/Sg = Singular
  • +CmpN/SgN = Singular Nominative
  • +CmpN/SgG = Singular Genitive
  • +CmpN/PlG = Plural Genitive
  • +CmpNP/All - ... be in all positions, default, this tag does not have to be written
  • +CmpNP/First - ... only be first part in a compound or alone
  • +CmpNP/Pref - ... only first part in a compound, NEVER alone
  • +CmpNP/Last - ... only be last part in a compound or alone
  • +CmpNP/Suff - ... only last part in a compound, NEVER alone
  • +CmpNP/None - ... not take part in compounds
  • +CmpNP/Only - ... only be part of a compound, i.e. can never be used alone, but can appear in any position
  • +CmpN/SgLeft Singular to the left
  • +CmpN/SgNomLeft Singular nominative to the left
  • +CmpN/SgGenLeft Singular genitive to the left
  • +CmpN/PlGenLeft Plural genitive to the left
  • +CmpN/Def Left override
  • +CmpN/DefSgGen Overrides left tag, requires SgGen form
  • +CmpN/DefPlGen Overrides left tag, requires PlGen form
  • +Cmp/Sg Singular
  • +Cmp/SgNom Singular Nominative
  • +Cmp/SgGen Singular Genitive
  • +Cmp/PlGen Plural Genitiv
  • +Cmp/PlNom Plural Nominative
  • +Cmp/Attr Attribute
  • +Cmp Dynamic compound - this tag should always be part of a dynamic compound. It is important for Apertium, and useful in other cases as well.
  • +Cmp/SplitR This is a split compound with the other part to the right: "Arbeids- og inkluderingsdepartementet" => Arbeids- = +Cmp/SplitR
  • +Cmp/SplitL This is a split compound with the other part to the left
  • +Cmp/Sh testing ShCmp
  • +Sg Singular number
  • +Du Dual number
  • +Pl Plural number
  • +Ess Essive case
  • +Nom Nominative case
  • +Gen Genitive case
  • +Acc Accusative case
  • +Ill Illative case
  • +Loc Locative case
  • +Com Comitative case
  • +Ine Inesive case
  • +Ela Elative case
  • +Par Partitive case
  • +Abe Abessive case
  • +PxSg1 possessive suffix singular first person
  • +PxSg2 possessive suffix singular second person
  • +PxSg3 possessive suffix singular third person
  • +PxDu1 possessive suffix dual first person
  • +PxDu2 possessive suffix dual second person
  • +PxDu3 possessive suffix dual third person
  • +PxPl1 possessive suffix plural first person
  • +PxPl2 possessive suffix plural second person
  • +PxPl3 possessive suffix plural plural person
  • +Comp Comparative comparison
  • +Superl Superlative comparison
  • +Attr Attribute
  • +Card
  • +Ord CHECK THIS! In closed-sme there are +Ord entries without circ. tag
  • +Ind Indicative mood
  • +Prs Present tense
  • +Prt Past tense
  • +Pot Potensial mood
  • +Cond conditional mood
  • +Imprt Imperative mood
  • +Sg1 singular first person
  • +Sg2 singular second person
  • +Sg3 singular third person
  • +Du1 dual first person
  • +Du2 dual second person
  • +Du3 dual third person
  • +Pl1 plural first person
  • +Pl2 plural second person
  • +Pl3 plural plural person
  • +Inf infinitive
  • +Ger gerundium
  • +ConNeg the main verb form used with negation verb. Like "bårå" in "Iv bårå guolev"
  • +Neg negation verb
  • +ImprtII second imperative mood
  • +PrsPrc present participle
  • +PrfPrc past participle
  • +Sup supinum
  • +VGen verb genitive
  • +VAbess verb abessive
  • +Actio Actio
  • +ABBR
  • +Symbol = independent symbols in the text stream, like £, €, ©
  • +ACR
  • +CLB
  • +PUNCT
  • +LEFT
  • +RIGHT
  • ^GUESSNOUNROOT
  • +CLBfinal Sentence final abbreviated expression ending in full stop, so that the full stop is ambiguous
  • +TV
  • +IV Transitivity tags
  • +Multi Multiword phrase tag
  • +Guess for the name guesser
  • +NomAg Actor Noun From Verb - Nomen Agentis

Lexeme disambiguation tags

+Hom1
Homonymy
+Hom2
Homonymy

Stem variant tags

  • +v1 - variant 1
  • +v2 - variant 2
  • +v3 - variant 3
  • +v4 - variant 4
  • +v5 - variant 5

+Cmp/Hyph +Cmp/Hyph Compound with (optional) hyphen, heavily weighted +Cmp/OblHyph +Cmp/OblHyph Obligatory hyphen, not as badly weighed as +Cmp/Hyph

Question and Focus particles:

  • +Qst
  • +Clt
  • +Foc These two are only found in SMJ - do we need them?

Focus particles:

  • +Foc/ge
  • +Foc/gen
  • +Foc/ga
  • +Foc/Neg-k
  • +Foc/Pos-k

Other tags

  • +MWE multi word expressions, goes to abbr
  • +Sh Short form

Semantic tags to help disambiguation & syntactic analysis

These tags should always be located just before the POS tag.

  • +Sem/Act = Activity; cleaning, work, occupation, project, photosynthesis
  • +Sem/Adr = Webadr
  • +Sem/Amount = Amount; pile, amount of lynx, amount of food, one fifteenth, ten percent
  • +Sem/Ani = Animate; dog, reindeer, teddy bear,ndragon, lice
  • +Sem/Ani-fish = Fish
  • +Sem/Aniprod = Animal Product; sweat, reindeer skin, lice egg, blood for making dumplings, pee
  • +Sem/Body = Bodypart; ear, bone, ear canal, artificial leg, mustache, nervous system
  • +Sem/Body-abstr = Non-physical body part; Sjel, ånd, reason, soul, voice, eyesight, conscience
  • +Sem/Build = Building; house, museum, Sámi tent, nest, sandcastle
  • +Sem/Build-room = Room in a building, typically place to be
  • +Sem/Buildpart = Part of building; closet, room, door, balcony, pool, office
  • +Sem/Cat = Category; name, subjunction, suffix, password, existential sentence
  • +Sem/Clth = Clothes/Clothing; shirt, hat, theater costume, shawl, seatbelt, diaper
  • +Sem/Clth-jewl = Jewelery and similar; watch, sunglasses, ring, necklace, crown
  • +Sem/Clthpart = Part of clothes; knapp, søm, pocket (OBS! Ctain-abstr), bottom part of an article of clothing, upper part of trousers, seam, button
  • +Sem/Ctain = Container; suitcase, terrarium, closet, container, gas tank
  • +Sem/Ctain-abstr = Abstract container; bank account, fund, account, loan fund, pot, bank account
  • +Sem/Ctain-clth = Eks; lomme/pocket
  • +Sem/Curr = Currency; not money, euro, US dollar, denar, Danish crown, currency
  • +Sem/Dance = Dance; swing, rumba, ballet, belly dance, war dance
  • +Sem/Date = Date
  • +Sem/Dir = Direction; GPS course, stock exchange price, graph, tendency, starboard
  • +Sem/Domain = Domain like politics, reindeerherding (a system of actions), anthropology, punk rock, biology, linguistics, medicine
  • +Sem/Drink = Drink; tea, beer, UHT milk, soda, communion wine
  • +Sem/Dummytag = Default tag for uncategorized nouns
  • +Sem/Edu = Educational event; skiing high school skiing academy, course, music lesson, lesson, master
  • +Sem/Event = Event; wedding, meeting, competition, election, festival
  • +Sem/Feat = Feature; Árvu, age difference, homosexuality, femininity, identity, congruence
  • +Sem/Feat-measr = Measurable feature; radius, diameter, volume, circumference, perimeter, frequency
  • +Sem/Feat-phys = Physiological feature; size, color, height, shape, weight, horsepower
  • +Sem/Feat-psych = Psychological feature; authority, nature, childishness, creativity, arrogance
  • +Sem/Fem = Female name
  • +Sem/Food = Food; bread, vegetarian food, flour, tobacco, salt
  • +Sem/Food-med = Medicine; birth-control pill, asthma medicine, medicine, penicillin, vaccine
  • +Sem/Fruit = Fruit and fruit-like edibles
  • +Sem/Furn = Furniture; throne, chair, table, altar, trampoline
  • +Sem/Game = Game; biŋgo bingo, TV game, flipper, paintball, chess
  • +Sem/Geom = Geometrical object; triangle, triangle, tetrahedron, asymptote, star
  • +Sem/Group = Animal or Human Group; family, herd, group, indigenous people, delegation
  • +Sem/Hum = Human
  • +Sem/Hum-abstr = Human abstract
  • +Sem/ID = ID
  • +Sem/Ideol = Ideology; nomadism, buddhism, feminism, christianity, fanaticism
  • +Sem/Lang = Language; South Sámi, mother tongue, Maori, Yiddish, second language
  • +Sem/Mal = Male name
  • +Sem/Mat = Material for producing things; paper, steel, wood, leather, wool
  • +Sem/Measr = Measure; unit of volume, quarter meter, hour, minus degree, wátta watt
  • +Sem/Money = Has to do with money; wages, not Curr(ency), treasure, belongings, debt, food price, bill, pension
  • +Sem/Obj = Object; thing, cup, thing, toy, painting
  • +Sem/Obj-catch =
  • +Sem/Obj-clo = Cloth; carpet, flag, curtain, silk sheets, napkin
  • +Sem/Obj-cogn =
  • +Sem/Obj-el = (Electrical) machine or apparatus; player, lamp, TV, radio, oven
  • +Sem/Obj-ling = Object with something written on it
  • +Sem/Obj-play = Play object
  • +Sem/Obj-rope = flexible ropelike object; barbed wire, thread, rope, cable, dental floss
  • +Sem/Obj-surfc = Surface object; blackboard, drawing paper, canvas, board (for playing board games), (money) bill
  • +Sem/Org = Organisation; newspaper, supreme court, company, museum, administration
  • +Sem/Part = Feature, part of something; half, part, percent, rest, tenth
  • +Sem/Perc-cogn = Cloth
  • +Sem/Perc-emo = Emotional perception; fear, feeling of identity, empathy, sadness, working motivation
  • +Sem/Perc-phys = Physical perception; shoulder pain, gass smell, sleep (during the night), need of sleep, hunger
  • +Sem/Perc-psych = Psychic perception
  • +Sem/Plant = Plant; lichen, plant, cucumber, bluebell, poisonous mushroom
  • +Sem/Plantpart = Plant part; twig, leaf, root, seed, pine trunk
  • +Sem/Play = Play
  • +Sem/Plc = Place; world, nature, airport, workplace, fireplace
  • +Sem/Plc-abstr = Abstract place; bachelor level, job market, third place, Troms website, address
  • +Sem/Plc-elevate = Elevated place; mountain, peak, mountain top, scree, volcano
  • +Sem/Plc-line = Place limitations; national border, border, finish line, stop line, equator
  • +Sem/Plc-water = Water; river, lake, polar sea, sea, well
  • +Sem/Pos = Position (as in social position job); 50% position, presidency, fixed position, kindergarten place, role in society
  • +Sem/Process = Process
  • +Sem/Prod = Product
  • +Sem/Prod-audio = Audio product; yoik, roar, Beatles song, Bible psalm, blues
  • +Sem/Prod-cogn = Cognition product; thought, decision, knowledge, lack of understanding, requirement
  • +Sem/Prod-ling = Linguistic product; message, question, agreement, translation, criticism
  • +Sem/Prod-vis = Visual product; picture, film, TV series, documentary, art
  • +Sem/Rel = Relation; relation, dependency, subordination, analogy, equivalence
  • +Sem/Route = Route-like place; street, path, corridor, bridge, winter path
  • +Sem/Rule = Rule or convention; cultural tradition, abortion law, EU rule, law of cosines, fair play
  • +Sem/Semcon = Abstract semantic concept; result, objective, reason, alternative, case
  • +Sem/Sign = Sign (e.g. numbers, punctuation); ID number, ČSV letters, quotation mark, hieroglyph, symbol
  • +Sem/Sport = Sport; table tennis, judo, motor cross, ice hockey, floorball
  • +Sem/State = State; hurry, captivity, anarchy, biodiversity, chaos
  • +Sem/State-sick = Illness; allergy, cold, autism, dementia, somnambulism
  • +Sem/Substnc = Substance; Water, sand, air, smoke, carbohydrate, vitamin, dust
  • +Sem/Sur = Surname
  • +Sem/Symbol = Symbol
  • +Sem/Time = Time; áigi time, April, half an hour, Tuesday, deadline
  • +Sem/Time-clock = Time
  • +Sem/Tool = Prototypical tool for repairing things; axe, knife, fire striker, plastic hammer, wrench
  • +Sem/Tool-catch = Tool used for catching; artificial fly, fishing net for cod, fishing rod, lasso used in wintertime, mouse trap
  • +Sem/Tool-clean = Tool used for cleaning; broom, vegetable brush, toothbrush, cleaning equipment
  • +Sem/Tool-it = Tool used in IT/tool within IT; IT infrastructure, analyzer, searching function, computer program, browser
  • +Sem/Tool-measr = Tool used for measuring; barometer, hourglass, ruler, spirit level, scale
  • +Sem/Tool-music = Musical instrument; shaman drum, guitar, violin, musical instrument, jaw harp
  • +Sem/Tool-write = Writing tool; colored pen, pencil, chalk, paintbrush, paint
  • +Sem/Txt = Written document; paper, book, letter, e-mail, song
  • +Sem/Veh = Vehicle; car, boat, sled, vehicle, draft reindeer, bicycle
  • +Sem/Wpn = Weapon; rifle, bow, sword, arrow, war axe
  • +Sem/Wthr = The Weather or the state of ground; cloudy weather, wind, driving conditions, night sunlight, rain shower
  • +Sem/Year = Year

Multiple Semantic tags:

  • +Sem/Ani_Group =
  • +Sem/Act_Clth =
  • +Sem/Act_Domain =
  • +Sem/Act_Domain_Measr =
  • +Sem/Act_Event =
  • +Sem/Act_Feat =
  • +Sem/Act_Feat-psych =
  • +Sem/Act_Fruit =
  • +Sem/Act_Group =
  • +Sem/Act_Hum =
  • +Sem/Act_Hum_Obj =
  • +Sem/Act_Money =
  • +Sem/Act_Obj =
  • +Sem/Act_Obj-play =
  • +Sem/Act_Org =
  • +Sem/Act_Perc-emo =
  • +Sem/Act_Plc = A persons job is an activity, and a place as well
  • +Sem/Act_Prod-audio =
  • +Sem/Act_Prod-vis =
  • +Sem/Act_Route = Activity and Route, ie johtolat
  • +Sem/Act_Semcon =
  • +Sem/Act_State =
  • +Sem/Act_Time =
  • +Sem/Act_Tool-it =
  • +Sem/Act_Txt =
  • +Sem/Amount_Build =
  • +Sem/Amount_Semcon =
  • +Sem/Ani_Body =
  • +Sem/Ani_Body-abstr_Hum =
  • +Sem/Ani_Build =
  • +Sem/Ani_Build_Hum_Txt =
  • +Sem/Ani_Buildpart =
  • +Sem/Ani_Cat =
  • +Sem/Ani_Clth =
  • +Sem/Ani_Feat_Hum =
  • +Sem/Ani_Feat_Plant =
  • +Sem/Ani_Food =
  • +Sem/Ani_Group_Hum =
  • +Sem/Ani_Group_Prod-vis =
  • +Sem/Ani_Hum =
  • +Sem/Ani_Hum_Plc =
  • +Sem/Ani_Hum_Time =
  • +Sem/Ani_Obj =
  • +Sem/Ani_Org =
  • +Sem/Ani_Plc =
  • +Sem/Ani_Plc_Txt =
  • +Sem/Ani_State =
  • +Sem/Ani_Substnc =
  • +Sem/Ani_Time =
  • +Sem/Ani_Veh =
  • +Sem/Aniprod_Hum =
  • +Sem/Aniprod_Mat =
  • +Sem/Aniprod_Obj =
  • +Sem/Aniprod_Obj-clo =
  • +Sem/Aniprod_Perc-phys =
  • +Sem/Aniprod_Plant =
  • +Sem/Aniprod_Plc =
  • +Sem/Aniprod_Plc_Route =
  • +Sem/Aniprod_Substnc =
  • +Sem/Aniprod_Substnc_Wthr =
  • +Sem/Body-abstr_Feat-psych =
  • +Sem/Body-abstr_Prod-audio_Semcon =
  • +Sem/Body_Body-abstr =
  • +Sem/Body_Buildpart =
  • +Sem/Body_Clth =
  • +Sem/Body_Clthpart =
  • +Sem/Body_Food =
  • +Sem/Body_Fruit =
  • +Sem/Body_Group_Hum =
  • +Sem/Body_Group_Hum_Time =
  • +Sem/Body_Hum =
  • +Sem/Body_Mat =
  • +Sem/Body_Measr =
  • +Sem/Body_Obj =
  • +Sem/Body_Obj_Tool-catch =
  • +Sem/Body_Org =
  • +Sem/Body_Part =
  • +Sem/Body_Plc =
  • +Sem/Body_Plc-elevate =
  • +Sem/Body_Plc_State =
  • +Sem/Body_State =
  • +Sem/Body_Time =
  • +Sem/Build-room_Furn =
  • +Sem/Build-room_Org =
  • +Sem/Build_Build-room =
  • +Sem/Build_Buildpart =
  • +Sem/Build_Clthpart =
  • +Sem/Build_Edu_Org =
  • +Sem/Build_Event_Org =
  • +Sem/Build_Obj =
  • +Sem/Build_Org =
  • +Sem/Build_Plc =
  • +Sem/Build_Route =
  • +Sem/Build_Tool =
  • +Sem/Build_Veh =
  • +Sem/Buildpart_Ctain_Obj =
  • +Sem/Buildpart_Obj =
  • +Sem/Buildpart_Obj_Plc =
  • +Sem/Buildpart_Part =
  • +Sem/Buildpart_Plc =
  • +Sem/Buildpart_Prod-audio =
  • +Sem/Cat_Edu =
  • +Sem/Cat_Feat =
  • +Sem/Cat_Feat-psych_Plc =
  • +Sem/Cat_Group_Hum =
  • +Sem/Cat_Hum =
  • +Sem/Cat_Measr =
  • +Sem/Cat_Obj =
  • +Sem/Cat_Plantpart =
  • +Sem/Cat_Plantpart_Semcon =
  • +Sem/Cat_Txt =
  • +Sem/Clth-jewl_Curr =
  • +Sem/Clth-jewl_Fruit =
  • +Sem/Clth-jewl_Money =
  • +Sem/Clth-jewl_Obj =
  • +Sem/Clth-jewl_Org =
  • +Sem/Clth-jewl_Plant =
  • +Sem/Clth_Hum =
  • +Sem/Clth_Obj =
  • +Sem/Clth_Part =
  • +Sem/Clth_Sur =
  • +Sem/Clthpart_Plc =
  • +Sem/Ctain-abstr_Org =
  • +Sem/Ctain-clth_Plant =
  • +Sem/Ctain-clth_Veh =
  • +Sem/Ctain_Feat-phys =
  • +Sem/Ctain_Furn =
  • +Sem/Ctain_Obj-surfc =
  • +Sem/Ctain_Plc =
  • +Sem/Ctain_Txt =
  • +Sem/Ctain_Tool =
  • +Sem/Ctain_Tool-measr =
  • +Sem/Curr_Org =
  • +Sem/Dance_Org =
  • +Sem/Dance_Prod-audio =
  • +Sem/Dir_Geom =
  • +Sem/Domain_Feat =
  • +Sem/Domain_Feat-phys =
  • +Sem/Domain_Food-med =
  • +Sem/Domain_Hum =
  • +Sem/Domain_Ideol =
  • +Sem/Domain_Obj =
  • +Sem/Domain_Org =
  • +Sem/Domain_Org_Plc-abstr =
  • +Sem/Domain_Perc-emo =
  • +Sem/Domain_Prod-audio =
  • +Sem/Domain_State-sick =
  • +Sem/Domain_Txt =
  • +Sem/Domain_Edu =
  • +Sem/Drink_Plant =
  • +Sem/Drink_Plc =
  • +Sem/Drink_Plc_Substnc =
  • +Sem/Edu_Event =
  • +Sem/Edu_Geom =
  • +Sem/Edu_Geom_Plc-line =
  • +Sem/Edu_Group_Hum =
  • +Sem/Edu_Hum =
  • +Sem/Edu_Mat =
  • +Sem/Edu_Org =
  • +Sem/Edu_Txt =
  • +Sem/Event_Food =
  • +Sem/Event_Hum =
  • +Sem/Event_Org =
  • +Sem/Event_Plc =
  • +Sem/Event_Plc-elevate =
  • +Sem/Event_Time =
  • +Sem/Feat-measr_Plc =
  • +Sem/Feat-phys_Food_State =
  • +Sem/Feat-phys_Hum =
  • +Sem/Feat-phys_Obj =
  • +Sem/Feat-phys_Plc-line =
  • +Sem/Feat-phys_Tool-write =
  • +Sem/Feat-phys_Veh =
  • +Sem/Feat-phys_Wthr =
  • +Sem/Feat-psych_Hum =
  • +Sem/Feat-psych_Plc =
  • +Sem/Feat_Hum =
  • +Sem/Feat_Plant =
  • +Sem/Food_Part =
  • +Sem/Food_Perc-phys =
  • +Sem/Food_Plant =
  • +Sem/Food_Substnc =
  • +Sem/Food_Time =
  • +Sem/Fruit_Sport =
  • +Sem/Furn_Ctain-abstr =
  • +Sem/Game_Obj-play =
  • +Sem/Geom_Obj =
  • +Sem/Geom_Plc =
  • +Sem/Group_Hum =
  • +Sem/Group_Hum_Org =
  • +Sem/Group_Hum_Plc =
  • +Sem/Group_Hum_Prod-vis =
  • +Sem/Group_Org =
  • +Sem/Group_Plc =
  • +Sem/Group_Prod-vis_Txt_Veh =
  • +Sem/Group_Sign =
  • +Sem/Group_State =
  • +Sem/Group_Txt =
  • +Sem/Hum-abtsr =
  • +Sem/Hum_Lang =
  • +Sem/Hum_Lang_Plc =
  • +Sem/Hum_Lang_Time =
  • +Sem/Hum_Mat_Tool =
  • +Sem/Hum_Money =
  • +Sem/Hum_Obj =
  • +Sem/Hum_Obj_Plc =
  • +Sem/Hum_Org =
  • +Sem/Hum_Part =
  • +Sem/Hum_Plant =
  • +Sem/Hum_Plc =
  • +Sem/Hum_State =
  • +Sem/Hum_Tool =
  • +Sem/Hum_Tool-catch =
  • +Sem/Hum_Veh =
  • +Sem/Hum_Wthr =
  • +Sem/Lang_Tool =
  • +Sem/Lang_Tool-catch =
  • +Sem/Mat_Obj =
  • +Sem/Mat_Part =
  • +Sem/Mat_Plant =
  • +Sem/Mat_Plantpart =
  • +Sem/Mat_Plc =
  • +Sem/Mat_Tool =
  • +Sem/Mat_Tool-catch =
  • +Sem/Mat_Txt =
  • +Sem/Measr_Plc_Time =
  • +Sem/Measr_Sign =
  • +Sem/Measr_Time =
  • +Sem/Money_Obj =
  • +Sem/Money_Plc =
  • +Sem/Money_Txt =
  • +Sem/Obj-ling_Obj-surfc =
  • +Sem/Obj-play =
  • +Sem/Obj-play_Sport =
  • +Sem/Obj_Part_Sign =
  • +Sem/Obj_Perc-emo =
  • +Sem/Obj_Plantpart =
  • +Sem/Obj_Plc =
  • +Sem/Obj_Plc-abstr =
  • +Sem/Obj_Plc_Semcon =
  • +Sem/Obj_Process =
  • +Sem/Obj_Prod-audio =
  • +Sem/Obj_Semcon =
  • +Sem/Obj_Sign =
  • +Sem/Obj_State =
  • +Sem/Obj_Tool-music =
  • +Sem/Obj_Tool-write =
  • +Sem/Obj_Txt =
  • +Sem/Obj_Veh =
  • +Sem/Org_Play =
  • +Sem/Org_Plc =
  • +Sem/Org_Prod-audio =
  • +Sem/Org_Prod-cogn =
  • +Sem/Org_Prod-vis =
  • +Sem/Org_Rule =
  • +Sem/Org_State =
  • +Sem/Org_Txt =
  • +Sem/Org_Veh =
  • +Sem/Part_Plc =
  • +Sem/Part_Plc_Prod-audio =
  • +Sem/Part_Prod-cogn =
  • +Sem/Part_Substnc =
  • +Sem/Part_Txt =
  • +Sem/Perc-emo_Plc =
  • +Sem/Perc-emo_State =
  • +Sem/Perc-emo_Wthr =
  • +Sem/Plant_Plantpart =
  • +Sem/Plant_Plc =
  • +Sem/Plant_Time_Wthr =
  • +Sem/Plant_Tool =
  • +Sem/Plant_Tool-measr =
  • +Sem/Plc-abstr_Rel_State =
  • +Sem/Plc-abstr_Route =
  • +Sem/Plc-abstr_Rule =
  • +Sem/Plc-abstr_State =
  • +Sem/Plc-abstr_Txt =
  • +Sem/Plc_Pos =
  • +Sem/Plc_Route =
  • +Sem/Plc_State =
  • +Sem/Plc_Substnc =
  • +Sem/Plc_Substnc_Wthr =
  • +Sem/Plc_Time =
  • +Sem/Plc_Time_Wthr =
  • +Sem/Plc_Tool-catch =
  • +Sem/Plc_Txt =
  • +Sem/Plc_Wthr =
  • +Sem/Prod-audio_Prod-vis =
  • +Sem/Prod-audio_Substnc =
  • +Sem/Prod-audio_Txt =
  • +Sem/Prod-cogn_Txt =
  • +Sem/Route_State =
  • +Sem/Route_Txt =
  • +Sem/Rule_Txt =
  • +Sem/Semcon_State =
  • +Sem/Semcon_Txt =
  • +Sem/State-sick_Substnc =
  • +Sem/State_Veh =
  • +Sem/Substnc_Wthr =
  • +Sem/Time_Wthr =
  • +Sem/Tool-music =

Derivation tags

The following tags are used to describe the dynamic derivational system in Lule Sámi as encoded in this lexical description. The tags are classified according to a positional system, where each tag can be in one and only one position, and can only combine with tags from an earlier / lower position. This is done to avoid possible overgeneration in the derivational system.

+Der1 +Der2 +Der3 +Der4 +Der5
- positional tags, preceeds the actual der tag

Der#1 tags - tags in first position

  • +Der/PassL VV - long passive láhpeduvvat
  • +Der/PassS VV - Short passive láhpput
  • +Der/PassD VV - dallat passive
  • +Der/Dimin NN
  • +Der/adda VV
  • +Der/ahtja VV - only odd syll verbs take this der
  • +Der/ahttjá VV - only odd syll verbs take this der
  • +Der/Caus VV - previously Der/ahtte
  • +Der/alla VV
  • +Der/asste VV
  • +Der/d VV
  • +Der/dalla VV
  • +Der/dasste VV
  • +Der/Car NA - only even/contr, prev. Der/dibme
  • +Der/ferjak NA Adjectival -k der (from ?)
  • +Der/k NN / NA
  • +Der/l VV
  • +Der/ladda VV
  • +Der/lahtte VV
  • +Der/lasj NA - dont know, guess it Tronds, ojes, I see - is this ok?jes 2 Der: lasj Noun on 1472 Adj on 2040
  • +Der/lasj NN
  • +Der/lasste VV
  • +Der/n NA. Denominal -n adjective (similar t -k adj)
  • +Der/r VN - AA?
  • +Der/sasj NA
  • +Der/segak NA Adj. -k der from?
  • +Der/st VV
  • +Der/stahtte VV
  • +Der/stalla VV
  • +Der/stasste VV
  • +Der/tj VV
  • +Der/u/a/åd VV
  • +Der/A NA

Der#2 tags - tags in second position

  • +Der/dahtte VV
  • +Der/duhtte VV
  • +Der/ahkes VA
  • +Der/NomAct VN

Der#3 tags - tags in third position

  • +Der/duvva VV
  • +Der/InchL VV (previosuly Der/goahte)
  • +Der/mus VN
  • +Der/NomAct VN Realised in two different ways.
  • This realisation is Der3. Outcommented
  • to not define the tag twice, but kept
  • here for documentation purposes.
  • +Der/dahka VN
  • +Der/lis VA
  • +Der/NomAg VN

Der#4 tags - tags in fourth position

  • +Der/ahtes NA ! only odd

Der#5 tags - tags in fifth position

  • +Der/AAdv NA AAdv, previously +Der/at
  • +Der/vuota NA AN (tag harmonization: previosuly Der/vuohta)

Der#other tags - tags that can be in any position

There are no such tags in SMJ, but for symmetry and code coherence with SME the class is still kept.

Tags for originating language

The following tags are used to guide conversion to IPA: loan words and foreign names are usually pronounced (approximately) as in the originating (majority) language. Instead of trying to identify the correct pronunciation based on phonotactics (orthotactics actually), we tag all words that can't be correctly transcribed using the SME transcriber with source language codes. Once tagged, it is possible to split the lexical transducer in smaller ones according to langu- age, and apply different IPA conversion to each of them. The principle of tagging is that we only tag to the extent needed, and following a priority:

  1. any untagged word is pronounced with SME orthographic conventions
  2. NNO and NOB have identical pronunciation, NNO is only used if different in spelling from NOB
  3. SWE has mostly the same pronunciation as NOB, and is only used if different in spelling from NOB
  4. Occasionally even SME (the default) may be tagged, to block other languages from being specified, mainly during semi-automatic language tagging sessions All in all, we want to get as much correctly transcribed to IPA with as little work as possible. On the other hand, if more words are tagged than strictly needed, this should pose no problem as long as the IPA conversion is correct - at least some words will get the same pronunciation whether read as SME or NOB/NNO/SWE.
  • +OLang/SME - North Sámi
  • +OLang/SMA - South Sámi
  • +OLang/FIN - Finnish
  • +OLang/SWE - Swedish
  • +OLang/NOB - Norw. bokmål
  • +OLang/NNO - Norw. nynorsk
  • +OLang/ENG - English
  • +OLang/RUS - Russian
  • +OLang/UND - Undefined

Flag diacritics

We have manually optimised the structure of our lexicon using following flag diacritics to restrict morhpological combinatorics - only allow compounds with verbs if the verb is further derived into a noun again:

@P.NeedNoun.ON@ (Dis)allow compounds with verbs unless nominalised
@D.NeedNoun.ON@ (Dis)allow compounds with verbs unless nominalised
@C.NeedNoun@ (Dis)allow compounds with verbs unless nominalised
@D.ErrOrth.ON@
@C.ErrOrth@
@P.ErrOrth.ON@

For languages that allow compounding, the following flag diacritics are needed to control position-based compounding restrictions for nominals. Their use is handled automatically if combined with +CmpN/xxx tags. If not used, they will do no harm.

@P.CmpFrst.FALSE@ Require that words tagged as such only appear first
@D.CmpPref.TRUE@ Block such words from entering ENDLEX
@P.CmpPref.FALSE@ Block these words from making further compounds
@D.CmpLast.TRUE@ Block such words from entering R
@D.CmpNone.TRUE@ Combines with the next tag to prohibit compounding
@U.CmpNone.FALSE@ Combines with the prev tag to prohibit compounding
@U.CmpNone.TRUE@ Combines with the two previous ones to block compounding
@P.CmpOnly.TRUE@ Sets a flag to indicate that the word has passed R
@D.CmpOnly.FALSE@ Disallow words coming directly from root.
@U.CmpHyph.FALSE@ Flag to control hyphenated compounds like proper nouns
@U.CmpHyph.TRUE@ Flag to control hyphenated compounds like proper nouns
@C.CmpHyph@ Flag to control hyphenated compounds like proper nouns

Use the following flag diacritics to control downcasing of derived proper nouns (e.g. Finnish Pariisi -> pariisilainen). See e.g. North Sámi for how to use these flags. There exists a ready-made regex that will do the actual down-casing given the proper use of these flags.

@U.Cap.Obl@ Disallow downcasing of names when not derived: Deatnu
@U.Cap.Opt@ Allowing downcasing of derived names: deatnulasj.
@P.Px.add@ Giving possibility for Px-suffixes (all except from Nom 3.p)
@R.Px.add@ Requiring P.Px.add-flag for Px-suffixes (all except from Nom 3.p)
@P.Nom3Px.add@ Giving possibility for Px-suffixes Nom 3.p
@R.Nom3Px.add@ Requiring P.Nom3Px.add flag for Px-suffixes Nom 3.p
  • LEXICON Acronym
  • LEXICON ProperNoun

Lexicon ENDLEX

And this is the ENDLEX of everything:

 @D.CmpOnly.FALSE@@D.CmpPref.TRUE@@D.NeedNoun.ON@ # ;

The @D.CmpOnly.FALSE@ flag diacritic is ued to disallow words tagged with +CmpNP/Only to end here. The @D.NeedNoun.ON@ flag diacritic is used to block illegal compounds.

  • LEXICON UNIT As acro, but without paradigm
  • LEXICON ACRONOUN_cons is the lexicon for nouns (not +Prop) like ATV
  • LEXICON ACRONOUN_vow is the lexicon for nouns (not +Prop) like ATV

LEXICON ACRO_vow Acronyms ending in vowels and which are pronounced as a word, like NATO and UNESCO (Giellagálldo bievddegirjje 2/2016). These only get the case endings: NATO: v, NATO: n, etc.

LEXICON ACRO_cons Mostly acronyms ending in consonants, but also acronyms ending in vowels which are not pronounced as words, like LO and CV (Giellagálldo bievddegirjje 2/2016). These get case endings like a regular a-stem noun with both the vowel and case ending: LO: av, LO: an, etc.

Sublexica for Adjective

Even-syllable stems

LEXICON GIEVRRA Adjectives with attribute in WeG and -s. As 1a in Spiik. Sg Acc: gievrav, Attr: gievras.

gárttje # Even-syllable test examples:

  • gárttje: gárttje+A+Sg+Nom
  • gártjev: gárttje+A+Sg+Acc
  • gártjes: gárttje+A+Attr
  • gártjep: gárttje+A+Comp+Sg+Nom

LEXICON HÁVSSKE Adjectives with attribute -s, without WeG. As 1c in Spiik. Sg Acc: hávsskev, Attr: hávsskes.

hoallá # Even-syllable test examples:

  • hoallá: hoallá+A+Sg+Nom
  • hoalláv: hoallá+A+Sg+Acc
  • hoallás: hoallá+A+Attr
  • hoalláp: hoallá+A+Comp+Sg+Nom

LEXICON NUORRA Adjectives with attribute same as pred. As 1b in Spiik. Sg Acc: nuorav, Attr: nuorra.

visská # Even-syllable test examples:

  • visská: visská+A+Sg+Nom
  • viskáv: visská+A+Sg+Acc
  • visská: visská+A+Attr
  • viskáp: visská+A+Comp+Sg+Nom

LEXICON TJUODDJE Adjectives with attribute -is, without WeG. presently only "Tjuoddje" Sg Acc: tjuoddjev, Attr: tjuoddjis.

tjuoddje # Even-syllable test examples:

  • tjuoddje: tjuoddje+A+Sg+Nom
  • tjuoddjev: tjuoddje+A+Sg+Acc
  • tjuoddjis: tjuoddje+A+Attr
  • tjuoddjep: tjuoddje+A+Comp+Sg+Nom

LEXICON GALLJE Adjectives on -e, the attribute is in WeG and e > a. As 1d in Spiik. Sg Acc: galjev, Attr: galja.

uhttse # Even-syllable test examples:

  • uhttse: uhttse+A+Sg+Nom
  • uhtsev: uhttse+A+Sg+Acc
  • uhtsa: uhttse+A+Attr
  • uhtses: uhttse+A+Attr (Eng. # gets this attr from LEXATTR)
  • uhtsep: uhttse+A+Comp+Sg+Nom

LEXICON TJÁBBE Adjectives on -e, the attribute is in WeG and e > a. Same as GALLJE only different adv derivation. Sg Acc: tjáppev, Attr: tjáppa.

njálgge # Even-syllable test examples:

  • njálgge: njálgge+A+Sg+Nom
  • njálgev: njálgge+A+Sg+Acc
  • njálga: njálgge+A+Attr
  • njálgep: njálgge+A+Comp+Sg+Nom

Pres.participles

LEXICON SÁVADAHTTE Causative-participles. No attribute. No comparision. As 1e in Spiik. Sg Acc: sávadahttev. PrsPrc of causative verbs "uttrykker at handlingen lar seg gjøre eller er verdt å gjøre" (Kintel 1991).

vuojedahtte # Even-syllable test examples:

  • vuojedahtte: vuojedahtte+A+Sg+Nom
  • vuojedahttev: vuojedahtte+A+Sg+Acc

LEXICON JUHKKE participles with -s attributive. No comparision As 1e in Spiik. Sg Acc: juhkkev, Attr: juhkkes. Spiik: presens particip har med den attributive formen på -s betydelsen ºnågon som är duktig i, snabb til att, begiven att utföra handlingenº.

vuohttje # Even-syllable test examples:

  • vuohttje: vuohttje+A+Sg+Nom
  • vuohttjev: vuohttje+A+Sg+Acc
  • vuohttjes: vuohttje+A+Attr

LEXICON BÅRRE participles without the -s attributive. As 1e in Spiik. Sg Acc: bårrev, Attr: bårre. Spiik: presens particip har med den attributiva formen utan -s betydelsen ºdem som utför handlingenº.

ednabårre # Even-syllable test examples:

  • ednabårre: ednabårre+A+Sg+Nom
  • ednabårrev: ednabårre+A+Sg+Acc
  • ednabårre: ednabårre+A+Attr

Test data:

Loan words lexicas

Correctly assimilated loanwords, derived from real noun.

LEXICON MEKANIHKALASJ LOAN! Foreign -isk adjectives adapted in updated normative way. To smj ending -alasj, adjective is truly derived from a noun. Mekanisk-mekanihkka-mekanihkalasj, instead of mekánalasj that goes to MEKÁNALASJ_BADASS. Pred and attr are both -alasj. Attr same as pred. With comparatives.

LEXICON MEKANIHKALASJ_CMP_INFL

kapitalismalasj # Even-syllable test examples:

  • kapitalismalasj: kapitalismalasj+A+Sg+Nom
  • kapitalismalattjav: kapitalismalasj+A+Sg+Acc
  • kapitalismalasj: kapitalismalasj+A+Attr
  • kapitalismalabbo: kapitalismalasj+A+Comp+Sg+Nom

LEXICON IJJALASJ Just lik MEKANIHKALASJ only for words ending on ijjalasj/iddjalasj, so that we don't need a lot of Area and Err tags in stems file.

LEXICON IJJALASJ_CMP_INFL

LEXICON OGIJJALASJ Just like IJJALASJ only for words ending on ogijjalasj/ogiddjalasj, so that we don't need a lot of err tags in stems files. For words like "pedagogijjalasj" which also have "pedagåvgålasj" (not really a wrong derivation, but doesn't mean pedagogisk) and "pedagogalasj" err taged.

LEXICON OGIJJALASJ_CMP_INFL

LEXICON SJÅNÅLASJ_SJONAL -sjonal/sjonell and -tional/tionel loanwords. Only for words that work as nouns, so that they are REAL dervations, as nasjonal-nasjåvnnå-nasjåvnålasj. NOT for words like "rasjonell", with no real noun. Words as "rasjonell>rasjonálla-rasjonálalasj" go to lexicon ÁLLA. The fake derivation "nasjonálalasj" is err taged, so is the strange "nasjonálla/nasjunálla".

LEXICON SJÅNÅLASJ_SJONAL_CMP_INFL

konstitusjåvnålasj # Even-syllable test examples:

  • nasjåvnålasj: nasjåvnålasj+A+Sg+Nom
  • nasjåvnålattjav: nasjåvnålasj+A+Sg+Acc
  • nasjåvnålasj: nasjåvnålasj+A+Attr

LEXICON SJÅNÅLASJ_SJONELL -sjonal/sjonell and -tional/tionel loanwords. Only for words that work as nouns, so that they are REAL dervations, as nasjonal-nasjåvnnå-nasjåvnålasj. NOT for words like "rasjonell", with no real noun. Words as "rasjonell>rasjonálla-rasjonálalasj" go to lexicon ÁLLA. The fake derivation "nasjonálalasj" is err taged, so is the strange "nasjonálla/nasjunálla".

LEXICON SJÅNÅLASJ_SJONELL_CMP_INFL

konstitusjåvnålasj # Even-syllable test examples:

  • konstitusjåvnålasj: konstitusjåvnålasj+A+Sg+Nom
  • konstitusjåvnålattjav: konstitusjåvnålasj+A+Sg+Acc
  • konstitusjåvnålasj: konstitusjåvnålasj+A+Attr

Badly assimilated loanwords, some against norm others with no norm

LEXICON MEKÁNALASJ_BADASS LOAN! Wronly assimilated -lasj adjectives fro SE/NO -isk. Looks derived but isn't since there is no real noun to be derived from. Like mekanisk-mekánalasj, but "mekádna" is no real noun! Like MEKANIHKALASJ, but gives the Use/-Spell tag, so it's only for these wronly/non-derived loan adjectives.

LEXICON ARKTALASJ_CMP_INFL Foreign -isk, that are not real derivations. Same as MEKÁNALASJ_BADASS, but no +Use/-Spell tag since ther is no "right" way to assimilate these. This is a question for GG. Adapted to smj by simply adding -alasj in place of -isk. These are not real derivations, but sitation borrowed loan adjectives. Only words without a noun base, like arktisk and syntetisk. Pred and attr are both -lasj. No comparatives.

syntetalasj # Even-syllable test examples:

  • syntetalasj: syntetalasj+A+Sg+Nom
  • syntetalattjav: syntetalasj+A+Sg+Acc
  • syntetalasj: syntetalasj+A+Attr

LEXICON ORÁNSSJA Loan adjectives, not -isk. Used without the -lasj. Adjectives with attribute same as pred. So far only for oránssja.

LEXICON DEMONSTRATIJVA_LASJ_NO_NORM Loan adjectives from norwegian/swedish (Not adjectives ending on -isk). Words like demonstrativ, transitiv, dupleks, informativ, analog, privat. Gives both "demonstratijvva" and "demonstratijvalasj". There are two ways of adapting these adjectives, the adding of -lasj isn't okey, because that's a false derivation. But GG hasn't decided how these should be handled, so so far both ok. Looks like noun instead of adjective when adapted without the -lasj ending. Attr is in weak grad, used in strong grad ass pred even thou this seems a little bit odd "Værbba l transitijvva".

LEXICON DEMONSTRATIJVA_LASJ_CMP_INFL

aktijvva # Even-syllable test examples:

  • aktijva: aktijvva+A+Attr
  • aktijvalasj: aktijvva+A+Attr
  • aktijvva: aktijvva+A+Sg+Nom
  • aktijvalasj: aktijvva+A+Sg+Nom
  • aktijvalattjav: aktijvva+A+Sg+Acc
  • aktijvav: aktijvva+A+Sg+Acc

LEXICON ÁLA_LASJ_NO_NORM Same as DEMONSTRATIJVA_LASJ_NO_NORM. Only for adjectives ending on -al. Words like digital,liberal, lokal. Gives both "eksponentiálla" and "eksponentiálalasj". Different lexicon for these -al adjectives because of Err/Orth tags. OBS, "dialektal", is assimilated "dialevtalasj", and goes to lexicon MEKANIHKALASJ.

LEXICON ÁLA_LASJ_INFL_CMP

+Err/Orth: álal%> ABBO/AMOS ;

LEXICON ELLA_LASJ_NO_NORM Loanwords, same as ÁLA_LASJ_NO_NORM and DEMONSTRANTIJVA_LASJ_NO_NORM. For NO and SE adjectives ending on -ell, eksperimentell, ideell, parallell. The short form is nom parallælla, attr, parallella The long form: paralellalasj, attr parallellalasj. Different lexicon for these -ell adjectives because of err/orth tags. OBS, "individuell", is assimilated "indivijdalasj", and goes to lexicon MEKANIHKALASJ.

LEXICON ELLA_LASJ_INFL_CMP

LEXICON ÁLLA-ÆLLA

LEXICON MEKÁNALASJ_CMP_INFL Same as mekanihkalasj only without vuohta.

Inherent comparatives and superlatives lexica

LEXICON OANEP Inherent comparatives, gives comp and superl. There are two main groups of word here: Adjectives that are lexicalized in their comparative (and superlative) forms, like sisŋep, bárep. And Nouns that can be compared, like nuortap, gáttep, oarjep (some of these are compared from their noun lexicas and thus are found twice). Some entries are likely incorrect compared forms of other adjectives, like ådåp and ruvvap (more research needed).

lagáp # Even-syllable test examples:

  • lagáp: lagáp+A+Comp+Sg+Nom
  • lagámus: lagáp+A+Superl+Sg+Nom

LEXICON TJAVGGÁMUS Inherent superlatives, only gives superl. Some words are lexicalized in their superlative forms, like dájvvámus. Some are likely incorrect superlative forms, like tjábbámus (more research is needed)

dájvvámus # Even-syllable test examples:

  • dájvvámus: dájvvámus+A+Superl+Sg+Nom

4-syllable miscellanious stems

LEXICON ÁRMMOGIS Adjectives on -is, attribute same as pred. Odd-syllable comparison. As 2 in Spiik. Sg Acc: ármmogisáv, Attr: ármmogis.

bahágis # Even-syllable test examples:

  • bahágis: bahágis+A+Sg+Nom
  • bahágisáv: bahágis+A+Sg+Acc
  • bahágis: bahágis+A+Attr
  • bahágabbo: bahágis+A+Comp+Sg+Nom

LEXICON SÆHKÁLAK Adjectives on -álak, attribute same as pred. Odd-syllable comparison. So far only for "sæhkálak".

sæhkálak # Even-syllable test examples:

  • sæhkálak: sæhkálak+A+Sg+Nom
  • sähkálak: sæhkálak+A+Sg+Nom
  • sæhkálagáv: sæhkálak+A+Sg+Acc
  • sähkálagáv: sæhkálak+A+Sg+Acc
  • sæhkálak: sæhkálak+A+Attr
  • sähkálak: sæhkálak+A+Attr
  • sæhkálabbo: sæhkálak+A+Comp+Sg+Nom
  • sähkálabbo: sæhkálak+A+Comp+Sg+Nom

LEXICON ÅLLAGSJ_CMP_INFL Adjectives on -asj, attribute same as pred. No comparatives. 2 in Spiik. Sg Acc: ållagattjav, Attr: ållagasj.

belulasj # Even-syllable test examples:

  • belulasj: belulasj+A+Sg+Nom
  • belulattjav: belulasj+A+Sg+Acc
  • belulasj: belulasj+A+Attr

LEXICON DÁRBULASJ_CMP_INFL Adjectives on -asj, attribute same as pred. Odd-syllable comparison. Sg Acc: dárbulattjav, Attr: dárbulasj. Essive -attjan, -adtjan is subtaged.

dábálasj # Even-syllable test examples:

  • dábálasj: dábálasj+A+Sg+Nom
  • dábálattjav: dábálasj+A+Sg+Acc
  • dábálasj: dábálasj+A+Attr
  • dábális: dábálasj+A+Attr
  • dábálabbo: dábálasj+A+Comp+Sg+Nom

LEXICON ASIDASJ_CMP_INFL Adjectives on -asj, -is attr. Odd-syllable comparison. Sg Acc: asidattjav, Attr: asidis.

gågulasj # Even-syllable test examples:

  • gågulasj: gågulasj+A+Sg+Nom
  • gågulattjav: gågulasj+A+Sg+Acc
  • gågulis: gågulasj+A+Attr
  • gågulabbo: gågulasj+A+Comp+Sg+Nom

LEXICON UDNODIBME Adjectives on -dibme, attribute on -is. Odd-syllable comparison. Sg Acc: udnodimev, Attr: udnodis.

gælvodibme # Even-syllable test examples:

  • gælvodibme: gælvodibme+A+Sg+Nom
  • gälvodibme: gælvodibme+A+Sg+Nom
  • gælvodimev: gælvodibme+A+Sg+Acc
  • gälvodimev: gælvodibme+A+Sg+Acc
  • gælvodis: gælvodibme+A+Attr
  • gälvodis: gælvodibme+A+Attr
  • gælvodabbo: gælvodibme+A+Comp+Sg+Nom
  • gälvodabbo: gælvodibme+A+Comp+Sg+Nom

LEXICON TJALMEDIBME Like UDNODIBME but no comparatives. Sg Acc: tjalmedimev, Attr: tjalmedis.

huvsodibme # Even-syllable test examples:

  • huvsodibme: huvsodibme+A+Sg+Nom
  • huvsodimev: huvsodibme+A+Sg+Acc
  • huvsodis: huvsodibme+A+Attr

LEXICON SUOLASIEHKE -siehke. Sg Acc: suolasiegev, attr: suolasiek

hánessiehke # Even-syllable test examples:

  • hánessiehke: hánessiehke+A+Sg+Nom
  • hánessiegev: hánessiehke+A+Sg+Acc
  • hánessiek: hánessiehke+A+Attr

Odd-syllable stems

LEXICON TJIEGOS Adjectives on -s, negating adjs on -dahkes, with attr same as pred. For adjectives with -e in second syllable e>á: divtes>diktásav in StrG. As a. in Spiik. Sg Acc: tjiehkusav, Attr: tjiegos. Consonant gradation.

hánes # Odd-syllable test examples:

  • hánes: hánes+A+Sg+Nom
  • hádnásav: hánes+A+Sg+Acc
  • hánes: hánes+A+Attr
  • hádnásabbo: hánes+A+Comp+Sg+Nom

LEXICON GÅNTSAS Same as TJIEGOS but with vokal change å=oa. Attr is same as pred. Sg Acc: goanntsasav, attr: gåntsas. Consonant gradation.

måskas # Odd-syllable test examples:

  • måskas: måskas+A+Sg+Nom
  • moasskasav: måskas+A+Sg+Acc
  • måskas: måskas+A+Attr
  • moasskasabbo: måskas+A+Comp+Sg+Nom

LEXICON GALMAS Adjectives on -as, ås- and ás, with attr ending on a (-as and -ás pred) and å (-ås pred). As e. in Spiik. Sg Acc: galmmasav, attr: galmma, Consonant gradation.

njuoskas # Odd-syllable test examples:

  • njuoskas: njuoskas+A+Sg+Nom
  • njuosskasav: njuoskas+A+Sg+Acc
  • njuosska: njuoskas+A+Attr
  • njuosskasabbo: njuoskas+A+Comp+Sg+Nom

LEXICON SÁJSAS For -as adjectives with attribute -e. Presently only used for "sájsas", because this -as adj gets -e attr. Sg Acc: sájssasav, Attr: sájsse. Consonant gradation.

sájsas # Odd-syllable test examples:

  • sájsas: sájsas+A+Sg+Nom
  • sájssasav: sájsas+A+Sg+Acc
  • sájsse: sájsas+A+Attr
  • sájssasabbo: sájsas+A+Comp+Sg+Nom

LEXICON RÁDAS Presently only used for "rádas". This word has special consonant gradation d>dd. Attr same as pred. Sg Acc: ráddasav, Attr: rádas. Consonant gradation.

rádas # Odd-syllable test examples:

  • rádas: rádas+A+Sg+Nom
  • ráddasav: rádas+A+Sg+Acc
  • rádas: rádas+A+Attr
  • ráda: rádas+A+Attr (Eng. # from LEXATTR)
  • ráddasabbo: rádas+A+Comp+Sg+Nom

LEXICON LUOBES Adjectives on -es. Attribute same as pred. Does the same as TJIEGOS only e>a. Sg Acc: luohpasav, Attr: luobes. Consonant gradation.

luobes # Odd-syllable test examples:

  • luobes: luobes+A+Sg+Nom
  • luohpasav: luobes+A+Sg+Acc
  • luobes: luobes+A+Attr
  • luohpasabbo: luobes+A+Comp+Sg+Nom

LEXICON GIMES Adjectives on -es with attribute -a. Vocaal e>a in StrG. As g. in Spiik. Sg Acc: gibmasav, Attr: gibma. Consonant gradation.

gimes # Odd-syllable test examples:

  • gimes: gimes+A+Sg+Nom
  • gibmasav: gimes+A+Sg+Acc
  • gibma: gimes+A+Attr
  • gibmasabbo: gimes+A+Comp+Sg+Nom

LEXICON LINES Same as GIMES, only with e>á in StrG. Adjectives on -es with attribute -a. As g. in Spiik. Sg Acc: lidnásav, attr: lidna. Consonant gradation.

lines # Odd-syllable test examples:

  • lines: lines+A+Sg+Nom
  • lidnásav: lines+A+Sg+Acc
  • lidna: lines+A+Attr
  • lidnásabbo: lines+A+Comp+Sg+Nom

LEXICON OAMES Adjectives on -es with attribute -e. As g2. in Spiik. Sg Acc: oabmásav, Attr: oabme. Consonant gradation.

goastes # Odd-syllable test examples:

  • goastes: goastes+A+Sg+Nom
  • goasstásav: goastes+A+Sg+Acc
  • goasste: goastes+A+Attr
  • goasstásabbo: goastes+A+Comp+Sg+Nom

LEXICON BASSTEL Adjs on -et, -l, -r, sm om -k, -sj with attr -is and no consonant gradation. As b. in Spiik. Sg Acc: basstelav, Attr: basstelis. Many of these entries might be instances of derivations, like belak, deblak, and maybe also basstel, bargán.

goavrret # Odd-syllable test examples:

  • goavrret: goavrret+A+Sg+Nom
  • goavrredav: goavrret+A+Sg+Acc
  • goavrredis: goavrret+A+Attr
  • goavrredabbo: goavrret+A+Comp+Sg+Nom

LEXICON DABÁR Adjectives on -r. With attr -is like BASSTEL, but with CG. Sg Acc: dahparav, Attr: dabáris. Presently only for dabár. Consonant gradation.

sihkar # Odd-syllable test examples:

  • dabár: dabár+A+Sg+Nom
  • dahparav: dabár+A+Sg+Acc
  • dabáris: dabár+A+Attr
  • dahparabbo: dabár+A+Comp+Sg+Nom

LEXICON MUTTÁK Adjs on -ák/-ak/-ek, two attr: -is and same as pred. As c. in Spiik. Sg Acc: muttágav, Attr: muttágis and mutták. These seem to be instances of the adjectival -k derivation. Unclear whether such derivation have different attr forms or not, and thats maybe why some of these derivations are found in BASSTEL lexicon.

bárvak # Odd-syllable test examples:

  • bárvak: bárvak+A+Sg+Nom
  • bárvagav: bárvak+A+Sg+Acc
  • bárvak: bárvak+A+Attr
  • bárvagis: bárvak+A+Attr
  • bárvagabbo: bárvak+A+Comp+Sg+Nom

LEXICON SKÁRBAK Presently only for skárbak, because it has skárbbis attr not skárbagis according to Korhonen. It might be a hybrid. Consonant gradation.

skárbak # Odd-syllable test examples:

  • skárbak: skárbak+A+Sg+Nom
  • skárbagav: skárbak+A+Sg+Acc
  • skárbbis: skárbak+A+Attr
  • skárbagabbo: skárbak+A+Comp+Sg+Nom

LEXICON ALLAK Adjs on -ak, attr.on -a. Have both gasep/gaggagabbo and alep/allagabbo as comparatives. As d. in Spiik. So far only the adjectives "allak" and "gassak" go to this lexicon.

gassak # Odd-syllable test examples:

  • gassak: gassak+A+Sg+Nom
  • gassagav: gassak+A+Sg+Acc
  • gassa: gassak+A+Attr
  • gassagabbo: gassak+A+Comp+Sg+Nom
  • gasep: gassak+A+Comp+Sg+Nom

LEXICON GÅBDDÅK Adjs on -åk, attr. on -å. Has even-syllable comparison: gåbdep and gåbdemus. So far "gåbddåk" is the only word in this lexicon. As d2. in Spiik. Sg Acc: gåbddågav, Attr: gåbddå.

gåbddåk # Odd-syllable test examples:

  • gåbddåk: gåbddåk+A+Sg+Nom
  • gåbddågav: gåbddåk+A+Sg+Acc
  • gåbddå: gåbddåk+A+Attr
  • gåbdep: gåbddåk+A+Comp+Sg+Nom

LEXICON STUORAK Only for stuorak. It hase two attributes. Has even-syllable comparison: stuoráp and stuorámus.Sg Acc: stuoragav, attr: stuor and stuorra. This might be a -k derivation of adjective stuorre attr stuor(ra). The comparison is thus based on the original adjective and thus it naturally is an even syll comparison.

stuorak # Odd-syllable test examples:

  • stuorak: stuorak+A+Sg+Nom
  • stuoragav: stuorak+A+Sg+Acc
  • stuorra: stuorak+A+Attr
  • stuor: stuorak+A+Attr
  • stuoráp: stuorak+A+Comp+Sg+Nom

LEXICON RIHTSOK Lexicon for miscellanious uneven adjectives. No vowel changes. The lexicon gives no attribute, either because the adjective dosnºt have attr or because there is stemvowel change in attr that the lexicon canºt handle, or because there are strange atrributes that donºt fit to any other lexicon (these attributes are hardcoded). Sg Acc: rihtsogav (attr for words in this lexicon are hardcoded). Consonant gradation.

måjdås # Odd-syllable test examples:

  • måjdås: måjdås+A+Sg+Nom
  • måjddåsav: måjdås+A+Sg+Acc
  • måjddåsabbo: måjdås+A+Comp+Sg+Nom

LEXICON SJÆVNNJAT Lexicon for miscellanious uneven adjectives on -k, -t, -l, -s. The last vowel changes, e>á, e>å, o>u, e>i. The lexicon gives no attribute, either because the adjective dosnºt have attr, because there is stemvowel change in attr that the lexicon canºt handle or because there are strange atrributes that donºt fit to any other lexicon (these attributes are hardcoded). Sg Acc: sjævnnjadav.

rávdes # Odd-syllable test examples:

  • rávdes: rávdes+A+Sg+Nom
  • rávddásav: rávdes+A+Sg+Acc
  • rávddásabbo: rávdes+A+Comp+Sg+Nom

LEXICON SUOHKAT Adjectives on -at and -åt, with attribute III -is. No vowel change. As f. in Spiik. Sg Acc: suohkadav, attr: suohkkis,

rávvat # Odd-syllable test examples:

  • rávvat: rávvat+A+Sg+Nom
  • rávvadav: rávvat+A+Sg+Acc
  • rávvis: rávvat+A+Attr
  • rávvadabbo: rávvat+A+Comp+Sg+Nom

LEXICON LÅSSÅT As f3. in Spiik. So far the only word i this lexicon i "låssåt", because both låssis and låsså are attr and comparative is both låsep(hybrid?) and låssådabbo.

låssåt # Odd-syllable test examples:

  • låssåt: låssåt+A+Sg+Nom
  • låssådav: låssåt+A+Sg+Acc
  • låssis: låssåt+A+Attr
  • låsså: låssåt+A+Attr
  • låssådabbo: låssåt+A+Comp+Sg+Nom
  • låsep: låssåt+A+Comp+Sg+Nom

LEXICON TSIBTSA Adjectives that end on -a. Attribute is same as pred. Sg Acc: tsibttsagav, Attr: tsibtsa. Consonant gradation.

bihtja # Odd-syllable test examples:

  • bihtja: bihtja+A+Sg+Nom
  • bihttjagav: bihtja+A+Sg+Acc
  • bihtja: bihtja+A+Attr
  • bihttjagabbo: bihtja+A+Comp+Sg+Nom

LEXICON TJALMMIS Adjectives ending on -is. With two attributes; same as pred and -isis. As h. in Spiik. Sg Acc: tjalmmisav, Attr: tjalmmis (Spiik og Kintel).and tjalmmisis (Korhonen).

mujttis # Odd-syllable test examples:

  • mujttis: mujttis+A+Sg+Nom
  • mujttisav: mujttis+A+Sg+Acc
  • mujttis: mujttis+A+Attr
  • mujttisis: mujttis+A+Attr
  • mujttisabbo: mujttis+A+Comp+Sg+Nom

LEXICON IENNILS no comparatives, no attr

ieŋŋils # Odd-syllable test examples:

  • ieŋŋils: ieŋŋils+A+Sg+Nom
  • ieŋŋilsav: ieŋŋils+A+Sg+Acc

Inherent comparatives and superlatives

LEXICON NUORTTALABBO Inherent comparatives, gives both comp and superl. Most of the words are the compared forms of -el(a) words, like nuorttal, lullel.

guddnelabbo # Even-syllable test examples:

  • guddnelabbo: guddnelabbo+A+Comp+Sg+Nom
  • guddnelamos: guddnelabbo+A+Superl+Sg+Nom
  • guddnelap: guddnelabbo+A+Comp+Attr
  • guddnelup: guddnelabbo+A+Comp+Attr

LEXICON GASSKALAMOS Inherent superlatives, gives onlys superl. Words that are lexicalized in their superlative forms.

ájtodamos # Even-syllable test examples:

  • ájtodamos: ájtodamos+A+Superl+Sg+Nom

Contracted stems

LEXICON SÁDNES Attr same as pred. Sg Acc: sáddnáv, Attr: sádnes.

hávres # Contracted test examples:

  • hávres: hávres+A+Sg+Nom
  • hávrráv: hávres+A+Sg+Acc
  • hávres: hávres+A+Attr
  • hávrráp: hávres+A+Comp+Sg+Nom

LEXICON GOAVSOS Attr same as pred. Sg Acc: goaksuv, Attr: goavsos.(goavsos is so far the only word in this lexicon)

goavsos # Contracted test examples:

  • goavsos: goavsos+A+Sg+Nom
  • goaksuv: goavsos+A+Sg+Acc
  • goaksusav: goavsos+A+Sg+Acc (Eng. # From lexicon TJIEGOS)
  • goavsos: goavsos+A+Attr
  • goaksup: goavsos+A+Comp+Sg+Nom
  • goaksusabbo: goavsos+A+Comp+Sg+Nom (Eng. # from lexicon TJIEGOS)

LEXICON SUVRES Sg Acc: suvrráv, Attr: suvra.

suvres # Contracted test examples:

  • suvres: suvres+A+Sg+Nom
  • suvrráv: suvres+A+Sg+Acc
  • suvrrásav: suvres+A+Sg+Acc (Eng. # From lexicon SJÆVNNJAT)
  • suvra: suvres+A+Attr
  • suvrráp: suvres+A+Comp+Sg+Nom
  • suvrrásabbo: suvres+A+Comp+Sg+Nom (Eng. # from LINES)

Comparation

LEXICON BU/MUS comparison for even-syll adjectives. Also derivates diminutive and adverbs from the comparisions.

LEXICON ABBO/AMOS comparison for odd-syll adjectives. Also derivates diminutive and adverbs from the comparisions.

LEXICON BUStem Comparative even-syll, case and attr.

LEXICON ABBO Comparative odd-syll, get case and attr. With the dialect differences "-ubbo" and "-æbbo".

Superlative

LEXICON BUOREMUS Superlative even-syll, get attr and nom case.

LEXICON AMOS Superlative odd-syll, get case and attr. With the dialect differences "-umos" and "-æmos".

Comparative and Superlative sub-lexica

LEXICON CompSup-EVEN

LEXICON CompSup-EVENWEAKSTEM

LEXICON ATTR Sends attributes to

LEXICON ATTR_PrsPrc Attr without -vuohta derivation.

Derivation of adjectives

LEXICON DenominalAdjsV1 ! even noun stems are sent here

LEXICON DenominalAdjsV1_1 ! even noun stems without grade alternation are sent here

LEXICON DenominalAdjsV2 ! even noun stems are sent here. -asj derivation

LEXICON DenominalAdjsKINO ! unassimilated nouns are sent here

LEXICON DenominalAdjsODD ! gives derivation -ahtes

LEXICON DenominalAdjsContr

Derivations to adjectives

LEXICON AHTES ! odd syllable

LEXICON DIBME ! even and contracted

LEXICON LIS ! Handlernomen på -is?

LEXICON LÁGÁSJ

LEXICON AGAdj ! denominal derivations go here

LEXICON Ahkásasj ! lexicalized and denominal -asj derivations

LEXICON STÁVVAL ! OK& Kintel: stávval attr stávvalis, also as second compound

Sublexica for Noun

Even-syllable stems

2syll stems

LEXICON MUORRA Standard even stems with cg (note Q1). OBS: Nouns with invisible 3>2 cg (as busºsa) go to this lexicon.

#Even-syllable test examples:

  • kártta: kártta+N+Sg+Nom
  • kártajn: kártta+N+Sg+Com

LEXICON ALMME Same as MUORRA, but with special -LASJ derivation. For noun that have strong grade -lasj. "Almmelasj" instead of "almálasj".

LEXICON NOADE Even stem without cg. OBS: No nouns with invisible 3>2 cg (as busºsa) in this lexicon. OBS: Because of denominal nouns taking a weak grade stem, entries in grade 3 are given the gradation mark º in order to prevent alternation to weak grade. We should consider creating a separate denominal nouns lexicon for NOADE instead.

låda # Even-syllable test examples:

  • låda: låda+N+Sg+Nom
  • lådas: låda+N+Sg+Ela

LEXICON LINNJA Only for the loan word "linnja". Because it's a loan word, the "nnj" is pronounced "nn-j", and therefore does not behave as the regular lule sami "nj" sound and therefore it doesn't follow the rule that makes a: á in 1. grade with short vowel in first syllable (It isn't as linnja-linjáv or birás-birrasav). This word is therefore sub taged. Norwegian/Swedish words with a short "i" followed by two different consonants are assimilated to lule sami in different manners accoring to the consonants in question, but the word is always on grade III (Morén-Duolljá 2014). The correctly assimilated form linºnja is added to NOADE lexicon in stems file.

linja # Even-syllable test examples:

  • linnja: linnja+N+Sg+Nom

LEXICON SÁMEGIEL Compounds on -giella, with short -giel as middle compound (sámegielåhpadiddje)

rievsakgiella # Even-syllable test examples:

  • rievsakgiella: rievsakgiella+N+Sg+Nom

LEXICON AHKA Words like tjerastahka, with short compound form

báládahka # Even-syllable test examples:

  • báládahka: báládahka+N+Sg+Nom
  • báládak: báládahka+N+Sg+Nom
  • báládagáv: báládahka+N+Sg+Acc

LEXICON DARRHA Only for "darrha" or compounds that end on "darrha".

báktedarrha # Even-syllable test examples:

  • báktedarrha: báktedarrha+N+Sg+Nom
  • báktedarháv: báktedarrha+N+Sg+Acc

Nouns with comparatives

LEXICON GÁDDE 2 syllable stems with cg (note Q1) with comparatives

boassjo # Even-syllable test examples:

  • boassjo: boassjo+N+Sg+Nom
  • båssjujn: boassjo+N+Sg+Com
  • boassjojn: boassjo+N+Sg+Com
  • boassjop: boassjo+N+Der/A+A+Comp+Sg+Nom

LEXICON ÅLGGO Like MUORRA, but with comparatives. This lexicon was previously without sg ill/ine/elat, but these nouns can be conjugated for regular location cases. However, "adverbs" like ålggot (from outside), nuorttan (at north), oarjas (to south), etc., are more commonly used to denote location/direction (should therefore maybe consider subing the regular location case forms).

lulle # Even-syllable test examples:

  • lulle: lulle+N+Sg+Nom
  • lulev: lulle+N+Sg+Acc
  • lulep: lulle+N+Der/A+A+Comp+Sg+Nom

LEXICON MIEHTE Like MUORRA but no locative/elative/illative sg. Presently no words in this lexica except for err subed nuortto

nuortto # Even-syllable test examples:

  • nuortto: nuortto+N+Sg+Nom
  • nuortov: nuortto+N+Sg+Acc

Plural stems

LEXICON BÅVSÅ Like MUORRA, only in plural. All, except ganta, juvdá and ávta, have regular, singular stem counterparts.

båvså # Even-syllable test examples:

  • båvså: båvså+N+Pl+Nom
  • båvsåjt: båvså+N+Pl+Acc

Partially assimilated loanwords. The first part of the word is "citation borrowed" and keeps its norwegian/swedish orthography, only the last two syllables are adapted to sami.

LEXICON MUORRA_LOAN For loan words that do not fit in a loan word lexicon because of wrong short cmp, or partially assimilated loanwords without separate lexicas (medállja). This lexicon gives no short compound forms. Potential short cmps must therefore be hard coded into the FirstComponent lexicon. This also for compounded words with partially assimilated loan words. Examples of problem words: sirup>siráhppa og stetoskop>stetoskoahppa.

LEXICON ADVÆRBBA_CMP_INFL Recent loanwords on -ærbba with long and short compound-form

detransitijvvaværbba # Even-syllable test examples:

  • detransitijvvaværbba: detransitijvvaværbba+N+Sg+Nom
  • detransitijvvavärbba: detransitijvvaværbba+N+Sg+Nom
  • detransitijvvaverbas: detransitijvvaværbba+N+Sg+Ela

LEXICON KAFIEDJA_CMP_INFL Recent loanwords on -edja. Ends on -é in norwegian. Short and long cmp. "Kafea" and "kaféa" are subtaged. See comments about the -ie/-e dialtags in ALFABIEHTTA.

LEXICON DEMAGÅVGGÅ_CMP_INFL Recent loanwords ending on -og with long and short compound form. Assimilated to smj as -åvggå. The old stadarization -oga that does not follow lulesami rules is sub taged.

pedagoga # Even-syllable test examples:

  • pedagåvggå: pedagåvggå+N+Sg+Nom
  • pedagåvggå: pedagåvggå+N+Sg+Nom
  • pedagåvgås: pedagåvggå+N+Sg+Ela

LEXICON A_CMP_INFL Sub-forms. Lexicon for giving sub-variation conjugation by simply adding an -a to the norwegian/swedish word. No cg. Like "alkohola" and "agronoma". These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

LEXICON ALLEGORIJJA_CMP_INFL Recent loanwords ending on -i in NOR/SWE, with long and short compound form. Standardized as-iddja (SWE) and -ijºja (NOR). Previously often assimilated as -ija (or just -ia), but both forms are ungrammatical: Short vowels cannot preceed and follow a single intervocalic consonant. -ija is thus ungrammatical as the short a would be lenghtened to á, like "idja-ijá".

partijja # Even-syllable test examples:

  • partiddja: partijja+N+Sg+Nom
  • partijja: partijja+N+Sg+Nom
  • partidjas: partijja+N+Sg+Ela
  • partijjas: partijja+N+Sg+Ela

LEXICON APOTIEHKKA_CMP_INFL Recent loanwords on -iehkka in NOR, -æhkka in SWE. -ehkka as sub. With long and short compound-form on -k. See comments about the -ie/-e dialtags in ALFABIEHTTA.

kartotiehkka # Even-syllable test examples:

  • kartotæhkka: kartotiehkka+N+Sg+Nom
  • kartotähkka: kartotiehkka+N+Sg+Nom
  • kartotehkas: kartotiehkka+N+Sg+Ela
  • kartotehkas: kartotiehkka+N+Sg+Ela
  • kartotiehkka: kartotiehkka+N+Sg+Nom
  • kartotiehkas: kartotiehkka+N+Sg+Ela

LEXICON ANTIHKKA_CMP_INFL Recent loanwords on -hkka in Norway, both -ijkka and -hkka are used in Sweden (Antik vs antikk). With long and short compound-form on -kk/-k. The swedish forms were earlier added to stems for the Swedish version, but now added here.

dialektihkka # Even-syllable test examples:

  • dialektihkka: dialektihkka+N+Sg+Nom
  • dialektihkas: dialektihkka+N+Sg+Ela
  • dialektijkka: dialektihkka+N+Sg+Nom
  • dialektijkas: dialektihkka+N+Sg+Ela

LEXICON AREÁLLA_CMP_INFL Recent loanwords on -álla with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

gasskavokálla # Even-syllable test examples:

  • gasskavokálla: gasskavokálla+N+Sg+Nom
  • gasskavokálas: gasskavokálla+N+Sg+Ela

LEXICON TABÆLLA_CMP_INFL Recent loanwords on -älºla with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

flotælla # Even-syllable test examples:

  • flotælla: flotælla+N+Sg+Nom
  • flotälla: flotælla+N+Sg+Nom
  • flotellas: flotælla+N+Sg+Ela

LEXICON TEKSTIJLLA_CMP_INFL Recent loanwords on -ijlla with long and short compound-form. . Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

tekstijlla # Even-syllable test examples:

  • tekstijlla: tekstijlla+N+Sg+Nom
  • tekstijlas: tekstijlla+N+Sg+Ela

LEXICON ASIJLLA_CMP_INFL Recent loanwords on -ijlla, from nor and swe words ending on -yl. With long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

asijlla # Even-syllable test examples:

  • asijlla: asijlla+N+Sg+Nom
  • asijlas: asijlla+N+Sg+Ela

LEXICON ALKOHÅVLLÅ_CMP_INFL Recent loanwords on -åvllå with long and short compound-form. The old stadarization form "alkohola" is sub taged. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

parabåvllå # Even-syllable test examples:

  • parabåvllå: parabåvllå+N+Sg+Nom
  • parabåvlås: parabåvllå+N+Sg+Ela

LEXICON TELEGRÁMMA_CMP_INFL Recent loanwords on -ámºma with long and short compound-form

grámma # Even-syllable test examples:

  • grámma: grámma+N+Sg+Nom
  • grámmas: grámma+N+Sg+Ela

LEXICON SYSTIEBMA_CMP_INFL Recent loanwords on -ebma/-iebma with long and short compound-form. -em in NOR and SWE. See comments about the -ie/-e dialtags in ALFABIEHTTA. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

vokalsystiebma # Even-syllable test examples:

  • vokalsystebma: vokalsystiebma+N+Sg+Nom
  • vokalsystiebma: vokalsystiebma+N+Sg+Nom
  • vokalsystiemas: vokalsystiebma+N+Sg+Ela
  • vokalsystemas: vokalsystiebma+N+Sg+Ela

LEXICON AGRONÅVMMÅ_CMP_INFL Recent loanwords on -åvmma with long and short compound-form. The old stadarization form -oma that does not follow lulesami rules is sub taged.

agronåvmmå # Even-syllable test examples:

  • agronåvmmå: agronåvmmå+N+Sg+Nom
  • agronåvmås: agronåvmmå+N+Sg+Ela

LEXICON FANATISSMA_CMP_INFL Recent loanwords on -ssma with long and short compound-form.

kabbalissma # Even-syllable test examples:

  • kabbalissma: kabbalissma+N+Sg+Nom
  • kabbalismas: kabbalissma+N+Sg+Ela

LEXICON ORGÁDNA_CMP_INFL Recent loanwords on -ádna with long and short compound-form

guovllopládna # Even-syllable test examples:

  • guovllopládna: guovllopládna+N+Sg+Nom
  • guovlloplánav: guovllopládna+N+Sg+Acc

LEXICON HYDROGIEDNA_CMP_INFL Recent loanwords on -iedna in NOR and -edna in SWE. Both long and short compound-form. Norwegian/swedish -en. The old standardization form -ena, without cg, is subtaged. See comments about the -ie/-e dialtags in ALFABIEHTTA.

LEXICON BENSIJNNA Recent loanwords on -ijnna with long and short compound-form

LEXICON BENSIJNNA_CMP_INFL Recent loanwords on -ijnna with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

tamburijnna # Even-syllable test examples:

  • tamburijnna: tamburijnna+N+Sg+Nom
  • tamburijnas: tamburijnna+N+Sg+Ela

LEXICON MASJIJNNA_CMP_INFL Recent loanwords on -sjijnna with long and short compound-form: -SKIN

bivtasmasjijnna # Even-syllable test examples:

  • bivtasmasjijnna: bivtasmasjijnna+N+Sg+Nom
  • bivtasmasjijnas: bivtasmasjijnna+N+Sg+Ela

LEXICON TELEFÅVNNÅ_CMP_INFL Recent loanwords on -åvnnå with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

persåvnnå # Even-syllable test examples:

  • persåvnnå: persåvnnå+N+Sg+Nom
  • persåvnås: persåvnnå+N+Sg+Ela

LEXICON INSTITUSJÅVNNÅ_CMP_INFL Recent loanwords on -sjåvnnå with long and short compound-form: -TION IN SWEDISH. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

populasjåvnnå # Even-syllable test examples:

  • populasjåvnnå: populasjåvnnå+N+Sg+Nom
  • populasjåvnås: populasjåvnnå+N+Sg+Ela

LEXICON MISJÅVNNÅ_CMP_INFL Recent loanwords on -sjåvnnå with long and short compound-form: -SSION IN SWEDISH. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

sesjåvnnå # Even-syllable test examples:

  • sesjåvnnå: sesjåvnnå+N+Sg+Nom
  • sesjåvnås: sesjåvnnå+N+Sg+Ela

LEXICON PENSJÅVNNÅ_CMP_INFL Recent loanwords on -sjåvnnå with long and short compound-form: -SION IN SWEDISH. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

suspensjåvnnå # Even-syllable test examples:

  • suspensjåvnnå: suspensjåvnnå+N+Sg+Nom
  • suspensjåvnås: suspensjåvnnå+N+Sg+Ela

partisihppa # Even-syllable test examples:

  • partisihppa: partisihppa+N+Sg+Nom
  • partisihpas: partisihppa+N+Sg+Ela

LEXICON SEMINÁRRA_CMP_INFL Recent loanwords on -árra with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

hektárra # Even-syllable test examples:

  • hektárra: hektárra+N+Sg+Nom
  • hektáras: hektárra+N+Sg+Ela

LEXICON FAKTÅVRRÅ_CMP_INFL Recent loanwords on -åvrrå with long and short compound-form.

oajvvekontåvrrå # Even-syllable test examples:

  • oajvvekontåvrrå: oajvvekontåvrrå+N+Sg+Nom
  • oajvvekontåvrås: oajvvekontåvrrå+N+Sg+Ela

LEXICON KULTUVRRA_CMP_INFL Recent loanwords on -vrra with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

muvrra # Even-syllable test examples:

  • muvrra: muvrra+N+Sg+Nom
  • muvrajn: muvrra+N+Sg+Com

LEXICON INTERIERRA_CMP_INFL Recent loanwords on -ør with long and short compound-form. Standarized by Giellagálldo 05.0514 as -erra. -ørra is subtaged

observaterra # Even-syllable test examples:

  • observaterra: observaterra+N+Sg+Nom
  • observateras: observaterra+N+Sg+Ela

LEXICON SUBSTÁNSSA_CMP_INFL Recent loanwords on -ánssa with long and short compound-form. Originally -ans in SWE and NOR. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

instánssa # Even-syllable test examples:

  • instánssa: instánssa+N+Sg+Nom
  • instánsas: instánssa+N+Sg+Ela

LEXICON VALÆNSSA_CMP_INFL Recent loanwords on -ænssa with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

intelligænssa # Even-syllable test examples:

  • intelligænssa: intelligænssa+N+Sg+Nom
  • intelligänssa: intelligænssa+N+Sg+Nom
  • intelligensav: intelligænssa+N+Sg+Acc

LEXICON PARADIJSSA_CMP_INFL Recent loanwords on -ijssa with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

servijssa # Even-syllable test examples:

  • servijssa: servijssa+N+Sg+Nom
  • servijsas: servijssa+N+Sg+Ela

LEXICON ADVOKÁHTTA_CMP_INFL Recent loanwords on -áhtta with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

klimáhtta # Even-syllable test examples:

  • klimáhtta: klimáhtta+N+Sg+Nom
  • klimáhtas: klimáhtta+N+Sg+Ela

LEXICON ALFABIEHTTA_CMP_INFL Recent loanwords originally on -et both in Norway and Sweden. Assimilation differences, however, create two lule sami forms: -iehtta in NOR and -æhtta in SWE. LONG -e is assimilated in different ways in Norway and Sweden: In Norway, it becomes -ie, and in Sweden -e. Tiedja/tedja, systiebma/systebma and so on. This is especially apparent in assimilated words with long e in third grade: E becomes æ in third grade so we get "universitæhtta" in SWE, but this is very strange to people on the norwegian side of the border as they want "universitiehtta". Both -ie and -e are dialtaged in lexicons HYDROGIEDNA, APOTIEHKKA, SYSTIEBMA, KAFÉ. Previously people often wrote -ehtta in Norway, but this is incorrect as e always becomes æ in grade three.

mobilitiehtta # Even-syllable test examples:

  • mobilitæhtta: mobilitiehtta+N+Sg+Nom
  • mobilitähtta: mobilitiehtta+N+Sg+Nom
  • mobilitiehtta: mobilitiehtta+N+Sg+Nom
  • mobilitehtav: mobilitiehtta+N+Sg+Acc
  • mobilitiehtav: mobilitiehtta+N+Sg+Acc

LEXICON INTERNÆHTTA_CMP_INFL Recent loanwords on -æhtta with long and short compound-form: -ET IN SWEDISH, -ETT in norwegian. Differs from ALFABIEHTTA because -ehtta isn't used in NOR.

intranæhtta # Even-syllable test examples:

  • intranæhtta: intranæhtta+N+Sg+Nom
  • intranähtta: intranæhtta+N+Sg+Nom
  • intranehtas: intranæhtta+N+Sg+Ela

LEXICON TABLÆHTTA_CMP_INFL Recent loanwords on -æhtta with long and short compound-form. -ETT in both norwegian and in swedish.

kvartæhtta # Even-syllable test examples:

  • kvartæhtta: kvartæhtta+N+Sg+Nom
  • kvartähtta: kvartæhtta+N+Sg+Nom
  • kvartehtas: kvartæhtta+N+Sg+Ela

LEXICON INSTITUHTTA_CMP_INFL Recent loanwords on -uhtta, with long and short compound-form on -utt(NOR)/-ut(SWE). The swedish -ut also gets uvtta, as ANTIHKKA-antijkka, but instituhtta is also used in sweden, so no Area/NO tag.

minuhtta # Even-syllable test examples:

  • minuhtta: minuhtta+N+Sg+Nom
  • minuvtta: minuhtta+N+Sg+Nom
  • minuhtas: minuhtta+N+Sg+Ela
  • minuvtas: minuhtta+N+Sg+Ela

LEXICON SATELIHTTA_CMP_INFL Recent loanwords on -ihtta, with long and short compound-form on -itt(NOR)/-it(SWE). The swedish -it also gets ijtta, as ANTIHKKA-antijkka, but satelihtta is also used in sweden, so no Area/NO tag.

inuihtta # Even-syllable test examples:

  • inuihtta: inuihtta+N+Sg+Nom
  • inuijtta: inuihtta+N+Sg+Nom
  • inuihtas: inuihtta+N+Sg+Ela
  • inuijtas: inuihtta+N+Sg+Ela

LEXICON PATÆNNTA_CMP_INFL Recent loanwords on -ænnta with long and short compound-form. The -ennta form (used in "Ådå testamennta") is taged as sub (e always becomes æ in grade three).

patænnta # Even-syllable test examples:

  • patænnta: patænnta+N+Sg+Nom
  • patännta: patænnta+N+Sg+Nom
  • patentas: patænnta+N+Sg+Ela

LEXICON VARIÁNNTA_CMP_INFL Recent loanwords on -ánnta with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

praktikánnta # Even-syllable test examples:

  • praktikánnta: praktikánnta+N+Sg+Nom
  • praktikántas: praktikánnta+N+Sg+Ela

LEXICON KOLLÆKTA_CMP_INFL Recent loanwords on -ækta with long and short compound-form

subjækta # Even-syllable test examples:

  • subjækta: subjækta+N+Sg+Nom
  • subjäkta: subjækta+N+Sg+Nom
  • subjevtas: subjækta+N+Sg+Ela

LEXICON TURISSTA_CMP_INFL Recent loanwords on -ssta with long and short compound-form. Frequently typos that does not follow lulesami rules are sub taged; These forms goes against the standarization rule, but are found because of earlier standarizations rules and dictionaries.

journalissta # Even-syllable test examples:

  • journalissta: journalissta+N+Sg+Nom
  • journalistas: journalissta+N+Sg+Ela

LEXICON ADJEKTIJVVA_CMP_INFL Recent loanwords on -ijvva with long and short compound-form

datijvva # Even-syllable test examples:

  • datijvva: datijvva+N+Sg+Nom
  • datijvas: datijvva+N+Sg+Ela

Loanwords becoming odd-syll

LEXICON PRIEMIJ_CMP_INFL Assimilated loanwords. on -ie/-y, like premie and bandy. Become odd syllable loan words with cg, like "riebij". Nom: premij, gen prebmiha. Long and short essive.

priemij # Even-syllable test examples:

  • priemij: priemij+N+Sg+Nom
  • premij: priemij+N+Sg+Nom
  • priebmihis: priemij+N+Sg+Ela
  • prebmihis: priemij+N+Sg+Ela
  • priemin: priemij+N+Ess
  • premin: priemij+N+Ess
  • priebmihin: priemij+N+Ess
  • prebmihin: priemij+N+Ess

Badly assimilated loanwords

LEXICON NOADE_BADASS 2 syll stems without cg. Badly or wrongly assimilated words, ie. assimilated in a way that isn't lulesami. (Same as NOADE) Most of the words are Err/Orth tagged with a standardized lemma. Some are Err/Lex tagged, 5.9.2019: EJP/SNM: fjerna +Use/-Spell - sjølv om vi ikkje likar orda, så vil vi sjå til at dei blir skrive rett etter smj-ortografien! Dei fleste orda er uansett merka med +Err/Orth : )

balláda # Even-syllable test examples:

  • balláda balláda+N+Sg+Nom
  • balládas balláda+N+Sg+Ela

LEXICON BOAKSA_BADASS_CMP_INFL This is an Err/Sub, and all the words pointing to lexicon BOAKSA are therfore Err/Sub taged. Only for words "boaksa-båksav". Boaksa-båksa is assimilated wrongly, must be boaksa-båvsa if it's to follow lulesami rules.

LEXICON C_ILL_IJ_BADASS Badly or wrongly assimilated words. Last letter is consonant, no cg, no vowchange, with illative -ij. (Same as GAHPER) Assimilated in a way that isn't lulesami. Most of the words are Err/Orth tagged with a standardized lemma. Some are Err/Lex tagged, and some only recieve the +Use/-Spell tag from the lexicon.

sentimehter # Odd-syllable test examples:

  • sentimehter sentimehter+N+Sg+Nom
  • sentimehteris sentimehter+N+Sg+Ela
  • sentimehterij sentimehter+N+Sg+Ill

LEXICON C_ILL_AJ_BADASS Badly or wrongly assimilated words. Last letter in consonant, no cg, no vowchange, with illative -aj. Should have been assimilated to even-syll, but are used as odd-syll, and mostly just assimilated with changing to letter á. So almosed same as CELSIUS_UNASS.

kálsium # Odd-syllable test examples:

  • kálsium kálsium+N+Sg+Nom
  • kálsiumas kálsium+N+Sg+Ela
  • kálsiumaj kálsium+N+Sg+Ill

Unassimilated loanwords

LEXICON KINO_UNASS_CMP_INFL V-final unassimilated loanwords. Not lulesami. No diacritics whatsoever. Words that aren't assimilated at all. Really just norwegian words with a kind of sami inflection. Get even syllable case marking. Are part of the spell checker.

netto # Even-syllable test examples:

  • netto: netto+N+Sg+Nom
  • nettos: netto+N+Sg+Ela

LEXICON C_ILL_IJ_UNASS C-final unassimilated loanwords, gives illative- ij. Not lulesami. No diacritics whatsoever. Really just foreign words with a kind of sami inflection. Odd syllable case marking (like GAHPER). Are part of the spell checker.

sirkus # Even-syllable test examples:

  • sirkus: sirkus+N+Sg+Nom
  • sirkusij: sirkus+N+Sg+Ill
  • sirkusis: sirkus+N+Sg+Ela

LEXICON C_ILL_AJ_UNASS C-final unassimilated loanwords, gives illativ -aj. Also odd-syll words ending on letter i, as selleri. Not lulesami. No diacritics whatsoever. Really just norwegian words with a kind of sami inflection. Case marking like standard even 4 syllable stems (see proper nouns file on the case marking of foreign words with stressed last syllable). Are part of the spell checker.

aids # Even-syllable test examples:

  • aids: aids+N+Sg+Nom
  • aidsaj: aids+N+Sg+Ill
  • aidsas: aids+N+Sg+Ela
  • aidsan: aids+N+Ess
  • aidsadak: aids+N+Abe
  • aidsada: aids+N+Abe
  • aidsasj: aids+N+Der/Dimin+N+Sg+Nom

+Der4+Der/ahtes: e»g AHTES ; Only for odd-syllble stems

4syll stems

LEXICON GÅNÅGIS Standard C-final 4-syllabic stems

rahtjamus # Even-syllable test examples:

  • rahtjamus: rahtjamus+N+Sg+Nom
  • rahtjamussaj: rahtjamus+N+Sg+Ill
  • rahtjamusás: rahtjamus+N+Sg+Ela

LEXICON BERULASJ For words ending on -asj. Same as GÅNÅGIS but with strong essive and illative -adjtan and -adtjaj subtaged, same with PX "-adjtam". These forms are barely used today.

LEXICON BEDNAGASJ Like BERULASJ, but for derived nouns in diminutive. No cg, no vowchange, no short Ess. Has only one dimin derivation since these words already are dimin, ie. no double dim as for GAHPER. No abessive, not totally sure about this, I think we must use postposition dagi when it's diminutive,

stiebil # Odd-syllable test examples:

  • bednagasj: bednagasj+N+Sg+Nom
  • bednagattjas: bednagasj+N+Sg+Ela

LEXICON JIHPELIJ gen: jihpelahá

gehtsulij # Even-syllable test examples:

  • gehtsulij: gehtsulij+N+Sg+Nom
  • gehtsulaháv: gehtsulij+N+Sg+Acc

LEXICON OARJJILIJ gen: oarjjilihá

allilij # Even-syllable test examples:

  • allilij: allilij+N+Sg+Nom
  • allilihás: allilij+N+Sg+Ela

LEXICON VIESSOMUJ gen: viessumuhá

bårråmuj # Even-syllable test examples:

  • bårråmuj: bårråmuj+N+Sg+Nom
  • bårråmuháj: bårråmuj+N+Sg+Ill

4 syllable plurals

LEXICON OADÁDAGÁ Plural forms of words like tjerastahka with short compound-form

látjádagá # Even-syllable test examples:

  • látjádagá: látjádagá+N+Pl+Nom
  • látjádagájs: látjádagá+N+Pl+Ela

LEXICON BERRAHATTJA Plural stems. Like IEDNITJA, these do not have corresponding singular stems. Most stems here have the same form as the pl nom form of diminutive derivations, but (while it may have originated as a diminuitive derivation) it is not the same derivation (today) and it does not have a singular form.

gahpanisá # Even-syllable test examples:

  • gahpanisá: gahpanisá+N+Pl+Nom
  • gahpanisájda: gahpanisá+N+Pl+Ill
  • gahpanisájs: gahpanisá+N+Pl+Ela

LEXICON SIJDDALAHÁ Plurals

lullelahá # Even-syllable test examples:

  • lullelahá: lullelahá+N+Pl+Nom
  • lullelahájt: lullelahá+N+Pl+Acc

LEXICON SISSNELUHÁ plurals. presently only for sissŋeluhá

sissŋeluhá # Even-syllable test examples:

  • sissŋeluhá: sissŋeluhá+N+Pl+Nom
  • sissŋeluhájda: sissŋeluhá+N+Pl+Ill

Adjectival sublexicas. Give 4 syll adjectives inflection

LEXICON SURGULASJ-EVEN

Compound lexicas

Odd-syllable stems

without cg

LEXICON GAHPER Odd-syllable C-final noun without cg, no vowchange, no short Ess. Spiik A3

stiebil # Odd-syllable test examples:

  • stiebil: stiebil+N+Sg+Nom
  • stiebilis: stiebil+N+Sg+Ela

with cg

LEXICON ÅRES Odd-syllable C-final noun with CG, 2ndsyll vowchange. Long and short essive. Spiik A1

sjattos # Odd-syllable test examples:

  • sjattos: sjattos+N+Sg+Nom
  • sjaddusis: sjattos+N+Sg+Ela
  • sjaddusin: sjattos+N+Ess
  • sjatton: sjattos+N+Ess

LEXICON SÅHKÅR Odd-syllable C-final noun with CG and 2ndsyll vowelchange. Has only long essive. Spiik 2b

spiger # Odd-syllable test examples:

  • spiger: spiger+N+Sg+Nom
  • spihkáris: spiger+N+Sg+Ela
  • spihkárin: spiger+N+Ess

LEXICON GÁMAS Odd-syllable C-final noun with CG, no 2ndsyll vowchange (OBS: a does not change). Long and short essive. Spiik A2

sjábtjas # Odd-syllable test examples:

  • sjábtjas: sjábtjas+N+Sg+Nom
  • sjábttjasis: sjábtjas+N+Sg+Ela

LEXICON BENA Odd-syllable V-final noun with cg, no vowchange. Long and short essive. Spiik 2a

galma # Odd-syllable test examples:

  • galma: galma+N+Sg+Nom
  • galmmagis: galma+N+Sg+Ela

Irregular stems

LEXICON SÁGE gen: sáhkaha. Presently only for "ságe". Long and short essive.

ságe # Odd-syllable test examples:

  • ságe: ságe+N+Sg+Nom
  • sáhkahav: ságe+N+Sg+Acc

LEXICON BAVSEV Ends on -v and last vowel changes to i: bavsev: baksIma. Not like gierkav gierkkAma and birev birEma.

sievtev # Odd-syllable test examples:

  • sievtev: sievtev+N+Sg+Nom
  • siektimis: sievtev+N+Sg+Ela

LEXICON RÁBEV rábev: ráhpuga. Presently only for "rábev".

rábev # Odd-syllable test examples:

  • rábev: rábev+N+Sg+Nom
  • ráhpugis: rábev+N+Sg+Ela

LEXICON RITJAS ! Like GÁMAS but without stem a-lengthening for grade I (underlying long -i-). presently only for "ritjas".

ritjas # Odd-syllable test examples:

  • ritjas: ritjas+N+Sg+Nom
  • rihtjasis: ritjas+N+Sg+Ela

LEXICON SÅGAS gen: sågaska. Presently only for "sågas".

sågas # Odd-syllable test examples:

  • sågas: sågas+N+Sg+Nom
  • sågaskav: sågas+N+Sg+Acc

LEXICON SJUVÁJ Presently only for "sjuváj". sjuváj-sjuvvaga. Only this word

sjuváj # Odd-syllable test examples:

  • sjuváj: sjuváj+N+Sg+Nom
  • sjuvvagis: sjuváj+N+Sg+Ela

LEXICON BØSOJ Because of bösoj in O.Korhonen, and bæsoj-bessuga. Only for these two words. J becomes g.

LEXICON GUOVSOJVUOJOJ vuojoj: vuodjom. Presently only for "guovsojvuojoj".

guovsojvuojoj # Odd-syllable test examples:

  • guovsojvuojoj: guovsojvuojoj+N+Sg+Nom
  • guovsojvuodjomav: guovsojvuojoj+N+Sg+Acc

LEXICON BUTJES butjes-buttjása. Presently only for "butjes". This is an sub. Korhonen has this form but if you look in Grundstöm it's buttjes-budtjasa. Must be a typo in Korhonen, because ttj-tj dosn't exist in smj. This form is err subed in stems file.

LEXICON TJÅLKES tjålkes: tjoalkkas- Presently only for "tjålkes and tsålkes". This must be wrong, and it dosn't exist in Grundström. Å in 1. syll isn't possible with e in 2. syll. Must be tjoalkes-tjoalkkása or tjålkas-tjoalkkasa. This form is err subed in stems file.

tsålkes # Odd-syllable test examples:

  • *tsålkes: tsålkes+N+Sg+Nom (is not standard language)
  • *tsoalkkasav: tsålkes+N+Sg+Acc (is not standard language)

LEXICON VÁJES vájes: vádjas- Presently only for "báhkovájes". It's a sub: 2. syll e doesn't become a. Must be vájes-vádjása or vájas-vádjasa. The second is used in NT, so I belive thats the right one. This form is err subed in stems file.

Derived stems

LEXICON BADJEL Derived nouns with acc -elav, ill -elij, elat -elas, etc. These were previously categorized as adpositions and adverbs, but according to Bruce Morén-Duolljá (2014) they are actually case forms of nouns derived from certain location nouns. Derived from even strong stems (badje -> badjel). Odd syllable inflection, but only singular nominative-elative (not clear if they take comitative and essive case). With comparatives. No Px.

allel # Odd-syllable test examples:

  • allel: allel+N+Sg+Nom
  • allelis: allel+N+Sg+Ela
  • allelabbo: allel+N+Der/A+A+Comp+Sg+Nom

LEXICON BÁRNEP bárnep: bárnebu-. Comparisation of nouns. No -ahtá abesive.

iednep # Odd-syllable test examples:

  • iednep: iednep+N+Sg+Nom
  • iednebuv: iednep+N+Sg+Acc

LEXICON OAPPÁSJ Like GAHPER, but for derived nouns in diminutive, have an underived form. Doesn't get abesive -ahtá or -ahtes derivation. Oddsyll, no cg, no vowchange, no short Ess. Has only one dimin derivation since these words already are dimin, ie. not double dim as in GAHPER.

stiebil # Odd-syllable test examples:

  • oappásj: oappásj+N+Sg+Nom
  • oappátjis: oappásj+N+Sg+Ela

LEXICON FIERUN Like GAHPER, but instruments derived from verbs. Fierrot>fierun. No short essive.

stiebil # Odd-syllable test examples:

  • fierun: fierun+N+Sg+Nom
  • fierunis: fierun+N+Sg+Ela

LEXICON GUOLLÁR Like GAHPER, but actor derived from contracted verbs (ACTOR for evensyll verbs). Guollit>guollár. No short essive.

Plural odd-syll

LEXICON DÁRBBAGA Like BENA, but plural. Presently only for "dárbbaga", has singular stem counterpart.

dárbbaga # Odd-syllable test examples:

  • dárbbaga: dárbbaga+N+Pl+Nom
  • dárbbagijt: dárbbaga+N+Pl+Acc

LEXICON BÆLLJASA Like GÁMAS, but plural. These have corresponding singular stems.

jiednabælljasa # Odd-syllable test examples:

  • jiednabælljasa: jiednabælljasa+N+Pl+Nom
  • jiednabälljasa: jiednabælljasa+N+Pl+Nom
  • jiednabælljasijt: jiednabælljasa+N+Pl+Acc
  • jiednabälljasijt: jiednabælljasa+N+Pl+Acc

LEXICON IEDNITJA Odd syllable pluralforms only. These do not have a singular form.

jáhkoguojmitja # Odd-syllable test examples:

  • jáhkoguojmitja: jáhkoguojmitja+N+Pl+Nom
  • jáhkoguojmitjijt: jáhkoguojmitja+N+Pl+Acc

LEXICON SNJIERÁGA Odd syllable pluralforms only. These have corresponding singular stems.

guovlloådåsa # Odd-syllable test examples:

  • guovlloådåsa: guovlloådåsa+N+Pl+Nom
  • guovlloådåsijt: guovlloådåsa+N+Pl+Acc

LEXICON MANEBU oddsyllable plural only. presently only for "maŋebu".

maŋebu # Odd-syllable test examples:

  • maŋebu: maŋebu+N+Pl+Nom
  • maŋebujt: maŋebu+N+Pl+Acc

Contracted stems

LEXICON SUOLOJ C-final with cg II-III: ålmåj: ålmmå

njurgoj # Contracted test examples:

  • njurgoj: njurgoj+N+Sg+Nom
  • njurgguv: njurgoj+N+Sg+Acc

LEXICON ÅLMÅJ_LOAN Same as SUOLOJ, only for loan words. Follows Ráhka/Mikkelsen's Bårjås 2014. C-final with cg II-III: ålmåj: ålmmå

bistroj # Contracted test examples:

  • bistroj: bistroj+N+Sg+Nom
  • bisstruv: bistroj+N+Sg+Acc

LEXICON GUOMOJ C-final with cg I-III: guomoj: guobbmu

ænoj # Contracted test examples:

  • ænoj: ænoj+N+Sg+Nom
  • æddnuv: ænoj+N+Sg+Acc
  • änoj: ænoj+N+Sg+Nom
  • äddnuv: ænoj+N+Sg+Acc

LEXICON SARVES C-final with cg II-III. sarves: sarvvá

moarmes # Contracted test examples:

  • moarmes: moarmes+N+Sg+Nom
  • moarmmáv: moarmes+N+Sg+Acc

LEXICON SVÁLES C-final with cg I-III. sváles: svállá (lºl)

sváles # Contracted test examples:

  • sváles: sváles+N+Sg+Nom
  • sválláv: sváles+N+Sg+Acc

LEXICON GÅHKES C-final with cg II-III with vowel harmony (a/á=å). gåhkes: gåhkkå. Presently only for "gåhkes".

gåhkes # Contracted test examples:

  • gåhkes: gåhkes+N+Sg+Nom
  • gåhkkåv: gåhkes+N+Sg+Acc

LEXICON SJUOKKAJ sjuokkaj: sjuoggá. Presently only for "sjuokkaj".

sjuokkaj # Contracted test examples:

  • sjuokkaj: sjuokkaj+N+Sg+Nom
  • sjuoggáv: sjuokkaj+N+Sg+Acc

LEXICON GISTÁ gistá: gisstá. Presently only for "gistá".

gistá # Contracted test examples:

  • gistá: gistá+N+Sg+Nom
  • gisstáv: gistá+N+Sg+Acc

Contracted stems sublexica

Px lexica

Sublexica for Verb

Table of content:

  • The auxiliaries
    • Negation verb
    • Copula
    • Modals
  • Main verbs
    • Even syllable stems
      • Intransitives
      • Transitives
    • Odd syllable stems
      • Intranstives
      • Transitives
    • Contracted stems
      • Intransitives
      • Transitives
    • Assimilated loan verbs
      • Intransitives
      • Transitives
    • Main inflectional categories
      • suffix lexicas
    • Verb derivation

IV means intransitive verbs, TV means transitive verbs.

Auxiliary verbs

Negation verb

LEXICON NEG

Copula

LEXICON ÅRROT

LEXICON LIEHKET

LEXICON LULU

Modals

LEXICON GALGGAT_IV even-syllable modal verbs.

soajttet # Eveb-syllable test examples:

  • soajtáv: soajttet+V+IV+Ind+Prs+Sg1
  • soajttiv: soajttet+V+IV+Ind+Prt+Sg1
  • såjtijma: soajttet+V+IV+Ind+Prt+Pl1

LEXICON VIERTTIT_IV Contracted modal verbs.

hæhttut # Eveb-syllable test examples:

  • hæhttut: hæhttut+V+IV+Inf
  • hähttut: hæhttut+V+IV+Inf
  • hæhttuv: hæhttut+V+IV+Ind+Prs+Sg1
  • hähttuv: hæhttut+V+IV+Ind+Prs+Sg1
  • hæhttujma: hæhttut+V+IV+Ind+Prt+Pl1
  • hähttujma: hæhttut+V+IV+Ind+Prt+Pl1

Ordinary main verbs

Even-syllable stems

Intransitives

LEXICON GALSSJOT_IV Impersonal o-verbs

hærmmot # Eveb-syllable test examples:

  • hærmmu: hærmmot+V+IV+Ind+Prs+Sg3
  • härmmu: hærmmot+V+IV+Ind+Prs+Sg3
  • hærmoj: hærmmot+V+IV+Ind+Prt+Sg3
  • härmoj: hærmmot+V+IV+Ind+Prt+Sg3

LEXICON BÅRSSJOT_IV o-verbs with

hæssot # Eveb-syllable test examples:

  • hæsov: hæssot+V+IV+Ind+Prs+Sg1
  • häsov: hæssot+V+IV+Ind+Prs+Sg1
  • hessuv: hæssot+V+IV+Ind+Prt+Sg1
  • hæsojma: hæssot+V+IV+Ind+Prt+Pl1
  • häsojma: hæssot+V+IV+Ind+Prt+Pl1

LEXICON VILSSJOT_IV o-verbs as BÅRSSJOT but without derivations -stit, -stallat, -stahtte, - stasste. With dim -astit that are hardcoded

libjjot # Eveb-syllable test examples:

  • libjov: libjjot+V+IV+Ind+Prs+Sg1
  • libjjuv: libjjot+V+IV+Ind+Prt+Sg1
  • libjojma: libjjot+V+IV+Ind+Prt+Pl1

LEXICON BUOLLET_IV e-verbs

liddet # Even-syllable test examples:

  • littáv: liddet+V+IV+Ind+Prs+Sg1
  • liddiv: liddet+V+IV+Ind+Prt+Sg1
  • littijma: liddet+V+IV+Ind+Prt+Pl1

LEXICON BOAHTET_IV e-verbs like BUOLLET_IV without passive

boahtet # Even-syllable test examples:

  • boadáv: boahtet+V+IV+Ind+Prs+Sg1
  • båhtiv: boahtet+V+IV+Ind+Prt+Sg1
  • bådijma: boahtet+V+IV+Ind+Prt+Pl1

LEXICON VIEDJET_IV e-verbs GRADE II-I WITH IE DIPHT.

biehket # Eveb-syllable test examples:

  • bægáv: biehket+V+IV+Ind+Prs+Sg1
  • bägáv: biehket+V+IV+Ind+Prs+Sg1
  • biehkiv: biehket+V+IV+Ind+Prt+Sg1
  • biegijma: biehket+V+IV+Ind+Prt+Pl1

LEXICON ASSTAT_IV only for asstat, no passive

asstat # Even-syllable test examples:

  • astav: asstat+V+IV+Ind+Prs+Sg1
  • asstiv: asstat+V+IV+Ind+Prt+Sg1
  • astajma: asstat+V+IV+Ind+Prt+Pl1

LEXICON RAVGGAT_IV a- and å-verbs only Sg3 passive.

bajássjaddat # Even-syllable test examples:

  • bajássjattav: bajássjaddat+V+IV+Ind+Prs+Sg1
  • bajássjaddiv: bajássjaddat+V+IV+Ind+Prt+Sg1
  • bajássjattajma: bajássjaddat+V+IV+Ind+Prt+Pl1

LEXICON BIEGGAT_IV Impersonals

dednjat # Even-syllable test examples:

  • dædnjá: dednjat+V+IV+Ind+Prs+Sg3
  • dädnjá: dednjat+V+IV+Ind+Prs+Sg3
  • denjaj: dednjat+V+IV+Ind+Prt+Sg3

LEXICON RAVGGALASSTET_IV Like RAVGGAT for already derived words (except words ending -uššat) - no actio as first part of compounds, but reintroduced

dehpudallat # Even-syllable test examples:

  • dehpudaláv: dehpudallat+V+IV+Ind+Prs+Sg1
  • dehpudalliv: dehpudallat+V+IV+Ind+Prt+Sg1
  • dehpudalájma: dehpudallat+V+IV+Ind+Prt+Pl1

LEXICON BIEKKASTALLAT_IV Already derived impersonals

dehpudallat # Even-syllable test examples:

  • duhpárasstá: duhpárasstet+V+IV+Ind+Prs+Sg3
  • duhpárastij: duhpárasstet+V+IV+Ind+Prt+Sg3

LEXICON GUOTTEDALLAT_IV passives on -allat - no actio as first part of compounds, but reintroduced

duolmudallat # Even-syllable test examples:

  • duolmudaláv: duolmudallat+V+IV+Ind+Prs+Sg1
  • duolmudalliv: duolmudallat+V+IV+Ind+Prt+Sg1
  • duolmudalájma: duolmudallat+V+IV+Ind+Prt+Pl1

LEXICON HIEBADUVVAT_IV passives on -uvvat - no actio as first part of compounds, but reintroduced

duostoduvvat # Even-syllable test examples:

  • duostoduváv: duostoduvvat+V+IV+Ind+Prs+Sg1
  • duostoduvviv: duostoduvvat+V+IV+Ind+Prt+Sg1
  • duostoduvájma: duostoduvvat+V+IV+Ind+Prt+Pl1

Transitives LEXICON MÁHTTET_TV verbs without personal passive

jáhkket # Even-syllable test examples:

  • jáhkáv: jáhkket+V+TV+Ind+Prs+Sg1
  • jáhkkiv: jáhkket+V+TV+Ind+Prt+Sg1
  • jáhkijma: jáhkket+V+TV+Ind+Prt+Pl1

LEXICON BASSAT_TV a- and å-verbs. Three passives

jåksåt # Even-syllable test examples:

  • jåvsåv: jåksåt+V+TV+Ind+Prs+Sg1
  • jåksiv: jåksåt+V+TV+Ind+Prt+Sg1
  • jåvsåjma: jåksåt+V+TV+Ind+Prt+Pl1

LEXICON BASSALASSTET_TV Like BASSAT for already derived words (except words ending -uššat) - no actio as first part of compounds, but reintroduced. Three passives

jårgudallat # Even-syllable test examples:

  • jårgudaláv: jårgudallat+V+TV+Ind+Prs+Sg1
  • jårgudalliv: jårgudallat+V+TV+Ind+Prt+Sg1
  • jårgudalájma: jårgudallat+V+TV+Ind+Prt+Pl1

LEXICON JUHKAT_TV a-verbs like BASSAT_TV but but without derivations -stit, -stallat, -stahtte, - stasste. Dim -istit that are hardcoded. Three passives

njammat # Even-syllable test examples:

  • njamáv: njammat+V+TV+Ind+Prs+Sg1
  • njammiv: njammat+V+TV+Ind+Prt+Sg1
  • njamájma: njammat+V+TV+Ind+Prt+Pl1

LEXICON LÁHPPET_TV e-verbs. Three passives

oajttet # Even-syllable test examples:

  • oajtáv: oajttet+V+TV+Ind+Prs+Sg1
  • oajttiv: oajttet+V+TV+Ind+Prt+Sg1
  • åjtijma: oajttet+V+TV+Ind+Prt+Pl1

LEXICON JIEHKET_TV e-verbs GRADE II-I WITH IE DIPHT. Three passives

sievvet # Even-syllable test examples:

  • sæváv: sievvet+V+TV+Ind+Prs+Sg1
  • säváv: sievvet+V+TV+Ind+Prs+Sg1
  • sievviv: sievvet+V+TV+Ind+Prt+Sg1
  • sievijma: sievvet+V+TV+Ind+Prt+Pl1

LEXICON DIEHTET_TV Only this one word, unusual diphtong behavior. No passive

diehtet # Even-syllable test examples:

  • diedáv: diehtet+V+TV+Ind+Prs+Sg1
  • diehtiv: diehtet+V+TV+Ind+Prt+Sg1
  • diedijma: diehtet+V+TV+Ind+Prt+Pl1

LEXICON GÁDJOT_TV o-verbs. only duvvat passive.

sjpædtjot # Even-syllable test examples:

  • sjpættjov: sjpædtjot+V+TV+Ind+Prs+Sg1
  • sjpättjov: sjpædtjot+V+TV+Ind+Prs+Sg1
  • sjpædtjuv: sjpædtjot+V+TV+Ind+Prt+Sg1
  • sjpädtjuv: sjpædtjot+V+TV+Ind+Prt+Sg1
  • sjpættjojma: sjpædtjot+V+TV+Ind+Prt+Pl1
  • sjpättjojma: sjpædtjot+V+TV+Ind+Prt+Pl1

LEXICON JÅRGGOT_TV o-verbs with dim -astit that are hardcoded. Duvvat and dallat passive.

boarkkot # Even-syllable test examples:

  • boarkov: boarkkot+V+TV+Ind+Prs+Sg1
  • boarkkuv: boarkkot+V+TV+Ind+Prt+Sg1
  • boarkojma: boarkkot+V+TV+Ind+Prt+Pl1

Odd-syllable stems

This is just awaiting a manual classification

LEXICON BIEKKASTIT_IV Impersonals, only Sg3

LEXICON JÅRGESTIT_IV At the moment IV, we may perhaps change IV/TV.

doalvestit # Odd-syllable test examples:

  • doalvestav: doalvestit+V+IV+Ind+Prs+Sg1
  • doalvestiv: doalvestit+V+IV+Ind+Prt+Sg1
  • doalvestijma: doalvestit+V+IV+Ind+Prt+Pl1

LEXICON BEGATJIT_IV Words ending -tjit, -jdit, reciprocals on -dit, momentatives on -dit, -edit, continuatives on -ldit, -nit, essives on -hit and 5-syllables - no actio cmps, but only Sg3 passivereintroduced

duojkkuhit # Odd-syllable test examples:

  • duojkkuhav: duojkkuhit+V+IV+Ind+Prs+Sg1
  • duojkkuhiv: duojkkuhit+V+IV+Ind+Prt+Sg1
  • duojkkuhijma: duojkkuhit+V+IV+Ind+Prt+Pl1

LEXICON BALÁDIT_IV continuatives on -dit, frequentatives on -odit, reciprocals, momentatives and frequentatives ending -alit - actio cpms, only Sg3 passive

lihtudit # Odd-syllable test examples:

  • lihtudav: lihtudit+V+IV+Ind+Prs+Sg1
  • lihtudiv: lihtudit+V+IV+Ind+Prt+Sg1
  • lihtudijma: lihtudit+V+IV+Ind+Prt+Pl1

LEXICON SUOGNALIT_IV Trisyllabic Verbs ending -lit. only Sg3 passive

loavkkalit # Odd-syllable test examples:

  • loavkkalav: loavkkalit+V+IV+Ind+Prs+Sg1
  • loavkkaliv: loavkkalit+V+IV+Ind+Prt+Sg1
  • loavkkalijma: loavkkalit+V+IV+Ind+Prt+Pl1

LEXICON LASSÁNIT_IV verbs ending -nit, -sit, no passive

rievddánit # Odd-syllable test examples:

  • rievddánav: rievddánit+V+IV+Ind+Prs+Sg1
  • rievddániv: rievddánit+V+IV+Ind+Prt+Sg1
  • rievddánijma: rievddánit+V+IV+Ind+Prt+Pl1

LEXICON BÁHTARIT_IV verbs ending -rit. only Sg3 passive

sjtávttjurit # Odd-syllable test examples:

  • sjtávttjurav: sjtávttjurit+V+IV+Ind+Prs+Sg1
  • sjtávttjuriv: sjtávttjurit+V+IV+Ind+Prt+Sg1
  • sjtávttjurijma: sjtávttjurit+V+IV+Ind+Prt+Pl1

LEXICON UNNEDIT_TV All -uvvat passives.

nuoledit # Odd-syllable test examples:

  • nuoledav: nuoledit+V+TV+Ind+Prs+Sg1
  • nuolediv: nuoledit+V+TV+Ind+Prt+Sg1
  • nuoledijma: nuoledit+V+TV+Ind+Prt+Pl1

LEXICON MUJTATJIT_TV Words ending -tjit, -jdit, reciprocals on -dit, momentatives on -dit, -edit, continuatives on -ldit, -nit, essives on -hit and 5-syllables - no actio cmps, but reintroduced. All -uvvat passives

nårddådit # Odd-syllable test examples:

  • nårddådav: nårddådit+V+TV+Ind+Prs+Sg1
  • nårddådiv: nårddådit+V+TV+Ind+Prt+Sg1
  • nårddådijma: nårddådit+V+TV+Ind+Prt+Pl1

LEXICON BÅNJÅDIT_TV continuatives on -dit, frequentatives on -odit, reciprocals, momentatives and frequentatives ending -alit - actio cpms. All -uvvat passives.

tsirggalit # Odd-syllable test examples:

  • tsirggalav: tsirggalit+V+TV+Ind+Prs+Sg1
  • tsirggaliv: tsirggalit+V+TV+Ind+Prt+Sg1
  • tsirggalijma: tsirggalit+V+TV+Ind+Prt+Pl1

LEXICON VUORDDELIT_TV Trisyllabic Verbs ending -lit. All -uvvat passives

tsåggålit # Odd-syllable test examples:

  • tsåggålav: tsåggålit+V+TV+Ind+Prs+Sg1
  • tsåggåliv: tsåggålit+V+TV+Ind+Prt+Sg1
  • tsåggålijma: tsåggålit+V+TV+Ind+Prt+Pl1

Contracted stems

LEXICON SJIERRIT_IV Impersonals

boavddit # Contracted test examples:

  • boavddi: boavddit+V+IV+Ind+Prs+Sg3
  • boavddij: boavddit+V+IV+Ind+Prt+Sg3

LEXICON BASSUT_IV Passives

buvvut # Contracted test examples:

  • buvvuv: buvvut+V+IV+Ind+Prs+Sg1
  • buvvujiv: buvvut+V+IV+Ind+Prt+Sg1
  • buvvujma: buvvut+V+IV+Ind+Prt+Pl1

LEXICON OADDÁT_IV Incoative, (doarrut,jåhttåt). Only Sg3 passive. Does not make nouns via -ár derivation.

bæhkkát # Contracted test examples:

  • bæhkkáv: bæhkkát+V+IV+Ind+Prs+Sg1
  • bähkkáv: bæhkkát+V+IV+Ind+Prs+Sg1
  • bæhkkájiv: bæhkkát+V+IV+Ind+Prt+Sg1
  • bähkkájiv: bæhkkát+V+IV+Ind+Prt+Sg1
  • bæhkkájma: bæhkkát+V+IV+Ind+Prt+Pl1
  • bähkkájma: bæhkkát+V+IV+Ind+Prt+Pl1

LEXICON TJUOLLÁT_TV Incoativ. All passive. Does not make nouns via -ár derivation, (gullát, bårråt)

gajkkát # Contracted test examples:

  • gajkkáv: gajkkát+V+TV+Ind+Prs+Sg1
  • gajkkájiv: gajkkát+V+TV+Ind+Prt+Sg1
  • gajkkájma: gajkkát+V+TV+Ind+Prt+Pl1

LEXICON DULLUT_IV Does not make nouns via -ár derivation. Only Sg3 passiv.

dussut # Contracted test examples:

  • dussuv: dussut+V+IV+Ind+Prs+Sg1
  • dussujiv: dussut+V+IV+Ind+Prt+Sg1
  • dussujma: dussut+V+IV+Ind+Prt+Pl1

LEXICON STRÁFFUT_TV Does not make nouns via -ár derivation. All duvvat passives.

gáhpput # Contracted test examples:

  • gáhppuv: gáhpput+V+TV+Ind+Prs+Sg1
  • gáhppujiv: gáhpput+V+TV+Ind+Prt+Sg1
  • gáhppujma: gáhpput+V+TV+Ind+Prt+Pl1

LEXICON TSIEGGIT_TV Makes nouns via -ár derivation. All duvvat passives. asdf

gámmpit # Contracted test examples:

  • gámmpiv: gámmpit+V+TV+Ind+Prs+Sg1
  • gámmpijiv: gámmpit+V+TV+Ind+Prt+Sg1
  • gámmpijma: gámmpit+V+TV+Ind+Prt+Pl1
  • gámmpár: gámmpit+V+TV+Der/r+N+Sg+Nom

LEXICON VALLIT_TV Makes nouns via -ár derivation. Gets only passive Sg3

hinnit # Contracted test examples:

  • hinniv: hinnit+V+TV+Ind+Prs+Sg1
  • hinnijiv: hinnit+V+TV+Ind+Prt+Sg1
  • hinnijma: hinnit+V+TV+Ind+Prt+Pl1
  • hinnár: hinnit+V+TV+Der/r+N+Sg+Nom

Assimilated loan verbs

LEXICON PÁNNTIT_TV Two syllable transitive NEW loan verbs. Makes nouns via -ár derivation. All passives.

hinnit # Contracted test examples:

  • bloaggiv: bloaggit+V+TV+Ind+Prs+Sg1
  • blåggiv: bloaggit+V+TV+Ind+Prs+Sg1
  • bloaggijiv: bloaggit+V+TV+Ind+Prt+Sg1
  • blåggijiv: bloaggit+V+TV+Ind+Prt+Sg1
  • bloaggijma: bloaggit+V+TV+Ind+Prt+Pl1
  • blåggijma: bloaggit+V+TV+Ind+Prt+Pl1
  • bloaggár: bloaggit+V+TV+Der/r+N+Sg+Nom
  • blåggår: bloaggit+V+TV+Der/r+N+Sg+Nom

LEXICON HÅŊŊLIT_IV Two syllable intransitive NEW loan verbs. Only Sg3 passiv.

håŋŋlit # Contracted test examples:

  • håŋŋliv: håŋŋlit+V+IV+Ind+Prs+Sg1
  • håŋŋlijiv: håŋŋlit+V+IV+Ind+Prt+Sg1
  • håŋŋlijma: håŋŋlit+V+IV+Ind+Prt+Pl1

LEXICON TV_BADASS NEW badly assimilated two syllable transitive loan verbs. Makes nouns via -ár derivation. All passives.

LEXICON IV_BADASS NEW badly assimilated two syllable intransitive loan verbs. Makes nouns via -ár derivation. Only Sg3 passiv.

LEXICON BRILJERE_IV Intransitive loan words with more than two syllables with -rit endings. Does not make nouns via -ár derivation. Only the two last syllables are assimilated to sami. Long -e is assimilated in different ways in dialects in Norway and Sweden: In Norway it often becomes -ie, while in Sweden itºs usually -e.

LEXICON BRILJERE_IV_INFL

briljierit # Contracted test examples:

  • briljieriv: briljierit+V+IV+Ind+Prs+Sg1
  • briljeriv: briljierit+V+IV+Ind+Prs+Sg1
  • briljierijiv: briljierit+V+IV+Ind+Prt+Sg1
  • briljerijiv: briljierit+V+IV+Ind+Prt+Sg1
  • briljierijma: briljierit+V+IV+Ind+Prt+Pl1
  • briljerijma: briljierit+V+IV+Ind+Prt+Pl1

LEXICON BRILJERE_IV_NOR Separate lexicon for loan words with ortographic differences between the Norwegian and Swedish forms. Otherwise same as BRILJERE_TV. +Area/SE forms are +Use/NG tagged. Lemma form is -ierit.

LEXICON BRILJERE_IV_INFL_NOR

eksplodierit # Contracted test examples:

  • eksplodieriv: eksplodierit+V+IV+Ind+Prs+Sg1
  • eksploderiv: eksplodierit+V+IV+Ind+Prs+Sg1
  • eksplodierijiv: eksplodierit+V+IV+Ind+Prt+Sg1
  • eksploderijiv: eksplodierit+V+IV+Ind+Prt+Sg1
  • eksplodierijma: eksplodierit+V+IV+Ind+Prt+Pl1
  • eksploderijma: eksplodierit+V+IV+Ind+Prt+Pl1

LEXICON BRILJERE_IV_SW Separate lexicon for loan words with ortographic differences between the Norwegian and Swedish forms. Otherwise same as BRILJERE_TV. +Area/NO forms are +Use/NG tagged. Lemma form is -erit

LEXICON BRILJERE_IV_INFL_SW

exploderit # Contracted test examples:

  • explodieriv: exploderit+V+IV+Ind+Prs+Sg1
  • exploderiv: exploderit+V+IV+Ind+Prs+Sg1
  • explodierijiv: exploderit+V+IV+Ind+Prt+Sg1
  • exploderijiv: exploderit+V+IV+Ind+Prt+Sg1
  • explodierijma: exploderit+V+IV+Ind+Prt+Pl1
  • exploderijma: exploderit+V+IV+Ind+Prt+Pl1

LEXICON ABBONERE_TV Transitive loan words with more than two syllables with -rit endings. Duvvat passives. Does not make nouns via -ár derivation. Only the two last syllables are assimilated to sami. LONG -e is assimilated in different ways in Norway and Sweden: In Norway, it becomes -ie, and in Sweden -e.

LEXICON ABBONERE_TV_INFL

abbonierit # Contracted test examples:

  • abbonieriv: abbonierit+V+TV+Ind+Prs+Sg1
  • abboneriv: abbonierit+V+TV+Ind+Prs+Sg1
  • abbonierijiv: abbonierit+V+TV+Ind+Prt+Sg1
  • abbonerijiv: abbonierit+V+TV+Ind+Prt+Sg1
  • abbonierijma: abbonierit+V+TV+Ind+Prt+Pl1
  • abbonerijma: abbonierit+V+TV+Ind+Prt+Pl1

LEXICON ABBONERE_TV_NOR Separate lexicon for loan words with ortographic differences between the Norwegian and Swedish forms. Otherwise same as ABBONERE_TV. +Area/SE forms are +Use/NG tagged. Lemma form is -ierit

LEXICON ABBONERE_TV_INFL_NOR

akseptierit # Contracted test examples:

  • akseptieriv: akseptierit+V+IV+Ind+Prs+Sg1
  • aksepteriv: akseptierit+V+IV+Ind+Prs+Sg1
  • akseptierijiv: akseptierit+V+IV+Ind+Prt+Sg1
  • aksepterijiv: akseptierit+V+IV+Ind+Prt+Sg1
  • akseptierijma: akseptierit+V+IV+Ind+Prt+Pl1
  • aksepterijma: akseptierit+V+IV+Ind+Prt+Pl1

LEXICON ABBONERE_TV_SW Separate lexicon for loan words with ortographic differences between the Norwegian and Swedish forms. Otherwise same as ABBONERE_TV. +Area/NO forms are +Use/NG tagged. Lemma form is -erit

LEXICON ABBONERE_TV_INFL_SW

accepterit # Contracted test examples:

  • acceptieriv: accepterit+V+IV+Ind+Prs+Sg1
  • accepteriv: accepterit+V+IV+Ind+Prs+Sg1
  • acceptierijiv: accepterit+V+IV+Ind+Prt+Sg1
  • accepterijiv: accepterit+V+IV+Ind+Prt+Sg1
  • acceptierijma: accepterit+V+IV+Ind+Prt+Pl1
  • accepterijma: accepterit+V+IV+Ind+Prt+Pl1

Background

The file itself is located in langs/smj/src/phonology/smj-phon.twolc. The file is modeled upon the corresponding file for North Sámi, but has been revised and differs from it on several issues. The grammatical sources are Spiik 1989: Lulesamisk grammatik and Nystø and Johnsen 2001: Sámásta 2.

File structure

The rule file has the sections Alphabet, Sets, Definition and Rules. The rules are ordered thematically, with 3 main sections: Consonant alternations (except cg), vowel alternations, and consonant gradation.

The Alphabet section

The real Lule Sámi Alphabet

All Lule Saami letters are listed. The Lule Sámi ENG sound is represented as ñ. Lule Sámi letter repertoire is not fully standardised. In the source code we write (and you shall write!) æ; ø; ŋ, but the parser tolerates input written with the the letters ä; ö; ń, ñ (cf. the 4 rules in the file smj/src/orthography/spellrelax.regex).

The 3rd degree mark º is never realized, hence declared as º: 0. º: 0 = Gradation mark %/ = Literal /, not the TWOLC reserved symbol ':' = Apostrophe

h2, g2 etc. are consonants deleted in the Nom. m3, d3 etc. (?) are consonants that undergo certain processes word-finally. This issue should be looked into. Perhaps the two sets can be unified. The reason why there are more distinctions than for sme, is that the cns deletion process is more phonological in sme.

The Dummy symbols

The Dummy symbols are taken from the sme file for convenience, only a small part of them are actually used, they are defined in the Sets section along the way, included there as soon as they are used. The set of actually used Dummy symbols is thus the set declared in "Dummy". The Dummy symbols trigger morphophonological rules. X is used for nouns and adjectives, Y for verbs and Q for processes common to all The symbols themselves are used in the following way:

X1:0
Deletes final consonants in short essive of odd syllables
X2:0
WeG and neutralization of g8, etc. (hivsik-hivsiga)
X3:0
Weg and deletion of g8, etc. (bena-bednaga)
X4:0
e:á and e:å in illatives and px. a:á and o:u in Px and ill of a-stem actors and o-stems
X5:0
e:á, e:å and o:u in odd-syllable nouns, but also for some even nouns (o:u f.eks)
X6:0
Deviant III-I consonant gradation (in contracted stems, guobbmu:guomoj)
X7:0
WeG and e:á, e:å, o:á, o:u in front of diminutives, e:å in -lasj der
X8:0
Stem vowel alternations in Px
X9:0
Stem-vowel and central consonant shortening in first part(s) of compounds
Q1:0
The general weak grade trigger. Stem vowel change e:i and o:u in front of j. Dipht. simpl. Any environment #only# demanding WeG shall use Q1.
Q2:0
Vowel harmony:2nd syll e realized as å whenever 1st syll is å.
Q3:0
WeG in contracted, also does not trigger Dipht simpl.
Q4:0
Stem vowel change e:i and o:u in front of j. Dipht. simpl. Like Q1 but strong grade.
Q5:0
e:á stem vowel change for word diehtet. Weak grade.
Q6:0
e:á stem wovel change for word diehtet. Strong grade.
Q7:0
e:á stem vowel change for word diehtte. Extra strong grade
Q8:0
Stem vowel deletion, impII of verbs.
Q9:0
TBW
Y1:0
Stem vowel deletion, imp 3sg, 3du, 2pl, 3pl of verbs
Y2:0
"Indicative Present Singular 3rd Final Vowel in verbs"
Y3:0
PrsPrc
Y4:0
e > u in front of dersuff, o > u and e > á in front of dersuffix -alla
Y5:0
e > a, i > á, o > u, e > å in verb derivation
Y6:0
"Consonant insertion as II-III strengthening gradation", verbs +PrsPrt and +Imprt+Du2
Y7:0
"Consonant insertion as II-III strengthening gradation", nouns and propernouns
Y8:0
"Stem vowel deletion in even-syllable verbs, imp 1du, 1pl"
Y9:0
"Stem vowel deletion in short passives of even-syllable verbs
Z1:0
TBW "i:á in Verb Derivation guollir>guollár"
Z2:0
e:å, o:u in -lasj der

Morpheme boundaries:

«
Derivational prefix
»
Derivational suffix
%<
Inflectional prefx
%>
Inflectional suffix
#
Word boundary for both lexicalised and dynamic compounds
%^
(exceptional) soft hyphenation point
%
a space
mark before # to indicate dynamic comounds

The Sets section

These are the sets:

Vow
the vowels
Cns
the consonants
StemCns
consonants that may occur in stem-final position
DelCns
the consonants that are deleted in nominative
Dummy
the set of dummy symbols, they are there to trigger certain morphophonological symbols
WeG
the dummy symbols that trigger weak grade

The Definitions section

In this section, the consonants are defined. This includes consonant clusters in the various grades and consonant alternations.

G3 vs G2

The alternation patterns according to Spiik's alternations series.

S7 kkn: k0n series 1
S8 fºf: f0f series 2
S9 jgg: j0g series 3
S4 hkk: h0k series 4
S5 xy: zy (no zeros) series 5
S6 xx: yy (no zeros) series 6
S7 xy: zy (no zeros) series 7
S8 ----- (no cg) series 8
LowerG2
A definition of Grade2 consonant sequences referring mostly to the surface level
LowerG1
A definition of Grade 1 consonant sequences
LowerG12
A definition of Grade 1 or 2 consonant sequences
G32
A definition of Grade 3 or 2 consonant sequences
G3
A definition of Grade 3 consonant sequences

The Rules section

Overview

The rules section has the following chapters: Consonant alternations in certain pos, vowel lengthening, diphthong simplification, stem vowel alternations, consonant gradation rules

Consonant alternations in certain pos

All rules deal with word-final position.

  • *a (is not standard language)
  • *b (is not standard language)

Word Final Devoicing of Certain Single Consonants d9 etc.

  • iemed9#
  • iemet#

Word final weakening -tj and -ttj to -sj part 1

Word final weakening -tj and -ttj to -sj part 2

  • jågåtj
  • jågåsj
  • gålºleX7tj
  • gål0lå0sj

Word Final Deletion of n8 m8 g8 h8

  • loavddag8X3#
  • l0åv0da00#

Word Final Neutralization of g8, h8, m8

Deleting Final h9 in Short Essive of Uneven Syllables

Deleting Final l9 in Short Essive of Uneven Syllables

Deleting Final m9 in Short Essive of Uneven Syllables

Deleting Final n9 in Short Essive of Uneven Syllables

Deleting Final r9 in Short Essive of Uneven Syllables

  • málest#
  • máles0#

Vowel lengthening

The second syllable vowel a is lengthened to á whenever the stem consonants are in grade 1 and the first syllable vowel is short. Short vowels cannot preceed and follow a single intervocalic consonant.

Compulsatory lengthening in grade I even-syllables

  • gussaQ1#
  • gu0sá0#
  • skihpaQ1s#
  • ski0bá0s#

Diphtong simplification

The diphthong simplification handles oa: å and æ: e. Phonologically, these are identical processes, but since the dipthong is written by two letters in the former case and by one letter in the latter, the alternations must be handled separately. This section also handles ie: æ, these are in principle the same as oa: å, but the alternation does not occur in so many contexts.

oa:å Diphtong Simplification Part I

oa:å Diphtong Simplification Part II

  • toahkkeY6X5jn
  • toahkki00jn
  • *toahkkeY6X5jn (is not standard language)
  • *t0åhkki00jn (is not standard language)
  • boalloX4j
  • b0ållu0j
  • roavggoX4j
  • roavggu0j
  • *roavggoX4j (is not standard language)
  • *r0åvggu0j (is not standard language)
  • toasºsoQ1X5jn
  • t0ås0su00jn
  • *toasºsoQ1X5jn (is not standard language)
  • *toas0su00jn (is not standard language)
  • *moasºsoX5jn (is not standard language)
  • *m0ås0su0jn (is not standard language)
  • moasºsoX5jn
  • moas0su0jn
  • goarºroY6X5jn
  • goar0ru00jn
  • goarroY6X5jn
  • goarru00jn
  • *goarºroY6X5jn (is not standard language)
  • *g0år0ru00jn (is not standard language)
  • *goarºroY2 (is not standard language)
  • *g0år0ru0 (is not standard language)
  • goarroY2
  • g0årru0
  • doad0jeY6
  • doaddje0
  • *doad0jeY6 (is not standard language)
  • *d0åddje0 (is not standard language)
  • goarºroY5d9it
  • g0år0ru0dit
  • *goarºroY5d9it (is not standard language)
  • *goar0ru0dit (is not standard language)
  • toab0moY6X4j
  • toabbmu00j
  • toabmoX4j
  • t0åbmu0j
  • *toa0mboY6X4j (is not standard language)
  • *t0åbbmu00j (is not standard language)
  • toabmoX7dallat
  • t0å0mu0dallat
  • *toabmoX7dallat (is not standard language)
  • *toa0mu0dallat (is not standard language)
  • oaddoY6X4j
  • oaddu00j
  • boassjkoQ1X5jn
  • b0å0sjku00jn
  • *boassjkoQ1X5jn (is not standard language)
  • *boas0jku00jn (is not standard language)
  • boajsstoQ1X5jn
  • b0åj0stu00jn
  • *boajsstoQ1X5jn (is not standard language)
  • *boaj0stu00jn (is not standard language)
  • boaggoQ1X5jn
  • b0åkku00jn
  • *boaggoQ1X5jn (is not standard language)
  • *boakku00jn (is not standard language)

examples:

examples:

examples:

examples:

examples:

examples:

examples:

examples:

examples:

examples:

æ:e Diphthong Simplification 1

æ:e Diphthong Simplification 2

  • hærránis
  • hærránis
  • hærránis#gæhttjalibme>
  • hærránis#gæhttjalibme>
  • pasiænnta>Q1
  • pasien0ta>0
  • patænnta>Q1
  • paten0ta>0
  • kvotiænnta>Q1
  • kvotien0ta>0
  • kliænnta>Q1
  • klien0ta>0
  • Lævnnja>Q1
  • Lev0nja>0

ie:æ Diphthong Simplification Part I

  • ielvveY9ut
  • 0ælvv00ut
  • iehttseY1up
  • 0æhtts00up
  • giesseQ8us
  • g0ess00us

ie:æ Diphthong Simplification Part IIa Norwegian æ is an option...

ie:ä Diphthong Simplification Part IIb Swedish ä is an option...

ie:ä Diphthong Simplification Part IIc And there are no other options. This rule to block the e, and allow for the æ and ä.

  • jeht0sa>Y6
  • jæhttse>0
  • jeht0sa>Y6
  • jähttse>0
  • gierre»X7dalla>t
  • g0æ0rá»0dalla>t
  • boarkkaQ1
  • b0år0ka0
  • loavddag8X3#
  • l0åv0da00#

Vowel-change oa:å for verbs part I

Vowel-change oa:å for verbs part II

  • hå0llaY2
  • hoallá0
  • gå0d0naY6
  • goaddne0
  • *hållaY2 (is not standard language)
  • *hållá0 (is not standard language)
  • gå0ht0saY6
  • goahttse0

Stem vowel alternations

This section is divided according to stem vowels: a-, e-, o-, å-stems.

a-stem alternations

For a-stems, there is a: e and a: i. Each alternation is triggered by a combination of phonological content and dummy symbols.

a:e in Present Participle of even-syllable verbs

  • bassa>Y6
  • basse>0

a:i in Prs Prc of even-syllable verbs

  • basºsaY6jt#
  • bas0si0jt#

a-stem vowel deletion

  • giedjeg9>a#
  • giedjeg>a#

e-stem alternations

For e-stems, there is e: i, e: á, e: å, e: u and e: a. Each alternation is triggered by a combination of phonological content and dummy symbols.

e:i in e-stems

  • manasseQ4j
  • manassi0j
  • biesseQ1j
  • bie0si0j
  • boaht0eY6j
  • boahtti0j
  • gálleQ1tj
  • gá0li0sj
  • gálleQ1tjav
  • gá0li0tjav
  • gálleQ1tjin
  • gá0li0tjin
  • gálleQ1tjihpit
  • gá0li0tjihpit
  • gálleQ1tjibá
  • gá0li0tjibá
  • gálleQ1tjip
  • gá0li0tjip
  • gálleQ1tja
  • gá0li0tja
  • gierre>Q1tja
  • gie0ri>0tja
  • gierre>Q1tj
  • gie0ri>0sj

The following two rules constitute a <= / => rule pair.

e:á in certain stem types 1

  • bálggeX4v
  • bálggá0v
  • gálleY3m#
  • gállá0m#
  • gálleQ2v#
  • gá0lá0v#
  • báhkoX7tj#
  • bá0gu0sj#
  • goahteX7tj#
  • goa0dá0sj#
  • *goahteX7tj# (is not standard language)
  • *go00dá0sj# (is not standard language)

e:á in certain stem types 2

  • bárnneX4m
  • bárnná0m
  • *bárnneX4m (is not standard language)
  • *bárnne0m (is not standard language)

e:å in certain stem types with å as root vowel

  • gådeQ2v
  • gådå0v
  • jåhteQ2v
  • jå0då0v
  • gådeY2
  • gådå0
  • jåhteY2
  • jåhtå0
  • jåhteY3m
  • jåhtå0m
  • låhkkeY7tj#
  • låhkkå0sj#

e-stem vowel deletion

  • ielvveY9ut
  • 0ælvv00ut

i-stem alternations

For i-stems, there is i: á. The alternation is triggered by a combination of phonological content and dummy symbols.

i:á in Verb Derivation

o-stem alternations

The duplicates of the three lines of the two following rules are there to resolve the => conflict between the two rules.

o:u in certain stem types 1

o:u in certain stem types 2

u:o in contracted nouns

o-stem vowel deletion

For å-stems there is å:e and å:i and vowel deletion. Each alternation is triggered by a combination of phonological content and dummy symbols.

å:e in Present Participle of even-syllable verbs

å:i in Actor nouns of even-syllable verbs

å-stem vowel deletion

alternations valid for several stem types

Stem vowel deletion in even-syllable verbs, imp 3sg, 3du, 2pl, 3pl

  • ielvveY1up
  • 0ælvv00up
  • giessaY1up
  • giess00up
  • bårråY1up
  • bårr00up

Consonant gradation rules

The consonant gradation rules differ considerably from the corresponding rules for North Sámi. Instead of generalizing oversets of consonants (Cx: Cy &lt;=&gt; ...), each rule contains the alternation for one consonant only, and to the right of the &lt;=&gt; arrow is listed all the contexts where the relevant alternation appears. The disadvantage with this method is that the same context must be written several times, if e.g. both p, t and k are deleted in the same contexts, each of these contexts must be written several times, one for each consonant. The advantage is that there are no conflicts during compilation, compilation takes 10 seconds rather than 3 minutes. The earlier North-Sámi-style rule set was ordered according to CG pattern. This pattern is still visible in the new rules, via the reference S1-3 etc. (Spiik's Series 1, 3-letter pattern, etc) behind each subrule.

This actually opens up for a migration to an xfst rule file instead of the current twolc format, since what xfst really cannot do is generalize over sets (Cx: Cy etc.). This is an issue for future revisions to decide.

The rules are divided in two subsections, deletion rules and change (alternation) rules.

Deletion rules

The b, d, g deletion rules are similar, via the optional ( b ) etc. in front of the "_" symbol, both bm: m and bbm: bm alternations are covered. The contexts differ to a certain extent. For b and d, the III-I special gradation bbm: m is covered by two separate rules, and a special Dummy (X6), not part of the ordinary WeG set.

Note that one of the rules for t: 0 refers to #: as part of its context. As soon as clitics are added to the word form, this rule will thus not be triggered. Look into this when the clitics are added.

Consonant gradation b:0

Consonant gradation d:0

  • bednag8>X3
  • be0na0>0

Consonant gradation g:0

Consonant gradation k:0

Consonant gradation l:0

Consonant gradation m:0

Consonant gradation n:0

Consonant gradation p:0

Consonant gradation s:0

  • russjpeQ1
  • ru0sjpe0
  • *russjpeQ1 (is not standard language)
  • *russjpe0 (is not standard language)

Consonant gradation ŋ:0

Consonant gradation f:0

Consonant gradation r:0

Consonant gradation v:0

Consonant gradation j:0

Consonant gradation t:0

  • oajváladtj#
  • oajvála0sj#

Gradation Series 4, II-I, tj and ts

Change rules

The Cx: Cy format was kept for hk: g, hp: b, ht: d, since the left context h: 0 was unique, and no compilation conflict thus arose.

The bb: pp, gg: kk, dd: tt alternations were split into three rules, since keeping them in one Cx: Cy rule created compilation conflicts. Also, d: t contain a rule not found for the other two...

Gradation Series 4, II-I

bb:pp

  • oabbáQ1
  • oappá0

gg:kk

  • vággeQ1
  • vákke0
  • *vággeQ1 (is not standard language)
  • *vágge0 (is not standard language)

g:k change for clitic -ge

dd:tt and dtj, dts

Gradation Series 7, III-II, ks(t), kt, ktj, kts

Exceptional II-III inverse gradation in present participles

This gradation is only for II-I syllable verbs that get III as present participles.

  • bbm - bm - m
  • ddn - dn - n
  • ddnj- dnj- nj
  • ggŋ - gŋ - ŋ
  • ddj - dj - dj
  • hkk - hk - g
  • hpp - hp - b
  • htt - ht - d
  • httj- htj- tj
  • htts- hts- ts

Strategy: Do insertion rule for the initial element.

Consonant insertion as II-III strengthening gradation with bm, gŋ

Consonant insertion as II-III strengthening gradation with dn/j + as I-III strengthening gradation with d

Consonant insertion as II-III strengthening gradation with hk, hp,

Consonant insertion as II-III strengthening gradation with htt(j/s)

Debugging of twol-rules

All rule conflicts have been successfully resolved. The rule file should be kept that way. Look out for conflicts in the compilation process, and resolve them as they appear!