This is for now an ad-hoc page of links and ideas
Machine learning and language technology.
Mainstream language technology is dominated by neural networks.
Good to read
Things we have looked at
Ideas we have
Vi har språklege data (i Korp nå, men vi har mer):
- sme: over 32M tokens, nesten 3M setninger
- nob-sme: nesten 2.5M tokens, mer enn 150K setninger