Consider this sentence: Tearpmat biddjojuvvoojit neahttabáikái www.risten.no‘s dađistaga go Sámi giellalávdegoddi lea dohkkehan tearpmaid. It is presently unrecognised as a URL because the URL parser does not know about morphology: "<www>" "www" N Prop Sem/Txt ACR Sg Acc <W:0.0000000000> "www" N Prop Sem/Txt ACR Sg Gen <W:0.0000000000> "www" N Prop Sem/Txt ACR Sg Nom <W:0.0000000000> "www" N Sem/Txt Prop ACR Sg Acc <W:0.0000000000> "www" N Sem/Txt Prop ACR Sg Gen <W:0.0000000000> "www" N Sem/Txt Prop ACR Sg Nom <W:0.0000000000> "<.>" "." CLB <W:0.0000000000> "<risten>" "riestit" V TV Ind Prt Sg1 <W:0.0000000000> "ristat" V TV Ind Prt Sg1 <W:0.0000000000> "ristet" V TV Actio Gen <W:0.0000000000> "ristet" V TV Actio Nom <W:0.0000000000> "ristet" V TV Ind Prs Sg1 <W:0.0000000000> "ristet" V TV Ind Prt ConNeg <W:0.0000000000> "ristet" V TV PrfPrc <W:0.0000000000> "ristet" VV TV Der/NomAct N Sg Gen <W:0.0000000000> "ristet" VV TV Der/NomAct N Sg Nom <W:0.0000000000> "<.>" "." CLB <W:0.0000000000> :no‘s "<dađistaga>" "dađistaga" Adv <W:0.0000000000> (output from hfst-tokenise). The easiest solution is to build a separate fst with just the tags and affixes, and then concatenate it with the URL parser.
Definitely essential for both analysis and grammar checking to be able to analyze these.
*** Bug 2619 has been marked as a duplicate of this bug. ***
illativ sg norm is with one i only :i