UIT The arctic university of Norway > Giellatekno
 

Apertium Commands

Gohččumat maid dárbbašat dávjjimusat:

Jorgalit:

  • Jorgalit cealkaga:
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma
  • Jorgalit olles fiilla:
cat texts/tarina.sme.txt | apertium -d. sme-sma | less
  • Jorgalit olles fiilla hash ja nástti haga (omd. evaluerejeddjiide):
cat texts/tarina.sme.txt | apertium -d. -u sme-sma
  • Jorgalit nu ahte oainnát maid prográmma ii máhte genereret, omd:
#pyeri<adj><comp><der_avt><adv> 


cat texts/tarina.sme.txt | apertium -d. sme-sma-dgen | less

Debuggen:

  • Boađus ovdal transfernjuolggadusaid:
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-biltrans | tr ' ' '\n'
  • Boađus maŋŋel t1x-fiilla:
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-chunker
  • Davvisámi disambiguerema disambigueren:
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-disam | tr ' ' '\n'

Iskat MT-vuogádaga analysahtor-output:

MT-vuogádaga analysáhtoriin leat dušše sánit mat leat mielde dix-fiillas.

echo "lohkan" | hfst-lookup sme-smn.automorf.hfst
echo "baakoem" | hfst-lookup sma-sme.automorf.hfst

Analysahtor-output buot sániiguin, eai dušše bidix-sánit:

echo "lohkan" | hfst-lookup .deps/sme.automorf.hfst
echo "baakoem" | hfst-lookup .deps/sma.automorf.hfst

Iskat MT-systema genererema:

MT-vuogádaga genereráhtoriin leat dušše sánit mat leat mielde dix-fiillas, ja seamma PoS: in mii lea dix-fiillas.

echo "sátni<n><sg><acc>" | hfst-lookup sma-sme.autogen.hfst
echo "baakoe<n><sg><acc>" | hfst-lookup sme-sma.autogen.hfst

Jorgalanteasta:

Iskka dáid gohččumiid, ja de oainnát mo Apertium bargá cehkiid mielde. Lonut sma = smn jnv.

echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-morph
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-disam
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-biltrans   # sme-analysa
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-chunker
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-interchunk3
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma-postchunk
echo "Don galggat boahtit skuvlii." | apertium -d. sme-sma

Hukset parallealla jorgalusaid (python-jorgalusa -- tmx-fiillaide)

Input: tmx-máhppa.

  • (text-máhpa sisdoallu)
    • python check_mt-otpt.py -d tmx_data
  • (sámedikki siiddut)
    • python check_mt-otpt.py -d fi.samediggi
  • časkit buot output-fiillaid oktii ja rahpat daid fierbmelohkkis:
    • cat otpt-dir/* > testenfiillat.html
    • open testenfiillat.html

Hukset parallealla jorgalusaid (tmx haga)

Input: sme- ja smj-fiillat parallelliserejuvvon nu, ahte juohke cealkka lea seammá linnjás.

cat -n texts/samediggidiedadus_samegiela_birra_2012.sme.txt |sed 's/	/a	/g;' > xdiedahusa
cat -n texts/samediggidiedadus_samegiela_birra_2012.smj.txt |sed 's/	/b	/g;' > xdiedahusa
cat texts/samediggidiedadus_samegiela_birra_2012.sme.txt |apertium -d. sme-smj|cat -n|sed 's/	/c	/g;' > xdiedahusc
yes|head -1217|cat -n|sed 's/	y/d	/g;' > xdiedahusd
sort -n xdiedahus*|l

Generating modes:

Dás oainnát buot vejolaš gohččumiid:

apertium-gen-modes modes.xml

Missing lists

Leago missing list up to date? (ovdamearka = smn)

cat listanamma | hfst-proc sme-smn.automorf.hfst
cat dev/sme_FI.N.nocmp.missing |hfst-proc sme-smn.automorf.hfst |grep -v '/\*'|see