The FISKMO project: Resources and tools for Finnish-Swedish machine translation and cross-linguistic research

dc.contributor.authorJörg Tiedemann
dc.contributor.authorTommi Nieminen
dc.contributor.authorMikko Aulamo
dc.contributor.authorJenna Kanerva
dc.contributor.authorAkseli Leino
dc.contributor.authorFilip Ginter
dc.contributor.authorNiko Papula
dc.contributor.organizationfi=kieli- ja puheteknologia|en=Language and Speech Technology|
dc.contributor.organizationfi=tietotekniikan laitos|en=Department of Computing|
dc.contributor.organization-code1.2.246.10.2458963.20.85312822902
dc.contributor.organization-code2606805
dc.converis.publication-id51180793
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/51180793
dc.date.accessioned2025-08-28T03:15:43Z
dc.date.available2025-08-28T03:15:43Z
dc.description.abstract<p>This paper presents FISKMÖ, a project that focuses on the development of resources and tools for cross-linguistic research and machine translation between Finnish and Swedish. The goal of the project is the compilation of a massive parallel corpus out of translated material collected from web sources, public and private organisations and language service providers in Finland with its two official languages. The project also aims at the development of open and freely accessible translation services for those two languages for the general purpose and for domain-specific use. We have released new data sets with over 3 million translation units, a benchmark test set for MT development, pre-trained neural MT models with high coverage and competitive performance and a self-contained MT plugin for a popular CAT tool. The latter enables offline translation without dependencies on external services making it possible to work with highly sensitive data without compromising security concerns.<br /></p>
dc.format.pagerange3808
dc.format.pagerange3815
dc.identifier.isbn978-10-95546-34-5
dc.identifier.olddbid210439
dc.identifier.oldhandle10024/193466
dc.identifier.urihttps://www.utupub.fi/handle/11111/51567
dc.identifier.urlhttps://www.aclweb.org/anthology/2020.lrec-1.470/
dc.identifier.urnURN:NBN:fi-fe2021042826550
dc.language.isoen
dc.okm.affiliatedauthorKanerva, Jenna
dc.okm.affiliatedauthorGinter, Filip
dc.okm.affiliatedauthorLeino, Akseli
dc.okm.discipline222 Other engineering and technologiesen_GB
dc.okm.discipline222 Muu tekniikkafi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryFranceen_GB
dc.publisher.countryRanskafi_FI
dc.publisher.country-codeFR
dc.publisher.placeMarseille, France
dc.relation.conferenceInternational Conference on Language Resources and Evaluation
dc.source.identifierhttps://www.utupub.fi/handle/10024/193466
dc.titleThe FISKMO project: Resources and tools for Finnish-Swedish machine translation and cross-linguistic research
dc.title.bookProceedings of the 12th Language Resources and Evaluation Conference
dc.year.issued2020

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
2020.lrec-1.470.pdf
Size:
477.22 KB
Format:
Adobe Portable Document Format
Description:
Publisher´s PDF