Distinguishing translations from non-translations and identifying (in)direct translations’ source languages

dc.contributor.authorIvaska Laura
dc.contributor.organizationfi=englannin kieli, klassilliset kielet ja monikielinen käännösviestintä|en=English, Classics and Multilingual Translation Studies|
dc.contributor.organization-code1.2.246.10.2458963.20.22758552511
dc.converis.publication-id43722199
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/43722199
dc.date.accessioned2022-10-28T12:31:46Z
dc.date.available2022-10-28T12:31:46Z
dc.description.abstract<p>The scope of this study is threefold. First, machine learning will be applied to<br />distinguish translated from non-translated Finnish texts. Then, it will attempt to<br />identify the source languages of the translated Finnish texts. Finally, the source<br />language identification will be tested with indirect translations, that is, with<br />translations made from translations. The three underlying research questions are: 1)<br />Can translated Finnish be distinguished from non-translated Finnish? 2) Can the<br />source languages of Finnish translations be identified? 3) If the answer to question<br />2 is yes, then what happens when the method is applied to indirect translations; will<br />the analysis identify the ultimate source language, the mediating language, or<br />neither?</p><p>This study is based on the hypothesis that translated language contains traces<br />of the source language (Toury 1995). The corpus of the study consists of nontranslated<br />Finnish prose, Finnish prose literature translations made from English,<br />German, French, Modern Greek, and Swedish, as well as indirect translations from<br />Modern Greek into Finnish via English, German, French, and Swedish. The<br />analyses are based on cluster analysis and support vector machines using the<br />frequencies of the most frequent lemmatized words.<br />Results show that translated and non-translated Finnish can be distinguished<br />by using machine learning techniques. Support vector machine-based source<br />language identification, however, was only partially successful, while a cluster<br />analysis suggested that there is coherence within a group of texts translated from<br />the same source language and variation between the groups of texts with different<br />source languages. Clustering was further tested with indirect translations, and the<br />results were mixed: six of the thirteen tested indirect translations clustered with<br />direct translations from the ultimate source language, two with translations from<br />their mediating languages, and five with neither.<br /></p>
dc.format.pagerange125
dc.format.pagerange138
dc.identifier.eisbn978-952-62-2321-6
dc.identifier.isbn978-952-62-2320-9
dc.identifier.issn1796-4725
dc.identifier.jour-issn1796-4725
dc.identifier.olddbid177067
dc.identifier.oldhandle10024/160161
dc.identifier.urihttps://www.utupub.fi/handle/11111/48468
dc.identifier.urlhttps://www.oulu.fi/sites/default/files/content/ProceedingsStudiaHumanioraOuluensia17.pdf
dc.identifier.urnURN:NBN:fi-fe2021042825020
dc.language.isoen
dc.okm.affiliatedauthorIvaska, Laura
dc.okm.discipline6121 Languagesen_GB
dc.okm.discipline6121 Kielitieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityDomestic publication
dc.okm.typeA4 Conference Article
dc.publisher.countryFinlanden_GB
dc.publisher.countrySuomifi_FI
dc.publisher.country-codeFI
dc.publisher.placeOulu
dc.relation.conferenceResearch Data and Humanities
dc.relation.ispartofjournalStudia Humaniora Ouluensia
dc.relation.ispartofseriesStudia humaniora ouluensia
dc.relation.volume17
dc.source.identifierhttps://www.utupub.fi/handle/10024/160161
dc.titleDistinguishing translations from non-translations and identifying (in)direct translations’ source languages
dc.title.bookProceedings of the Research Data and Humanities (RDHUM) 2019 Conference: Data, Methods and Tools
dc.year.issued2019

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
ivaska_distinguishing_translations.pdf
Size:
385.95 KB
Format:
Adobe Portable Document Format