Looking under the hood: which linguistic features contribute to the source language classification of direct and indirect translations into Finnish, and why is that?

Ivaska, Ilmari; Ivaska, Laura

Looking under the hood: which linguistic features contribute to the source language classification of direct and indirect translations into Finnish, and why is that?

dc.contributor.author	Ivaska, Ilmari
dc.contributor.author	Ivaska, Laura
dc.contributor.organization	fi=englannin kieli, klassilliset kielet ja monikielinen käännösviestintä\|en=English, Classics and Multilingual Translation Studies\|
dc.contributor.organization	fi=kotimaiset kielet ja niiden sukukielet\|en=Finnish, Finno-Ugric and Scandinavian languages\|
dc.contributor.organization-code	1.2.246.10.2458963.20.59108485091
dc.converis.publication-id	470940483
dc.converis.url	https://research.utu.fi/converis/portal/Publication/470940483
dc.date.accessioned	2025-08-28T03:40:44Z
dc.date.available	2025-08-28T03:40:44Z
dc.description.abstract	<p> <span>The study of features that affect the linguistic form of translated texts has been one of the central questions within the field of corpus-based translation studies. In the partially overlapping field of computational linguistics, previous studies have shown that source languages of individual texts can be detected automatically in direct translations and indirect translations (i.e., translations done from translations). However, computationally oriented approaches have paid limited attention to what specific linguistic features make successful classification possible. Consequently, the types of linguistic phenomena characterizing translations and the kinds of linguistic interference that can be detected in them remain underexplored. In this study, we study the linguistic features that contribute to the identification of the source language of direct translations from English, French, German, Greek, and Swedish, as well as indirect translations from Greek into Finnish, with English, French, German, and Swedish as mediating languages. Theoretically, this study builds on Halverson’s (2017) gravitational pull model to explain the mechanisms behind our findings in a theoretically sound fashion and to generate theoretically motivated, specific hypotheses to be tested by future research. The analysis makes use of keyness analysis as a supervised machine learning technique, as well as exploratory factor analysis (EFA) as an unsupervised machine learning technique. The results indicate that sentence length, sentence-initial adverbs and sentence-final specification are the linguistic features that set the different types of translations apart from each other. Furthermore, the salient features of the ultimate source language outweigh those of the mediating languages in indirect translations or the entrenched parallels between specific language pairs.</span> The study of features that affect the linguistic form of translated texts has been one of the central questions within the field of corpus-based translation studies. In the partially overlapping field of computational linguistics, previous studies have shown that source languages of individual texts can be detected automatically in direct translations and indirect translations (i.e., translations done from translations). However, computationally oriented approaches have paid limited attention to what specific linguistic features make successful classification possible. Consequently, the types of linguistic phenomena characterizing translations and the kinds of linguistic interference that can be detected in them remain underexplored. In this study, we study the linguistic features that contribute to the identification of the source language of direct translations from English, French, German, Greek, and Swedish, as well as indirect translations from Greek into Finnish, with English, French, German, and Swedish as mediating languages. Theoretically, this study builds on Halverson’s (2017) gravitational pull model to explain the mechanisms behind our findings in a theoretically sound fashion and to generate theoretically motivated, specific hypotheses to be tested by future research. The analysis makes use of keyness analysis as a supervised machine learning technique, as well as exploratory factor analysis (EFA) as an unsupervised machine learning technique. The results indicate that sentence length, sentence-initial adverbs and sentence-final specification are the linguistic features that set the different types of translations apart from each other. Furthermore, the salient features of the ultimate source language outweigh those of the mediating languages in indirect translations or the entrenched parallels between specific language pairs.<br></p>
dc.format.pagerange	239
dc.identifier.eissn	1588-2519
dc.identifier.jour-issn	1585-1923
dc.identifier.olddbid	210985
dc.identifier.oldhandle	10024/194012
dc.identifier.uri	https://www.utupub.fi/handle/11111/56792
dc.identifier.url	https://doi.org/10.1556/084.2024.00912
dc.identifier.urn	URN:NBN:fi-fe2025082792802
dc.language.iso	en
dc.okm.affiliatedauthor	Ivaska, Ilmari
dc.okm.affiliatedauthor	Ivaska, Laura
dc.okm.discipline	6121 Languages	en_GB
dc.okm.discipline	6121 Kielitieteet	fi_FI
dc.okm.internationalcopublication	not an international co-publication
dc.okm.internationality	International publication
dc.okm.type	A1 ScientificArticle
dc.publisher	Akadémiai Kiadó
dc.publisher.country	Hungary	en_GB
dc.publisher.country	Unkari	fi_FI
dc.publisher.country-code	HU
dc.relation.doi	10.1556/084.2024.00912
dc.relation.ispartofjournal	Across Languages and Cultures
dc.relation.issue	2
dc.relation.volume	25
dc.source.identifier	https://www.utupub.fi/handle/10024/194012
dc.title	Looking under the hood: which linguistic features contribute to the source language classification of direct and indirect translations into Finnish, and why is that?
dc.year.issued	2024

Tiedostot

Näytetään 1 - 1 / 1

Name:: ivaska-ivaska_2024_looking-under-the-hood_authors-accepted-manuscript.pdf
Size:: 1.18 MB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet