Dr. Livingstone, I presume? Polishing of foreign character identification in literary texts

dc.contributor.authorKonovalova Aleksandra
dc.contributor.authorToral Antonio
dc.contributor.authorTaivalkoski-Shilov Kristiina
dc.contributor.organizationfi=kieli- ja käännöstieteiden laitos|en=School of Languages and Translation Studies|
dc.contributor.organization-code2602100
dc.converis.publication-id176107951
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/176107951
dc.date.accessioned2022-10-28T12:42:33Z
dc.date.available2022-10-28T12:42:33Z
dc.description.abstract<p>Character identification is a key element for many narrative-related tasks. To implement it, the baseform of the name of the character (or lemma) needs to be identified, so different appearances of the same character in the narrative could be aligned. In this paper we tackle this problem in translated texts (English–Finnish translation direction), where the challenge regarding lemmatizing foreign names in an agglutinative language appears. To solve this problem, we present and compare several methods. The results show that the method based on a search for the shortest version of the name proves to be the easiest, best performing (83.4% F1), and most resource-independent.</p>
dc.format.pagerange123
dc.format.pagerange128
dc.identifier.isbn978-1-955917-73-5
dc.identifier.olddbid178391
dc.identifier.oldhandle10024/161485
dc.identifier.urihttps://www.utupub.fi/handle/11111/43311
dc.identifier.urlhttps://aclanthology.org/2022.naacl-srw.16/
dc.identifier.urnURN:NBN:fi-fe2022091258602
dc.language.isoen
dc.okm.affiliatedauthorKonovalova, Aleksandra
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline616 Other humanitiesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline616 Muut humanistiset tieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.conferenceConference of the North American Chapter of the Association for Computational Linguistics
dc.relation.doi10.18653/v1/2022.naacl-srw.16
dc.source.identifierhttps://www.utupub.fi/handle/10024/161485
dc.titleDr. Livingstone, I presume? Polishing of foreign character identification in literary texts
dc.title.bookProceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop
dc.year.issued2022

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
2022.naacl-srw.16.pdf
Size:
124.91 KB
Format:
Adobe Portable Document Format