Sentence Compression for Automatic Subtitling

dc.contributor.authorJuhani Luotolahti
dc.contributor.authorFilip Ginter
dc.contributor.organizationfi=tietojenkäsittelytiede|en=Computer Science|
dc.contributor.organizationfi=tietotekniikan laitos|en=Department of Computing|
dc.contributor.organization-code1.2.246.10.2458963.20.85312822902
dc.contributor.organization-code2606803
dc.converis.publication-id2020416
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/2020416
dc.date.accessioned2022-10-28T13:42:21Z
dc.date.available2022-10-28T13:42:21Z
dc.description.abstract<p> This paper investigates sentence compression for automatic subtitle generation using supervised machine learning. We present a method for sentence compression as well as discuss generation of training data from compressed Finnish sentences, and different approaches to the problem. The method we present outperforms state-of-the-art baseline in both automatic and human &nbsp;valuation. On real data, 44.9% of the sentences produced by the compression algorithm have been judged to be useable as-is or after minor edits.</p>
dc.format.pagerange134
dc.format.pagerange143
dc.identifier.isbn978-91-7519-098-3
dc.identifier.olddbid183759
dc.identifier.oldhandle10024/166853
dc.identifier.urihttps://www.utupub.fi/handle/11111/41044
dc.identifier.urlhttps://aclweb.org/anthology/W/W15/W15-1818.pdf
dc.identifier.urnURN:NBN:fi-fe2021042714403
dc.language.isoen
dc.okm.affiliatedauthorLuotolahti, Matti
dc.okm.affiliatedauthorGinter, Filip
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryLithuaniaen_GB
dc.publisher.countryLiettuafi_FI
dc.publisher.country-codeLT
dc.relation.conferenceNordic Conference on Computational Linguistics
dc.source.identifierhttps://www.utupub.fi/handle/10024/166853
dc.titleSentence Compression for Automatic Subtitling
dc.title.bookProceedings of NoDaLiDa 2015
dc.year.issued2015

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
W15-1818.pdf
Size:
688.05 KB
Format:
Adobe Portable Document Format