Sentence Compression for Automatic Subtitling
| dc.contributor.author | Juhani Luotolahti | |
| dc.contributor.author | Filip Ginter | |
| dc.contributor.organization | fi=tietojenkäsittelytiede|en=Computer Science| | |
| dc.contributor.organization | fi=tietotekniikan laitos|en=Department of Computing| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.85312822902 | |
| dc.contributor.organization-code | 2606803 | |
| dc.converis.publication-id | 2020416 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/2020416 | |
| dc.date.accessioned | 2022-10-28T13:42:21Z | |
| dc.date.available | 2022-10-28T13:42:21Z | |
| dc.description.abstract | <p> This paper investigates sentence compression for automatic subtitle generation using supervised machine learning. We present a method for sentence compression as well as discuss generation of training data from compressed Finnish sentences, and different approaches to the problem. The method we present outperforms state-of-the-art baseline in both automatic and human valuation. On real data, 44.9% of the sentences produced by the compression algorithm have been judged to be useable as-is or after minor edits.</p> | |
| dc.format.pagerange | 134 | |
| dc.format.pagerange | 143 | |
| dc.identifier.isbn | 978-91-7519-098-3 | |
| dc.identifier.olddbid | 183759 | |
| dc.identifier.oldhandle | 10024/166853 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/41044 | |
| dc.identifier.url | https://aclweb.org/anthology/W/W15/W15-1818.pdf | |
| dc.identifier.urn | URN:NBN:fi-fe2021042714403 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Luotolahti, Matti | |
| dc.okm.affiliatedauthor | Ginter, Filip | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.publisher.country | Lithuania | en_GB |
| dc.publisher.country | Liettua | fi_FI |
| dc.publisher.country-code | LT | |
| dc.relation.conference | Nordic Conference on Computational Linguistics | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/166853 | |
| dc.title | Sentence Compression for Automatic Subtitling | |
| dc.title.book | Proceedings of NoDaLiDa 2015 | |
| dc.year.issued | 2015 |
Tiedostot
1 - 1 / 1