Sentence Compression for Automatic Subtitling
Juhani Luotolahti; Filip Ginter
Sentence Compression for Automatic Subtitling
Juhani Luotolahti
Filip Ginter
Julkaisun pysyvä osoite on:
https://urn.fi/URN:NBN:fi-fe2021042714403
https://urn.fi/URN:NBN:fi-fe2021042714403
Tiivistelmä
This paper investigates sentence compression for automatic subtitle generation using supervised machine learning. We present a method for sentence compression as well as discuss generation of training data from compressed Finnish sentences, and different approaches to the problem. The method we present outperforms state-of-the-art baseline in both automatic and human valuation. On real data, 44.9% of the sentences produced by the compression algorithm have been judged to be useable as-is or after minor edits.
Kokoelmat
- Rinnakkaistallenteet [29335]
