Finnish Paraphrase Corpus
| dc.contributor.author | Kanerva Jenna | |
| dc.contributor.author | Ginter Filip | |
| dc.contributor.author | Chang Li-Hsin | |
| dc.contributor.author | Rastas Iiro | |
| dc.contributor.author | Skantsi Valtteri | |
| dc.contributor.author | Kilpeläinen Jemina | |
| dc.contributor.author | Kupari Hanna-Mari | |
| dc.contributor.author | Saarni Jenna | |
| dc.contributor.author | Sevón Maija | |
| dc.contributor.author | Tarkka Otto | |
| dc.contributor.organization | fi=data-analytiikka|en=Data-analytiikka| | |
| dc.contributor.organization | fi=tietotekniikan laitos|en=Department of Computing| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.68940835793 | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.85312822902 | |
| dc.contributor.organization-code | 2610301 | |
| dc.converis.publication-id | 53727016 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/53727016 | |
| dc.date.accessioned | 2025-08-28T02:08:14Z | |
| dc.date.available | 2025-08-28T02:08:14Z | |
| dc.description.abstract | <p>In this paper, we introduce the firstfully manually annotated paraphrase cor-pus for Finnish containing 53,572 para-phrase pairs harvested from alternative subtitles and news headings. Out of all paraphrase pairs in our corpus 98% are manually classified to be paraphrases at least in their given context, if not in all contexts. Additionally, we establish a manual candidate selection method and demonstrate its feasibility in high quality paraphrase selection in terms of both costand quality.</p> | |
| dc.format.pagerange | 288 | |
| dc.format.pagerange | 298 | |
| dc.identifier.isbn | 978-91-7929-614-8 | |
| dc.identifier.issn | 1650-3686 | |
| dc.identifier.jour-issn | 1650-3686 | |
| dc.identifier.olddbid | 208638 | |
| dc.identifier.oldhandle | 10024/191665 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/58156 | |
| dc.identifier.url | https://ep.liu.se/en/conference-article.aspx?series=ecp&issue=178&Article_No=29 | |
| dc.identifier.urn | URN:NBN:fi-fe2021093048687 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Kanerva, Jenna | |
| dc.okm.affiliatedauthor | Ginter, Filip | |
| dc.okm.affiliatedauthor | Chang, Li-Hsin | |
| dc.okm.affiliatedauthor | Rastas, Iiro | |
| dc.okm.affiliatedauthor | Skantsi, Valtteri | |
| dc.okm.affiliatedauthor | Kilpeläinen, Jemina | |
| dc.okm.affiliatedauthor | Kupari, Hanna-Mari | |
| dc.okm.affiliatedauthor | Saarni, Jenna | |
| dc.okm.affiliatedauthor | Sevon, Maija | |
| dc.okm.affiliatedauthor | Tarkka, Otto | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.publisher.country | Sweden | en_GB |
| dc.publisher.country | Ruotsi | fi_FI |
| dc.publisher.country-code | SE | |
| dc.relation.conference | Nordic Conference on Computational Linguistics | |
| dc.relation.ispartofjournal | Linköping Electronic Conference Proceedings | |
| dc.relation.ispartofseries | Linköping Electronic Conference Proceedings | |
| dc.relation.volume | 178 | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/191665 | |
| dc.title | Finnish Paraphrase Corpus | |
| dc.title.book | Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021) | |
| dc.year.issued | 2021 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- ecp2021178029.pdf
- Size:
- 260.02 KB
- Format:
- Adobe Portable Document Format
- Description:
- Publisher's PDF