Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions
| dc.contributor.author | Kira Droganova | |
| dc.contributor.author | Filip Ginter | |
| dc.contributor.author | Jenna Kanerva | |
| dc.contributor.author | Daniel Zeman | |
| dc.contributor.organization | fi=kieli- ja puheteknologia|en=Language and Speech Technology| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.47465613983 | |
| dc.converis.publication-id | 37617276 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/37617276 | |
| dc.date.accessioned | 2022-10-28T14:40:54Z | |
| dc.date.available | 2022-10-28T14:40:54Z | |
| dc.description.abstract | <p>In this paper, we focus on parsing rare and non-trivial constructions, in particular ellipsis. We report on several experiments in enrichment of training data for this specific construction, evaluated on five languages: Czech, English, Finnish, Russian and Slovak. These data enrichment methods draw upon self-training and tri-training, combined with a stratified sampling method mimicking the structural complexity of the original treebank. In addition, using these same methods, we also demonstrate small improvements over the CoNLL-17 parsing shared task winning system for four of the five languages, not only restricted to the elliptical constructions.<br /></p> | |
| dc.format.pagerange | 47 | |
| dc.format.pagerange | 54 | |
| dc.identifier.isbn | 978-1-948087-78-0 | |
| dc.identifier.olddbid | 189649 | |
| dc.identifier.oldhandle | 10024/172743 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/44665 | |
| dc.identifier.url | http://aclweb.org/anthology/W18-6006 | |
| dc.identifier.urn | URN:NBN:fi-fe2021042827530 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Ginter, Filip | |
| dc.okm.affiliatedauthor | Kanerva, Jenna | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.internationalcopublication | international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.relation.conference | Universal Dependencies Workshop | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/172743 | |
| dc.title | Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions | |
| dc.title.book | Proceedings of the Second Workshop on Universal Dependencies (UDW 2018) | |
| dc.year.issued | 2018 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- W18-6006.pdf
- Size:
- 212.1 KB
- Format:
- Adobe Portable Document Format
- Description:
- Publisher's version