Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions

dc.contributor.authorKira Droganova
dc.contributor.authorFilip Ginter
dc.contributor.authorJenna Kanerva
dc.contributor.authorDaniel Zeman
dc.contributor.organizationfi=kieli- ja puheteknologia|en=Language and Speech Technology|
dc.contributor.organization-code1.2.246.10.2458963.20.47465613983
dc.converis.publication-id37617276
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/37617276
dc.date.accessioned2022-10-28T14:40:54Z
dc.date.available2022-10-28T14:40:54Z
dc.description.abstract<p>In this paper, we focus on parsing rare and non-trivial constructions, in particular ellipsis. We report on several experiments in enrichment of training data for this specific construction, evaluated on five languages: Czech, English, Finnish, Russian and Slovak. These data enrichment methods draw upon self-training and tri-training, combined with a stratified sampling method mimicking the structural complexity of the original treebank. In addition, using these same methods, we also demonstrate small improvements over the CoNLL-17 parsing shared task winning system for four of the five languages, not only restricted to the elliptical constructions.<br /></p>
dc.format.pagerange47
dc.format.pagerange54
dc.identifier.isbn978-1-948087-78-0
dc.identifier.olddbid189649
dc.identifier.oldhandle10024/172743
dc.identifier.urihttps://www.utupub.fi/handle/11111/44665
dc.identifier.urlhttp://aclweb.org/anthology/W18-6006
dc.identifier.urnURN:NBN:fi-fe2021042827530
dc.language.isoen
dc.okm.affiliatedauthorGinter, Filip
dc.okm.affiliatedauthorKanerva, Jenna
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.relation.conferenceUniversal Dependencies Workshop
dc.source.identifierhttps://www.utupub.fi/handle/10024/172743
dc.titleMind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions
dc.title.bookProceedings of the Second Workshop on Universal Dependencies (UDW 2018)
dc.year.issued2018

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
W18-6006.pdf
Size:
212.1 KB
Format:
Adobe Portable Document Format
Description:
Publisher's version