On variability in the identification and labelling of disfluencies — preliminary results from 23 annotations of the same data

Trouvain, Jürgen; Crible, Ludivine; Belz, Malte; Betz, Simon; Beňuš, Štefan; Baqué, Lorraine; Cantarutti, Marina; Di Napoli, Jessica; Didirková, Ivana; Machuca, Maria; Mareková, Lucia; Niculescu, Oana; Peltonen, Pauliina; Pistono, Aurelie; Schettino, Loredana; Silber-Varod, Vered; Williams, Simon

On variability in the identification and labelling of disfluencies — preliminary results from 23 annotations of the same data

Trouvain, Jürgen; Crible, Ludivine; Belz, Malte; Betz, Simon; Beňuš, Štefan; Baqué, Lorraine; Cantarutti, Marina; Di Napoli, Jessica; Didirková, Ivana; Machuca, Maria; Mareková, Lucia; Niculescu, Oana; Peltonen, Pauliina; Pistono, Aurelie; Schettino, Loredana; Silber-Varod, Vered; Williams, Simon

On variability in the identification and labelling of disfluencies — preliminary results from 23 annotations of the same data

Trouvain, Jürgen

Crible, Ludivine

Belz, Malte

Betz, Simon

Beňuš, Štefan

Baqué, Lorraine

Cantarutti, Marina

Di Napoli, Jessica

Didirková, Ivana

Machuca, Maria

Mareková, Lucia

Niculescu, Oana

Peltonen, Pauliina

Pistono, Aurelie

Schettino, Loredana

Silber-Varod, Vered

Williams, Simon

Katso/Avaa

trouvain25_diss.pdf (676.6Kb)

Lataukset:

doi:10.21437/DiSS.2025-12

URI

https://www.isca-archive.org/tmp/diss_2025/trouvain25_diss.html

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on:
https://urn.fi/URN:NBN:fi-fe202601216838

Tiivistelmä

This study provides a preliminary report on a large inter-annotator agreement experiment where 23 expert annotators from various research backgrounds identified and labelled disfluencies in the same speech sample. Each annotator was instructed to analyze the sample according to the framework (definitions, segmentation, labels, etc.) they typically use. The annotations were then processed and compared across three different dimensions: 1) the scope of the chosen typology and the definitions within, 2) the implementation of the typology in terms of annotation tiers and labels, and 3) the temporal alignment of the annotations. Preliminary findings reveal that there are substantial variations between annotators on various levels of annotation. The lack of a common standard becomes particularly evident in more complex segments, such as repairs.

Kokoelmat

Rinnakkaistallenteet [29335]