Collecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages

dc.contributor.authorOlstad Anne Marte Haug
dc.contributor.authorSmolander Anna
dc.contributor.authorStrömbergsson Sofia
dc.contributor.authorYlinen Sari
dc.contributor.authorLehtonen Minna
dc.contributor.authorKurimo Mikko
dc.contributor.authorGetman Yaroslav
dc.contributor.authorGrósz Tamás
dc.contributor.authorCao Xinwei
dc.contributor.authorSvendsen Torbjørn
dc.contributor.authorSalvi Giampiero
dc.contributor.organizationfi=logopedia|en=Speech-Language Pathology|
dc.contributor.organization-code1.2.246.10.2458963.20.46679761984
dc.converis.publication-id404724456
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/404724456
dc.date.accessioned2025-08-28T00:45:28Z
dc.date.available2025-08-28T00:45:28Z
dc.description.abstract<p>This paper reports on the experience collecting a number of corpora of Nordic languages spoken by children. The aim of the data collection is providing annotated data to develop and evaluate computer assisted pronunciation assessment systems both for non-native children learning a Nordic language (L2) and for L1 children with speech sound disorder (SSD). The paper presents the challenges encountered recording and annotating data for Finnish, Swedish and Norwegian, as well as the ethical considerations related with making this data publicly available. We hope that sharing this experience will encourage others to collect similar data for other languages. Of the different data collections, we were able to make the Norwegian corpus publicly available in the hope that it will serve as a reference in pronunciation assessment research.<br></p>
dc.format.pagerange3529
dc.format.pagerange3537
dc.identifier.eisbn978-2-493814-10-4
dc.identifier.issn2522-2686
dc.identifier.jour-issn2522-2686
dc.identifier.olddbid206342
dc.identifier.oldhandle10024/189369
dc.identifier.urihttps://www.utupub.fi/handle/11111/45515
dc.identifier.urlhttps://aclanthology.org/2024.lrec-main.313.pdf
dc.identifier.urnURN:NBN:fi-fe2025082791219
dc.language.isoen
dc.okm.affiliatedauthorLehtonen, Minna
dc.okm.discipline6121 Languagesen_GB
dc.okm.discipline6121 Kielitieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryItalyen_GB
dc.publisher.countryItaliafi_FI
dc.publisher.country-codeIT
dc.relation.conferenceLanguage Resources and Evaluation
dc.relation.ispartofjournalLREC Proceedings
dc.source.identifierhttps://www.utupub.fi/handle/10024/189369
dc.titleCollecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages
dc.title.bookProceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
dc.year.issued2024

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
2024.lrec-main.313.pdf
Size:
260.36 KB
Format:
Adobe Portable Document Format