Scoping natural language processing in Indonesian and Malay for education applications

dc.contributor.authorMaxwell-Smith Zara
dc.contributor.authorKohler Michelle
dc.contributor.authorSuominen Hanna
dc.contributor.organizationfi=tietotekniikan laitos|en=Department of Computing|
dc.contributor.organization-code1.2.246.10.2458963.20.85312822902
dc.converis.publication-id176230090
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/176230090
dc.date.accessioned2022-10-27T12:10:51Z
dc.date.available2022-10-27T12:10:51Z
dc.description.abstractIndonesian and Malay are underrepresented in the development of natural language processing (NLP) technologies and available resources are difficult to find. A clear picture of existing work can invigorate and inform how researchers conceptualise worthwhile projects. Using an education sector project to motivate the study, we conducted a wide-ranging overview of Indonesian and Malay human language technologies and corpus work. We charted 657 included studies according to Hirschberg and Manning's 2015 description of NLP, concluding that the field was dominated by exploratory corpus work, machine reading of text gathered from the Internet, and sentiment analysis. In this paper, we identify most published authors and research hubs, and make a number of recommendations to encourage future collaboration and efficiency within NLP in Indonesian and Malay.
dc.format.pagerange171
dc.format.pagerange228
dc.identifier.isbn978-1-955917-23-0
dc.identifier.jour-issn0736-587X
dc.identifier.olddbid173718
dc.identifier.oldhandle10024/156812
dc.identifier.urihttps://www.utupub.fi/handle/11111/32919
dc.identifier.urlhttps://aclanthology.org/2022.acl-srw.15
dc.identifier.urnURN:NBN:fi-fe2022102462976
dc.language.isoen
dc.okm.affiliatedauthorSuominen, Hanna
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.conferenceAnnual Meeting of the Association for Computational Linguistics
dc.relation.doi10.18653/v1/2022.acl-srw.15
dc.relation.ispartofjournalAnnual Meeting of the Association for Computational Linguistics
dc.source.identifierhttps://www.utupub.fi/handle/10024/156812
dc.titleScoping natural language processing in Indonesian and Malay for education applications
dc.title.bookProceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
dc.year.issued2022

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Scoping natural language processing in Indonesian and Malay for education applications.pdf
Size:
1.68 MB
Format:
Adobe Portable Document Format