TextSimplifier : A Modular, Extensible, and Context Sensitive Simplification Framework for Improved Natural Language Understanding

dc.contributor.authorSeneviratne Sandaru
dc.contributor.authorDaskalaki Elena
dc.contributor.authorSuominen Hanna
dc.contributor.organizationfi=tietotekniikan laitos|en=Department of Computing|
dc.contributor.organization-code1.2.246.10.2458963.20.85312822902
dc.converis.publication-id387499458
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/387499458
dc.date.accessioned2025-08-28T01:52:14Z
dc.date.available2025-08-28T01:52:14Z
dc.description.abstractNatural language understanding is fundamental to knowledge acquisition in today's information society. However, natural language is often ambiguous with frequent occurrences of complex terms, acronyms, and abbreviations that require substitution and disambiguation, for example, by “translation” from complex to simpler text for better understanding. These tasks are usually difficult for people with limited reading skills, second language learners, and non-native speakers. Hence, the development of text simplification systems that are capable of simplifying complex text is of paramount importance. Thus, we conducted a user study to identify which components are essential in a text simplification system. Based on our findings, we proposed an improved text simplification framework, covering a broader range of aspects related to lexical simplification - from complexity identification to lexical substitution and disambiguation - while supplementing the simplified outputs with additional information for better understandability. Based on the improved framework, we developed TextSimplifier, a modularised, context-sensitive, end-to-end simplification framework, and engineered its web implementation. This system targets lexical simplification that identifies complex terms and acronyms followed by their simplification through substitution and disambiguation for better understanding of complex language.
dc.format.pagerange21
dc.format.pagerange32
dc.identifier.isbn978-954-452-086-1
dc.identifier.olddbid208192
dc.identifier.oldhandle10024/191219
dc.identifier.urihttps://www.utupub.fi/handle/11111/57623
dc.identifier.urlhttps://aclanthology.org/2023.tsar-1.3
dc.identifier.urnURN:NBN:fi-fe2025082787904
dc.language.isoen
dc.okm.affiliatedauthorSuominen, Hanna
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryBulgariaen_GB
dc.publisher.countryBulgariafi_FI
dc.publisher.country-codeBG
dc.publisher.placeShoumen
dc.relation.conferenceWorkshop on Text Simplification, Accessibility and Readability
dc.relation.doi10.26615/978-954-452-086-1_003
dc.source.identifierhttps://www.utupub.fi/handle/10024/191219
dc.titleTextSimplifier : A Modular, Extensible, and Context Sensitive Simplification Framework for Improved Natural Language Understanding
dc.title.bookTSAR 2023 : Proceedings of the 2nd edition of the Workshop on Text Simplification, Accessibility and Readability - associated with The 14th International Conference on Recent Advances in Natural Language Processing’2023
dc.year.issued2023

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
2023.tsar-1.3 (2).pdf
Size:
584.9 KB
Format:
Adobe Portable Document Format