m-Networks: Adapting the Triplet Networks for Acronym Disambiguation

Seneviratne Sandaru; Daskalaki Elena; Lenskiy Artem; Suominen Hanna

m-Networks: Adapting the Triplet Networks for Acronym Disambiguation

dc.contributor.author	Seneviratne Sandaru
dc.contributor.author	Daskalaki Elena
dc.contributor.author	Lenskiy Artem
dc.contributor.author	Suominen Hanna
dc.contributor.organization	fi=tietotekniikan laitos\|en=Department of Computing\|
dc.contributor.organization-code	1.2.246.10.2458963.20.85312822902
dc.converis.publication-id	176821003
dc.converis.url	https://research.utu.fi/converis/portal/Publication/176821003
dc.date.accessioned	2022-11-29T15:43:48Z
dc.date.available	2022-11-29T15:43:48Z
dc.description.abstract	<p>Acronym disambiguation (AD) is the process of identifying the correct expansion of the acronyms in text. AD is crucial in natural language understanding of scientific and medical documents due to the high prevalence of technical acronyms and the possible expansions. Given that natural language is often ambiguous with more than one meaning for words, identifying the correct expansion for acronyms requires learning of effective representations for words, phrases, acronyms, and abbreviations based on their context. In this paper, we proposed an approach to leverage the triplet networks and triplet loss which learns better representations of text through distance comparisons of embeddings. We tested both the triplet network-based method and the modified triplet network-based method with m networks on the AD dataset from the SDU@AAAI-21 AD task, CASI dataset, and MeDAL dataset. F scores of 87.31%, 70.67%, and 75.75% were achieved by the m network-based approach for SDU, CASI, and MeDAL datasets respectively indicating that triplet network-based methods have comparable performance but with only 12% of the number of parameters in the baseline method. This effective implementation is available at https://github.com/sandaruSen/m_networks under the MIT license.</p>
dc.format.pagerange	29
dc.identifier.isbn	978-1-955917-77-3
dc.identifier.olddbid	190091
dc.identifier.oldhandle	10024/173182
dc.identifier.uri	https://www.utupub.fi/handle/11111/32190
dc.identifier.url	https://aclanthology.org/2022.clinicalnlp-1.3/
dc.identifier.urn	URN:NBN:fi-fe2022112967663
dc.language.iso	en
dc.okm.affiliatedauthor	Suominen, Hanna
dc.okm.discipline	113 Computer and information sciences	en_GB
dc.okm.discipline	113 Tietojenkäsittely ja informaatiotieteet	fi_FI
dc.okm.internationalcopublication	international co-publication
dc.okm.internationality	International publication
dc.okm.type	A4 Conference Article
dc.publisher.country	United States	en_GB
dc.publisher.country	Yhdysvallat (USA)	fi_FI
dc.publisher.country-code	US
dc.relation.conference	Clinical Natural Language Processing Workshop
dc.source.identifier	https://www.utupub.fi/handle/10024/173182
dc.title	m-Networks: Adapting the Triplet Networks for Acronym Disambiguation
dc.title.book	Proceedings of the 4th Clinical Natural Language Processing Workshop
dc.year.issued	2022

Tiedostot

Näytetään 1 - 1 / 1

Name:: 2022.clinicalnlp-1.3.pdf
Size:: 384.25 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet