Clustering Nursing Sentences - Comparing Three Sentence Embedding Methods
| dc.contributor.author | Moen Hans | |
| dc.contributor.author | Suhonen Henry | |
| dc.contributor.author | Salanterä Sanna | |
| dc.contributor.author | Salakoski Tapio | |
| dc.contributor.author | Peltonen Laura-Maria | |
| dc.contributor.organization | fi=hoitotieteen laitos|en=Department of Nursing Science| | |
| dc.contributor.organization | fi=matemaattis-luonnontieteellinen tiedekunta|en=Faculty of Science| | |
| dc.contributor.organization | fi=matematiikan ja tilastotieteen laitos|en=Department of Mathematics and Statistics| | |
| dc.contributor.organization | fi=tyks, vsshp|en=tyks, varha| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.27201741504 | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.36798383026 | |
| dc.contributor.organization-code | 2607400 | |
| dc.converis.publication-id | 178641781 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/178641781 | |
| dc.date.accessioned | 2025-08-28T02:30:25Z | |
| dc.date.available | 2025-08-28T02:30:25Z | |
| dc.description.abstract | <p>In health sciences, high-quality text embeddings may augment qualitative data analysis of large amounts of text by enabling, e.g., searching and clustering of health information. This study aimed to evaluate three different sentence-level embedding methods in clustering sentences in nursing narratives from individual patients' hospital care episodes. Two of these embeddings are generated from language models based on the BERT framework, and the third on the Sent2Vec method. These embedding methods were used to cluster sentences from 20 patient care episodes and the results were manually evaluated. Findings suggest that the best clusters were produced by the embeddings from a BERT model fine-tuned for the proxy task of predicting subject headings for nursing text.<br></p> | |
| dc.format.pagerange | 854 | |
| dc.format.pagerange | 858 | |
| dc.identifier.eisbn | 978-1-64368-285-3 | |
| dc.identifier.isbn | 978-1-64368-284-6 | |
| dc.identifier.issn | 0926-9630 | |
| dc.identifier.olddbid | 209211 | |
| dc.identifier.oldhandle | 10024/192238 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/40784 | |
| dc.identifier.url | https://ebooks.iospress.nl/doi/10.3233/SHTI220606 | |
| dc.identifier.urn | URN:NBN:fi-fe2023022128021 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Suhonen, Henry | |
| dc.okm.affiliatedauthor | Salanterä, Sanna | |
| dc.okm.affiliatedauthor | Salakoski, Tapio | |
| dc.okm.affiliatedauthor | Peltonen, Laura-Maria | |
| dc.okm.affiliatedauthor | Dataimport, tyks, vsshp | |
| dc.okm.affiliatedauthor | Dataimport, Matematiikan ja tilastotieteen lait yht | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 316 Nursing | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.discipline | 316 Hoitotiede | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.publisher.country | Netherlands | en_GB |
| dc.publisher.country | Alankomaat | fi_FI |
| dc.publisher.country-code | NL | |
| dc.relation.conference | Medical Informatics Europe | |
| dc.relation.doi | 10.3233/SHTI220606 | |
| dc.relation.ispartofjournal | Medical informatics Europe | |
| dc.relation.ispartofseries | Studies in Health Technology and Informatics | |
| dc.relation.volume | 294 | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/192238 | |
| dc.title | Clustering Nursing Sentences - Comparing Three Sentence Embedding Methods | |
| dc.title.book | Challenges of Trustable AI and Added-Value on Health | |
| dc.year.issued | 2022 |
Tiedostot
1 - 1 / 1