Combining supervised and unsupervised named entity recognition to detect psychosocial risk factors in occupational health checks

dc.contributor.authorUronen Leena
dc.contributor.authorSalanterä Sanna
dc.contributor.authorHakala Kai
dc.contributor.authorHartiala Jaakko
dc.contributor.authorMoen Hans
dc.contributor.organizationfi=data-analytiikka|en=Data-analytiikka|
dc.contributor.organizationfi=hoitotieteen laitos|en=Department of Nursing Science|
dc.contributor.organizationfi=lastentautioppi|en=Paediatrics and Adolescent Medicine|
dc.contributor.organizationfi=tietotekniikan laitos|en=Department of Computing|
dc.contributor.organization-code1.2.246.10.2458963.20.27201741504
dc.contributor.organization-code1.2.246.10.2458963.20.40612039509
dc.contributor.organization-code1.2.246.10.2458963.20.68940835793
dc.contributor.organization-code2610300
dc.converis.publication-id175069684
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/175069684
dc.date.accessioned2022-10-28T13:49:20Z
dc.date.available2022-10-28T13:49:20Z
dc.description.abstract<p>Introduction: In occupational health checks the information about psychosocial risk factors, which influence work ability, is documented in free text. Early detection of psychosocial risk factors helps occupational health care to choose the right and targeted interventions to maintain work capacity. In this study the aim was to evaluate if we can automate the recognition of these psychosocial risk factors in occupational health check electronic records with natural language processing (NLP). <br></p><p>Materials and methods: We compared supervised and unsupervised named entity recognition (NER) to detect psychosocial risk factors from health checks’ documentation. Occupational health nurses have done these records. <br></p><p>Results: Both methods found over 60% of psychosocial risk factors from the records. However, the combination of BERT-NER (supervised NER) and QExp (query expansion/paraphrasing) seems to be more suitable. In both methods the most (correct) risk factors were found in the work environment and equipment category. <br></p><p>Conclusion: This study showed that it was possible to detect risk factors automatically from free-text documentation of health checks. It is possible to develop a text mining tool to automate the detection of psychosocial risk factors at an early stage<br></p>
dc.identifier.eissn1872-8243
dc.identifier.jour-issn1386-5056
dc.identifier.olddbid184547
dc.identifier.oldhandle10024/167641
dc.identifier.urihttps://www.utupub.fi/handle/11111/50485
dc.identifier.urlhttps://doi.org/10.1016/j.ijmedinf.2022.104695
dc.identifier.urnURN:NBN:fi-fe2022081154671
dc.language.isoen
dc.okm.affiliatedauthorUronen, Leena
dc.okm.affiliatedauthorSalanterä, Sanna
dc.okm.affiliatedauthorHakala, Kai
dc.okm.affiliatedauthorHartiala, Jaakko
dc.okm.affiliatedauthorMoen, Hans
dc.okm.discipline3142 Public health care science, environmental and occupational healthen_GB
dc.okm.discipline3142 Kansanterveystiede, ympäristö ja työterveysfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherElsevier
dc.publisher.countryNetherlandsen_GB
dc.publisher.countryAlankomaatfi_FI
dc.publisher.country-codeNL
dc.relation.articlenumber104695
dc.relation.doi10.1016/j.ijmedinf.2022.104695
dc.relation.ispartofjournalInternational Journal of Medical Informatics
dc.relation.volume160
dc.source.identifierhttps://www.utupub.fi/handle/10024/167641
dc.titleCombining supervised and unsupervised named entity recognition to detect psychosocial risk factors in occupational health checks
dc.year.issued2022

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
1-s2.0-S1386505622000090-main.pdf
Size:
1.7 MB
Format:
Adobe Portable Document Format