Machine Learning and Clinical Text. Supporting Health Information Flow

Suominen, Hanna

Machine Learning and Clinical Text. Supporting Health Information Flow

dc.contributor	Matemaattis-luonnontieteellinen tiedekunta / Faculty of Mathematics and Natural Sciences, Department of Information Technology	-
dc.contributor.author	Suominen, Hanna
dc.contributor.department	fi=Tulevaisuuden teknologioiden laitos\|en=Department of Future Technologies\|
dc.contributor.faculty	fi=Matemaattis-luonnontieteellinen tiedekunta\|en=Faculty of Mathematics and Natural Sciences\|	-
dc.date.accessioned	2009-11-30T10:14:49Z
dc.date.available	2009-11-30T10:14:49Z
dc.date.issued	2009-12-15
dc.description.abstract	Fluent health information flow is critical for clinical decision-making. However, a considerable part of this information is free-form text and inabilities to utilize it create risks to patient safety and cost-effective hospital administration. Methods for automated processing of clinical text are emerging. The aim in this doctoral dissertation is to study machine learning and clinical text in order to support health information flow.First, by analyzing the content of authentic patient records, the aim is to specify clinical needs in order to guide the development of machine learning applications.The contributions are a model of the ideal information flow,a model of the problems and challenges in reality, and a road map for the technology development. Second, by developing applications for practical cases,the aim is to concretize ways to support health information flow. Altogether five machine learning applications for three practical cases are described: The first two applications are binary classification and regression related to the practical case of topic labeling and relevance ranking.The third and fourth application are supervised and unsupervised multi-class classification for the practical case of topic segmentation and labeling.These four applications are tested with Finnish intensive care patient records.The fifth application is multi-label classification for the practical task of diagnosis coding. It is tested with English radiology reports.The performance of all these applications is promising. Third, the aim is to study how the quality of machine learning applications can be reliably evaluated.The associations between performance evaluation measures and methods are addressed,and a new hold-out method is introduced.This method contributes not only to processing time but also to the evaluation diversity and quality. The main conclusion is that developing machine learning applications for text requires interdisciplinary, international collaboration. Practical cases are very different, and hence the development must begin from genuine user needs and domain expertise. The technological expertise must cover linguistics,machine learning, and information systems. Finally, the methods must be evaluated both statistically and through authentic user-feedback.	en
dc.description.accessibilityfeature	ei tietoa saavutettavuudesta
dc.description.notification	Siirretty Doriasta
dc.format.content	fulltext
dc.identifier	ISBN 978-952-12-2375-4	en
dc.identifier.olddbid	53155
dc.identifier.oldhandle	10024/50510
dc.identifier.uri	https://www.utupub.fi/handle/11111/28147
dc.language.iso	eng	eng
dc.publisher	Turku Centre for Computer Science
dc.relation.ispartofseries	TUCS Dissertations
dc.relation.issn	1239-1883
dc.relation.numberinseries	125	-
dc.source.identifier	https://www.utupub.fi/handle/10024/50510
dc.title	Machine Learning and Clinical Text. Supporting Health Information Flow	en
dc.type.ontasot	fi=Artikkeliväitöskirja\|en=Doctoral dissertation (article-based)\|	en

Tiedostot

Näytetään 1 - 2 / 2

Name:: TUCS125Suominen.pdf
Size:: 9.73 MB
Format:: Adobe Portable Document Format

Lataa

Name:: Errata.pdf
Size:: 294.89 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Väitöskirjat