An experimental comparison of cross-validation techniques for estimating the area under the ROC curve

dc.contributor.authorAirola A
dc.contributor.authorPahikkala T
dc.contributor.authorWaegeman W
dc.contributor.authorDe Baets B
dc.contributor.authorSalakoski T
dc.contributor.organizationfi=kieli- ja puheteknologia|en=Language and Speech Technology|
dc.contributor.organizationfi=tietojenkäsittelytiede|en=Computer Science|
dc.contributor.organization-code1.2.246.10.2458963.20.23479734818
dc.contributor.organization-code1.2.246.10.2458963.20.47465613983
dc.converis.publication-id2930618
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/2930618
dc.date.accessioned2025-08-28T01:28:50Z
dc.date.available2025-08-28T01:28:50Z
dc.description.abstractReliable estimation of the classification performance of inferred predictive models is difficult when working with small data sets. Cross-validation is in this case a typical strategy for estimating the performance. However, many standard approaches to cross-validation suffer from extensive bias or variance when the area under the ROC curve (AUC) is used as the performance measure. This issue is explored through an extensive simulation study. Leave-pair-out cross-validation is proposed for conditional AUC-estimation, as it is almost unbiased, and its deviation variance is as low as that of the best alternative approaches. When using regularized least-squares based learners, efficient algorithms exist for calculating the leave-pair-out cross-validation estimate.<br>
dc.format.pagerange1828
dc.format.pagerange1844
dc.identifier.jour-issn0167-9473
dc.identifier.olddbid207605
dc.identifier.oldhandle10024/190632
dc.identifier.urihttps://www.utupub.fi/handle/11111/54014
dc.identifier.urnURN:NBN:fi-fe2025082787725
dc.language.isoen
dc.okm.affiliatedauthorAirola, Antti
dc.okm.affiliatedauthorPahikkala, Tapio
dc.okm.affiliatedauthorSalakoski, Tapio
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherELSEVIER SCIENCE BV
dc.publisher.countryNetherlandsen_GB
dc.publisher.countryAlankomaatfi_FI
dc.publisher.country-codeNL
dc.relation.doi10.1016/j.csda.2010.11.018
dc.relation.ispartofjournalComputational Statistics and Data Analysis
dc.relation.issue4
dc.relation.volume55
dc.source.identifierhttps://www.utupub.fi/handle/10024/190632
dc.titleAn experimental comparison of cross-validation techniques for estimating the area under the ROC curve
dc.year.issued2011

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
An_experimental_comparison_of_cross-validation_techniques_for_estimating_the_area_under_the_ROC_curve.pdf
Size:
743.82 KB
Format:
Adobe Portable Document Format