Comparative accuracy of two commercial AI algorithms for musculoskeletal trauma detection in emergency radiographs

dc.contributor.authorHuhtanen, Jarno T.
dc.contributor.authorNyman, Mikko
dc.contributor.authorSequeiros
dc.contributor.authorRoberto
dc.contributor.authorBlanco
dc.contributor.authorKoskinen, Seppo K.
dc.contributor.authorPudas, Tomi K.
dc.contributor.authorKajander, Sami
dc.contributor.authorNiemi, Pekka
dc.contributor.authorAronen, Hannu J.
dc.contributor.authorHirvonen, Jussi
dc.contributor.organizationfi=kuvantaminen ja kliininen diagnostiikka|en=Imaging and Clinical Diagnostics|
dc.contributor.organizationfi=tyks, vsshp|en=tyks, varha|
dc.contributor.organization-code1.2.246.10.2458963.20.69079168212
dc.converis.publication-id498946096
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/498946096
dc.date.accessioned2025-08-27T23:06:49Z
dc.date.available2025-08-27T23:06:49Z
dc.description.abstract<p><strong>Purpose: </strong>Missed fractures are the primary cause of interpretation errors in emergency radiology, and artificial intelligence has recently shown great promise in radiograph interpretation. This study compared the diagnostic performance of two AI algorithms, BoneView and RBfracture, in detecting traumatic abnormalities (fractures and dislocations) in MSK radiographs.</p><p><strong>Methods: </strong>AI algorithms analyzed 998 radiographs (585 normal, 413 abnormal), against the consensus of two MSK specialists. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, and interobserver agreement (Cohen's Kappa) were calculated. 95% confidence intervals (CI) assessed robustness, and McNemar's tests compared sensitivity and specificity between the AI algorithms.</p><p><strong>Results: </strong>BoneView demonstrated a sensitivity of 0.893 (95% CI: 0.860-0.920), specificity of 0.885 (95% CI: 0.857-0.909), PPV of 0.846, NPV of 0.922, and accuracy of 0.889. RBfracture demonstrated a sensitivity of 0.872 (95% CI: 0.836-0.901), specificity of 0.892 (95% CI: 0.865-0.915), PPV of 0.851, NPV of 0.908, and accuracy of 0.884. No statistically significant differences were found in sensitivity (p = 0.151) or specificity (p = 0.708). Kappa was 0.81 (95% CI: 0.77-0.84), indicating almost perfect agreement between the two AI algorithms. Performance was similar in adults and children. Both AI algorithms struggled more with subtle abnormalities, which constituted 66% and 70% of false negatives but only 20% and 18% of true positives for the two AI algorithms, respectively (p < 0.001).</p><p><strong>Conclusions: </strong>BoneView and RBfracture exhibited high diagnostic performance and almost perfect agreement, with consistent results across adults and children, highlighting the potential of AI in emergency radiograph interpretation.</p>
dc.identifier.eissn1438-1435
dc.identifier.jour-issn1070-3004
dc.identifier.olddbid203413
dc.identifier.oldhandle10024/186440
dc.identifier.urihttps://www.utupub.fi/handle/11111/35040
dc.identifier.urlhttps://link.springer.com/article/10.1007/s10140-025-02353-2
dc.identifier.urnURN:NBN:fi-fe2025082790101
dc.language.isoen
dc.okm.affiliatedauthorHuhtanen, Jarno
dc.okm.affiliatedauthorNyman, Mikko
dc.okm.affiliatedauthorBlanco Sequeiros, Roberto
dc.okm.affiliatedauthorKajander, Sami
dc.okm.affiliatedauthorNiemi, Pekka
dc.okm.affiliatedauthorAronen, Hannu
dc.okm.affiliatedauthorHirvonen, Jussi
dc.okm.affiliatedauthorDataimport, tyks, vsshp
dc.okm.discipline3126 Surgery, anesthesiology, intensive care, radiologyen_GB
dc.okm.discipline3126 Kirurgia, anestesiologia, tehohoito, radiologiafi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherSPRINGER HEIDELBERG
dc.publisher.countryGermanyen_GB
dc.publisher.countrySaksafi_FI
dc.publisher.country-codeDE
dc.publisher.placeHEIDELBERG
dc.relation.doi10.1007/s10140-025-02353-2
dc.relation.ispartofjournalEmergency Radiology
dc.source.identifierhttps://www.utupub.fi/handle/10024/186440
dc.titleComparative accuracy of two commercial AI algorithms for musculoskeletal trauma detection in emergency radiographs
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Huhtanen_etal_Comparative_accuracy_2025.pdf
Size:
1.8 MB
Format:
Adobe Portable Document Format