AI-assisted assessment of the IFSO consensus on obesity management medications in the context of metabolic bariatric surgery

dc.contributor.authorKermansaravi, Mohammad
dc.contributor.authorSalminen, Paulina
dc.contributor.authorPrager, Gerhard
dc.contributor.authorCohen, Ricardo V.
dc.contributor.organizationfi=tyks, vsshp|en=tyks, varha|
dc.contributor.organizationfi=kirurgia|en=Surgery|
dc.contributor.organizationfi=InFLAMES Lippulaiva|en=InFLAMES Flagship|
dc.contributor.organization-code1.2.246.10.2458963.20.97295082107
dc.contributor.organization-code1.2.246.10.2458963.20.68445910604
dc.converis.publication-id508658198
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/508658198
dc.date.accessioned2026-04-24T20:15:57Z
dc.description.abstract<p> <span>Artificial intelligence (AI) and large language models (LLMs), when combined with human expertise in collaborative intelligence (CI), can enhance medical decision-making, reduce bias in guideline development, and support precision care. New obesity management medications (OMMs) such as GLP-1 receptor agonists and dual incretin mimetics complement metabolic bariatric surgery but currently lack clear integration strategies. To address this gap, IFSO released consensus guidelines in 2024. This study evaluates their robustness by comparing expert recommendations with LLM outputs, highlighting the role of AI in assessment and strengthening clinical consensus. Thirty-one IFSO consensus statements were tested across eleven advanced LLMs on June 1, 2025. Models received standardized prompts that required binary “AGREE” or “DISAGREE” outputs, supported by brief, evidence-based rationales. Individual responses were aggregated to form an overall “LLM consensus,” and mean percentage agreement was calculated against the original IFSO expert grades—Fleiss’ κappa quantified inter-model reliability beyond chance. Incorporating the AI responses led to shifts in the consensus grade for 2 of the 31 statements. One statement originally rated A + was downgraded to A after some LLMs’ outputs indicated disagreement, citing nuanced evidence on pre- and post-MBS OMM use and comparative effectiveness. One statement on combining OMMs with endoscopic therapies was upgraded from C to B due to unanimous support from the LLM. The remaining 29 statements maintained their original grades, demonstrating strong overall alignment between LLM outputs and expert consensus. Overall concordance between LLMs and experts was 93%, with substantial inter-model agreement(κ = 0.81 [95% CI 0.74–0.87]). Integrating AI, especially LLMs, into collaborative intelligence frameworks strengthens clinical consensus when evidence is limited. This study shows that concordance between LLMs outputs and expert consensus should not be taken as evidence of objectivity; rather, it may simply reflect overlap between the published evidence base and the model’s training data or retrieval sources.</span> <br></p>
dc.identifier.eissn2767-3170
dc.identifier.urihttps://www.utupub.fi/handle/11111/59476
dc.identifier.urlhttps://doi.org/10.1371/journal.pdig.0001132
dc.identifier.urnURN:NBN:fi-fe2026022315699
dc.language.isoen
dc.okm.affiliatedauthorSalminen, Paulina
dc.okm.affiliatedauthorDataimport, tyks, vsshp
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline3126 Surgery, anesthesiology, intensive care, radiologyen_GB
dc.okm.discipline3126 Kirurgia, anestesiologia, tehohoito, radiologiafi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherPublic Library of Science
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.articlenumbere0001132
dc.relation.doi10.1371/journal.pdig.0001132
dc.relation.ispartofjournalPLoS Digital Health
dc.relation.issue12
dc.relation.volume4
dc.titleAI-assisted assessment of the IFSO consensus on obesity management medications in the context of metabolic bariatric surgery
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
journal.pdig.0001132.pdf
Size:
283.35 KB
Format:
Adobe Portable Document Format