Does ChatGPT Ignore Article Retractions and Other Reliability Concerns?

Thelwall, Mike; Lehtisaari, Marianna; Katsirea, Irini; Holmberg, Kim; Zheng, Er‐Te

Does ChatGPT Ignore Article Retractions and Other Reliability Concerns?

dc.contributor.author	Thelwall, Mike
dc.contributor.author	Lehtisaari, Marianna
dc.contributor.author	Katsirea, Irini
dc.contributor.author	Holmberg, Kim
dc.contributor.author	Zheng, Er‐Te
dc.contributor.organization	fi=taloussosiologia\|en=Economic Sociology\|
dc.contributor.organization-code	1.2.246.10.2458963.20.82939713796
dc.converis.publication-id	499720452
dc.converis.url	https://research.utu.fi/converis/portal/Publication/499720452
dc.date.accessioned	2026-01-21T12:09:17Z
dc.date.available	2026-01-21T12:09:17Z
dc.description.abstract	Large language models (LLMs) like ChatGPT seem to be increasingly used for information seeking and analysis, including to support academic literature reviews. To test whether the results might sometimes include retracted research, we identified 217 retracted or otherwise concerning academic studies with high altmetric scores and asked ChatGPT 4o-mini to evaluate their quality 30 times each. Surprisingly, none of its 6510 reports mentioned that the articles were retracted or had relevant errors, and it gave 190 relatively high scores (world leading, internationally excellent, or close). The 27 articles with the lowest scores were mostly accused of being weak, although the topic (but not the article) was described as controversial in five cases (e.g., about hydroxychloroquine for COVID-19). In a follow-up investigation, 61 claims were extracted from retracted articles from the set, and ChatGPT 4o-mini was asked 10 times whether each was true. It gave a definitive yes or a positive response two-thirds of the time, including for at least one statement that had been shown to be false over a decade ago. The results therefore emphasise, from an academic knowledge perspective, the importance of verifying information from LLMs when using them for information seeking or analysis.
dc.identifier.eissn	1741-4857
dc.identifier.jour-issn	0953-1513
dc.identifier.olddbid	212171
dc.identifier.oldhandle	10024/195189
dc.identifier.uri	https://www.utupub.fi/handle/11111/40356
dc.identifier.url	https://doi.org/10.1002/leap.2018
dc.identifier.urn	URN:NBN:fi-fe202601216605
dc.language.iso	en
dc.okm.affiliatedauthor	Lehtisaari, Marianna
dc.okm.affiliatedauthor	Holmberg, Kim
dc.okm.discipline	5141 Sociology	en_GB
dc.okm.internationalcopublication	international co-publication
dc.okm.internationality	International publication
dc.okm.type	A1 ScientificArticle
dc.publisher	Wiley
dc.publisher.country	United Kingdom	en_GB
dc.publisher.country	Britannia	fi_FI
dc.publisher.country-code	GB
dc.relation.articlenumber	e2018
dc.relation.doi	10.1002/leap.2018
dc.relation.ispartofjournal	Learned Publishing
dc.relation.issue	4
dc.relation.volume	38
dc.source.identifier	https://www.utupub.fi/handle/10024/195189
dc.title	Does ChatGPT Ignore Article Retractions and Other Reliability Concerns?
dc.year.issued	2025

Tiedostot

Näytetään 1 - 1 / 1

Name:: Learned Publishing - 2025 - Thelwall - Does ChatGPT Ignore Article Retractions and Other Reliability Concerns.pdf
Size:: 397.55 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet