Resolving Geogenic and Anthropogenic Sources of Soil Contamination in Central Tanzania Using Probabilistic and Machine Learning Approaches

dc.contributor.authorKazapoe, Raymond Webrah
dc.contributor.authorMvile, Benatus Norbert
dc.contributor.authorKalimenze, John Desderius
dc.contributor.authorSagoe, Samuel Dzidefo
dc.contributor.authorAwog-badek, Darwin Abaanamkadila
dc.contributor.authorFynn, Obed Fiifi
dc.contributor.authorKonate, Sory I.M.
dc.contributor.organizationfi=geologia|en=Geology |
dc.contributor.organization-code1.2.246.10.2458963.20.72020864681
dc.converis.publication-id508322840
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/508322840
dc.date.accessioned2026-01-21T14:44:22Z
dc.date.available2026-01-21T14:44:22Z
dc.description.abstract<p>Soils in mining terrains are subject to complex interactions between geological backgrounds and human activities, often resulting in elevated concentrations of Potentiality Toxic Elements (PTEs). This study applied an integrated framework combining probabilistic pollution indices, positive matrix factorization (PMF), and machine learning (Gradient Boosted Decision Trees and Artificial Neural Networks) to evaluate soil contamination in the Singida mining terrain of Tanzania. A total of 1,884 surface soil samples (0–20 cm) were analyzed for 12 PTEs. Concentrations showed strong heterogeneity, with right-skewed distributions indicating hotspot enrichment. Pb (mean 25.3 mg/kg; 70% > UCC) reflects regional background enrichment with possible localized anthropogenic enhancement, whereas Cd (0.13 mg/kg; 49% > UCC), and As (1.85 mg/kg; 5% > UCC) show stronger anthropogenic influence. Cr (62.6 mg/kg; 18% > UCC), Ni (23.4 mg/kg; 14% > UCC), and V (61.2 mg/kg; 16% > UCC) reflected lithogenic control from mafic–ultramafic lithologies. Probabilistic simulations (20,000 iterations) showed that most soils were low risk with Pollution Load Index (PLI) mean 0.60; Potential Ecological Risk Index (PERI) mean 59.5; and Nemerow Integrated Risk Index (NIRI) mean 29.5, yet ~21% of sites reached moderate to extreme risk categories. PMF resolved two dominant source factors: (i) a lithogenic Ba–Sr–Pb–Cd–Mn assemblage, and (ii) a ferromagnesian–sulphide Cu–Ni–Cr–V–Co–Zn–As assemblage. Machine learning reproduced these factor contributions with high fidelity (R2 = 0.96–0.99), enabling nonlinear sensitivity analysis and identification of dominant predictor elements rather than independent validation of the PMF solution. These findings demonstrate the effectiveness of a combinatorial approach in capturing both deterministic structure and stochastic uncertainty in soil contamination. The results highlight the need for hotspot-targeted remediation, region-specific baselines, and integration of probabilistic monitoring frameworks into environmental policy for mineralized terrains in Sub-Saharan Africa.</p>
dc.format.pagerange771
dc.format.pagerange791
dc.identifier.eissn2288-7962
dc.identifier.jour-issn1225-7281
dc.identifier.olddbid213638
dc.identifier.oldhandle10024/196656
dc.identifier.urihttps://www.utupub.fi/handle/11111/55740
dc.identifier.urlhttps://doi.org/10.9719/eeg.2025.58.6.771
dc.identifier.urnURN:NBN:fi-fe202601215788
dc.language.isoen
dc.okm.affiliatedauthorKalimenze, John
dc.okm.discipline1171 Geosciencesen_GB
dc.okm.discipline1171 Geotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherThe Korean Society of Economic and Environmental Geology
dc.publisher.countryKorea, Republic of (South Korea)en_GB
dc.publisher.countryKorean tasavalta (Etelä-Korea)fi_FI
dc.publisher.country-codeKR
dc.relation.doi10.9719/EEG.2025.58.6.771
dc.relation.ispartofjournalEconomic and Environmental Geology
dc.relation.issue6
dc.relation.volume58
dc.source.identifierhttps://www.utupub.fi/handle/10024/196656
dc.titleResolving Geogenic and Anthropogenic Sources of Soil Contamination in Central Tanzania Using Probabilistic and Machine Learning Approaches
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
kseeg058-06-771.pdf
Size:
6.06 MB
Format:
Adobe Portable Document Format