Mathematical Modelling and Predictive Analytics for Antimicrobial Resistance Dynamics: A Scoping Review

Anu Mahajan and Sumit Sharma ORCiD
Research Scientist, Nutralytics Edtech LLP, Pune, Maharashtra, India
Correspondence to: Dr. Sumit Sharma, drsumits@outlook.com

Premier Journal of Data Science

Additional information

  • Ethical approval: N/a
  • Consent: N/a
  • Funding: REMIT – Research on Economics, Management and Information Technologies
  • Conflicts of interest: N/a
  • Author contribution: Sumit Sharma – Conceptualization, Writing – original draft, review and editing
  • Guarantor: Sumit Sharma
  • Provenance and peer-review: Unsolicited and externally peer-reviewed
  • Data availability statement: This is a review paper, and we have already submitted the data in paper itself. But if data is required during the peer review process, we will make it available

Keywords: Compartmental AMR transmission models, Gram-negative priority pathogens, Machine learning resistance prediction, One Health antimicrobial modelling, Trace validation framework.

Peer Review
Received: 19 March 2026
Last revised: 08 May 2026
Accepted: 10 May 2026
Version accepted: 5
Published: 17 May 2026

Plain Language Summary Infographic
“Mathematical Modelling and Predictive Analytics for Antimicrobial Resistance Dynamics: A Scoping Review” illustrating the integration of mechanistic transmission models and machine learning approaches for understanding, predicting, and managing antimicrobial resistance (AMR), featuring compartmental transmission models, predictive analytics workflows, One Health integration across human, animal, and environmental health, and hybrid mechanistic–ML frameworks for AMR forecasting, while highlighting predictive performance metrics, WHO GLASS alignment, data sources, policy applications, surveillance systems, and key challenges related to validation, transparency, reproducibility, and implementation in global AMR control strategies.
Abstract

This scoping review is a synthesis of the evidence based on the existing systematic and scoping reviews on mathematical modelling and predictive analytics of antimicrobial resistance (AMR) dynamics. It seeks to trace the use of mechanistic transmission models and machine-learning (ML) methods to understand the emergence, spread and prediction of AMR in pathogens, hosts and settings. A total of 10 studies were included: six systematic reviews of mechanistic or transmission models, three systematic reviews of ML-based resistance prediction and one narrative review, providing a broader conceptual context on AMR modelling.

Mechanistic examinations reveal that deterministic compartmental model structures in human medical centres or community contexts prevail with a paucity of One Health consolidation, limited external validation and under-representation of WHO priority Gram-negative organisms and low and middle-income countries. ML model reviews state that patient-level resistance prediction has promising discriminative performance (summary area under the receiver operating characteristic curve (AUC) of approximately 0.78–0.82 across ML-based prediction systematic reviews) based on heterogeneous data pipelines.

However, the studies have mainly been applied in a retrospective, single-centre design, and are likely to be susceptible to bias and lack prospective impact assessment. In both fields, major gaps are associated with validation, transparency, reproducibility, coverage of pathogens and settings and correlation with policy-relevant economic and decision measures. Development of hybrid mechanistic-ML frameworks and explicit One Health-oriented modelling strategies, aligned with WHO GLASS and national AMR action plans, are required to strengthen AMR policy and clinical decision-making.

Introduction

Antimicrobial resistance (AMR) is one of the most pressing threats to global public health. It occurs when pathogens acquire resistance to antimicrobial agents, rendering the effective treatment of common infections increasingly compromised.1 According to recent estimates, bacterial AMR was directly attributable to approximately 1.27 million deaths globally in 2019, with an additional 4.95 million deaths associated with AMR; projections suggest this burden could rise substantially by mid-century.2 Addressing AMR effectively requires understanding its multifaceted determinants.3

Over the past decade, a substantial body of literature has examined AMR dynamics through mechanistic modelling.3 For instance, Niewiadomska et al. found 273 population-level AMR models (2006–2016), 66% of which were deterministic, and 76% compartmental.4 These models concentrated primarily on a few human pathogens (HIV, TB, malaria, MRSA) and frequently neglected such sectors as agriculture or the environment. This limited scope prompted calls for broader, more integrative modelling frameworks.4

In parallel, data-driven predictive approaches have emerged. These are time-series analyses and machine learning (ML) models that have been trained on surveillance or clinical data. ML algorithms (supervised and unsupervised) have been effectively used to forecast resistance patterns and inform therapeutic decision-making.5 For instance, statistical time-series models (ARIMA, SARIMA) have been applied to predict the trend of resistance using previous data, whereas algorithms such as random forests and gradient boosting have been applied to predict antibiotic susceptibility from electronic health records. Both approaches are expected to enhance our capacity to predict AMR and assess control strategies.6,7

This review-of-reviews synthesises evidence from published systematic and scoping reviews on AMR modelling and prediction (2018–2025 for formal database searches; supplemented by a preprint repository hand-search to January 2026). It aimed to: (i) characterise model types and underlying assumptions; (ii) identify common parameters and data sources; (iii) describe prediction methodologies and reported performance metrics; (iv) quantify evidence overlap using corrected covered area analysis and (v) delineate persistent research gaps and future priorities, including practical guidance for decision-makers.

Methodology

The scoping review was developed to map and synthesise existing literature on mathematical modelling and predictive analytics in analysing AMR dynamics. The methodological framework used in the review was that suggested by Arksey and O’Malley and expanded by Levac et al., and the review was presented in line with the Preferred Reporting Items of Systematic Reviews and Meta-Analyses Extension to Scoping Reviews (PRISMA-ScR). The search strategy, including full Boolean search strings and date limits, was conducted in the major electronic databases, such as PubMed, Scopus and Web of Science, to locate the literature on the topic published within the range of 2018–2025 for formal database searches, supplemented by the January 2026 preprint repository hand-search described above.

Search terms combined MeSH headings and free-text keywords for antimicrobial resistance with modelling methods (mathematical modelling, transmission models, predictive analytics, machine learning, forecasting). Formal database searches were completed on January 15, 2025 (PubMed), January 16, 2025 (Scopus) and January 17, 2025 (Web of Science). A supplementary hand-search of preprint repositories (arXiv, medRxiv) was conducted on January 20, 2026, identifying one additional preprint (Schardong et al., 2026),1 which is labelled and analysed separately from the peer-reviewed evidence base. Full Boolean search strings are provided in Supplementary File S1, and the completed PRISMA-ScR checklist in Supplementary File S2.

Grey literature was not systematically searched; this decision is acknowledged as a limitation, as relevant WHO and ECDC technical reports, unpublished national AMR action-plan modelling studies and conference abstracts may not have been captured. Eligible studies were peer-reviewed systematic or scoping reviews examining mathematical models, simulation models, or predictive analytics applied to AMR dynamics in human, animal or environmental contexts. Protocols, preprints and narrative reviews were included only where they provided essential conceptual context; such studies are labelled explicitly and analysed separately. Editorials, commentaries and studies not focused on AMR modelling were excluded.

Titles and abstracts were screened independently by two reviewers (co-authors), followed by full-text evaluation of potentially eligible articles by the same reviewers. Disagreements were resolved by discussion, with adjudication by a third reviewer where consensus could not be reached. A calibration exercise on a random 10% sample of titles and abstracts was performed prior to full screening to ensure consistent application of eligibility criteria. The essential data extracted from included reviews were the type of models, the pathogens studied, the data sources, the scale of modelling, the validation techniques and the reported results. Given that two of the included studies are a preprint (Schardong et al., 2026)1 and a protocol (Acharya et al., 2023),8 their inclusion requires explicit methodological justification.

The preprint (Schardong et al., 2026)1 was identified through the supplementary January 2026 preprint repository search and included for its contemporaneous mapping of mathematical AMR models; its findings are consistent with peer-reviewed syntheses and it is labelled and analysed separately. The protocol (Acharya et al., 2023)8 was included solely for contextual framing of One Health decision-support endpoints. A formal sensitivity analysis confirmed that excluding both studies does not alter the primary conclusions regarding dominant model types, validation deficits, pathogen coverage gaps or LMIC underrepresentation. Narrative synthesis was used to identify key modelling strategies, trends and research gaps in AMR modelling. The review protocol was not prospectively registered in PROSPERO or OSF; at the time of initiation, neither platform formally accommodated scoping reviews of reviews. This is acknowledged as a limitation.

Formal quality appraisal of included reviews using AMSTAR-29 or ROBIS10 was not performed, consistent with JBI guidance for scoping reviews of reviews,11 which distinguishes scoping overviews from full systematic overviews requiring mandatory critical appraisal. The primary aim was to map review-level evidence rather than weight conclusions by review quality; this omission is explicitly acknowledged in the Limitations section. A light-touch descriptive appraisal was nonetheless conducted across four key domains for each included review: (i) prospective protocol registration; (ii) dual independent screening; (iii) performance of sensitivity or validation analysis and (iv) conflict-of-interest declaration. Results are summarised in Table S1 (Supplementary File S4). To quantify evidence redundancy, a citation overlap analysis was performed: the corrected covered area (CCA) was calculated across all pairwise combinations of included reviews based on their primary study reference lists.

The CCA formula applied was: CCA = (A − r)/[N × r × (r − 1)/2], where A is the total number of citations across all reviews counting duplicates, r is the number of included reviews and N is the total number of unique citations. The median pairwise CCA was 0.07 (range 0.00–0.19), indicating slight-to-moderate overlap (CCA ≤ 0.10 = slight; 0.11–0.20 = moderate), consistent with the reviews addressing broadly complementary evidence bases. Slight-to-moderate overlap indicates that whilst some primary studies are shared—particularly foundational transmission-modelling datasets cited across Niewiadomska et al.4 and Brinch et al.12—the reviewed evidence bases are substantially distinct, reducing but not eliminating the risk of double-counting signals. Overlapping citations are flagged in the synthesis where relevant. A pairwise CCA matrix for all included reviews is provided in Supplementary File S5. A PRISMA-ScR-compliant list of excluded full-text records with reasons for exclusion is provided in Supplementary File S3.

Results

A total of 10 studies were included in this scoping review. Among them, nine were systematic reviews examining mathematical modelling and predictive analytics approaches for antimicrobial resistance, while one study was a narrative review included to provide additional conceptual context and support the interpretation of modelling frameworks (Figure 1).

Flowchart depicting the process of identifying, screening, and including studies in a scoping review on antimicrobial resistance (AMR) modeling. Steps include initial records identified, duplicates removed, records screened, full-text articles assessed, and final studies included.
Figure 1: PRISMA-ScR flow diagram illustrating the study selection process for the scoping review of mathematical modelling and predictive analytics studies on antimicrobial resistance.

A scoping review of dynamic models of antibacterial use and resistance conducted by Ramsay et al.13 demonstrated that 81 studies had employed dynamic simulation to analyse bacterial resistance in association with antibacterial use in human and animal populations, with more focus on aggregate compartmental models in hospital and community settings and little transparency in the assumptions and uncertainty analysis.13 In a systematic review of mathematical and simulation modelling of AMR development and spread, examining 38 models applied across human, animal and environmental contexts, Birkegård and colleagues found that none of the models met the TRACE good-practice framework and that validation and uncertainty treatment were frequently inadequate.14

Niewiadomska et al. conducted a comprehensive systematic review of population-level AMR transmission modelling of bacterial, viral, parasitic and fungal pathogens between 2006 and 2016 (including 273 modelling studies), identifying a preponderance of compartmental, deterministic models focused on MRSA, tuberculosis, HIV, influenza and malaria with limited representation of WHO priority bacterial pathogens such as carbapenem–resistant Enterobacteriaceae.4 A more specific systematic review by Leclerc et al. looked at 43 mathematical models of horizontal gene transfer (HGT) of AMR genes and found that most had concentrated on conjugation in Escherichia coli in vitro and that transformation, transduction, multiple independent resistance genes and explicit antibiotic effects were not modelled often despite their significance in AMR evolution.15

Based on these previous syntheses, Schardong et al.1 [preprint; identified via supplementary January 2026 preprint repository search] mapped 36 mathematical modelling studies.1 In a systematic review of 170 AMR transmission models published in 2010–2022, Brinch et al. evaluated them using the TRACE framework and demonstrated that only approximately one-third reported sensitivity analyses and external validation, and none reported implementation verification of the model code (TRACE criterion 5).12 Moreover, Acharya et al. [protocol] published a protocol for a scoping review examining AMR model elements for One Health decision-making.8

In a systematic review and meta-analysis on machine-learning models to predict AMR, Tang et al. identified 25 studies until the end of 2021 that utilised ML or risk scores to predict resistance (ML often ESBLs, MRSA or carbapenem resistance) and reported a pooled area under the ROC curve (AUC) of 0.82 in models that used machine-learning to predict resistance (with most models generally having higher specificity but similar sensitivity to traditional risk scores).16

Ardila and colleagues focused on ML for predicting resistance in WHO critical and high-priority bacterial pathogens using real-world antimicrobial susceptibility test data, ultimately including 21 observational cohort studies with over 688,000 patients and 1.7 million susceptibility tests and found that gradient-boosted decision trees, random forests and XGBoost consistently outperformed logistic regression, but that retrospective design, non-standardised preprocessing and lack of trial-level validation limited clinical translation.17 An updated systematic review and meta-analysis of ML-based antibiotic resistance prediction models by Lv and Wang found that ML models have good discriminative ability (summary AUC of around 0.78); however, the quality of their methods is heterogeneous, and there is a high risk of bias and publication bias among implicit studies (Figure 2).18

Fig 2 | Conceptual framework of mathematical modelling and predictive analytics approaches used to study antimicrobial resistance dynamics
Figure 2: Conceptual framework of mathematical modelling and predictive analytics approaches used to study antimicrobial resistance dynamics.
Mechanistic Models of AMR Dynamics

Model Types, Scales and Settings

Among mechanistic modelling reviews, the majority of AMR models are population-level compartmental models and are given as deterministic ordinary differential equations and typically represent hospital or community systems with homogeneous mixing and few compartments that represent the susceptible and resistant colonisation or infection states.4,13 Stochastic and individual-based (agent-based) models are less common; however, several reviews note that these approaches better accommodate heterogeneity, stochastic events and complex contact structures, especially when the population size is small, like a hospital ward.

Niewiadomska et al. present results that 76% of the 273 models they examined were compartmental, 66% deterministic and none of them was individual-based4 whereas Brinch et al. find that 78.8% of 170 models were population-based and 62.9% deterministic.12 Ramsay et al. identified that all dynamic models in their scoping review were aggregate and not individual-based, and Birkegård et al. also observed that the majority of the models represent a homogenous population with limited explicit contact network or nested host-microbiota structure representation.13

Pathogens, Hosts and One Health Representation

According to Niewiadomska et al., in the period between 2006 and 2016, over 70% of AMR transmission models involved MRSA, tuberculosis, HIV, influenza and malaria, with very few models addressing the WHO priority bacteria like carbapenem-resistant Enterobacteriaceae, Acinetobacter baumannii or Pseudomonas aeruginosa.4

Similar gaps were identified by Brinch et al., who list Mycobacterium tuberculosis (39 models) and Staphylococcus aureus (27 models) as the most frequently modelled transmission models, with common community-acquired infections like Salmonella spp. and Campylobacter spp., or Helicobacter pylori, rarely modelled.12 In reviews, most models are focused on human hosts, very few are based on animals or the environment, although inter-sectoral transmission of AMR is clear; Ramsay et al. identified only a few models based on an inter-host model outside human health care and community, and Niewiadomska et al. found that only 2% of models are focused on human–animal transmission.13

Horizontal Gene Transfer and Within-Host Processes

The specialised review of the HGT models by Leclerc et al. shows that the published literature has centred primarily on conjugation in Escherichia coli in vitro, and did not typically take into account actual exposure to antibiotics or multiple and independent AMR genes. Only 43 modelling studies of HGT followed more than one AMR gene independently, and very few followed transformation or transduction, even though all of these have been shown experimentally to have significant roles in resistance evolution in some species.15 Within-host dynamics—encompassing pharmacokinetics/ pharmacodynamics (PK/PD), fitness costs of resistance and immune responses—are represented in only a minority of the synthesised models, typically within more complex within-host or hybrid frameworks. These processes are consistently underutilised relative to their established importance in resistance emergence and persistence.16 Recent scoping work by Schardong et al. identifies growing interest in integrating PK/PD and within-host bacterial dynamics into broader AMR frameworks. However, data scarcity continues to drive simplification of these components in most modelling studies.1

Interventions Modelled and Policy Relevance

The effect of hypothetical interventions on the dynamics of AMR is often studied using dynamic models. According to Ramsay et al., 73 out of 81 included studies investigated at least one intervention, with most studies investigating changes in the overall consumption of antibiotics, switching of classes or altering the management of antibiotics, such as cycling, mixing or combination therapy, as well as infection-control measures, including improved hygiene or isolation.13

Niewiadomska et al. concluded that approximately half of AMR-specific modelled interventions focus on curtailing transmission of resistant pathogens (i.e., by infection control), with only a minority explicitly targeting de novo emergence of resistance, and only very few models taking into account alternative therapeutics, vaccines or behaviour modification in reducing antimicrobial use.4 Brinch et al. reported an increasing number of models assessing vaccines and (to a smaller degree) monoclonal antibodies as interventions against AMR, but noted that the results of models are very heterogeneous (prevalence, incidence, basic reproduction number or mortality) and are rarely combined with economic assessment, which restricts their comparability and direct policy relevance.12

Model Validation, Uncertainty and Reporting

Application of TRACE and Validation Practice

Poor validation of models and the lack of full reporting of assumptions and data sources are consistent findings across mechanistic modelling reviews. Birkegård et al. applied the TRACE framework to find that none of the 38 AMR models they reviewed met any of the good-practice criteria of problem formulation, model description, data evaluation, internal and external (out-of-sample) validation, sensitivity analysis and output corroboration, and that only a portion of them did any formal sensitivity analysis and very few of them carried out any external (out-of-sample) validation with empirical data.14

Brinch et al. generalised this analysis to 170 transmission models, discovering that only 39 reported some type of validation and performed sensitivity analysis, and none reported the implementation verification of the model code (TRACE criterion 5), such that potential programming errors and numerical artefacts remained undetected.12 Calibration to epidemiological data was also done in approximately 43% of the models and out-of-sample (external) validation in only 14% in the more general review of Niewiadomska et al., indicating that intervention projections are frequently generated from partially calibrated or unvalidated model structures.4

Treatment of Uncertainty and Heterogeneity

Deterministic compartmental models with fixed point-estimate parameters predominate and formal sensitivity analyses are inconsistently applied. Ramsay et al. reported that slightly more than half of the dynamic models in their scoping review did any sensitivity analysis,13 and Birkegård et al. also noted that most models do not consider stochasticity and heterogeneity despite the inherently noisy nature of AMR transmission processes.14 Leclerc et al. observed that the majority of HGT models presuppose constant rates of transfer and growth and have not properly addressed the variability in the parameters or patterns of antibiotic exposure that can change the prevalence and spread of resistance genes, especially in realistic in vivo settings.15 In agent-based or stochastic models, it has been pointed out that they may be useful in capturing rare events (such as extinction or invasion of resistance), and heterogeneous contact structures, but these methods are the exception rather than the rule in the reviewed literature.13

Predictive Analytics and Machine-Learning Models

Data Sources and Prediction Targets

Most systematic reviews of antimicrobial resistance (AMR) prediction models are patient-level, that is, they predict the likelihood of a certain resistance phenotype (or antimicrobial susceptibility test (AST) result) based on patient-level variables, as opposed to population-level prediction of resistance prevalence. Tang et al. compared the prediction of resistance of individual patients or isolates to selected agents, including methicillin, carbapenems or ESBL-producing status, based on clinical, microbiological and in some cases genomic data using ML algorithms with conventional risk-score methods.16

Ardila et al. concentrated on the practical healthcare environment and the WHO critical and high-priority pathogens, and reviewed 21 cohort studies, which applied large hospital AST datasets to predict resistance at the point of care using a ML model.17 Lv and Wang compiled a wider range of ML-based antibiotic resistance prediction research (the precise number of which is not specified in the abstract) on a variety of pathogens and datasets. Their combined diagnostic performance outcomes also focus on individual-level categorisation as opposed to ecological prediction of the prevalence of the resistance.18

Methodological Limitations and Risk of Bias

The systematic reviews all find common methodological shortcomings in studies of ML-based AMR prediction. Common limitations include retrospective single-centre designs, non-standardised preprocessing, selective incorporation of features, small or biased datasets, scant external (out-of-sample) validation, and reporting centred on AUC as opposed to calibration or clinical-utility measures. Tang et al. and Ardila et al. observed that few ML models have been prospectively assessed or implemented in clinical decision-support systems, and thus their practical implications on prescribing behaviour and patient outcomes are unknown.16

The risk-of-bias evaluations by Lv and Wang show that not all studies are of low risk of bias, with many studies found to be at moderate-to-high risk of bias in the areas of participant selection and analysis, indicating that the reported performance may be overestimated by the generalisable accuracy.18 Similarly, the literature review of ML to address AMR, and the reviews presented in Clinical Microbiology Reviews, propose a better study design, disclosed reporting, open data and code and prediction-model reporting criteria (TRIPOD,19 PROBAST20 and their AI-adapted extensions (TRIPOD-AI/PROBAST-AI) where ML methods are discussed).21

Cross-Cutting Gaps and Research Priorities

Pathogen and Setting Coverage

Across both mechanistic and ML modelling traditions, certain pathogens, settings and geographic regions remain markedly under-represented. High-burden LMIC settings—particularly sub-Saharan Africa and parts of South Asia are rarely the focus of modelling studies, despite carrying the greatest global AMR burden. Long-term care facilities, informal healthcare settings and the hospital–community–-agriculture–environment interface are similarly neglected, despite their centrality to a credible One Health approach. Both modelling traditions show limited coverage of WHO critical–priority Gram-negative pathogens outside acute hospital contexts, and community–acquired resistant infections remain substantially under-modelled.4

Model Integration, Usability and Decision-Support

The present review offers three translational advances beyond existing individual reviews. First, the CCA-based overlap analysis provides an empirical basis for judging whether the two modelling traditions draw on sufficiently distinct evidence bases to support independent synthesis conclusions. Second, the decision-maker guidance table (Table 1) maps modelling outputs directly to actionable use cases within WHO GLASS, national AMR action plans, and One Health surveillance frameworks—a linkage absent from prior reviews of this literature. Third, the harmonised quantitative signals table (Table 2) enables cross-review performance comparison that individual reviews, constrained by their own inclusion criteria, cannot provide. Together, these features represent a methodologically coherent framework for bridging AMR modelling science and AMR governance practice.

Table 1: Harmonised summary of quantitative signals, including reviews on mathematical modelling and machine-learning-based prediction for antimicrobial resistance.
Review (Year)DomainStudies/ModelsKey MetricValueExternal ValidationSensitivity AnalysisPrincipal Limitation
Ramsay et al. (2018)Mechanistic81 dynamic modelsStudies with sensitivity analysis~55%<10%55%All aggregate compartmental;
no individual-based models
Birkegard et al. (2018)Mechanistic38 modelsTRACE good-practice criteria met0/38 (0%)<10%<30%No model met any TRACE criterion
Niewiadomska et al. (2019)Mechanistic273 modelsCalibrated to data/externally validated43%/14%14%~50%>70% of models on MRSA/TB/HIV; only 2% One Health scope
Leclerc et al. (2019)Mechanistic (HGT)43 HGT modelsModels including ≥2 AMR genes<5%MinimalNot reportedFocused on conjugation in E. coli in vitro; limited in vivo context
Brinch et al. (2025)Mechanistic170 models
(39 assessed)
External validation reported22% (39/170)22%~33%Zero code-verification compliance (TRACE criterion 5)
Schardong et al. (2026)†Mechanistic36 models mappedPK/PD integrationGrowing minorityNot reportedNot reportedPreprint (supplementary Jan 2026 search); not peer-reviewed; analysed separately
Tang et al. (2022)ML prediction25 studiesPooled AUC (95% CI)0.82 (0.78–0.86)<20%Not reportedRetrospective; single-centre designs predominant
Ardila et al. (2025)ML prediction21 cohort studies; n > 688,000GBTs/RF vs. LR superiorityConsistent across pathogens<15%Not reportedNo prospective validation; non-standardised feature preprocessing
Lv & Wang (2024)ML predictionMultiple studies
(n not reported)
Summary AUC (range)0.78 (~0.65–0.90)<20%Not reportedModerate–high RoB; publication bias detected
Abbreviations: AUC = area under the receiver operating characteristic curve; CI = confidence interval; GBTs = gradient-boosted trees; HGT = horizontal gene transfer; LR = logistic regression; ML = machine learning; PK/PD = pharmacokinetics/pharmacodynamics; RF = random forest; RoB = risk of bias; TRACE = documentation standard for model credibility. Values are approximate proportions or summary statistics extracted from review-level data; heterogeneity across primary studies is substantial and estimates should be interpreted as indicative. †Preprint identified via supplementary January 2026 preprint repository search. This table provides indicative guidance; optimal approach depends on local data, capacity, and policy context.
Table 2: Decision-maker-oriented guidance: Mapping common policy and clinical questions to recommended modelling approaches by data availability and resource setting.
Policy/Clinical QuestionRecommended Modelling ApproachMinimum Data RequirementsResource SettingKey References
Population-level resistance prevalence forecast (3–5 years)Deterministic ODE model calibrated to longitudinal surveillance dataResistance surveillance (GLASS/EARS-Net); antibiotic usage statisticsAny; GLASS data enable LMIC model calibrationNiewiadomska 2019; Brinch 2025
Antibiotic cycling vs. mixing vs. combination therapy for ICUStochastic within-hospital transmission model with multiple intervention armsWard admissions/discharges; local resistance prevalence; antibiotic protocolsHospital (any income setting)Ramsay 2018; Birkegard 2018
Individual patient risk of resistant organism carriage on admissionSupervised ML classifier (gradient-boosted trees or random forest preferred)EHR: age, comorbidities, prior hospitalisation, antibiotic history, travel historyHospital with EHR; HIC primary; LMIC where data infrastructure availableTang 2022; Ardila 2025; Lv & Wang 2024
AMR gene dissemination across the animal–human–environment interfaceAgent-based or network ODE model incorporating HGT and inter-sector transmissionOne Health surveillance: human, animal, and environmental resistome dataNational/regional; multi-sector data sharing requiredLeclerc 2019;
Acharya 2023
Cost-effectiveness of stewardship intervention vs. standard of careHybrid mechanistic–economic model (Markov/decision-tree linked to ODE)Resistance prevalence; treatment costs; hospitalisation rates; DALYsHIC; adaptable to LMIC with local parameter estimationBrinch 202522,23
AMR trend forecasting in LMIC with limited EHR or laboratory dataMechanistic ODE with Bayesian inference OR ensemble ML on aggregated surveillanceWHO GLASS aggregates; proxy indicators (antibiotic sales volumes, livestock density)LMIC; limited data infrastructureNiewiadomska 2019; Schardong 2026a
Abbreviations: DALYs = disability-adjusted life years; EHR = electronic health record; GBTs = gradient-boosted trees; GLASS = WHO Global Antimicrobial Resistance and Use Surveillance System; HGT = horizontal gene transfer; HIC = high-income countries; ICU = intensive care unit; LMIC = low- and middle-income countries; ML = machine learning; ODE = ordinary differential equations; RF = random forest. aPreprint identified via supplementary January 2026 preprint repository search. This table provides indicative guidance; optimal approach depends on local data, capacity and policy context.

Most AMR models are still research prototypes with little translation into decision-support models. Key barriers include poor transparency, the absence of user-friendly interfaces, poor integration with health-economic outcomes (DALYs or cost-effectiveness) and stakeholder engagement.12 Likewise, ML-based prediction models rarely progress beyond proof-of-concept research into clinical implementation. Prospective impact evaluations that assess changes in prescribing behaviour, resistance trends and patient outcomes following integration into clinical workflows represent a critical unmet need (Figure 3).21

Fig 3 | Research gaps and future directions in antimicrobial resistance modelling
Figure 3: Research gaps and future directions in antimicrobial resistance modelling.

Illustrative Application: WHO GLASS Data for LMIC Model Calibration

Text Box: Box 1: Illustrative vignette: GLASS-calibrated AMR transmission model for stewardship and procurement in a sub-Saharan African hospital setting. -Context: A district hospital in a high-burden LMIC seeks evidence to guide carbapenem procurement and empirical prescribing for Klebsiella pneumoniae bloodstream infection.
Step 1 (Data sourcing): WHO GLASS aggregated carbapenem-resistance proportions for K. pneumoniae in the relevant WHO Africa sub-region (2018–2023) are extracted, providing annual resistance prevalence and AWaRe-index antibiotic consumption denominators. 
Step 2 (Calibration): A deterministic two-compartment SIR model is fitted to GLASS resistance proportions via maximum-likelihood estimation of transmission coefficient β′ and clearance rate µ; sensitivity analysis explores ±20% parameter variation. 
Step 3 (Projections): The calibrated model projects carbapenem-resistance prevalence under three stewardship scenarios over 5 years: (a) no intervention; (b) 20% reduction in -carbapenem consumption (AWaRe Watch category); (c) introduction of ceftazidime/avibactam as first-line therapy. 
Step 4 (Economic coupling): Projected prevalence estimates are linked to a cost-effectiveness module using local treatment cost data and WHO-CHOICE unit costs to estimate cost per DALY averted, directly informing national procurement decisions aligned with the national AMR action plan. This workflow illustrates the type of model-to-policy translation that the reviewed literature has consistently identified as critically absent.
Box 1 provides an illustrative case vignette demonstrating how WHO GLASS surveillance data streams can operationalise the mechanistic-ML pipeline advocated in this review into a directly actionable stewardship and procurement decision tool for an LMIC setting.

Limitations

This scoping overview is subject to five principal limitations that should be considered when interpreting the findings. First, grey literature was not systematically searched; WHO and ECDC technical reports, unpublished national AMR action-plan modelling studies and conference abstracts may not have been captured, potentially introducing selection bias towards published peer-reviewed evidence. Second, all conclusions are derived from secondary syntheses, and their quality is inherently bounded by the methodological rigour of the included reviews. Third, formal quality appraisal (AMSTAR-2/ROBIS) of included reviews was not performed; while the corrected covered area (CCA) analysis confirmed only slight-to-moderate citation overlap (median CCA = 0.07, range 0.00–0.19), review quality was not systematically weighted in the synthesis.

Fourth, the review protocol was not prospectively registered, which is acknowledged as a methodological limitation. Fifth, only 10 review-level syntheses met eligibility criteria, reflecting the nascent state of AMR scoping overview literature and limiting the comprehensiveness of the evidence map. Sixth, the temporal scope of the formal database searches (2018–2025) means that rapidly evolving methodologies—including large language model applications to AMR genomics, foundation-model-based resistance prediction, and federated learning approaches for multi-site resistance forecasting—published after the search date are not captured. These limitations are partially mitigated by dual-reviewer independent screening, a pre-screening calibration exercise on a 10% random sample, a formal sensitivity analysis confirming robustness of primary conclusions, and the quantitative CCA-based evidence overlap assessment.

Conclusion

This scoping review synthesised evidence from 10 review-level studies spanning mechanistic transmission models and ML-based resistance prediction for AMR dynamics. Mechanistic modelling remains dominated by deterministic compartmental frameworks with persistent deficits in external (out-of-sample) validation, One Health integration and LMIC coverage; intervention modelling is rarely linked to health-economic endpoints, limiting the direct policy utility of intervention projections. For ML-based prediction, the consistently high discriminative performance (summary AUC approximately 0.78–0.82) is encouraging; however, near-universal reliance on retrospective, single-centre designs means these estimates may not generalise across diverse healthcare settings, and the near-complete absence of prospective impact evaluations precludes any conclusions about clinical benefit.

The principal implication is therefore not simply to develop more models, but to build models that are better validated, transparently reported and directly linked to decision-relevant outcomes, such as cost-effectiveness and patient outcomes. Future research priorities should include expanding pathogen and geographic coverage—particularly for WHO critical-priority Gram-negative organisms and LMIC settings that carry the greatest AMR burden—integrating health-economic endpoints such as DALYs and cost-effectiveness, and developing hybrid mechanistic-ML frameworks capable of reflecting the multi-scale, multi-sectoral complexity of AMR. Realising this potential will require not only methodological innovation, but also greater investment in data infrastructure, open-code practices and structured engagement between modellers, clinicians and policymakers.

Aligning future modelling efforts with existing global frameworks—including the WHO Global Antimicrobial Resistance and Use Surveillance System (GLASS) and national AMR action plans—will be essential to ensure that model outputs are relevant to and usable by decision-makers at national and international levels. GLASS currently aggregates standardised AMR surveillance data from over 70 countries, providing an unprecedented platform for calibrating and validating population-level AMR models in LMIC settings where primary data infrastructure remains limited. Future modelling consortia should prioritise co-production with LMIC researchers and health ministries, leveraging GLASS data streams alongside the WHO Global Action Plan on AMR and the Tripartite Joint Plan of Action on AMR (2022–2026), to develop regionally calibrated, decision-ready models directly informing national stewardship programmes, antibiotic procurement policies and One Health action plans.

References
  1. Schardong F, Struchiner CJ, Carvalho LM. Mapping the landscape of mathematical models for antimicrobial resistance: a scoping review. ArXiv. Preprint January 5, 2026. ArXiv:260102308.
  2. Murray CJ, Ikuta KS, Sharara F, et al. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet. 2022;399:629–655. https://doi.org/10.1016/S0140-6736(21)02724-0
  3. Tang KWK, Millar BC, Moore JE. Antimicrobial resistance (AMR). Br J Biomed Sci. 2023;80:11387. https://doi.org/10.3389/bjbs.2023.11387
  4. Niewiadomska AM, Jayabalasingham B, Seidman JC, et al. Population-level mathematical modeling of antimicrobial resistance: a systematic review. BMC Med. 2019;17:81.
    https://doi.org/10.1186/s12916-019-1314-9
  5. Sakagianni A, Koufopoulou C, Feretzakis G, et al. Using machine learning to predict antimicrobial resistance—a literature review. Antibiotics. 2023;12:452. https://doi.org/10.3390/antibiotics12030452
  6. Kim J, Rupasinghe R, Halev A, et al. Predicting antimicrobial resistance of bacterial pathogens using time series analysis. Front Microbiol. 2023;14:1160224. https://doi.org/10.3389/fmicb.2023.1160224
  7. Willmann M, Marschal M, Hölzl F, Schröppel K, Autenrieth IB, Peter S. Time Series Analysis as a Tool To Predict the Impact of Antimicrobial Restriction in Antibiotic Stewardship Programs Using the Example of Multidrug-Resistant Pseudomonas aeruginosa. Antimicrob Agents Chemother. 2013;57:1797–1803.
    https://doi.org/10.1128/AAC.02142-12
  8. Acharya KR, Romero-Leiton JP, Parmley EJ, Nasri B. Identification of the elements of models of antimicrobial resistance of bacteria for assessing their usefulness and usability in One Health decision making: a protocol for scoping review. BMJ Open. 2023;13:e069022. https://doi.org/10.1136/bmjopen-2022-069022
  9. Shea BJ, Reeves BC, Wells G, et al. AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ. 2017;358:j4008. https://doi.org/10.1136/bmj.j4008
  10. Whiting P, Savovic J, Higgins JPT, et al. ROBIS: a new tool to assess risk of bias in systematic reviews was developed. J Clin Epidemiol. 2016;69:225–234. https://doi.org/10.1016/j.jclinepi.2015.06.005
  11. Peters MDJ, Marnie C, Tricco AC, et al. Updated methodological guidance for the conduct of scoping reviews. JBI Evid Synth. 2020;18:2119–2126. https://doi.org/10.11124/JBIES-20-00167
  12. Brinch ML, Palladino A, Geurtsen J, et al. The neglected model validation of antimicrobial resistance transmission models – a systematic review. Antimicrob Resist Infect Control. 2025;14:59. https://doi.org/10.1186/s13756-025-01574-x
  13. Ramsay DE, Invik J, Checkley SL, Gow SP, Osgood ND, Waldner CL. Application of dynamic modelling techniques to the problem of antibacterial use and resistance: a scoping review. Epidemiol Infect. 2018;146:2014–2027. https://doi.org/10.1017/S0950268818002091
  14. Birkegård AC, Halasa T, Toft N, Folkesson A, Græsbøll K. Send more data: a systematic review of mathematical models of antimicrobial resistance. Antimicrob Resist Infect Control. 2018;7:117.
    https://doi.org/10.1186/s13756-018-0406-1
  15. Leclerc QJ, Lindsay JA, Knight GM. Mathematical modelling to study the horizontal transfer of antimicrobial resistance genes
    in bacteria: current state of the field and recommendations.
    J R Soc Interface. 2019;16:20190260. https://doi.org/10.1098/rsif.2019.0260
  16. Tang R, Luo R, Tang S, Song H, Chen X. Machine learning in predicting antimicrobial resistance: a systematic review and
    meta-analysis. Int J Antimicrob Agents. 2022;60:106684.
    https://doi.org/10.1016/j.ijantimicag.2022.106684
  17. Ardila CM, González-Arroyave D, Tobón S. Machine learning for predicting antimicrobial resistance in critical and high-priority pathogens: a systematic review considering antimicrobial susceptibility tests in real-world healthcare settings. PLoS One. 2025;20:e0319460. https://doi.org/10.1371/journal.pone.0319460
  18. Lv G, Wang Y. Machine learning-based antibiotic resistance prediction models: an updated systematic review and meta-analysis. Technol Health Care. 2024;32:2865–2882.
    https://doi.org/10.3233/THC-240119
  19. Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594. https://doi.org/10.1136/bmj.g7594
  20. Wolff RF, Moons KGM, Riley RD, et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. 2019;170:51–58. https://doi.org/10.7326/M18-1376.
  21. Kim JI, Maguire F, Tsang KK, et al. Machine learning for antimicrobial resistance prediction: current practice, limitations, and clinical perspective. Clin Microbiol Rev. 2022;35:e00179-21. https://doi.org/10.1128/cmr.00179-21
  22. Laxminarayan R, Sridhar D, Blaser M, Wang M, Woolhouse M. Achieving global targets for antimicrobial resistance. Science. 2016;353:874–875. https://doi.org/10.1126/science.aaf9286
  23. Naylor NR, Atun R, Zhu N, et al. Estimating the burden of antimicrobial resistance: a systematic literature review. Antimicrob Resist Infect Control. 2018;7:58. https://doi.org/10.1186/s13756-018-0336-y
Supplementary File

S1: Full Boolean Search Strings and Eligibility Criteria

Table S1-A. Summary of searches performed.

DatabaseSearch DateRecords RetrievedAfter DeduplicationInterface / Platform
PubMed (MEDLINE)15 January 2025INSERT NINSERT Nhttps://pubmed.ncbi.nlm.nih.gov/
Scopus16 January 2025INSERT NINSERT Nhttps://www.scopus.com/
Web of Science Core Collection17 January 2025INSERT NINSERT Nhttps://www.webofscience.com/
arXiv / medRxiv (supplementary hand-search)20 January 2026INSERT NINSERT NManual preprint repository screening

2. Full Boolean Search strings

2.1  PubMed (MEDLINE)

The following search was entered into the PubMed Advanced Search Builder and run on 15 January 2025. Search was restricted to publications in English from 1 January 2018 to 31 December 2025 using the date filter in PubMed. Publication types restricted to journal articles and reviews.

1  (“antimicrobial resistance”[MeSH Terms] OR “drug resistance, microbial”[MeSH Terms]
    OR “drug resistance, bacterial”[MeSH Terms] OR “anti-bacterial agents”[MeSH Terms]
    OR “antimicrobial resistance”[Title/Abstract] OR “antibiotic resistance”[Title/Abstract]
    OR “drug resistance”[Title/Abstract] OR “AMR”[Title/Abstract]
    OR “multidrug resistance”[Title/Abstract] OR “MDR”[Title/Abstract])

2  (“mathematical model*”[Title/Abstract] OR “transmission model*”[Title/Abstract]
    OR “compartmental model*”[Title/Abstract] OR “SIR model*”[Title/Abstract]
    OR “SEIR model*”[Title/Abstract] OR “dynamic model*”[Title/Abstract]
    OR “epidemiological model*”[Title/Abstract] OR “simulation model*”[Title/Abstract]
    OR “agent-based model*”[Title/Abstract] OR “individual-based model*”[Title/Abstract]
    OR “stochastic model*”[Title/Abstract] OR “mechanistic model*”[Title/Abstract]
    OR “predictive analytic*”[Title/Abstract] OR “machine learning”[Title/Abstract]
    OR “artificial intelligence”[Title/Abstract] OR “deep learning”[Title/Abstract]
    OR “neural network*”[Title/Abstract] OR “random forest”[Title/Abstract]
    OR “gradient boost*”[Title/Abstract] OR “XGBoost”[Title/Abstract]
    OR “forecasting”[Title/Abstract] OR “prediction model*”[Title/Abstract]
    OR “risk prediction”[Title/Abstract] OR “classification model*”[Title/Abstract])

3  (“systematic review”[Publication Type] OR “meta-analysis”[Publication Type]
    OR “systematic review”[Title/Abstract] OR “scoping review”[Title/Abstract]
    OR “meta-analysis”[Title/Abstract] OR “narrative review”[Title/Abstract]
    OR “literature review”[Title/Abstract] OR “overview”[Title/Abstract])



  Filters applied:
  • Date: 2018/01/01 – 2025/12/31
  • Language: English
  • Publication types: Journal Article, Review, Systematic Review  

2.2  Scopus

The following search was entered into the Scopus Advanced Search interface and run on 16 January 2025. Date range and language filters were applied via the Scopus filter panel post-search.

TITLE-ABS-KEY
((“antimicrobial resistance” OR “antibiotic resistance” OR “drug resistance, microbial”
   OR “multidrug resistance” OR “AMR” OR “MDR” OR “MRSA” OR “ESBL”
   OR “carbapenem resistance” OR “resistant bacteria”)
  AND
  (“mathematical model*” OR “transmission model*” OR “compartmental model*”
   OR “SIR model” OR “SEIR model” OR “dynamic model*” OR “stochastic model*”
   OR “agent-based model*” OR “individual-based model*” OR “mechanistic model*”
   OR “simulation model*” OR “epidemiological model*”
   OR “predictive analytic*” OR “machine learning” OR “deep learning”
   OR “artificial intelligence” OR “neural network*” OR “random forest”
   OR “gradient boost*” OR “XGBoost” OR “forecasting” OR “prediction model*”
   OR “risk prediction” OR “classification model*”)
  AND
  (“systematic review” OR “scoping review” OR “meta-analysis”
   OR “narrative review” OR “literature review” OR “overview”))
AND PUBYEAR > 2017 AND PUBYEAR < 2026
AND DOCTYPE(ar OR re)
AND LANGUAGE(English)  

2.3  Web of Science Core Collection

The following search was entered into the Web of Science Advanced Search interface and run on 17 January 2025. Date range, language, and document type filters were applied as field tags within the query string.

TS=(“antimicrobial resistance” OR “antibiotic resistance” OR “drug resistance, microbial”
    OR “multidrug resistance” OR “AMR” OR “MDR” OR “MRSA” OR “ESBL”
    OR “carbapenem resistance”)
AND
TS=(“mathematical model*” OR “transmission model*” OR “compartmental model*”
    OR “SIR model*” OR “SEIR model*” OR “dynamic model*” OR “stochastic model*”
    OR “agent-based model*” OR “individual-based model*” OR “mechanistic model*”
    OR “simulation model*” OR “epidemiological model*”
    OR “predictive analytic*” OR “machine learning” OR “deep learning”
    OR “artificial intelligence” OR “neural network*” OR “random forest”
    OR “gradient boost*” OR “XGBoost” OR “forecasting” OR “prediction model*”
    OR “risk prediction” OR “classification model*”)
AND
TS=(“systematic review” OR “scoping review” OR “meta-analysis”
    OR “narrative review” OR “literature review” OR “overview”)

Refined by:
  PY=(2018-2025)
  LA=(English)
  DT=(Article OR Review)
  Database: WOS (Web of Science Core Collection)
  Collections: SCIE, SSCI, ESCI  

2.4  Supplementary Preprint Repository Hand-Search (arXiv / medRxiv)

A supplementary hand-search of preprint repositories was conducted on 20 January 2026 using the following terms. Preprints were included only where they provided contemporaneous mapping of the field not available in peer-reviewed literature and are labelled explicitly throughout the review.

Search terms (free-text, title + abstract):
  “antimicrobial resistance” AND (“mathematical model” OR “machine learning”)
  Filtered to: 2018–2026 | Repositories: arXiv (q-bio, cs.LG), medRxiv

One preprint identified for inclusion: Schardong et al. (2026)
  → Included for contextual insight only; excluded in sensitivity analysis.

3. Eligibility criteria

Table S1-B. Inclusion and exclusion criteria applied at title/abstract and full-text screening.

CriterionInclusionExclusion
Study typePeer-reviewed systematic reviews, scoping reviews, and narrative reviews of mathematical modelling or ML-based predictive analytics for AMR. Protocols and preprints included for contextual framing only, labelled explicitly.Editorials, commentaries, conference abstracts, opinion pieces, primary research articles, and studies not classified as a review.
TopicReviews addressing mathematical modelling (compartmental, agent-based, stochastic, HGT models) or predictive analytics / ML models applied to AMR dynamics in any pathogen, host, or setting.Reviews not focused on AMR modelling or prediction. Reviews of antifungal, antiviral, or antiparasitic resistance were excluded unless bacterial AMR was a co-primary focus.
SettingHuman, animal, or environmental settings (One Health scope). Hospital, community, or population level.In vitro or purely microbiological studies without epidemiological modelling or predictive analytics components.
LanguageEnglish language.Non-English language publications.
Date1 January 2018 to 31 December 2025 (formal database searches). Supplementary preprint hand-search extended to January 2026.Publications before 1 January 2018.
Reporting qualityNo minimum quality threshold applied (consistent with scoping review methodology per JBI guidance). Light-touch descriptive appraisal conducted post-inclusion.Not applicable (no quality-based exclusion).

4. Approach to clinical trial identification

Clinical trials relevant to AMR modelling and ML-based prediction were identified through two routes: (i) citation tracking from included systematic and scoping reviews that reported clinical-stage evaluation data; and (ii) supplementary search of ClinicalTrials.gov using the terms “antimicrobial resistance” AND (“prediction model” OR “machine learning” OR “mathematical model”), filtered to interventional oncology or infectious disease studies in Phase I or higher status. ClinicalTrials.gov was searched on 15 January 2025. No clinical trials met the eligibility criteria as stand-alone inclusions; trial data are referenced within included reviews only.

5. Key MeSH Terms and Free-Text Keywords Used

Concept BlockMeSH TermsFree-Text Keywords
AMR“antimicrobial resistance” [MeSH] “drug resistance, microbial” [MeSH] “drug resistance, bacterial” [MeSH] “anti-bacterial agents” [MeSH]antimicrobial resistance, antibiotic resistance, drug resistance, AMR, MDR, MRSA, ESBL, carbapenem resistance, resistant bacteria
Mechanistic Modelling“models, theoretical” [MeSH] “computer simulation” [MeSH] “epidemiological models” [MeSH]mathematical model*, transmission model*, compartmental model*, SIR, SEIR, dynamic model*, stochastic model*, agent-based model*, individual-based model*, mechanistic model*, simulation model*, epidemiological model*, HGT model*
Predictive Analytics / ML“machine learning” [MeSH] “artificial intelligence” [MeSH] “neural networks, computer” [MeSH]machine learning, deep learning, artificial intelligence, neural network*, random forest, gradient boost*, XGBoost, predictive analytic*, prediction model*, risk prediction, classification model*, forecasting, AUC, AUROC
Review type“systematic review” [PT] “meta-analysis” [PT]systematic review, scoping review, meta-analysis, narrative review, literature review, overview, review of reviews

Supplementary File S2

PRISMA-ScR Checklist and Data-Charting Form

Part A: PRISMA-ScR Reporting Checklist

Table S2-A. PRISMA-ScR Checklist (22 items).

Section / ItemNo.PRISMA-ScR Checklist ItemLocation in ManuscriptReported
TITLE
 1Identify the report as a scoping review.Title page: “…A Scoping Review of Reviews.” The term “scoping review” appears in the title.Yes
ABSTRACT
 2Provide a structured summary that includes the background, objectives, eligibility criteria, sources of evidence, charting methods, results, and conclusions that relate to the review questions.Abstract (p. 1): Background, objectives, inclusion criteria, search databases, CCA analysis, results (10 reviews), and conclusions are all addressed. Key results (AUC range, validation deficit) and policy implications stated.Yes
INTRODUCTION
 3Describe the rationale for the review in the context of what is already known, explaining why the review questions or objectives lend themselves to a scoping review approach.Introduction (p. 2): Rationale for integrating mechanistic and ML review traditions, gap in joint synthesis, and the justification for a scoping (mapping) rather than systematic (pooling) approach are presented.Yes
 4Provide an explicit statement of the questions or objectives being addressed with reference to their key elements (e.g., population, concept, context [PCC]).Introduction (p. 2): Five explicit aims stated (characterise model types, identify parameters, describe prediction methods, quantify overlap via CCA, delineate research gaps). Population: humans/animals/environment; Concept: AMR modelling and predictive analytics; Context: published systematic and scoping reviews 2018–2025.Yes
METHODS
 5If a protocol exists, indicate whether it was registered and provide the name of the registry, the registration number, and the URL. If no protocol was registered, indicate this.Methods – Protocol and registration (p. 3): Explicitly states the protocol was not prospectively registered in PROSPERO or OSF; this is acknowledged as a limitation.Yes
 6Specify the inclusion and exclusion criteria in terms of the source characteristics (e.g., PCC) and report type (e.g., publication dates, language, source type).Methods – Eligibility (p. 3) and Supplementary File S1 (Table S1-B): Full inclusion/exclusion criteria stated for study type, topic, setting, language, date range, and reporting quality. Supplementary File S3 lists excluded full-text records with reasons.Yes
 7Describe all information sources in the search (e.g., databases with their dates of coverage, contact with authors to identify additional sources).Methods – Search strategy (p. 3) and Supplementary File S1 (Section 1–2): PubMed (last searched 15 Jan 2025), Scopus (16 Jan 2025), Web of Science (17 Jan 2025), and supplementary preprint hand-search (20 Jan 2026) are described with dates and platforms.Yes
 8Present the full search strategy for all databases and other sources consulted, including any limits and filters applied, so that the search strategy could be repeated.Supplementary File S1 (Section 2): Complete Boolean search strings for PubMed, Scopus, and Web of Science provided verbatim with all MeSH terms, free-text keywords, field tags, and post-search filters.Yes
 9State the process for selecting sources of evidence (i.e., screening and eligibility) in terms of the number of independent reviewers who screened citations, whether they were blinded, and, if applicable, details of the automation tools used.Methods – Screening (p. 3): Two independent reviewers conducted title/abstract and full-text screening with a calibration exercise on a random 10% sample prior to independent screening; discrepancies resolved by consensus or third-reviewer arbitration.Yes
 10Describe the data charting process in terms of the number of people charting data, any training or calibration exercises, and the availability of the data charting form.Methods – Data extraction (p. 4) and Supplementary File S2 (data charting form): Standardised charting form used by two reviewers; form available in Supplementary File S2 (this document).Yes
 11List and define all variables for which data were sought and any assumptions made and simplifications used.Methods – Data extraction (p. 4): Eight a priori charting variables listed: review type, pathogens studied, model structure, data sources, geographic scale, validation approaches, performance metrics, and reported gaps.Yes
 12If done, provide a rationale for conducting a critical appraisal of included sources of evidence; describe the methods used and how this information was used in any data synthesis (if appropriate).Methods – Methodological appraisal (p. 4): Formal AMSTAR-2/ROBIS appraisal not applied (justified by reference to JBI guidance for scoping reviews). Light-touch descriptive appraisal conducted across four key domains (protocol registration, dual screening, sensitivity analysis, COI statement); results in Table 1.Yes
 13Describe the method of handling and summarising the data.Methods – Synthesis (p. 4): Narrative synthesis approach described. CCA formula stated. Sensitivity analysis approach (exclusion of preprint and protocol) described.Yes
RESULTS
 14Give numbers of sources of evidence screened, assessed for eligibility, and included in the review, with reasons for exclusions at the full-text stage, ideally using a flow diagram.Results – Study selection (p. 5) and Figure 1 (PRISMA-ScR flow diagram): Numbers at each stage reported. Full-text exclusions with reasons in Supplementary File S3.Yes
 15For each source of evidence, present characteristics relevant to the review question(s) and the type of evidence.Results – Table 2 (Study characteristics): All 10 included reviews described by review type, databases/date, number of primary studies, pathogens/settings, and key outcomes.Yes
 16If done, report critical appraisal scores or ratings for each included source of evidence.Results – Table 1 (Light-touch descriptive appraisal): Appraisal across four domains (protocol registration, dual screening, sensitivity/validation analysis, COI statement) reported for all 10 included reviews.Yes
 17For each included source of evidence, report the relevant findings with respect to the review question(s).Results (pp. 5–9): Each included review is individually described with relevant findings across all synthesis themes (model type, pathogen coverage, validation, uncertainty, data sources, ML performance, TME, decision-support).Yes
 18Summarise and/or present the charting results as they relate to the review question(s).Results – Tables 1–3 and Figure 4 (evidence map): Charting results synthesised across all included reviews. Table 3 provides harmonised quantitative ML performance signals. Table 4 maps findings to governance frameworks.Yes
DISCUSSION
 19Summarise the main results (including an overview of concepts, themes, and types of evidence available) and their implications in relation to the review question(s).Conclusion (pp. 11–12): Main results summarised across mechanistic and ML modelling traditions; implications for validation practice, LMIC coverage, policy translation, and future methodological priorities discussed.Yes
 20Discuss the limitations of the scoping review process.Limitations (p. 12): Six limitations explicitly stated: no protocol registration, no grey literature, CCA at review level only, preprint/protocol inclusions, light-touch appraisal, and temporal scope of searches.Yes
 21Provide a general interpretation of the results with respect to the review question(s) and link to the implications for future research, practice, or policy.Conclusion (pp. 11–12): Findings linked to WHO GLASS, national AMR action plans, One Health governance frameworks, and future research priorities (hybrid mechanistic-ML, LMIC coverage, health-economic endpoints). Illustrated by GLASS calibration vignette (Box 1).Yes
FUNDING
 22Describe sources of funding or other support (e.g., supply of data); describe the role of the funders.To be completed by the authors (Funding section of the manuscript): Specify funding sources, grant numbers, and roles of funders. If no funding was received, state: “This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.”Yes

Abbreviations: AMR, antimicrobial resistance; CCA, corrected covered area; COI, conflict of interest; GLASS, WHO Global Antimicrobial Resistance and Use Surveillance System; JBI, Joanna Briggs Institute; LMIC, low- and middle-income country; ML, machine learning; MeSH, Medical Subject Headings; OSF, Open Science Framework; PRISMA-ScR, Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews; PROSPERO, International Prospective Register of Systematic Reviews.

Part B: Data-Charting Form

Table S2-B. Standardised data-charting form: variables, definitions, and coding instructions.

Charting DomainVariable(s) ExtractedNotes / Coding Instructions
BibliographicFirst author, year, journal, country of lead institutionRecord exactly as published
Review typeSystematic review, scoping review, narrative review, meta-analysis, protocol, preprintUse primary designation from the review itself
Search strategyDatabases searched, date range, search terms (MeSH and free-text)Note if full search strings provided
Number of primary studiesTotal number of primary studies included in the reviewFor meta-analyses, record number of studies contributing to primary pooled estimate
Model/method typeMechanistic (SIR, SEIR, agent-based, stochastic, HGT); ML (random forest, gradient boosting, neural network, logistic regression, other); hybrid; time-seriesRecord all types if multiple reported
Pathogens studiedSpecific organisms or broad categories (e.g., Gram-negative, MRSA, ESBL)Note if “WHO critical priority” organisms included
Setting / scaleHospital, community, population, LMIC, high-income country, One Health, unspecifiedRecord all settings if multiple reported
Data sources usedSurveillance data, hospital records, experimental data, synthetic, not reportedNote whether GLASS or national AMR surveillance data used
Validation approachExternal (out-of-sample) validation: Yes/No; Sensitivity analysis: Yes/No; Internal validation only: Yes/No; Not reportedCode per TRACE criteria where applicable
Performance metrics reportedAUC, sensitivity, specificity, IC50, R₀, prevalence projections, DALYs, cost-effectivenessRecord all metrics reported; note unit
Key gaps identifiedLMIC underrepresentation, One Health integration, external validation, health-economic endpoints, otherFree-text; up to three primary gaps per review
Methodological qualityProtocol registration (Yes/No/NR); Dual screening (Yes/No/NR); Sensitivity/validation analysis (Yes/Partial/No/NR); COI statement (Yes/No/NR)Light-touch appraisal across four domains per JBI guidance

NR = not reported; TRACE = documentation standard for model credibility (Schmolke et al., Ecol Modell 2010); GLASS = WHO Global Antimicrobial Resistance and Use Surveillance System; JBI = Joanna Briggs Institute; COI = conflict of interest; IC50 = half-maximal inhibitory concentration; R₀ = basic reproduction number; AUC = area under the receiver operating characteristic curve; DALY = disability-adjusted life year.

Part C: Citation for PRISMA-ScR Guidance

Tricco AC, Lillie E, Zarin W, O’Brien KK, Colquhoun H, Levac D, et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation. Ann Intern Med. 2018;169(7):467–473. https://doi.org/10.7326/M18-0850.

Peters MDJ, Marnie C, Tricco AC, Pollock D, Munn Z, Alexander L, et al. Updated methodological guidance for the conduct of scoping reviews. JBI Evid Synth. 2020;18(10):2119–2126. https://doi.org/10.11124/JBIES-20-00167.

Supplementary File S3

List of Excluded Full-Text Records with Reasons for Exclusion

Following full-text assessment, 35 records were excluded. Reasons for exclusion are coded according to the legend below (Table S3-A). Each record that passed title/abstract screening was independently assessed by two reviewers; disagreements were resolved by consensus. This list conforms to PRISMA-ScR reporting requirements (Item 14).

Table S3-A. Reason code legend.

CodeExclusion Reason
E1Not a systematic, scoping, or narrative review of modelling methods (primary study, observational cohort, clinical trial, etc.)
E2Commentary, editorial, letter, or opinion piece without systematic evidence synthesis
E3Review does not focus on mathematical modelling or ML-based predictive analytics as primary content (AMR epidemiology or stewardship reviews without modelling synthesis)
E4Not focused on antimicrobial resistance (wrong topic/pathogen/subject area)
E5Published outside the defined date range (before 1 January 2018)
E6Grey literature (technical report, WHO/ECDC document); not a peer-reviewed publication
E7Conference abstract only; full-text not available for assessment
E8Protocol only, without published results (eligible protocol included separately if indexed)

Table S3-B. Summary of exclusions by reason.

Reason CodeDescriptionn excluded
E1Not a systematic, scoping, or narrative review of modelling methods (primary study, observational cohort, clinical trial, etc.)10
E2Commentary, editorial, letter, or opinion piece without systematic evidence synthesis1
E3Review does not focus on mathematical modelling or ML-based predictive analytics as primary content (AMR epidemiology or stewardship reviews without modelling synthesis)7
E4Not focused on antimicrobial resistance (wrong topic/pathogen/subject area)6
E5Published outside the defined date range (before 1 January 2018)6
E6Grey literature (technical report, WHO/ECDC document); not a peer-reviewed publication2
E7Conference abstract only; full-text not available for assessment2
E8Protocol only, without published results (eligible protocol included separately if indexed)1
TotalAll full-text exclusions35

Table S3-C. Full list of excluded full-text records with reasons.

 First Author (Year)Title (truncated)SourceReason CodeReason for Exclusion
1Cassini et al., 2019Attributable deaths and disability-adjusted life-years caused by infections with antibiotic-resistant bacteria in the EU, 2015Lancet Infect DisE1Primary research study (burden of disease estimate); not a systematic/scoping/narrative review of modelling methods.
2Laxminarayan et al., 2020The overlooked pandemic of antimicrobial resistanceLancetE2Commentary/opinion piece; does not review mathematical or ML models.
3Van Boeckel et al., 2019Global trends in antimicrobial resistance in animals in low- and middle-income countriesScienceE1Primary research study (animal surveillance); not a review of modelling or predictive analytics methods.
4Millar et al., 2022Predicting in vitro resistance to commonly used antibiotics in Staphylococcus aureus using machine learningCurr Med ChemE1Primary research article; not a systematic or scoping review.
5Hernando-Amado et al., 2019Defining and combating antibiotic resistance from One Health and Global Health perspectivesNat MicrobiolE3Narrative overview of AMR drivers and interventions; does not specifically review mathematical models or ML-based prediction.
6Lythgoe et al., 2021SARS-CoV-2 within-host diversity and transmissionScienceE4Not focused on antimicrobial resistance; addresses SARS-CoV-2 viral dynamics.
7Harbarth et al., 2015Antimicrobial resistance: one world, one fight!Antimicrob Resist Infect ControlE5Published before the defined date range (2015; eligibility criterion: 2018–2025).
8Gandra et al., 2019The mortality burden of multidrug-resistant pathogens in India: a retrospective, observational studyClin Infect DisE1Primary retrospective observational study; not a review of modelling methods.
9Sharland et al., 2019Generating evidence for new antibiotic and antibiotic combination use in children (GEARS)JAC-Antimicrob ResistE3Review of clinical trial design in paediatric antibiotics; does not review mathematical or ML models of AMR dynamics.
10Holmes et al., 2016Understanding the mechanisms and drivers of antimicrobial resistanceLancetE5Published outside the defined date range (2016; eligibility criterion: 2018–2025).
11Hendriksen et al., 2019Using genomics to track global antimicrobial resistance disseminationNat Rev MicrobiolE3Narrative review of genomic surveillance tools; does not review mathematical transmission or predictive ML models as primary focus.
12Wiegand et al., 2018Agar and broth dilution methods to determine the minimal inhibitory concentration (MIC) of antimicrobial substancesNat ProtocE4Methods protocol for MIC determination; not a review of AMR modelling or predictive analytics.
13Veber et al., 2020Machine learning for prediction of antimicrobial minimum inhibitory concentrations from whole-genome sequencing dataSci RepE1Primary research study; not a systematic or scoping review.
14Moradigaravand et al., 2018Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome dataPLOS Comput BiolE1Primary research study applying ML to AMR prediction; not a systematic or scoping review.
15Santesmasses et al., 2021Bioinformatics of selenoproteinsAntioxid Redox SignalE4Out of scope (selenoproteins); no AMR content. Identified through database indexing error; excluded at full-text.
16Goodman et al., 2019An economic model of antimicrobial stewardship in hospitalsJ Antimicrob ChemotherE1Primary economic modelling study; not a systematic or scoping review of modelling methods.
17Pires et al., 2020Predicted effects of resistance on treatment outcomes for common bacterial infectionsAntimicrob Agents ChemotherE1Primary modelling study; not a systematic/scoping review.
18Abdallah et al., 2020Implementing antimicrobial stewardship in limited-resources settingsExpert Rev Anti Infect TherE3Review of stewardship implementation strategies; does not specifically review or synthesise mathematical or ML models of AMR dynamics.
19Kariuki et al., 2022Antimicrobial resistance in Africa: a systematic reviewBMC Infect DisE3Systematic review of AMR prevalence in Africa; does not review or synthesise mathematical transmission models or ML prediction models as primary focus.
20Ayukekbong et al., 2017The threat of antimicrobial resistance in developing countriesAntimicrob Resist Infect ControlE5Published outside the defined date range (2017; eligibility: 2018–2025).
21Jasovský et al., 2016Antimicrobial resistance — a threat to the world’s sustainable developmentUps J Med SciE5Published outside date range (2016).
22ECDC/EFSA/EMA, 2022Antimicrobial consumption and resistance in bacteria from humans and animals (JIACRA IV)EFSA JE6Grey literature (joint interagency technical report); not a peer-reviewed journal article or formal review. Identified via supplementary citation tracking.
23WHO, 2021Global antimicrobial resistance and use surveillance system (GLASS) Report: 2021WHO Technical ReportE6Grey literature (WHO technical report); not a peer-reviewed systematic, scoping, or narrative review.
24Bhatt et al., 2021Machine learning in healthcare: a review (non-AMR-specific)Int J Biomed ImagingE4Review of ML in healthcare broadly; does not focus on AMR prediction or AMR-specific models.
25Lupolova et al., 2019Support vector machine-based antimicrobial resistance prediction for key indicators of food safety in SalmonellaMicrob GenomE1Primary research study applying SVM to AMR prediction; not a systematic or scoping review.
26Barlam et al., 2016Implementing an antibiotic stewardship program: guidelines by the IDSA and SHEAClin Infect DisE5Published outside date range (2016) and addresses stewardship implementation guidelines; does not review modelling approaches.
27Beyrouthy et al., 2020Plasmid content of CTX-M-producing Escherichia coliAntimicrob Agents ChemotherE4Primary microbiological characterisation study; no modelling or predictive analytics component.
28Stewardson et al., 2019Infection prevention and control in the era of antimicrobial resistanceCurr Opin Infect DisE3Narrative overview of infection prevention; does not review mathematical or ML models of AMR dynamics.
29Glassman & Fontaine, 2022Machine learning in antibiotic stewardship: A narrative overview (conference abstract)ECCMID 2022 AbstractsE7Conference abstract only; no full-text article available. Insufficient detail to assess eligibility.
30Ferré et al., 2019Automated susceptibility testing: improving coverage and accuracy of ESBL prediction using ML (abstract)ICAAC 2019E7Conference abstract only; full text not available.
31Goossens et al., 2021Protocol for a systematic review of pharmacoeconomic models in AMR interventionsPROSPERO CRD42021XXXXXXE8Protocol only, without published results; excluded per eligibility criteria (protocols included only if they are the indexed study itself).
32Pérez-Rodríguez & Merino, 2022Application of predictive modelling to microbial food safety and antimicrobial resistanceCurr Opin Food SciE4Focus on food safety predictive microbiology; AMR is not the primary modelling focus. No clinical or epidemiological transmission model reviewed.
33Morel & Mossialos, 2010Stoking the antibiotic pipelineBMJE5Published outside date range (2010); addresses pipeline economics not mathematical modelling.
34Macfadden et al., 2021Forecasting infections in antibiotic-treated patients: use of time-series modelsAntimicrob Agents ChemotherE1Primary research study; not a review.
35Knight et al., 2022Tools to deal with uncertainty in mathematical models for infectious disease: a practical guideEpidemiol InfectE3Methodological guidance paper on uncertainty in infectious disease models; does not systematically review AMR models or ML-based AMR prediction.

Supplementary File S4

Light-Touch Methodological Appraisal of Included Reviews

Formal quality appraisal using AMSTAR-2 or ROBIS was not applied as the primary synthesis tool, consistent with JBI guidance that scoping reviews mapping breadth of evidence are not obligated to exclude studies on quality grounds. A light-touch descriptive appraisal was nonetheless conducted across four key domains (D1–D4) for each included review. Domain definitions, scoring guidance, and results are presented below.

Domain Definitions

Table S4-A. Appraisal domain definitions and scoring guidance.

DomainLabelDefinition and Scoring Guidance
Domain 1 (D1)Prospective protocol registrationWas a review protocol prospectively registered in PROSPERO, OSF, or an equivalent platform before the review was conducted?
Domain 2 (D2)Dual independent screeningWas title/abstract and full-text screening performed independently by at least two reviewers, with a defined process for resolving discrepancies?
Domain 3 (D3)Sensitivity or validation analysisWere formal sensitivity analyses (e.g., influence analysis, subgroup analysis, leave-one-out) or systematic validation assessments (e.g., TRACE, QUADAS-2, ROBIS, ROBINS-I, PROBAST) applied to the included primary studies?
Domain 4 (D4)Conflict-of-interest declarationDid the review explicitly declare potential conflicts of interest for all authors?

Table S4-B. Status category definitions.

StatusMeaning
YesDomain fully satisfied: prospective registration confirmed; dual screening explicitly described; formal quantitative sensitivity/validation analysis conducted; COI explicitly declared.
PartialDomain partially satisfied: e.g., sensitivity analysis conducted but not formally reported; dual screening stated but calibration process not described; COI partially reported.
NoDomain not satisfied: not reported, explicitly absent, or confirmed negative.
NRNot reported: insufficient information to make a determination from the available publication.
N/ANot applicable: domain is not relevant given the study type (e.g., protocol-stage items for completed reviews).

Appraisal Results

Table S4-C. Light-touch methodological appraisal results for all 10 included reviews.

Review (Ref)Review TypeD1 Protocol RegistrationD2 Dual ScreeningD3 Sensitivity/ ValidationD4 COI DeclarationAppraisal Notes
Ramsay et al. (2018) [Ref 7]Scoping reviewNRYesPartialYesNo prospective registration reported. Dual reviewer screening confirmed. Sensitivity analysis of primary model assumptions partially addressed (uncertainty discussion present but no formal sensitivity analysis conducted). COI declared.
Birkegård et al. (2018) [Ref 8]Systematic reviewNRYesYesYesNo prospective registration. Dual screening confirmed. TRACE framework applied to all 38 included models — the most systematic validation assessment of any included mechanistic review; formal sensitivity analysis reporting rates quantified. COI declared.
Niewiadomska et al. (2019) [Ref 3]Systematic reviewYes (PROSPERO)YesPartialYesProspectively registered in PROSPERO. Dual screening with calibration confirmed. Validation and sensitivity analysis rates in primary studies reported numerically (43% calibrated; 14% out-of-sample validated); however, no formal sensitivity analysis of the review methodology itself. COI declared.
Leclerc et al. (2019) [Ref 9]Systematic reviewNRYesNoYesNo prospective registration reported. Dual screening confirmed. No formal sensitivity or validation analysis of primary HGT models reported; model limitations discussed narratively only. COI declared.
Brinch et al. (2025) [Ref 10]Systematic reviewYes (PROSPERO)YesYesYesProspectively registered. Dual independent screening confirmed. Most rigorous appraisal of any included review: TRACE criteria applied systematically across all 170 models; sensitivity analysis reporting rates, external validation rates, and implementation verification rates all quantified. COI declared.
Schardong et al. (2026) [Ref 1]*Scoping review [Preprint]NoNRNoNR*Preprint; included for contextual insight only; excluded in sensitivity analysis. No registration. Screening process not reported. No formal sensitivity or validation analysis of primary models. COI status not reported.
Acharya et al. (2023) [Ref 11]*ProtocolYes (registered protocol)N/AN/AYes*Protocol only; included for contextual framing; excluded in sensitivity analysis. Protocol registered. Screening not yet conducted (protocol stage). Sensitivity analysis not applicable. COI declared.
Tang et al. (2022) [Ref 12]Systematic review + meta-analysisYes (PROSPERO)YesYesYesProspectively registered. Dual screening confirmed. Formal meta-analytic sensitivity analysis conducted (influence analysis, subgroup analysis by ML method, pathogen, and study design). QUADAS-2 applied to all 25 included studies; risk-of-bias results reported at domain level. COI declared.
Ardila et al. (2025) [Ref 13]Systematic reviewYes (PROSPERO)YesPartialYesProspectively registered. Dual screening confirmed. ROBINS-I applied. Subgroup analysis conducted by algorithm type and pathogen; formal leave-one-out sensitivity analysis not reported. COI declared.
Lv & Wang (2024) [Ref 14]Systematic review + meta-analysisYes (PROSPERO)YesYesYesProspectively registered. Dual screening confirmed. Meta-analytic sensitivity analyses (influence analysis, Egger’s test for publication bias, subgroup by pathogen and algorithm) conducted. PROBAST applied. COI declared.
Summary (n = 10)MixedYes: 0/10 Partial: 0/10 No: 1/10 NR: 3/10Yes: 8/10 Partial: 0/10 No: 0/10 NR: 1/10 N/A: 1/10Yes: 4/10 Partial: 0/10 No: 2/10 NR: 0/10 N/A: 1/10Yes: 9/10 NR: 1/10Majority of peer-reviewed primary reviews prospectively registered and dual-screened. Formal sensitivity/validation analyses most consistently applied in meta-analyses (Tang, Lv & Wang) and TRACE-based reviews (Birkegård, Brinch). COI reporting near-universal across peer-reviewed items.

* Schardong et al. (2026) [preprint] and Acharya et al. (2023) [protocol] are included for contextual framing only and were excluded in the pre-specified sensitivity analysis. Their appraisal results reflect the inherent limitations of these study types. COI, conflict of interest; D, domain; N/A, not applicable; NR, not reported; PROBAST, Prediction model Risk Of Bias ASsessment Tool; PROSPERO, International Prospective Register of Systematic Reviews; QUADAS-2, Quality Assessment of Diagnostic Accuracy Studies 2; ROBINS-I, Risk Of Bias in Non-randomised Studies of Interventions; TRACE, documentation standard for model credibility.

Interpretation

Across the eight peer-reviewed primary reviews (excluding the preprint and protocol), five (62.5%) were prospectively registered in PROSPERO; all eight reported dual independent screening. Formal sensitivity or validation analyses were reported in five of eight (62.5%) peer-reviewed reviews, with the strongest appraisal practice observed in the two meta-analyses (Tang et al. and Lv & Wang) and the two TRACE-based mechanistic model reviews (Birkegård et al. and Brinch et al.). Conflict-of-interest declarations were present in all peer-reviewed primary reviews. These findings contextualise confidence in the synthesis: the evidence base is predominantly dual-screened and largely prospectively registered, but formal sensitivity analysis is inconsistently applied, particularly among scoping reviews of mechanistic models.

Supplementary File S5

Pairwise Corrected Covered Area (CCA) Matrix and Overlap Interpretation

1. Method

To quantify the degree of primary study citation overlap among the 10 included reviews, a corrected covered area (CCA) analysis was performed following the method of Pieper et al. (2014). The pairwise CCA between any two reviews i and j was calculated as:

Pairwise CCA(i,j)  =  S(i,j)  /  √(N_i × N_j)

where S(i,j) is the number of citations shared between reviews i and j; N_i is the total number of unique citations in review i; and N_j is the total number of unique citations in review j. The overall CCA across all reviews was also calculated using the formula described in the Methodology section of the main manuscript: CCA = (A − r) / [N × r × (r − 1) / 2], yielding a median pairwise CCA of 0.07 (range 0.00–0.19).

CCA values are interpreted as follows: ≤0.05 = slight overlap; 0.06–0.10 = moderate overlap; 0.11–0.15 = high overlap; >0.15 = very high overlap (Pieper et al., 2014). The analysis was conducted based on the primary study reference lists of the included reviews.

2. Review Abbreviations

R1 — Ramsay et al. (2018)     [Mechanistic; scoping review; 81 dynamic AMR models]

R2 — Birkegård et al. (2018)  [Mechanistic; systematic review; 38 AMR models; TRACE framework]

R3 — Niewiadomska et al. (2019) [Mechanistic; systematic review; 273 transmission models]

R4 — Leclerc et al. (2019)    [Mechanistic; systematic review; 43 HGT models]

R5 — Brinch et al. (2025)     [Mechanistic; systematic review; 170 transmission models]

R6 — Schardong et al. (2026)* [Mechanistic; scoping review (preprint); 36 models]

R7 — Acharya et al. (2023)*   [One Health protocol; included for contextual framing only]

R8 — Tang et al. (2022)       [ML; systematic review + meta-analysis; 25 prediction studies]

R9 — Ardila et al. (2025)     [ML; systematic review; 21 prediction studies]

R10 — Lv & Wang (2024)        [ML; systematic review + meta-analysis; varied prediction studies]

* Included for contextual framing only; excluded in the pre-specified sensitivity analysis.

3. Colour-Coding Key

Table S5-A. CCA colour-coding and interpretation.

CCA RangeColourInterpretation
≥ 0.16 Very high overlap — substantial shared primary evidence base; synthesis signals from this pair should be interpreted with caution to avoid double-counting.
0.11–0.15 High overlap — considerable shared citations; overlap explicitly noted in synthesis narrative.
0.06–0.10 Moderate overlap — some shared primary studies; limited risk of double-counting.
0.01–0.05 Slight overlap — small number of shared citations; double-counting risk minimal.
0.00 No overlap — distinct primary evidence bases.
 Diagonal — self-comparison (not applicable).

4. Pairwise CCA Matrix

Table S5-B. Pairwise CCA values for all 45 pairwise combinations of the 10 included reviews.

 R1R2R3R4R5R6*R7*R8R9R10
R10.120.190.070.140.110.040.010.010.01
R20.120.170.080.150.100.050.010.010.00
R30.190.170.060.180.130.060.020.010.01
R40.070.080.060.070.050.030.000.000.00
R50.140.150.180.070.140.070.020.010.01
R6*0.110.100.130.050.140.050.010.000.00
R7*0.040.050.060.030.070.050.020.010.01
R80.010.010.020.000.020.010.020.090.08
R90.010.010.010.000.010.000.010.090.11
R100.010.000.010.000.010.000.010.080.11

5. Summary Statistics

Table S5-C. Summary of pairwise CCA statistics overall and by review tradition.

StatisticAll pairwise comparisons (n = 45)Mechanistic–mechanistic pairs (R1–R7)ML–ML pairs (R8–R10)
Number of pairs45213
Minimum0.000.030.08
Maximum0.190.190.11
Range0.00–0.190.03–0.190.08–0.11
Mean0.0560.0980.093
Median0.040.080.09
InterpretationSlight-to-moderate (median 0.07)Moderate (median 0.08)Moderate (median 0.09)

6. Interpretation

Overall overlap (median CCA = 0.07; slight-to-moderate). The overall evidence base of the 10 included reviews is substantially distinct, reducing the risk of double-counting signals in the synthesis. The 45 pairwise CCA values range from 0.00 to 0.19, with 73% of pairs in the “slight” (≤0.05) or “moderate” (0.06–0.10) range.

Within-tradition mechanistic pairs (R1–R7; median CCA = 0.12; moderate-to-high). The five primary mechanistic systematic/scoping reviews (R1–R5) share a substantially overlapping foundational primary study base: Ramsay (2018), Birkegård (2018), Niewiadomska (2019), and Brinch (2025) all draw on the same corpus of AMR transmission modelling literature. The highest pairwise CCA (0.19) is observed between Niewiadomska et al. (R3) and Ramsay et al. (R1), both of which comprehensively surveyed population-level transmission models published up to 2018–2019. These overlapping pairs are explicitly flagged in the synthesis narrative, and conclusions drawn from these reviews are jointly attributed.

Within-tradition ML pairs (R8–R10; moderate overlap). The three ML prediction reviews (Tang 2022, Ardila 2025, Lv & Wang 2024) share a modest degree of citation overlap (CCA 0.08–0.11), reflecting the rapid growth of the ML-AMR prediction literature and the distinct temporal and pathogen-scope emphases of each review. These pairs are noted in the synthesis.

Cross-tradition pairs (mechanistic vs ML; CCA 0.00–0.02). Cross-tradition overlap is negligible, confirming that the mechanistic modelling and ML prediction traditions draw on substantially independent primary evidence bases. This supports the appropriateness of synthesising both traditions within a single scoping review of reviews without significant double-counting risk.

Leclerc et al. (R4 — HGT models). The HGT modelling review shows uniformly low pairwise CCA with all other reviews (range 0.00–0.08), reflecting its highly specialised focus on horizontal gene transfer models in vitro, which has minimal citation overlap with population-level transmission models or clinical ML prediction studies.

Reference: Pieper D, Antoine SL, Mathes T, Neugebauer EAM, Eikermann M. Systematic review finds overlapping reviews were not mentioned in every other overview. J Clin Epidemiol 2014;67:368–375. https://doi.org/10.1016/j.jclinepi.2013.11.007.


Premier Science
Publishing Science that inspires