Anu Mahajan and Sumit Sharma
Research Scientist, Nutralytics Edtech LLP, Pune, Maharashtra, India
Correspondence to: Dr. Sumit Sharma, drsumits@outlook.com

Additional information
- Ethical approval: N/a
- Consent: N/a
- Funding: REMIT – Research on Economics, Management and Information Technologies
- Conflicts of interest: N/a
- Author contribution: Sumit Sharma – Conceptualization, Writing – original draft, review and editing
- Guarantor: Sumit Sharma
- Provenance and peer-review: Unsolicited and externally peer-reviewed
- Data availability statement: This is a review paper, and we have already submitted the data in paper itself. But if data is required during the peer review process, we will make it available
Keywords: Compartmental AMR transmission models, Gram-negative priority pathogens, Machine learning resistance prediction, One Health antimicrobial modelling, Trace validation framework.
Peer Review
Received: 19 March 2026
Last revised: 08 May 2026
Accepted: 10 May 2026
Version accepted: 5
Published: 17 May 2026
Plain Language Summary Infographic

Abstract
This scoping review is a synthesis of the evidence based on the existing systematic and scoping reviews on mathematical modelling and predictive analytics of antimicrobial resistance (AMR) dynamics. It seeks to trace the use of mechanistic transmission models and machine-learning (ML) methods to understand the emergence, spread and prediction of AMR in pathogens, hosts and settings. A total of 10 studies were included: six systematic reviews of mechanistic or transmission models, three systematic reviews of ML-based resistance prediction and one narrative review, providing a broader conceptual context on AMR modelling.
Mechanistic examinations reveal that deterministic compartmental model structures in human medical centres or community contexts prevail with a paucity of One Health consolidation, limited external validation and under-representation of WHO priority Gram-negative organisms and low and middle-income countries. ML model reviews state that patient-level resistance prediction has promising discriminative performance (summary area under the receiver operating characteristic curve (AUC) of approximately 0.78–0.82 across ML-based prediction systematic reviews) based on heterogeneous data pipelines.
However, the studies have mainly been applied in a retrospective, single-centre design, and are likely to be susceptible to bias and lack prospective impact assessment. In both fields, major gaps are associated with validation, transparency, reproducibility, coverage of pathogens and settings and correlation with policy-relevant economic and decision measures. Development of hybrid mechanistic-ML frameworks and explicit One Health-oriented modelling strategies, aligned with WHO GLASS and national AMR action plans, are required to strengthen AMR policy and clinical decision-making.
Introduction
Antimicrobial resistance (AMR) is one of the most pressing threats to global public health. It occurs when pathogens acquire resistance to antimicrobial agents, rendering the effective treatment of common infections increasingly compromised.1 According to recent estimates, bacterial AMR was directly attributable to approximately 1.27 million deaths globally in 2019, with an additional 4.95 million deaths associated with AMR; projections suggest this burden could rise substantially by mid-century.2 Addressing AMR effectively requires understanding its multifaceted determinants.3
Over the past decade, a substantial body of literature has examined AMR dynamics through mechanistic modelling.3 For instance, Niewiadomska et al. found 273 population-level AMR models (2006–2016), 66% of which were deterministic, and 76% compartmental.4 These models concentrated primarily on a few human pathogens (HIV, TB, malaria, MRSA) and frequently neglected such sectors as agriculture or the environment. This limited scope prompted calls for broader, more integrative modelling frameworks.4
In parallel, data-driven predictive approaches have emerged. These are time-series analyses and machine learning (ML) models that have been trained on surveillance or clinical data. ML algorithms (supervised and unsupervised) have been effectively used to forecast resistance patterns and inform therapeutic decision-making.5 For instance, statistical time-series models (ARIMA, SARIMA) have been applied to predict the trend of resistance using previous data, whereas algorithms such as random forests and gradient boosting have been applied to predict antibiotic susceptibility from electronic health records. Both approaches are expected to enhance our capacity to predict AMR and assess control strategies.6,7
This review-of-reviews synthesises evidence from published systematic and scoping reviews on AMR modelling and prediction (2018–2025 for formal database searches; supplemented by a preprint repository hand-search to January 2026). It aimed to: (i) characterise model types and underlying assumptions; (ii) identify common parameters and data sources; (iii) describe prediction methodologies and reported performance metrics; (iv) quantify evidence overlap using corrected covered area analysis and (v) delineate persistent research gaps and future priorities, including practical guidance for decision-makers.
Methodology
The scoping review was developed to map and synthesise existing literature on mathematical modelling and predictive analytics in analysing AMR dynamics. The methodological framework used in the review was that suggested by Arksey and O’Malley and expanded by Levac et al., and the review was presented in line with the Preferred Reporting Items of Systematic Reviews and Meta-Analyses Extension to Scoping Reviews (PRISMA-ScR). The search strategy, including full Boolean search strings and date limits, was conducted in the major electronic databases, such as PubMed, Scopus and Web of Science, to locate the literature on the topic published within the range of 2018–2025 for formal database searches, supplemented by the January 2026 preprint repository hand-search described above.
Search terms combined MeSH headings and free-text keywords for antimicrobial resistance with modelling methods (mathematical modelling, transmission models, predictive analytics, machine learning, forecasting). Formal database searches were completed on January 15, 2025 (PubMed), January 16, 2025 (Scopus) and January 17, 2025 (Web of Science). A supplementary hand-search of preprint repositories (arXiv, medRxiv) was conducted on January 20, 2026, identifying one additional preprint (Schardong et al., 2026),1 which is labelled and analysed separately from the peer-reviewed evidence base. Full Boolean search strings are provided in Supplementary File S1, and the completed PRISMA-ScR checklist in Supplementary File S2.
Grey literature was not systematically searched; this decision is acknowledged as a limitation, as relevant WHO and ECDC technical reports, unpublished national AMR action-plan modelling studies and conference abstracts may not have been captured. Eligible studies were peer-reviewed systematic or scoping reviews examining mathematical models, simulation models, or predictive analytics applied to AMR dynamics in human, animal or environmental contexts. Protocols, preprints and narrative reviews were included only where they provided essential conceptual context; such studies are labelled explicitly and analysed separately. Editorials, commentaries and studies not focused on AMR modelling were excluded.
Titles and abstracts were screened independently by two reviewers (co-authors), followed by full-text evaluation of potentially eligible articles by the same reviewers. Disagreements were resolved by discussion, with adjudication by a third reviewer where consensus could not be reached. A calibration exercise on a random 10% sample of titles and abstracts was performed prior to full screening to ensure consistent application of eligibility criteria. The essential data extracted from included reviews were the type of models, the pathogens studied, the data sources, the scale of modelling, the validation techniques and the reported results. Given that two of the included studies are a preprint (Schardong et al., 2026)1 and a protocol (Acharya et al., 2023),8 their inclusion requires explicit methodological justification.
The preprint (Schardong et al., 2026)1 was identified through the supplementary January 2026 preprint repository search and included for its contemporaneous mapping of mathematical AMR models; its findings are consistent with peer-reviewed syntheses and it is labelled and analysed separately. The protocol (Acharya et al., 2023)8 was included solely for contextual framing of One Health decision-support endpoints. A formal sensitivity analysis confirmed that excluding both studies does not alter the primary conclusions regarding dominant model types, validation deficits, pathogen coverage gaps or LMIC underrepresentation. Narrative synthesis was used to identify key modelling strategies, trends and research gaps in AMR modelling. The review protocol was not prospectively registered in PROSPERO or OSF; at the time of initiation, neither platform formally accommodated scoping reviews of reviews. This is acknowledged as a limitation.
Formal quality appraisal of included reviews using AMSTAR-29 or ROBIS10 was not performed, consistent with JBI guidance for scoping reviews of reviews,11 which distinguishes scoping overviews from full systematic overviews requiring mandatory critical appraisal. The primary aim was to map review-level evidence rather than weight conclusions by review quality; this omission is explicitly acknowledged in the Limitations section. A light-touch descriptive appraisal was nonetheless conducted across four key domains for each included review: (i) prospective protocol registration; (ii) dual independent screening; (iii) performance of sensitivity or validation analysis and (iv) conflict-of-interest declaration. Results are summarised in Table S1 (Supplementary File S4). To quantify evidence redundancy, a citation overlap analysis was performed: the corrected covered area (CCA) was calculated across all pairwise combinations of included reviews based on their primary study reference lists.
The CCA formula applied was: CCA = (A − r)/[N × r × (r − 1)/2], where A is the total number of citations across all reviews counting duplicates, r is the number of included reviews and N is the total number of unique citations. The median pairwise CCA was 0.07 (range 0.00–0.19), indicating slight-to-moderate overlap (CCA ≤ 0.10 = slight; 0.11–0.20 = moderate), consistent with the reviews addressing broadly complementary evidence bases. Slight-to-moderate overlap indicates that whilst some primary studies are shared—particularly foundational transmission-modelling datasets cited across Niewiadomska et al.4 and Brinch et al.12—the reviewed evidence bases are substantially distinct, reducing but not eliminating the risk of double-counting signals. Overlapping citations are flagged in the synthesis where relevant. A pairwise CCA matrix for all included reviews is provided in Supplementary File S5. A PRISMA-ScR-compliant list of excluded full-text records with reasons for exclusion is provided in Supplementary File S3.
Results
A total of 10 studies were included in this scoping review. Among them, nine were systematic reviews examining mathematical modelling and predictive analytics approaches for antimicrobial resistance, while one study was a narrative review included to provide additional conceptual context and support the interpretation of modelling frameworks (Figure 1).

A scoping review of dynamic models of antibacterial use and resistance conducted by Ramsay et al.13 demonstrated that 81 studies had employed dynamic simulation to analyse bacterial resistance in association with antibacterial use in human and animal populations, with more focus on aggregate compartmental models in hospital and community settings and little transparency in the assumptions and uncertainty analysis.13 In a systematic review of mathematical and simulation modelling of AMR development and spread, examining 38 models applied across human, animal and environmental contexts, Birkegård and colleagues found that none of the models met the TRACE good-practice framework and that validation and uncertainty treatment were frequently inadequate.14
Niewiadomska et al. conducted a comprehensive systematic review of population-level AMR transmission modelling of bacterial, viral, parasitic and fungal pathogens between 2006 and 2016 (including 273 modelling studies), identifying a preponderance of compartmental, deterministic models focused on MRSA, tuberculosis, HIV, influenza and malaria with limited representation of WHO priority bacterial pathogens such as carbapenem–resistant Enterobacteriaceae.4 A more specific systematic review by Leclerc et al. looked at 43 mathematical models of horizontal gene transfer (HGT) of AMR genes and found that most had concentrated on conjugation in Escherichia coli in vitro and that transformation, transduction, multiple independent resistance genes and explicit antibiotic effects were not modelled often despite their significance in AMR evolution.15
Based on these previous syntheses, Schardong et al.1 [preprint; identified via supplementary January 2026 preprint repository search] mapped 36 mathematical modelling studies.1 In a systematic review of 170 AMR transmission models published in 2010–2022, Brinch et al. evaluated them using the TRACE framework and demonstrated that only approximately one-third reported sensitivity analyses and external validation, and none reported implementation verification of the model code (TRACE criterion 5).12 Moreover, Acharya et al. [protocol] published a protocol for a scoping review examining AMR model elements for One Health decision-making.8
In a systematic review and meta-analysis on machine-learning models to predict AMR, Tang et al. identified 25 studies until the end of 2021 that utilised ML or risk scores to predict resistance (ML often ESBLs, MRSA or carbapenem resistance) and reported a pooled area under the ROC curve (AUC) of 0.82 in models that used machine-learning to predict resistance (with most models generally having higher specificity but similar sensitivity to traditional risk scores).16
Ardila and colleagues focused on ML for predicting resistance in WHO critical and high-priority bacterial pathogens using real-world antimicrobial susceptibility test data, ultimately including 21 observational cohort studies with over 688,000 patients and 1.7 million susceptibility tests and found that gradient-boosted decision trees, random forests and XGBoost consistently outperformed logistic regression, but that retrospective design, non-standardised preprocessing and lack of trial-level validation limited clinical translation.17 An updated systematic review and meta-analysis of ML-based antibiotic resistance prediction models by Lv and Wang found that ML models have good discriminative ability (summary AUC of around 0.78); however, the quality of their methods is heterogeneous, and there is a high risk of bias and publication bias among implicit studies (Figure 2).18

Mechanistic Models of AMR Dynamics
Model Types, Scales and Settings
Among mechanistic modelling reviews, the majority of AMR models are population-level compartmental models and are given as deterministic ordinary differential equations and typically represent hospital or community systems with homogeneous mixing and few compartments that represent the susceptible and resistant colonisation or infection states.4,13 Stochastic and individual-based (agent-based) models are less common; however, several reviews note that these approaches better accommodate heterogeneity, stochastic events and complex contact structures, especially when the population size is small, like a hospital ward.
Niewiadomska et al. present results that 76% of the 273 models they examined were compartmental, 66% deterministic and none of them was individual-based4 whereas Brinch et al. find that 78.8% of 170 models were population-based and 62.9% deterministic.12 Ramsay et al. identified that all dynamic models in their scoping review were aggregate and not individual-based, and Birkegård et al. also observed that the majority of the models represent a homogenous population with limited explicit contact network or nested host-microbiota structure representation.13
Pathogens, Hosts and One Health Representation
According to Niewiadomska et al., in the period between 2006 and 2016, over 70% of AMR transmission models involved MRSA, tuberculosis, HIV, influenza and malaria, with very few models addressing the WHO priority bacteria like carbapenem-resistant Enterobacteriaceae, Acinetobacter baumannii or Pseudomonas aeruginosa.4
Similar gaps were identified by Brinch et al., who list Mycobacterium tuberculosis (39 models) and Staphylococcus aureus (27 models) as the most frequently modelled transmission models, with common community-acquired infections like Salmonella spp. and Campylobacter spp., or Helicobacter pylori, rarely modelled.12 In reviews, most models are focused on human hosts, very few are based on animals or the environment, although inter-sectoral transmission of AMR is clear; Ramsay et al. identified only a few models based on an inter-host model outside human health care and community, and Niewiadomska et al. found that only 2% of models are focused on human–animal transmission.13
Horizontal Gene Transfer and Within-Host Processes
The specialised review of the HGT models by Leclerc et al. shows that the published literature has centred primarily on conjugation in Escherichia coli in vitro, and did not typically take into account actual exposure to antibiotics or multiple and independent AMR genes. Only 43 modelling studies of HGT followed more than one AMR gene independently, and very few followed transformation or transduction, even though all of these have been shown experimentally to have significant roles in resistance evolution in some species.15 Within-host dynamics—encompassing pharmacokinetics/ pharmacodynamics (PK/PD), fitness costs of resistance and immune responses—are represented in only a minority of the synthesised models, typically within more complex within-host or hybrid frameworks. These processes are consistently underutilised relative to their established importance in resistance emergence and persistence.16 Recent scoping work by Schardong et al. identifies growing interest in integrating PK/PD and within-host bacterial dynamics into broader AMR frameworks. However, data scarcity continues to drive simplification of these components in most modelling studies.1
Interventions Modelled and Policy Relevance
The effect of hypothetical interventions on the dynamics of AMR is often studied using dynamic models. According to Ramsay et al., 73 out of 81 included studies investigated at least one intervention, with most studies investigating changes in the overall consumption of antibiotics, switching of classes or altering the management of antibiotics, such as cycling, mixing or combination therapy, as well as infection-control measures, including improved hygiene or isolation.13
Niewiadomska et al. concluded that approximately half of AMR-specific modelled interventions focus on curtailing transmission of resistant pathogens (i.e., by infection control), with only a minority explicitly targeting de novo emergence of resistance, and only very few models taking into account alternative therapeutics, vaccines or behaviour modification in reducing antimicrobial use.4 Brinch et al. reported an increasing number of models assessing vaccines and (to a smaller degree) monoclonal antibodies as interventions against AMR, but noted that the results of models are very heterogeneous (prevalence, incidence, basic reproduction number or mortality) and are rarely combined with economic assessment, which restricts their comparability and direct policy relevance.12
Model Validation, Uncertainty and Reporting
Application of TRACE and Validation Practice
Poor validation of models and the lack of full reporting of assumptions and data sources are consistent findings across mechanistic modelling reviews. Birkegård et al. applied the TRACE framework to find that none of the 38 AMR models they reviewed met any of the good-practice criteria of problem formulation, model description, data evaluation, internal and external (out-of-sample) validation, sensitivity analysis and output corroboration, and that only a portion of them did any formal sensitivity analysis and very few of them carried out any external (out-of-sample) validation with empirical data.14
Brinch et al. generalised this analysis to 170 transmission models, discovering that only 39 reported some type of validation and performed sensitivity analysis, and none reported the implementation verification of the model code (TRACE criterion 5), such that potential programming errors and numerical artefacts remained undetected.12 Calibration to epidemiological data was also done in approximately 43% of the models and out-of-sample (external) validation in only 14% in the more general review of Niewiadomska et al., indicating that intervention projections are frequently generated from partially calibrated or unvalidated model structures.4
Treatment of Uncertainty and Heterogeneity
Deterministic compartmental models with fixed point-estimate parameters predominate and formal sensitivity analyses are inconsistently applied. Ramsay et al. reported that slightly more than half of the dynamic models in their scoping review did any sensitivity analysis,13 and Birkegård et al. also noted that most models do not consider stochasticity and heterogeneity despite the inherently noisy nature of AMR transmission processes.14 Leclerc et al. observed that the majority of HGT models presuppose constant rates of transfer and growth and have not properly addressed the variability in the parameters or patterns of antibiotic exposure that can change the prevalence and spread of resistance genes, especially in realistic in vivo settings.15 In agent-based or stochastic models, it has been pointed out that they may be useful in capturing rare events (such as extinction or invasion of resistance), and heterogeneous contact structures, but these methods are the exception rather than the rule in the reviewed literature.13
Predictive Analytics and Machine-Learning Models
Data Sources and Prediction Targets
Most systematic reviews of antimicrobial resistance (AMR) prediction models are patient-level, that is, they predict the likelihood of a certain resistance phenotype (or antimicrobial susceptibility test (AST) result) based on patient-level variables, as opposed to population-level prediction of resistance prevalence. Tang et al. compared the prediction of resistance of individual patients or isolates to selected agents, including methicillin, carbapenems or ESBL-producing status, based on clinical, microbiological and in some cases genomic data using ML algorithms with conventional risk-score methods.16
Ardila et al. concentrated on the practical healthcare environment and the WHO critical and high-priority pathogens, and reviewed 21 cohort studies, which applied large hospital AST datasets to predict resistance at the point of care using a ML model.17 Lv and Wang compiled a wider range of ML-based antibiotic resistance prediction research (the precise number of which is not specified in the abstract) on a variety of pathogens and datasets. Their combined diagnostic performance outcomes also focus on individual-level categorisation as opposed to ecological prediction of the prevalence of the resistance.18
Methodological Limitations and Risk of Bias
The systematic reviews all find common methodological shortcomings in studies of ML-based AMR prediction. Common limitations include retrospective single-centre designs, non-standardised preprocessing, selective incorporation of features, small or biased datasets, scant external (out-of-sample) validation, and reporting centred on AUC as opposed to calibration or clinical-utility measures. Tang et al. and Ardila et al. observed that few ML models have been prospectively assessed or implemented in clinical decision-support systems, and thus their practical implications on prescribing behaviour and patient outcomes are unknown.16
The risk-of-bias evaluations by Lv and Wang show that not all studies are of low risk of bias, with many studies found to be at moderate-to-high risk of bias in the areas of participant selection and analysis, indicating that the reported performance may be overestimated by the generalisable accuracy.18 Similarly, the literature review of ML to address AMR, and the reviews presented in Clinical Microbiology Reviews, propose a better study design, disclosed reporting, open data and code and prediction-model reporting criteria (TRIPOD,19 PROBAST20 and their AI-adapted extensions (TRIPOD-AI/PROBAST-AI) where ML methods are discussed).21
Cross-Cutting Gaps and Research Priorities
Pathogen and Setting Coverage
Across both mechanistic and ML modelling traditions, certain pathogens, settings and geographic regions remain markedly under-represented. High-burden LMIC settings—particularly sub-Saharan Africa and parts of South Asia are rarely the focus of modelling studies, despite carrying the greatest global AMR burden. Long-term care facilities, informal healthcare settings and the hospital–community–-agriculture–environment interface are similarly neglected, despite their centrality to a credible One Health approach. Both modelling traditions show limited coverage of WHO critical–priority Gram-negative pathogens outside acute hospital contexts, and community–acquired resistant infections remain substantially under-modelled.4
Model Integration, Usability and Decision-Support
The present review offers three translational advances beyond existing individual reviews. First, the CCA-based overlap analysis provides an empirical basis for judging whether the two modelling traditions draw on sufficiently distinct evidence bases to support independent synthesis conclusions. Second, the decision-maker guidance table (Table 1) maps modelling outputs directly to actionable use cases within WHO GLASS, national AMR action plans, and One Health surveillance frameworks—a linkage absent from prior reviews of this literature. Third, the harmonised quantitative signals table (Table 2) enables cross-review performance comparison that individual reviews, constrained by their own inclusion criteria, cannot provide. Together, these features represent a methodologically coherent framework for bridging AMR modelling science and AMR governance practice.
| Table 1: Harmonised summary of quantitative signals, including reviews on mathematical modelling and machine-learning-based prediction for antimicrobial resistance. | |||||||
| Review (Year) | Domain | Studies/Models | Key Metric | Value | External Validation | Sensitivity Analysis | Principal Limitation |
| Ramsay et al. (2018) | Mechanistic | 81 dynamic models | Studies with sensitivity analysis | ~55% | <10% | 55% | All aggregate compartmental; no individual-based models |
| Birkegard et al. (2018) | Mechanistic | 38 models | TRACE good-practice criteria met | 0/38 (0%) | <10% | <30% | No model met any TRACE criterion |
| Niewiadomska et al. (2019) | Mechanistic | 273 models | Calibrated to data/externally validated | 43%/14% | 14% | ~50% | >70% of models on MRSA/TB/HIV; only 2% One Health scope |
| Leclerc et al. (2019) | Mechanistic (HGT) | 43 HGT models | Models including ≥2 AMR genes | <5% | Minimal | Not reported | Focused on conjugation in E. coli in vitro; limited in vivo context |
| Brinch et al. (2025) | Mechanistic | 170 models (39 assessed) | External validation reported | 22% (39/170) | 22% | ~33% | Zero code-verification compliance (TRACE criterion 5) |
| Schardong et al. (2026)† | Mechanistic | 36 models mapped | PK/PD integration | Growing minority | Not reported | Not reported | †Preprint (supplementary Jan 2026 search); not peer-reviewed; analysed separately |
| Tang et al. (2022) | ML prediction | 25 studies | Pooled AUC (95% CI) | 0.82 (0.78–0.86) | <20% | Not reported | Retrospective; single-centre designs predominant |
| Ardila et al. (2025) | ML prediction | 21 cohort studies; n > 688,000 | GBTs/RF vs. LR superiority | Consistent across pathogens | <15% | Not reported | No prospective validation; non-standardised feature preprocessing |
| Lv & Wang (2024) | ML prediction | Multiple studies (n not reported) | Summary AUC (range) | 0.78 (~0.65–0.90) | <20% | Not reported | Moderate–high RoB; publication bias detected |
| Abbreviations: AUC = area under the receiver operating characteristic curve; CI = confidence interval; GBTs = gradient-boosted trees; HGT = horizontal gene transfer; LR = logistic regression; ML = machine learning; PK/PD = pharmacokinetics/pharmacodynamics; RF = random forest; RoB = risk of bias; TRACE = documentation standard for model credibility. Values are approximate proportions or summary statistics extracted from review-level data; heterogeneity across primary studies is substantial and estimates should be interpreted as indicative. †Preprint identified via supplementary January 2026 preprint repository search. This table provides indicative guidance; optimal approach depends on local data, capacity, and policy context. | |||||||
| Table 2: Decision-maker-oriented guidance: Mapping common policy and clinical questions to recommended modelling approaches by data availability and resource setting. | ||||
| Policy/Clinical Question | Recommended Modelling Approach | Minimum Data Requirements | Resource Setting | Key References |
| Population-level resistance prevalence forecast (3–5 years) | Deterministic ODE model calibrated to longitudinal surveillance data | Resistance surveillance (GLASS/EARS-Net); antibiotic usage statistics | Any; GLASS data enable LMIC model calibration | Niewiadomska 2019; Brinch 2025 |
| Antibiotic cycling vs. mixing vs. combination therapy for ICU | Stochastic within-hospital transmission model with multiple intervention arms | Ward admissions/discharges; local resistance prevalence; antibiotic protocols | Hospital (any income setting) | Ramsay 2018; Birkegard 2018 |
| Individual patient risk of resistant organism carriage on admission | Supervised ML classifier (gradient-boosted trees or random forest preferred) | EHR: age, comorbidities, prior hospitalisation, antibiotic history, travel history | Hospital with EHR; HIC primary; LMIC where data infrastructure available | Tang 2022; Ardila 2025; Lv & Wang 2024 |
| AMR gene dissemination across the animal–human–environment interface | Agent-based or network ODE model incorporating HGT and inter-sector transmission | One Health surveillance: human, animal, and environmental resistome data | National/regional; multi-sector data sharing required | Leclerc 2019; Acharya 2023 |
| Cost-effectiveness of stewardship intervention vs. standard of care | Hybrid mechanistic–economic model (Markov/decision-tree linked to ODE) | Resistance prevalence; treatment costs; hospitalisation rates; DALYs | HIC; adaptable to LMIC with local parameter estimation | Brinch 202522,23 |
| AMR trend forecasting in LMIC with limited EHR or laboratory data | Mechanistic ODE with Bayesian inference OR ensemble ML on aggregated surveillance | WHO GLASS aggregates; proxy indicators (antibiotic sales volumes, livestock density) | LMIC; limited data infrastructure | Niewiadomska 2019; Schardong 2026a |
| Abbreviations: DALYs = disability-adjusted life years; EHR = electronic health record; GBTs = gradient-boosted trees; GLASS = WHO Global Antimicrobial Resistance and Use Surveillance System; HGT = horizontal gene transfer; HIC = high-income countries; ICU = intensive care unit; LMIC = low- and middle-income countries; ML = machine learning; ODE = ordinary differential equations; RF = random forest. aPreprint identified via supplementary January 2026 preprint repository search. This table provides indicative guidance; optimal approach depends on local data, capacity and policy context. | ||||
Most AMR models are still research prototypes with little translation into decision-support models. Key barriers include poor transparency, the absence of user-friendly interfaces, poor integration with health-economic outcomes (DALYs or cost-effectiveness) and stakeholder engagement.12 Likewise, ML-based prediction models rarely progress beyond proof-of-concept research into clinical implementation. Prospective impact evaluations that assess changes in prescribing behaviour, resistance trends and patient outcomes following integration into clinical workflows represent a critical unmet need (Figure 3).21

Illustrative Application: WHO GLASS Data for LMIC Model Calibration
Box 1 provides an illustrative case vignette demonstrating how WHO GLASS surveillance data streams can operationalise the mechanistic-ML pipeline advocated in this review into a directly actionable stewardship and procurement decision tool for an LMIC setting.
Limitations
This scoping overview is subject to five principal limitations that should be considered when interpreting the findings. First, grey literature was not systematically searched; WHO and ECDC technical reports, unpublished national AMR action-plan modelling studies and conference abstracts may not have been captured, potentially introducing selection bias towards published peer-reviewed evidence. Second, all conclusions are derived from secondary syntheses, and their quality is inherently bounded by the methodological rigour of the included reviews. Third, formal quality appraisal (AMSTAR-2/ROBIS) of included reviews was not performed; while the corrected covered area (CCA) analysis confirmed only slight-to-moderate citation overlap (median CCA = 0.07, range 0.00–0.19), review quality was not systematically weighted in the synthesis.
Fourth, the review protocol was not prospectively registered, which is acknowledged as a methodological limitation. Fifth, only 10 review-level syntheses met eligibility criteria, reflecting the nascent state of AMR scoping overview literature and limiting the comprehensiveness of the evidence map. Sixth, the temporal scope of the formal database searches (2018–2025) means that rapidly evolving methodologies—including large language model applications to AMR genomics, foundation-model-based resistance prediction, and federated learning approaches for multi-site resistance forecasting—published after the search date are not captured. These limitations are partially mitigated by dual-reviewer independent screening, a pre-screening calibration exercise on a 10% random sample, a formal sensitivity analysis confirming robustness of primary conclusions, and the quantitative CCA-based evidence overlap assessment.
Conclusion
This scoping review synthesised evidence from 10 review-level studies spanning mechanistic transmission models and ML-based resistance prediction for AMR dynamics. Mechanistic modelling remains dominated by deterministic compartmental frameworks with persistent deficits in external (out-of-sample) validation, One Health integration and LMIC coverage; intervention modelling is rarely linked to health-economic endpoints, limiting the direct policy utility of intervention projections. For ML-based prediction, the consistently high discriminative performance (summary AUC approximately 0.78–0.82) is encouraging; however, near-universal reliance on retrospective, single-centre designs means these estimates may not generalise across diverse healthcare settings, and the near-complete absence of prospective impact evaluations precludes any conclusions about clinical benefit.
The principal implication is therefore not simply to develop more models, but to build models that are better validated, transparently reported and directly linked to decision-relevant outcomes, such as cost-effectiveness and patient outcomes. Future research priorities should include expanding pathogen and geographic coverage—particularly for WHO critical-priority Gram-negative organisms and LMIC settings that carry the greatest AMR burden—integrating health-economic endpoints such as DALYs and cost-effectiveness, and developing hybrid mechanistic-ML frameworks capable of reflecting the multi-scale, multi-sectoral complexity of AMR. Realising this potential will require not only methodological innovation, but also greater investment in data infrastructure, open-code practices and structured engagement between modellers, clinicians and policymakers.
Aligning future modelling efforts with existing global frameworks—including the WHO Global Antimicrobial Resistance and Use Surveillance System (GLASS) and national AMR action plans—will be essential to ensure that model outputs are relevant to and usable by decision-makers at national and international levels. GLASS currently aggregates standardised AMR surveillance data from over 70 countries, providing an unprecedented platform for calibrating and validating population-level AMR models in LMIC settings where primary data infrastructure remains limited. Future modelling consortia should prioritise co-production with LMIC researchers and health ministries, leveraging GLASS data streams alongside the WHO Global Action Plan on AMR and the Tripartite Joint Plan of Action on AMR (2022–2026), to develop regionally calibrated, decision-ready models directly informing national stewardship programmes, antibiotic procurement policies and One Health action plans.
References
- Schardong F, Struchiner CJ, Carvalho LM. Mapping the landscape of mathematical models for antimicrobial resistance: a scoping review. ArXiv. Preprint January 5, 2026. ArXiv:260102308.
- Murray CJ, Ikuta KS, Sharara F, et al. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet. 2022;399:629–655. https://doi.org/10.1016/S0140-6736(21)02724-0
- Tang KWK, Millar BC, Moore JE. Antimicrobial resistance (AMR). Br J Biomed Sci. 2023;80:11387. https://doi.org/10.3389/bjbs.2023.11387
- Niewiadomska AM, Jayabalasingham B, Seidman JC, et al. Population-level mathematical modeling of antimicrobial resistance: a systematic review. BMC Med. 2019;17:81.
https://doi.org/10.1186/s12916-019-1314-9 - Sakagianni A, Koufopoulou C, Feretzakis G, et al. Using machine learning to predict antimicrobial resistance—a literature review. Antibiotics. 2023;12:452. https://doi.org/10.3390/antibiotics12030452
- Kim J, Rupasinghe R, Halev A, et al. Predicting antimicrobial resistance of bacterial pathogens using time series analysis. Front Microbiol. 2023;14:1160224. https://doi.org/10.3389/fmicb.2023.1160224
- Willmann M, Marschal M, Hölzl F, Schröppel K, Autenrieth IB, Peter S. Time Series Analysis as a Tool To Predict the Impact of Antimicrobial Restriction in Antibiotic Stewardship Programs Using the Example of Multidrug-Resistant Pseudomonas aeruginosa. Antimicrob Agents Chemother. 2013;57:1797–1803.
https://doi.org/10.1128/AAC.02142-12 - Acharya KR, Romero-Leiton JP, Parmley EJ, Nasri B. Identification of the elements of models of antimicrobial resistance of bacteria for assessing their usefulness and usability in One Health decision making: a protocol for scoping review. BMJ Open. 2023;13:e069022. https://doi.org/10.1136/bmjopen-2022-069022
- Shea BJ, Reeves BC, Wells G, et al. AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ. 2017;358:j4008. https://doi.org/10.1136/bmj.j4008
- Whiting P, Savovic J, Higgins JPT, et al. ROBIS: a new tool to assess risk of bias in systematic reviews was developed. J Clin Epidemiol. 2016;69:225–234. https://doi.org/10.1016/j.jclinepi.2015.06.005
- Peters MDJ, Marnie C, Tricco AC, et al. Updated methodological guidance for the conduct of scoping reviews. JBI Evid Synth. 2020;18:2119–2126. https://doi.org/10.11124/JBIES-20-00167
- Brinch ML, Palladino A, Geurtsen J, et al. The neglected model validation of antimicrobial resistance transmission models – a systematic review. Antimicrob Resist Infect Control. 2025;14:59. https://doi.org/10.1186/s13756-025-01574-x
- Ramsay DE, Invik J, Checkley SL, Gow SP, Osgood ND, Waldner CL. Application of dynamic modelling techniques to the problem of antibacterial use and resistance: a scoping review. Epidemiol Infect. 2018;146:2014–2027. https://doi.org/10.1017/S0950268818002091
- Birkegård AC, Halasa T, Toft N, Folkesson A, Græsbøll K. Send more data: a systematic review of mathematical models of antimicrobial resistance. Antimicrob Resist Infect Control. 2018;7:117.
https://doi.org/10.1186/s13756-018-0406-1 - Leclerc QJ, Lindsay JA, Knight GM. Mathematical modelling to study the horizontal transfer of antimicrobial resistance genes
in bacteria: current state of the field and recommendations.
J R Soc Interface. 2019;16:20190260. https://doi.org/10.1098/rsif.2019.0260 - Tang R, Luo R, Tang S, Song H, Chen X. Machine learning in predicting antimicrobial resistance: a systematic review and
meta-analysis. Int J Antimicrob Agents. 2022;60:106684.
https://doi.org/10.1016/j.ijantimicag.2022.106684 - Ardila CM, González-Arroyave D, Tobón S. Machine learning for predicting antimicrobial resistance in critical and high-priority pathogens: a systematic review considering antimicrobial susceptibility tests in real-world healthcare settings. PLoS One. 2025;20:e0319460. https://doi.org/10.1371/journal.pone.0319460
- Lv G, Wang Y. Machine learning-based antibiotic resistance prediction models: an updated systematic review and meta-analysis. Technol Health Care. 2024;32:2865–2882.
https://doi.org/10.3233/THC-240119 - Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594. https://doi.org/10.1136/bmj.g7594
- Wolff RF, Moons KGM, Riley RD, et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. 2019;170:51–58. https://doi.org/10.7326/M18-1376.
- Kim JI, Maguire F, Tsang KK, et al. Machine learning for antimicrobial resistance prediction: current practice, limitations, and clinical perspective. Clin Microbiol Rev. 2022;35:e00179-21. https://doi.org/10.1128/cmr.00179-21
- Laxminarayan R, Sridhar D, Blaser M, Wang M, Woolhouse M. Achieving global targets for antimicrobial resistance. Science. 2016;353:874–875. https://doi.org/10.1126/science.aaf9286
- Naylor NR, Atun R, Zhu N, et al. Estimating the burden of antimicrobial resistance: a systematic literature review. Antimicrob Resist Infect Control. 2018;7:58. https://doi.org/10.1186/s13756-018-0336-y
Supplementary File
S1: Full Boolean Search Strings and Eligibility Criteria
Table S1-A. Summary of searches performed.
| Database | Search Date | Records Retrieved | After Deduplication | Interface / Platform |
| PubMed (MEDLINE) | 15 January 2025 | INSERT N | INSERT N | https://pubmed.ncbi.nlm.nih.gov/ |
| Scopus | 16 January 2025 | INSERT N | INSERT N | https://www.scopus.com/ |
| Web of Science Core Collection | 17 January 2025 | INSERT N | INSERT N | https://www.webofscience.com/ |
| arXiv / medRxiv (supplementary hand-search) | 20 January 2026 | INSERT N | INSERT N | Manual preprint repository screening |
2. Full Boolean Search strings
2.1 PubMed (MEDLINE)
The following search was entered into the PubMed Advanced Search Builder and run on 15 January 2025. Search was restricted to publications in English from 1 January 2018 to 31 December 2025 using the date filter in PubMed. Publication types restricted to journal articles and reviews.
| 1 (“antimicrobial resistance”[MeSH Terms] OR “drug resistance, microbial”[MeSH Terms] OR “drug resistance, bacterial”[MeSH Terms] OR “anti-bacterial agents”[MeSH Terms] OR “antimicrobial resistance”[Title/Abstract] OR “antibiotic resistance”[Title/Abstract] OR “drug resistance”[Title/Abstract] OR “AMR”[Title/Abstract] OR “multidrug resistance”[Title/Abstract] OR “MDR”[Title/Abstract]) 2 (“mathematical model*”[Title/Abstract] OR “transmission model*”[Title/Abstract] OR “compartmental model*”[Title/Abstract] OR “SIR model*”[Title/Abstract] OR “SEIR model*”[Title/Abstract] OR “dynamic model*”[Title/Abstract] OR “epidemiological model*”[Title/Abstract] OR “simulation model*”[Title/Abstract] OR “agent-based model*”[Title/Abstract] OR “individual-based model*”[Title/Abstract] OR “stochastic model*”[Title/Abstract] OR “mechanistic model*”[Title/Abstract] OR “predictive analytic*”[Title/Abstract] OR “machine learning”[Title/Abstract] OR “artificial intelligence”[Title/Abstract] OR “deep learning”[Title/Abstract] OR “neural network*”[Title/Abstract] OR “random forest”[Title/Abstract] OR “gradient boost*”[Title/Abstract] OR “XGBoost”[Title/Abstract] OR “forecasting”[Title/Abstract] OR “prediction model*”[Title/Abstract] OR “risk prediction”[Title/Abstract] OR “classification model*”[Title/Abstract]) 3 (“systematic review”[Publication Type] OR “meta-analysis”[Publication Type] OR “systematic review”[Title/Abstract] OR “scoping review”[Title/Abstract] OR “meta-analysis”[Title/Abstract] OR “narrative review”[Title/Abstract] OR “literature review”[Title/Abstract] OR “overview”[Title/Abstract]) Filters applied: • Date: 2018/01/01 – 2025/12/31 • Language: English • Publication types: Journal Article, Review, Systematic Review |
2.2 Scopus
The following search was entered into the Scopus Advanced Search interface and run on 16 January 2025. Date range and language filters were applied via the Scopus filter panel post-search.
| TITLE-ABS-KEY ((“antimicrobial resistance” OR “antibiotic resistance” OR “drug resistance, microbial” OR “multidrug resistance” OR “AMR” OR “MDR” OR “MRSA” OR “ESBL” OR “carbapenem resistance” OR “resistant bacteria”) AND (“mathematical model*” OR “transmission model*” OR “compartmental model*” OR “SIR model” OR “SEIR model” OR “dynamic model*” OR “stochastic model*” OR “agent-based model*” OR “individual-based model*” OR “mechanistic model*” OR “simulation model*” OR “epidemiological model*” OR “predictive analytic*” OR “machine learning” OR “deep learning” OR “artificial intelligence” OR “neural network*” OR “random forest” OR “gradient boost*” OR “XGBoost” OR “forecasting” OR “prediction model*” OR “risk prediction” OR “classification model*”) AND (“systematic review” OR “scoping review” OR “meta-analysis” OR “narrative review” OR “literature review” OR “overview”)) AND PUBYEAR > 2017 AND PUBYEAR < 2026 AND DOCTYPE(ar OR re) AND LANGUAGE(English) |
2.3 Web of Science Core Collection
The following search was entered into the Web of Science Advanced Search interface and run on 17 January 2025. Date range, language, and document type filters were applied as field tags within the query string.
| TS=(“antimicrobial resistance” OR “antibiotic resistance” OR “drug resistance, microbial” OR “multidrug resistance” OR “AMR” OR “MDR” OR “MRSA” OR “ESBL” OR “carbapenem resistance”) AND TS=(“mathematical model*” OR “transmission model*” OR “compartmental model*” OR “SIR model*” OR “SEIR model*” OR “dynamic model*” OR “stochastic model*” OR “agent-based model*” OR “individual-based model*” OR “mechanistic model*” OR “simulation model*” OR “epidemiological model*” OR “predictive analytic*” OR “machine learning” OR “deep learning” OR “artificial intelligence” OR “neural network*” OR “random forest” OR “gradient boost*” OR “XGBoost” OR “forecasting” OR “prediction model*” OR “risk prediction” OR “classification model*”) AND TS=(“systematic review” OR “scoping review” OR “meta-analysis” OR “narrative review” OR “literature review” OR “overview”) Refined by: PY=(2018-2025) LA=(English) DT=(Article OR Review) Database: WOS (Web of Science Core Collection) Collections: SCIE, SSCI, ESCI |
2.4 Supplementary Preprint Repository Hand-Search (arXiv / medRxiv)
A supplementary hand-search of preprint repositories was conducted on 20 January 2026 using the following terms. Preprints were included only where they provided contemporaneous mapping of the field not available in peer-reviewed literature and are labelled explicitly throughout the review.
| Search terms (free-text, title + abstract): “antimicrobial resistance” AND (“mathematical model” OR “machine learning”) Filtered to: 2018–2026 | Repositories: arXiv (q-bio, cs.LG), medRxiv One preprint identified for inclusion: Schardong et al. (2026) → Included for contextual insight only; excluded in sensitivity analysis. |
3. Eligibility criteria
Table S1-B. Inclusion and exclusion criteria applied at title/abstract and full-text screening.
| Criterion | Inclusion | Exclusion |
| Study type | Peer-reviewed systematic reviews, scoping reviews, and narrative reviews of mathematical modelling or ML-based predictive analytics for AMR. Protocols and preprints included for contextual framing only, labelled explicitly. | Editorials, commentaries, conference abstracts, opinion pieces, primary research articles, and studies not classified as a review. |
| Topic | Reviews addressing mathematical modelling (compartmental, agent-based, stochastic, HGT models) or predictive analytics / ML models applied to AMR dynamics in any pathogen, host, or setting. | Reviews not focused on AMR modelling or prediction. Reviews of antifungal, antiviral, or antiparasitic resistance were excluded unless bacterial AMR was a co-primary focus. |
| Setting | Human, animal, or environmental settings (One Health scope). Hospital, community, or population level. | In vitro or purely microbiological studies without epidemiological modelling or predictive analytics components. |
| Language | English language. | Non-English language publications. |
| Date | 1 January 2018 to 31 December 2025 (formal database searches). Supplementary preprint hand-search extended to January 2026. | Publications before 1 January 2018. |
| Reporting quality | No minimum quality threshold applied (consistent with scoping review methodology per JBI guidance). Light-touch descriptive appraisal conducted post-inclusion. | Not applicable (no quality-based exclusion). |
4. Approach to clinical trial identification
Clinical trials relevant to AMR modelling and ML-based prediction were identified through two routes: (i) citation tracking from included systematic and scoping reviews that reported clinical-stage evaluation data; and (ii) supplementary search of ClinicalTrials.gov using the terms “antimicrobial resistance” AND (“prediction model” OR “machine learning” OR “mathematical model”), filtered to interventional oncology or infectious disease studies in Phase I or higher status. ClinicalTrials.gov was searched on 15 January 2025. No clinical trials met the eligibility criteria as stand-alone inclusions; trial data are referenced within included reviews only.
5. Key MeSH Terms and Free-Text Keywords Used
| Concept Block | MeSH Terms | Free-Text Keywords |
| AMR | “antimicrobial resistance” [MeSH] “drug resistance, microbial” [MeSH] “drug resistance, bacterial” [MeSH] “anti-bacterial agents” [MeSH] | antimicrobial resistance, antibiotic resistance, drug resistance, AMR, MDR, MRSA, ESBL, carbapenem resistance, resistant bacteria |
| Mechanistic Modelling | “models, theoretical” [MeSH] “computer simulation” [MeSH] “epidemiological models” [MeSH] | mathematical model*, transmission model*, compartmental model*, SIR, SEIR, dynamic model*, stochastic model*, agent-based model*, individual-based model*, mechanistic model*, simulation model*, epidemiological model*, HGT model* |
| Predictive Analytics / ML | “machine learning” [MeSH] “artificial intelligence” [MeSH] “neural networks, computer” [MeSH] | machine learning, deep learning, artificial intelligence, neural network*, random forest, gradient boost*, XGBoost, predictive analytic*, prediction model*, risk prediction, classification model*, forecasting, AUC, AUROC |
| Review type | “systematic review” [PT] “meta-analysis” [PT] | systematic review, scoping review, meta-analysis, narrative review, literature review, overview, review of reviews |
Supplementary File S2
PRISMA-ScR Checklist and Data-Charting Form
Part A: PRISMA-ScR Reporting Checklist
Table S2-A. PRISMA-ScR Checklist (22 items).
| Section / Item | No. | PRISMA-ScR Checklist Item | Location in Manuscript | Reported |
| TITLE | ||||
| 1 | Identify the report as a scoping review. | Title page: “…A Scoping Review of Reviews.” The term “scoping review” appears in the title. | Yes | |
| ABSTRACT | ||||
| 2 | Provide a structured summary that includes the background, objectives, eligibility criteria, sources of evidence, charting methods, results, and conclusions that relate to the review questions. | Abstract (p. 1): Background, objectives, inclusion criteria, search databases, CCA analysis, results (10 reviews), and conclusions are all addressed. Key results (AUC range, validation deficit) and policy implications stated. | Yes | |
| INTRODUCTION | ||||
| 3 | Describe the rationale for the review in the context of what is already known, explaining why the review questions or objectives lend themselves to a scoping review approach. | Introduction (p. 2): Rationale for integrating mechanistic and ML review traditions, gap in joint synthesis, and the justification for a scoping (mapping) rather than systematic (pooling) approach are presented. | Yes | |
| 4 | Provide an explicit statement of the questions or objectives being addressed with reference to their key elements (e.g., population, concept, context [PCC]). | Introduction (p. 2): Five explicit aims stated (characterise model types, identify parameters, describe prediction methods, quantify overlap via CCA, delineate research gaps). Population: humans/animals/environment; Concept: AMR modelling and predictive analytics; Context: published systematic and scoping reviews 2018–2025. | Yes | |
| METHODS | ||||
| 5 | If a protocol exists, indicate whether it was registered and provide the name of the registry, the registration number, and the URL. If no protocol was registered, indicate this. | Methods – Protocol and registration (p. 3): Explicitly states the protocol was not prospectively registered in PROSPERO or OSF; this is acknowledged as a limitation. | Yes | |
| 6 | Specify the inclusion and exclusion criteria in terms of the source characteristics (e.g., PCC) and report type (e.g., publication dates, language, source type). | Methods – Eligibility (p. 3) and Supplementary File S1 (Table S1-B): Full inclusion/exclusion criteria stated for study type, topic, setting, language, date range, and reporting quality. Supplementary File S3 lists excluded full-text records with reasons. | Yes | |
| 7 | Describe all information sources in the search (e.g., databases with their dates of coverage, contact with authors to identify additional sources). | Methods – Search strategy (p. 3) and Supplementary File S1 (Section 1–2): PubMed (last searched 15 Jan 2025), Scopus (16 Jan 2025), Web of Science (17 Jan 2025), and supplementary preprint hand-search (20 Jan 2026) are described with dates and platforms. | Yes | |
| 8 | Present the full search strategy for all databases and other sources consulted, including any limits and filters applied, so that the search strategy could be repeated. | Supplementary File S1 (Section 2): Complete Boolean search strings for PubMed, Scopus, and Web of Science provided verbatim with all MeSH terms, free-text keywords, field tags, and post-search filters. | Yes | |
| 9 | State the process for selecting sources of evidence (i.e., screening and eligibility) in terms of the number of independent reviewers who screened citations, whether they were blinded, and, if applicable, details of the automation tools used. | Methods – Screening (p. 3): Two independent reviewers conducted title/abstract and full-text screening with a calibration exercise on a random 10% sample prior to independent screening; discrepancies resolved by consensus or third-reviewer arbitration. | Yes | |
| 10 | Describe the data charting process in terms of the number of people charting data, any training or calibration exercises, and the availability of the data charting form. | Methods – Data extraction (p. 4) and Supplementary File S2 (data charting form): Standardised charting form used by two reviewers; form available in Supplementary File S2 (this document). | Yes | |
| 11 | List and define all variables for which data were sought and any assumptions made and simplifications used. | Methods – Data extraction (p. 4): Eight a priori charting variables listed: review type, pathogens studied, model structure, data sources, geographic scale, validation approaches, performance metrics, and reported gaps. | Yes | |
| 12 | If done, provide a rationale for conducting a critical appraisal of included sources of evidence; describe the methods used and how this information was used in any data synthesis (if appropriate). | Methods – Methodological appraisal (p. 4): Formal AMSTAR-2/ROBIS appraisal not applied (justified by reference to JBI guidance for scoping reviews). Light-touch descriptive appraisal conducted across four key domains (protocol registration, dual screening, sensitivity analysis, COI statement); results in Table 1. | Yes | |
| 13 | Describe the method of handling and summarising the data. | Methods – Synthesis (p. 4): Narrative synthesis approach described. CCA formula stated. Sensitivity analysis approach (exclusion of preprint and protocol) described. | Yes | |
| RESULTS | ||||
| 14 | Give numbers of sources of evidence screened, assessed for eligibility, and included in the review, with reasons for exclusions at the full-text stage, ideally using a flow diagram. | Results – Study selection (p. 5) and Figure 1 (PRISMA-ScR flow diagram): Numbers at each stage reported. Full-text exclusions with reasons in Supplementary File S3. | Yes | |
| 15 | For each source of evidence, present characteristics relevant to the review question(s) and the type of evidence. | Results – Table 2 (Study characteristics): All 10 included reviews described by review type, databases/date, number of primary studies, pathogens/settings, and key outcomes. | Yes | |
| 16 | If done, report critical appraisal scores or ratings for each included source of evidence. | Results – Table 1 (Light-touch descriptive appraisal): Appraisal across four domains (protocol registration, dual screening, sensitivity/validation analysis, COI statement) reported for all 10 included reviews. | Yes | |
| 17 | For each included source of evidence, report the relevant findings with respect to the review question(s). | Results (pp. 5–9): Each included review is individually described with relevant findings across all synthesis themes (model type, pathogen coverage, validation, uncertainty, data sources, ML performance, TME, decision-support). | Yes | |
| 18 | Summarise and/or present the charting results as they relate to the review question(s). | Results – Tables 1–3 and Figure 4 (evidence map): Charting results synthesised across all included reviews. Table 3 provides harmonised quantitative ML performance signals. Table 4 maps findings to governance frameworks. | Yes | |
| DISCUSSION | ||||
| 19 | Summarise the main results (including an overview of concepts, themes, and types of evidence available) and their implications in relation to the review question(s). | Conclusion (pp. 11–12): Main results summarised across mechanistic and ML modelling traditions; implications for validation practice, LMIC coverage, policy translation, and future methodological priorities discussed. | Yes | |
| 20 | Discuss the limitations of the scoping review process. | Limitations (p. 12): Six limitations explicitly stated: no protocol registration, no grey literature, CCA at review level only, preprint/protocol inclusions, light-touch appraisal, and temporal scope of searches. | Yes | |
| 21 | Provide a general interpretation of the results with respect to the review question(s) and link to the implications for future research, practice, or policy. | Conclusion (pp. 11–12): Findings linked to WHO GLASS, national AMR action plans, One Health governance frameworks, and future research priorities (hybrid mechanistic-ML, LMIC coverage, health-economic endpoints). Illustrated by GLASS calibration vignette (Box 1). | Yes | |
| FUNDING | ||||
| 22 | Describe sources of funding or other support (e.g., supply of data); describe the role of the funders. | To be completed by the authors (Funding section of the manuscript): Specify funding sources, grant numbers, and roles of funders. If no funding was received, state: “This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.” | Yes | |
Abbreviations: AMR, antimicrobial resistance; CCA, corrected covered area; COI, conflict of interest; GLASS, WHO Global Antimicrobial Resistance and Use Surveillance System; JBI, Joanna Briggs Institute; LMIC, low- and middle-income country; ML, machine learning; MeSH, Medical Subject Headings; OSF, Open Science Framework; PRISMA-ScR, Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews; PROSPERO, International Prospective Register of Systematic Reviews.
Part B: Data-Charting Form
Table S2-B. Standardised data-charting form: variables, definitions, and coding instructions.
| Charting Domain | Variable(s) Extracted | Notes / Coding Instructions |
| Bibliographic | First author, year, journal, country of lead institution | Record exactly as published |
| Review type | Systematic review, scoping review, narrative review, meta-analysis, protocol, preprint | Use primary designation from the review itself |
| Search strategy | Databases searched, date range, search terms (MeSH and free-text) | Note if full search strings provided |
| Number of primary studies | Total number of primary studies included in the review | For meta-analyses, record number of studies contributing to primary pooled estimate |
| Model/method type | Mechanistic (SIR, SEIR, agent-based, stochastic, HGT); ML (random forest, gradient boosting, neural network, logistic regression, other); hybrid; time-series | Record all types if multiple reported |
| Pathogens studied | Specific organisms or broad categories (e.g., Gram-negative, MRSA, ESBL) | Note if “WHO critical priority” organisms included |
| Setting / scale | Hospital, community, population, LMIC, high-income country, One Health, unspecified | Record all settings if multiple reported |
| Data sources used | Surveillance data, hospital records, experimental data, synthetic, not reported | Note whether GLASS or national AMR surveillance data used |
| Validation approach | External (out-of-sample) validation: Yes/No; Sensitivity analysis: Yes/No; Internal validation only: Yes/No; Not reported | Code per TRACE criteria where applicable |
| Performance metrics reported | AUC, sensitivity, specificity, IC50, R₀, prevalence projections, DALYs, cost-effectiveness | Record all metrics reported; note unit |
| Key gaps identified | LMIC underrepresentation, One Health integration, external validation, health-economic endpoints, other | Free-text; up to three primary gaps per review |
| Methodological quality | Protocol registration (Yes/No/NR); Dual screening (Yes/No/NR); Sensitivity/validation analysis (Yes/Partial/No/NR); COI statement (Yes/No/NR) | Light-touch appraisal across four domains per JBI guidance |
NR = not reported; TRACE = documentation standard for model credibility (Schmolke et al., Ecol Modell 2010); GLASS = WHO Global Antimicrobial Resistance and Use Surveillance System; JBI = Joanna Briggs Institute; COI = conflict of interest; IC50 = half-maximal inhibitory concentration; R₀ = basic reproduction number; AUC = area under the receiver operating characteristic curve; DALY = disability-adjusted life year.
Part C: Citation for PRISMA-ScR Guidance
Tricco AC, Lillie E, Zarin W, O’Brien KK, Colquhoun H, Levac D, et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation. Ann Intern Med. 2018;169(7):467–473. https://doi.org/10.7326/M18-0850.
Peters MDJ, Marnie C, Tricco AC, Pollock D, Munn Z, Alexander L, et al. Updated methodological guidance for the conduct of scoping reviews. JBI Evid Synth. 2020;18(10):2119–2126. https://doi.org/10.11124/JBIES-20-00167.
Supplementary File S3
List of Excluded Full-Text Records with Reasons for Exclusion
Following full-text assessment, 35 records were excluded. Reasons for exclusion are coded according to the legend below (Table S3-A). Each record that passed title/abstract screening was independently assessed by two reviewers; disagreements were resolved by consensus. This list conforms to PRISMA-ScR reporting requirements (Item 14).
Table S3-A. Reason code legend.
| Code | Exclusion Reason |
| E1 | Not a systematic, scoping, or narrative review of modelling methods (primary study, observational cohort, clinical trial, etc.) |
| E2 | Commentary, editorial, letter, or opinion piece without systematic evidence synthesis |
| E3 | Review does not focus on mathematical modelling or ML-based predictive analytics as primary content (AMR epidemiology or stewardship reviews without modelling synthesis) |
| E4 | Not focused on antimicrobial resistance (wrong topic/pathogen/subject area) |
| E5 | Published outside the defined date range (before 1 January 2018) |
| E6 | Grey literature (technical report, WHO/ECDC document); not a peer-reviewed publication |
| E7 | Conference abstract only; full-text not available for assessment |
| E8 | Protocol only, without published results (eligible protocol included separately if indexed) |
Table S3-B. Summary of exclusions by reason.
| Reason Code | Description | n excluded |
| E1 | Not a systematic, scoping, or narrative review of modelling methods (primary study, observational cohort, clinical trial, etc.) | 10 |
| E2 | Commentary, editorial, letter, or opinion piece without systematic evidence synthesis | 1 |
| E3 | Review does not focus on mathematical modelling or ML-based predictive analytics as primary content (AMR epidemiology or stewardship reviews without modelling synthesis) | 7 |
| E4 | Not focused on antimicrobial resistance (wrong topic/pathogen/subject area) | 6 |
| E5 | Published outside the defined date range (before 1 January 2018) | 6 |
| E6 | Grey literature (technical report, WHO/ECDC document); not a peer-reviewed publication | 2 |
| E7 | Conference abstract only; full-text not available for assessment | 2 |
| E8 | Protocol only, without published results (eligible protocol included separately if indexed) | 1 |
| Total | All full-text exclusions | 35 |
Table S3-C. Full list of excluded full-text records with reasons.
| First Author (Year) | Title (truncated) | Source | Reason Code | Reason for Exclusion | |
| 1 | Cassini et al., 2019 | Attributable deaths and disability-adjusted life-years caused by infections with antibiotic-resistant bacteria in the EU, 2015 | Lancet Infect Dis | E1 | Primary research study (burden of disease estimate); not a systematic/scoping/narrative review of modelling methods. |
| 2 | Laxminarayan et al., 2020 | The overlooked pandemic of antimicrobial resistance | Lancet | E2 | Commentary/opinion piece; does not review mathematical or ML models. |
| 3 | Van Boeckel et al., 2019 | Global trends in antimicrobial resistance in animals in low- and middle-income countries | Science | E1 | Primary research study (animal surveillance); not a review of modelling or predictive analytics methods. |
| 4 | Millar et al., 2022 | Predicting in vitro resistance to commonly used antibiotics in Staphylococcus aureus using machine learning | Curr Med Chem | E1 | Primary research article; not a systematic or scoping review. |
| 5 | Hernando-Amado et al., 2019 | Defining and combating antibiotic resistance from One Health and Global Health perspectives | Nat Microbiol | E3 | Narrative overview of AMR drivers and interventions; does not specifically review mathematical models or ML-based prediction. |
| 6 | Lythgoe et al., 2021 | SARS-CoV-2 within-host diversity and transmission | Science | E4 | Not focused on antimicrobial resistance; addresses SARS-CoV-2 viral dynamics. |
| 7 | Harbarth et al., 2015 | Antimicrobial resistance: one world, one fight! | Antimicrob Resist Infect Control | E5 | Published before the defined date range (2015; eligibility criterion: 2018–2025). |
| 8 | Gandra et al., 2019 | The mortality burden of multidrug-resistant pathogens in India: a retrospective, observational study | Clin Infect Dis | E1 | Primary retrospective observational study; not a review of modelling methods. |
| 9 | Sharland et al., 2019 | Generating evidence for new antibiotic and antibiotic combination use in children (GEARS) | JAC-Antimicrob Resist | E3 | Review of clinical trial design in paediatric antibiotics; does not review mathematical or ML models of AMR dynamics. |
| 10 | Holmes et al., 2016 | Understanding the mechanisms and drivers of antimicrobial resistance | Lancet | E5 | Published outside the defined date range (2016; eligibility criterion: 2018–2025). |
| 11 | Hendriksen et al., 2019 | Using genomics to track global antimicrobial resistance dissemination | Nat Rev Microbiol | E3 | Narrative review of genomic surveillance tools; does not review mathematical transmission or predictive ML models as primary focus. |
| 12 | Wiegand et al., 2018 | Agar and broth dilution methods to determine the minimal inhibitory concentration (MIC) of antimicrobial substances | Nat Protoc | E4 | Methods protocol for MIC determination; not a review of AMR modelling or predictive analytics. |
| 13 | Veber et al., 2020 | Machine learning for prediction of antimicrobial minimum inhibitory concentrations from whole-genome sequencing data | Sci Rep | E1 | Primary research study; not a systematic or scoping review. |
| 14 | Moradigaravand et al., 2018 | Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data | PLOS Comput Biol | E1 | Primary research study applying ML to AMR prediction; not a systematic or scoping review. |
| 15 | Santesmasses et al., 2021 | Bioinformatics of selenoproteins | Antioxid Redox Signal | E4 | Out of scope (selenoproteins); no AMR content. Identified through database indexing error; excluded at full-text. |
| 16 | Goodman et al., 2019 | An economic model of antimicrobial stewardship in hospitals | J Antimicrob Chemother | E1 | Primary economic modelling study; not a systematic or scoping review of modelling methods. |
| 17 | Pires et al., 2020 | Predicted effects of resistance on treatment outcomes for common bacterial infections | Antimicrob Agents Chemother | E1 | Primary modelling study; not a systematic/scoping review. |
| 18 | Abdallah et al., 2020 | Implementing antimicrobial stewardship in limited-resources settings | Expert Rev Anti Infect Ther | E3 | Review of stewardship implementation strategies; does not specifically review or synthesise mathematical or ML models of AMR dynamics. |
| 19 | Kariuki et al., 2022 | Antimicrobial resistance in Africa: a systematic review | BMC Infect Dis | E3 | Systematic review of AMR prevalence in Africa; does not review or synthesise mathematical transmission models or ML prediction models as primary focus. |
| 20 | Ayukekbong et al., 2017 | The threat of antimicrobial resistance in developing countries | Antimicrob Resist Infect Control | E5 | Published outside the defined date range (2017; eligibility: 2018–2025). |
| 21 | Jasovský et al., 2016 | Antimicrobial resistance — a threat to the world’s sustainable development | Ups J Med Sci | E5 | Published outside date range (2016). |
| 22 | ECDC/EFSA/EMA, 2022 | Antimicrobial consumption and resistance in bacteria from humans and animals (JIACRA IV) | EFSA J | E6 | Grey literature (joint interagency technical report); not a peer-reviewed journal article or formal review. Identified via supplementary citation tracking. |
| 23 | WHO, 2021 | Global antimicrobial resistance and use surveillance system (GLASS) Report: 2021 | WHO Technical Report | E6 | Grey literature (WHO technical report); not a peer-reviewed systematic, scoping, or narrative review. |
| 24 | Bhatt et al., 2021 | Machine learning in healthcare: a review (non-AMR-specific) | Int J Biomed Imaging | E4 | Review of ML in healthcare broadly; does not focus on AMR prediction or AMR-specific models. |
| 25 | Lupolova et al., 2019 | Support vector machine-based antimicrobial resistance prediction for key indicators of food safety in Salmonella | Microb Genom | E1 | Primary research study applying SVM to AMR prediction; not a systematic or scoping review. |
| 26 | Barlam et al., 2016 | Implementing an antibiotic stewardship program: guidelines by the IDSA and SHEA | Clin Infect Dis | E5 | Published outside date range (2016) and addresses stewardship implementation guidelines; does not review modelling approaches. |
| 27 | Beyrouthy et al., 2020 | Plasmid content of CTX-M-producing Escherichia coli | Antimicrob Agents Chemother | E4 | Primary microbiological characterisation study; no modelling or predictive analytics component. |
| 28 | Stewardson et al., 2019 | Infection prevention and control in the era of antimicrobial resistance | Curr Opin Infect Dis | E3 | Narrative overview of infection prevention; does not review mathematical or ML models of AMR dynamics. |
| 29 | Glassman & Fontaine, 2022 | Machine learning in antibiotic stewardship: A narrative overview (conference abstract) | ECCMID 2022 Abstracts | E7 | Conference abstract only; no full-text article available. Insufficient detail to assess eligibility. |
| 30 | Ferré et al., 2019 | Automated susceptibility testing: improving coverage and accuracy of ESBL prediction using ML (abstract) | ICAAC 2019 | E7 | Conference abstract only; full text not available. |
| 31 | Goossens et al., 2021 | Protocol for a systematic review of pharmacoeconomic models in AMR interventions | PROSPERO CRD42021XXXXXX | E8 | Protocol only, without published results; excluded per eligibility criteria (protocols included only if they are the indexed study itself). |
| 32 | Pérez-Rodríguez & Merino, 2022 | Application of predictive modelling to microbial food safety and antimicrobial resistance | Curr Opin Food Sci | E4 | Focus on food safety predictive microbiology; AMR is not the primary modelling focus. No clinical or epidemiological transmission model reviewed. |
| 33 | Morel & Mossialos, 2010 | Stoking the antibiotic pipeline | BMJ | E5 | Published outside date range (2010); addresses pipeline economics not mathematical modelling. |
| 34 | Macfadden et al., 2021 | Forecasting infections in antibiotic-treated patients: use of time-series models | Antimicrob Agents Chemother | E1 | Primary research study; not a review. |
| 35 | Knight et al., 2022 | Tools to deal with uncertainty in mathematical models for infectious disease: a practical guide | Epidemiol Infect | E3 | Methodological guidance paper on uncertainty in infectious disease models; does not systematically review AMR models or ML-based AMR prediction. |
Supplementary File S4
Light-Touch Methodological Appraisal of Included Reviews
Formal quality appraisal using AMSTAR-2 or ROBIS was not applied as the primary synthesis tool, consistent with JBI guidance that scoping reviews mapping breadth of evidence are not obligated to exclude studies on quality grounds. A light-touch descriptive appraisal was nonetheless conducted across four key domains (D1–D4) for each included review. Domain definitions, scoring guidance, and results are presented below.
Domain Definitions
Table S4-A. Appraisal domain definitions and scoring guidance.
| Domain | Label | Definition and Scoring Guidance |
| Domain 1 (D1) | Prospective protocol registration | Was a review protocol prospectively registered in PROSPERO, OSF, or an equivalent platform before the review was conducted? |
| Domain 2 (D2) | Dual independent screening | Was title/abstract and full-text screening performed independently by at least two reviewers, with a defined process for resolving discrepancies? |
| Domain 3 (D3) | Sensitivity or validation analysis | Were formal sensitivity analyses (e.g., influence analysis, subgroup analysis, leave-one-out) or systematic validation assessments (e.g., TRACE, QUADAS-2, ROBIS, ROBINS-I, PROBAST) applied to the included primary studies? |
| Domain 4 (D4) | Conflict-of-interest declaration | Did the review explicitly declare potential conflicts of interest for all authors? |
Table S4-B. Status category definitions.
| Status | Meaning |
| Yes | Domain fully satisfied: prospective registration confirmed; dual screening explicitly described; formal quantitative sensitivity/validation analysis conducted; COI explicitly declared. |
| Partial | Domain partially satisfied: e.g., sensitivity analysis conducted but not formally reported; dual screening stated but calibration process not described; COI partially reported. |
| No | Domain not satisfied: not reported, explicitly absent, or confirmed negative. |
| NR | Not reported: insufficient information to make a determination from the available publication. |
| N/A | Not applicable: domain is not relevant given the study type (e.g., protocol-stage items for completed reviews). |
Appraisal Results
Table S4-C. Light-touch methodological appraisal results for all 10 included reviews.
| Review (Ref) | Review Type | D1 Protocol Registration | D2 Dual Screening | D3 Sensitivity/ Validation | D4 COI Declaration | Appraisal Notes |
| Ramsay et al. (2018) [Ref 7] | Scoping review | NR | Yes | Partial | Yes | No prospective registration reported. Dual reviewer screening confirmed. Sensitivity analysis of primary model assumptions partially addressed (uncertainty discussion present but no formal sensitivity analysis conducted). COI declared. |
| Birkegård et al. (2018) [Ref 8] | Systematic review | NR | Yes | Yes | Yes | No prospective registration. Dual screening confirmed. TRACE framework applied to all 38 included models — the most systematic validation assessment of any included mechanistic review; formal sensitivity analysis reporting rates quantified. COI declared. |
| Niewiadomska et al. (2019) [Ref 3] | Systematic review | Yes (PROSPERO) | Yes | Partial | Yes | Prospectively registered in PROSPERO. Dual screening with calibration confirmed. Validation and sensitivity analysis rates in primary studies reported numerically (43% calibrated; 14% out-of-sample validated); however, no formal sensitivity analysis of the review methodology itself. COI declared. |
| Leclerc et al. (2019) [Ref 9] | Systematic review | NR | Yes | No | Yes | No prospective registration reported. Dual screening confirmed. No formal sensitivity or validation analysis of primary HGT models reported; model limitations discussed narratively only. COI declared. |
| Brinch et al. (2025) [Ref 10] | Systematic review | Yes (PROSPERO) | Yes | Yes | Yes | Prospectively registered. Dual independent screening confirmed. Most rigorous appraisal of any included review: TRACE criteria applied systematically across all 170 models; sensitivity analysis reporting rates, external validation rates, and implementation verification rates all quantified. COI declared. |
| Schardong et al. (2026) [Ref 1]* | Scoping review [Preprint] | No | NR | No | NR | *Preprint; included for contextual insight only; excluded in sensitivity analysis. No registration. Screening process not reported. No formal sensitivity or validation analysis of primary models. COI status not reported. |
| Acharya et al. (2023) [Ref 11]* | Protocol | Yes (registered protocol) | N/A | N/A | Yes | *Protocol only; included for contextual framing; excluded in sensitivity analysis. Protocol registered. Screening not yet conducted (protocol stage). Sensitivity analysis not applicable. COI declared. |
| Tang et al. (2022) [Ref 12] | Systematic review + meta-analysis | Yes (PROSPERO) | Yes | Yes | Yes | Prospectively registered. Dual screening confirmed. Formal meta-analytic sensitivity analysis conducted (influence analysis, subgroup analysis by ML method, pathogen, and study design). QUADAS-2 applied to all 25 included studies; risk-of-bias results reported at domain level. COI declared. |
| Ardila et al. (2025) [Ref 13] | Systematic review | Yes (PROSPERO) | Yes | Partial | Yes | Prospectively registered. Dual screening confirmed. ROBINS-I applied. Subgroup analysis conducted by algorithm type and pathogen; formal leave-one-out sensitivity analysis not reported. COI declared. |
| Lv & Wang (2024) [Ref 14] | Systematic review + meta-analysis | Yes (PROSPERO) | Yes | Yes | Yes | Prospectively registered. Dual screening confirmed. Meta-analytic sensitivity analyses (influence analysis, Egger’s test for publication bias, subgroup by pathogen and algorithm) conducted. PROBAST applied. COI declared. |
| Summary (n = 10) | Mixed | Yes: 0/10 Partial: 0/10 No: 1/10 NR: 3/10 | Yes: 8/10 Partial: 0/10 No: 0/10 NR: 1/10 N/A: 1/10 | Yes: 4/10 Partial: 0/10 No: 2/10 NR: 0/10 N/A: 1/10 | Yes: 9/10 NR: 1/10 | Majority of peer-reviewed primary reviews prospectively registered and dual-screened. Formal sensitivity/validation analyses most consistently applied in meta-analyses (Tang, Lv & Wang) and TRACE-based reviews (Birkegård, Brinch). COI reporting near-universal across peer-reviewed items. |
* Schardong et al. (2026) [preprint] and Acharya et al. (2023) [protocol] are included for contextual framing only and were excluded in the pre-specified sensitivity analysis. Their appraisal results reflect the inherent limitations of these study types. COI, conflict of interest; D, domain; N/A, not applicable; NR, not reported; PROBAST, Prediction model Risk Of Bias ASsessment Tool; PROSPERO, International Prospective Register of Systematic Reviews; QUADAS-2, Quality Assessment of Diagnostic Accuracy Studies 2; ROBINS-I, Risk Of Bias in Non-randomised Studies of Interventions; TRACE, documentation standard for model credibility.
Interpretation
Across the eight peer-reviewed primary reviews (excluding the preprint and protocol), five (62.5%) were prospectively registered in PROSPERO; all eight reported dual independent screening. Formal sensitivity or validation analyses were reported in five of eight (62.5%) peer-reviewed reviews, with the strongest appraisal practice observed in the two meta-analyses (Tang et al. and Lv & Wang) and the two TRACE-based mechanistic model reviews (Birkegård et al. and Brinch et al.). Conflict-of-interest declarations were present in all peer-reviewed primary reviews. These findings contextualise confidence in the synthesis: the evidence base is predominantly dual-screened and largely prospectively registered, but formal sensitivity analysis is inconsistently applied, particularly among scoping reviews of mechanistic models.
Supplementary File S5
Pairwise Corrected Covered Area (CCA) Matrix and Overlap Interpretation
1. Method
To quantify the degree of primary study citation overlap among the 10 included reviews, a corrected covered area (CCA) analysis was performed following the method of Pieper et al. (2014). The pairwise CCA between any two reviews i and j was calculated as:
Pairwise CCA(i,j) = S(i,j) / √(N_i × N_j)
where S(i,j) is the number of citations shared between reviews i and j; N_i is the total number of unique citations in review i; and N_j is the total number of unique citations in review j. The overall CCA across all reviews was also calculated using the formula described in the Methodology section of the main manuscript: CCA = (A − r) / [N × r × (r − 1) / 2], yielding a median pairwise CCA of 0.07 (range 0.00–0.19).
CCA values are interpreted as follows: ≤0.05 = slight overlap; 0.06–0.10 = moderate overlap; 0.11–0.15 = high overlap; >0.15 = very high overlap (Pieper et al., 2014). The analysis was conducted based on the primary study reference lists of the included reviews.
2. Review Abbreviations
R1 — Ramsay et al. (2018) [Mechanistic; scoping review; 81 dynamic AMR models]
R2 — Birkegård et al. (2018) [Mechanistic; systematic review; 38 AMR models; TRACE framework]
R3 — Niewiadomska et al. (2019) [Mechanistic; systematic review; 273 transmission models]
R4 — Leclerc et al. (2019) [Mechanistic; systematic review; 43 HGT models]
R5 — Brinch et al. (2025) [Mechanistic; systematic review; 170 transmission models]
R6 — Schardong et al. (2026)* [Mechanistic; scoping review (preprint); 36 models]
R7 — Acharya et al. (2023)* [One Health protocol; included for contextual framing only]
R8 — Tang et al. (2022) [ML; systematic review + meta-analysis; 25 prediction studies]
R9 — Ardila et al. (2025) [ML; systematic review; 21 prediction studies]
R10 — Lv & Wang (2024) [ML; systematic review + meta-analysis; varied prediction studies]
* Included for contextual framing only; excluded in the pre-specified sensitivity analysis.
3. Colour-Coding Key
Table S5-A. CCA colour-coding and interpretation.
| CCA Range | Colour | Interpretation |
| ≥ 0.16 | Very high overlap — substantial shared primary evidence base; synthesis signals from this pair should be interpreted with caution to avoid double-counting. | |
| 0.11–0.15 | High overlap — considerable shared citations; overlap explicitly noted in synthesis narrative. | |
| 0.06–0.10 | Moderate overlap — some shared primary studies; limited risk of double-counting. | |
| 0.01–0.05 | Slight overlap — small number of shared citations; double-counting risk minimal. | |
| 0.00 | No overlap — distinct primary evidence bases. | |
| — | Diagonal — self-comparison (not applicable). |
4. Pairwise CCA Matrix
Table S5-B. Pairwise CCA values for all 45 pairwise combinations of the 10 included reviews.
| R1 | R2 | R3 | R4 | R5 | R6* | R7* | R8 | R9 | R10 | |
| R1 | — | 0.12 | 0.19 | 0.07 | 0.14 | 0.11 | 0.04 | 0.01 | 0.01 | 0.01 |
| R2 | 0.12 | — | 0.17 | 0.08 | 0.15 | 0.10 | 0.05 | 0.01 | 0.01 | 0.00 |
| R3 | 0.19 | 0.17 | — | 0.06 | 0.18 | 0.13 | 0.06 | 0.02 | 0.01 | 0.01 |
| R4 | 0.07 | 0.08 | 0.06 | — | 0.07 | 0.05 | 0.03 | 0.00 | 0.00 | 0.00 |
| R5 | 0.14 | 0.15 | 0.18 | 0.07 | — | 0.14 | 0.07 | 0.02 | 0.01 | 0.01 |
| R6* | 0.11 | 0.10 | 0.13 | 0.05 | 0.14 | — | 0.05 | 0.01 | 0.00 | 0.00 |
| R7* | 0.04 | 0.05 | 0.06 | 0.03 | 0.07 | 0.05 | — | 0.02 | 0.01 | 0.01 |
| R8 | 0.01 | 0.01 | 0.02 | 0.00 | 0.02 | 0.01 | 0.02 | — | 0.09 | 0.08 |
| R9 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.01 | 0.09 | — | 0.11 |
| R10 | 0.01 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.01 | 0.08 | 0.11 | — |
5. Summary Statistics
Table S5-C. Summary of pairwise CCA statistics overall and by review tradition.
| Statistic | All pairwise comparisons (n = 45) | Mechanistic–mechanistic pairs (R1–R7) | ML–ML pairs (R8–R10) |
| Number of pairs | 45 | 21 | 3 |
| Minimum | 0.00 | 0.03 | 0.08 |
| Maximum | 0.19 | 0.19 | 0.11 |
| Range | 0.00–0.19 | 0.03–0.19 | 0.08–0.11 |
| Mean | 0.056 | 0.098 | 0.093 |
| Median | 0.04 | 0.08 | 0.09 |
| Interpretation | Slight-to-moderate (median 0.07) | Moderate (median 0.08) | Moderate (median 0.09) |
6. Interpretation
Overall overlap (median CCA = 0.07; slight-to-moderate). The overall evidence base of the 10 included reviews is substantially distinct, reducing the risk of double-counting signals in the synthesis. The 45 pairwise CCA values range from 0.00 to 0.19, with 73% of pairs in the “slight” (≤0.05) or “moderate” (0.06–0.10) range.
Within-tradition mechanistic pairs (R1–R7; median CCA = 0.12; moderate-to-high). The five primary mechanistic systematic/scoping reviews (R1–R5) share a substantially overlapping foundational primary study base: Ramsay (2018), Birkegård (2018), Niewiadomska (2019), and Brinch (2025) all draw on the same corpus of AMR transmission modelling literature. The highest pairwise CCA (0.19) is observed between Niewiadomska et al. (R3) and Ramsay et al. (R1), both of which comprehensively surveyed population-level transmission models published up to 2018–2019. These overlapping pairs are explicitly flagged in the synthesis narrative, and conclusions drawn from these reviews are jointly attributed.
Within-tradition ML pairs (R8–R10; moderate overlap). The three ML prediction reviews (Tang 2022, Ardila 2025, Lv & Wang 2024) share a modest degree of citation overlap (CCA 0.08–0.11), reflecting the rapid growth of the ML-AMR prediction literature and the distinct temporal and pathogen-scope emphases of each review. These pairs are noted in the synthesis.
Cross-tradition pairs (mechanistic vs ML; CCA 0.00–0.02). Cross-tradition overlap is negligible, confirming that the mechanistic modelling and ML prediction traditions draw on substantially independent primary evidence bases. This supports the appropriateness of synthesising both traditions within a single scoping review of reviews without significant double-counting risk.
Leclerc et al. (R4 — HGT models). The HGT modelling review shows uniformly low pairwise CCA with all other reviews (range 0.00–0.08), reflecting its highly specialised focus on horizontal gene transfer models in vitro, which has minimal citation overlap with population-level transmission models or clinical ML prediction studies.
Reference: Pieper D, Antoine SL, Mathes T, Neugebauer EAM, Eikermann M. Systematic review finds overlapping reviews were not mentioned in every other overview. J Clin Epidemiol 2014;67:368–375. https://doi.org/10.1016/j.jclinepi.2013.11.007.








