Hostname: page-component-6bb9c88b65-kfd97 Total loading time: 0 Render date: 2025-07-25T15:02:25.269Z Has data issue: false hasContentIssue false

Looking inside the black box – hybrid epidemiology approaches to identify causal inferences

Published online by Cambridge University Press:  15 July 2025

Ahmed Elagali*
Affiliation:
Plastics and Human Health, https://ror.org/0289t9g81Minderoo Foundation, Perth, WA, Australia School of Biological Sciences, The University of Western Australia, Perth, WA, Australia Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, VIC, Australia
Katherine Drummond
Affiliation:
Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, VIC, Australia
Adriano Winterton
Affiliation:
Division of Mental and Physical Health, https://ror.org/046nvst19 Norwegian Institute of Public Health , Oslo, Norway
Yannick Mulders
Affiliation:
Plastics and Human Health, https://ror.org/0289t9g81Minderoo Foundation, Perth, WA, Australia
Christos Symeonides
Affiliation:
Plastics and Human Health, https://ror.org/0289t9g81Minderoo Foundation, Perth, WA, Australia
Bhedita Seewoo
Affiliation:
Plastics and Human Health, https://ror.org/0289t9g81Minderoo Foundation, Perth, WA, Australia School of Biological Sciences, The University of Western Australia, Perth, WA, Australia
Gro Andersen
Affiliation:
Division of Mental and Physical Health, https://ror.org/046nvst19 Norwegian Institute of Public Health , Oslo, Norway
Anne-Louise Ponsonby
Affiliation:
Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, VIC, Australia
Sarah Dunlop
Affiliation:
Plastics and Human Health, https://ror.org/0289t9g81Minderoo Foundation, Perth, WA, Australia School of Biological Sciences, The University of Western Australia, Perth, WA, Australia
*
Corresponding author: Ahmed Elagali; Email: aelagali@minderoo.org
Rights & Permissions [Opens in a new window]

Abstract

Plastic chemicals are numerous and ubiquitous in modern life and pose significant risks to human health. Observational epidemiological studies have been instrumental in identifying consistent and statistically significant associations between exposure to certain chemicals and adverse health outcomes. However, these studies often fail to establish causality due to the complexity of real-world chemical mixtures, confounding factors, reverse causation, and study designs that lack measures reflecting underlying genetic and cellular mechanisms indicating causal pathways to harm. Addressing these limitations requires moving beyond traditional ‘black-box’ epidemiology, which mainly focuses on the strength of associations. We propose adopting hybrid epidemiological methodologies that incorporate genetic susceptibility and molecular mechanisms to uncover biological pathways, combined with machine learning and statistical analysis of chemical mixtures, to strengthen the causal evidence linking exposure to harm. By integrating observational multi-omics data with experimental and mechanistic models, hybrid epidemiology offers a transformative path to improve causal evidence and public health interventions. In addition, machine learning and statistical methods provide a more nuanced understanding of the health effects of exposures to plastic chemical mixtures, facilitating the identification of interactions within chemical mixtures and the influence of biological pathways. This paradigm shift is critical addressing the complex challenges of plastic exposure and protecting human health.

Information

Type
Perspective
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Impact statement

This work addresses a critical gap in environmental health research: the inability of traditional observational epidemiology to fully elucidate the causal and interactive effects of plastic chemical exposures. By proposing a hybrid epidemiological framework that incorporates underpinning genetic susceptibility and molecular mechanisms, advanced statistical methods, machine learning and experimental tools, this perspective charts a path towards a more comprehensive understanding of exposure-disease relationships. This approach not only reveals the biological plausibility of the findings but also provides robust evidence for policymakers to regulate hazardous chemical mixtures effectively.

Introduction

For decades, observational epidemiological studies have formed the basis for our understanding of environmental health risks, particularly those posed by plastic chemicals (Grandjean and Landrigan, Reference Grandjean and Landrigan2006; Woodruff et al., Reference Woodruff, Zota and Schwartz2011; Landrigan et al., Reference Landrigan, Raps, Cropper, Bald, Brunner, Canonizado, Charles, Chiles, Donohue and Enck2023). These studies have provided critical information on the associations between exposure to numerous ubiquitous chemicals, in particular during early life, and various health conditions, including the aetiologies of adverse perinatal and childhood outcomes, endocrine disruption, adverse neurodevelopmental outcomes, obesity, diabetes, and cancer (Symeonides et al., Reference Symeonides, Aromataris, Mulders, Dizon, Stern, Barker, Whitehorn, Pollock, Marin and Dunlop2024, and references therein). Despite their significant contributions, observational epidemiological studies often face limitations in establishing causal relationships. Such studies typically focus on the health impacts of individual chemicals, an approach that may not fully capture the complexities of endocrine-disrupting chemicals (EDCs) such as phthalates and bisphenols. These chemicals can exert effects through shared and/or overlapping mechanisms on endocrine systems, such as binding to specific hormone receptors in target tissues or interfering with biotransformation and excretion of endogenous hormones, enabling multiple chemicals to act additively or synergistically at lower concentrations to produce outcomes that would require higher concentrations if acting alone (Kortenkamp, Reference Kortenkamp2008; Al-Gubory, Reference Al-Gubory2014; Darbre, Reference Darbre2022). The inability to account for the full exposome (Wild, Reference Wild2005), encompassing all chemical and non-chemical factors that influence human well-being throughout life, e.g., lifestyle, socio-economic status, diet, adverse life-events and psychosocial factors, can bias findings further and obscure direct causal links between chemical exposure and health outcomes later in life. In addition, the long latency periods associated with conditions such as cancer or reproductive disorders make it difficult to attribute these health impacts to specific chemical exposures many years earlier in life (Amolegbe et al., Reference Amolegbe, Carlin, Henry, Heacock, Trottier and Suk2022). Exposures to even very low concentrations during the critical developmental window in the foetal and childhood life stages could be related to a programming effect with an increased risk of disease later in life (Lanphear, Reference Lanphear2015). Furthermore, as EDCs often show non-linear exposure–response relationships, assuming only linear relationships will obscure the true impact of these chemicals. These challenges highlight the need for a hybrid epidemiological approach that combines advances in mixture analysis, machine learning and molecular methods to improve causal inference. Integrating experimental models (e.g., in vitro, in vivo, in silico and mechanistic toxicology) with epidemiological data enables testing of biological plausibility, dose–response and temporality. This alignment helps move from statistical associations to identifying mechanistic pathways linking EDC exposure to health outcomes.

The challenge faced by observational studies to establish causality closely resembles those outlined in the Bradford Hill criteria, such as demonstrating strength, consistency, biological plausibility and temporality in observed associations (Woodside III and Davis, Reference Woodside and Davis2012; Fedak et al., Reference Fedak, Bernal, Capshaw and Gross2015). Traditional epidemiology often produces small effect sizes (Symeonides et al., Reference Symeonides, Aromataris, Mulders, Dizon, Stern, Barker, Whitehorn, Pollock, Marin and Dunlop2024), which poses inconclusive and weak confidence in causal links to human harm based on Bradford Hill’s criterion of strength of association (Woodside III and Davis, Reference Woodside and Davis2012; Fedak et al., Reference Fedak, Bernal, Capshaw and Gross2015). While the consistency across observational epidemiological studies, another Bradford Hill criterion, provides some support for causality, the strength of the association is the primary factor in considering ‘cause’ (Woodside III and Davis, Reference Woodside and Davis2012; Fedak et al., Reference Fedak, Bernal, Capshaw and Gross2015). In contrast, adopting a hybrid epidemiology approach that explores the underlying biological mechanisms is likely to reveal larger effect sizes expressed as several-fold changes (Caporale et al., Reference Caporale, Leemans, Birgersson, Germain, Cheroni, Borbély, Engdahl, Lindh, Bressan and Cavallo2022; Symeonides et al., Reference Symeonides, Vacy, Thomson, Tanner, Chua, Dixit, Mansell, O’Hely, Novakovic and Herbstman2024; Elagali et al., Reference Elagali, Eisner, Tanner, Drummond, Symeonides, Love, Tang, Mansell, Burgner and Collier2025), which increases the evidence for causation. Moving beyond traditional observational studies, this perspective advocates for innovative hybrid epidemiological strategies that will equip researchers with powerful tools to uncover causal relationships and assess the health impacts of complex plastic chemical mixtures. In this perspective, we offer key recommendations to support the shift towards strengthening causal evidence, with substantial potential to drive progress in precision medicine and improve population health outcomes.

Beyond one-chemical-at-a-time: mixture analysis and machine learning methods

Traditional epidemiological approaches are sometimes limited by their focus on assessing health risks associated with single chemical exposure, which fail to capture the complexity of real-world scenarios and treat environmental exposures as isolated events. This reductionist approach has resulted in a limited understanding of the health impacts associated with chemical mixtures and presents a significant regulatory challenge. In practice, humans are exposed to numerous environmental chemicals throughout their lifespan, often concurrently, as shown by national human biomonitoring programs (Woodruff et al., Reference Woodruff, Zota and Schwartz2011; Stanfield et al., Reference Stanfield, Setzer, Hull, Sayre, Isaacs and Wambaugh2024). These exposures are frequently correlated, may show non-linear dose–response relationships and can lead to cumulative or interactive health effects. Traditional regression models struggle with such complexity due to multicollinearity, limited power to detect interactions and challenges in selecting meaningful variables from high-dimensional data. To address this, advanced statistical techniques have been developed to more accurately model chemical mixtures. These approaches differ in assumptions, interpretability, ability to account for correlation and capacity to detect non-linear or interactive effects. Advancing methods to quantify the disease risks posed by chemical mixtures could help identify modifiable exposures, enabling targeted public health interventions and prevention strategies.

In adapting this approach, epidemiologists must consider that the current research on plastic chemicals is constrained by a focus on substances already known to be hazardous or measurable using existing methods, a limitation often referred to as the ‘street-light effect’ (J. M. Braun et al., Reference Braun, Gennings, Hauser and Webster2016). With the increase in the use of non-targeted approaches to quantify chemical exposures, mixture studies can rely on these methods to prioritize which chemicals are included in mixture-based statistical approaches i.e., ‘in silico’ prioritization. To support this advancement, it is crucial to develop efficient high-throughput screening assays that target key endpoints and account for the toxicity of both individual chemicals and their mixtures. G. Braun et al. (Reference Braun, Herberth, Krauss, König, Wojtysiak, Zenclussen and Escher2024) demonstrated this approach by analysing the neurotoxic effects of hundreds of chemical mixtures extracted from the blood of pregnant women. Using unbiased extraction methods, automated target screening and high-throughput in vitro neurotoxicity assays, Braun et al. uncovered the impact of complex organic mixtures on neurite development and demonstrated how in silico prioritization can effectively narrow down chemicals of concern for neurodevelopmental endpoints.

As the study of chemical mixtures remains an evolving field of research, there is currently no established consensus on the most appropriate statistical or machine learning methods to investigate the health effects of chemical mixtures in epidemiological studies (Bobb et al., Reference Bobb, Valeri, Henn, Christiani, Wright, Mazumdar, Godleski and Coull2015; Lazarevic et al., Reference Lazarevic, Barnett, Sly and Knibbs2019; Maitre et al., Reference Maitre, Guimbaud, Warembourg, Güil-Oumrait, Petrone, Chadeau-Hyam, Vrijheid, Basagaña and Gonzalez2022; Miller and Consortium, Reference Miller and Consortium2025). In recent years, numerous methods have been introduced to address the complexities associated with multiple exposures and their interactions (Sun et al., Reference Sun, Tao, Li, Ferguson, Meeker, Park, Batterman and Mukherjee2013; Forns et al., Reference Forns, Mandal, Iszatt, Polder, Thomsen, Lyche, Stigum, Vermeulen and Eggesbø2016; Stafoggia et al., Reference Stafoggia, Breitner, Hampel and Basagaña2017). These include techniques for variable selection, shrinkage and grouping of correlated variables, such as the least absolute shrinkage and selection operator (LASSO; Sun et al., Reference Sun, Tao, Li, Ferguson, Meeker, Park, Batterman and Mukherjee2013), elastic net (Lenters et al., Reference Lenters, Portengen, Rignell-Hydbom, Jönsson, Lindh, Piersma, Toft, Bonde, Heederik and Rylander2016) and adaptive elastic net (Zou and Zhang, Reference Zou and Zhang2009). Other approaches include dimension reduction methods like principal component analysis (Yang et al., Reference Yang, Li, Li, Wang, Cao, Wu and Xu2013) and partial least squares (Sun et al., Reference Sun, Tao, Li, Ferguson, Meeker, Park, Batterman and Mukherjee2013), as well as Bayesian frameworks such as Bayesian model averaging (Bobb et al., Reference Bobb, Dominici and Peng2011). Notably, two methods specifically designed for mixture analyses in environmental epidemiology warrant attention: Weighted Quantile Sum Regression (WQSR, Carrico et al., Reference Carrico, Gennings, Wheeler and Factor-Litvak2015) and Bayesian Kernel Machine Regression (BKMR, Bobb et al., Reference Bobb, Henn, Valeri and Coull2018). WQSR and BKMR provide robust measures of the health effects of mixtures, with BKMR offering additional advantages, such as accounting for non-linearity and interaction in multivariate exposure–response relationships. Despite their strengths, all of these methods exhibit some limitations, including instability in model selection (shrinkage approaches), difficulties in interpreting latent variables (dimension reduction) and the computational intensity of Bayesian models. Furthermore, their application to large and heterogeneous exposome datasets – comprising diverse variable types such as -omics data and mixed categorical and continuous variables – remains limited.

We do not advocate for a rigid, one-size-fits-all approach; however, for complex research questions exploring the health effects of multiple exposures, a multi-staged statistical strategy is recommended. For example, an initial step could involve utilizing methods like LASSO as a pre-processing tool (Sun et al., Reference Sun, Tao, Li, Ferguson, Meeker, Park, Batterman and Mukherjee2013). This preliminary step helps refine the dataset and enhance model performance before implementing Bayesian approaches, such as: Bayesian Hierarchical Models (Gelman et al., Reference Gelman, Carlin, Stern and Rubin1995), which can handle data with nested or multilevel structures (e.g., repeated measures or multi-cohort studies); BKMR, which is particularly effective for modelling non-linear and interactive effects of correlated exposures; or Bayesian Additive Regression Trees (Chipman et al., Reference Chipman, George and McCulloch2010), a flexible ensemble method that captures complex, non-parametric relationships and has been used to identify high-order interactions in high-dimensional datasets. An alternative to multistage analysis is the use of machine learning (ML) techniques, which are increasingly applied in environmental health to manage high-dimensional exposure data and uncover patterns missed by regression models. ML is well-suited for modelling non-linear relationships and interactions in chemical mixtures. The choice of algorithm depends on the data structure and research aims. For example, neural networks are particularly suited for modelling complex, multilayered interactions in large datasets, while support vector regression (SVR) performs well in small- to medium-sized datasets with non-linear boundaries. Ensemble learning methods like gradient boosting (e.g., XGBoost, LightGBM, CatBoost) and adaptive boosting (AdaBoost) build highly predictive models and are especially effective for structured data with correlated features (see, for example, Argyri et al., Reference Argyri, Gallos, Amditis and Dionysiou2024; Guimbaud et al., Reference Guimbaud, Siskos, Sakhi, Heude, Sabidó, Borràs, Keun, Wright, Julvez and Urquiza2024, and references therein). However, a significant challenge with machine learning models lies in their lack of interpretability. For example, models like gradient boosting, which consist of numerous decision trees, make it difficult to determine how individual features contribute to a specific prediction. To address this limitation, Shapley Additive Explanations (SHAP) has emerged as a valuable tool, offering clear and detailed explanations of ML predictions (Lundberg, Reference Lundberg2017). Ultimately, the choice of method should reflect the dimensionality of the data, sample size, confounding and research objectives. As environmental health data grow in complexity, so does the relevance of machine learning for analysis and interpretation.

Uncovering mechanisms: -omics and molecular pathways in the shift towards causal inference

As research on plastic chemicals advances, there is an increasing emphasis on the need to identify robust causal evidence linking these exposures to human health outcomes. While correlations between exposure and health effects may represent genuine associations, they do not inherently demonstrate causation. A promising strategy to strengthen causal inference involves the integration of evidence from multiple approaches, each with distinct and unrelated sources of potential bias – a method referred to as triangulation of evidence. For instance, combining observational epidemiological data with experimental or quasi-experimental approaches, such as toxicological studies, in vitro systems and mechanistic models, facilitates the identification of biologically plausible pathways linking plastic exposure to health outcomes. Similarly, the integration of ‘multi-omics’ technologies, encompassing comprehensive biological domains such as genomics, transcriptomics and proteomics, offers a robust framework for uncovering novel causal mediators of disease and revealing biological insights that may remain undetected through single-omics analyses (Hasin et al., Reference Hasin, Seldin and Lusis2017; Karczewski and Snyder, Reference Karczewski and Snyder2018; Miller and Consortium, Reference Miller and Consortium2025). Since each method operates under unique assumptions and exhibits distinct strengths and limitations, consistent findings across diverse methodologies significantly reduce the likelihood of artefactual results, thereby strengthening the inference of causality (see Figure 1).

Figure 1. Unlike the traditional epidemiological approach (represented by the grey arrow), which examines the relationship between a single exposure and a disease outcome while many critical aspects of the biological pathway remain uncovered, hybrid epidemiology (depicted by the purple arrow) adopts a more comprehensive view. The hybrid epidemiology approach considers the broader exposome and examines how factors such as genetic predisposition, molecular mechanisms, environmental factors and lifestyle variations within individuals can modulate this effect. Hybrid epidemiology goes a step further by integrating multiple analytical approaches and techniques – including machine learning, multi-omics, advanced statistical methods and animal or cell models – to triangulate and identify the most parsimonious effect of exposure on disease aetiology.

The approach of uncovering the biological pathways linking exposures to outcomes has gained significant attention in Europe and is the focus of ongoing research through initiatives such as the Horizon 2020 ENDpoiNTs project (Lupu et al., Reference Lupu, Andersson, Bornehag, Demeneix, Fritsche, Gennings, Lichtensteiger, Leist, Leonards and Ponsonby2020) and the Psychiatric Disorders and Comorbidities Caused by Pollution in the Mediterranean Area (PsyCoMed) project. These projects combine scientific expertise in endocrine disruption and developmental neurotoxicity with advanced in silico and in vitro tools, innovative experimental approaches and sophisticated biostatistical analyses of human epidemiological and biomonitoring data. Their aim is to uncover both correlative and causal links between neurodevelopmental outcomes and endocrine pathways, addressing not only well-studied targets like estrogen, androgen and thyroid systems but also lesser-known pathways such as the retinoic acid system. Symeonides et al. (Reference Symeonides, Vacy, Thomson, Tanner, Chua, Dixit, Mansell, O’Hely, Novakovic and Herbstman2024) employed a multimodal approach, combining human observational studies and preclinical mouse models, to investigate the link between prenatal bisphenol A (BPA) exposure and autism spectrum disorder (ASD) in males. In the human cohort, BPA exposure was associated with increased ASD symptoms and diagnosis in males with low aromatase activity, driven by BPA-induced hypermethylation of the CYP19A1 brain promoter, with over three-fold increased risk of ASD symptoms at age 2 and six-fold increased risk of diagnosis at age 9 (Symeonides et al., Reference Symeonides, Vacy, Thomson, Tanner, Chua, Dixit, Mansell, O’Hely, Novakovic and Herbstman2024). In mouse models, BPA exposure and aromatase knockout led to ASD-like behaviours, amygdala hypoactivation and brain alterations. In vitro experiments confirmed BPA’s suppression of aromatase in neuronal cells. These findings provide insights into how prenatal BPA disrupts aromatase signalling, causing changes characteristic of ASD in males.

To inform policy effectively, broader research covering a wider range of health outcomes beyond neurodevelopment is needed (Symeonides et al., Reference Symeonides, Aromataris, Mulders, Dizon, Stern, Barker, Whitehorn, Pollock, Marin and Dunlop2024). Further, incorporating mixture analyses into triangulation methods is crucial, as interactions between individual EDCs can amplify their effects. For example, BPA has been shown to enhance oestrogen receptor expression, increasing cellular vulnerability to other EDCs (Hayes et al., Reference Hayes, Weening and Morey2016; Hamid et al., Reference Hamid, Junaid and Pei2021). This complicates the assessment of combined effects based on individual exposures (Kortenkamp, Reference Kortenkamp2008; J. M. Braun et al., Reference Braun, Gennings, Hauser and Webster2016). Misattributing health effects to a single exposure rather than to a correlated harmful exposure carries significant regulatory consequences (Caporale et al., Reference Caporale, Leemans, Birgersson, Germain, Cheroni, Borbély, Engdahl, Lindh, Bressan and Cavallo2022; Duh-Leong et al., Reference Duh-Leong, Maffini, Kassotis, Vandenberg and Trasande2023). Such poorly defined policies can increase global human exposure to plastic chemicals, resulting in considerable public health issues, disability and economic costs (the societal burden of chemical exposure is estimated at US$340 billion annually, see for example Duh-Leong et al., Reference Duh-Leong, Maffini, Kassotis, Vandenberg and Trasande2023). We therefore advocate for the integration of triangulation methods with mixture modelling to more accurately reflect real-world exposure scenarios. This combined approach enhances the identification of the most harmful chemicals within mixtures and provides insights into whether their effects are additive, antagonistic or synergistic – thus supporting more precise and actionable interventions. The applicability of this hybrid epidemiological framework is particularly relevant in areas where environmental exposures act through diverse, interacting biological pathways — including endocrine disruption, neurodevelopment and immune modulation. In this context, the eight hallmarks of environmental insults proposed by Peters et al. (Reference Peters, Nawrot and Baccarelli2021)) — namely oxidative stress and inflammation, genomic alterations and mutations, epigenetic alterations, mitochondrial dysfunction, endocrine disruption, altered intercellular communication, altered microbiome communities and impaired nervous system function — offer a valuable conceptual scaffold. Integrating multi-omics and mechanistic evidence with these hallmarks enables researchers to better identify plausible biological pathways and prioritize environmental exposures for regulatory action.

Additionally, the integration of advanced biomolecular techniques, such as genomics, proteomics and transcriptomics, plays a key role in enhancing the biological plausibility of exposure–disease associations and strengthening causal evidence. For instance, utilizing genetic data can effectively reduce confounding in epidemiological research through methods like pedigree-based analyses (e.g., sibling comparisons) and genetically informed approaches (e.g., polygenic scoring, genetic pathway score functions and Mendelian randomization). This capability to identify genotype-specific or individual responses to environmental exposures is crucial to unravelling gene–environment interactions (see, for example, Tanner et al., Reference Tanner, Thomson, Drummond, O’hely, Symeonides, Mansell, Saffery, Sly, Collier and Burgner2022; Elagali et al., Reference Elagali, Eisner, Tanner, Drummond, Symeonides, Love, Tang, Mansell, Burgner and Collier2025), as the aetiology of many diseases involves intricate interplays between genetic susceptibilities and environmental factors (Manrai et al., Reference Manrai, Cui, Bushel, Hall, Karakitsios, Mattingly, Ritchie, Schmitt, Sarigiannis and Thomas2017; Vermeulen et al., Reference Vermeulen, Schymanski, Barabási and Miller2020; Chang et al., Reference Chang, Ewald, Hui, Bayen and Xia2024). Advancing this understanding is important for the development of predictive gene–environment interaction (GxE) models, with transformative potential in robust chemical regulation and large-scale public health monitoring (Motsinger-Reif et al., Reference Motsinger-Reif, Reif, Akhtari, House, Campbell, Messier, Fargo, Bowen, Nadadur and Schmitt2024). Further, recent advances in single-cell omics and human organoid modelling provide powerful opportunities to validate and contextualize associations identified through epidemiological studies (Cuomo et al., Reference Cuomo, Nathan, Raychaudhuri, MacArthur and Powell2023; Farbehi et al., Reference Farbehi, Neavin, Cuomo, Studer, MacArthur and Powell2024; Caporale et al., Reference Caporale, Castaldi, Rigoli, Cheroni, Valenti, Stucchi, Lessi, Bulgheresi, Trattaro and Pezzali2025). For example, single-cell expression quantitative trait loci (eQTL) mapping reveals how genetic variation influences gene expression in specific cell types or states, helping identify where environmental exposures exert their effects. Likewise, organoid systems derived from human stem cells offer controlled platforms to model tissue-specific responses to plastic chemicals and test hypotheses about developmental disruption, endocrine signalling or neurotoxicity (Caporale et al., Reference Caporale, Castaldi, Rigoli, Cheroni, Valenti, Stucchi, Lessi, Bulgheresi, Trattaro and Pezzali2025). These tools strengthen causal inference by enabling experimental testing of epidemiologically derived hypotheses in human-relevant models and are a key complement to the hybrid epidemiology framework. By incorporating these technologies, researchers can bridge the gap between population-level associations and cell-level mechanisms, ultimately producing more translatable, policy-relevant insights (Caporale et al., Reference Caporale, Leemans, Birgersson, Germain, Cheroni, Borbély, Engdahl, Lindh, Bressan and Cavallo2022).

Limitations

While hybrid epidemiological approaches offer promising pathways for strengthening causal inference, several methodological and practical challenges remain. Data availability and accessibility are key limitations, particularly for high-resolution multi-omics and single-cell datasets, which are often derived from small cohorts and may be restricted due to privacy, consent or regulatory concerns. Even when accessible, harmonizing data across studies is difficult due to differences in sample processing, measurement platforms and annotation standards, hindering integrative and meta-analytic analyses.

Modelling the health effects of chemical mixtures also presents significant challenges. Although methods like WQSR, BKMR and elastic net are useful, they often assume additive effects, despite the likelihood of synergistic or antagonistic interactions. Accurately identifying and quantifying such interactions typically requires large sample sizes, which in turn lead to substantial financial costs – a major limiting factor. These financial constraints are compounded by the need for substantial biospecimen volumes (e.g., blood, urine or plasma), which can be especially difficult to obtain from vulnerable populations such as neonates or young children. As the field progresses, overcoming data and modelling limitations will be essential to bridge existing gaps in our understanding of how environmental exposures affect human health.

Conclusion

Although observational epidemiological studies have provided important information on the health impacts of exposure to plastic chemicals, it is time to move beyond simple ‘black-box’ associations. Hybrid epidemiology, powered by examining mixture analyses, including underpinning -omics and molecular mechanisms, as well as machine learning, offers the opportunity to examine causal evidence, the underlying biological pathways and understand the complex interactions between multiple chemical exposures. This shift is crucial for making informed decisions that protect human health in the face of increasing environmental challenges. By accounting for the totality of exposures and integrating mechanistic evidence, we can more effectively understand and prevent disease, ultimately improving population health, quality of life and economic outcomes such as Gross Domestic Product (GDP, Landrigan, Reference Landrigan2017).

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/plc.2025.10011.

Data availability statement

No data were used for the research described in this article.

Acknowledgements

All authors made a substantial contribution to the conception, preparation, writing and editing of the manuscript. All authors reviewed the manuscript before submission.

Competing interests

The authors declare no competing interests.

Financial statement

This research is funded by Minderoo Foundation (Australia), an independent not-for-profit philanthropic organization. Neither the foundation, nor its benefactors, had any influence in the planning and conduction of this work.

References

Amolegbe, SM, Carlin, DJ, Henry, HF, Heacock, ML, Trottier, BA and Suk, WA (2022) Understanding exposures and latent disease risk within the national institute of environmental health sciences superfund research program. Experimental Biology and Medicine 247(7), 529537.10.1177/15353702221079620CrossRefGoogle ScholarPubMed
Argyri, KD, Gallos, IK, Amditis, A and Dionysiou, DD (2024) Exposomics and cardiovascular diseases: a scoping review of machine learning approaches. medRxiv, 2024–07.10.1101/2024.07.19.24310695CrossRefGoogle Scholar
Bobb, JF, Henn, BC, Valeri, L and Coull, BA (2018) Statistical software for analyzing the health effects of multiple concurrent exposures via bayesian kernel machine regression. Environmental Health 17, 110.10.1186/s12940-018-0413-yCrossRefGoogle ScholarPubMed
Bobb, JF, Dominici, F and Peng, RD (2011) A bayesian model averaging approach for estimating the relative risk of mortality associated with heat waves in 105 us cities. Biometrics 67(4), 16051616.10.1111/j.1541-0420.2011.01583.xCrossRefGoogle Scholar
Bobb, JF, Valeri, L, Henn, BC, Christiani, DC, Wright, RO, Mazumdar, M, Godleski, JJ and Coull, BA (2015) Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures. Biostatistics 16(3), 493508.10.1093/biostatistics/kxu058CrossRefGoogle ScholarPubMed
Braun, G, Herberth, G, Krauss, M, König, M, Wojtysiak, N, Zenclussen, AC and Escher, BI (2024) Neurotoxic mixture effects of chemicals extracted from blood of pregnant women. Science 386(6719), 301309.10.1126/science.adq0336CrossRefGoogle ScholarPubMed
Braun, JM, Gennings, C, Hauser, R and Webster, TF (2016) What can epidemiological studies tell us about the impact of chemical mixtures on human health? Environmental Health Perspectives 124(1), A6A9.10.1289/ehp.1510569CrossRefGoogle ScholarPubMed
Caporale, N, Castaldi, D, Rigoli, MT, Cheroni, C, Valenti, A, Stucchi, S, Lessi, M, Bulgheresi, D, Trattaro, S, Pezzali, M, et al. (2025) Multiplexing cortical brain organoids for the longitudinal dissection of developmental traits at single-cell resolution. Nature Methods 22(2), 358370.10.1038/s41592-024-02555-5CrossRefGoogle ScholarPubMed
Caporale, N, Leemans, M, Birgersson, L, Germain, P-L, Cheroni, C, Borbély, G, Engdahl, E, Lindh, C, Bressan, RB, Cavallo, F, et al. (2022) From cohorts to molecules: Adverse impacts of endocrine disrupting mixtures. Science 375(6582), eabe8244.10.1126/science.abe8244CrossRefGoogle ScholarPubMed
Carrico, C, Gennings, C, Wheeler, DC and Factor-Litvak, P (2015) Characterization of weighted quantile sum regression for highly correlated data in a risk analysis setting. Journal of Agricultural, Biological, and Environmental Statistics 20, 100120.10.1007/s13253-014-0180-3CrossRefGoogle Scholar
Chang, L, Ewald, J, Hui, F, Bayen, S and Xia, J (2024) A data-centric perspective on exposomics data analysis. Exposome 4(1), osae005.10.1093/exposome/osae005CrossRefGoogle Scholar
Chipman, HA, George, EI and McCulloch, RE (2010) Bart: bayesian additive regression trees. Annals of Applied Statistics 4 (1), 266298. https://projecteuclid.org/journals/annals-of-applied-statistics/volume-4/issue-1/BART-Bayesian-additive-regression-trees/10.1214/09-AOAS285.10.1214/09-AOAS285CrossRefGoogle Scholar
Cuomo, ASE, Nathan, A, Raychaudhuri, S, MacArthur, DG and Powell, JE (2023) Single-cell genomics meets human genetics. Nature Reviews Genetics 24(8), 535549.10.1038/s41576-023-00599-5CrossRefGoogle ScholarPubMed
Darbre, PD (2022) How could endocrine disrupters affect human health? In Endocrine Disruption and Human Health. Cambridge, Massachusetts: Academic Press, pp. 3156. https://doi.org/10.1016/B978-0-12-821985-0.00002-5Google Scholar
Duh-Leong, C, Maffini, MV, Kassotis, CD, Vandenberg, LN and Trasande, L (2023) The regulation of endocrine-disrupting chemicals to minimize their impact on health. Nature Reviews Endocrinology 19(10), 600614.10.1038/s41574-023-00872-xCrossRefGoogle ScholarPubMed
Elagali, A, Eisner, A, Tanner, S, Drummond, K, Symeonides, C, Love, C, Tang, MLK, Mansell, T, Burgner, D, Collier, F, et al. (2025) A pathway-based genetic score for inflammation: An indicator of vulnerability to phthalate-induced adverse neurodevelopment outcomes. International Journal of Hygiene and Environmental Health 264, 114514.10.1016/j.ijheh.2024.114514CrossRefGoogle ScholarPubMed
Farbehi, N, Neavin, DR, Cuomo, ASE, Studer, L, MacArthur, DG and Powell, JE (2024) Integrating population genetics, stem cell biology and cellular genomics to study complex human diseases. Nature Genetics 56(5), 758766.10.1038/s41588-024-01731-9CrossRefGoogle ScholarPubMed
Fedak, KM, Bernal, A, Capshaw, ZA and Gross, S (2015) Applying the Bradford hill criteria in the 21st century: How data integration has changed causal inference in molecular epidemiology. Emerging Themes in Epidemiology 12, 19.10.1186/s12982-015-0037-4CrossRefGoogle Scholar
Forns, J, Mandal, S, Iszatt, N, Polder, A, Thomsen, C, Lyche, JL, Stigum, H, Vermeulen, R and Eggesbø, M (2016) Novel application of statistical methods for analysis of multiple toxicants identifies ddt as a risk factor for early child behavioral problems. Environmental Research 151, 91100.10.1016/j.envres.2016.07.014CrossRefGoogle ScholarPubMed
Gelman, A, Carlin, JB, Stern, HS and Rubin, DB (1995) Bayesian Data Analysis. Boca Raton, Florida: Chapman/Hall/CRC.10.1201/9780429258411CrossRefGoogle Scholar
Grandjean, P and Landrigan, PJ (2006) Developmental neurotoxicity of industrial chemicals. The Lancet 368(9553), 21672178.10.1016/S0140-6736(06)69665-7CrossRefGoogle ScholarPubMed
Al-Gubory, KH (2014) Environmental pollutants and lifestyle factors induce oxidative stress and poor prenatal development. Reproductive Biomedicine Online 29(1), 1731.10.1016/j.rbmo.2014.03.002CrossRefGoogle ScholarPubMed
Guimbaud, J-B, Siskos, AP, Sakhi, AK, Heude, B, Sabidó, E, Borràs, E, Keun, H, Wright, J, Julvez, J, Urquiza, J, et al. (2024) Machine learning-based health environmental-clinical risk scores in european children. Communications Medicine 4(1), 98.10.1038/s43856-024-00513-yCrossRefGoogle ScholarPubMed
Hamid, N, Junaid, M and Pei, D-S (2021) Combined toxicity of endocrine-disrupting chemicals: A review. Ecotoxicology and Environmental Safety 215, 112136.10.1016/j.ecoenv.2021.112136CrossRefGoogle ScholarPubMed
Hasin, Y, Seldin, M and Lusis, A (2017) Multi-omics approaches to disease. Genome Biology 18, 115.10.1186/s13059-017-1215-1CrossRefGoogle ScholarPubMed
Hayes, L, Weening, A and Morey, LM (2016) Differential effects of estradiol and bisphenol a on set8 and sirt1 expression in ovarian cancer cells. Dose-Response 14(2), 1559325816640682.10.1177/1559325816640682CrossRefGoogle ScholarPubMed
Karczewski, KJ and Snyder, MP (2018) Integrative omics for health and disease. Nature Reviews Genetics 19(5), 299310.10.1038/nrg.2018.4CrossRefGoogle ScholarPubMed
Kortenkamp, A (2008) Low dose mixture effects of endocrine disrupters: Implications for risk assessment and epidemiology. International Journal of Andrology 31(2), 233240.10.1111/j.1365-2605.2007.00862.xCrossRefGoogle ScholarPubMed
Landrigan, PJ (2017) Air pollution and health. The Lancet Public Health 2(1), e4e5.10.1016/S2468-2667(16)30023-8CrossRefGoogle ScholarPubMed
Landrigan, PJ, Raps, H, Cropper, M, Bald, C, Brunner, M, Canonizado, EM, Charles, D, Chiles, TC, Donohue, MJ, Enck, J, et al. (2023) The minderoo-Monaco commission on plastics and human health. Annals of Global Health 89(1), 1215.Google ScholarPubMed
Lanphear, BP (2015) The impact of toxins on the developing brain. Annual Review of Public Health 36(1), 211230.10.1146/annurev-publhealth-031912-114413CrossRefGoogle ScholarPubMed
Lazarevic, N, Barnett, AG, Sly, PD and Knibbs, LD (2019) Statistical methodology in studies of prenatal exposure to mixtures of endocrine-disrupting chemicals: A review of existing approaches and new alternatives. Environmental Health Perspectives 127(2), 026001.10.1289/EHP2207CrossRefGoogle ScholarPubMed
Lenters, V, Portengen, L, Rignell-Hydbom, A, Jönsson, BAG, Lindh, CH, Piersma, AH, Toft, G, Bonde, JP, Heederik, D, Rylander, L, et al. (2016) Prenatal phthalate, perfluoroalkyl acid, and organochlorine exposures and term birth weight in three birth cohorts: Multi-pollutant models based on elastic net regression. Environmental Health Perspectives 124(3), 365372.10.1289/ehp.1408933CrossRefGoogle ScholarPubMed
Lundberg, S (2017) A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems 30. https://proceedings.neurips.cc/paper/2017/hash/8a20a8621978632d76c43dfd28b67767-Abstract.htmlGoogle Scholar
Lupu, D, Andersson, P, Bornehag, C-G, Demeneix, B, Fritsche, E, Gennings, C, Lichtensteiger, W, Leist, M, Leonards, PEG, Ponsonby, A-L, et al. (2020) The endpoints project: Novel testing strategies for endocrine disruptors linked to developmental neurotoxicity. International Journal of Molecular Sciences 21(11), 3978.10.3390/ijms21113978CrossRefGoogle ScholarPubMed
Maitre, L, Guimbaud, J-B, Warembourg, C, Güil-Oumrait, N, Petrone, PM, Chadeau-Hyam, M, Vrijheid, M, Basagaña, X, Gonzalez, JR, Exposome Data Challenge Participant Consortium, Consortium EDCP, et al. (2022) State-of-the-art methods for exposure-health studies: Results from the exposome data challenge event. Environment International 168, 107422.10.1016/j.envint.2022.107422CrossRefGoogle ScholarPubMed
Manrai, AK, Cui, Y, Bushel, PR, Hall, M, Karakitsios, S, Mattingly, CJ, Ritchie, M, Schmitt, C, Sarigiannis, DA, Thomas, DC, et al. (2017) Informatics and data analytics to support exposome-based discovery for public health. Annual Review of Public Health 38(1), 279294.10.1146/annurev-publhealth-082516-012737CrossRefGoogle ScholarPubMed
Miller, GW and Consortium, BE (2025) Integrating exposomics into biomedicine. Science 388(6745), 356358.10.1126/science.adr0544CrossRefGoogle ScholarPubMed
Motsinger-Reif, AA, Reif, DM, Akhtari, FS, House, JS, Campbell, CR, Messier, KP, Fargo, DC, Bowen, TA, Nadadur, SS, Schmitt, CP, et al. (2024) Gene-environment interactions within a precision environmental health framework. Cell Genomics 4(7), 100591. https://doi.org/10.1016/j.xgen.2024.100591CrossRefGoogle ScholarPubMed
Peters, A, Nawrot, TS and Baccarelli, AA (2021) Hallmarks of environmental insults. Cell 184(6), 14551468.10.1016/j.cell.2021.01.043CrossRefGoogle ScholarPubMed
Stafoggia, M, Breitner, S, Hampel, R and Basagaña, X (2017) Statistical approaches to address multi-pollutant mixtures and multiple exposures: The state of the science. Current Environmental Health Reports 4, 481490.10.1007/s40572-017-0162-zCrossRefGoogle ScholarPubMed
Stanfield, Z, Setzer, RW, Hull, V, Sayre, RR, Isaacs, KK and Wambaugh, JF (2024) Characterizing chemical exposure trends from nhanes urinary biomonitoring data. Environmental Health Perspectives 132(1), 017009.10.1289/EHP12188CrossRefGoogle ScholarPubMed
Sun, Z, Tao, Y, Li, S, Ferguson, KK, Meeker, JD, Park, SK, Batterman, SA and Mukherjee, B (2013) Statistical strategies for constructing health risk models with multiple pollutants and their interactions: Possible choices and comparisons. Environmental Health 12, 119.10.1186/1476-069X-12-85CrossRefGoogle ScholarPubMed
Symeonides, C, Aromataris, E, Mulders, Y, Dizon, J, Stern, C, Barker, TH, Whitehorn, A, Pollock, D, Marin, T and Dunlop, S (2024) An umbrella review of meta- analyses evaluating associations between human health and exposure to major classes of plastic-associated chemicals. Annals of Global Health 90(1), 154.10.5334/aogh.4459CrossRefGoogle ScholarPubMed
Symeonides, C, Vacy, K, Thomson, S, Tanner, S, Chua, HK, Dixit, S, Mansell, T, O’Hely, M, Novakovic, B, Herbstman, JB, et al. (2024) Male autism spectrum disorder is linked to brain aromatase disruption by prenatal bpa in multimodal investigations and 10hda ameliorates the related mouse phenotype. Nature Communications 15(1), 6367.10.1038/s41467-024-48897-8CrossRefGoogle ScholarPubMed
Tanner, S, Thomson, S, Drummond, K, O’hely, M, Symeonides, C, Mansell, T, Saffery, R, Sly, PD, Collier, F, Burgner, D, et al. (2022) A pathway-based genetic score for oxidative stress: An indicator of host vulnerability to phthalate-associated adverse neurodevelopment. Antioxidants 11(4), 659.10.3390/antiox11040659CrossRefGoogle ScholarPubMed
Vermeulen, R, Schymanski, EL, Barabási, A-L and Miller, GW (2020) The exposome and health: Where chemistry meets biology. Science 367(6476), 392396.10.1126/science.aay3164CrossRefGoogle ScholarPubMed
Wild, CP (2005) Complementing the genome with an “exposome”: The outstanding challenge of environmental exposure measurement in molecular epidemiology. Cancer Epidemiology Biomarkers & Prevention 14(8), 18471850.10.1158/1055-9965.EPI-05-0456CrossRefGoogle ScholarPubMed
Woodruff, TJ, Zota, AR and Schwartz, JM (2011) Environmental chemicals in pregnant women in the United States: Nhanes 2003–2004. Environmental Health Perspectives 119(6), 878885.10.1289/ehp.1002727CrossRefGoogle ScholarPubMed
Woodside, FC III and Davis, AG (2012) The Bradford hill criteria: The forgotten predicate. Thomas Jefferson Law Review 35, 103.Google Scholar
Yang, Y, Li, R, Li, W, Wang, M, Cao, Y, Wu, Z and Xu, Q (2013) The association between ambient air pollution and daily mortality in Beijing after the 2008 olympics: A time series study. PLoS One 8(10), e76759.10.1371/journal.pone.0076759CrossRefGoogle ScholarPubMed
Zou, H and Zhang, HH (2009) On the adaptive elastic-net with a diverging number of parameters. Annals of Statistics 37(4), 1733.10.1214/08-AOS625CrossRefGoogle ScholarPubMed
Figure 0

Figure 1. Unlike the traditional epidemiological approach (represented by the grey arrow), which examines the relationship between a single exposure and a disease outcome while many critical aspects of the biological pathway remain uncovered, hybrid epidemiology (depicted by the purple arrow) adopts a more comprehensive view. The hybrid epidemiology approach considers the broader exposome and examines how factors such as genetic predisposition, molecular mechanisms, environmental factors and lifestyle variations within individuals can modulate this effect. Hybrid epidemiology goes a step further by integrating multiple analytical approaches and techniques – including machine learning, multi-omics, advanced statistical methods and animal or cell models – to triangulate and identify the most parsimonious effect of exposure on disease aetiology.

Author comment: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR1

Comments

No accompanying comment.

Review: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

General Comments

This perspective article is well-written and makes a positive contribution to the field.

The article recommends a shift toward hybrid epidemiological approaches that incorporate genetic and molecular data to explore underlying biological mechanisms and move beyond traditional “black-box” approaches. It also recommends combing this with machine learning and statistical techniques to analyse chemical mixtures, enabling a deeper understanding of their combined health effects and the biological pathways they influence.

The authors effectively convey their messages by seamlessly guiding the reader through epidemiology, statistics, and basic sciences. The recommendations flow logically as a natural outcome of the preceding discussion.

Below are a few comments that may improve the quality of the paper, primarily focusing on making it more accessible and broadening its readership.

(1)

Pages 5 (from line 51) to page 6 (up to line 49): From “In recent years, numerous methods have been introduced….” to “ …….. omics data and mixed categorical and continuous variables—remains limited”

- This section provides a helpful overview of statistical methods used in mixture analysis. It might be beneficial to briefly expand on the background (specifically, the challenges these methods are designed to address) before describing how each method tackles them. Providing this context would be especially valuable in a ‘Perspective’ paper, as it helps engage a broader audience, including readers without advanced statistical expertise.

(2)

Page 6, lines 56-59: “This preliminary step helps refine the dataset and enhance model performance before implementing Bayesian approaches such as Bayesian Hierarchical Models (Gelman et al. 1995), BKMR, or Bayesian Additive Regression Trees (Chipman, George, and McCulloch 2010)”.

- To make it more informative for readers, it could be useful to briefly explain each approach and how it relates to the topic (for example, in what contexts each method is most suitable). This would help strengthen the connection between the methodology and its application and make the discussion more accessible.

(3)

Page 7, lines 3-25: “An alternative to this multistage analysis is the application of machine learning techniques, such as neural networks ….” to the end of the paragraph

- Machine learning is mentioned several times in the manuscript and included as a keyword. Since this section introduces machine learning for the first time, it offers an opportunity to engage the reader. Consider rewriting this part to make the introduction more engaging. It would be helpful to explain how machine learning can enhance causal inference and to provide context for the techniques mentioned, so readers can better understand their relevance and application.

(4)

Page 7, lines 39-44: “This approach, extensively employed across various disciplines, including theoretical physics, astrophysics, and mathematics (Tegmark 2015; Abbott et al. 2016; Lawlor, Tilling, and Davey Smith 2016), leverages complementary methodologies to enhance causal understanding”.

- Triangulation is a well-established concept in epidemiology, so this quoted part may be a bit redundant. Authors might consider removing it to keep the text more concise and focused.

(5)

While the term “black-box” epidemiology is conceptually engaging, it might be clearer for readers if a term like “classical” or “traditional” epidemiology were used instead.

(6)

I wonder if the phrase “establishing causality” might be a little overused here. In a practical context like this paper, it might be more helpful for readers if we focus on how credible it is to suggest that the exposure caused the outcome, rather than framing it as definitively establishing causality.

(7)

The authors make a strong case for the shift toward new approaches. It may be helpful to briefly acknowledge some of the key challenges or limitations that could arise in implementing this shift, to provide a more balanced ‘perspective’.

(8)

I suggest having a look at the figure and its various components. I’m not sure it clearly conveys the intended messages in its current form.

Minor comments

(9)

I am not sure if GDP needs to be spelled out.

Review: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

This manuscript presents a perspective on the limitations of traditional observational epidemiology in studying the health impacts of plastic chemicals and proposes a shift towards hybrid epidemiological methodologies. The authors argue that relying solely on associations between exposures and outcomes (“black box” epidemiology) is insufficient to establish causality due to the complexity of chemical mixtures, confounding factors, and the absence of underlying mechanistic data. They advocate for integrating genetic susceptibility, molecular mechanisms, multiomics data, machine learning, and advanced statistical analyses to strengthen causal inference and improve public health interventions leveraging a triangulation of evidence.

This represents a crucial and timely issue in environmental health research. The proposed hybrid epidemiological framework has significant scientific value and potential impact. While this is not a completely novel concept/idea, this perspective offers a useful proposal to move beyond simple associations and delve into the underlying biological pathways altered by complex chemical mixtures.

There are however several aspects that should be better addressed to strengthen the relevance, clarity and rigour of the manuscript, as suggested in the comments below.

While the manuscript provides a general overview of the hybrid epidemiological framework, it could benefit from a more specific definition of applicability domains. For example, the authors could refer to the seminal review by Peters et al 1, and explain how the hallmarks of environmental insults can be leveraged in their integrative framework.

“These challenges emphasize the critical need for a hybrid epidemiological research approach that incorporates advancements in chemical mixture analysis, machine learning, and molecular epidemiology techniques to strengthen causal inferences.”

The need to integrate experimental testing with clinical data to provide the causal link should be better emphasised in the above passage and overall in the manuscript.

While the manuscript covers a wide range of topics, some sections could benefit from more in-depth discussion. For example, when presenting the need to integrate epidemiological associations with molecular mechanisms, the authors could better discuss recent technological advances, particularly in the areas of single cell omics and human organoid modelling, that are enabling novel and more powerful ways to link epidemiological data to molecular mechanisms (as for example discussed in 2–4).

The manuscript acknowledges some limitations of the proposed methodologies (e.g., interpretability of machine learning models), however a more in-depth discussion of potential challenges and limitations would be beneficial, in particular including issues and challenges related to data availability, data sharing, and data harmonization across heterogeneous sources. Also for single chemical vs mixtures effects there are several challenges related to the modelling of additive vs synergistic effects that should be better presented and discussed.

The recent perspective in Science about integrating exposomics into biomedicine 5 seems an interesting reference for this manuscript.

As a more theoretical thought/suggestion: a recent paper proposes to use mathematical frameworks from information theory to quantitatively understand information flow in cell biology 6. Are similar principles implied when the “complementary methodologies in theoretical physics, astrophysics, and mathematics to enhance causal understanding” are mentioned to explain the “triangulation of evidence” approach proposed here? If yes it would be interesting to discuss these converging development of analytical approaches from epidemiology to cell biology.

1. Peters, A., Nawrot, T. S. & Baccarelli, A. A. Hallmarks of environmental insults. Cell 184, 1455–1468 (2021).

2. Farbehi, N. et al. Integrating population genetics, stem cell biology and cellular genomics to study complex human diseases. Nat. Genet. 56, 758–766 (2024).

3. Cuomo, A. S. E., Nathan, A., Raychaudhuri, S., MacArthur, D. G. & Powell, J. E. Single-cell genomics meets human genetics. Nat. Rev. Genet. 24, 535–549 (2023).

4. Caporale, N. et al. Multiplexing cortical brain organoids for the longitudinal dissection of developmental traits at single-cell resolution. Nat. Methods 22, 358–370 (2025).

5. Miller, G. W. & Banbury Exposomics Consortium. Integrating exposomics into biomedicine. Science 388, 356–358 (2025).

6. Quake, S. R. The cellular dogma. Cell 187, 6421–6423 (2024).

Recommendation: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR4

Comments

The Cambridge Prisms Plastics editorial team have worked very hard to secure appropriate available reviewers for this manuscript. It is unfortunate this has meant that the authors have had to wait so long for this review outcome. We are very happy that we are now in the position to provide this recommendation.

The manuscript is well-written. Both reviewers are complementary of the manuscript and share the opinion that the article will make a positive contribution to the field. Reviewer 2 describes this as a “a crucial and timely issue in environmental health research. The proposed hybrid epidemiological framework has significant scientific value and potential impact [offering]… a useful proposal to move beyond simple associations and delve into the underlying biological pathways altered by complex chemical mixtures.”

Both reviewers offer recommendations to strengthen some sections of the manuscript including providing more in-depth discussion and context in areas, as well as specific definitions. One reviewer provides a list of potentially helpful publications to assist the authors in deepening the theoretical analysis. This manuscript will be significantly strengthened if the authorial team systematically consider and respond to the recommendations of both reviewers.

Decision: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR5

Comments

No accompanying comment.

Author comment: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R1/PR6

Comments

No accompanying comment.

Recommendation: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R1/PR7

Comments

I am wholly satisfied that the authors have comprehensively, systematically, and appropriately responded to all the reviewers’ comments and that this manuscript is now ready for publication.

Decision: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R1/PR8

Comments

No accompanying comment.