Skip to main content

TECHNOLOGY AND CODE article

Front. Public Health, 24 July 2024
Sec. Digital Public Health

The Chilean exposome-based system for ecosystems (CHiESS): a framework for national data integration and analytics platform

Patricia MatusPatricia MatusAlejandro Sepúlveda-PealozaAlejandro Sepúlveda-PeñalozaKeneth PageKeneth PageClaudia RodríguezClaudia RodríguezMarcela CrcamoMarcela CárcamoFrancisco BustamanteFrancisco BustamanteMarcela GarridoMarcela GarridoCinthya Urquidi
Cinthya Urquidi*
  • Department of Epidemiology and Health Studies, Medicine Faculty, Universidad de los Andes, Santiago, Chile

The double burden of diseases and scarce resources in developing countries highlight the need to change the conceptualization of health problems and translational research. Contrary to the traditional paradigm focused on genetics, the exposome paradigm proposed in 2005 that complements the genome is an innovative theory. It involves a holistic approach to understanding the complexity of the interactions between the human being’s environment throughout their life and health. This paper outlines a scalable framework for exposome research, integrating diverse data sources for comprehensive public health surveillance and policy support. The Chilean exposome-based system for ecosystems (CHiESS) project proposes a conceptual model based on the ecological and One Health approaches, and the development of a technological dynamic platform for exposome research, which leverages available administrative data routinely collected by national agencies, in clinical records, and by biobanks. CHiESS considers a multilevel exposure for exposome operationalization, including the ecosystem, community, population, and individual levels. CHiESS will include four consecutive stages for development into an informatic platform: (1) environmental data integration and harmonization system, (2) clinical and omics data integration, (3) advanced analytical algorithm development, and (4) visualization interface development and targeted population-based cohort recruitment. The CHiESS platform aims to integrate and harmonize available secondary administrative data and provide a complete geospatial mapping of the external exposome. Additionally, it aims to analyze complex interactions between environmental stressors of the ecosystem and molecular processes of the human being and their effect on human health. Moreover, by identifying exposome-based hotspots, CHiESS allows the targeted and efficient recruitment of population-based cohorts for translational research and impact evaluation. Utilizing advanced technologies such as Artificial Intelligence (AI), Internet of Things (IoT), and blockchain, this framework enhances data security, real-time monitoring, and predictive analytics. The CHiESS model is adaptable for international use, promoting global health collaboration and supporting sustainable development goals.

1 Introduction

The persistently high burden of non-communicable diseases (NCDs) worldwide (1, 2) highlights the shortcomings of preventive interventions in real-world settings. After birth, individuals live in a complex ecosystem where environmental factors influence their risk of developing diseases and the effectiveness of interventions evaluated in controlled experiments (3, 4). While a small proportion of diseases are genetic in origin, the majority are attributable to environmental factors (57). Similarly, predictive models in precision medicine exhibit significant unexplained variability, which is attributed to contextual factors such as social determinants and lifestyles (8, 9).

Studies have often focused on individual factors or specific environmental exposures. In 2005, Wild proposed the novel exposome paradigm to complement the human genome in cancer research, which consider the totality of exposures throughout a person’s life and their complex interactions, including chemical, biological, psychosocial, and lifestyle factors (10). Subsequently, Wild stated that the exposome approach comprises the external (general and specific) and the internal exposome. The external exposome includes all external environmental or contextual factors affecting an individual, such as pollutants, lifestyle, and socioeconomic status. The internal exposome encompasses biological responses and processes (the omics) within the body, like metabolism, immune response, and the microbiome (11). Thus, exposome studies offer a more detailed and comprehensive understanding of the complex interactions between environmental exposures and their influence on internal biological processes and vice versa, contributing to the development of more effective strategies for disease prevention and public health interventions.

Complementing the exposome theory with the One Health approach offers a powerful novel paradigm for advancing public health. The One Health approach recognizes that human health is closely linked to the health of animals and their shared environment. By emphasizing the interconnectedness of human, animal, and environmental health, both the One Health approach and the exposome concept benefit from interdisciplinary collaboration and integrated data systems. This synergy enables comprehensive research and public health interventions, enhances disease surveillance, informs public health policies, and supports the development of innovative strategies to tackle complex health challenges across species and ecosystems. Understanding disturbances in one species can provide critical insights into promoting health in others, leading to a more holistic and effective approach to public health (12, 13).

Several exposome-based initiatives have emerged, including the HELIX, HERCULES, EXPOsOMIC, and the European Human Exposome Network (EHEN) (1419). Most of these projects focus on European populations with diverse geographical scopes, targeting specific diseases or health outcomes, such as early-life development, cardiovascular and metabolic diseases, chronic respiratory diseases, or major chronic diseases. These initiatives typically involve interdisciplinary collaboration to develop tools for exposome research by integrating data from previously recruited birth cohorts. Additionally, these projects are addressing challenges such as data privacy, diverse data types, temporal variation, and geographic data scales through the development of technological solutions and robust protocols.

Although mega-cohort studies, including biobanks, provide the framework for exposomic research, they require substantial investment. Latin American (LATAM) and other developing countries have different socioeconomic contexts and scarce research resources. Additionally, assembling population cohorts is expensive. Conversely, administrative data that is routinely collected provide insights into societal trends, resource allocation, and program effectiveness, and they are valuable for policy and decision-making across various sectors. Moreover, the Pan American Health Organization coordinated public health efforts to strengthen epidemiological surveillance in LATAM countries, providing a platform for member countries to share epidemiological data and information.

Most countries face a double burden of diseases, which refers to the coexistence of communicable (infectious) and NCD within a population, community, or individual. This phenomenon is particularly relevant in transitional or developing societies that are undergoing rapid social, economic, and demographic changes (20). Chile, a LATAM country, has demonstrated a sustained rapid economic growth over the last few decades. However, a socioeconomic gap persists, with income inequality being a notable concern. The epidemiological transition in Chile reflects a shift from a predominance of infectious diseases to an increasing burden of NCD, which is often associated with lifestyle changes. The coexistence of communicable diseases and NCDs highlights the complex health landscape during Chile’s economic evolution (21).

The purpose of this communication is to describe the conceptual model of the Chilean exposome-based system for ecosystems CHiESS project, which is complemented by the ecological and One Health approaches, and the technological development of a platform that allows the integration of existing administrative data, operationalization of exposome, and evaluation of the exposome impact on health outcomes.

2 CHiESS description

2.1 Aim of CHiESS

The primary purpose of the CHiESS project is to develop an intelligent technological platform that allows exposome operationalization and to investigate its effects on human health using administrative data. This platform will also serve as a foundation for integration with primary omics data. Thus, we aim to better understand the etiology of the disease and the translation of scientific evidence of both individual and population interventions to real-world settings.

The specific objectives of the CHiESS project are as follows:

1. To build a dynamic and automated system that allows the integration, analysis, and visualization of a wide range of environmental stressors and individual data.

2. To operationalize and map the external and internal exposomes for identifying and monitoring novel hotspots.

3. To study the exposome contribution to the disease’s etiology and the impact of preventive interventions.

The protocol was approved by the Scientific Ethics Committee of the Universidad de los Andes, Chile.

2.2 Administrative data sources

Leveraging available administrative data is a differentiating focus of CHiESS. Administrative data sources encompass a wide range of information that governmental and organizational entities collect for administrative purposes. These sources include the following (Table 1).

Table 1
www.frontiersin.org

Table 1. Description of potential data sources for CHiESS.

2.3 General research questions and conceptual model of CHiESS

The CHiESS project raises the following three general research questions, which address priority current diseases according to biological plausibility and the current state of knowledge:

a. What environmental exposure patterns, temporal accumulation of patterns, interaction of patterns, or mechanisms contribute to the diseases?

b. Can omics signals or biomarkers of the health effects of multiple and prolonged exposures to environmental stressors be identified?

c. What individual- or population-based interventions could be targeted using the exposome paradigm?

Consequently, the integration and analysis of CHiESS data are founded on specific hypotheses that will be derived from the previously formulated research questions, focusing on certain diseases such as diabetes, cancer, and obesity, among others. Furthermore, data integration and analytics within CHiESS adhere to an agnostic approach, which is a hypothesis-free method aimed at uncovering new disease etiologies through omics and non-omics technologies, devoid of specific preconceptions or predetermined conclusions.

Based on the ecological theory, which examines patterns and processes in nature at different levels of organization (from individual units to larger systems), and considering the geospatial and administrative division of Chile, the CHiESS conceptual model consists of four levels of exposure: (1) ecosystem, (2) community, (3) population, and (4) individual (Figure 1). The ecological model is justified as it allows for a comprehensive understanding of environmental and biological interactions at different scales, which is essential for investigating the multifactorial nature of health outcomes. Additionally, the ecological approach permits the conceptualization and operationalization of shared and non-shared exposures, where shared exposures pertain to the general external exposome in specific geographic areas, and non-shared exposures relate to individual lifestyles or specific external exposome. Furthermore, the One Health approach, which recognizes the interconnectedness of human, animal, and environmental health, is integral to CHiESS, providing a holistic framework to address complex health issues.

Figure 1
www.frontiersin.org

Figure 1. Conceptual model.

2.3.1 Ecosystem

Continental Chile is divided into 16 regions, each with distinct climates, soils, economic activities (e.g., mining and agriculture industry), and biodiversity due to the country’s extensive geographical length. For instance, the northern regions are characterized by deserts, arid areas, and high peaks of the Andes mountains, while the southern regions, feature humid jungles to glacial areas. The central regions are predominantly urbanized and engage in intensive agriculture. Thus, the ecosystem encompasses the biotic organisms and abiotic factors of each region, including political, economic, and climate, denoting shared exposures of the general external exposome. This level can be extended to a broader area range, such as two or more countries or WHO regional areas (Figure 1).

2.3.2 Community

Chile is further divided into 346 communes, the smallest unit of local administration, managed by municipalities. At this level, a community includes all populations of different species coexisting in a location, such as mammals, birds, and other species that can spread diseases. The community level in CHiESS, includes geographical characteristics, species distribution, and behavior (including zoonotic vectors and domestic and productive animals), as well as social, educational, or chemical exposures at the commune level, denoting the shared exposures of the general external exposome, but within a smaller areas and including other species (Figure 1).

2.3.3 Population

According to the ecological organization theory, this level examines all individuals of the human species in a specific area at a given time. Moreover, given the variations in population density across communes, different ecosystems emerge. The population level in CHiESS focuses on human-specific exposures and behaviors (in contrast to the communal level, which includes other species and geographic factors), constructing aggregated indicators of exposure to environmental contaminants, lifestyles, psychosocial, economic, and health-related factors, denoting the specific external exposome of unshared exposures but at the population level (Figure 1).

2.3.4 Individual

This level focuses on a single organism and its physiology, behavior, and adaptations. At the individual level, exposures assessed at the population level will be evaluated on a personal scale, denoting unshared individual exposures of the specific external exposome (Figure 1).

At the individual level, CHiESS includes the internal exposome represented by omics data, which will allow for the identification of biomarkers and omics signals associated with prolonged and multiple exposures to environmental stressors. By linking these biomarkers with specific exposure patterns identified at higher levels of the model (ecosystem, community, and population), CHiESS can elucidate the biological pathways affected by environmental stressors. This approach enables the identification of molecular signatures that indicate the health effects of complex exposure scenarios, providing insights into the mechanisms of disease development.

2.3.5 CHiESS outcomes

The project’s purpose is to provide evidence for both local public policies and precision medicine concerning NCD, re-emerging and emerging infectious diseases, and for countries confronting the double burden of disease. The CHiESS addresses outcomes for both NCD and communicable diseases, considering their etiology and transmission. Health effects will also be assessed at various levels, with outcomes defined at both the individual and population levels (green arrows in Figure 1).

The www.frontiersin.org symbol in Figure 1 denotes the complex interaction between the multilevel exposures within each level and between levels. The www.frontiersin.org symbol corresponds to the recursiveness of health effects (i.e., how individual outcomes contribute to population indicators and vice versa).

CHiESS acknowledges the cumulative and dynamic nature of exposures by investigating how exposures accumulate and interact across different life stages, from prenatal development through childhood, adolescence, adulthood, and into old age, illustrated by the arrow at the top indicating time in Figure 1. Additionally, the accumulative exposure over time can occur at the four levels of CHiESS. For example, considering the economic evolution of Chile and within its region (ecosystem), changes in biodiversity or social factors within the commune (community), shifts in social dynamics of the population (population), or changes in individual lifestyle choices (individual).

By mixing the ecological and One Health approaches to the exposome, CHiESS aims to capture the complex interplay between environmental, biological, and social factors influencing health. This comprehensive approach is critical for developing targeted interventions and informing public health strategies responsive to the unique contexts of different regions and communities. Thus, the CHiESS conceptual model can be applicable to other countries. Additionally, the ecological approach allows for the conceptualization and further operationalization of shared exposures, which pertain to the general external exposome, and non-shared exposures, which relate to individual lifestyles.

The CHiESS model systematically integrates data across four levels of exposure: ecosystem, community, population, and individual. This multilevel approach allows for the comprehensive analysis of environmental exposure patterns and their temporal accumulation. By examining interactions between various exposures (e.g., socioeconomic, chemical, and biological), CHiESS aims to identify mechanisms that contribute to diseases. The model’s ecological basis facilitates the exploration of how different exposures interact within and across levels to influence health outcomes. For example, by analyzing data at the community level, the model can reveal how local environmental factors, such as pollution and social stressors, combine and accumulate over time to impact health.

Finally, the CHiESS model provides a framework for defining environmental health priorities, guiding both cohort recruitment and the development of targeted interventions. By characterizing exposures at multiple levels (ecosystem, community, population, and individual), CHiESS can identify high-risk areas and populations for targeted interventions. For instance, at the population level, aggregated indicators of exposure can inform public health strategies, while at the individual level, personalized interventions can be designed based on specific omics profiles. The model’s ability to integrate and analyze diverse data types allows for the identification of intervention points that can mitigate the impact of harmful exposures, ultimately contributing to precision public health.

2.4 CHiESS technological framework

The leading architecture and data interactions in CHiESS for the platform are based on five layers that evolve in a dynamic and robust environment: (1) diverse data sources, (2) data wrangling, (3) data warehousing, (4) discover engine, and (5) data dashboard (Figure 2).

Figure 2
www.frontiersin.org

Figure 2. The technological framework of the platform.

The CHiESS platform architecture seamlessly integrates external exposome information with individual omics and healthcare provider data. This first layer manages a diverse array of external data sources from government agencies, epidemiological surveillance systems, educational institutions, social services, employment and labor departments, transportation authorities, environmental agencies, and statistical agencies. The data may include relevant information such as air quality indices, pollution levels, socioeconomic indicators, lifestyle information, and zoonotic vectors. This layer also interfaces seamlessly with individual omics data from healthcare systems, clinical data, electronic health records, lifestyle assessments, and other original studies. Additionally, it incorporates genomics, epigenomics, transcriptomics, metabolomics, and proteomics to provide a complete view of an individual’s biological makeup. The collection of this data will be challenging because the information should be reconciled cohesively.

The second layer addresses the data heterogeneity with a robust data wrangling and preparation process that harmonizes the diverse datasets. This ensures consistency and quality of the integrated datasets, addresses potential discrepancies, and enhances the overall reliability of the exposome studies. The architecture employs sophisticated data integration and transformation techniques, ensuring that disparate data types are harmonized and normalized to facilitate meaningful correlations and analyses across dimensions. This transformation process involves careful consideration of temporal, spatial dimensions, standardization, and normalization, as well as data quality checks to eliminate potential biases.

A secure cloud-based data warehousing layer efficiently stores the pre-processed data to ensure an efficient and protected environment for sensitive information. This layer serves as a dynamic interface that facilitates efficient storage and allows scalable and flexible data management. Researchers, stakeholders, and policymakers benefit from this central repository that streamlines collaborative efforts and ensures data integrity.

In the final dual-layer, an engine for advanced algorithms operates in tandem with a visualization tool to provide a comprehensive platform for analysis and interpretation. This final engine layer extracts patterns and associations from the integrated data via two types of analyses (hypothesis-based and agnostic algorithms). Simultaneously, the visualization tool translates complex findings into interpretable insights, which facilitates comprehension and decision-making. The platform capitalizes on advanced techniques to transform complex data into interpretable insights. A diverse array of visualization methods is employed to present findings in a meaningful, accessible, and effective manner. Graphical representations, such as interactive charts, heatmaps, and network diagrams, offer intuitive insights into patterns and relationships within the integrated datasets. Geographic visualizations overlaying the exposome data on maps facilitate the identification of spatial trends, which is crucial for understanding regional variations in environmental exposures and health outcomes. Furthermore, it incorporates dynamic views that enable historical and real-time exploration of data trends. Thus, stakeholders can focus on specific variables or timeframes which facilitate targeted analyses. This interactive approach enhances user experience and promotes collaboration by allowing diverse stakeholders to contribute their expertise to the exposome studies.

3 Project stages development

CHiESS will undergo four consecutive stages in its development into an informatics platform.

3.1 Stage one: environmental data integration and harmonization system

The first stage aims to develop the initial data integration and harmonization components. This stage also involves the development of a data integration application and the processing of queries to allow geospatial operationalization of the general external exposome at the ecosystem (regions) and community (commune) levels. Validation at this stage will consist of determining data inconsistency, integrity, and post-transformation accuracy according to data governance policies and privacy regulations.

3.2 Stage two: health and omics data integration

The second stage focuses on developing complementary informatics components for integrating and harmonizing data from clinical records and biobanks. This will operationalize the specific external exposome at the population level and the internal exposome at the individual level. Validation at this stage will consist of determining data integrity and post-transformation accuracy in addition to confirming the concordance of systems results with well-known geospatial exposure distribution and omics-biomarkers outcomes.

3.3 Stage three: development of advanced analytical algorithms

The third stage involves developing hypothesis-based or supervised and unsupervised agnostic algorithms complemented by causal diagrams. The conceptual model of CHiESS encompasses the operationalization of exposures at different levels of observation and aims to evaluate trends and complex interactions of multiple exposures. Therefore, data-driven analytics will be based on the exposome-wide association study (ExWASs) approach, which considers mixed models for nested or correlated data, weighting for complex survey samples when data comes from national surveys, different types of variable selection such as LASSO or Ridge regressions (22), and mixture analysis such as Weighted Quantile Sum or Bayesian approaches or Gaussian process regression (23). Additionally, causal machine-learning models (24) will complement hypothesis-based association analyses.

The conceptual model of CHiESS also encompasses the operationalization of the external exposome in specific geographic areas and spatiotemporal data linkage. Thus, geospatial techniques to account for spatial autocorrelation and to model the spatial distribution of exposures and health outcomes will be considered, as well as techniques for small-area analysis such as Fay-Herriot and Spatial Fay-Herriot models (25).

3.4 Stage four: visualization interface development and population-based cohort recruitment

The fourth stage is focused on developing and implementing visualization and monitoring interfaces for mapping and surveillance hotspots for final users and stakeholders. Moreover, the identification of hotspots at this stage will help determine policy recommendations, plan interventions, and recruit a hotspot-targeted population-based cohort for the measurement of environmental stressors and omics, as well as a final external validation.

4 Main challenges

Undoubtedly, CHiESS is an ambitious and large-scale project that presents many challenges, some of which have been identified previously by other initiatives, and others will likely continue to emerge (2630).

The main challenges are the high dimension integration and mining data with time and spatial dependence structure, dealing with heterogeneity data, and the development of wearable measurement sensors. Fortunately, with the advancements in bioinformatics, big data analytics, artificial intelligence, and machine learning, among others, these challenges can be overcome.

Another challenge for CHiESS is incorporating causal structures in the integrated data, ensuring advanced analytics are appropriately tailored to a robust causal design that tests hypotheses, and understanding the role of confounders. Therefore, epidemiological and clinical reasoning is essential during data integration and analysis. Since CHiESS is initially based on administrative data integration, validating exposure and outcome data measurements is another challenge. The application of known exposure or outcomes biomarkers, complemented by validated questionnaire-based approaches, may overcome this limitation because altered mRNA, proteins, or metabolite levels will reflect specific environmental exposures.

Notwithstanding the aforementioned limitations, matching omics measurements with functionalities (e.g., biomarker development, differentiating exposures from biological responses, investigating mixtures and interactions between agents, and understanding mechanisms for biological plausibility and etiology of diseases) is undoubtedly another challenge for CHiESS due to its cost. Hence, the prospective cohort study design best suits the exposure biomarker approach, providing opportunities for repeat sampling to enable a broader timeframe of exposure assessment and avoid reverse causation by collecting samples before the disease onset. Thus, CHiESS provides criteria for defining environmental health priorities, which will guide priority cohort recruitment as well as research and tailoring omics measurement and target intervention at the population level. A geocoding process to a geospatial dimension is necessary to integrate existing or new data. One of the advantages in Chile is the existence of a unique identifier number for individuals, which facilitates the integration of individual data while adhering to confidentiality standards. On the other hand, due to the multilevel and geospatial conceptual mode of CHiESS, we plan to calculate the geospatial diameter of a commune and match it with individual addresses and thus their exact coordinates, but we will need to define an analysis radius to cross-reference information within a commune. This could lead to issues, for example, where a person lives on the border of a commune and exhibits symptoms of pollution from a neighboring commune that falls within the defined perimeter.

Because CHiESS is based on the One Health ecosystem approach, another challenge is understanding the particularities of each health problem and establishing connections with general processes such as globalization, territorial expansion, migration, job insecurity, vulnerability of populations, environmental degradation, and urbanization (31). Thus, a multidisciplinary team composed of epidemiologists, sociologists, clinicians, environmental health experts, agricultural scientists, ecologists, veterinarians, microbiologists, biostatisticians, and geospatial analysts is essential for the formulation of specific yet holistic hypotheses and data analysis. On the other hand, the technological development will need to be addressed by data scientists to develop algorithms and models for integrating and analyzing diverse datasets, software engineers to develop and maintain the platform’s software infrastructure, computer scientists or informatics specialists to manage the hardware and network infrastructure required for data storage and processing, and a project Manager to coordinate the efforts of the multidisciplinary team and ensure timely progress. Fortunately, innovation and technological development from academia, along with interdisciplinary collaboration and partnerships with industry, make this feasible.

The major challenge of a technology platform integrating health data is ensuring the protection of sensitive information. This includes safeguarding data from authorized access, preventing breaches or leaks, and complying with privacy regulations such as HIPAA (Health Insurance Portability and Accountability Act) in the United States or GDPR (General Data Protection Regulation) in the European Union. In Chile, Law 19.628, also known as the Chilean Data Protection Law (Ley de Protección de Datos Personales), regulates the protection of personal data. It establishes principles and rules for the processing of personal data, including the rights of individuals regarding their data and the obligations of data controllers and processors. Compliance with Law 19.628 is crucial for any technology platform handling health data in Chile, as it sets requirements for data security, consent, notification of data breaches, and other aspects of data protection. Hence, CHiESS contemplates compliance with national and international laws, and it will need to prioritize to ensure the integrity and reliability of the data amidst evolving cybersecurity as part of its implementation.

Finally, other ethical challenges in data integration could arise from the potential mishandling of data, privacy concerns, security risks, data governance, transparency and accountability, legal and regulatory compliance, and the impact on individuals and society. Thus, applying technical measures, legal frameworks, and ethical guidelines will ensure that data integration is conducted responsibly and transparently.

5 Discussion

The complexity of human exposome needs to be investigated progressively. To our knowledge, CHiESS is an innovative exposome-based project in LATAM that contextualizes the human exposome and introduces the exposome paradigm in translational research, which can be scaled and contribute to the exposomic research network worldwide. Unlike other initiatives, the CHiESS model is based on the One Health ecosystem approach to include the patterns of other species that contribute to the spread of communicable diseases. Furthermore, CHiESS focuses on the geospatial operationalization of the general external exposome based on administrative and routinely collected data.

The CHiESS project aims to address complex interactions that have been understudied in longstanding health issues, as well as to explore new causal pathways in disease etiology. For example, these may include the complex interactions between socioeconomic status and lifestyle factors in relation to obesity, or the evaluation of the effect of environmental contaminants or toxins, potentially acting through hypothesized endocrine disruptor mechanisms, on the early origins of obesity. Our team has also been working on topics such as suicide in adolescents, anticipating peaks in respiratory illnesses based on environmental factors, and vaccine effectiveness for a better understanding of the impact of interventions in real-world settings, thereby translating evidence to policymakers. Thus, a differentiating factor of CHiESS from other initiatives is the expectation that it can contribute locally to epidemiological surveillance and hotspot identification for policymaker targeting, and as a scalable technological platform for other countries.

The novel CHiESS technological platform involves developing automated and semi-automated systems for data integration, advanced analytics, visualization, and monitoring, which is similar to other platforms (32, 33). This is intended to facilitate meaningful exposome research and empower researchers and stakeholders to derive significant conclusions from integrated data and advanced analytics. We will use Python and R Studio for data analysis and modeling. For the architecture, we will use EC2 to deploy applications on WildFly, MySQL RDS for data, and another development layer on Amazon. Nevertheless, it is essential to carry out a proof-of-concept study to validate the feasibility and effectiveness of data integration and harmonization before it can be fully developed.

In conclusion, the CHiESS is a novel project that will provide an informatics and intelligent platform that will integrate administrative and original target omics data for exposome studies in Latin America, since most Latin America and developing countries are facing the double burden of diseases, contributing to the exposome research network worldwide. Thus, the CHiESS conceptual model is based on the one-health ecosystem approach. It includes environmental stressors and other species’ patterns that contribute to the transmission of infectious diseases and the etiology of NCD. CHiESS will serve as a basis for targeted, evidence-based interventions, efficient and tailored population-based cohort recruitment, and omics measurements.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

PM: Writing – review & editing, Methodology, Conceptualization. AS-P: Writing – review & editing, Software, Methodology, Data curation, Conceptualization. KP: Writing – review & editing, Software, Methodology, Data curation, Conceptualization. CR: Writing – review & editing, Investigation. MC: Writing – review & editing. FB: Writing – review & editing, Supervision. MG: Writing – review & editing, Supervision, Resources. CU: Writing – review & editing, Writing – original draft, Validation, Supervision, Methodology, Funding acquisition, Conceptualization.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article. The authors declare that they received funding from the Fondo de Apoyo a la Innovación (FAIN) 2023 associative modality competition, an initiative promoted by the Innovation Directorate of the Universidad de los Andes.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Institute for Health Metrics and Evaluation (IHME) . GBD compare data visualization. Seattle, WA: IHME, University of Washington (2020).

Google Scholar

2. Ong, KL, Stafford, LK, McLaughlin, SA, Boyko, EJ, Vollset, SE, Smith, AE, et al. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the global burden of disease study 2021. Lancet. (2023) 402:203–34. doi: 10.1016/S0140-6736(23)01301-6

PubMed Abstract | Crossref Full Text | Google Scholar

3. Rutter, H, Savona, N, Glonti, K, Bibby, J, Cummins, S, Finegood, DT, et al. The need for a complex systems model of evidence for public health. Lancet. (2017) 390:2602–4. doi: 10.1016/S0140-6736(17)31267-9

Crossref Full Text | Google Scholar

4. Rod, NH, Broadbent, A, Rod, MH, Russo, F, Arah, OA, and Stronks, K. Complexity in epidemiology and public health. Addressing complex health problems through a mix of epidemiologic methods and data. Epidemiology. (2023) 34:505–14. doi: 10.1097/EDE.0000000000001612

PubMed Abstract | Crossref Full Text | Google Scholar

5. Collins, FS, and McKusick, VA. Implications of the human genome project for medical science. JAMA. (2001) 285:540–4. doi: 10.1001/jama.285.5.540

Crossref Full Text | Google Scholar

6. Mokdad, AH, Marks, JS, Stroup, DF, and Gerberding, JL. Actual causes of death in the United States, 2000. JAMA. (2004) 291:1238–45. doi: 10.1001/jama.291.10.1238

Crossref Full Text | Google Scholar

7. Rappaport, SM . Genetic factors are not the major causes of chronic diseases. PLoS One. (2016) 11:e0154387. doi: 10.1371/journal.pone.0154387

PubMed Abstract | Crossref Full Text | Google Scholar

8. Ashley, EA . Towards precision medicine. Nat Rev Genet. (2016) 17:507–22. doi: 10.1038/nrg.2016.86

Crossref Full Text | Google Scholar

9. Sandhu, C, Qureshi, A, and Emili, A. Panomics for precision medicine. Trends Mol Med. (2018) 24:85–101. doi: 10.1016/j.molmed.2017.11.001

PubMed Abstract | Crossref Full Text | Google Scholar

10. Wild, CP . Complementing the genome with an “Exposome”: the outstanding challenge of environmental exposure measurement in molecular epidemiology. Cancer Epidemiol Biomarkers Prev. (2005) 14:1847–50. doi: 10.1158/1055-9965.EPI-05-0456

PubMed Abstract | Crossref Full Text | Google Scholar

11. Wild, CP . The exposome: from concept to utility. Int J Epidemiol. (2012) 41:24–32. doi: 10.1093/ije/dyr236

Crossref Full Text | Google Scholar

12. Lerner, H, and Berg, C. A comparison of three holistic approaches to health: one health, EcoHealth, and planetary health. Frontiers in veterinary. Science. (2017) 4:4. doi: 10.3389/fvets.2017.00163

Crossref Full Text | Google Scholar

13. Weatherly, C, Carag, J, Zohdy, S, and Morrison, M. The mental health impacts of human-ecosystem-animal relationships: a systematic scoping review of eco-, planetary, and one health approaches. One Health. (2023) 17:100621. doi: 10.1016/j.onehlt.2023.100621

PubMed Abstract | Crossref Full Text | Google Scholar

14. Vrijheid, M, Slama, R, Robinson, O, Chatzi, L, Coen, M, van den Hazel, P, et al. The human early-life exposome (HELIX): project rationale and design. Environ Health Perspect. (2014) 122:535–44. doi: 10.1289/ehp.1307204

PubMed Abstract | Crossref Full Text | Google Scholar

15. Niedzwiecki, MM, and Miller, GW. HERCULES: an academic center to support Exposome research In: S Dagnino and A Macherone, editors. Unraveling the exposome: a practical view. Cham: Springer International Publishing (2019). 339–48.

Google Scholar

16. Pronk, A, Loh, M, Kuijpers, E, Albin, M, Selander, J, Godderis, L, et al. Applying the exposome concept to working life health: the EU EPHOR project. Environ. Epidemiol. (2022) 6:e185. doi: 10.1097/EE9.0000000000000185

PubMed Abstract | Crossref Full Text | Google Scholar

17. Vlaanderen, J, de Hoogh, K, Hoek, G, Peters, A, Probst-Hensch, N, Scalbert, A, et al. Developing the building blocks to elucidate the impact of the urban exposome on cardiometabolic-pulmonary disease: the EU EXPANSE project. Environ. Epidemiol. (2021) 5:e162. doi: 10.1097/EE9.0000000000000162

PubMed Abstract | Crossref Full Text | Google Scholar

18. Vrijheid, M, Basagaña, X, Gonzalez, JR, Jaddoe, VWV, Jensen, G, Keun, HC, et al. Advancing tools for human early lifecourse exposome research and translation (ATHLETE): project overview. Environ Epidemiol. (2021) 5:e166. doi: 10.1097/EE9.0000000000000166

PubMed Abstract | Crossref Full Text | Google Scholar

19. Vineis, P, Chadeau-Hyam, M, Gmuender, H, Gulliver, J, Herceg, Z, Kleinjans, J, et al. The exposome in practice: design of the EXPOsOMICS project. Int J Hyg Environ Health. (2017) 220:142–51. doi: 10.1016/j.ijheh.2016.08.001

Crossref Full Text | Google Scholar

20. Bygbjerg, IC . Double burden of non-communicable and infectious diseases in developing countries. Science. (2012) 337:1499–501. doi: 10.1126/science.1223466

Crossref Full Text | Google Scholar

21. Albala, C, and Vio, F. Epidemiological transition in Latin America: the case of Chile. Public Health. (1995) 109:431–42. doi: 10.1016/S0033-3506(95)80048-4

PubMed Abstract | Crossref Full Text | Google Scholar

22. Chung, MK, House, JS, Akhtari, FS, Makris, KC, Langston, MA, Islam, KT, et al. Decoding the exposome: data science methodologies and implications in exposome-wide association studies (ExWASs). Exp Dermatol. (2024) 4:osae001. doi: 10.1093/exposome/osae001

Crossref Full Text | Google Scholar

23. McGee, G, Wilson, A, Webster, TF, and Coull, BA. Bayesian multiple index models for environmental mixtures. Biometrics. (2023) 79:462–74. doi: 10.1111/biom.13569

PubMed Abstract | Crossref Full Text | Google Scholar

24. Feuerriegel, S, Frauen, D, Melnychuk, V, Schweisthal, J, Hess, K, Curth, A, et al. Causal machine learning for predicting treatment outcomes. Nat Med. (2024) 30:958–68. doi: 10.1038/s41591-024-02902-1

Crossref Full Text | Google Scholar

25. Sepúlveda-Peñaloza, A, Cumsille, F, Garrido, M, Matus, P, Vera-Concha, G, and Urquidi, C. Geographical disparities in obesity prevalence: small-area analysis of the Chilean National Health Surveys. BMC Public Health. (2022) 22:1443. doi: 10.1186/s12889-022-13841-2

PubMed Abstract | Crossref Full Text | Google Scholar

26. Turner, MC, Vineis, P, Seleiro, E, Dijmarescu, M, Balshaw, D, Bertollini, R, et al. EXPOsOMICS: final policy workshop and stakeholder consultation. BMC Public Health. (2018) 18:260. doi: 10.1186/s12889-018-5160-z

PubMed Abstract | Crossref Full Text | Google Scholar

27. Maitre, L, Guimbaud, JB, Warembourg, C, Güil-Oumrait, N, Petrone, PM, Chadeau-Hyam, M, et al. State-of-the-art methods for exposure-health studies: results from the exposome data challenge event. Environ Int. (2022) 168:107422. doi: 10.1016/j.envint.2022.107422

PubMed Abstract | Crossref Full Text | Google Scholar

28. Hu, H, Liu, X, Zheng, Y, He, X, Hart, J, James, P, et al. Methodological challenges in spatial and contextual Exposome-health studies. Crit Rev Environ Sci Technol. (2023) 53:827–46. doi: 10.1080/10643389.2022.2093595

PubMed Abstract | Crossref Full Text | Google Scholar

29. Guloksuz, S, van Os, J, and Rutten, BPF. The Exposome paradigm and the complexities of environmental research in psychiatry. JAMA Psychiatry. (2018) 75:985–6. doi: 10.1001/jamapsychiatry.2018.1211

PubMed Abstract | Crossref Full Text | Google Scholar

30. Manrai, AK, Cui, Y, Bushel, PR, Hall, M, Karakitsios, S, Mattingly, CJ, et al. Informatics and data analytics to support Exposome-based discovery for public health. Annu Rev Public Health. (2017) 38:279–94. doi: 10.1146/annurev-publhealth-082516-012737

PubMed Abstract | Crossref Full Text | Google Scholar

31. Barcellos, C, Buzai, GD, and Santana, P. Geografía de la salud: bases y actualidad. Salud Colectiva. (2018) 14:1–4. doi: 10.18294/sc.2018.1763

PubMed Abstract | Crossref Full Text | Google Scholar

32. Brook, JR, Setton, EM, Seed, E, Shooshtari, M, Doiron, D, Awadalla, P, et al. The Canadian urban environmental Health Research consortium – a protocol for building a national environmental exposure data platform for integrated analyses of urban form and health. BMC Public Health. (2018) 18:114. doi: 10.1186/s12889-017-5001-5

PubMed Abstract | Crossref Full Text | Google Scholar

33. Merino Martinez, R, Müller, H, Negru, S, Ormenisan, A, Arroyo Mühr, LS, Zhang, X, et al. Human exposome assessment platform. Environ Epidemiol. (2021) 5:e182. doi: 10.1097/EE9.0000000000000182

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: exposome, data integration, ecosystems, systems of system, public health

Citation: Matus P, Sepúlveda-Peñaloza A, Page K, Rodríguez C, Cárcamo M, Bustamante F, Garrido M and Urquidi C (2024) The Chilean exposome-based system for ecosystems (CHiESS): a framework for national data integration and analytics platform. Front. Public Health. 12:1407514. doi: 10.3389/fpubh.2024.1407514

Received: 09 April 2024; Accepted: 10 July 2024;
Published: 24 July 2024.

Edited by:

Gilles Guillot, CSL Behring AG, Switzerland

Reviewed by:

Lauren A. Eaves, University of North Carolina at Chapel Hill, United States
Ilse Bente Ottenbros, Netherlands Organisation for Applied Scientific Research (TNO), Netherlands

Copyright © 2024 Matus, Sepúlveda-Peñaloza, Page, Rodríguez, Cárcamo, Bustamante, Garrido and Urquidi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Cinthya Urquidi, curquidi@uandes.cl

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.