Building a Healthcare Data Warehouse: Considerations, Opportunities, and Challenges

Knezevic Ivanovski, Tamara; Honap, Sailish; Choukour, Myriam; Matic, Rade; Peyrin-Biroulet, Laurent

doi:10.3389/fdgth.2025.1691142

REVIEW article

Front. Digit. Health

Sec. Health Technology Implementation

This article is part of the Research TopicDigital Health Past, Present, and FutureView all 39 articles

Building a Healthcare Data Warehouse: Considerations, Opportunities, and Challenges

Provisionally accepted

Tamara Knezevic Ivanovski^1*

Sailish Honap²

Myriam Choukour³

Rade Matic⁴

Laurent Peyrin-Biroulet^3*

¹Klinicko-bolnicki centar Zvezdara, Belgrade, Serbia
²St George's University Hospitals NHS Foundation Trust, London, United Kingdom
³Centre Hospitalier Regional Universitaire de Nancy, Nancy, France
⁴Belgrade Academy for Business and Arts Applied Studies,, Belgrade, Serbia

The final, formatted version of the article will be published soon.

The increasing digitalization of health systems is accelerating the transition towards a new era of data-driven, evidence-based care. This has profound implications for clinical practice, performance evaluation, policy making and biomedical research. At the heart of this transformation lies a healthcare data warehouse (DW), which functions as a critical infrastructure for aggregating, standardizing, and analyzing diverse clinical and administrative data. When well-designed and implemented, DWs provide clinicians with timely access to comprehensive, longitudinal patient data, enabling more informed decision-making, enhancing care quality, and improving outcomes. For researchers, these repositories offer opportunities for population-level analytics, predictive modeling, and large-scale health service research, enabling insights into disease patterns, healthcare utilization, and system inefficiencies. Centralizing clinical and administrative data in a DW allows for more frequent, nuanced analyses, increasing the precision and responsiveness of care. However, developing an effective DW requires careful consideration of system architecture, data governance, and interoperability. These foundational components support the robust ETL/ELT frameworks that ensure data quality, consistency, and readiness for analysis across diverse and evolving data streams. Beyond supporting individual patient care, DWs act as essential drivers of scalable research, operational efficiency, and evidence-based health policy. Their successful implementation marks a pivotal step toward achieving personalized, high-quality, and cost-effective healthcare in the digital transformation age. This paper reviews the existing literature to outline the process of building and implementing a data warehouse, introducing real-world disease-specific applications. BiotherDW connects theoretical frameworks with practical healthcare applications by demonstrating how traditional data warehouse design can be adapted for national-scale digital health infrastructures.

Keywords: Data warehourse, Inflamatory bowel disease, ETL/ELT, Data integration and interoperability, Artificial intelligence in healthcare

Received: 22 Aug 2025; Accepted: 19 Nov 2025.

Copyright: © 2025 Knezevic Ivanovski, Honap, Choukour, Matic and Peyrin-Biroulet. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Tamara Knezevic Ivanovski, tamara6788@gmail.com
Laurent Peyrin-Biroulet, peyrinbiroulet@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.