Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Public Health

Sec. Digital Public Health

Data Science Without Borders: Bridging the Divide in Data Science Capacity Across African Health Institutions

Provisionally accepted
Agnes  KiraggaAgnes Kiragga1*Samuel  IddiSamuel Iddi1Abel  Wilson WalekhwaAbel Wilson Walekhwa2Miranda  BarasaMiranda Barasa1Steve  CyguSteve Cygu1Rachel  OdhiamboRachel Odhiambo1Moctar  GningueMoctar Gningue3Aminata  MboupAminata Mboup3Anicet  OnanaAnicet Onana4Bethlehem  AdnewBethlehem Adnew5Akililu  AlemuAkililu Alemu5Simon  HodsonSimon Hodson6Jay  GreenfieldJay Greenfield7Jim  ToddJim Todd8Tathagata  BhattacharjeeTathagata Bhattacharjee8Malvika  SharanMalvika Sharan9Raphael  SonabeRaphael Sonabe9Damazo  KagengyeDamazo Kagengye1Bertrand  Hugo MbatchouBertrand Hugo Mbatchou10Alemseged  AbdissaAlemseged Abdissa5Moussa  SarrMoussa Sarr3Joyce  Nakatumba-NabendeJoyce Nakatumba-Nabende11Moses  BamuturaMoses Bamutura12Bekure  TamiratBekure Tamirat13Nebiyu  DerebeNebiyu Derebe13Elvis  TemfackElvis Temfack13
  • 1African Population and Health Research Center (APHRC), Nairobi, Kenya
  • 2University of Cambridge Department of Veterinary Medicine, Cambridge, United Kingdom
  • 3Institut de Recherche en Sante de Surveillance Epidemiologique et de Formation, Diamniadio, Senegal
  • 4Douala General Hospital, Cameroon, Doula, Cameroon
  • 5Armauer Hansen Research Institute, Addis Ababa, Ethiopia
  • 6Committee on Data (CODATA), France, France
  • 7Committee on Data, France, France
  • 8London School of Hygiene and Tropical Medicine, London, United Kingdom
  • 9OSPO NOW, London, United Kingdom
  • 10Universite de Douala, Douala, Cameroon
  • 11Makerere University, Department of Computer Science, Center for Artificial Intelligence Lab, Kampala, Uganda, Kampala, Uganda
  • 12Africa centers for Disease Prevention and Control, Addis Ababa, Ethiopia
  • 13Africa Centres for Disease Control and Prevention, Addis Ababa, Ethiopia

The final, formatted version of the article will be published soon.

Background: Effective public health data science in Africa requires a clear understanding of institutional capacities across multiple dimensions. This study conducted a multidimensional baseline assessment of three African health institutions to evaluate health data availability, training needs, data governance, and infrastructure, with the goal of informing data science capacity-building for health challenges. Methods: This assessment forms part of the Data Science Without Borders Project—a three-year, multi-country initiative implemented at the Institute for Health Research, Epidemiological Surveillance and Training (IRESSEF, Senegal), the Armauer Hansen Research Institute (AHRI, Ethiopia), and Douala General Hospital (DGH, Cameroon). A structured needs assessment survey was designed to evaluate: (1) health data availability across 16 dataset categories; (2) training needs across seven domains; (3) data governance considerations; and (4) infrastructure capabilities, including computing resources and connectivity. An integrated analysis identified patterns, gaps, and opportunities across institutions. Results: Institutions exhibited distinct yet complementary strengths. IRESSEF possessed rich datasets—especially in genomics, maternal health, and geographic health disparities—moderate infrastructure (8 GB RAM, 67% service capacity), and high training needs in data analytics (4.7/5.0) and data governance (4.0/5.0). AHRI showed strong computational capacity (512 GB RAM, 64 CPU cores), specialized surveillance data (9.9%), and moderate training needs (3.0/5.0). DGH demonstrated expertise in infectious disease research (3.3%), moderate infrastructure (32 GB RAM), and substantial potential to expand research using electronic health records. Shared priorities included strengthening analytical skills (average 4.3/5.0), advancing artificial intelligence and machine learning use (IRESSEF: 5.0, AHRI: 4.0, DGH: 5.0), and developing data governance frameworks to enhance cross-institutional data sharing. Conclusions: The findings highlight the need for tailored, phased capacity building that leverages institutional strengths and complementarities rather than a one-size-fits-all approach. We recommend: (1) establishing robust data governance frameworks; (2) implementing customized training aligned with institutional priorities; and (3) addressing infrastructure gaps to enable sustainable data-driven research. This model offers a scalable foundation for advancing African-led health data science, AI adoption, and continental collaboration.

Keywords: Data governance, Federated Infrastructure, Capacity Building, African Health Informatics, Cross-institutional collaboration

Received: 30 Aug 2025; Accepted: 07 Nov 2025.

Copyright: © 2025 Kiragga, Iddi, Wilson Walekhwa, Barasa, Cygu, Odhiambo, Gningue, Mboup, Onana, Adnew, Alemu, Hodson, Greenfield, Todd, Bhattacharjee, Sharan, Sonabe, Kagengye, Mbatchou, Abdissa, Sarr, Nakatumba-Nabende, Bamutura, Tamirat, Derebe and Temfack. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Agnes Kiragga, akiragga@aphrc.org

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.