AUTHOR=Beyer Stefan , Tanjga Nikola , Kleinoscheg Gabriel , Hayn Dieter , Donsa Klaus , Kreiner Karl , Schreier Günter TITLE=Preparing for the European Health Data Space: an open-source compiler for fast, transparent, and portable health data transformations JOURNAL=Frontiers in Medicine VOLUME=Volume 12 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/medicine/articles/10.3389/fmed.2025.1661091 DOI=10.3389/fmed.2025.1661091 ISSN=2296-858X ABSTRACT=IntroductionHealthcare systems generate vast amounts of data in diverse and often incompatible formats. Efficient conversion between these formats is essential to ensure interoperability and enable secondary data use, particularly in the context of the European Health Data Space (EHDS) and the proposed Austrian Health Data Donation Space (AHDDS). While standards such as HL7 FHIR aim to facilitate interoperability, inconsistencies in implementation persist. Electronic health record (EHR) providers, including Austria’s ELGA, continue to face challenges in this area. The FHIR mapping language (FML) offers a promising solution for format translation, but current tools for executing FML mappings are limited, especially in terms of processing speed. To address this gap, there is a pressing need for a compiler that translates FML mappings into efficient, executable code.Materials and methodsWe developed the Mapping Language Compiler for Health Data (MaLaC-HD), which compiles FML code into Python. To assess performance, we benchmarked the compiler using a large ELGA document on a typical end-user device, comparing execution speed with existing FML tools. Baseline overhead was measured using an empty mapping. Conformance was manually evaluated by comparing the output of a wide range of example mappings and input data against the Java reference implementation. Additionally, we analyzed the structure and correctness of the generated Python code to assess functional completeness.ResultsAfter adjusting for overhead, MaLaC-HD achieved execution speeds nearly 100 times faster than existing tools. The output closely matched that of the reference implementation, with only minor discrepancies. The generated Python code met all functional requirements and demonstrated the compiler’s ability to support complex transformations. MaLaC-HD is publicly available under the LGPL license.ConclusionMaLaC-HD can serve a wide array of use cases and has the potential to integrate with existing platforms for secondary data use to support large-scale health data research across Europe and beyond. MaLaC-HD could provide the EHR community with a powerful, efficient tool for accelerating data transformation, an essential capability for the success of the EHDS initiative.