ORIGINAL RESEARCH article
Front. High Perform. Comput.
Sec. Architecture and Systems
Towards Energy-Efficiency: CNTD MERIC Approach for Energy-Aware MPI Applications
- KA
Kashaf Ad Dooja 1
- OY
Osman Yasal 2
- OV
Ondřej Vysocký 2
- RL
Riha Lubomir 2
- DC
Daniele Cesarini 3
- AB
Andrea Bartolini 1
1. University of Bologna, Bologna, Italy
2. Vysoka skola banska-Technicka univerzita Ostrava, Ostrava, Czechia
3. Cineca, Casalecchio di Reno, Italy
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract
Energy efficiency is a major challenge in High-Performance Computing (HPC) systems, impairing their scale, performance, and sustainability. However, despite technological and research progress, there is still a lack of software methods to measure and assess the energy efficiency of computing codes at scale. This is also exacerbated by the emergence of newer ISAs in the HPC computing spectrum with non-unified interfaces for power and energy monitoring. In this work, we present CNTD MERIC, which integrates two state-of-the-art energy monitoring and optimization libraries for HPC systems, COUNTDOWN and MERIC. COUNTDOWN is an energy-aware runtime system for MPI applications. MERIC is a platform-agnostic runtime system and energy measurement library that optimizes energy efficiency by adjusting hardware configurations. CNTD MERIC combines the benefits of the two approaches with low overhead, resulting in a portable power management runtime system for MPI applications. We tested the newly developed integrated library in both ARM and x86 compute nodes in the production environment of the IT4Innovations supercomputing center (IT4I). Our results show that CNTD MERIC achieves similar performance to the original COUNTDOWN and MERIC in terms of energy optimization and power/energy measurement, with negligible overheads within -5% to +3% with respect to the original COUNTDOWN configurations. We also implemented CNTD MERIC for multi-architecture (x86 and ARM) comparison between Intel Sapphire Rapids and A64FX processors, and the results show that A64FX achieves significantly lower execution time, reduced energy-to-solution and lower average power consumption (110–132 W vs. 400–590 W), which confirms its efficiency for energy-efficient HPC systems.
Summary
Keywords
A64FX, energy/power saving, HPC, MPI, Power management, profiling
Received
12 July 2025
Accepted
02 March 2026
Copyright
© 2026 Ad Dooja, Yasal, Vysocký, Lubomir, Cesarini and Bartolini. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Kashaf Ad Dooja
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.