Icy-Hot: Decoupled Compute Paradigm towards a General-Purpose Superconducting CPU Design

Renduchintala, Tara; Lee, Jonghyun; Zha, Haipeng; Qi, Michael; Annavaram, Murali

doi:10.3389/fmats.2025.1618454

ORIGINAL RESEARCH article

Front. Mater.

Sec. Semiconducting Materials and Devices

This article is part of the Research TopicAdvancing Superconducting Electronics for Server ApplicationsView all articles

Icy-Hot: Decoupled Compute Paradigm towards a General-Purpose Superconducting CPU Design

Provisionally accepted

Murali Annavaram

University of Southern California, Los Angeles, United States

The final, formatted version of the article will be published soon.

Single Flux Quantum (SFQ) superconducting technology offers substantial performance and power efficiency advantages over conventional CMOS, especially as the benefits of Dennard scaling continue to diminish. As SFQ design tools and fabrication capabilities mature, SFQ-based computation is poised to become a leading candidate in the post-CMOS computing landscape. However, the design of SFQ CPUs presents two fundamental challenges. First, the Josephson Junction (JJ) budget must be kept low to ensure manufacturability. Second, the lack of dense, compact on-chip memory in SFQ remains a significant bottleneck. Preliminary analysis of a basic CPU design revealed that control structures—such as decoders and dependency checkers—consume a disproportionately large share of the JJ count. Meanwhile, modern software and CPU architectures rely heavily on substantial on-chip memory for components like branch predictors and caches. To address these constraints, we introduce Icy-Hot, a novel CPU architecture that granularly decouples pipeline stages to optimize JJ usage. Observing that SFQ-based execution ALUs are relatively JJ-efficient com-pared to memory and control logic, Icy-Hot places the execution stage within a 4K superconducting zone built entirely from SFQ circuits. The remaining pipeline stages—fetch, decode, and control—reside in a warmer 77K region implemented using conventional CMOS. Decoded instructions and control metadata are then passed from the Hot zone to the Icy zone for high-speed execution. We identify key architectural challenges in this hybrid design and propose solutions that combine compiler-inserted metadata with SFQ-specific circuit-level innovations. The Icy-Hot architecture effectively leverages the speed and efficiency of SFQ execution, while reducing JJ usage in memory and control logic through the integration of cryogenic CMOS. Our evaluation demonstrates a 38% power improvement over baseline designs, while consuming approximately 220,000 Josephson Junctions.

Keywords: Superconducting, architecture, CPU (Central Processing Unit), Thermal zone model, Design - optimization

Received: 26 Apr 2025; Accepted: 20 Oct 2025.

Copyright: © 2025 Renduchintala, Lee, Zha, Qi and Annavaram. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Tara Renduchintala, trenduch@usc.edu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.