Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Machine Learning and Artificial Intelligence

Volume 8 - 2025 | doi: 10.3389/frai.2025.1605539

Graph neural networks with configuration cross-attention for tensor compilers

Provisionally accepted
Dmitrii  KhizbullinDmitrii Khizbullin1*Eduardo  Rocha De AndradeEduardo Rocha De Andrade2Thanh  Hau NguyenThanh Hau Nguyen2Matheus  Pedroza FerreiraMatheus Pedroza Ferreira2David  Robert PughDavid Robert Pugh1
  • 1King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
  • 2Sprout AI, London, United Kingdom

The final, formatted version of the article will be published soon.

With the recent popularity of neural networks comes the need for efficient serving of inference workloads. A neural network inference workload can be represented as a computational graph with nodes as operators transforming multidimensional tensors. The tensors can be transposed and/or tiled in a combinatorially large number of ways, some configurations leading to accelerated inference. We propose TGraph, a neural graph architecture that allows screening for fast configurations of the target computational graph, thus representing an artificial intelligence (AI) tensor compiler in contrast to traditional heuristic-based compilers. The proposed solution improves mean Kendall's τ across layout collections of TpuGraphs from 29.8% of the reliable baseline to 67.4% of TGraph. We estimate the potential CO 2 emission reduction associated with our work to be equivalent to over 50% of the total household emissions in the areas hosting AI-oriented data centers.

Keywords: GNN, Graph neural network, tensor compilation, attention mechanism, Ranking loss function

Received: 03 Apr 2025; Accepted: 17 Jul 2025.

Copyright: © 2025 Khizbullin, De Andrade, Nguyen, Ferreira and Pugh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Dmitrii Khizbullin, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.