<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1202409</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2023.1202409</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Deciphering tissue heterogeneity from spatially resolved transcriptomics by the autoencoder-assisted graph convolutional neural network</article-title>
<alt-title alt-title-type="left-running-head">Li et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fgene.2023.1202409">10.3389/fgene.2023.1202409</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Li</surname>
<given-names>Xinxing</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/2315235/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Huang</surname>
<given-names>Wendong</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/2315236/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Xu</surname>
<given-names>Xuan</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1880622/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>Hong-Yu</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/853800/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Shi</surname>
<given-names>Qianqian</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/660872/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>Hubei Key Laboratory of Agricultural Bioinformatics</institution>, <institution>College of Informatics</institution>, <institution>Huazhong Agricultural University</institution>, <addr-line>Wuhan</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/552766/overview">Tao Huang</ext-link>, Chinese Academy of Sciences (CAS), China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2283271/overview">Chengming Zhang</ext-link>, The University of Tokyo, Japan</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1168243/overview">Xiaoping Liu</ext-link>, University of Chinese Academy of Science, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Qianqian Shi, <email>qqshi@mail.hzau.edu.cn</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>25</day>
<month>05</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1202409</elocation-id>
<history>
<date date-type="received">
<day>08</day>
<month>04</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>05</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Li, Huang, Xu, Zhang and Shi.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Li, Huang, Xu, Zhang and Shi</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Spatially resolved transcriptomics (SRT) provides an unprecedented opportunity to investigate the complex and heterogeneous tissue organization. However, it is challenging for a single model to learn an effective representation within and across spatial contexts. To solve the issue, we develop a novel ensemble model, AE-GCN (<bold>a</bold>uto<bold>e</bold>ncoder-assisted <bold>g</bold>raph <bold>c</bold>onvolutional neural <bold>n</bold>etwork), which combines the autoencoder (AE) and graph convolutional neural network (GCN), to identify accurate and fine-grained spatial domains. AE-GCN transfers the AE-specific representations to the corresponding GCN-specific layers and unifies these two types of deep neural networks for spatial clustering via the clustering-aware contrastive mechanism. In this way, AE-GCN accommodates the strengths of both AE and GCN for learning an effective representation. We validate the effectiveness of AE-GCN on spatial domain identification and data denoising using multiple SRT datasets generated from ST, 10x Visium, and Slide-seqV2 platforms. Particularly, in cancer datasets, AE-GCN identifies disease-related spatial domains, which reveal more heterogeneity than histological annotations, and facilitates the discovery of novel differentially expressed genes of high prognostic relevance. These results demonstrate the capacity of AE-GCN to unveil complex spatial patterns from SRT data.</p>
</abstract>
<kwd-group>
<kwd>spatially resolved transcriptomics</kwd>
<kwd>spatial domain identification</kwd>
<kwd>spatial information</kwd>
<kwd>graph convolutional neural network</kwd>
<kwd>autoencoder</kwd>
</kwd-group>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>RNA</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Spatially resolved transcriptomics (SRT) technologies, such as spatial transcriptomics (ST) (<xref ref-type="bibr" rid="B21">St&#xe5;hl et al., 2016</xref>), 10x Visium, and Slide-seqV2 (<xref ref-type="bibr" rid="B22">Stickels et al., 2021</xref>), can measure the transcript localization and abundance in the dissected tissue area, enabling novel insights into tissue development and tumor heterogeneity (<xref ref-type="bibr" rid="B1">Atta and Fan, 2021</xref>; <xref ref-type="bibr" rid="B16">Nasab et al., 2022</xref>). Their generated data (i.e., gene expression in tissue locations [spots] and spatial locational information) can be used to decipher the spatially functional regions and cellular architectures in tissues (<xref ref-type="bibr" rid="B14">Maniatis et al., 2021</xref>; <xref ref-type="bibr" rid="B15">Marx, 2021</xref>; <xref ref-type="bibr" rid="B28">Zeng et al., 2022</xref>). However, due to technical limitations (<xref ref-type="bibr" rid="B27">Xu et al., 2022</xref>), modeling and integrating the available SRT modalities for accurate spatial domain identification still remain challenging.</p>
<p>Currently, the spatial domain detection methods could be mainly divided into two categories: non-spatial and spatial clustering methods. Some non-spatial methods originally developed for single-cell RNA-sequencing (scRNA-seq) studies, e.g., Seurat (<xref ref-type="bibr" rid="B2">Butler et al., 2018</xref>) and Scanpy (<xref ref-type="bibr" rid="B26">Wolf et al., 2018</xref>), are also applied in SRT studies. They only utilize the expression profiles to cluster spots while often obtaining domains lacking in spatial continuity to some extent. To address such issues, spatial clustering approaches generally incorporate the additional spatial information into their models. For example, with the spatial prior, BayesSpace (<xref ref-type="bibr" rid="B29">Zhao et al., 2021</xref>) and HMRF (<xref ref-type="bibr" rid="B5">Dries et al., 2021</xref>) use the Markov random field model (or its variant) to encourage the spatially neighboring spots to belong to the same domain. SpaGCN (<xref ref-type="bibr" rid="B9">Hu et al., 2021</xref>) and SEDR (<xref ref-type="bibr" rid="B7">Fu et al., 2021</xref>) enable spatial clustering by learning the low-dimensional representation with graph constraints that represent the spatial dependency. STAGATE (<xref ref-type="bibr" rid="B4">Dong and Zhang, 2022</xref>) identifies spatial domains by adaptively learning the similarity of neighboring spots via attention mechanisms. Modeling the spatial dependency of gene expression fairly facilitates the discovery of spatial domains with spatial coherence.</p>
<p>Though these methods have provided useful information on the usage of expression profiles and spatial information, they usually depend on single models, which center on either expression data itself or spatially neighboring structure, thus probably resulting in the preferred usage of the focused data type. For example, the non-spatial clustering methods only models the gene expression itself, while the spatial clustering methods often take spatial neighbors prior as a hard constraint to ensure spatial clustering continuity, which may lead to over-smoothing of expression (<xref ref-type="bibr" rid="B10">Huang et al., 2018</xref>) and missing subtle spatial regions with a handful of spots. Thus, the rational combination of these different kinds of models can fairly generate more useful representations, enabling better spatial domain detection in SRT studies.</p>
<p>Here, we develop a novel combined model, AE-GCN (<bold>a</bold>uto<bold>e</bold>ncoder-assisted <bold>g</bold>raph <bold>c</bold>onvolutional neural <bold>n</bold>etwork), which combines the autoencoder (AE) and graph convolutional neural network (GCN), for accurate and fine-grained spatial domain identification. Specifically, AE-GCN relies on AE for learning expression data-based representations and GCN for spatial graph-constrained learning. AE-GCN orderly transfers the AE-specific representations to GCN-specific layers and unifies these two types of neural networks for spatial clustering via a clustering-aware contrastive mechanism. In this way, AE-GCN combines the advantages of the two models and takes full integration of the expression data and spatial information during the representation learning process.</p>
<p>We demonstrate the effectiveness of AE-GCN on spatial domain identification and data denoising using SRT datasets generated from ST, 10x Visium, and Slide-seqV2 platforms. In particular, it is validated in two cancer samples that AE-GCN can refine the spatial functional regions and discover novel cancer-associated genes. These results show that AE-GCN is capable of unveiling complex tissue architecture from SRT data.</p>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>Materials and methods</title>
<sec id="s2-1">
<title>Overview of AE-GCN</title>
<p>AE-GCN is an integrative scheme that incorporates the AE and GCN learning processes, enabling tasks of spatial domain detection and data denoising (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Schematic overview of AE-GCN and its potential applications. Given gene expression and spatial coordinates as input, AE-GCN first builds the spatially neighboring network <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and enhances expression <inline-formula id="inf2">
<mml:math id="m2">
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. AE-GCN uses AE to learn representations from the enhanced expression and employs GCN to learn the representations of each layer from the spatially neighboring network <inline-formula id="inf3">
<mml:math id="m3">
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. Then, AE-GCN transfers the AE-specific representations from the encoder to the corresponding GCN-specific layer and learns the combined representation <inline-formula id="inf4">
<mml:math id="m4">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. To ensure effective training of the combined deep learning model for clustering, AE-GCN proposes a clustering-aware contrastive module based on the distribution of the representation <inline-formula id="inf5">
<mml:math id="m5">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. When AE-GCN reaches convergence, the latent combined representation <inline-formula id="inf6">
<mml:math id="m6">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> enables AE-GCN to identify spatial domains for different platforms, i.e., ST, 10x Visium, and Slide-seqV2. The reconstructed expression data <inline-formula id="inf7">
<mml:math id="m7">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> serves to denoise expression profiles.</p>
</caption>
<graphic xlink:href="fgene-14-1202409-g001.tif"/>
</fig>
<p>Given the original expression <inline-formula id="inf8">
<mml:math id="m8">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mn>0</mml:mn>
</mml:msup>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> (where <inline-formula id="inf9">
<mml:math id="m9">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf10">
<mml:math id="m10">
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, respectively, denote the number of genes and spots) and spatial coordinates, the spatially neighboring network <inline-formula id="inf11">
<mml:math id="m11">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> and the enhanced expression data <inline-formula id="inf12">
<mml:math id="m12">
<mml:mrow>
<mml:mi>X</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> are computed as the input of the combined learning process (see Methods). On the enhanced expression data <inline-formula id="inf13">
<mml:math id="m13">
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, AE-GCN employs AE to learn the low-dimensional representation (i.e., AE-specific representation <inline-formula id="inf14">
<mml:math id="m14">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>L</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf15">
<mml:math id="m15">
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the number of total layers in AE) in each layer. With the spatially neighboring network <inline-formula id="inf16">
<mml:math id="m16">
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, AE-GCN utilizes GCN to learn the graph-constrained representation in each layer (i.e., GCN-specific representation <inline-formula id="inf17">
<mml:math id="m17">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>B</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf18">
<mml:math id="m18">
<mml:mrow>
<mml:mi>B</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the number of layers in GCN or the encoder of AE). Then, AE-GCN transfers the AE-specific representations from the encoder to the corresponding GCN layer, thus generating the combined representation <inline-formula id="inf19">
<mml:math id="m19">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. Additionally, AE-GCN proposes a clustering-aware contrastive module to make the combined representation appropriate for spatial clustering.</p>
<p>When the learning process reaches convergence, the low-dimensional representation (i.e., <inline-formula id="inf20">
<mml:math id="m20">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>) of the last layer and the reconstructed expression data (i.e., <inline-formula id="inf21">
<mml:math id="m21">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>) can be used for downstream analytical tasks. The optimal representation enables AE-GCN to identify spatial domains interoperating with the Leiden method (<xref ref-type="bibr" rid="B24">Traag et al., 2019</xref>). The reconstructed expression data serve as the denoised profile, which overcomes the sparsity of the original data to improve differentially expressed gene identification (see Methods).</p>
</sec>
<sec id="s2-2">
<title>Spatially neighboring network construction and expression augmentation</title>
<sec id="s2-2-1">
<title>Spatially neighboring network construction</title>
<p>Assume that there are original expression matrix <inline-formula id="inf22">
<mml:math id="m22">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mn>0</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> and spatial locations in the SRT dataset. We first use spatial coordinates and Euclidean distance to calculate the distance between spots and then select the <italic>k</italic>-nearest spatial neighbors of each spot to participate in the subsequent process. In this work, we set <italic>k</italic> &#x3d; 10 for ST and 10x Visium datasets and <italic>k</italic> &#x3d; 30 for Slide-seqV2 datasets. Then, we perform principal component analysis (PCA) based on gene expression and select the top <inline-formula id="inf23">
<mml:math id="m23">
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> PCs (i.e., <inline-formula id="inf24">
<mml:math id="m24">
<mml:mrow>
<mml:mi>U</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, default to 15) to calculate the similarity matrix <inline-formula id="inf25">
<mml:math id="m25">
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> between the center spot and its spatial neighbors using cosine metric:<disp-formula id="e1">
<mml:math id="m26">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi>D</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>exp</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>e</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>d</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>t</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>U</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
</p>
<p>Then, the weighted adjacency matrix <inline-formula id="inf26">
<mml:math id="m27">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> is obtained by normalizing the similarity matrix <inline-formula id="inf27">
<mml:math id="m28">
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>:<disp-formula id="e2">
<mml:math id="m29">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
</mml:mstyle>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>
</p>
</sec>
<sec id="s2-2-2">
<title>Spatial expression augmentation</title>
<p>Limited by the transcript capture rate of SRT technologies, expression data are often sparse and noisy. AE-GCN generates the enhanced expression data <inline-formula id="inf28">
<mml:math id="m30">
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> by borrowing the shared information from spatial neighborhood, which can correct low-quality measurements and strengthen local similarity:<disp-formula id="e3">
<mml:math id="m31">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi>X</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mn>0</mml:mn>
</mml:msup>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>&#x3b1;</mml:mi>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mn>0</mml:mn>
</mml:msup>
<mml:mi>A</mml:mi>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(3)</label>
</disp-formula>where the tunable parameter <inline-formula id="inf29">
<mml:math id="m32">
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is flexibly set, and it controls the extent to aggregating expression information from neighboring spots.</p>
</sec>
</sec>
<sec id="s2-3">
<title>AE component</title>
<p>We employ AE to learn the useful representations from the expression data itself and assume that there are <inline-formula id="inf30">
<mml:math id="m33">
<mml:mrow>
<mml:mi>B</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> layers in the encoder and <inline-formula id="inf31">
<mml:math id="m34">
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>B</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
</inline-formula> layers in the decoder. Specifically, the learned <inline-formula id="inf32">
<mml:math id="m35">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>th layer representation, <inline-formula id="inf33">
<mml:math id="m36">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, can be obtained as follows:<disp-formula id="e4">
<mml:math id="m37">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3d5;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msup>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x2b;</mml:mo>
<mml:msup>
<mml:mi>b</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>B</mml:mi>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(4)</label>
</disp-formula>
<disp-formula id="e5">
<mml:math id="m38">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msup>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x2b;</mml:mo>
<mml:msup>
<mml:mi>b</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>B</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>L</mml:mi>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(5)</label>
</disp-formula>where <inline-formula id="inf34">
<mml:math id="m39">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3d5;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf35">
<mml:math id="m40">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c8;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are the activation functions of the <inline-formula id="inf36">
<mml:math id="m41">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>th layer in the respective encoder and decoder. <inline-formula id="inf37">
<mml:math id="m42">
<mml:mrow>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf38">
<mml:math id="m43">
<mml:mrow>
<mml:msup>
<mml:mi>b</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> are the weight matrix and reconstruction error in the <inline-formula id="inf39">
<mml:math id="m44">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>th layer, respectively. For convenience, we denote the enhanced expression data <inline-formula id="inf40">
<mml:math id="m45">
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> as <inline-formula id="inf41">
<mml:math id="m46">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>The output (i.e., <inline-formula id="inf42">
<mml:math id="m47">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>) of the decoder part is obtained through the reconstruction of the input data (i.e., <inline-formula id="inf43">
<mml:math id="m48">
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>) by minimizing the following loss function:<disp-formula id="e6">
<mml:math id="m49">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mfenced open="&#x2016;" close="&#x2016;" separators="|">
<mml:mrow>
<mml:mi>X</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>F</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:math>
<label>(6)</label>
</disp-formula>
</p>
</sec>
<sec id="s2-4">
<title>GCN component</title>
<p>AE-specific representations, e.g., <inline-formula id="inf44">
<mml:math id="m50">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf45">
<mml:math id="m51">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf46">
<mml:math id="m52">
<mml:mrow>
<mml:mo>&#x22ef;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf47">
<mml:math id="m53">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, can denoise data itself and extract valuable information from the data itself, which can effectively reflect expression variation but cannot guarantee the spatial smoothness of the identified domains. GCN can model the spatial structural dependency between spots, which is beneficial to improving the spatial smoothness of the identified domains. Thus, we then transfer AE-specific representations in the encoder into GCN-specific representations and use the GCN module to propagate these AE-specific representations for capturing a more complete and powerful representation. Thus, the GCN-learnable representations can accommodate two different kinds of information: gene expression values and spatial neighborhood structure. The representation learned by the <inline-formula id="inf48">
<mml:math id="m54">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>th layer of GCN, <inline-formula id="inf49">
<mml:math id="m55">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, can be obtained as follows:<disp-formula id="e7">
<mml:math id="m56">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3bc;</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:msub>
<mml:mi>&#x3d5;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msup>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:msup>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:mover accent="true">
<mml:mi>A</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:msup>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>&#x3bc;</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:msup>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:mover accent="true">
<mml:mi>A</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:msup>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:math>
<label>(7)</label>
</disp-formula>where <inline-formula id="inf50">
<mml:math id="m57">
<mml:mrow>
<mml:mi>I</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> denotes the identity diagonal matrix. <inline-formula id="inf51">
<mml:math id="m58">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>A</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>A</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>I</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf52">
<mml:math id="m59">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:msub>
<mml:mover accent="true">
<mml:mi>A</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. <inline-formula id="inf53">
<mml:math id="m60">
<mml:mrow>
<mml:msup>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:mover accent="true">
<mml:mi>A</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:msup>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> is the normalized adjacency matrix. <inline-formula id="inf54">
<mml:math id="m61">
<mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the balance coefficient and is often uniformly set to 0.5. Note that GCN and AE share weights.</p>
<p>Note that we denote the representation (i.e., <inline-formula id="inf55">
<mml:math id="m62">
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>B</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>) of the last GCN layer as <inline-formula id="inf56">
<mml:math id="m63">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. The input of the first layer GCN can be obtained from the enhanced expression data <inline-formula id="inf57">
<mml:math id="m64">
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>:<disp-formula id="e8">
<mml:math id="m65">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>H</mml:mi>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3d5;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msup>
<mml:mi>X</mml:mi>
<mml:msup>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:mover accent="true">
<mml:mi>A</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:msup>
<mml:mover accent="true">
<mml:mi>D</mml:mi>
<mml:mo>&#x223c;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(8)</label>
</disp-formula>
</p>
</sec>
<sec id="s2-5">
<title>Clustering-aware contrastive component</title>
<p>Although we have incorporated the encoder of AE into the neural network architecture of GCN to obtain the combined representation, this representation cannot be directly applied to the clustering problem. Herein, we propose a clustering-aware contrastive module to unify these two different deep learning models for effective spatial clustering.</p>
<p>Specifically, we use student&#x2019;s <italic>t</italic>-distribution to measure the probability of assigning the spot <inline-formula id="inf58">
<mml:math id="m66">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> to cluster <inline-formula id="inf59">
<mml:math id="m67">
<mml:mrow>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> based on the combined latent representation <inline-formula id="inf60">
<mml:math id="m68">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> as follows:<disp-formula id="e9">
<mml:math id="m69">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="&#x2016;" close="&#x2016;" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>/</mml:mo>
<mml:mi>&#x3c1;</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>&#x3c1;</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:mrow>
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:msub>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="&#x2016;" close="&#x2016;" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>/</mml:mo>
<mml:mi>&#x3c1;</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>&#x3c1;</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:mfrac>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfrac>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(9)</label>
</disp-formula>where <inline-formula id="inf61">
<mml:math id="m70">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the cluster center by <italic>K</italic>-means on learned representations. <inline-formula id="inf62">
<mml:math id="m71">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the <inline-formula id="inf63">
<mml:math id="m72">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>th column of <inline-formula id="inf64">
<mml:math id="m73">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. We regard <inline-formula id="inf65">
<mml:math id="m74">
<mml:mrow>
<mml:mi>Q</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mfenced open="[" close="]" separators="|">
<mml:mrow>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> as the distribution of the assignments of all samples. <inline-formula id="inf66">
<mml:math id="m75">
<mml:mrow>
<mml:mi>&#x3c1;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the degree of freedom of student&#x2019;s <italic>t</italic>-distribution.</p>
<p>To optimize the AE-GCN-learnable representation from the high-confidence assignment, we make data representation closer to cluster centers for improving the cluster cohesion. Hence, we calculate the target distribution <inline-formula id="inf67">
<mml:math id="m76">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> as follows:<disp-formula id="e10">
<mml:math id="m77">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:msub>
<mml:msup>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:msub>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:math>
<label>(10)</label>
</disp-formula>where <inline-formula id="inf68">
<mml:math id="m78">
<mml:mrow>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the soft cluster frequency. Each assignment in <inline-formula id="inf69">
<mml:math id="m79">
<mml:mrow>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is squared and normalized to produce the target distribution <inline-formula id="inf70">
<mml:math id="m80">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, which makes the data representation surround the cluster centers closer and helps AE-GCN learn a better representation for the clustering task. By minimizing the KL (Kullback&#x2013;Leibler) divergence loss between <inline-formula id="inf71">
<mml:math id="m81">
<mml:mrow>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf72">
<mml:math id="m82">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> distributions, the target distribution <italic>P</italic> can help the AE-GCN learn a better representation for the clustering task, i.e., making the data representation surround the cluster centers closer, thus leading to the following loss function:<disp-formula id="e11">
<mml:math id="m83">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>K</mml:mi>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="|">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mfenced open="&#x2016;" close="" separators="|">
<mml:mrow>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">log</mml:mi>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(11)</label>
</disp-formula>
</p>
<p>This design is regarded as a clustering-aware contrastive mechanism, where the <inline-formula id="inf73">
<mml:math id="m84">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> distribution supervises the updating of the distribution <inline-formula id="inf74">
<mml:math id="m85">
<mml:mrow>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, and the target distribution <inline-formula id="inf75">
<mml:math id="m86">
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is calculated by the distribution <inline-formula id="inf76">
<mml:math id="m87">
<mml:mrow>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> in turn. Using this mechanism, AE-GCN can directly concentrate two different objectives: clustering objective and data reconstruction objective, in one loss function. Thus, the overall loss function of AE-GCN is<disp-formula id="e12">
<mml:math id="m88">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="italic">&#x3b2;</mml:mi>
<mml:msub>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:math>
<label>(12)</label>
</disp-formula>where <inline-formula id="inf77">
<mml:math id="m89">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> denotes the tunable parameter <inline-formula id="inf78">
<mml:math id="m90">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>&#x3e;</mml:mi>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and can be flexibly set, which balances data reconstruction and clustering optimization.</p>
</sec>
<sec id="s2-6">
<title>Data collection and general preprocessing</title>
<p>The top 3,000 highly variable genes (HVGs) for 13 10x Visium datasets, one ST dataset, and one Slide-seqV2 dataset are selected using scanpy.pp.highly_variable_genes() from the Scanpy Python package. The log-transformation of the expression profiles is performed using scanpy.pp.log1p() on the original gene expression data.</p>
</sec>
<sec id="s2-7">
<title>Spatial domain detection and gene expression denoising</title>
<p>AE-GCN uses the combined latent representation <inline-formula id="inf79">
<mml:math id="m91">
<mml:mrow>
<mml:mi>Y</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> to detect spatial domains by Leiden (<xref ref-type="bibr" rid="B24">Traag et al., 2019</xref>) algorithms implemented as scanpy.tl.leiden(). The parameter &#x201c;resolution&#x201d; can be adjusted to match the number of the manual annotations.</p>
<p>For the enhanced expression matrix <inline-formula id="inf80">
<mml:math id="m92">
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, AE-GCN aggregates the shared information between each spot and its surrounding neighbors by incorporating prior spatial information into gene expression, which is used to adjust expression values in each spot and enrich spatial local signals. For the reconstructed expression data <inline-formula id="inf81">
<mml:math id="m93">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, AE-GCN uses AE and GCN to reconstruct the enhanced expression matrix <inline-formula id="inf82">
<mml:math id="m94">
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. By minimizing the reconstruction error, the reconstructed data <inline-formula id="inf83">
<mml:math id="m95">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> can reflect both the spatial local signals and expression measurement global signals. Thus, AE-GCN uses the reconstructed expression data <inline-formula id="inf84">
<mml:math id="m96">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> as the denoised profiles.</p>
</sec>
<sec id="s2-8">
<title>Performance evaluation</title>
<p>We use adjusted Rand index (ARI) (<xref ref-type="bibr" rid="B11">Hubert and Arabie, 1985</xref>) and cluster purity (i.e., Eq. <xref ref-type="disp-formula" rid="e13">13</xref>) (<xref ref-type="bibr" rid="B29">Zhao et al., 2021</xref>) to quantify the accuracy of the identified spatial domain and the reference annotations from original publications.<disp-formula id="e13">
<mml:math id="m97">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi mathvariant="normal">c</mml:mi>
<mml:mi mathvariant="normal">l</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="normal">p</mml:mi>
<mml:mi mathvariant="normal">u</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mrow>
<mml:munder>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:munder>
<mml:mi>max</mml:mi>
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>G</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:mfenced open="|" close="|" separators="|">
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mo>&#x2229;</mml:mo>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:math>
<label>(13)</label>
</disp-formula>where <inline-formula id="inf85">
<mml:math id="m98">
<mml:mrow>
<mml:mi>C</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is denoted as the set of the spatial cluster set and <inline-formula id="inf86">
<mml:math id="m99">
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is regarded as the set of annotated groups. Due to cancer slices with rough annotations (e.g., IDC and PDAC cancer data), cluster purity is specifically used to evaluate the clustering performance on SRT cancer datasets (<xref ref-type="bibr" rid="B29">Zhao et al., 2021</xref>).</p>
</sec>
<sec id="s2-9">
<title>Survival analysis</title>
<p>We use bulk expression data with patient survival information to evaluate the prognostic significance of genes via the Kaplan&#x2013;Meier plotter (<xref ref-type="bibr" rid="B30">Zwyea et al., 2021</xref>) in the IDC and PDAC cancer studies.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>Benchmarking AE-GCN against state-of-the-art methods</title>
<p>We evaluated the ability of AE-GCN to detect spatial domains using 12 human dorsolateral prefrontal cortex (DLPFC) slices generated using 10x Visium. The DLPFC dataset obtained from spatialLIBD (<xref ref-type="bibr" rid="B17">Pardo et al., 2022</xref>) is manually annotated as the layered regions by gene markers and cytoarchitecture. The annotations can be considered as the ground truth for benchmarking. Based on this dataset, we compared AE-GCN with the existing state-of-the-art methods, including six spatial clustering methods [i.e., BayesSpace (<xref ref-type="bibr" rid="B29">Zhao et al., 2021</xref>), Giotto (<xref ref-type="bibr" rid="B5">Dries et al., 2021</xref>), SEDR (<xref ref-type="bibr" rid="B7">Fu et al., 2021</xref>), SpaGCN (<xref ref-type="bibr" rid="B9">Hu et al., 2021</xref>), stLearn (<xref ref-type="bibr" rid="B18">Pham et al., 2020</xref>), and STAGATE (<xref ref-type="bibr" rid="B4">Dong and Zhang, 2022</xref>)] and three non-spatial algorithms [i.e., variational autoencoder (VAE) (<xref ref-type="bibr" rid="B12">Kingma and Welling, 2019</xref>), Leiden implemented in Scanpy (<xref ref-type="bibr" rid="B26">Wolf et al., 2018</xref>), and Louvain implemented in Seurat (<xref ref-type="bibr" rid="B2">Butler et al., 2018</xref>)]. The adjusted Rand index (ARI) is used to quantify the similarity between the manual labels and identified clusters, which ranges from 0 for poor consistency to 1 for identical clusters.</p>
<p>Generally, most of the spatial clustering methods performed better than non-spatial algorithms (Wilcoxon signed-rank test <inline-formula id="inf87">
<mml:math id="m100">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>&#x3c;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, <xref ref-type="fig" rid="F2">Figure 2A</xref>), which showed that the integration of spatial information is necessary to improve the spatial clustering performance. Strikingly, AE-GCN had the highest mean ARI (mean ARI &#x3d; 0.561) and substantially performed better than the competing methods over the slices (Wilcoxon signed-rank test <inline-formula id="inf88">
<mml:math id="m101">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>&#x3c;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>8</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, <xref ref-type="fig" rid="F2">Figure 2A</xref>). Taking slice 151673 as an example (<xref ref-type="fig" rid="F2">Figure 2B</xref>), we found AE-GCN (ARI &#x3d; 0.623), STAGATE (ARI &#x3d; 0.588), BayesSpace (ARI &#x3d; 0.556), and SEDR (ARI &#x3d; 0.515) delineated the layered regions (<xref ref-type="fig" rid="F2">Figure 2C</xref>). Notably, the partitions from AE-GCN (termed as a deep learning model-combined method) exhibited clearer and less noisy outcomes than those from single model-based methods (e.g., GCN-based SpaGCN and the VAE model).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Benchmarking AE-GCN against state-of-the-art spatial domain detection methods. <bold>(A)</bold> Spatial clustering performance is compared using ARI on 12 manually annotated DLPFC datasets from spatialLIBD. The bold line represents the mean ARI value of each approach on all the datasets. <bold>(B)</bold> Slice 151673 with the manual annotation. <bold>(C)</bold> Comparative illustration of the identified spatial domain on slice 151673. The identified spatial domains of each method are distinguished by colors without strict correspondence.</p>
</caption>
<graphic xlink:href="fgene-14-1202409-g002.tif"/>
</fig>
</sec>
<sec id="s3-2">
<title>AE-GCN reveals fine-grained anatomical regions on mouse hippocampus Slide-seqV2 data</title>
<p>To illustrate the effectiveness of AE-GCN on high-resolution SRT platforms, we applied AE-GCN to a mouse hippocampus Slide-seqV2 dataset (<italic>n</italic> &#x3d; 41,786 spots). Slide-seqV2 can measure gene expression at near-cellular resolution (<xref ref-type="bibr" rid="B22">Stickels et al., 2021</xref>) but has lower number of transcripts per location/spot and higher dropouts than the 10x Visium platform. Thus, it poses more challenges for accurately distinguishing tissue structures from the data of high sparsity. To better validate the performance of AE-GCN, we also compared it with other domain detection methods and used the corresponding anatomical diagram from the Allen Mouse Brain Atlas (<xref ref-type="bibr" rid="B23">Sunkin et al., 2012</xref>) as the illustrative reference (<xref ref-type="fig" rid="F3">Figure 3A</xref>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>AE-GCN reveals the finer-grained anatomical regions on mouse hippocampus Slide-seqV2 data. <bold>(A)</bold> Corresponding anatomical diagram from the Allen Mouse Brain Atlas and spatial domains identified by each competed method. <bold>(B&#x2013;D)</bold> CA2 and ventricle and habenula regions (at the top) from AE-GCN partitions are, respectively, validated by the known gene markers (i.e., <italic>Pcp4</italic>, <italic>Enpp2</italic>, and <italic>Gabbr2</italic>) from gene expression (at the middle) and ISH images (at the bottom). The ISH images of <italic>Pcp4</italic>, <italic>Enpp2</italic>, and <italic>Gabbr2</italic> are also obtained from the Allen Mouse Brain Atlas.</p>
</caption>
<graphic xlink:href="fgene-14-1202409-g003.tif"/>
</fig>
<p>Comparing with the reference, we found that AE-GCN and STAGATE can identify the spatially coherent domains compared to other involved methods. However, AE-GCN performed better to detect the fine-grained structures, such as the cornu ammonis 2 (CA2, AE-GCN domain 16), ventricle (AE-GCN domain 12), and habenula (AE-GCN domain 11) sections (<xref ref-type="fig" rid="F3">Figure 3A</xref>). These sections are delineated with sharper boundaries and higher concordance with the anatomical annotation. We further isolated the focused regions and provided validations from other perspectives (<xref ref-type="fig" rid="F3">Figures 3B&#x2013;D</xref>). For the CA2 section, which is only detected by AE-GCN, the domain location showed good alignment with the marker gene expression (i.e., <italic>Pcp4</italic> (<xref ref-type="bibr" rid="B20">San Antonio et al., 2014</xref>)) and independent <italic>in situ</italic> hybridization (ISH) image (<xref ref-type="fig" rid="F3">Figure 3B</xref>). For ventricle and habenula sections, AE-GCN domains are closer to the shapes of their respective marker expression (<italic>Enpp2</italic> for ventricle (<xref ref-type="bibr" rid="B13">Koike et al., 2006</xref>) and <italic>Gabbr2</italic> for habenula (<xref ref-type="bibr" rid="B3">De Beaurepaire, 2018</xref>)) or stained regions and match the anatomical shape well (<xref ref-type="fig" rid="F3">Figures 3C, D</xref>). Thus, for higher-resolution SRT data, AE-GCN is capable of effectively unveiling the fine-grained anatomical functional regions.</p>
</sec>
<sec id="s3-3">
<title>AE-GCN accurately discerns tumor regions on human pancreatic ductal adenocarcinoma data</title>
<p>To illustrate the effectiveness of AE-GCN on cancer tissue, we applied AE-GCN to the human pancreatic ductal adenocarcinoma (PDAC) ST dataset (<italic>n</italic> &#x3d; 428 spots). The histopathological image and annotations were taken as references (<xref ref-type="fig" rid="F4">Figures 4A, B</xref>). We assessed these spatial domain identification methods using cluster purity (see Methods) as the quantitative measure on cancer datasets with rough annotation information. AE-GCN achieved the highest cluster purity (purity &#x3d; 0.756) and detected more spatially enriched functional regions in tumor tissue than other compared methods (<xref ref-type="fig" rid="F4">Figure 4C</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>AE-GCN identifies tumor regions on human PDAC ST data. The H&#x26;E-stained image <bold>(A)</bold> and the corresponding manual annotation <bold>(B)</bold> are shown as references. <bold>(C)</bold> The identified spatial domains using all the compared methods are distinguished by different colors without strict correspondence. Cluster purity is used to compare the similarity between identified domains and the reference annotation. <bold>(D)</bold> The change in gene differential expression in each domain before and after data denoising. log<sub>2</sub>(FC): the logarithmic value of the gene expression fold change with base 2. <bold>(E)</bold> Spatial expression visualization of selected DEGs (i.e., <italic>S100P</italic> and <italic>TNS4</italic>) before and after data denoising. <bold>(F)</bold> Kaplan&#x2013;Meier survival curves show the clinical relevance of the identified DEGs (i.e., <italic>S100P</italic> and <italic>TNS4</italic>).</p>
</caption>
<graphic xlink:href="fgene-14-1202409-g004.tif"/>
</fig>
<p>Next, we examined whether AE-GCN could provide more insights into the underlying tumor heterogeneity, as data sparsity could hinder other downstream analytical tasks, for example, the identification of differentially expressed genes (DEGs). In this manner, we used the AE-GCN-reconstructed data to denoise the low-quality measurements and evaluated the effectiveness in recovering gene spatial expression patterns. Based on the denoised data, we selected the top 50 DEGs of each domain from the reconstructed data <inline-formula id="inf89">
<mml:math id="m102">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> and compared the log fold change (LFC) of these DEGs before and after denoising (<xref ref-type="fig" rid="F4">Figure 4D</xref>). Overall, the comparison highlights the significant improvement of biological specificity brought by AE-GCN denoising across the identified domains (Wilcoxon signed-rank test <inline-formula id="inf90">
<mml:math id="m103">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>&#x3c;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>14</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, <xref ref-type="fig" rid="F4">Figure 4D</xref>). In particular, we found that some DEG expression (e.g., <italic>S100P</italic> and <italic>TNS4</italic>) appeared more spatially smoothed on spots <italic>in situ</italic> (<xref ref-type="fig" rid="F4">Figure 4E</xref>). These two DEGs were validated to be the potential prognostic risk factors for PDAC (<xref ref-type="fig" rid="F4">Figure 4F</xref>). For example, <italic>S100P</italic> is ever reported to be involved in the aggressive properties of cancer cells and associated with poor prognosis (<xref ref-type="bibr" rid="B25">Wang et al., 2012</xref>) (<xref ref-type="fig" rid="F4">Figure 4F</xref>); <italic>TNS4</italic> is associated with cancer cell motility and migration, whose high expression can indicate poor prognosis (<xref ref-type="bibr" rid="B19">Sakashita et al., 2008</xref>). These results indicate that AE-GCN has the potential to provide the in-depth biological insights into the underlying tumor heterogeneity from the perspectives of spatial domain detection and gene expression pattern recovery.</p>
</sec>
<sec id="s3-4">
<title>AE-GCN reveals more intratumor heterogeneity on invasive ductal carcinoma data</title>
<p>To illustrate the generalization ability of AE-GCN on cancer tissues, we next tested AE-GCN using the invasive ductal carcinoma (IDC) Visium dataset (<italic>n</italic> &#x3d; 4,727 spots). The histopathological annotations from the original paper (<xref ref-type="bibr" rid="B29">Zhao et al., 2021</xref>) were taken as the reference (<xref ref-type="fig" rid="F5">Figures 5A, B</xref>). We found that the identified domains of AE-GCN were highly consistent with the manual annotations (purity &#x3d; 0.865, <xref ref-type="fig" rid="F5">Figure 5C</xref>). Compared with the domains captured by other methods, the clustering partitions from AE-GCN showed clear spatial separations with few scatter points and high regional continuity.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>AE-GCN provides more biological insights into intratumor heterogeneity on the IDC 10x Visium dataset. The fluorescent image <bold>(A)</bold> and the corresponding manual annotation <bold>(B)</bold> are shown as references. Each spot is colored due to the annotation label in <bold>(B)</bold>. <bold>(C)</bold> The spatial domains obtained by all involved methods are distinguished using different colors without strict correspondence. Cluster purity is used to compare the similarities between identified outcomes and reference annotation. <bold>(D)</bold> The change gene FC before and after data denoising. <bold>(E)</bold> Spatial expression visualization of the selected domain-specific genes (i.e., <italic>SLC7A5</italic> and <italic>RDH16</italic>) before and after data denoising. <bold>(F)</bold> Kaplan&#x2013;Meier survival curves show the clinical relevance of the newly identified DEGs (i.e., <italic>SLC7A5</italic> and <italic>RDH16</italic>).</p>
</caption>
<graphic xlink:href="fgene-14-1202409-g005.tif"/>
</fig>
<p>Then, for functional gene identification, we identified the top 50 DEGs of each cluster from the denoised data <inline-formula id="inf91">
<mml:math id="m104">
<mml:mrow>
<mml:msup>
<mml:mi>X</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. Similarly, based on the comparison before and after denoising, we found that AE-GCN significantly improves the LFCs of gene expression, revealing more biological specificity across domains, which may suggest the detection of new disease-associated genes (<xref ref-type="fig" rid="F5">Figure 5D</xref>). For example, <italic>SLC7A5</italic> and <italic>RDH16</italic> are two newly found DEGs after denoising, whose spatial expression patterns are greatly enhanced after denoising (<xref ref-type="fig" rid="F5">Figure 5E</xref>). Moreover, the two novel DEGs were shown to be the potential prognostic risk genes for breast cancer via survival analysis of independent clinical data (<xref ref-type="fig" rid="F5">Figure 5F</xref>). Their biological functions in tumors indicate the prognostic relevance from previous studies. For example, <italic>SLC7A5</italic> is reported to involve in tumor cell metabolism and promotes cell proliferation (<xref ref-type="bibr" rid="B6">El Ansari et al., 2018</xref>). <italic>RDH16</italic> affects retinol metabolism to participate indirectly in breast cancer occurrence and progression (<xref ref-type="bibr" rid="B8">Gao et al., 2020</xref>). The application, along with the PDAC case, demonstrates that AE-GCN can unveil cancer heterogeneity from SRT data, enabling the discovery of novel spatial patterns of both samples and genes.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>Spatially resolved transcriptomics technologies measure gene expression on each spot while preserving spatial context, which can support computational methods to identify functional regions of tissue and further resolve organizational heterogeneity. The combined modeling of gene expression and spatial information enables the improved identification accuracy of spatial domains, especially for complex spatial architecture, e.g., tumor microenvironments. In this paper, AE-GCN combines the autoencoder and graph convolutional neural network to achieve effective latent representations from expression data itself and spot neighboring structure. The superiority of AE-GCN is shown not only on the accurate and fine-grained identification of spatial domains for multiple SRT platforms but also on the recovery or identification of gene spatial expression patterns. In particular, the application on cancer slices (i.e., IDC and PDAC) demonstrates that AE-GCN reveals more functional regions and novel cancer prognostic genes for interpreting cancer heterogeneity, suggesting that AE-GCN has great capability of unveiling tissue heterogeneity from SRT data.</p>
<p>The effectively combined modeling is key to the superiority of AE-GCN in the SRT study. Generally, AE models learn the representations from expression data itself, while GCN models learn the structured representations from the sample graph structure by providing an approximate second-order graph regularization, which may suffer from over-smoothing issues. AE-GCN combines the characteristics of these two deep learning methods and integrates them to learn effective representations so that AE is used to weaken the problem of overfitting while simultaneously learning the structured representations in GCN. Additionally, the proposed clustering-aware contrastive module in AE-GCN further promotes the combined model from processes independent of clustering targets to the model that achieves effective spatial clustering. Thus, AE-GCN can not only effectively use the information of the expression data itself but also reasonably regularize the learned information from expression data by spatial structure between spots, which has better advantages than the spatial domain detection methods based on a single-model design in SRT studies.</p>
<p>Currently, AE-GCN only models gene expression and spatial information from SRT data and cannot utilize histological images which are also provided by several SRT technologies, e.g., 10x Visium. Although some methods have used histological images in spatial domain detection, histological images are mainly used to enhance the quality of expression data and lack of modeling image data separately, e.g., stLearn (<xref ref-type="bibr" rid="B18">Pham et al., 2020</xref>). Compared with expression data and spatial information, histological image data are one type of modalities more suitable for deep learning modeling. The future work to extend AE-GCN is to integrate deep learning models for each multi-modal data characteristic (i.e., gene expression, histological images, and spatial information) to improve the performance of current methods in SRT research.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The available web resources include human DLPFC datasets (available in spatialLIBD package), mouse hippocampus Slide-seqV2 dataset (<ext-link ext-link-type="uri" xlink:href="https://singlecell.broadinstitute.org">https://singlecell.broadinstitute.org</ext-link>), human IDC 10x Visium dataset (<ext-link ext-link-type="uri" xlink:href="https://www.10xgenomics.com/resources/datasets">https://www.10xgenomics.com/resources/datasets</ext-link>) and human PDAC ST dataset (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE111672">https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc&#x003D;GSE111672</ext-link>). Python source code of AE-GCN is available at <ext-link ext-link-type="uri" xlink:href="https://github.com/zccqq/AE-GCN">https://github.com/zccqq/AE-GCN</ext-link>.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>QS conceived and designed the framework and the experiments. XL and WH performed the experiments. WH and XX developed the Python package and documentation website of the framework. QS analyzed the data and wrote the paper. QS and HZ revised the manuscript. All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work was supported by the National Natural Science Foundation of China (Grant No. 61802141).</p>
</sec>
<ack>
<p>The authors thank the reviewers for useful suggestions.</p>
</ack>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Atta</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Computational challenges and opportunities in spatially resolved transcriptomic data analysis</article-title>. <source>Nat. Commun.</source> <volume>12</volume> (<issue>1</issue>), <fpage>5283</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-021-25557-9</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Butler</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Hoffman</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Smibert</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Papalexi</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Satija</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Integrating single-cell transcriptomic data across different conditions, technologies, and species</article-title>. <source>Nat. Biotechnol.</source> <volume>36</volume> (<issue>5</issue>), <fpage>411</fpage>&#x2013;<lpage>420</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.4096</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>De Beaurepaire</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>A review of the potential mechanisms of action of baclofen in alcohol use disorder</article-title>. <source>Front. Psychiatry</source> <volume>9</volume>, <fpage>506</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyt.2018.00506</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dong</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder</article-title>. <source>Nat. Commun.</source> <volume>13</volume> (<issue>1</issue>), <fpage>1739</fpage>&#x2013;<lpage>1812</lpage>. <pub-id pub-id-type="doi">10.1038/s41467-022-29439-6</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dries</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Eng</surname>
<given-names>C.-H. .L.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Giotto: A toolbox for integrative analysis and visualization of spatial expression data</article-title>. <source>Genome Biol.</source> <volume>22</volume> (<issue>1</issue>), <fpage>78</fpage>&#x2013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1186/s13059-021-02286-2</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>El Ansari</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Craze</surname>
<given-names>M. .L.</given-names>
</name>
<name>
<surname>Miligy</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Diez-Rodriguez</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Nolan</surname>
<given-names>C. .C.</given-names>
</name>
<name>
<surname>Ellis</surname>
<given-names>I. .O.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>The amino acid transporter SLC7A5 confers a poor prognosis in the highly proliferative breast cancer subtypes and is a key therapeutic target in luminal B tumours</article-title>. <source>Breast Cancer Res.</source> <volume>20</volume>, <fpage>21</fpage>&#x2013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1186/s13058-018-0946-6</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Fu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Hang</surname>
<given-names>X. .U.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <source>Unsupervised spatial embedded deep representation of spatial transcriptomics</source>. <publisher-name>bioRxiv</publisher-name>.</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zhuang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Development of a risk scoring system for evaluating the prognosis of patients with Her2-positive breast cancer</article-title>. <source>Cancer Cell Int.</source> <volume>20</volume> (<issue>1</issue>), <fpage>121</fpage>&#x2013;<lpage>212</lpage>. <pub-id pub-id-type="doi">10.1186/s12935-020-01175-1</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Coleman</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Schroeder</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Irwin</surname>
<given-names>D. .J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network</article-title>. <source>Nat. methods</source> <volume>18</volume> (<issue>11</issue>), <fpage>1342</fpage>&#x2013;<lpage>1351</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-021-01255-8</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Torre</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Dueck</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Shaffer</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bonasio</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Saver: Gene expression recovery for single-cell RNA sequencing</article-title>. <source>Nat. methods</source> <volume>15</volume> (<issue>7</issue>), <fpage>539</fpage>&#x2013;<lpage>542</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-018-0033-z</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hubert</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Arabie</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>1985</year>). <article-title>Comparing partitions</article-title>. <source>J. Classif.</source> <volume>2</volume> (<issue>1</issue>), <fpage>193</fpage>&#x2013;<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1007/bf01908075</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kingma</surname>
<given-names>D. .P.</given-names>
</name>
<name>
<surname>Welling</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>An introduction to variational autoencoders</article-title>. <source>Found. Trends&#xae; Mach. Learn.</source> <volume>12</volume> (<issue>4</issue>), <fpage>307</fpage>&#x2013;<lpage>392</lpage>. <pub-id pub-id-type="doi">10.1561/2200000056</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Koike</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Keino&#x2010;Masu</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ohto</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Masu</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>The N&#x2010;terminal hydrophobic sequence of autotaxin (ENPP2) functions as a signal peptide</article-title>. <source>Genes cells</source> <volume>11</volume> (<issue>2</issue>), <fpage>133</fpage>&#x2013;<lpage>142</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2443.2006.00924.x</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maniatis</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Petrescu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Phatnani</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Spatially resolved transcriptomics and its applications in cancer</article-title>. <source>Curr. Opin. Genet. Dev.</source> <volume>66</volume>, <fpage>70</fpage>&#x2013;<lpage>77</lpage>. <pub-id pub-id-type="doi">10.1016/j.gde.2020.12.002</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marx</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Method of the year: Spatially resolved transcriptomics</article-title>. <source>Nat. methods</source> <volume>18</volume> (<issue>1</issue>), <fpage>9</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-020-01033-y</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Nasab</surname>
<given-names>R. .Z.</given-names>
</name>
<name>
<surname>Ghamsari</surname>
<given-names>M. .R. .E.</given-names>
</name>
<name>
<surname>Argha</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Macphillamy</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Beheshti</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Alizadehsani</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <source>Deep learning in spatially resolved transcriptomics: A comprehensive technical view</source>. <comment>arXiv preprint arXiv:221004453 2022</comment>.</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pardo</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Spangler</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Weber</surname>
<given-names>L. .M.</given-names>
</name>
<name>
<surname>Page</surname>
<given-names>S. .C.</given-names>
</name>
<name>
<surname>Hicks</surname>
<given-names>S. .C.</given-names>
</name>
<name>
<surname>Jaffe</surname>
<given-names>A. .E.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>spatialLIBD: an R/Bioconductor package to visualize spatially-resolved transcriptomics data</article-title>. <source>BMC genomics</source> <volume>23</volume> (<issue>1</issue>), <fpage>434</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-022-08601-w</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pham</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Grice</surname>
<given-names>L. .F.</given-names>
</name>
<name>
<surname>Lam</surname>
<given-names>P. .Y.</given-names>
</name>
<name>
<surname>Raghubar</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>stLearn: integrating spatial location, tissue morphology and gene expression to find cell types, cell-cell interactions and spatial trajectories within undissociated tissues</article-title>. <source>bioRxiv</source>.</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sakashita</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Mimori</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Tanaka</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Kamohara</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Inoue</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Sawada</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>Prognostic relevance of Tensin4 expression in human gastric cancer</article-title>. <source>Ann. Surg. Oncol.</source> <volume>15</volume>, <fpage>2606</fpage>&#x2013;<lpage>2613</lpage>. <pub-id pub-id-type="doi">10.1245/s10434-008-9989-8</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>San Antonio</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Liban</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ikrar</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tsyganovskiy</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Distinct physiological and developmental properties of hippocampal CA2 subfield revealed by using anti&#x2010;Purkinje cell protein 4 (PCP4) immunostaining</article-title>. <source>J. Comp. Neurology</source> <volume>522</volume> (<issue>6</issue>), <fpage>1333</fpage>&#x2013;<lpage>1354</lpage>. <pub-id pub-id-type="doi">10.1002/cne.23486</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>St&#xe5;hl</surname>
<given-names>P. .L.</given-names>
</name>
<name>
<surname>Salm&#xe9;n</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Vickovic</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lundmark</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Navarro</surname>
<given-names>J. .F.</given-names>
</name>
<name>
<surname>Magnusson</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Visualization and analysis of gene expression in tissue sections by spatial transcriptomics</article-title>. <source>Science</source> <volume>353</volume> (<issue>6294</issue>), <fpage>78</fpage>&#x2013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1126/science.aaf2403</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stickels</surname>
<given-names>R. .R.</given-names>
</name>
<name>
<surname>Murray</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Marshall</surname>
<given-names>J. .L.</given-names>
</name>
<name>
<surname>Di Bella</surname>
<given-names>D. .J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2</article-title>. <source>Nat. Biotechnol.</source> <volume>39</volume> (<issue>3</issue>), <fpage>313</fpage>&#x2013;<lpage>319</lpage>. <pub-id pub-id-type="doi">10.1038/s41587-020-0739-1</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sunkin</surname>
<given-names>S. .M.</given-names>
</name>
<name>
<surname>Ng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Lau</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Dolbeare</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Gilbert</surname>
<given-names>T. .L.</given-names>
</name>
<name>
<surname>Thompson</surname>
<given-names>C. .L.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Allen Brain Atlas: An integrated spatio-temporal portal for exploring the central nervous system</article-title>. <source>Nucleic acids Res.</source> <volume>41</volume> (<issue>D1</issue>), <fpage>D996-D1008</fpage>&#x2013;<lpage>D1008</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks1042</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Traag</surname>
<given-names>V. .A.</given-names>
</name>
<name>
<surname>Waltman</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Van Eck</surname>
<given-names>N. .J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>From Louvain to leiden: Guaranteeing well-connected communities</article-title>. <source>Sci. Rep.</source> <volume>9</volume> (<issue>1</issue>), <fpage>5233</fpage>&#x2013;<lpage>5312</lpage>. <pub-id pub-id-type="doi">10.1038/s41598-019-41695-z</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.-N.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>G.-L.</given-names>
</name>
<name>
<surname>Qiu</surname>
<given-names>H.-Z.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>H.-Y.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>S100P, a potential novel prognostic marker in colorectal cancer</article-title>. <source>Oncol. Rep.</source> <volume>28</volume> (<issue>1</issue>), <fpage>303</fpage>&#x2013;<lpage>310</lpage>. <pub-id pub-id-type="doi">10.3892/or.2012.1794</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wolf</surname>
<given-names>F. .A.</given-names>
</name>
<name>
<surname>Angerer</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Theis</surname>
<given-names>F. .J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Scanpy: Large-scale single-cell gene expression data analysis</article-title>. <source>Genome Biol.</source> <volume>19</volume> (<issue>1</issue>), <fpage>15</fpage>&#x2013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1186/s13059-017-1382-0</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>DeepST: Identifying spatial domains in spatial transcriptomics by deep learning</article-title>. <source>Nucleic Acids Res.</source> <volume>50</volume> (<issue>22</issue>), <fpage>e131</fpage>&#x2013;<lpage>e</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkac901</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zeng</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Statistical and machine learning methods for spatially resolved transcriptomics data analysis</article-title>. <source>Genome Biol.</source> <volume>23</volume> (<issue>1</issue>), <fpage>83</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1186/s13059-022-02653-7</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Stone</surname>
<given-names>M. .R.</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Guenthoer</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Smythe</surname>
<given-names>K. .S.</given-names>
</name>
<name>
<surname>Pulliam</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Spatial transcriptomics at subspot resolution with BayesSpace</article-title>. <source>Nat. Biotechnol.</source> <volume>39</volume> (<issue>11</issue>), <fpage>1375</fpage>&#x2013;<lpage>1384</lpage>. <pub-id pub-id-type="doi">10.1038/s41587-021-00935-2</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Zwyea</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Naji</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Almansouri</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). &#x201c;<article-title>Kaplan-Meier plotter data analysis model in early prognosis of pancreatic cancer</article-title>,&#x201d; in <conf-name>Journal of Physics: Conference Series</conf-name> (<publisher-name>IOP Publishing</publisher-name>).</citation>
</ref>
</ref-list>
</back>
</article>