<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Chem.</journal-id>
<journal-title>Frontiers in Chemistry</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Chem.</abbrev-journal-title>
<issn pub-type="epub">2296-2646</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">707382</article-id>
<article-id pub-id-type="doi">10.3389/fchem.2021.707382</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Chemistry</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>GlycanGUI: Automated Glycan Annotation and Quantification Using Glucose Unit Index</article-title>
<alt-title alt-title-type="left-running-head">Zhang et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">GlycanGUI Automatic Annotation</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>Rui</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1332463/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Peng</surname>
<given-names>Wenjing</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1222520/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Gautam</surname>
<given-names>Sakshi</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Huang</surname>
<given-names>Yifan</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Mechref</surname>
<given-names>Yehia</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/603461/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Tang</surname>
<given-names>Haixu</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/34558/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<label>
<sup>1</sup>
</label>Department of Computer Science, Luddy School of Informatics, Computing, and Engineering, Indiana University Bloomington, <addr-line>Bloomington</addr-line>, <addr-line>IN</addr-line>, <country>United&#x20;States</country>
</aff>
<aff id="aff2">
<label>
<sup>2</sup>
</label>Department of Chemistry and Biochemistry, Texas Tech University, <addr-line>Lubbock</addr-line>, <addr-line>TX</addr-line>, <country>United&#x20;States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/846382/overview">Ganglong Yang,</ext-link> Jiangnan University, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1338330/overview">Shunji Natsuka</ext-link>, Niigata University, Japan</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/709385/overview">Shiwei Sun</ext-link>, Institute of Computing Technology, Chinese Academy of Sciences (CAS), China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Haixu Tang, <email>hatang@indiana.edu</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Analytical Chemistry, a section of the journal Frontiers in Chemistry</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>15</day>
<month>06</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>9</volume>
<elocation-id>707382</elocation-id>
<history>
<date date-type="received">
<day>09</day>
<month>05</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>03</day>
<month>06</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Zhang, Peng, Gautam, Huang, Mechref and Tang.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Zhang, Peng, Gautam, Huang, Mechref and Tang</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>The retention time provides critical information for glycan annotation and quantification from the Liquid Chromatography Mass Spectrometry (LC-MS) data. However, the variation of the precise retention time of glycans is highly dependent on the experimental conditions such as the specific separating columns, MS instruments and/or the buffer used. This variation hampers the exploitation of retention time for the glycan annotation from LC-MS data, especially when inter-laboratory data are compared. To incorporate the retention time of glycan across experiments, Glucose Unit Index (GUI) can be computed using the dextrin ladder as internal standard. The retention time of glycans are then calibrated with respect to glucose units derived from dextrin ladders. Despite the successful application of the GUI approach, the manual calibration process is quite tedious and often error prone. In this work, we present a standalone software tool GlycanGUI, with a graphic user interface to automatically carry out the GUI-based glycan annotation/quantification and subsequent data analysis. When tested on experimental data, GlycanGUI reported accurate GUI values compared with manual calibration, and thus is ready to be used for automated glycan annotation and quantification using&#x20;GUI.</p>
</abstract>
<kwd-group>
<kwd>GUI</kwd>
<kwd>glycan</kwd>
<kwd>annotation</kwd>
<kwd>quantification</kwd>
<kwd>mass spectrometry</kwd>
</kwd-group>
<contract-num rid="cn001">5R01GM112490</contract-num>
<contract-sponsor id="cn001">National Institutes of Health<named-content content-type="fundref-id">10.13039/100000002</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Glycosylation is a post-translational modification that plays critical roles in important biological processes such as immune response, cellular differentiation/adhesion and host-pathogen interactions (<xref ref-type="bibr" rid="B21">Varki et&#x20;al., 2009</xref>). The aberrant alteration of glycan structure is implicit with malfunction of cells and possesses potential significance for the medical diagnosis of complex human diseases such as cancer (<xref ref-type="bibr" rid="B12">Ohtsubo and Marth, 2006</xref>; <xref ref-type="bibr" rid="B15">Pinho and Reis, 2015</xref>; <xref ref-type="bibr" rid="B18">Stowell et&#x20;al., 2015</xref>). The Liquid Chromatography coupled Mass Spectrometry (LC-MS) is one of the most widely used techniques for glycan analysis due to its high sensitivity and throughput (<xref ref-type="bibr" rid="B13">Pabst and Altmann, 2011</xref>). For a chromatography, the elution order of glycans is consistent and reproducible among experiments. Hence, the retention time is often used to assess the separation of glycans and to assist glycan identification (<xref ref-type="bibr" rid="B13">Pabst and Altmann, 2011</xref>). However, the precise retention time of a specific glycan may vary widely depending on the experimental conditions such as the separating columns, MS instruments and/or the buffer used. This variation hampers the interpretation of LC-MS/MS data, especially when inter-laboratory data are compared without any calibrations.</p>
<p>To address this issue, the Glucose Unit Index (GUI) can be utilized to normalize the retention time that eliminates the variation (<xref ref-type="bibr" rid="B11">Mellis and Baenziger, 1981</xref>). This method employs the dextrin ladder as the internal standard and measures the retention time of glycans with respect to the glucose units derived from the dextrin ladder. The retention time is assigned to a glucose unit (GU) value, which serves as a calibration of different experiments. The GUI approach was proposed for glycomics (<xref ref-type="bibr" rid="B6">Campbell et&#x20;al., 2008</xref>; <xref ref-type="bibr" rid="B17">Stockmann et&#x20;al., 2013</xref>; <xref ref-type="bibr" rid="B1">Abrahams et&#x20;al., 2018</xref>) and has been extensively used for the normalization of retention time of glycans (<xref ref-type="bibr" rid="B5">Ashwood et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B8">Fabini et&#x20;al., 2001</xref>; <xref ref-type="bibr" rid="B19">Van den Steen et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B16">Royle et&#x20;al., 2008</xref>). In our previous study, we examined the use of permethylated dextrin for the annotation of permethylated N-glycans and their isomers derived from standard glycoproteins and human blood serum (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>). The calibrated GUI was proved to be reproducible across inter- and intra-laboratory analyses (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>).</p>
<p>Despite the successful application of the GUI approach, the calibration process is quite tedious, which requires manual data processing such as peak finding and curve fitting (<xref ref-type="bibr" rid="B4">Ashwood et&#x20;al., 2019</xref>). In this paper, we present a standalone software tool GlycanGUI to automate the whole calibration process. GlycanGUI offers a graphic user interface for users to choose logarithmic or polynomial fitting that will be used in the automatic calibration of the retention time of glycan ions. Furthermore, GlycanGUI implemented the computational procedures improved from those previously developed in GlycoHybridSeq (<xref ref-type="bibr" rid="B23">Zhang et&#x20;al., 2021</xref>) for automated glycan annotation and label-free quantification based on the total peak area of glycan ions with various charges and abducts. Therefore, GlycanGUI is ready to be used for large-scale comparative analyses of glycan abundances across many glycomic samples.</p>
</sec>
<sec id="s2">
<title>2 Method</title>
<sec id="s2-1">
<title>2.1 Experimental Data</title>
<p>The data were obtained from the dextrin spiked human blood serum using LC-MS according to the previously published experimental protocols (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>). Briefly, 1&#xa0;<italic>&#x3bc;</italic>g of dextrin was spiked as an internal standard in initial 10&#xa0;<italic>&#x3bc;</italic>l serum before permethylation. For each injection, the released N-glycans from 1&#xa0;<italic>&#x3bc;</italic>l of the initialserum sample were resuspended in 6&#xa0;<italic>&#x3bc;</italic>l of 20% ACN and 0.1%FA. For C18 columns, solution A was 98% water, 2% ACN, and 0.1% FA and solution B was 100% ACN and 0.1% FA. The gradient started at 20% solution B and increased to 42% in 11&#xa0;min. After 48&#xa0;min, it reached 55% and increased to 90% in 1&#xa0;min. The organic phase remained at 90% for 54&#xa0;min and decreased to 20% for 6&#xa0;min. The LC-MS data were acquired by using LTQ Orbitrap Velos (Thermo Scientific) instrument.</p>
</sec>
<sec id="s2-2">
<title>2.2 GlycoGUI Software</title>
<p>The software GlycoGUI was implemented in C&#x23; using the WPF framework for graphic user interface. After GlycoGUI reads the input LC-MS data (in thermo. raw format) using the MSFileReader library, a user can calibrate the retention time of any annotated glycans into the corresponding GUI value. The abundances (peak areas) of the glycans are computed over retention time and specifically for the detected retention range of major isomers. The source code of GlycanGUI can be found on Github at <ext-link ext-link-type="uri" xlink:href="https://github.com/ruizhang84/GlycanGUIApp">https://github.com/ruizhang84/GlycanGUIApp</ext-link>.</p>
</sec>
<sec id="s2-3">
<title>2.3 Glucose Units Identification</title>
<p>The ions corresponding to <italic>Glucose Units</italic> are extracted directly from the full MS spectra based on their expected mass-to-charge-ratio (m/z). Due to the instrumental noise and overlapping peaks, multiple putative glucose units (of different m/z) may be extracted at a specific retention time. To correctly label glucose units and avoid mis-identified glucose units, a dynamic programming algorithm is implemented to obtain the most likely sequence of glucose units. Briefly, a score is computed recursively for all glucose units at each retention time where a putative glucose unit (i.e.,&#x20;dextrin) is observed, which is based on the intensity of the peak matching the dextrin:<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>where <italic>i</italic> is the index for the observed retention time, <italic>u</italic> is the index of the glucose units. This equation is derived based on the fact that the higher glucose units (dextrin) always elute at a latter time than the lower units, as well as the assumption that the true peaks of the glucose units are likely more intensive than those false peaks. After identifying peaks corresponding to the glucose units in each experimental MS spectrum, the retention time with highest peak intensity for each glucose unit is used for curve fitting.</p>
</sec>
<sec id="s2-4">
<title>2.4 Curve Fitting</title>
<p>To calibrate the retention time of ion species into a glucose unit index (GUI), we adopt a polynomial regression,<disp-formula id="e2">
<mml:math id="m2">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x2b;</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>where for an ion species ion <italic>i</italic>, <inline-formula id="inf1">
<mml:math id="m3">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the (target) glucose unit index, <inline-formula id="inf2">
<mml:math id="m4">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is its retention time, and <inline-formula id="inf3">
<mml:math id="m5">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is <italic>j</italic>th coefficient in the polynomial regression. For a logarithmic fitting, the regression can be solved similarly by converting <inline-formula id="inf4">
<mml:math id="m6">
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> into <inline-formula id="inf5">
<mml:math id="m7">
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. The linear regression can be expressed in terms of matrix multiplication for a total of <italic>m</italic> glucose units (i.e.,&#x20;dextrin ladders),<disp-formula id="e3">
<mml:math id="m8">
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>3</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mn>1</mml:mn>
<mml:mi>m</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>3</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mn>2</mml:mn>
<mml:mi>m</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
</mml:mtd>
<mml:mtd>
</mml:mtd>
<mml:mtd>
</mml:mtd>
<mml:mtd>
</mml:mtd>
<mml:mtd>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi>n</mml:mi>
<mml:mn>3</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>m</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>m</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
<label>(3)</label>
</disp-formula>or simply in the vector form as,<disp-formula id="e4">
<mml:math id="m9">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x2192;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi mathvariant="bold-italic">X</mml:mi>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2192;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
</mml:math>
<label>(4)</label>
</disp-formula>
</p>
<p>Using the least square estimation, the coefficients are computed as,<disp-formula id="e5">
<mml:math id="m10">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x2192;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="bold-italic">X</mml:mi>
<mml:mi>T</mml:mi>
</mml:msup>
<mml:mi mathvariant="bold-italic">X</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="bold-italic">X</mml:mi>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#x2192;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
</mml:math>
<label>(5)</label>
</disp-formula>where <inline-formula id="inf6">
<mml:math id="m11">
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="bold-italic">X</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> is the inverse of the matrix <inline-formula id="inf7">
<mml:math id="m12">
<mml:mi mathvariant="bold-italic">X</mml:mi>
</mml:math>
</inline-formula>, which can be computed by using the Gauss-Jordan method (<xref ref-type="bibr" rid="B2">Althoen and Mclaughlin, 1987</xref>). Once the coefficients are estimated using the elution time and the (target) GUI values of the dextrin ladders that are injected and observed in a LC-MS experiment, they can be used in the polynomial regression for calibrating the elution time of other ion species of interests (e.g., those annotated as glycans; see below), into GUI values.</p>
</sec>
<sec id="s2-5">
<title>2.5 Glycan Annotation and Label-free Quantification</title>
<p>We adopted the algorithm used for glycan annotation similar to one implemented in GlycoHybridSeq (<xref ref-type="bibr" rid="B23">Zhang et&#x20;al., 2021</xref>). Briefly, the potential N-glycans are pre-computed with up to a certain maximum number of monosaccharide residues (by default, &#x23;HexNAc<inline-formula id="inf8">
<mml:math id="m13">
<mml:mrow>
<mml:mo>&#x2264;</mml:mo>
<mml:mn>12</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, &#x23;Hex<inline-formula id="inf9">
<mml:math id="m14">
<mml:mrow>
<mml:mo>&#x2264;</mml:mo>
<mml:mn>12</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, &#x23;Fuc<inline-formula id="inf10">
<mml:math id="m15">
<mml:mrow>
<mml:mo>&#x2264;</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and &#x23;NeuAc<inline-formula id="inf11">
<mml:math id="m16">
<mml:mrow>
<mml:mo>&#x2264;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>) according to the biosynthesis rule (<xref ref-type="bibr" rid="B23">Zhang et&#x20;al., 2021</xref>). The theoretical isotopic distribution of these N-glycans are subsequently derived using the BRAIN algorithm (<xref ref-type="bibr" rid="B7">Dittwald et&#x20;al., 2013</xref>) based on their chemical formulas. To search for a particular N-glycan, the accurate mass value corresponding to its most abundant isotopic ion is searched against the peaks in a full MS spectrum with a given mass tolerance. The most abundant isotopic ion is expected to be most intensive and thus is most likely to be observed in an experimental spectrum if the N-glycan is indeed present in the sample. We adopted the same spectrum pre-processing procedure including charge derivation and peak picking as that used in GlycoHybridSeq. To speed up searching of peaks within a given mass tolerance, a bucket search algorithm is employed as employed in GlycoHybridSeq. After the most abundant isotopic ion is matched, we extend the peak matching of other isotopic ions to generate a matched isotopic envelope, which is then scored against the theoretical isotopic distribution as Pearson Coefficient (r)<disp-formula id="e6">
<mml:math id="m17">
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>t</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mstyle>
<mml:mtext>&#x200b;</mml:mtext>
</mml:msup>
<mml:msup>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>o</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mstyle>
<mml:mtext>&#x200b;</mml:mtext>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:mrow>
<mml:msup>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>t</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mstyle>
<mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>o</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mtext>&#x200b;</mml:mtext>
</mml:msup>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>(6)</label>
</disp-formula>where <inline-formula id="inf12">
<mml:math id="m18">
<mml:mrow>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf13">
<mml:math id="m19">
<mml:mrow>
<mml:msub>
<mml:mi>o</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are the intensities of the matched theoretical and observed isotopic ions, and <inline-formula id="inf14">
<mml:math id="m20">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>t</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> ad <inline-formula id="inf15">
<mml:math id="m21">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>o</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> are the average intensities of all peaks in the isotopic envelope. Pearson Coefficient measures the similarity of two isotopic clusters, which offer similar measures such as Cosine Similarity, but are invariant to linear transformation of data (<xref ref-type="bibr" rid="B20">Van Dongen and Enright, 2012</xref>). A user-defined cutoff is used (default 0.9) to reduce likely false N-glycan annotation.</p>
<p>To assess the abundance of a glycan, the extracted-ion chromatogram (XIC) of glycan ion is generated by GlycanGUI by searching theoretical m/z over all full-MS spectra (as described above) and measuring the corresponding peak area at each retention time. The peak area is computed from the combined intensity of top three isotope peaks, which is adopted from the concept of top three-isotopes quantification (3TIQ) algorithm (<xref ref-type="bibr" rid="B14">Park et&#x20;al., 2016</xref>). It is reported that considering top three isotope peaks as in 3TIQ algorithm offers more sensitive results with better signal-to-noise ratio (<xref ref-type="bibr" rid="B14">Park et&#x20;al., 2016</xref>). To detect individual XIC of glycan isomers (that have the same composition and thus theoretical m/z), the XIC peak detection algorithm reported by Aoshima et&#x20;al. <xref ref-type="bibr" rid="B3">Aoshima et&#x20;al. (2014)</xref> is adopted with customized modification to suit for glycan detection. Briefly, the point of highest intensity (i.e.,&#x20;apex) is first located by searching the local maximum over XIC peaks. The leftmost and rightmost neighbor of the apex are then tracked starting from the center of apex, which are defined as the peak boundary higher than (or equal to) a given cutoff (by default 50%) of apex intensity. The left and right bounds of XIC are discovered by extending the leftmost and rightmost neighbor using a local minimum algorithm. The peak area of the detected glycan isomers were summed<disp-formula id="e7">
<mml:math id="m22">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>a</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:munder>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>l</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>f</mml:mi>
<mml:mi>t</mml:mi>
<mml:mo>&#x3c;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x3c;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>h</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:munder>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
<label>(7)</label>
</disp-formula>where <inline-formula id="inf16">
<mml:math id="m23">
<mml:mrow>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the peak area over the detected range within left and right bound in&#x20;XIC.</p>
</sec>
</sec>
<sec sec-type="results|discussion" id="s3">
<title>3 Results and Discussion</title>
<sec id="s3-1">
<title>3.1 GUI Calibration</title>
<p>GlycanGUI provides a graphic user interface that takes mass spectra data containing dextran ladder as the internal standard (as shown in <xref ref-type="sec" rid="s9">Supplementary Figure S1</xref>). It determines Glucose Unit (GU) values by fitting to 3rd order polynomial (by default) or logarithmic function (<xref ref-type="sec" rid="s9">Supplementary Figure S1</xref>), which allows for calibrating the retention time to the corresponding Glucose Unit Index (GUI). This calibration process involves the determination of retention time of dextrin ladders with different number of GUs (i.e.,&#x20;GU 2&#x2013;12) followed by the polynomial (or logarithmic) curve fitting.</p>
<p>To assign the peaks of the dextrin ladders in LC-MS data, each peak in a full MS spectrum was first compared against the theoretical mass-to-charge-ratios of dextrin ladders with putative ion charges (by default up to &#x2b;3) and abducts (by default the proton) using a user-defined mass tolerance. For each dextrin of a specific GU, only the most abundant isotopic ion among all matched peaks in the spectrum is recorded, along with its GU, retention time and the maximum intensity of matched peaks. However, sometimes multiple dextrin ladders with different number of GUs may be extracted from the same spectrum (<xref ref-type="fig" rid="F1">Figure&#x20;1</xref> for an example). We employed the dynamic programming algorithm as described in the Methods section to resolve the ambiguity. As shown in <xref ref-type="fig" rid="F2">Figure&#x20;2</xref>, the retention time of the extracted peaks are consistent with the manual assignment in the previous study (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>) (<xref ref-type="sec" rid="s9">Supplementary Figure S2</xref>), indicating GlycoGUI assigned the internal standards (dextrin ladders) accurately. Notably, for dextrins with large GUI (e.g., for <inline-formula id="inf17">
<mml:math id="m24">
<mml:mrow>
<mml:mi>G</mml:mi>
<mml:mi>U</mml:mi>
<mml:mi>I</mml:mi>
<mml:mo>&#x3e;</mml:mo>
<mml:mn>10</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>), their peak intensities become so low that are hard to be distinguished from the baseline peaks, which is difficult for manual assignment. On the other hand, the automated peak assignment by GlycoGUI can avoid such potential errors in manual assignment.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>The peak assignment of dextrin ladders in a full MS spectrum.</p>
</caption>
<graphic xlink:href="fchem-09-707382-g001.tif"/>
</fig>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>The elution profile of dextrin ladders extracted by using peak matching and the dynamic programming algorithm.</p>
</caption>
<graphic xlink:href="fchem-09-707382-g002.tif"/>
</fig>
<p>To calibrate the retention time into GUI, the value of GUs (i.e.,&#x20;GUIs) are regressed against the retention times of the dextrin ladders (corresponding to ithe most abundant isotopic peak) using a polynomial function as described in Method. <xref ref-type="fig" rid="F3">Figure&#x20;3</xref> shows the standard plot of the GUI against the retention times for the dextrin ladders observed in a human serum sample. The coefficient of determination (<inline-formula id="inf18">
<mml:math id="m25">
<mml:mrow>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>) is greater than 0.99 from the curve fitting, indicating the high correlation between the retention time and GUI. Alternatively, a logarithmic function can also be used for fitting the standard curve of GUIs against the retention time (as shown in <xref ref-type="sec" rid="s9">Supplementary Figure S3</xref>). We implemented both curve fitting functionalities, since our previous study showed the polynomial fitting gave the best fitting results (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>), while Ashwood et&#x20;al. concluded that the logarithmic fitting best described the retention profile of the dextran ladders (<xref ref-type="bibr" rid="B4">Ashwood et&#x20;al., 2019</xref>). In our experiments, both methods received high correlation (<inline-formula id="inf19">
<mml:math id="m26">
<mml:mrow>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x3e;</mml:mo>
<mml:mn>0.99</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>), which implies that the calibration can achieve satisfactory results by using either method, as long as the same method is consistently used for the calibrations of all samples being compared. For the results presented below, the polynomial fitting is used unless otherwise stated.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>The curve fitting of GUI as a polynomial function of the retention time on dextrin ladders with different number of GUs.</p>
</caption>
<graphic xlink:href="fchem-09-707382-g003.tif"/>
</fig>
</sec>
<sec id="s3-2">
<title>3.2 Glycan Annotation</title>
<p>To facilitate glycan identifications, glycanGUI provides a graphic user interface that allows user to annotate glycans with GUI (as shown in <xref ref-type="sec" rid="s9">Supplementary Figure S4</xref>). With reported GUI of annotated glycans, the annotated glycans can be easily verified according to calibrated retention time (<xref ref-type="bibr" rid="B4">Ashwood et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>). Moreover, the validation of annotated glycans can be conducted with GUI library of known glycans (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>) to filter putative false glycan annotations.</p>
<p>To evaluate the glycan annotation and GUI calibration by GlycanGUI, the GUI values reported by GlycanGUI for the annotated N-glycans in human serum glycomic data were compared with our manual calibrations reported previously (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>). As shown in <xref ref-type="fig" rid="F4">Figure&#x20;4</xref>, the calibrated GUI values by GlycanGUI is highly correlated with the manually calibrated values (<inline-formula id="inf20">
<mml:math id="m27">
<mml:mrow>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.97</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>), indicating the automated glycan annotation and GUI calibration by GlycanGUI achieved satisfactory results. Notably, a total of 275&#x20;N-glycans were annotated by GlycanGUI, while only 51 of them were manually annotated (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>); the above comparison is performed on the N-glycans that were annotated by both methods. Given the 1,046 full MS spectra acquired in this LC-MS study, the full annotation of all N-glycans involves tremendous manual efforts. Hence, GlycanGUI serves as a reliable software tool to automate the data processing that will drastically reduces the time for data analyses.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>The comparison of GUI values calibrated by using GlycanGUI and by manual calibration for the N-glycans in human serum glycomic&#x20;data.</p>
</caption>
<graphic xlink:href="fchem-09-707382-g004.tif"/>
</fig>
<p>In addition to the GUI values, GlycanGUI performs label-free quantification of the glycans with calibrated GUI values based on the peak areas in the extracted ion chromatograms (XIC). As shown in <xref ref-type="fig" rid="F5">Figure&#x20;5</xref>, the XIC reported by GlycanGUI is highly correlated (<inline-formula id="inf21">
<mml:math id="m28">
<mml:mrow>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.9995</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>) to that obtained manually using Xcalibur (see <xref ref-type="sec" rid="s9">Supplementary Figure S6</xref>), indicating quantitative measurement by GlycanGUI is reliable. Moreover, GlycanGUI can automatically distinguish the glycan isomers based on the peaks in the XIC of the precursor ions with the same m/z (within the mass tolerance). The peak areas and the corresponding GUI values of the isomers are reported, respectively, as shown in <xref ref-type="fig" rid="F6">Figure&#x20;6</xref>. A full mass chromatography of all the N-glycans automatically identified from human serum spiked dextrin is shown as in <xref ref-type="fig" rid="F7">Figure&#x20;7</xref>. A large portion of N-glycans (60.4%) elutes during 30&#x2013;40&#xa0;min, while several major elution peaks are observed over 20&#x2013;50&#xa0;min. It is worth to mention that identified N-glycans may requires human validation, especially when interference are present. To improve glycan identification, tandem mass spectrometry (MS/MS) can be performed along with full MS so that N-glycans can be identified according to ion fragments. This improvement will be pursued in our future endeavour.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>The comparison of extracted ion chromatograms reported by GlycanGUI, and by manual extraction using Xcalibur, respectively. The intensity is normalized to the highest intensity.</p>
</caption>
<graphic xlink:href="fchem-09-707382-g005.tif"/>
</fig>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>The calibration and quantification of glycan isomers distinguished by different peaks in the extracted ion chromatogram of the precursor ions with the same m/z.</p>
</caption>
<graphic xlink:href="fchem-09-707382-g006.tif"/>
</fig>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>The full mass chromatography of the total N-glycans from human serum spiked dextrin.</p>
</caption>
<graphic xlink:href="fchem-09-707382-g007.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="conclusion" id="s4">
<title>4 Conclusion</title>
<p>Glycan annotation and label-free quantification from LC-MS data often involve time consuming manual efforts. Here, we report a open-source software tool GlycanGUI for automated calibration of the elution time of glycans into GUI values and the quantification of the corresponding ions based on peak intensities. The source code of GlycanGUI is released on Github at <ext-link ext-link-type="uri" xlink:href="https://github.com/ruizhang84/GlycanGUIApp">https://github.com/ruizhang84/GlycanGUIApp</ext-link>. We note that although GlycanGUI has been tested extensively on glycomic datasets, the glycan annotation results should be used with caution. To produce the more reliable results for glycan annotation, GlycanGUI is preferred to be used in combination with a library of glycan GUIs (<xref ref-type="bibr" rid="B9">Gautam et&#x20;al., 2020</xref>) or with glycan annotation software tools (<xref ref-type="bibr" rid="B22">Yu et&#x20;al., 2013</xref>; <xref ref-type="bibr" rid="B10">Hu et&#x20;al., 2015</xref>). In the future, we plan to implement glycan identification algorithm in GlycanGUI, for example, by exploiting the fragment ion patterns in tandem mass spectra.</p>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s9">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>RZ and HT designed the study; RZ, SG, WP and YH investigated the study; RZ and HT drafted the manuscript; RZ, SG, WP, YH, YM, and HT reviewed and edited the manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work was supported by NIH (5R01GM112490 and 1R01GM130091-01A1).</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
<p>The reviewer (SS) declared a past co-authorship with one of the authors (YM) to the handling Editor.</p>
</sec>
<sec id="s9">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fchem.2021.707382/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fchem.2021.707382/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.PDF" id="SM1" mimetype="application/PDF" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abrahams</surname>
<given-names>J.&#x20;L.</given-names>
</name>
<name>
<surname>Campbell</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Packer</surname>
<given-names>N. H.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Building a Pgc-Lc-Ms N-Glycan Retention Library and Elution Mapping Resource</article-title>. <source>Glycoconj&#x20;J.</source> <volume>35</volume>, <fpage>15</fpage>&#x2013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1007/s10719-017-9793-4</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Althoen</surname>
<given-names>S. C.</given-names>
</name>
<name>
<surname>Mclaughlin</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1987</year>). <article-title>Gauss-jordan Reduction: A Brief History</article-title>. <source>The Am. Math. monthly</source> <volume>94</volume>, <fpage>130</fpage>&#x2013;<lpage>142</lpage>. <pub-id pub-id-type="doi">10.2307/2322413</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aoshima</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Takahashi</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ikawa</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kimura</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Fukuda</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Tanaka</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>A Simple Peak Detection and Label-free Quantitation Algorithm for Chromatography-Mass Spectrometry</article-title>. <source>BMC bioinformatics</source> <volume>15</volume>, <fpage>1</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1186/s12859-014-0376-0</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ashwood</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Pratt</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>MacLean</surname>
<given-names>B. X.</given-names>
</name>
<name>
<surname>Gundry</surname>
<given-names>R. L.</given-names>
</name>
<name>
<surname>Packer</surname>
<given-names>N. H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Standardization of Pgc-Lc-Ms-Based Glycomics for Sample Specific Glycotyping</article-title>. <source>Analyst</source> <volume>144</volume>, <fpage>3601</fpage>&#x2013;<lpage>3612</lpage>. <pub-id pub-id-type="doi">10.1039/c9an00486f</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ashwood</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Waas</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Weerasekera</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Gundry</surname>
<given-names>R. L.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Reference Glycan Structure Libraries of Primary Human Cardiomyocytes and Pluripotent Stem Cell-Derived Cardiomyocytes Reveal Cell-type and Culture Stage-specific Glycan Phenotypes</article-title>. <source>J.&#x20;Mol. Cell Cardiol.</source> <volume>139</volume>, <fpage>33</fpage>&#x2013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1016/j.yjmcc.2019.12.012</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Campbell</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Royle</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Radcliffe</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Dwek</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Rudd</surname>
<given-names>P. M.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Glycobase and Autogu: Tools for Hplc-Based Glycan Analysis</article-title>. <source>Bioinformatics</source> <volume>24</volume>, <fpage>1214</fpage>&#x2013;<lpage>1216</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btn090</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dittwald</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Claesen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Burzykowski</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Valkenborg</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Gambin</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Brain: a Universal Tool for High-Throughput Calculations of the Isotopic Distribution for Mass Spectrometry</article-title>. <source>Anal. Chem.</source> <volume>85</volume>, <fpage>1991</fpage>&#x2013;<lpage>1994</lpage>. <pub-id pub-id-type="doi">10.1021/ac303439m</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fabini</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Freilinger</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Altmann</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Wilson</surname>
<given-names>I. B. H.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Identification of Core &#x3b1;1,3-Fucosylated Glycans and Cloning of the Requisite Fucosyltransferase cDNA from <italic>Drosophila melanogaster</italic>
</article-title>. <source>J.&#x20;Biol. Chem.</source> <volume>276</volume>, <fpage>28058</fpage>&#x2013;<lpage>28067</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m100573200</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gautam</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Cho</surname>
<given-names>B. G.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Banazadeh</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Glucose Unit index (Gui) of Permethylated Glycans for Effective Identification of Glycans and Glycan Isomers</article-title>. <source>Analyst</source> <volume>145</volume>, <fpage>6656</fpage>&#x2013;<lpage>6667</lpage>. <pub-id pub-id-type="doi">10.1039/d0an00314j</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>C.-Y.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Mechref</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Automated Annotation and Quantitation of Glycans by Liquid Chromatography/electrospray Ionization Mass Spectrometric Analysis Using the Multiglycan-Esi Computational Tool</article-title>. <source>Rapid Commun. Mass. Spectrom.</source> <volume>29</volume>, <fpage>135</fpage>&#x2013;<lpage>142</lpage>. <pub-id pub-id-type="doi">10.1002/rcm.7093</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mellis</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Baenziger</surname>
<given-names>J.&#x20;U.</given-names>
</name>
</person-group> (<year>1981</year>). <article-title>Separation of Neutral Oligosaccharides by High-Performance Liquid Chromatography</article-title>. <source>Anal. Biochem.</source> <volume>114</volume>, <fpage>276</fpage>&#x2013;<lpage>280</lpage>. <pub-id pub-id-type="doi">10.1016/0003-2697(81)90480-2</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ohtsubo</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Marth</surname>
<given-names>J.&#x20;D.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Glycosylation in Cellular Mechanisms of Health and Disease</article-title>. <source>Cell</source> <volume>126</volume>, <fpage>855</fpage>&#x2013;<lpage>867</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2006.08.019</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pabst</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Altmann</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Glycan Analysis by Modern Instrumental Methods</article-title>. <source>Proteomics</source> <volume>11</volume>, <fpage>631</fpage>&#x2013;<lpage>643</lpage>. <pub-id pub-id-type="doi">10.1002/pmic.201000517</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Park</surname>
<given-names>G. W.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>J.&#x20;Y.</given-names>
</name>
<name>
<surname>Hwang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>J.&#x20;Y.</given-names>
</name>
<name>
<surname>Ahn</surname>
<given-names>Y. H.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>H. K.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Integrated Glycoproteome Analyzer (I-gpa) for Automated Identification and Quantitation of Site-specific N-Glycosylation</article-title>. <source>Sci. Rep.</source> <volume>6</volume>, <fpage>1</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1038/srep21175</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pinho</surname>
<given-names>S. S.</given-names>
</name>
<name>
<surname>Reis</surname>
<given-names>C. A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Glycosylation in Cancer: Mechanisms and Clinical Implications</article-title>. <source>Nat. Rev. Cancer</source> <volume>15</volume>, <fpage>540</fpage>&#x2013;<lpage>555</lpage>. <pub-id pub-id-type="doi">10.1038/nrc3982</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Royle</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Campbell</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Radcliffe</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>White</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Harvey</surname>
<given-names>D. J.</given-names>
</name>
<name>
<surname>Abrahams</surname>
<given-names>J.&#x20;L.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>Hplc-based Analysis of Serum N-Glycans on a 96-well Plate Platform with Dedicated Database Software</article-title>. <source>Anal. Biochem.</source> <volume>376</volume>, <fpage>1</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1016/j.ab.2007.12.012</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>St&#xf6;ckmann</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Adamczyk</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Hayes</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Rudd</surname>
<given-names>P. M.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Automated, High-Throughput Igg-Antibody Glycoprofiling Platform</article-title>. <source>Anal. Chem.</source> <volume>85</volume>, <fpage>8841</fpage>&#x2013;<lpage>8849</lpage>. <pub-id pub-id-type="doi">10.1021/ac402068r</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stowell</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Ju</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Cummings</surname>
<given-names>R. D.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Protein Glycosylation in Cancer</article-title>. <source>Annu. Rev. Pathol. Mech. Dis.</source> <volume>10</volume>, <fpage>473</fpage>&#x2013;<lpage>510</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-pathol-012414-040438</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van den Steen</surname>
<given-names>P. E.</given-names>
</name>
<name>
<surname>Van Aelst</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Hvidberg</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Piccard</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Fiten</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Jacobsen</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2006</year>). <article-title>The Hemopexin and O-Glycosylated Domains Tune Gelatinase B/mmp-9 Bioavailability via Inhibition and Binding to Cargo Receptors</article-title>. <source>J.&#x20;Biol. Chem.</source> <volume>281</volume>, <fpage>18626</fpage>&#x2013;<lpage>18637</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m512308200</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Van Dongen</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Enright</surname>
<given-names>A. J.</given-names>
</name>
</person-group> (<year>2012</year>). <source>Metric Distances Derived from Cosine Similarity and pearson and spearman Correlations</source>.<comment>arXiv preprint arXiv:1208.3145</comment>
</citation>
</ref>
<ref id="B21">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Varki</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Cummings</surname>
<given-names>R. D.</given-names>
</name>
<name>
<surname>Esko</surname>
<given-names>J.&#x20;D.</given-names>
</name>
<name>
<surname>Stanley</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Hart</surname>
<given-names>G. W.</given-names>
</name>
<name>
<surname>Aebi</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2009</year>). <source>Essentials of glycobiology</source>. <comment>3rd Edn</comment>, <publisher-loc>Cold Spring Harbor, NY</publisher-loc>: <publisher-name>Cold Spring Harbor Laboratory Press</publisher-name>. <comment>Chapter 9</comment> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>C.-Y.</given-names>
</name>
<name>
<surname>Mayampurath</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Mechref</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Automated Annotation and Quantification of Glycans Using Liquid Chromatography-Mass Spectrometry</article-title>. <source>Bioinformatics</source> <volume>29</volume>, <fpage>1706</fpage>&#x2013;<lpage>1707</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt190</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lubman</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Mechref</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Glycohybridseq: Automated Identification of N-Linked Glycopeptides Using Electron Transfer/high-Energy Collision Dissociation (Ethcd)</article-title>. <source>J.&#x20;Proteome Res.</source> <volume>20</volume>, <fpage>3345</fpage>&#x2013;<lpage>3352</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jproteome.1c00245</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>