- 1Radiation Oncology Key Laboratory of Sichuan Province, Department of Radiology, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital and Institute, Sichuan Cancer Center, Affiliated Cancer Hospital of University of Electronic Science and Technology of China, Chengdu, China
- 2Department of Nephrology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
- 3Department of Andrology and Sichuan Human Sperm Bank, West China Second University Hospital, Sichuan University, Chengdu, China
Protein glycosylation plays a pivotal role in human semen, influencing various processes, such as spermatogenesis, maturation, sperm motility, capacitation, and fertilization. Despite its importance, the specific details regarding N/O-glycosylation within human semen proteins have largely remained unknown. To address this challenge, an integrated glycoproteomic platform (termed GlycoIP) was developed, enabling the simultaneous analysis of both intact N- and O-glycopeptides directly from human semen samples. Characterizing these intact glycopeptides is particularly challenging, yet it provides invaluable insights into the structure and function of both glycans and their attachment sites. In this study, our platform enabled the identification of 1,833 unique N-glycopeptides and 720 unique O-glycopeptides. This approach revealed extensive and precise site-specific N/O-glycosylation data, highlighting 438 potential O-glycosylation sites from 148 distinct O-glycoproteins. Notably, we conducted site-specific N/O-glycosylation profiling on several unique glycoproteins, including sperm equatorial segment protein 1 (SPESP1), which is located on human sperm, and plasma serine protease inhibitor (SERPINA5), which is presented in both sperm and seminal plasma. In summary, this platform provides a promising approach for comprehensive profiling of protein site-specific N/O-glycosylation within a single experiment. This advancement paves the way for further functional studies on glycoproteins and their roles in male infertility, enhancing our understanding of this complex field.
Introduction
Male reproductive issues, particularly those stemming from diminished semen quality, have emerged as a pressing health concern worldwide (Skakkebaek et al., 2016). Human semen is composed of sperm (S) and seminal plasma (SP), both of which are rich in secreted glycoproteins essential for male fertility (Yang et al., 2015). Among these glycoproteins, several play critical roles in human fertility, such as the sperm surface glycoprotein CD52 (Diekman et al., 1999). These glycoproteins are instrumental in various stages of male reproductive processes, including spermatogenesis, maturation, extracellular quality control, capacitation, sperm-egg recognition, and ultimately fertilization (Luo et al., 2023). Abnormal glycosylation may result in compromised semen functionality and lead to infertility. Protein glycosylation can be classified mainly into two types: N-glycosylation (N-glycans are specifically attached to asparagine (N) residues in an N-X-S/T/C motif, X≠P, proline (P), cysteine (C)) and O-glycosylation (with serine (S) or threonine (T)) (Zhang et al., 2018). The presence of O-glycosites near N-glycosites suggests a potential “O-Follow-N” rule (Tian et al., 2021). While this is not the main focus of our research, this observation may provide insights into the coordinated regulation of N/O-glycosylation events in proteins. Understanding these processes could be significant for deciphering glycosylation patterns in human semen. Exploring human semen glycoproteins could be crucial in revealing the structures and functions of these proteins, as it also contributes to our understanding of specific diseases and helps identify potential therapeutic targets (Liu et al., 2022). For example, glycodelin is a key glycoprotein in reproduction, exists in four distinct glycoforms, each serving a specific function. This means that the glycan structures attached to the protein are crucial in determining its role in fertilization processes (Seppälä et al., 2007). Research underscores the importance of these glycoproteins, indicating that both protein expression and glycan structures are likely essential to these events (Ka et al., 2019). Furthermore, previous studies have shown that N-glycoproteins are vital for protecting spermatozoa from immune attack in the female genital tract, as well as for their recognition and interaction with the egg (Wang et al., 2013). However, our understanding of the specific N/O-glycosylation patterns within these glycoproteins in human semen is still quite limited. This is primarily due to the challenges associated with accurately characterizing intact N/O-glycopeptides directly from human semen samples.
In recent years, the field of glycoproteomics has experienced significant advancements because of the development of cutting-edge mass spectrometers with ultra-high resolution and sophisticated software (Zhao et al., 2023; Cao et al., 2021). The methodology of glycoproteomics consists of several essential steps: selecting and processing clinical samples, conducting LC-MS/MS analysis, performing bioinformatics analysis, and verifying the results (Li et al., 2023; Zhao et al., 2024). Great progress has been achieved at each step. For example, precise analysis software has been developed to interpret glycoproteomic data. These software tools can be categorized based on variations in their core algorithms, primarily into two types: peptide-first searching tools (Byonic, MSFraggerGlyco, etc.) and glycan-first searching tools (pGlyco series) (Lee et al., 2016; Polasky et al., 2020; Liu et al., 2017). A thorough review has been carried out on their advancements, principles, and unique characteristics (Cao, 2024). Specifically, Li et al. employed the advanced Orbitrap Fusion Lumos mass spectrometer alongside StrucGP software, revealing that sialylated N-glycans could be crucial for the process of semen liquefaction (Li et al., 2023). This discovery opens new avenues for understanding the biochemical foundations of fertility. Similarly, Luo et al. made significant strides by compiling an extensive database on human semen O-glycosylation. They presented glycoproteomics based on two complementary fragmentation methods (GlycoTCFM), highlighting the power of modern analytical techniques to uncover the intricate details of N/O-glycosylation patterns (Luo et al., 2023). Additionally, an optimized combination of electron-transfer/higher-energy collisional dissociation (EThcD) and the stepped collision energy/higher-energy collisional dissociation (sceHCD) method (EThcD-sceHCD) developed by our team has been demonstrated to significantly increase the accuracy and depth of intact glycopeptide identification. It also complements the common fragmentation mode sceHCD and enhances the analytical capability of intact N/O-glycopeptides in both simple glycoproteins and complex clinical samples (Zhang et al., 2021a; Zhang et al., 2021b; Luo et al., 2022a; Mao et al., 2022; Zeng et al., 2022; Riley et al., 2020; Mao et al., 2021). These findings suggest that the analysis of intact N/O-glycopeptides could be key to unraveling the mysteries of male reproductive disorders (Cheng et al., 2024).
The aim of this study was to develop a comprehensive platform for the simultaneous profiling of intact N/O-glycopeptides from human semen. The advanced platform offers accurate and in-depth site-specific information on N/O-glycosylation. Detailed insights into the N/O-glycosylation patterns in human semen is essential for understanding their effects on semen quality and male fertility. This knowledge could be crucial for evaluating the role of glycosylation in reproductive health and may open new avenues for research on male infertility.
Materials and methods
Materials and chemicals
All electrophoresis reagents were purchased from Bio-Rad (Richmond, CA). Dithiothreitol (DTT), iodoacetamide (IAA), trifluoroacetic acid (TFA), formic acid (FA), and ammonium bicarbonate (NH4CO3) were obtained from Sigma (St. Louis, United States). Acetonitrile (ACN) was acquired from Merck (Darmstadt, Germany). Sequencing grade trypsin was purchased from Enzyme and Spectrum (Beijing, China). Peptide N-glycosidase F (PNGase F) was purchased from Sigma (St. Louis, United States). O-Glycoprotease (IMPa) was purchased from New England Biolabs (Ipswich, United States). Zwitterionic hydrophilic interaction chromatography (ZIC-HILIC) beads were purchased from Fresh Bioscience (Shanghai, China). The C8 extraction disks were obtained from 3 M Empore (St. Paul, United States). The Bradford protein assay kit was purchased from Thermo Fisher Scientific (Rockford, United States). Centrifugal filters (30 kDa) were purchased from Merck Millipore (Carrigtwohill, Ireland). All other chemicals and reagents were purchased from Sigma-Aldrich (St. Louis, MO, United States) or Merck (Darmstadt, Germany).
Human semen collection
Human semen was collected from healthy donors at West China Second University Hospital of Sichuan University, Sichuan Province, China. Written informed consent was obtained from all volunteers. The human semen examination results from 40 volunteers are shown in Supplementary Table S1. The 40 semen samples were pooled and centrifuged at 4°C (2,000×g for 20 min) to separate the sperm (bottom pellets) from the seminal plasma (upper supernatant). The S and SP were collected in 10 mL tubes. This study abided by the Declaration of Helsinki principles and was approved by the Ethics Committee of West China Second University Hospital of Sichuan University.
Protein extraction, SDS-PAGE and digestion
The obtained S and SP were processed using our optimized approach (Luo et al., 2022b). To evaluate the extraction and quantification of proteins from S and SP, these proteins were loaded onto 12% sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS−PAGE) gels. Electrophoresis was performed at 120 V for 1 h at 4°C. The lanes were dyed using a BluPower Fast Staining Coomassie (Zoonbio, Nanjing, China). The gel was washed with deionized water. Proteins were subjected to proteolysis using a filter-aided sample preparation (FASP) protocol. Briefly, 1 mg of protein was placed in a 30-kDa filter. Following centrifugation at 13,000 × g for 15 min at 25°C, 200 μL of 8 M urea buffer containing 20 mM DTT was added, and the reduction reaction was allowed to proceed for 4 h at 37°C. Next, 50 mM IAA was added, and the mixture was incubated in the dark for 1 h at room temperature. The urea buffer was then exchanged for 50 mM NH4CO3 via centrifugation at 13,000 × g for 15 min. Finally, trypsin (20 μg) was added to each filter tube, and the proteins were digested overnight at 37°C. Peptide mixtures (Fraction 1, F1) were obtained by FASP and centrifugation at 13,000×g for 15 min. The obtained peptide concentration was measured through a colorimetric assay that quantifies the absorbance at 480 nm. Next, we used a SpeedVac to dry the peptides thoroughly. After vacuum drying, 500 μg of peptide was resuspended in 70 μL of binding buffer (80% ACN/0.2% TFA) and mixed with ZIC-HILIC (Mao et al., 2020). Following washing with binding buffer, we secured Fraction 2 (F2). Fraction 3 (F3) was subsequently acquired by elution with 0.1% TFA. Next, we processed half of F2 with 4 U of IMPa, resulting in Fraction 4 (F4). Simultaneously, treating half of F3 with 2 U of PNGase F led to the capture of Fraction 5 (F5). Additionally, by digesting half of F5 with 1 U of IMPa, we were able to produce Fraction 6 (F6).
EThcD-sceHCD-MS/MS and sceHCD-MS/MS analysis
All the fractions were analyzed via an Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher). They were separated on a column (ReproSil-Pur C18-AQ, 1.9 μm, 100 μm inner diameter, length 25 cm; Dr Maisch) over a 78-min gradient (buffer A, 0.1% FA in water; buffer B, 0.1% FA in 80% ACN) at a flow rate of 400 nL/min (0–8 min, 5–12% B; 8–58 min, 12–22% B; 58–70 min, 22–32% B; 70–71 min, 32–90% B; and 71–78 min, 90% B) and detected in the data-dependent acquisition mode. The detailed parameters of EThcD-sceHCD-MS/MS and sceHCD-MS/MS have been described previously (Luo et al., 2023).
Data analysis
The raw data files were searched against the human UniProt database (21,007 human protein entries) using Byonic software (version 5.3.5, Protein Metrics, Inc.). The mass tolerances for the precursors and fragment ions were set at ±6 ppm and ±20 ppm, respectively. Two missed cleavage sites were allowed for trypsin digestion. Thermo Scan headers were used for fragmentation. The fixed modification was carbamidomethyl (C), and the variable modifications included oxidation (M), acetylation (protein N-term) and deamidation (N). In addition, the 182 human N-glycan compositions were specified as N-glycosylation. The 6 most common human O-glycans (HexNAc(1), HexNAc(2), HexNAc(1)Hex(1), HexNAc(2)Hex(1), HexNAc(1)Hex(1)NeuAc(1), and HexNAc(1)Hex(1)NeuAc(2)) were specified as O-glycosylation. HexNAc [H+]/204.087 m/z was chosen for MS/MS filtering to speed up the search. Stricter quality control methods for intact glycopeptide identification were implemented, requiring a score of no less than 300. Five monosaccharide core structures should be contained in each N-glycan. The identification of N-glycosites requires the presence of consensus glycosylation motif (N-X-S/T/C, X≠P). Data preprocessing, statistical analysis, and GO enrichment performed via our in-house Wukong platform and R packages. Model building based on the AlphaFold-predicted structure (AF-Q6UW49-F1) of sperm equatorial segment protein 1 (SPESP1) was performed using PyMOL (version 2.6.0), GlycoWorkbench (version 2.1) and GlycoSHIELD (Damerell et al., 2015; Jumper et al., 2021; Tsai et al., 2024).
Results and discussion
Experimental design for simultaneous profiling of intact N/O-glycopeptides in human semen
To achieve the simultaneous profiling of intact N/O-glycopeptides from human semen, we developed an innovative glycoproteomic integration platform (termed GlycoIP). As shown in Figure 1, the semen mixture from 40 healthy men (Supplementary Table S1) was separated by centrifugation into two components: sperm (S) and seminal plasma (SP). This separation is crucial for the in-depth identification and functional interpretation of N/O-glycosylation in human semen proteins, given the significant differences in protein composition and abundance between S and SP(32). The findings were further validated by SDS−PAGE analysis (Supplementary Figure S1). Using our optimized method for protein extraction and trypsin digestion, we obtained peptide mixture Fraction 1 (F1), which includes both high-abundance naked peptides and low-abundance N/O-glycosylated peptides (Luo et al., 2022b). ZIC-HILIC was subsequently used to enrich intact N/O-glycopeptides because it can enrich more intact glycopeptides compared to traditional HILIC (Mao et al., 2020). ZIC-HILIC contains a zwitterionic group, which enhances hydrophilic interactions with glycopeptides more effectively than traditional HILIC. This process yields two components with distinct hydrophilicities (Wang et al., 2023). F2 comprises hydrophobic peptides along with some O-glycopeptides, while F3 contains hydrophilic peptides and the majority of N/O-glycopeptides, which can be eluted sequentially from the ZIC-HILIC gradient. In comparison to N-glycosylation, O-glycosylation is less abundant and poses greater challenges for accurate identification (Zhang et al., 2018; Zhang et al., 2021c). This difficulty arises because N-glycosylation occurs co-translationally and is more widespread in proteins, whereas O-glycosylation is often post-translational and may be more restricted to specific proteins. To enable the accurate and sensitive analysis of intact O-glycopeptides, two strategies were employed. The first strategy involved the use of immunomodulating metalloprotease (IMPa) from Pseudomonas aeruginosa to digest intact O-glycopeptides that contain mucin-type O-glycans (F4 and F6). It enables accurate identification of O-glycan structures at each O-glycosite (Vainauskas et al., 2022; Suttapitugsakul et al., 2023; Helms et al., 2023; Noach et al., 2017; Kida et al., 2016; Riley and Bertozzi, 2022). Using this method, nearly 100 O-glycoproteins have been identified in mouse brains (Suttapitugsakul et al., 2023). However, IMPa might face a challenge where sialylation could interfere with its cleavage efficiency. The presence of sialic acid residues on glycans can obstruct the ability of IMPa to reach and break the glycosidic bonds effectively. The second strategy involves the use of peptide N-glycosidase F (PNGase F) for de-N-glycosylation, which helps to reduce sample complexity before O-glycopeptide analysis and ensures the identification of N-glycosites (F5) (Huang et al., 2015). Finally, six fractions can be obtained from each S or SP sample. The collected intact N/O-glycopeptide fractions were subjected to liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis with both sceHCD and EThcD-sceHCD. Consequently, two complementary fragmentation methods (TCFM) were proposed for the in-depth and accurate identification of O-glycosylation in this study (Luo et al., 2023). Byonic was selected for its versatility in handling various fragmentation modes and its efficacy in analyzing N/O-glycosylation. Additionally, during a community evaluation of glycoproteomic informatics solutions, Byonic demonstrated exceptional performance in the analysis of intact N/O-glycopeptide data (Kawahara et al., 2021). In this study, the semen samples from all volunteers were pooled together and subsequently separated into two distinct components: S and SP. Each component was further divided into six fractions for analysis, with each fraction being examined using two different modes. As a result, a total of 24 data files were generated via the platform, allowing for the simultaneous profiling of intact N/O-glycopeptides from human semen in a single experiment.

Figure 1. Illustration of the glycoproteomic integration platform (GlycoIP) designed for the simultaneous profiling of intact N/O-glycopeptides in human semen.
Intact N-glycopeptide analysis of human semen N-glycoproteins
Human semen contains a thick glycocalyx that is critical for sperm survival (Tecle and Gagneux, 2015). One important mechanism for regulating spermatogenesis and other functions is N-glycosylation (Lan et al., 2020). In this study, a total of 1,833 unique intact N-glycopeptides with high confidence were identified in the F3 of human semen (Supplementary Table S2). Compared with those in S (781 intact N-glycopeptides and 88 N-glycans), more intact N-glycopeptides (1,431) and N-glycans (106) were identified in SP. Furthermore, 379 intact N-glycopeptides and 85 N-glycans overlapped. (Figures 2A,B). These results suggest that human S and SP have significantly different N-glycosylation patterns and that these N-glycosylations may perform different functions, such as supporting sperm function, modulating maternal immune responses, and promoting successful fertilization (Pilch and Mann, 2006; Pang et al., 2009). Our team showed that EThcD-sceHCD complements sceHCD in terms of the number of intact N-glycopeptides identified from immunoglobulin G (IgG), human immunodeficiency virus type 1 envelope glycoprotein gp120 (HIV-1 gp120), hepatoblastoma cells line (HepG2), urine, plasma and tissue (Zhang et al., 2021a; Zhang et al., 2021b; Luo et al., 2022a; Mao et al., 2022; Zeng et al., 2022). In this study, EThcD-sceHCD and sceHCD were comparable and complementary in terms of the number of intact N-glycopeptides identified from human semen (Figure 2C). The analysis of N-glycans in S and SP revealed both high levels of fucosylation and sialylation. While there were some differences in the composition of the top five N-glycans between S and SP, HexNAc (Yang et al., 2015)Hex (Zhang et al., 2018) emerged as the most abundant N-glycan in both compositions (Figure 2D). Furthermore, distinct variations in N-glycan types were observed between S and SP. In S, the percentages of high-mannose, hybrid and complex N-glycan were 39.0%, 9.6%, and 51.4%, respectively. In contrast, the percentages of high-mannose, hybrid, and complex N-glycans in SP were 23.1%, 14.4%, and 62.5%, respectively (Figure 2D). These differences may reflect the functional differences of glycans in S and SP. Notably, glycans in SP are widely recognized for their critical role as immunomodulatory agents, as supported by the established theory of the “human fetoembryonic defense system” (Clark, 2014). These glycans contribute to immune tolerance through a mechanism that involves unique glycosylation characteristics seldom found in other bodily fluids. Specifically, SP contains specialized glycoproteins abundant in immune-related glycopeptides, which potentially serve as binding sites for endogenous lectins present on immune cell surfaces (Szczykutowicz et al., 2021).

Figure 2. Analysis of intact N-glycopeptides in human semen. (A) Comparison of the number of identified intact N-glycopeptides among S, SP, and semen. (B) Comparison of the number of identified N-glycans among human S, SP, and semen. (C) Analysis of the number of intact N-glycopeptides identified by EThcD-sceHCD and sceHCD in human S, SP, and semen. (D) Distribution of different types (high-mannose, hybrid, and complex) and quantities of N-glycans in human S, SP, and semen.
For F5 of human S and SP, it is possible to analyze N-glycosites and N-glycoproteins. A total of 1,163 N-glycosites from 449 N-glycoproteins were identified in human semen (Supplementary Table S3). Notably, SP contained a significantly higher number of N-glycoproteins (346) and N-glycosites (723) compared to S (Figure 3A). The conserved motif N-X-S/T (X≠P) was significantly enriched, with the N-X-T motif occurring approximately 1.3 times more frequently than the N-X-S motif in both S and SP (Figure 3B). Beyond identifying these glycosites, we can also assess the occupancy of each glycosite by comparing the deamidated peptide versions to their non-amidated counterparts within the observed glycopeptides within these fractions. This comparison provides valuable insights into site occupancy, as well as micro- and macroheterogeneity. Gene Ontology (GO) analysis revealed differences in biological processes (BP) between S and SP N-glycoproteins, including spermatogenesis and proteolysis (Figure 3C). In recent years, the N-glycoproteomes of human S and SP have been investigated by LC−MS/MS (Zhang et al., 2022). For example, the GO annotations of 372 N-glycoproteins in human SP were located primarily in the extracellular region and associated with biological adhesion (Saraswat et al., 2016). Similarly, 297 N-glycoproteins in human S are located mainly in the membrane and extracellular regions and are associated with cell recognition and fertilization (Wang et al., 2013). Notably, both deamidation and deglycosylation can result in the same mass variation. Therefore, it is of great significance to conduct the identification of N-glycosites based on motif and UniProt database. This aligns with prior research on N-glycoproteins’ localization and function, hinting at potential variations in the development of biological synthesis pathways and the roles of N-glycosylation in human S and SP proteins (Pang et al., 2009; Saraswat et al., 2016).

Figure 3. Analysis of N-glycosites and N-glycoproteins in human semen. (A) Comparison of identified N-glycoproteins and N-glycosites between human S and SP. (B) Identified N-glycosylated sequence motifs in human semen. (C) Gene Ontology (GO) analysis of biological processes (BP), cellular components (CC) and molecular functions (MF) of N-glycoproteins identified from S and SP.
In-depth characterization of site-specific O-glycosylation of human semen
O-glycosylation has been widely reported in human samples. To date, only a few dozen O-glycoproteins, primarily modified by core 1 O-glycans, have been reported in human semen (Luo et al., 2023). In this study, we identified a total of 720 unique intact O-glycopeptides and 438 O-glycosites from 148 distinct O-glycoproteins present in human semen (Supplementary Table S4). Representative spectra of these intact O-glycopeptides from the 148 distinct O-glycoproteins were shown in Supplementary Table S5. To our knowledge, this is the largest number of intact O-glycopeptides, O-glycosites, and O-glycoproteins reported in O-glycoproteomic studies of human semen (Luo et al., 2023; Lan et al., 2020).
In comparison to a recent study, the depth of O-glycosylation identification has nearly doubled, with most of these identifications being reported for the first time (Figures 4A–C) (Luo et al., 2023). The discrepancy between the two research findings may stem from differences in sample preparation methods and analytical techniques employed for identification. Notably, the distinction in O-glycosylation between S and SP is significant, with 104 O-glycosites identified from 28 O-glycoproteins (Figures 4D,E). GO analysis revealed that these O-glycoproteins are involved in distinct biological processes. Specifically, the O-glycoproteins in S were involved in signal transduction and spermatogenesis (Supplementary Figure S2), whereas those in SP were involved in proteolysis (Supplementary Figure S3). Furthermore, the analysis of O-glycans revealed that core 1 type O-glycans are the most prevalent in both S and SP, with varying percentages (Figures 4F,G). Additionally, S had more core 2 type O-glycans (HexNAc (Yang et al., 2015)Hex (Skakkebaek et al., 2016)) and fewer Tn antigens (HexNAc (Skakkebaek et al., 2016)) than did SP (Figures 4F,G). By integrating the site-specific N/O-glycosylation information, we can gain a comprehensive and systematic understanding of protein glycosylation. However, the currently identified O-glycosylation may be biased due to the limitations of existing sample processing techniques (such as trypsin and IMPa), enrichment techniques (like ZIC-HILIC), mass spectrometry analysis methods (including EThcD-sceHCD and sceHCD), and software algorithms (such as Byonic) (Ahmad Izaham et al., 2021). To further improve the characterization of site-specific O-glycosylation in human semen, future studies could focus on developing new O-glycosidases capable of cleaving O-glycopeptides that contain sialic acids for sample processing. Additionally, utilizing more comprehensive O-glycan databases or open search strategies could aid in the discovery of new O-glycans. Furthermore, the development of more sensitive analytical techniques and software, along with conducting additional validation experiments, will be crucial.

Figure 4. Site-specific O-glycosylation analysis of human S and SP O-glycoproteins. (A–C) Comparison of identified intact O-glycopeptides (A), O-glycoproteins (B), and O-glycosites (C) in human semen between this study and the previous study (Luo et al., 2023). (D,E) Comparison of identified O-glycoproteins (D) and O-glycosites (E) between human S and SP. (F,G) Identified O-glycans in human S (F) and SP (G).
Distinctive glycoproteins in human sperm
To vividly represent site-specific N/O-glycosylation within a protein structure, several tools have been developed. For example, model building was conducted using the AlphaFold-predicted structures (AF-Q6UW49-F1 and AF-P05154-F1) of human sperm equatorial segment protein 1 (SPESP1) and plasma serine protease inhibitor (SERPINA5). This process utilized PyMOL (version 2.6.0), GlycoWorkbench (version 2.1), and GlycoSHIELD (Damerell et al., 2015; Jumper et al., 2021; Tsai et al., 2024). SPESP1 is a glycoprotein that is specifically and highly expressed in the testis, localized in the acrosome during the postmeiotic stages of spermiogenesis, which includes round and elongating spermatids as well as ejaculated spermatozoa. It plays an important role in the fertilization ability of sperm and serves as a suitable target for cancer immunotherapy (Wolkowicz et al., 2003; Kosaka et al., 2021). Our current analysis indicates that the protein is expressed as both an N-glycoprotein and an O-glycoprotein, primarily in S, while it was not detected in SP within the sensitivity limits of our analysis (Supplementary Table S5). Given the greater complexity and less common occurrence of glycosylation in SP, these proteins are likely present in such low quantities that they fall below our detection threshold. Our site-specific N/O-glycosylation analysis revealed that the N-glycosite (N128) can be decorated with five N-glycan compositions (HexNAc(2)Hex(3), HexNAc(5)Hex(3), HexNAc(5)Hex(4), HexNAc(5)Hex(6), and HexNAc(6)Hex(5)). Meanwhile, the O-glycosite (S123) can be decorated with two O-glycan compositions (HexNAc(1)Hex(1), HexNAc(2)Hex(1)), whereas the O-glycosite (S130) can be decorated with three O-glycan compositions (HexNAc(1)Hex(1),HexNAc(2), HexNAc(1)) (Figure 5).

Figure 5. Site-specific N/O-glycosylation analysis of the distinctive protein sperm equatorial segment protein 1 (SPESP1) from human sperm. The AlphaFold-predicted structure of human SPESP1 was generated using PyMOL. Three glycosites (S123, N128 and S130) were identified and decorated with glycan cartoons, depicting the deduced glycan structures on the basis of their compositions.
SERPINA5 is reported to be a glycoprotein with four glycosites (T39, N249, N262, and N338) (Sun et al., 2010). It can inactivate several serine proteases involved in the reproductive system and indirectly protect components of the male genital tract from degradation by excessive release of acrosin (He et al., 1999; Jerabek et al., 2001). It also plays a role in controlling sperm motility and fertilization, as well as regulating the degradation of semenogelin during the transfer of spermatozoa from the male reproductive tract to the female tract (Cao et al., 2022; Panner Selvam et al., 2019). Our findings revealed that it is an N/O-glycosylated protein with fiver glycosites (T39, S43, N249, N262, and N338), and it is expressed in both human S and SP (Supplementary Table S5). Site-specific N/O-glycosylation analysis has shown that the reported O-glycosite T39 can be decorated with one O-glycan (HexNAc(1)Hex(1)NeuAc(2)) in sperm. In contrast, it can be decorated with two O-glycans (HexNAc(1)Hex(1)NeuAc(2) and HexNAc(1)Hex(1)NeuAc(1)) in seminal plasma. Notably, our GlycoIP-based research has led to the discovery of a new O-glycosite S43. This finding underscores the effectiveness of our method, demonstrating its ability to not only pinpoint known glycosites but also to uncover new ones. The O-glycosite S43 can be decorated with one O-glycan composition (HexNAc(1)Hex(1)NeuAc(2)), and it can be decorated with two O-glycan compositions (HexNAc(1)Hex(1),HexNAc(2)) (Figure 6). The detailed site-specific N-glycosylation information of SERPINA5 revealed that glycosylation patterns of the same protein can be differ between human S and SP. Additionally, we have included the glycan databases utilized in this study (Supplementary Table S6). These novel findings and comprehensive glycosylation data may contribute to a better understanding of glycoprotein structure and function. Additionally, our research revealed a variety of intriguing glycoproteins in S and SP, including leucine-rich repeat-containing protein 37A2, forkhead-associated domain-containing protein 1, agrin, and fibronectin. With advancements in analytical techniques, we are poised to identify an increasing number of essential seminal glycoproteins. Investigating the functions of these glycoproteins and their relationships with semen examination parameters—such as concentration, motility, morphology, genetic damage, mitochondrial function, and the acrosome reaction—will be a critical focus of future studies.

Figure 6. Site-specific N/O-glycosylation analysis of plasma serine protease inhibitor (SERPINA5) expressed in human S and SP. The AlphaFold-predicted structure of human SERPINA5 was generated using PyMOL. Three N-glycosites (N249, N262 and N338) and two O-glycosites (T39 and S43) were identified and decorated with different glycan cartoons, depicting the deduced glycan structures on the basis of their compositions.
Conclusion
In this study, we introduced an innovative platform (termed GlycoIP) designed for the comprehensive analysis of intact N/O-glycopeptides in human semen. The platform provides more accurate and site-specific N/O-glycosylation information compared to previous methods. Our findings revealed a significant number of glycopeptides and glycosites: 1,833 unique intact N-glycopeptides and 1,163 N-glycosites from 449 N-glycoproteins, as well as 720 unique intact O-glycopeptides and 438 potential O-glycosites from 148 O-glycoproteins in human semen. GlycoIP facilitates the development of a detailed, site-specific N/O-glycosylation map for each glycoprotein, providing valuable insights into the complex relationship between N/O-glycosylation patterns and male infertility. This approach not only elucidates the structural complexity of N/O-glycosylation but also paves the way for a better understanding of their roles in reproductive health.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Ethics statement
The studies involving humans abided by the Declaration of Helsinki principles and were approved by the Ethics Committee of West China Second University Hospital of Sichuan University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
GY: Data curation, Methodology, Writing – original draft. FC: Formal Analysis, Methodology, Resources, Writing – original draft. KW: Investigation, Methodology, Writing – original draft. QC: Formal Analysis, Methodology, Resources, Writing – original draft. YZ: Conceptualization, Funding acquisition, Methodology, Visualization, Writing – original draft, Writing – review and editing. LF: Conceptualization, Validation, Writing – review and editing, Writing – original draft.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the National Natural Science Foundation of China (92478101) and the National Key R&D Program of China (2022YFF0608401).
Acknowledgments
We thank the volunteers for their semen samples.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fchem.2025.1569561/full#supplementary-material
References
Ahmad Izaham, A. R., Ang, C. S., Nie, S., Bird, L. E., Williamson, N. A., and Scott, N. E. (2021). What are we missing by using hydrophilic enrichment? Improving bacterial glycoproteome coverage using total proteome and FAIMS analyses. J. Proteome Res. 20 (1), 599–612. doi:10.1021/acs.jproteome.0c00565
Cao, S., Qian, Z., Wu, R., Sun, S., Jing, J., Zhang, G., et al. (2022). SERPINA5 protein in cumulus-oocyte complexes increases the fertilisation ability of mouse sperm. Reprod. Sci. 29 (8), 2350–2362. doi:10.1007/s43032-022-00867-5
Cao, W. (2024). Advancing mass spectrometry-based glycoproteomic software tools for comprehensive site-specific glycoproteome analysis. Curr. Opin. Chem. Biol. 80, 102442. doi:10.1016/j.cbpa.2024.102442
Cao, W., Liu, M., Kong, S., Wu, M., Zhang, Y., and Yang, P. (2021). Recent advances in software tools for more generic and precise intact glycopeptide analysis. Mol. Cell Proteomics 20, 100060. doi:10.1074/mcp.R120.002090
Cheng, Q., Luo, M., Xu, Z., Li, F., and Zhang, Y. (2024). Developing glycoproteomics reveals the role of posttranslational glycosylation in the physiological and pathological processes of male reproduction. iMetaOmics 1 (1), e10. doi:10.1002/imo2.10
Clark, G. F. (2014). The role of glycans in immune evasion: the human fetoembryonic defence system hypothesis revisited. Mol. Hum. Reprod. 20 (3), 185–199. doi:10.1093/molehr/gat064
Damerell, D., Ceroni, A., Maass, K., Ranzinger, R., Dell, A., and Haslam, S. M. (2015). Annotation of glycomics MS and MS/MS spectra using the GlycoWorkbench software tool. Methods Mol. Biol. 1273, 3–15. doi:10.1007/978-1-4939-2343-4_1
Diekman, A. B., Norton, E. J., Klotz, K. L., Westbrook, V. A., and Herr, J. C. (1999). Evidence for a unique N-linked glycan associated with human infertility on sperm CD52: a candidate contraceptive vaccinogen. Immunol. Rev. 171, 203–211. doi:10.1111/j.1600-065x.1999.tb01350.x
He, S., Lin, Y. L., and Liu, Y. X. (1999). Functionally inactive protein C inhibitor in seminal plasma may be associated with infertility. Mol. Hum. Reprod. 5 (6), 513–519. doi:10.1093/molehr/5.6.513
Helms, A., Escobar, E. E., Vainauskas, S., Taron, C. H., and Brodbelt, J. S. (2023). Ultraviolet photodissociation permits comprehensive characterization of O-glycopeptides cleaved with O-glycoprotease IMPa. Anal. Chem. 95 (24), 9280–9287. doi:10.1021/acs.analchem.3c01111
Huang, J., Wan, H., Yao, Y., Li, J., Cheng, K., Mao, J., et al. (2015). Highly efficient release of glycopeptides from hydrazide beads by hydroxylamine assisted PNGase F deglycosylation for N-glycoproteome analysis. Anal. Chem. 87 (20), 10199–10204. doi:10.1021/acs.analchem.5b02669
Jerabek, I., Zechmeister-Machhart, M., Binder, B. R., and Geiger, M. (2001). Binding of retinoic acid by the inhibitory serpin protein C inhibitor. Eur. J. Biochem. 268 (22), 5989–5996. doi:10.1046/j.0014-2956.2001.02560.x
Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596 (7873), 583–589. doi:10.1038/s41586-021-03819-2
Ka, U. A. A., Ferens-Sieczkowska, M. A., Olejnik, B., Ko Odziejczyk, J., Zimmer, M., and Kratz, E. M. (2019). The content of immunomodulatory glycoepitopes in seminal plasma glycoproteins of fertile and infertile men. Reprod. Fertil. Dev. 31 (3), 579–589. doi:10.1071/rd18124
Kawahara, R., Chernykh, A., Alagesan, K., Bern, M., Cao, W., Chalkley, R. J., et al. (2021). Community evaluation of glycoproteomics informatics solutions reveals high-performance search strategies for serum glycopeptide analysis. Nat. Methods 18 (11), 1304–1316. doi:10.1038/s41592-021-01309-x
Kida, Y., Taira, J., and Kuwano, K. (2016). EprS, an autotransporter serine protease, plays an important role in various pathogenic phenotypes of Pseudomonas aeruginosa. Microbiol. Read. 162 (2), 318–329. doi:10.1099/mic.0.000228
Kosaka, A., Yajima, Y., Hatayama, M., Ikuta, K., Sasaki, T., Hirai, N., et al. (2021). A stealth antigen SPESP1, which is epigenetically silenced in tumors, is a suitable target for cancer immunotherapy. Cancer Sci. 112 (7), 2705–2713. doi:10.1111/cas.14973
Lan, R., Xin, M., Hao, Z., You, S., Xu, Y., Wu, J., et al. (2020). Biological functions and large-scale profiling of protein glycosylation in human semen. J. Proteome Res. 19 (10), 3877–3889. doi:10.1021/acs.jproteome.9b00795
Lee, L. Y., Moh, E. S., Parker, B. L., Bern, M., Packer, N. H., and Thaysen-Andersen, M. (2016). Toward automated N-glycopeptide identification in glycoproteomics. J. Proteome Res. 15 (10), 3904–3915. doi:10.1021/acs.jproteome.6b00438
Li, C., Dan, W., Li, P., Xin, M., Lan, R., Zhu, B., et al. (2023). Site-specific N-glycan changes during semen liquefaction. Anal. Biochem. 680, 115318. doi:10.1016/j.ab.2023.115318
Liu, K., Meng, T., Chen, Q., Hou, G., Wang, X., Hu, S., et al. (2022). Diurnal rhythm of human semen quality: analysis of large-scale human sperm bank data and timing-controlled laboratory study. Hum. Reprod. 37 (8), 1727–1738. doi:10.1093/humrep/deac135
Liu, M. Q., Zeng, W. F., Fang, P., Cao, W. Q., Liu, C., Yan, G. Q., et al. (2017). pGlyco 2.0 enables precision N-glycoproteomics with comprehensive quality control and one-step mass spectrometry for intact glycopeptide identification. Nat. Commun. 8 (1), 438. doi:10.1038/s41467-017-00535-2
Luo, M., Mao, Y., Zeng, W., Zheng, S., Li, H., Hu, J., et al. (2022a). Site-specific N-glycosylation characterization of micro monoclonal immunoglobulins based on EThcD-sceHCD-MS/MS. Front. Immunol. 13, 1013990. doi:10.3389/fimmu.2022.1013990
Luo, M., Su, T., Cheng, Q., Zhang, X., Cai, F., Yin, Z., et al. (2023). GlycoTCFM: glycoproteomics based on two complementary fragmentation methods reveals distinctive O-glycosylation in human sperm and seminal plasma. J. Proteome Res. 22 (12), 3833–3842. doi:10.1021/acs.jproteome.3c00489
Luo, M., Su, T., Wang, S., Chen, J., Lin, T., Cheng, Q., et al. (2022b). Proteomic landscape of human spermatozoa: optimized extraction method and application. Cells 11 (24), 4064. doi:10.3390/cells11244064
Mao, Y., Su, T., Lin, T., Yang, H., Zhao, Y., Zhang, Y., et al. (2022). Comprehensive plasma N-glycoproteome profiling based on EThcD-sceHCD-MS/MS. Front. Chem. 10, 920009. doi:10.3389/fchem.2022.920009
Mao, Y., Wang, S., Zhao, Y., Konstantinidi, A., Sun, L., Ye, Z., et al. (2021). Systematic evaluation of fragmentation methods for unlabeled and isobaric mass tag-labeled O-glycopeptides. Anal. Chem. 93 (32), 11167–11175. doi:10.1021/acs.analchem.1c01696
Mao, Y., Zhao, Y., Zhang, Y., and Yang, H. (2020). In-depth characterization and comparison of the N-glycosylated proteome of two-dimensional- and three-dimensional-cultured breast cancer cells and xenografted tumors. PLoS One 15 (12), e0243789. doi:10.1371/journal.pone.0243789
Noach, I., Ficko-Blean, E., Pluvinage, B., Stuart, C., Jenkins, M. L., Brochu, D., et al. (2017). Recognition of protein-linked glycans as a determinant of peptidase activity. Proc. Natl. Acad. Sci. U. S. A. 114 (5), E679–e688. doi:10.1073/pnas.1615141114
Pang, P. C., Tissot, B., Drobnis, E. Z., Morris, H. R., Dell, A., and Clark, G. F. (2009). Analysis of the human seminal plasma glycome reveals the presence of immunomodulatory carbohydrate functional groups. J. Proteome Res. 8 (11), 4906–4915. doi:10.1021/pr9001756
Panner Selvam, M. K., Agarwal, A., Pushparaj, P. N., Baskaran, S., and Bendou, H. (2019). Sperm proteome analysis and identification of fertility-associated biomarkers in unexplained male infertility. Genes (Basel) 10 (7), 522. doi:10.3390/genes10070522
Pilch, B., and Mann, M. (2006). Large-scale and high-confidence proteomic analysis of human seminal plasma. Genome Biol. 7 (5), R40. doi:10.1186/gb-2006-7-5-r40
Polasky, D. A., Yu, F., Teo, G. C., and Nesvizhskii, A. I. (2020). Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco. Nat. Methods 17 (11), 1125–1132. doi:10.1038/s41592-020-0967-9
Riley, N. M., and Bertozzi, C. R. (2022). Deciphering O-glycoprotease substrate preferences with O-Pair Search. Mol. Omics 18 (10), 908–922. doi:10.1039/d2mo00244b
Riley, N. M., Malaker, S. A., Driessen, M. D., and Bertozzi, C. R. (2020). Optimal dissociation methods differ for N- and O-glycopeptides. J. Proteome Res. 19 (8), 3286–3301. doi:10.1021/acs.jproteome.0c00218
Saraswat, M., Joenväärä, S., Tomar, A. K., Singh, S., Yadav, S., and Renkonen, R. (2016). N-glycoproteomics of human seminal plasma glycoproteins. J. Proteome Res. 15 (3), 991–1001. doi:10.1021/acs.jproteome.5b01069
Seppälä, M., Koistinen, H., Koistinen, R., Chiu, P. C., and Yeung, W. S. (2007). Glycosylation related actions of glycodelin: gamete, cumulus cell, immune cell and clinical associations. Hum. Reprod. Update 13 (3), 275–287. doi:10.1093/humupd/dmm004
Skakkebaek, N. E., Rajpert-De Meyts, E., Buck Louis, G. M., Toppari, J., Andersson, A. M., Eisenberg, M. L., et al. (2016). Male reproductive disorders and fertility trends: influences of environment and genetic susceptibility. Physiol. Rev. 96 (1), 55–97. doi:10.1152/physrev.00017.2015
Sun, W., Parry, S., Ubhayasekera, W., Engström, A., Dell, A., and Schedin-Weiss, S. (2010). Further insight into the roles of the glycans attached to human blood protein C inhibitor. Biochem. Biophys. Res. Commun. 403 (2), 198–202. doi:10.1016/j.bbrc.2010.11.005
Suttapitugsakul, S., Matsumoto, Y., Aryal, R. P., and Cummings, R. D. (2023). Large-Scale and site-specific mapping of the murine brain O-glycoproteome with IMPa. Anal. Chem. 95 (36), 13423–13430. doi:10.1021/acs.analchem.3c00408
Szczykutowicz, J., Tkaczuk-Włach, J., and Ferens-Sieczkowska, M. (2021). Glycoproteins presenting galactose and N-acetylgalactosamine in human seminal plasma as potential players involved in immune modulation in the fertilization process. Int. J. Mol. Sci. 22 (14), 7331. doi:10.3390/ijms22147331
Tecle, E., and Gagneux, P. (2015). Sugar-coated sperm: unraveling the functions of the mammalian sperm glycocalyx. Mol. Reprod. Dev. 82 (9), 635–650. doi:10.1002/mrd.22500
Tian, W., Li, D., Zhang, N., Bai, G., Yuan, K., Xiao, H., et al. (2021). O-glycosylation pattern of the SARS-CoV-2 spike protein reveals an “O-Follow-N” rule. Cell Res. 31 (10), 1123–1125. doi:10.1038/s41422-021-00545-2
Tsai, Y. X., Chang, N. E., Reuter, K., Chang, H. T., Yang, T. J., von Bülow, S., et al. (2024). Rapid simulation of glycoprotein structures by grafting and steric exclusion of glycan conformer libraries. Cell 187 (5), 1296–1311.e26. doi:10.1016/j.cell.2024.01.034
Vainauskas, S., Guntz, H., McLeod, E., McClung, C., Ruse, C., Shi, X., et al. (2022). A broad-specificity O-glycoprotease that enables improved analysis of glycoproteins and glycopeptides containing intact complex O-glycans. Anal. Chem. 94 (2), 1060–1069. doi:10.1021/acs.analchem.1c04055
Wang, G., Wu, Y., Zhou, T., Guo, Y., Zheng, B., Wang, J., et al. (2013). Mapping of the N-linked glycoproteome of human spermatozoa. J. Proteome Res. 12 (12), 5750–5759. doi:10.1021/pr400753f
Wang, Z., Fang, Z., Liu, L., Zhu, H., Wang, Y., Zhao, C., et al. (2023). Development of an integrated platform for the simultaneous enrichment and characterization of N- and O-linked intact glycopeptides. Anal. Chem. 95 (19), 7448–7457. doi:10.1021/acs.analchem.2c04305
Wolkowicz, M. J., Shetty, J., Westbrook, A., Klotz, K., Jayes, F., Mandal, A., et al. (2003). Equatorial segment protein defines a discrete acrosomal subcompartment persisting throughout acrosomal biogenesis. Biol. Reprod. 69 (3), 735–745. doi:10.1095/biolreprod.103.016675
Yang, X., Liu, F., Yan, Y., Zhou, T., Guo, Y., Sun, G., et al. (2015). Proteomic analysis of N-glycosylation of human seminal plasma. Proteomics 15 (7), 1255–1258. doi:10.1002/pmic.201400203
Zeng, W., Zheng, S., Su, T., Cheng, J., Mao, Y., Zhong, Y., et al. (2022). Comparative N-glycoproteomics analysis of clinical samples via different mass spectrometry dissociation methods. Front. Chem. 10, 839470. doi:10.3389/fchem.2022.839470
Zhang, H., Situ, C., and Guo, X. (2022). Recent progress of proteomic analysis on spermatogenesis. Biol. Reprod. 107 (1), 109–117. doi:10.1093/biolre/ioac065
Zhang, Y., Xie, X., Zhao, X., Tian, F., Lv, J., Ying, W., et al. (2018). Systems analysis of singly and multiply O-glycosylated peptides in the human serum glycoproteome via EThcD and HCD mass spectrometry. J. Proteomics 170, 14–27. doi:10.1016/j.jprot.2017.09.014
Zhang, Y., Zhao, W., Mao, Y., Chen, Y., Zheng, S., Cao, W., et al. (2021c). O-glycosylation landscapes of SARS-CoV-2 spike proteins. Front. Chem. 9, 689521. doi:10.3389/fchem.2021.689521
Zhang, Y., Zheng, S., Mao, Y., Cao, W., Zhao, L., Wu, C., et al. (2021a). Systems analysis of plasma IgG intact N-glycopeptides from patients with chronic kidney diseases via EThcD-sceHCD-MS/MS. Analyst 146 (23), 7274–7283. doi:10.1039/d1an01657a
Zhang, Y., Zheng, S. S., Zhao, W. J., Mao, Y. H., Cao, W., Zeng, W. J., et al. (2021b). Sequential analysis of the N/O-glycosylation of heavily glycosylated HIV-1 gp120 using EThcD-sceHCD-MS/MS. Front. Immunol. 12, 755568. doi:10.3389/fimmu.2021.755568
Zhao, Y., Xue, Q., Wang, M., Meng, B., Jiang, Y., Zhai, R., et al. (2023). Evolution of mass spectrometry instruments and techniques for blood proteomics. J. Proteome Res. 22 (4), 1009–1023. doi:10.1021/acs.jproteome.3c00102
Keywords: sperm, N-glycosylation, O-glycosylation, LC-MS/MS, glycoproteomics
Citation: Yan G, Cai F, Wu K, Cheng Q, Zhang Y and Fan L (2025) GlycoIP: an integrated platform for simultaneous and site-specific N/O-glycosylation analysis of human semen. Front. Chem. 13:1569561. doi: 10.3389/fchem.2025.1569561
Received: 01 February 2025; Accepted: 01 May 2025;
Published: 19 May 2025.
Edited by:
Halil Ibrahim Ulusoy, Cumhuriyet University, TürkiyeReviewed by:
Guinevere S. M. Lageveen-Kammeijer, University of Groningen, NetherlandsPodjana Wattananit, Mahidol University, Thailand
Copyright © 2025 Yan, Cai, Wu, Cheng, Zhang and Fan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yong Zhang, bmFua2FpMTk4OUBmb3htYWlsLmNvbQ==, emhhbmd5b25nMDgwOUB3Y2hzY3UuZWR1LmNu; Lin Fan, ZmFubGluQHNjc3pseXkub3JnLmNu
†These authors have contributed equally to this work