DATA REPORT article

Front. Genet., 12 May 2023

Sec. Livestock Genomics

Volume 14 - 2023 | https://doi.org/10.3389/fgene.2023.1182002

The draft genome of the tropical sea cucumber Stichopus monotuberculatus (Echinodermata, Stichopodidae) reveals critical genes in fucosylated chondroitin sulfates biosynthetic pathway

  • SZ

    Shengping Zhong 1,2† *

  • XM

    Xiaowan Ma 3

  • YJ

    Yan Jiang 2

  • XL

    Xujia Liu 2,4

  • MZ

    Mengqing Zeng 2

  • LZ

    Longyan Zhao 1

  • LH

    Lianghua Huang 1

  • GH

    Guoqiang Huang 1

  • YZ

    Yongzhen Zhao 5

  • YQ

    Ying Qiao 3*

  • XC

    Xiuli Chen 5*

  • 1. Guangxi Key Laboratory of Marine Drugs, Institute of Marine Drugs, Guangxi University of Chinese Medicine, Nanning, China

  • 2. Guangxi Engineering Technology Research Center for Marine Aquaculture, Guangxi Institute of Oceanology Co., Ltd., Beihai, China

  • 3. Key Laboratory of Tropical Marine Ecosystem and Bioresource, Fourth Institute of Oceanography, Ministry of Natural Resources, Beihai, China

  • 4. Guangxi Key Laboratory of Marine Environmental Science, Guangxi Academy of Marine Sciences, Guangxi Academy of Sciences, Nanning, China

  • 5. Guangxi Key Laboratory of Aquatic Genetic Breeding and Healthy Aquaculture, Guangxi Academy of Fishery Sciences, Nanning, China

Introduction

Holothuroids, or sea cucumbers, are a diverse echinoderm group of economically significant marine benthic invertebrates (; ). Due to its large bode size and thick body wall, Stichopus monotuberculatus, a tropical holothuroid has become a popular tonic food in Indo-west pacific counties, where it is known as its delectable taste and high nutritional value (). Because the body walls of holothuroids contain abundant and diverse biomedical compounds, such as sulfated polysaccharides, saponins, and peptides () they have become the subject of increasing attention in pharmaceutical research as an important source of medicinal compounds ().

Marine sulfated polysaccharides usually occur in echinoderms and brown algae, which are abundant and considered to be promising sources of medicinal compounds with significant pharmacological effects (; ). The marine sulfated polysaccharides are typically composed of sulfate groups and α-L-fucose. They are structurally diverse and the positions of their fucosylated and sulfonated features vary by marine species (; ). Sulfated fucans (SF) and fucosylated chondroitin sulfates (FCS) are the two most common types of sulfated polysaccharides occurring in marine animals, with SF occurring in echinoderms and brown algae () and FCS being exclusive to echinoderms (). Fucosylated chondroitin sulfates, also called fucosylated glycosaminoglycan (FG), is typically composed of a chondroitin sulfate backbone attached to α-L-fucose branches, both of which are commonly sulfonated (; ). The structural features of FCS and the pattern of sulfonation of the chondroitin sulfate backbone and the α-L-fucose branches varies across sea cucumber species (). It be important in maintaining the body wall integrity of sea cucumbers by preventing degradation by marine microorganisms (). Studies of brown algae have detected duplication and expansion events at SF biosynthesis associated genes, such as the mannuronan C-5-epimerases (MC5Es) () and carbohydrate sulfotransferases (CHSTs), which increase the rigidity of the cell wall and provide genomic adaptions to different environments (). A previous study on the structural diversity of holothurian FCS and the position and number of the fucosylated branches attached to the chondroitin sulfate backbone confirmed the species-specific patterns across holothuroids (). These species-specific FCS structures may have significant impacts on various biological activities such as antithrombotic, antihyperlipidemic, and anticoagulant processes and wound healing (). Studies of brown algae genomes have identified genes coding for key enzymes for sulfated polysaccharides (). Moreover, the genes coding for downstream enzymes, such as sulfotransferases and fucosyltransferases have been revealed expanded in brown algae genome (). However, despite the biological, ecological and economic importance of holothurian FCS, the key genes associated with its biosynthetic pathways and their evolution remain unknown.

Here in this report, in order to decipher the key genes associated FCS biosynthetic pathways in tropical sea cucumbers, S. monotuberculatus genome have been assembled and annotated. The genome comprised 168 contigs, with an overall length of 839.56 Mb. The contig N50 length was 11.51 Mb. In addition, our investigations of the evolution of the gene families revealed 1,011 significantly expanded genes and 408 genes exclusive to S. monotuberculatus. The gene families coding for key enzymes associated with FCS biosynthetic pathways, including fucosyltransferases and sulfotransferases, were also significantly expanded. The accessibility of first genomic sequences and annotations of S. monotuberculatus will provide a genomic approach to investigate the structural diversity of holothurian FCS, as well as novel perspectives into evolutionary adaptation of critical genes in holothurian FCS biosynthesis pathways.

Materials and methods

DNA and RNA sequencing

An individual S. monotuberculatus (body weight 5.92 g) was collected from Guangxi Province, China (21.484,179 N, 109.720,886 E), and kept in an aerated water tank before tissue sampling. The intestine, tentacles, muscle, and tube-feet were dissected out and flash-frozen using liquid nitrogen. Genomic DNA was isolated from a muscle sample using a QIAamp DNA Mini Kit (QIAGEN, Hilden, Germany). A Nanopore 20 kb insert library was constructed using approximately 1 µg of genomic DNA and Blue Pippin electrophoresis (Sage Science, Inc., Beverly, CA, United States) was used to select DNA fractions larger than 20 kb. A DNA sequencing library for S. monotuberculatus was prepared by Grandomics Biosciences Co., Ltd (Wuhan, China) using an SQK-LSK109 ligation kit (Oxford NanoporeTechnologies, Oxford, UK). In addition, a 350 bp insert size paired-end DNA sequencing library with 2 × 150 bp lengths was prepared using a MGISEQ-2000 platform (Wuhan, China) in accordance with the manufacturer’s instructions. Finally, S. monotuberculatus RNA was isolated from the intestine, tentacle, and tube-feet samples using an RNAiso kit (TaKaRa, Tokyo, Japan) and then a MGISEQ-2000 system was used to prepare three S. monotuberculatus paired-end RNA sequencing libraries with lengths of 2 × 150 bp.

Genome assembly

The 2 × 150 bp paired-end DNA sequences were used to evaluate the S. monotuberculatus genome length and polymorphisms. After using fastp (v. 0.23.2) () to trim the DNA sequences, Jellyfish (v. 2.3.0) (), and Genomescope (v, 2.0) () were used to conduct a k-mer analysis with a length setting of 19. The correct-then-assemble strategy of NextDenovo (v. 2.5.0) (github.com/Nextomics/NextDenovo) was used to assemble the preliminarily S. monotuberculatus genome using Nanopore sequencing data. NextPolish (v. 1.4.1) () was used to fix the contig errors in the preliminarily S. monotuberculatus genome using S. monotuberculatus paired-end DNA sequences. Purge_Dups (v. 1.2.6) () was then used to remove redundant sequences. Finally, the completeness of the S. monotuberculatus genome was assessed using the metazoa_odb10 data in the BUSCO software (v. 5.4.4) ().

Prediction of transposable elements, gene structure, and functional annotation

Initially, we used EDTA (v. 2.1.0) () to apply an ab initio strategy to evaluate the transposable elements (TEs) in the S. monotuberculatus genome. A homology-based strategy was then applied using the predicted TEs library and RepeatMasker (v. 4.1.2) (www.repeatmasker.org) to further detect any remaining comprehensive TEs sequences. The protein-coding gene structures were annotated using a combination of transcript-based, homology-based, and ab initio prediction strategies. For the transcript-based annotation, the RNA sequencing data from the intestine, tentacles, and tube-feet samples were used to assembly de novo transcript sequences using Trinity (v. 2.14) (). Afterwards, the genome guided-transcripts were assembled using StringTie (v. 2.2.1) (). Finally, the transcript-based predicted proteins in the S. monotuberculatus genome were obtain by aligning all transcript sequences to the S. monotuberculatus genomic sequences using PASA (v. 2.5.2) () with default parameters. For the homology-based method, the protein-coding genes in the S. monotuberculatus genome were validated using GeMoMa (v. 1.9) () with default settings obtained from the echinoderm protein data in Genbank, including those for Strongylocentrotus purpuratus (GCF_000002235.5), Patiria miniata (GCF_015706575.1), Lytechinus variegatus (GCF_018143015.1), Apostichopus japonicus (GCF_002754855.1), Holothuria glaberrima (GCF_009936505.2), Asterias rubens (GCF_902459465.1), Anneissia japonica (GCF_011630105.1), and Chiridota heheva (GCF_020152595.1). For the ab initio method, the RNA sequencing data of S. monotuberculatus and the echinoderm protein data were applied to predict the coding genes using BRAKER2 (v. 2.1.6) () in the GeneMark-ETP mode. Finally, the predictions of the three strategies were merged and evaluated using EvidenceModeler (v. 1.1.1) (), and the evaluated results were functionally annotated using DIAMOND (v. 2.1.3) () with default parameters to search several protein databases, and well as the Gene Ontology (GO), Swiss-Prot, UniProtKB-TremBL, and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases, using an E-value limit for homologous annotation of 1e-5. The completeness of the prediction of the protein-coding genes in the S. monotuberculatus genome was further validated using BUSCO (v. 5.4.4) with the metazoa_odb10 data. The identification of transfer RNAs (tRNAs) and ribosomal RNAs (rRNAs) in the S. monotuberculatus genome was accomplished using tRNAscan-SE (v. 2.0.6) () and RNAmmer (v. 1.2) (), respectively. The identification of microRNAs and small nuclear RNAs (snRNAs) was performed using Infernal (v. 1.1.2) () to search the Rfam database with default parameters.

Phylogenomics and gene family investigation

To investigate the comparative phylogenomics and gene families of S. monotuberculatus, OrthoFinder (v. 2.5.4) () was used to verify the orthologous gene clusters of S. monotuberculatus and 14 related species including Acanthaste planci (GCF_001949145.1), Hemicentrotus pulcherrimus (GCF_003118195.1), Holothuria scabra (GCF_026123075.1), Pisaster ochraceus (GCF_010994315.2), Plazaster borealis (GCF_021014325.1), Saccoglossus kowalevskii (GCF_000003605.2), S. purpuratus, P. miniata, L. variegatus, Anneissia japonicus, H. glaberrima, A. rubens, Apostichopus japonica, and C. heheva. The protein sequences of the 14 related species were retrieved from Genbank and Echinobase (www.echinobase.org), and the longest protein sequence was chosen to identify the orthologous protein. To investigate the evolutionary status of S. monotuberculatus, the single-copy protein sequences were first obtained from the orthologous OrthoFinder results and then MUSCLE (v. 3.8.31) was used to align those sequences. The phylogenetic tree was constructed from the continuous super protein sequences combined from all of the aligned single-copy sequences using RAxML-avx (v. 8.2.9) () under LG4M amino acid substitution model with 1,000 bootstrap replicates. The evolutionary times of H. scabra and A. japonicus were retrieved from the TimeTree website (www.timetree.org) and calibrated using the r8s software (v. 1.71) to evaluate the evolutionary time of S. monotuberculatus and the 14 closely related species. In order to evaluate the evolutionary history of the S. monotuberculatus gene families, the likelihood analysis method was performed using CAFE (v. 5.0) () with p < 0.05. In addition, functional enrichment analyses, including KEGG and GO, were employed using TBtools () to evaluate the biological functions of the S. monotuberculatus species-specific and expanded gene families.

Results and discussions

Genome assembly

Both the Nanopore and MGI platforms were used to assemble the S. monotuberculatus genome. The former produced 51.53 Gb sequences with an average length of 22,615 bp, while the latter produced 64.72 Gb sequences with a Q20 quality level of 97.96%. The genomic features of the S. monotuberculatus genome were estimated using the MGI platform data in Genomescope to perform k-mer-based estimations. The results showed a predicted size of the S. monotuberculatus genome of 784.77 Mb, as well as a repetitive sequence ratio of 31.63% and a high heterozygous rate of 1.59% (Figure 1B; Supplementary Table S2). The heterozygous rate of the S. monotuberculatus genome was close to that of A. japonicus, while its genome size was slightly smaller (). Due to the high heterozygosity of the S. monotuberculatus genome, the long reads data were used in NextDenovo to assemble its complex regions. After a first round of assembly, 292 contigs with 10.52 Mb in contig N50 were achieved. To resolve the redundancy of heterozygous regions, Purge_Dups was applied after correcting contigs errors using NextPolish. Eventually, 168 contigs with 839.56 Mb in size, 11.51 Mb in contig N50, and a longest contig of 52.05 Mb were established in the S. monotuberculatus genome (Supplementary Table S1). This is the largest reported contig N50 for any holothurian genome to date (). Based on the metazoa_odb10 lineages, the BUSCO analysis with genome mode was performed to assess the genome integrity of S. monotuberculatus. The result showed that it covered 97.9% BUSCOs, including 94.2% complete and 3.7% fragmented, while only 2.1% were classified as missing (Supplementary Table S3). Our assembled genome has the greatest level of completeness of any reported holothurian genome to date, including H. scabra (), H. glaberrima (), C. heheva (), and A. japonicus (). In addition, 95.98% of the short read data can be confidently mapped to the S. monotuberculatus final genome using BWA v. 0.7.17. These results indicate that the genome completeness and integrity are high in the final assembled S. monotuberculatus genome.

FIGURE 1

Genome annotation

The integrated results of the predicted TEs analysis showed that a total length of 298.36 Mb (35.54%) of the S. monotuberculatus genome could be identified as TEs, which is the highest among the published Stichopodidae genomes, including A. japonicus (26.68%) and Parastichopus parvimensis (25.02%). However, the proportion of TEs in the S. monotuberculatus genome was much smaller than in other deep-sea holothurian genomes, such as C. heheva (56.40%) () and Paelopatides sp. Yap (73.93%) (). The S. monotuberculatus genome was found to be richest in terminal inverted repeats, which made up 19.41% of the genome sequences, while long terminal repeats (6.88%) and tandem repeats (5.67%) were the next two richest types (Supplementary Table S4). Using EvidenceModeler to combine protein-coding gene predictions, the S. monotuberculatus genome was confirmed to have 36,422 protein-coding genes (Supplementary Table S1). The protein-coding gene number of S. monotuberculatus was a bit higher than that of A. japonicus (30,305), but the protein-coding gene length of S. monotuberculatus (12,278 bp on average, Supplementary Table S1) was a bit longer than that of A. japonicus (7,787 bp on average) (). Compared to S. monotuberculatus and A. japonicus, H. scabra genome has a much longer protein-coding gene length (25,967 bp on average) but much less protein-coding gene number (16,642) (). In the comparison study of three A. japonicus genome assemblies, Jo’s assembly has the lowest protein-coding gene number (21,771) and the shortest protein-coding gene length (5,388 bp on average). However, as the quality of the assembly has improved, the gene annotation completeness has also increased significantly, resulting in the longest protein-coding gene length (8,918 bp on average) and the highest protein-coding gene number around thirty-thousand (). The improvement of S. monotuberculatus assembly quality may also result in an increase in the number of predicted protein-coding genes. The BUSCO evaluation based on metazoa_odb10 lineages showed that the verified genes covered 99.1% of metazoa_odb10 genes, including 98.6% classified as complete and 0.5% classified as fragmented (Supplementary Table S3). The completeness of the gene predictions in the S. monotuberculatus genome is the greatest of any reported holothurian genome to date, indicating that the quality of the gene structure prediction is sufficiently high for downstream analyses, such as comparative phylogenomics analysis. In addition, 79.31% of the S. monotuberculatus genes were functionally annotated in the various protein databases, including UniProtKB-TremBL (79.08%), Swiss-Prot (49.96%), KEGG (38.25%), and GO (36.82%) (Supplementary Table S5). Moreover, a total length of 0.34 Mb in the S. monotuberculatus genome was predicted to be non-coding RNAs, including 1,101 tRNAs, 1,452 microRNAs, 181 rRNAs, and 208 snRNAs (Supplementary Table S6).

Phylogenomics and gene family investigation

The orthologous genes shared between S. monotuberculatus and other Echinodermata species were identified using OrthoFinder. A total of 30,979 genes (85.1%) were identified as orthologous gene clusters in the S. monotuberculatus protein-coding gene set, of which 2,793 genes (7.7%) were found to be species-specific orthologous genes. In addition, all Echinodermata share 6,191 orthogroups, along with 422 single-copy orthogroups (Supplementary Table S7). RAxML used 73,344 amino acid sites of these single-copy orthogroups to build the Echinodermata phylogenetic tree with the hemichordate S. kowalevskii rooted as an outgroup species. The phylogenetic results showed that S. monotuberculatus initially clustered with A. japonicus, and that Stichopodidae and Holothuriidae were clustered as sister groups to form the Aspidochirotida clade. Apodida (C. heheva) and Aspidochirotida were clustered together within the Holothuroidea, with Apodida being the basal taxon of Holothuroidea (Supplementary Figure S1). Our phylogenetic results indicated that Stichopodidae and Holothuriidae have a close sister relationship within Aspidochirotida, and that Apodida occupies the basal branch in the class Holothuroidea. These results were consistent with the phylogenetic analyses of Holothuroidea inferred by morphological () and phylogenomic analyses ().

An evolutionary investigation of gene families was conducted comparing S. monotuberculatus with other Echinodermata species. The results showed that 693 gene families in the S. monotuberculatus genome were unique and that 857 gene families had undergone significant expansion, whereas only 167 gene families had undergone significant contraction (Figure 2A). The structural features of FCS from holothuroids are species-specific, and mainly due to the different positions of the sulfonated branches on the chondroitin sulfate backbone and to the different fucosylated branch chains across the holothuroids (). Studies of S. monotuberculatus have shown that these species-specific FCS structural features improve anticoagulant activity (). In this study, gene families coding for key enzymes involved in FCS biosynthesis, including carbohydrate sulfotransferases (CHSTs, EC 2.8.2) (Supplementary Figure S2) which were identified as significantly expanded gene families and fucosyltransferases (FUTs, EC 2.4.1) (Supplementary Figure S3) which were identified as species-specific to S. monotuberculatus. In previous studies on brown algae, the sulfotransferases and fucosyltransferases genes were also expanded independently, and the copies of these genes varied among brown algae genomes, which may be important in the efficient biosynthesis of SF (). Compared with other Echinodermata species, 13 copies of the carbohydrate sulfotransferase 11 (CHST11, EC 2.8.2.5) gene, 10 copies of the carbohydrate sulfotransferase 13 (CHST13, EC 2.8.2.5) gene, and 22 copies of the carbohydrate sulfotransferase (CHST15, EC 2.8.2.33) gene were identified as significantly expanded in the S. monotuberculatus genome. Meanwhile, The KEGG enrichment analyses of significantly expanded S. monotuberculatus gene families showed that these genes were significantly enriched in 62 pathways, as well as those involved in FCS biosynthesis, such as Glycosaminoglycan biosynthesis - chondroitin sulfate, Pentose and glucuronate interconversions, Glycan biosynthesis and metabolism, and Glycosaminoglycan binding proteins (Figure 2B). Significant expansions of gene families coding for key enzymes associated with FCS biosynthesis in the S. monotuberculatus genome may play crucial roles in the biosynthesis of complex and diverse FCS and may be important for maintaining body wall integrity in S. monotuberculatus. Furthermore, eight copies of the fucosyltransferase 4 (FUT4, EC 2.4.1.-) gene, nine copies of the fucosyltransferase 7 (FUT7, EC 2.4.1.-) gene, and one copy of the fucosyltransferase 3 (FUT3, EC 2.4.1.65) gene were identified as unique to the S. monotuberculatus genome. In previous studies on brown algae, N. decipiens genome also has unique fucosyltransferase and extracellular matrix genes which may facilitate sulfated fucan biosynthesis in Nemacystus decipiens (). Compared to other species of Aspidochirotida, these unique fucosyltransferase genes may be critical for S. monotuberculatus producing FCS more efficiently to maintain body wall integrity. GO enrichment analyses also showed that these unique genes were significantly enriched in the Glycosaminoglycan biosynthesis pathway, such as the polysaccharide binding (GO:0030247), proteoglycan binding (GO:0043394), fucosyltransferase activity (GO:0008417), and α-1,3-fucosyltransferase activity (GO:0046920) pathways (Figure 2C). In summary, the species specificity of key fucosyltransferase genes may play an important role in the biosynthesis of unique FCS in S. monotuberculatus, indicating the complex genomic adaptations for FCS biosynthesis in sea cucumbers.

FIGURE 2

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the https://figshare.com/, https://doi.org/10.6084/m9.figshare.22177898.v1; https://www.ncbi.nlm.nih.gov/, PRJNA938157, SRR23615351, SRR23610520-SRR23610521 and SRR23604910-SRR23604915.

Ethics statement

The studies involving animals were reviewed and approved by Committee on Animal Research and Ethics of Guangxi University of Chinese Medicine.

Author contributions

SZ, XM, and YJ: conceived study. SZ, XM, and YQ: bioinformatics analysis. XL, MZ and LZ: collected samples. LH, GH, and YZ: investigation samples. SZ, YQ, and XC: wrote manuscript. All authors approved the final submission. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by National Natural Science Foundation of China (31960225), China Postdoctoral Science Foundation (2021M701798), Guangxi University of Chinese Medicine “GuiPai Traditional Chinese Medicine inheritance and innovation team” Project (2022A007), Development Program of High-level Talent Team under Qihuang Project of Guangxi University of Chinese Medicine (2021004), Guangxi Natural Science Foundation (2021GXNSFAA196074, 2021GXNSFAA220030, 2022GXNSFBA035166, 2020GXNSFBA297126, AD19245135, and AD19245161), Fund of Hainan Provincial Key Laboratory of Tropical Maricultural Technologies (TMTOF202204), the Open Fund of Guangxi Key Laboratory of Aquatic Genetic Breeding and Healthy Aquaculture (GXKEYLA2022-03) and PhD research startup foundation of Guangxi University of Chinese Medicine (2019BS018).

Conflict of interest

Guangxi Institute of Oceanology Co., Ltd. was the employer of authors Shengping Zhong, Yan Jiang, Xujia Liu, and Mengqing Zeng.

The remaining authors state that no business or financial relationships that may be considered as a potential conflict of interest existed during the study.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2023.1182002/full#supplementary-material

References

  • 1

    BrůnaT.HoffK. J.LomsadzeA.StankeM.BorodovskyM. (2021). BRAKER2: Automatic eukaryotic genome annotation with GeneMark-ep+ and AUGUSTUS supported by a protein database. NAR Genomics Bioinforma.3 (1), lqaa108. 10.1093/nargab/lqaa108

  • 2

    BuchfinkB.ReuterK.DrostH.-G. (2021). Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods18 (4), 366368. 10.1038/s41592-021-01101-x

  • 3

    CampbellM. A.HaasB. J.HamiltonJ. P.MountS. M.BuellC. R. (2006). Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genomics7 (1), 327. 10.1186/1471-2164-7-327

  • 4

    CarvalhalF.CristeloR. R.ResendeD. I. S. P.PintoM. M. M.SousaE.Correia-da-SilvaM. (2019). Antithrombotics from the sea: Polysaccharides and beyond. Mar. Drugs17 (3), 170. 10.3390/md17030170

  • 5

    ChenS.ZhouY.ChenY.GuJ. (2018). fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics34 (17), i884i890. 10.1093/bioinformatics/bty560

  • 6

    ChenC.ChenH.ZhangY.ThomasH. R.FrankM. H.HeY.et al (2020). TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol. Plant13 (8), 11941202. 10.1016/j.molp.2020.06.009

  • 7

    ChenM.SunS.XuQ.GaoF.WangH.WangA. (2022). Influence of water temperature and flow velocity on locomotion behavior in tropical commercially important sea cucumber Stichopus monotuberculatus. Front. Mar. Sci.9, 931430. 10.3389/fmars.2022.931430

  • 8

    EmmsD. M.KellyS. (2019). OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol.20 (1), 238. 10.1186/s13059-019-1832-y

  • 9

    GrabherrM. G.HaasB. J.YassourM.LevinJ. Z.ThompsonD. A.AmitI.et al (2011). Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol.29 (7), 644652. 10.1038/nbt.1883

  • 10

    GuanD.McCarthyS. A.WoodJ.HoweK.WangY.DurbinR. (2020). Identifying and removing haplotypic duplication in primary genome assemblies. Bioinformatics36 (9), 28962898. 10.1093/bioinformatics/btaa025

  • 11

    HaasB. J.SalzbergS. L.ZhuW.PerteaM.AllenJ. E.OrvisJ.et al (2008). Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol.9 (1), R7. 10.1186/gb-2008-9-1-r7

  • 12

    HanQ.KeesingJ. K.LiuD. (2016). A review of sea cucumber aquaculture, ranching, and stock enhancement in China. Rev. Fish. Sci. Aquac.24 (4), 326341. 10.1080/23308249.2016.1193472

  • 13

    HelbertW. (2017). Marine polysaccharide sulfatases. Front. Mar. Sci.4. 10.3389/fmars.2017.00006

  • 14

    HoffK. J. (2019). MakeHub: Fully automated generation of UCSC genome browser assembly hubs. Genomics, Proteomics Bioinforma.17 (5), 546549. 10.1016/j.gpb.2019.05.003

  • 15

    HuJ.FanJ.SunZ.LiuS. (2020). NextPolish: A fast and efficient genome polishing tool for long-read assembly. Bioinformatics36 (7), 22532255. 10.1093/bioinformatics/btz891

  • 16

    HuangY.GilnaP.LiW. (2009). Identification of ribosomal RNA genes in metagenomic fragments. Bioinformatics25 (10), 13381340. 10.1093/bioinformatics/btp161

  • 17

    KerrA. M.KimJ. (2008). Phylogeny of Holothuroidea (Echinodermata) inferred from morphology. Zoological J. Linn. Soc.133 (1), 6381. 10.1111/j.1096-3642.2001.tb00623.x

  • 18

    KhotimchenkoY. (2018). Pharmacological potential of sea cucumbers. Int. J. Mol. Sci.19 (5), 1342. 10.3390/ijms19051342

  • 19

    LiY.WangR.XunX.WangJ.BaoL.ThimmappaR.et al (2018). Sea cucumber genome provides insights into saponin biosynthesis and aestivation regulation. Cell. Discov.4 (1), 29. 10.1038/s41421-018-0030-5

  • 20

    LoweT. M.ChanP. P. (2016). tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res.44 (W1), W54W57. 10.1093/nar/gkw413

  • 21

    LuC.ShaoZ.ZhangP.DuanD. (2020). Genome-wide analysis of the Saccharina japonica sulfotransferase genes and their transcriptional profiles during whole developmental periods and under abiotic stresses. BMC Plant Biol.20 (1), 271. 10.1186/s12870-020-02422-3

  • 22

    LuoH.HuangG.LiJ.YangQ.ZhuJ.ZhangB.et al (2022). De novo genome assembly and annotation of Holothuria scabra (Jaeger, 1833) from nanopore sequencing reads. Genes. and Genomics44 (12), 14871498. 10.1007/s13258-022-01322-0

  • 23

    ManniM.BerkeleyM. R.SeppeyM.SimãoF. A.ZdobnovE. M. (2021). BUSCO update: Novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol.38 (10), 46474654. 10.1093/molbev/msab199

  • 24

    MarçaisG.KingsfordC. (2011). A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics27 (6), 764770. 10.1093/bioinformatics/btr011

  • 25

    Medina-FelicianoJ. G.PirroS.García-ArrarásJ. E.MashanovV.RyanJ. F. (2021). Draft genome of the sea cucumber Holothuria glaberrima, a model for the study of regeneration. Front. Mar. Sci.8, 603410. 10.3389/fmars.2021.603410

  • 26

    MendesF. K.VanderpoolD.FultonB.HahnM. W. (2020). CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics36 (22-23), 55165518. 10.1093/bioinformatics/btaa1022

  • 27

    MillerA. K.KerrA. M.PaulayG.ReichM.WilsonN. G.CarvajalJ. I.et al (2017). Molecular phylogeny of extant Holothuroidea (Echinodermata). Mol. Phylogenetics Evol.111, 110131. 10.1016/j.ympev.2017.02.014

  • 28

    MouJ.LiQ.QiX.YangJ. (2018). Structural comparison, antioxidant and anti-inflammatory properties of fucosylated chondroitin sulfate of three edible sea cucumbers. Carbohydr. Polym.185, 4147. 10.1016/j.carbpol.2018.01.017

  • 29

    MyronP.SiddiqueeS.Al AzadS. (2014). Fucosylated chondroitin sulfate diversity in sea cucumbers: A review. Carbohydr. Polym.112, 173178. 10.1016/j.carbpol.2014.05.091

  • 30

    NawrockiE. P.KolbeD. L.EddyS. R. (2009). Infernal 1.0: Inference of RNA alignments. Bioinformatics25 (10), 13351337. 10.1093/bioinformatics/btp157

  • 31

    NishitsujiK.ArimotoA.IwaiK.SudoY.HisataK.FujieM.et al (2016). A draft genome of the Brown alga, Cladosiphon okamuranus, S-strain: A platform for future studies of ‘mozuku’ biology. DNA Res.23 (6), 561570. 10.1093/dnares/dsw039

  • 32

    NishitsujiK.ArimotoA.HigaY.MekaruM.KawamitsuM.SatohN.et al (2019). Draft genome of the Brown alga, Nemacystus decipiens, Onna-1 strain: Fusion of genes involved in the sulfated fucan biosynthesis pathway. Sci. Rep.9 (1), 4607. 10.1038/s41598-019-40955-2

  • 33

    OuS.SuW.LiaoY.ChouguleK.AgdaJ. R. A.HellingaA. J.et al (2019). Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol.20 (1), 275. 10.1186/s13059-019-1905-y

  • 34

    PominV. H. (2014). Holothurian fucosylated chondroitin sulfate. Mar. Drugs12 (1), 232254. 10.3390/md12010232

  • 35

    Ranallo-BenavidezT. R.JaronK. S.SchatzM. C. (2020). GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat. Commun.11 (1), 1432. 10.1038/s41467-020-14998-3

  • 36

    ShaoG.HeT.MuY.MuP.AoJ.LinX.et al (2022). The genome of a hadal sea cucumber reveals novel adaptive strategies to deep-sea environments. iScience25 (12), 105545. 10.1016/j.isci.2022.105545

  • 37

    ShumateA.WongB.PerteaG.PerteaM. (2022). Improved transcriptome assembly using a hybrid of long and short reads with StringTie. PLOS Comput. Biol.18 (6), e1009730. 10.1371/journal.pcbi.1009730

  • 38

    StamatakisA. (2014). RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics30 (9), 13121313. 10.1093/bioinformatics/btu033

  • 39

    ThimmappaR.WangS.ZhengM.MisraR. C.HuangA. C.SaalbachG.et al (2022). Biosynthesis of saponin defensive compounds in sea cucumbers. Nat. Chem. Biol.18 (7), 774781. 10.1038/s41589-022-01054-y

  • 40

    WuN.ZhangY.YeX.HuY.DingT.ChenS. (2016). Sulfation pattern of fucose branches affects the anti-hyperlipidemic activities of fucosylated chondroitin sulfate. Carbohydr. Polym.147, 17. 10.1016/j.carbpol.2016.03.013

  • 41

    XuH.ZhouQ.LiuB.ChenF.WangM. (2022). Holothurian fucosylated chondroitin sulfates and their potential benefits for human health: Structures and biological activities. Carbohydr. Polym.275, 118691. 10.1016/j.carbpol.2021.118691

  • 42

    YeN.ZhangX.MiaoM.FanX.ZhengY.XuD.et al (2015). Saccharina genomes provide novel insight into kelp biology. Nat. Commun.6 (1), 6986. 10.1038/ncomms7986

  • 43

    YuanQ.LiH.WangQ.SunS.FangZ.TangH.et al (2022). Deaminative-cleaved S. monotuberculatus fucosylated glycosaminoglycan: Structural elucidation and anticoagulant activity. Carbohydr. Polym.298, 120072. 10.1016/j.carbpol.2022.120072

  • 44

    ZhangX.SunL.YuanJ.SunY.GaoY.ZhangL.et al (2017). The sea cucumber genome provides insights into morphological evolution and visceral regeneration. PLOS Biol.15 (10), e2003790. 10.1371/journal.pbio.2003790

  • 45

    ZhangL.HeJ.TanP.GongZ.QianS.MiaoY.et al (2022). The genome of an apodid holothuroid (Chiridota heheva) provides insights into its adaptation to a deep-sea reducing environment. Commun. Biol.5 (1), 224. 10.1038/s42003-022-03176-4

Summary

Keywords

Stichopus monotuberculatus, fucosylated chondroitin sulfates, biosynthetic pathway, genomic adaptation, Echinodermata

Citation

Zhong S, Ma X, Jiang Y, Liu X, Zeng M, Zhao L, Huang L, Huang G, Zhao Y, Qiao Y and Chen X (2023) The draft genome of the tropical sea cucumber Stichopus monotuberculatus (Echinodermata, Stichopodidae) reveals critical genes in fucosylated chondroitin sulfates biosynthetic pathway. Front. Genet. 14:1182002. doi: 10.3389/fgene.2023.1182002

Received

08 March 2023

Accepted

03 May 2023

Published

12 May 2023

Volume

14 - 2023

Edited by

Mehar S. Khatkar, The University of Sydney, Australia

Reviewed by

Abigail Elizur, University of the Sunshine Coast, Australia

Jie Gong, Nantong University, China

Updates

Copyright

*Correspondence: Shengping Zhong, ; Ying Qiao, ; Xiuli Chen,

†These authors have contributed equally to this work

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics