ORIGINAL RESEARCH article

Front. Genet., 30 September 2021

Sec. Livestock Genomics

Volume 12 - 2021 | https://doi.org/10.3389/fgene.2021.730255

Whole-Genome Sequencing and Genome-Wide Studies of Spiny Head Croaker (Collichthys lucidus) Reveals Potential Insights for Well-Developed Otoliths in the Family Sciaenidae

  • 1. East China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Shanghai, China

  • 2. Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources, Ministry of Education, Shanghai Ocean University, Shanghai, China

  • 3. BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, China

  • 4. Shenzhen Key Lab of Marine Genomics, Guangdong Provincial Key Lab of Molecular Breeding in Marine Economic Animals, BGI Academy of Marine Sciences, BGI Marine, BGI, Shenzhen, China

Article metrics

View details

2,9k

Views

954

Downloads

Abstract

Spiny head croaker (Collichthys lucidus), belonging to the family Sciaenidae, is a small economic fish with a main distribution in the coastal waters of Northwestern Pacific. Here, we constructed a nonredundant chromosome-level genome assembly of spiny head croaker and also made genome-wide investigations on genome evolution and gene families related to otolith development. A primary genome assembly of 811.23 Mb, with a contig N50 of 74.92 kb, was generated by a combination of 49.12-Gb Illumina clean reads and 35.24 Gb of PacBio long reads. Contigs of this draft assembly were further anchored into chromosomes by integration with additional 185.33-Gb Hi-C data, resulting in a high-quality chromosome-level genome assembly of 817.24 Mb, with an improved scaffold N50 of 26.58 Mb. Based on our phylogenetic analysis, we observed that C. lucidus is much closer to Larimichthys crocea than Miichthys miiuy. We also predicted that many gene families were significantly expanded (p-value <0.05) in spiny head croaker; among them, some are associated with “calcium signaling pathway” and potential “inner ear functions.” In addition, we identified some otolith-related genes (such as otol1a that encodes Otolin-1a) with critical deletions or mutations, suggesting possible molecular mechanisms for well-developed otoliths in the family Sciaenidae.

Introduction

Spiny head croaker (Collichthys lucidus), belonging to the family Sciaenidae, is a small economic fish with a main distribution in the coastal waters of Northwestern Pacific (Cheng et al., 2012a), from Philippines, China to Japan. With excellent properties and good meat quality, spiny head croaker has been favored by Chinese consumers with a high market value, whereas it has been overfished in the Yangtze estuary area of China (Hu et al., 2015). Furthermore, Sciaenidae fishes are well known for their well-developed otoliths (Xu et al., 2016), which are acellular crystalline mineral deposits in the inner ears of various teleost fishes (Pracheil et al., 2019).

Otoliths, mainly composed of calcium carbonate and organic matrix, play vital roles in sound sensing, balance, linear acceleration, and gravity in bony fishes (Schulz-Mirbach et al., 2019). Moreover, otoliths are widely applied in fisheries sciences, such as evaluation of fish populations or population migration patterns, and are essential for paleoichthyological and archeological studies (Barrett, 2019; Barnett et al., 2020; Heimbrand et al., 2020). Several critical genes, including Otoconin-90 (Oc90), secreted protein acidic and rich in cysteine (SPARC), SPARC-like1 (SPARCL1), otopetrin-1 (otop1), otolin-1 (otol1), and Otolithmatrixprotein-1 (OMP-1), have been identified to be related to otolith growth and formation in various vertebrates including bony fishes (Hurle et al., 2003; Murayama et al., 2005; Kang et al., 2008; Petko et al., 2008; Xu et al., 2016b; Hołubowicz et al., 2017). Nevertheless, a systematic screening of otolith-related genes in Sciaenidae species has not been reported yet.

With the rapid development of genome sequencing technology and genome-based bioinformatics methods, studies on aquatic genomes and related applications, such as molecular breeding, drug development, new biomaterials, and DNA barcoding technology, have been accumulated (Zhang et al., 2019; Houston et al., 2020; Li et al., 2020; Long et al., 2020). Whole-genome sequencing (WGS) of about 90 fishes has been published around the world by far (Bian et al., 2019; You et al., 2020). At present, genome studies on the Sciaenidae family have focused on the popular large yellow croaker (Larimichthys crocea) and its economic traits (Wu et al., 2014; Ao et al., 2015; Gui et al., 2018; Mu et al., 2018; Chen et al., 2019), while there are only few reports on other Sciaenidae species such as spiny head croaker.

Mitochondrial genome maps of spiny head croaker and candidate genes related to its sex determination have been examined by mitochondrial genome sequencing (Cheng et al., 2012b), chromosome assembly (Cai et al., 2019), and RNA sequencing (Song et al., 2020). However, the genetic basis of well-developed otoliths in Sciaenidae species is still unknown. It is therefore necessary to explore genomic resources to gain insights into otolith development mechanisms and to accelerate genome-assisted improvements in biodiversity protection, breeding, and disease prevention of Sciaenidae species.

In the past decades, Illumina short-read sequencing technology has been generally employed to assemble various fish genomes. However, with cost reduction of both short- and long-read sequencing technologies, more and more recent WGS projects have introduced PacBio long-read sequences in order to assemble high repetitive regions and improve assembly quality (You et al., 2020). Here, we produced a nonredundant chromosome-level assembly of the spiny head croaker by combination of Illumina short reads, PacBio Single Molecule Real-Time (SMRT) long reads, high-throughput chromosome conformation capture (Hi-C) data, and transcriptome sequences. Moreover, we performed comparative genomics studies on candidate genes related to otolith development to figure out potential mechanisms for the well-developed otoliths in the family Sciaenidae.

Materials and Method

Sample Collection and Sequencing

We extracted genomic DNAs from pooled muscle tissues of a wild female spiny head croaker and sequenced by using an Illumina HiSeq 2500 sequencing platform (San Diego, CA, United States) and a PacBio Sequel sequencing platform (Menlo Park, CA, United States). The construction of DNA libraries (insert sizes of 500 and 800 bp for Illumina, and 20 kb for PacBio) and subsequent sequencing were performed according to the standard protocols. In total, 52.48 Gb of raw Illumina data and five SMRT cells produced using the P6 polymerase/C4 chemistry, producing 35.24 Gb of PacBio long reads, were generated. After filtering by SOAPnuke (v.1.5.6; Chen et al., 2017), we obtained 49.12 Gb of Illumina clean data and 35.24 Gb of PacBio data for subsequent assembly.

To acquire a chromosome-level assembly of the genome, genomic DNAs were fixed with formaldehyde and were sheared by a restriction enzyme (MboI) to build a Hi-C library, and then sequenced by an Illumina HiSeq X Ten platform. A total of 185.33 Gb of 150 PE Hi-C data were generated. All sequenced data generated in this study were deposited in the CNGB Nucleotide Sequence Archive under Program no. CNP0001197.

We also extracted muscle RNA for transcriptome sequencing by using a HiSeq 2500 platform. Furthermore, to obtain the full-length transcript, the mixed RNA sample from 13 tissues was transcribed to generate full-length cDNA, and the SMRT bell library was constructed using the SMRT bell Template Prep Kit. The libraries were then prepared for sequencing on the PacBio Sequel sequencing platform.

Genome Assembly and Chromosome Assembly

Firstly, we employed Kmerfreq (https://github.com/fanagislab/kmerfreq) to estimate the genome size with 17-bp k-mers and applied GenomeScope (v1.0; Vurture et al., 2017) to estimate genome heterozygosity. Subsequently, a hybrid genome assembly pipeline was employed to obtain genome assembly. Short Illumina reads were first assembled by using Platanus with “-m 300 -k 27 -s 3” (Kajitani et al., 2014), and DBG2OLC (Ye et al., 2016) was performed to combine Platanus-generated contigs with PacBio reads to generate a hybrid contig assembly with default parameters. Pilon (v.1.225; (Walker et al., 2014) was employed to polish the hybrid assembly. After then, redundancies of the primary assembly were removed by Redundans (Pryszcz and Gabaldón, 2016) with “--identity 0.85 --overlap 0.36.”

We performed quality control of Hi-C raw reads and obtained valid Hi-C-connected reads by Juicer (v.1.5; Durand et al., 2016). A 3D de novo assembly (3D-DNA, v.180922; Dudchenko, et al., 2017) pipeline (Dudchenko et al., 2017) was applied to anchor primary contigs into chromosome-level scaffolds. Completeness of the genome assembly was evaluated using by BUSCO v3.0 (Simão et al., 2015) with “-l actinopterygii_odb9 -m genome -c 3 -sp zebrafish.”

De Novo Assembly of Transcriptomes

We de novo assembled the RNA-seq reads using the Trinity assembler (v2.9.0; Haas et al., 2013) and TGI clustering tool (TGICL; Pertea et al., 2003). The PacBio ISO-Seq3 pipeline (https://github.com/PacificBiosciences/IsoSeq) was used to obtain full-length non-chimeric (FLNC) transcripts via ccs, classify, cluster, and polish stage. FLNC was aligned with genome by minimap2.

Gene Prediction and Annotation

Repetitive elements in the spiny head croaker genome were identified through a combination of homolog-based and de novo approaches. For the homolog-based method, RepeatMasker (v.4.0.7) (Smit et al., 2019) and RepeatProteinMask (v.4.0.7; Smit et al., 2019) were used to detect repeats by aligning against the Repbase database (v 21.0; Bao et al., 2015). For the de novo method, LTRharvest (Ellinghaus et al., 2008) was applied to predict full long terminal repeat (LTR) retrotransposons. RepeatModeler (v1.0.11; Smit et al., 2019) was employed to build transposable element (TE) consensus sequences as a de novo TE library, and TRF (v.4.09; Benson, 1999) was used to obtain tandem repetitive sequences. RepeatMasker was then used to discover and identify repetitive sequences with the combined library of the de novo TEs.

Based on the repeat masked genome, we employed de novo, homology-based, and transcriptome-based prediction methods to annotate protein-coding genes in the assembled genome. Protein sequences of zebrafish (Danio rerio), three-spined stickleback (Gasterosteus aculeatus), Atlantic cod (Gadus morhua), channel catfish (Ictalurus punctatus), spotted gar (Lepisosteus oculatus), Nile tilapia (Oreochromis niloticus), fugu (Takifugu rubripes), downloaded from Ensembl (Hunt et al., 2018), and large yellow croaker (L. crocea, GCF_000972845.2) from NCBI were aligned to the spiny head croaker genome by tBLASTn (Kent, 2002) with “-e 1e-5.” Subsequently, GeneWise (Birney et al., 2004) was used to predict gene structures from BLAST hits. Augustus (v3.3.1; Stanke et al., 2006) was performed to predict de novo genes. We obtain 3,000 intact gene models generated from the homolog-based method randomly to train the parameters of AUGUSTUS then used AUGUSTUS to perform de novo prediction based on the repeat-masked genome with the training parameters. These gene sets that were predicted by different methods were integrated into a nonredundant gene set through the pipelines described in previous research (Xiong et al., 2016). After that, the combined gene set was modified with transcriptome data through PASA (v2.3.3; Haas et al., 2003).

Gene functional annotation was performed based on consensus of sequence and domain. The protein sequences were aligned to NCBI Non-Redundant Protein Sequence (NR) databases, Kyoto Encyclopedia of Genes and Genomes (KEGG v89.0; Kanehisa and Goto, 2000), SwissProt, and TrEMBL (Uniprot release 2020-06) (Boeckmann et al., 2003) by BLASTp with “-e 1e-5.” The domains were searched and predicted by using InterProScan v5.11–55.0 (Zdobnov and Apweiler, 2001) (Jones et al., 2014) with publicly available databases including PANTHER (Thomas et al., 2003), Pfam (Bateman et al., 2004), PRINTS (Attwood et al., 2000), ProDom (Servant et al., 2002), PROSITE profiles (Sigrist et al., 2010), and SMART (Letunic et al., 2012). Gene ontology (GO) terms (Ashburner et al., 2000) for each gene were predicted from the InterPro descriptions.

Genome Evolution and Gene Family Analysis

In order to identify gene families in spiny head croaker, we collected protein sequences of the same species used for homologous annotation as well as miiuy croaker (Miichthys miiuy) and performed the TreeFam methodology (Li et al., 2006) to obtain gene families of these species. We then used RaxML (Stamatakis, 2006) to construct the phylogenetic tree by using the single copy orthologous gene families with the GTRGAMMA model.

To identify the synteny between spiny head croaker and large yellow croaker, BLASTp was used to calculate pairwise similarities (e value < 1e-5), and MCScanX package with default parameters was then used for classification. Then, JCVI was performed to generate visualization.

A MCMCtree program in PAML (v4.9e; Yang, 2007) was performed to estimate the divergence time between various species in the phylogenetic tree with the REV substitution model. Three calibration time points based on the TimeTree database (http://www.timetree.org) were used as references (T. rubripes-G. aculeatus: 99–127 MYA; G. morhua-T. rubripes: 141–166 MYA; L. oculatus-D. rerio: 295–334 MYA), including spotted gar, zebrafish, Atlantic cod, fugu, and three-spined stickleback.

CAFÉ (v3.0; Han et al., 2013) was used to analyze gene family expansion and contraction under a maximum likelihood framework; single-copy orthologous gene families and estimated divergence time between different species were used as input files. To identify possible positive selected genes (PSGs), we first conducted multiple-sequence alignments based on the protein sequences of single-copy gene families by PRANK (Löytynoja, 2014), then the non-synonymous substitution rate (Ka) and synonymous substitution rate (Ks) were calculated by the codeml in PAML (v4.9e) with the branch-site model (cleandata = 1) and spiny head croaker was chosen as foreground species. Only the results with p-values <0.05 and false discovery rate (FDR) < 0.05 were considered as positive selected genes. Based on whole-genome annotation results and the official classification, we use the phyper in R (v3.5.2) to perform KEGG pathway enrichment analysis.

Phylogenetic Analysis of Otolith Related Genes

We downloaded the protein sequences of Otolin-1a, Otolin-1b, Otopetrin-1, Otoconin-90, OMP-1, SPARC, SPARCL1, and SPARCL2 from zebrafish and fugu and whole-genome sequences of four fishes (D. rerio, L. crocea, T. rubripes, and Dicentrarchus labrax) from NCBI and Ensembl. Nucleotide sequences of these genes were aligned from these four species and spiny head croaker (from the present study) by using BLAST, and filtered with identity, then related protein coding sequences were predicted by Exonerate (v.2.2.0) (Slater and Birney, 2005) or GeneWise firstly. Secondly, we converted coding sequences (CDS) to protein sequences and used PRANK (Löytynoja, 2014) to perform multiple-sequence alignments. RaxML (Stamatakis, 2006) was employed to construct a gene family phylogenetic tree with the PROTGAMMAAUTO model, and the genes from spotted gar as outgroup. We also predicted whether these protein-coding genes in Sciaenidae were involved in positive selection by using the codeml in PAML (v4.9e) with the branch-site model (cleandata = 1) and choosing the branch of spiny head croaker and large yellow croaker as foreground species, and searched the domains of these protein sequences by using NCBI Batch CD-Search and generated visualizations via EvolView (Subramanian et al., 2019).

According to the results of multiple-sequence alignments and PSG analysis, we selected those amino acid sites with inconsistency between the family Sciaenidae and other species. Potential functional effects of these residual substitutions were evaluated by PolyPhen-2 (Adzhubei et al., 2013; Li et al., 2018) and PROVEAN (Choi and Chan, 2015).

Results

Genome Sequencing and Assembly

We sequenced the genome of a wild female spiny head croaker by using an Illumina HiSeq sequencing platform as well as a PacBio Sequel sequencing platform. After data filtering, we obtained a total of 49.12-Gb Illumina clean reads by SOAPnuke and 35.24-Gb PacBio long reads, representing approximately 60-fold and 43-fold coverage of the spiny head croaker genome, respectively. An entire genome size of 811.25 Mb was estimated by the routine Kmerfreq method (with K = 17; https://github.com/fanagislab/kmerfreq). Employing a hybrid assembly method, we obtained a redundant assembly of 994.29 Mb and then used Redundans (Kajitani et al., 2014; Ye, et al., 2016) to reduce the redundant sequences. About 18.4% sequences in hybrid assembly were removed. We obtained a draft genome of 811.23 Mb with a contig N50 of 74.92 kb (Table 1, Supplementary Table S1). The mapping ratio with genome sequencing was 97.89% for the chromosome version (Supplementary Table S2).

TABLE 1

ParameterContigScaffold
Total length (bp)811,227,110817,240,112
Total number12,22010,973
Gap (bp)06,013,002
N50 (bp)74,89126,576,940
N90 (bp)38,24523,883
Maximum length (bp)428,34333,752,147
Hi-C anchored length (bp)0643,229,385
Hi-C anchored rate078.71%
GC content41.60%41.30%
Evaluation of BUSCO94.00%94.20%

Statistics of the genome assembly of spiny head croaker.

A total of 185.33 Gb Hi-C data were analyzed by Juicer, and contigs in the draft assembly were subsequently anchored into chromosomes by a 3D-DNA pipelin, resulting in a polished genome assembly of 817.24 Mb, with an improved scaffold N50 of 26.58 Mb (Supplementary Figure S1, Table 1). The final assembly consists of 24 chromosomes (ranging from 24.92 to 33.75 Mb in length) and covers 643.23 Mb which accounts for 78.71% of the whole genome (Supplementary Table S3).

We determined that approximately 94.2% of complete reference genes (82.3% single-copy and 11.9% duplicated) were detectable in the final assembly according to BUSCO values (Supplementary Table S4).

Gene Prediction and Annotation

In total, approximately 31.69% of the spiny head croaker assembled sequences were annotated as repetitive elements, which is higher than that for large yellow croaker (Ao et al., 2015; Mu et al., 2018). The repetitive sequences include 161.45 Mb of DNA transposons (∼19.90%), 76.60 Mb of long interspersed elements (LINEs, ∼9.44%), 73.46 Mb of long terminal repeats (LTRs, ∼9.06%), and other TEs (Table 2, Figure 1).

TABLE 2

ParameterRepBase TEsTE proteinsDe novoCombined TEs
Length (bp)% in genomeLength (bp)% in genomeLength (bp)% in genomeLength (bp)% in genome
DNA54,209,8636.6824524,882,4930.601865138,693,32317.09673161,446,48419.90152
LINE24,524,8583.0231814,324,1101.76573451,652,4256.36719776,599,2979.442399
SINE4,234,5080.521988002,415,5070.297766,410,2320.79019
LTR19,821,5822.4434079,298,4721.14622354,523,3166.72109173,457,4489.055103
Other9,2970.00114600009,2970.001146
Unknown000040,564,7625.0004240,564,7625.00042
Total94,358,16711.63153528,439,8833.505786219,148,46727.01444257,046,83831.68617

Repetitive elements in the assembled genome of spiny head croaker.

FIGURE 1

Using the repeat-masked genome assembly, we predicted a total of 29,509 genes after integration of de novo, homology-based, and transcriptome-based annotations (Table 3). The total annotated gene number is similar to that in a previous published genome of the spiny head croaker (Figure 1, Supplementary Figure S2). Based on functional annotation, we predicted 29,432 (∼99.74%) protein-coding genes with at least one assignment from Swiss-Prot, TrEMBL, InterProScan, Nr database, and KEGGor GO databases.

TABLE 3

EvidenceMethod/speciesNumbersAverage gene length (bp)Average CDS length (bp)Average exon length (bp)Average exon per geneAverage intron length (bp)
De novoAUGUSTUS40,8245,815.971,092.66170.206.42871.46
HomologDanio rerio26,9937,176.521,404.30175.128.02822.34
Gadus morhua26,9176,874.981,290.37166.717.74828.53
Gasterosteus aculeatus27,8057,206.741,366.09166.068.23808.25
Ictalurus punctatus27,5277,096.901,368.55175.287.81841.41
Larimichthys crocea31,4657,875.631,539.67173.398.88804.08
Lepisosteus oculatus25,5017,310.531,396.47174.298.01843.40
Oreochromis niloticus30,8497,664.181,543.67180.708.54811.43
Takifugu rubripes28,3817,038.131,340.72172.837.76843.14
Transcriptome16,1894,348.241,295.77160.298.082,609.32
Total29,5096,425.151,336.49174.597.65764.65

Predicted protein-coding genes in the genome of spiny head croaker.

Genome Evolution and Gene Family Analysis

To determine the phylogenetic relationship of spiny head croaker with other species, we compared its assembly with other nine representative fish genomes. We identified a total of 19,627 gene families (16,005 in spiny head croaker) and 3,955 single-copy orthologues from TreeFam. After construction of a phylogenetic tree by using the single-copy orthologous gene families, we observed that spiny head croaker is much closer to large yellow croaker (Figure 2, Supplementary Figure S3), and Cichlidae (such as Nile tilapia) is closely related to Sciaenidae (such as spiny head croaker, large yellow croaker, and miiuy croaker). According to the results of MCMCtree, we estimated that spiny head croaker and large yellow croaker diverged around 13 (5.8∼28.6) million years ago (Mya), and their ancestor diverged from tilapia around 81 (67.8∼97.4) Mya (Figure 2).

FIGURE 2

Based on the phylogenetic tree and species divergence time analysis, we employed CAFÉ to analyze the expansion and contraction of gene families. A total of 1,028 significantly expanded (p < 0.05) and 230 significantly contracted (p < 0.05) gene families were predicted in spiny head croaker (Figure 2). Interestingly, many expanded gene families were enriched in several important KEGG pathways (Figure 3A), such as “calcium signaling pathway” (p = 1.30e−39), “circadian entrainment” (p = 1.39e−32), “intestinal immune network for IgA production” (p = 4.40e−47), and “NOD-like receptor signaling pathway” (p = 5.68e−15).

FIGURE 3

According to the analysis of positive selection with single-copy gene families, 421 positive selected genes (PSGs; p < 0.05) were identified in spiny head croaker (Supplementary Table S5) by PAML (Yang, 2007). These PSGs were enriched in several interesting KEGG pathways, such as “amino sugar and nucleotide sugar metabolism,” “longevity regulating pathway,” and “fat digestion and absorption” (Figure 3B).

Phylogenetic Analysis of Otolith-Related Genes

We examined several critical otolith-related genes, including otol1a, otol1b, otop1, oc90, omp1, sparc, sparcl1, and sparcl2, to find genetic evidence for the well-developed otoliths in the family Sciaenidae. All sequences were derived from five representative fishes, including zebrafish, fugu, and three Perciformes species (large yellow croaker, spiny head croaker, and European sea bass), and the sequences of zebrafish and fugu (download from NCBI and Ensembl) were used as the queries (Supplementary Table S6).

Each of these genes was a single copy in these examined species. However, localization and multiple-sequence alignment displayed that the gene previously annotated as otol1 in the family Sciaenidae was more similar to zebrafish otol1b; another gene annotated as inner ear-specific collagen showed a higher sequence similarity to zebrafish otol1a. Domains of otolith-related genes were searched by NCBI Batch CD-Search, and our results proved that the examined domains of these genes were highly conserved in various species (see more details in Figure 4). Phylogenetic trees of these otolith-related genes were constructed, and their topological structures were consistent with the species tree. For example, large yellow croaker and spiny head croaker were clustered as sister groups (Figure 4), indicating that these genes had a closer relationship in these two croaker species than in other vertebrates.

FIGURE 4

From the results of multiple-sequence alignment, we found that, in large yellow croaker and spiny head croaker, some nucleotide variances led to amino acid changes in some genes when compared with other fishes (see Figures 58, Supplementary Figures S4–S11). Interestingly, we observed that 16 amino acid residues in SPARC of the family Sciaenidae are different from those in other fishes, although the sequences of the calcium-binding region in SPARC showed high conservation in most species; however, the position 274 of large yellow croaker and spiny head croaker is Asp (D) instead of Gln (Q) or Ser (S) (see more details in Figure 6). In Oc90, several residues were changed at PLA2L domains (Figure 7).

FIGURE 5

FIGURE 6

FIGURE 7

Based on the best branch-site model, we propose that two critical otolith-related genes (SPARCL1 and OC90) in the family Sciaenidae were positively selected (Supplementary Table S7). Moreover, based on the multiple alignments, Some of the amino acid substitution sites in these genes were predicted to have a possible effect on the proteins by PolyPhen-2 analysis (Adzhubei et al., 2013) and by PROVEAN analysis (Choi and Chan, 2015), such as position 319 of Otol1a, positions 65 and 204 of SPARC, and the positive selected site in SPARCL1. More importantly, two deletion sites in Otol1a (positions 320 and 321) were predicted to have a possible effect on the protein by PROVEAN (Figure 8). Through PAML, we further predicted whether these genes were under positive selection or not.

FIGURE 8

Discussion

Due to the high heterozygosity rate of some species, heterozygous regions are probably assembled repeatedly, resulting in a redundant genome with a larger size. According to GenomeScope, heterozygosity of spiny head croaker is about 1%. In the present study, we chose a hybrid assembly strategy with a combination of Illumina short reads and PacBio long reads and then employed Redundans (Pryszcz and Gabaldón, 2016) to remove the redundant region so as to obtain a final genome as much as possible to be equal to the estimated haploid genome size. Using this improved version of genome assembly, we reconstructed 24 chromosomes with additional Hi-C reads. The genome size of spiny head croaker (817.24 Mb) is bigger than that of large yellow croaker (708.47 Mb; Chen et al., 2019) and miiuy croaker (636.22 Mb; Xu et al., 2016a). According to our results, especially the annotation of repetitive sequences in this study, we speculate that the higher proportion of repetitive elements may be the major reason for the larger genome of spiny head croaker than its close relatives.

In our present study, a total of 29,509 genes were annotated from the spiny head croaker genome, which are more than those from a previous study (Cai et al., 2019). The number is higher than that of large yellow croaker (23,172 genes; Chen et al., 2019), miiuy croaker (21,960 genes; Xu et al., 2016a), and leopard coral grouper (25,248 genes; Zhou et al., 2020), whereas it is similar to Nile tilapia (29,249 genes; Conte et al., 2017). Moreover, our current work investigated gene families in spiny head croaker based on an integration of PacBio long-read sequencing and Hi-C technology. We identified 3,955 single-copy orthologues and observed that spiny head croaker is much close to large yellow croaker, which is consistent with previous mitochondrial genome studies (Cheng et al., 2012b). Further analysis of gene families showed that some gene families have significantly expanded in the spiny head croaker. Many gene families were enriched in several important pathways, such as “calcium signaling pathway” and “NOD-like receptor signaling pathway,” which may be related to some biological characteristics and basic physiological activities of this economically important fish.

Interestingly, in the “calcium signaling pathway,” we predicted that the calcium-binding protein (CaBP) gene family and parvalbumin gene family were significantly expanded in the spiny head croaker. Previous studies reported that CaBPs and parvalbumin are early markers of non-mitotic regenerating hair cells in bullfrog vestibular otolith (Steyger et al., 1997). CaBPs, located at the neuroretina, inner ear, and notochord, could modulate calcium levels and distribution, and thereby they were regarded as important regulators of essential neuronal target proteins (Haynes et al., 2012; Di Donato et al., 2013). Otolith-specific CaBPs were also detected in zebrafish (Söllner et al., 2003) and rainbow trout (Poznar et al., 2017). It was reported that the circadian rhythm of hair cells for secreting these CaBPs is likely to be a vital factor to cause the daily increase of otoliths (Suga and Nakahara, 2012).

Most of the genome-based studies of fishes have focused on growth traits, innate immunity, and/or sex determination (Cai et al., 2019; Zhou et al., 2019). However, no genome study related to the otolith growth and development has been reported yet. One of the main structural proteins in the organic matrix is Otolin-1a, also named as inner ear-specific collagen in some reports, containing calcium-binding sites; its C1q-like domain forms a stable trimer in calcium-containing solutions, suggesting that it participates in the correct arrangement of otolith to the inner ear sensory epithelium and may act as a nucleating agent for crystallization and stabilization of the otolith matrix (Murayama et al., 2005; Hołubowicz et al., 2017). Interestingly, we found that in the third collagen-like domain of Otol1a, the 319th amino acid is substituted by Ala (A), and both positions of 320 and 321 were lost in the family Sciaenidae.

SPARC, a major bone protein with an essential role for fish otolith normal growth and development (Kang et al., 2008), is multifunctional. It is able to bind both collagens and calcium. SPARCL1 and SPARCL2 were derived from SPARC through whole-genome duplication (WGD). When oc90 is absent, both sparc and sparcl1 mRNA levels were significantly upregulated to compensate for the lack of Oc90 and promoted biomineralization of murine otoconia (Xu et al., 2010). In our present study, the sequences of calcium-binding region in SPARC of various fishes showed high conservation in most sites; however, the position 274 of large yellow croaker and spiny head croaker is Asp (D), which might have a higher calcium-binding affinity at a high pH condition (Tang and Skibsted, 2016) and act as a crystal nucleation center whether directly binding with inorganic crystals or interacting with crystal binding proteins; in fact, these two modes are involved in the interaction between bone matrix protein and hydroxyapatite (or apatite; Xu et al., 2010). The PolyPhen-2 and PROVEAN results of amino acid substitution sites 65 and 204 showed possible functional changes of SPARC from the family Sciaenidae. All these data suggest that SPARC in family Sciaenidae plays an important role in calcium and collagen binding capacity, which may relate with formation of well-developed otoliths. While SPARCL1 in the family Sciaenidae was detected as a positive selected gene, whether its function in fish otoliths is similar to that in mice remains unclear. Verification by more studies is required.

Oc90 is a matrix protein of otolith with two PLA2L domains. Although these two domains do not possess enzymatic activity, they contain potential glycosylation sites, retain calcium-binding capacity, and provide a rigid structure for potential CaCO3 deposition (Petko et al., 2008). It is necessary for the early events of otolith biomineralization to play an important role in recruiting other proteins to form the organic matrix (Yang et al., 2011). In the Oc90 of spiny head croaker and large yellow croaker, several sites were substituted at PLA2L domains, which may lead to some changes in the calcium-binding capacity of this protein.

According to the substitutions or deletions of the abovementioned candidate gene sites that were identified at the genomic level, we speculate that these changes may have a relationship with otolith formation, resulting in the interesting status of well-developed otoliths in the family Sciaenidae.

Conclusion

In this study, a high-quality chromosome-level genome assembly of spiny head croaker was constructed. Some amino acid substitutions or deletions in several otolith-related genes (such as substitutions in Oc90, Otol1a, SPARC, and deletions in Otol1a) were identified. These changes may be critical for well-developed otoliths in the family Sciaenidae. Our genome resources will provide genetic assistance for in-depth studies on detailed molecular mechanisms of the formation and development of well-developed otoliths in various Sciaenidae species.

Statements

Data availability statement

The original contributions presented in the study are publicly available in the CNGB Nucleotide Sequence Archive using accession number CNP0001197.

Ethics statement

The animal study was reviewed and approved by All animal study protocols were approved by the Ethics Committee of the Chinese Academy of Fishery Sciences.

Author contributions

WS and XY designed and conceived the study. WG and CZ performed the experiments, wrote the manuscript, and performed the data analyses. XL, CB, and QS revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Basic Research Fund for State-Level Nonprofit Research Institutes of ESCFRI, CAFS (No. Dong2019M02), Key Project of Zhejiang Province of China (2020C02015), Grant Plan for Demonstration City Project for Marine Economic Development in Shenzhen (No. 86), Natural Science Foundation for Fundamental Research in Shenzhen (No. JCYJ20190812105801661), and Shenzhen Science and Technology Program for International Cooperation (No. GJHZ20190819152407214).

Conflict of interest

Authors CZ, XL, CB, QS, and XY were employed by the company BGI.

The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2021.730255/full#supplementary-material

References

  • 1

    AdzhubeiI.JordanD. M.SunyaevS. R. (2013). Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2. Curr. Protoc. Hum. Genet.76 (1), 720. 10.1002/0471142905.hg0720s76

  • 2

    AoJ.MuY.XiangL.-X.FanD.FengM.ZhangS.et al (2015). Genome Sequencing of the Perciform Fish Larimichthys Crocea Provides Insights into Molecular and Genetic Mechanisms of Stress Adaptation. Plos Genet.11 (4), e1005118. 10.1371/journal.pgen.1005118

  • 3

    AshburnerM.BallC. A.BlakeJ. A.BotsteinD.ButlerH.CherryJ. M.et al (2000). Gene Ontology: Tool for the Unification of Biology. Nat. Genet.25 (1), 2529. 10.1038/75556

  • 4

    AttwoodT. K.CroningM. D. R.FlowerD. R.LewisA. P.MabeyJ. E.ScordisP.et al (2000). PRINTS-S: the Database Formerly Known as PRINTS. Nucleic Acids Res.28, 225227. 10.1093/nar/28.1.225

  • 5

    BaoW.KojimaK. K.KohanyO. (2015). Repbase Update, a Database of Repetitive Elements in Eukaryotic Genomes. Mobile Dna6 (1), 11. 10.1186/s13100-015-0041-9

  • 6

    BarnettB. K.ChantonJ. P.AhrensR.ThorntonL.PattersonW. F.3rd (2020). Life History of Northern Gulf of Mexico Warsaw Grouper Hyporthodus Nigritus Inferred from Otolith Radiocarbon Analysis. PLoS ONE15 (1), e0228254. 10.1371/journal.pone.0228254

  • 7

    BarrettJ. H. (2019). An Environmental (Pre)history of European Fishing: Past and Future Archaeological Contributions to Sustainable Fisheries. J. Fish. Biol.94 (6), 10331044. 10.1111/jfb.13929

  • 8

    BatemanA.CoinL.DurbinR.FinnR. D.HollichV.Griffiths-JonesS.et al (2004). The Pfam Protein Families Database. Nucleic Acids Res.32, 138D141D. 10.1093/nar/gkh121

  • 9

    BensonG. (1999). Tandem Repeats Finder: a Program to Analyze DNA Sequences. Nucleic Acids Res.27 (2), 573580. 10.1093/nar/27.2.573

  • 10

    BianC.HuangY.LiJ.YouX.YiY.GeW.et al (2019). Divergence, Evolution and Adaptation in ray-finned Fish Genomes. Sci. China Life Sci.62, 10031018. 10.1007/s11427-018-9499-5

  • 11

    BirneyE.ClampM.DurbinR. (2004). GeneWise and Genomewise. Genome Res.14 (5), 988995. 10.1101/gr.1865504

  • 12

    BoeckmannB.BairochA.ApweilerR.BlatterM.-C.EstreicherA.GasteigerE.et al (2003). The SWISS-PROT Protein Knowledgebase and its Supplement TrEMBL in 2003. Nucleic Acids Res.31 (1), 365370. 10.1093/nar/gkg095

  • 13

    CaiM.ZouY.XiaoS.LiW.HanZ.HanF.et al (2019). Chromosome Assembly of Collichthys Lucidus, a Fish of Sciaenidae with a Multiple Sex Chromosome System. Sci. Data6 (1), 132. 10.1038/s41597-019-0139-x

  • 14

    ChenB.ZhouZ.KeQ.WuY.BaiH.PuF.et al (2019). The Sequencing and De Novo Assembly of the Larimichthys Crocea Genome Using PacBio and Hi-C Technologies. Sci. Data6 (1), 188. 10.1038/s41597-019-0194-3

  • 15

    ChenY.ChenY.ShiC.HuangZ.ZhangY.LiS.et al (2017). SOAPnuke: a MapReduce Acceleration-Supported Software for Integrated Quality Control and Preprocessing of High-Throughput Sequencing Data. Gigascience7 (1), gix120. 10.1093/gigascience/gix120

  • 16

    ChengJ.MaG.-q.MiaoZ.-q.ShuiB.-n.GaoT.-x. (2012a). Complete Mitochondrial Genome Sequence of the Spinyhead Croaker Collichthys Lucidus (Perciformes, Sciaenidae) with Phylogenetic Considerations. Mol. Biol. Rep.39 (4), 42494259. 10.1007/s11033-011-1211-6

  • 17

    ChengJ.MaG.-q.SongN.GaoT.-x. (2012b). Complete Mitochondrial Genome Sequence of Bighead Croaker Collichthys Niveatus (Perciformes, Sciaenidae): a Mitogenomic Perspective on the Phylogenetic Relationships of Pseudosciaeniae. Gene491 (2), 210223. 10.1016/j.gene.2011.09.020

  • 18

    ChoiY.ChanA. P. (2015). PROVEAN Web Server: a Tool to Predict the Functional Effect of Amino Acid Substitutions and Indels. Bioinformatics31 (16), 27452747. 10.1093/bioinformatics/btv195

  • 19

    ConteM. A.GammerdingerW. J.BartieK. L.PenmanD. J.KocherT. D. (2017). A High Quality Assembly of the Nile Tilapia (Oreochromis niloticus) Genome Reveals the Structure of Two Sex Determination Regions. BMC genomics18 (1), 341. 10.1186/s12864-017-3723-5

  • 20

    Di DonatoV.AuerT. O.DuroureK.Del BeneF. (2013). Characterization of the Calcium Binding Protein Family in Zebrafish. PLoS One8 (1), e53299. 10.1371/journal.pone.0053299

  • 21

    DudchenkoO.BatraS. S.OmerA. D.NyquistS. K.HoegerM.DurandN. C.et al (2017). De Novo assembly of the Aedes aegypti Genome Using Hi-C Yields Chromosome-Length Scaffolds. Science356 (6333), 9295. 10.1126/science.aal3327

  • 22

    DurandN. C.ShamimM. S.MacholI.RaoS. S. P.HuntleyM. H.LanderE. S.et al (2016). Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments. Cel Syst.3 (1), 9598. 10.1016/j.cels.2016.07.002

  • 23

    EllinghausD.KurtzS.WillhoeftU. (2008). LTRharvest, an Efficient and Flexible Software for De Novo Detection of LTR Retrotransposons. BMC bioinformatics9 (1), 18. 10.1186/1471-2105-9-18

  • 24

    GuiJ. F.TangQ. S.LiZ. J.LiuJ. S.De SilvaS. S. (2018). Aquaculture in China: Success Stories and Modern Trends. John Wiley & Sons. Hoboken, NJ. USA. 10.1002/9781119120759

  • 25

    HaasB. J.DelcherA. L.MountS. M.WortmanJ. R.SmithR. K.JrHannickL. I.et al (2003). Improving the Arabidopsis Genome Annotation Using Maximal Transcript Alignment Assemblies. Nucleic Acids Res.31 (19), 56545666. 10.1093/nar/gkg770

  • 26

    HaasB. J.PapanicolaouA.YassourM.GrabherrM.BloodP. D.BowdenJ.et al (2013). De Novo transcript Sequence Reconstruction from RNA-Seq Using the Trinity Platform for Reference Generation and Analysis. Nat. Protoc.8 (8), 14941512. 10.1038/nprot.2013.084

  • 27

    HanM. V.ThomasG. W. C.Lugo-MartinezJ.HahnM. W. (2013). Estimating Gene Gain and Loss Rates in the Presence of Error in Genome Assembly and Annotation Using CAFE 3. Mol. Biol. Evol.30, 19871997. 10.1093/molbev/mst100

  • 28

    HaynesL. P.McCueH. V.BurgoyneR. D. (2012). Evolution and Functional Diversity of the Calcium Binding Proteins (CaBPs). Front. Mol. Neurosci.5, 9. 10.3389/fnmol.2012.00009

  • 29

    HeimbrandY.LimburgK. E.HüssyK.CasiniM.SjöbergR.Palmén BrattA. M.et al (2020). Seeking the True Time: Exploring Otolith Chemistry as an Age‐determination Tool. J. Fish. Biol.97, 552565. 10.1111/jfb.14422

  • 30

    HoustonR. D.BeanT. P.MacqueenD. J.GundappaM. K.JinY. H.JenkinsT. L.et al (2020). Harnessing Genomics to Fast-Track Genetic Improvement in Aquaculture. Nat. Rev. Genet.21 (7), 389409. 10.1038/s41576-020-0227-y

  • 31

    HołubowiczR.WojtasM.TaubeM.KozakM.OżyharA.DobryszyckiP. (2017). Effect of Calcium Ions on Structure and Stability of the C1q-like Domain of Otolin-1 from Human and Zebrafish. Febs J.284 (24), 42784297. 10.1111/febs.14308

  • 32

    HuY.ZhangT.YangG.ZhaoF.HouJ. L.ZhangL. Z.et al (2015). Assessment of Resource Situation of Collichthys Lucidus in Coastal Waters of the Yangtze Estuary. Ying Yong Sheng Tai Xue Bao26 (9), 28672873.

  • 33

    HuntS. E.McLarenW.GilL.ThormannA.SchuilenburgH.SheppardD.et al (2018). Ensembl Variation Resources). Ensembl Variation Resources. Database2018, bay119. 10.1093/database/bay119

  • 34

    HurleB.IgnatovaE.MassironiS. M.MashimoT.RiosX.ThalmannI.et al (2003). Non-syndromic Vestibular Disorder with Otoconial Agenesis in Tilted/mergulhador Mice Caused by Mutations in Otopetrin 1. Hum. Mol. Genet.12 (7), 777789. 10.1093/hmg/ddg087

  • 35

    JonesP.BinnsD.ChangH. Y.FraserM.LiW.McAnullaC.et al (2014). InterProScan 5: Genome-Scale Protein Function Classification. Bioinform.30 (9), 12361240. 10.1093/bioinformatics/btu031

  • 36

    KajitaniR.ToshimotoK.NoguchiH.ToyodaA.OguraY.OkunoM.et al (2014). Efficient De Novo Assembly of Highly Heterozygous Genomes from Whole-Genome Shotgun Short Reads. Genome Res.24 (8), 13841395. 10.1101/gr.170720.113

  • 37

    KanehisaM.GotoS. (2000). KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res.28 (1), 2730. 10.1093/nar/28.1.27

  • 38

    KangY.-J.StevensonA. K.YauP. M.KollmarR. (2008). Sparc Protein Is Required for normal Growth of Zebrafish Otoliths. J. Assoc. Res. Otolaryngol.9 (4), 436451. 10.1007/s10162-008-0137-8

  • 39

    KentW. J. (2002). BLAT---The BLAST-like Alignment Tool. Genome Res.12 (4), 656664. 10.1101/gr.22920210.1101/gr.229202

  • 40

    LetunicI.DoerksT.BorkP. (2012). SMART 7: Recent Updates to the Protein Domain Annotation Resource. Nucleic Acids Res.40, D302D305. 10.1093/nar/gkr931

  • 41

    LiF.BianL.GeJ.HanF.LiuZ.LiX.et al (2020). Chromosome‐level Genome Assembly of the East Asian Common octopus (Octopus Sinensis) Using PacBio Sequencing and Hi‐C Technology. Mol. Ecol. Resour.20, 15721582. 10.1111/1755-0998.13216

  • 42

    LiH.CoghlanA.RuanJ.CoinL. J.HericheJ.-K.OsmotherlyL.et al (2006). TreeFam: a Curated Database of Phylogenetic Trees of Animal Gene Families. Nucleic Acids Res.34 (Suppl. l_1), D572D580. 10.1093/nar/gkj118

  • 43

    LiJ.-T.GaoY.-D.XieL.DengC.ShiP.GuanM.-L.et al (2018). Comparative Genomic Investigation of High-Elevation Adaptation in Ectothermic Snakes. Proc. Natl. Acad. Sci. USA115 (33), 84068411. 10.1073/pnas.1805348115

  • 44

    LongL.AssarafY. G.LeiZ.-N.PengH.YangL.ChenZ.-S.et al (2020). Genetic Biomarkers of Drug Resistance: A Compass of Prognosis and Targeted Therapy in Acute Myeloid Leukemia. Drug Resist. Updates52, 100703. 10.1016/j.drup.2020.100703

  • 45

    LöytynojaA. (2014). Phylogeny-aware Alignment with PRANK. Mult. Seq. alignment Methods, 1079, 155170. 10.1007/978-1-62703-646-7_10

  • 46

    MuY.HuoJ.GuanY.FanD.XiaoX.WeiJ.et al (2018). An Improved Genome Assembly for Larimichthys Crocea Reveals Hepcidin Gene Expansion with Diversified Regulation and Function. Commun. Biol.1 (1), 112. 10.1038/s42003-018-0207-3

  • 47

    MurayamaE.HerbomelP.KawakamiA.TakedaH.NagasawaH. (2005). Otolith Matrix Proteins OMP-1 and Otolin-1 Are Necessary for normal Otolith Growth and Their Correct Anchoring onto the Sensory Maculae. Mech. Dev.122 (6), 791803. 10.1016/j.mod.2005.03.002

  • 48

    PerteaG.HuangX.LiangF.AntonescuV.SultanaR.KaramychevaS.et al (2003). TIGR Gene Indices Clustering Tools (TGICL): a Software System for Fast Clustering of Large EST Datasets. Bioinformatics19 (5), 651652. 10.1093/bioinformatics/btg034

  • 49

    PetkoJ. A.MillimakiB. B.CanfieldV. A.RileyB. B.LevensonR. (2008). Otoc1: A Novel Otoconin-90 Ortholog Required for Otolith Mineralization in Zebrafish. Devel Neurobio68 (2), 209222. 10.1002/dneu.20587

  • 50

    PoznarM.HołubowiczR.WojtasM.GapińskiJ.BanachowiczE.PatkowskiA.et al (2017). Structural Properties of the Intrinsically Disordered, Multiple Calcium Ion-Binding Otolith Matrix Macromolecule-64 (OMM-64). Biochim. Biophys. Acta (Bba) - Proteins Proteomics1865 (11 Pt A), 13581371. 10.1016/j.bbapap.2017.08.019

  • 51

    PracheilB. M.GeorgeR.ChakoumakosB. C. (2019). Significance of Otolith Calcium Carbonate crystal Structure Diversity to Microchemistry Studies. Rev. Fish. Biol. Fish.29, 569588. 10.1007/s11160-019-09561-3

  • 52

    PryszczL. P.GabaldónT. (2016). Redundans: an Assembly Pipeline for Highly Heterozygous Genomes. Nucleic Acids Res.44 (12), e113. 10.1093/nar/gkw294

  • 53

    Schulz-MirbachT.LadichF.PlathM.HessM. (2019). Enigmatic Ear Stones: what We Know about the Functional Role and Evolution of Fish Otoliths. Biol. Rev.94 (2), 457482. 10.1111/brv.12463

  • 54

    ServantF.BruC.CarrèreS.CourcelleE.GouzyJ.PeyrucD.et al (2002). ProDom: Automated Clustering of Homologous Domains. Brief. Bioinform.3, 246251. 10.1093/bib/3.3.246

  • 55

    SigristC. J. A.CeruttiL.De CastroE.Langendijk-GenevauxP. S.BulliardV.BairochA.et al (2010). PROSITE, a Protein Domain Database for Functional Characterization and Annotation. Nucleic Acids Res.38, D161D166. 10.1093/nar/gkp885

  • 56

    SimãoF. A.WaterhouseR. M.IoannidisP.KriventsevaE. V.ZdobnovE. M. (2015). BUSCO: Assessing Genome Assembly and Annotation Completeness with Single-Copy Orthologs. Bioinformatics31 (19), 32103212. 10.1093/bioinformatics/btv351

  • 57

    SlaterG.BirneyE. (2005). Automated Generation of Heuristics for Biological Sequence Comparison. BMC bioinformatics6 (1), 31. 10.1186/1471-2105-6-31

  • 58

    SmitA.HubleyR.GreenP. (2019). RepeatMasker Open-4.0. 2013-2015, Available at: http://repeatmasker.org/.

  • 59

    SöllnerC.BurghammerM.Busch-NentwichE.BergerJ.SchwarzH.RiekelC.et al (2003). Control of crystal Size and Lattice Formation by Starmaker in Otolith Biomineralization. Science302 (5643), 282286. 10.1126/science.1088443

  • 60

    SongW.ZhangY.ZhangX.GuiJ. (2020). De Novo transcriptome Assembly of Four Organs of Collichthys Lucidus and Identification of Genes Involved in Sex Determination and Reproduction. PLoS ONE15 (3), e0230580. 10.1371/journal.pone.0230580

  • 61

    StamatakisA. (2006). RAxML-VI-HPC: Maximum Likelihood-Based Phylogenetic Analyses with Thousands of Taxa and Mixed Models. Bioinformatics22 (21), 26882690. 10.1093/bioinformatics/btl446

  • 62

    StankeM.KellerO.GunduzI.HayesA.WaackS.MorgensternB. (2006). AUGUSTUS: Ab Initio Prediction of Alternative Transcripts. Nucleic Acids Res.34 (Suppl. l_2), W435W439. 10.1093/nar/gkl200

  • 63

    SteygerP. S.BurtonM.HawkinsJ. R.SchuffN. R.BairdR. A. (1997). Calbindin and Parvalbumin Are Early Markers of Non-mitotically Regenerating Hair Cells in the Bullfrog Vestibular Otolith Organs. Int. J. Dev. Neurosci.15 (4-5), 417432. 10.1016/s0736-5748(96)00101-3

  • 64

    SubramanianB.GaoS.LercherM. J.HuS.ChenW.-H. (2019). Evolview V3: a Webserver for Visualization, Annotation, and Management of Phylogenetic Trees. Nucleic Acids Res.47 (W1), W270W275. 10.1093/nar/gkz357

  • 65

    SugaS.NakaharaH. (2012). Mechanisms and Phylogeny of Mineralization in Biological Systems: Biomineralization’90. Springer Science & Business , Media, Berlin, Germany.

  • 66

    TangN.SkibstedL. H. (2016). Calcium Binding to Amino Acids and Small glycine Peptides in Aqueous Solution: toward Peptide Design for Better Calcium Bioavailability. J. Agric. Food Chem.64 (21), 43764389. 10.1021/acs.jafc.6b01534

  • 67

    ThomasP. D.CampbellM. J.KejariwalA.MiH.KarlakB.DavermanR.et al (2003). PANTHER: a Library of Protein Families and Subfamilies Indexed by Function. Genome Res.13, 21292141. 10.1101/gr.772403

  • 68

    VurtureG. W.SedlazeckF. J.NattestadM.UnderwoodC. J.FangH.GurtowskiJ.et al (2017). GenomeScope: Fast Reference-free Genome Profiling from Short Reads. Bioinformatics33 (14), 22022204. 10.1093/bioinformatics/btx153

  • 69

    WalkerB. J.AbeelT.SheaT.PriestM.AbouellielA.SakthikumarS.et al (2014). Pilon: an Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement. PloS one9 (11), e112963. 10.1371/journal.pone.0112963

  • 70

    WuC.ZhangD.KanM.LvZ.ZhuA.SuY.et al (2014). The Draft Genome of the Large Yellow Croaker Reveals Well-Developed Innate Immunity. Nat. Commun.5 (1), 17. 10.1038/ncomms6227

  • 71

    XiongZ.LiF.LiQ.ZhouL.GambleT.ZhengJ.et al (2016). Draft Genome of the Leopard Gecko, Eublepharis macularius. GigaSci5 (1), 47. 10.1186/s13742-016-0151-4

  • 72

    XuT.XuG.CheR.WangR.WangY.LiJ.et al (2016a). The Genome of the Miiuy Croaker Reveals Well-Developed Innate Immune and Sensory Systems. Sci. Rep.6, 21902. 10.1038/srep21902

  • 73

    XuY.ZhangH.YangH.ZhaoX.LovasS.LundbergY. Y. W. (2010). Expression, Functional, and Structural Analysis of Proteins Critical for Otoconia Development. Dev. Dyn.239 (10), 26592673. 10.1002/dvdy.22405

  • 74

    XuY.ZhangY.LundbergY. W. (2016b). Spatiotemporal Differences in Otoconial Gene Expression. Genesis54 (12), 613625. 10.1002/dvg.22990

  • 75

    YangH.ZhaoX.XuY.WangL.HeQ.LundbergY. W. (2011). Matrix Recruitment and Calcium Sequestration for Spatial Specific Otoconia Development. PLoS ONE6 (5), e20498. 10.1371/journal.pone.0020498

  • 76

    YangZ. (2007). PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol. Biol. Evol.24 (8), 15861591. 10.1093/molbev/msm088

  • 77

    YeC.HillC. M.WuS.RuanJ.MaZ. (2016). DBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies. Sci. Rep.6, 31900. 10.1038/srep31900

  • 78

    YouX.ShanX.ShiQ. (2020). Research Advances in the Genomics and Applications for Molecular Breeding of Aquaculture Animals, Aquaculture526, 735357. 10.1016/j.aquaculture.2020.735357

  • 79

    ZdobnovE. M.ApweilerR. (2001). InterProScan - an Integration Platform for the Signature-Recognition Methods in InterPro. Bioinformatics17 (9), 847848. 10.1093/bioinformatics/17.9.847

  • 80

    ZhangX.ZhouL.GuiJ. (2019). Biotechnological Innovation in Genetic Breeding and Sustainable green Development in Chinese Aquaculture. Sci. Sin.-Vitae49 (11), 14091429. 10.1360/SSV-2019-0142

  • 81

    ZhouQ.GaoH.ZhangY.FanG.XuH.ZhaiJ.et al (2019). A Chromosome‐level Genome Assembly of the Giant Grouper (Epinephelus Lanceolatus) Provides Insights into its Innate Immunity and Rapid Growth. Mol. Ecol. Resour.19 (5), 13221332. 10.1111/1755-0998.13048

  • 82

    ZhouQ.GuoX.HuangY.GaoH.XuH.LiuS.et al (2020). De Novo sequencing and Chromosomal‐scale Genome Assembly of Leopard Coral Grouper, Plectropomus leopardus. Mol. Ecol. Resour.20, 14031413. 10.1111/1755-0998.13207

Summary

Keywords

spiny head croaker, whole genome sequencing, chromosome-level genome assembly, otolith development, Sciaenidae

Citation

Gan W, Zhao C, Liu X, Bian C, Shi Q, You X and Song W (2021) Whole-Genome Sequencing and Genome-Wide Studies of Spiny Head Croaker (Collichthys lucidus) Reveals Potential Insights for Well-Developed Otoliths in the Family Sciaenidae. Front. Genet. 12:730255. doi: 10.3389/fgene.2021.730255

Received

24 June 2021

Accepted

06 September 2021

Published

30 September 2021

Volume

12 - 2021

Edited by

Liang Guo, Chinese Academy of Fishery Sciences (CAFS), China

Reviewed by

Manu Kumar Gundappa, University of Edinburgh, United Kingdom

Changxu Tian, Guangdong Ocean University, China

Huaping Zhu, Chinese Academy of Fishery Sciences, China

Updates

Copyright

*Correspondence: Xinxin You, ; Wei Song,

†These authors have contributed equally to this work

This article was submitted to Livestock Genomics, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics