Identifying Candidate Genes Involved in the Regulation of Early Growth Using Full-Length Transcriptome and RNA-Seq Analyses of Frontal and Parietal Bones and Vertebral Bones in Bighead Carp (Hypophthalmichthys nobilis)

Luo, Weiwei; Zhou, Ying; Wang, Junru; Yu, Xiaomu; Tong, Jingou

doi:10.3389/fgene.2020.603454

ORIGINAL RESEARCH article

Front. Genet., 15 January 2021

Sec. Livestock Genomics

Volume 11 - 2020 | https://doi.org/10.3389/fgene.2020.603454

Identifying Candidate Genes Involved in the Regulation of Early Growth Using Full-Length Transcriptome and RNA-Seq Analyses of Frontal and Parietal Bones and Vertebral Bones in Bighead Carp (Hypophthalmichthys nobilis)

Weiwei Luo^1,2

Ying Zhou^1,2

Junru Wang^1,2

Xiaomu Yu^1*

Jingou Tong^1*

¹State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, The Innovation Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
²College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China

Growth, one of the most important traits monitored in domestic animals, is essentially associated with bone development. To date, no large-scale transcriptome studies investigating bone development in bighead carp have been reported. In this study, we applied Isoform-sequencing technology to uncover the entire transcriptomic landscape of the bighead carp (Hypophthalmichthys nobilis) in early growth stage, and obtained 63,873 non-redundant transcripts, 20,907 long non-coding RNAs, and 1,579 transcription factors. A total of 381 alternative splicing events were seen in the frontal and parietal bones with another 784 events simultaneously observed in the vertebral bones. Coupling this to RNA sequencing (RNA-seq) data, we identified 27 differentially expressed unigenes (DEGs) in the frontal and parietal bones and 45 DEGs in the vertebral bones in the fast-growing group of fish, when compared to the slow-growing group of fish. Finally, 15 key pathways and 20 key DEGs were identified and found to be involved in regulation of early growth such as energy metabolism, immune function, and cytoskeleton function and important cellular pathways such as the arginine and proline metabolic pathway (p4ha1), FoxO signaling pathway (sgk1), cell adhesion molecules (b2m, ptprc, and mhcII), and peroxisome proliferator-activated receptor signaling pathway (scd). We established a novel full-length transcriptome resource and combined it with RNA-seq to elucidate the mechanism of genetic regulation of differential growth in bighead carp. The key DEGs identified in this study could fuel further studies investigating associations between growth and bone development and serve as a source of potential candidate genes for marker-assisted breeding programs.

Introduction

Growth is one of the most economically important traits monitored in domestic animals and therefore is the main objective of most genetic selection programs. Growth rates of aquatic animals, as well as of most organisms, are regulated by multiple genetic and environmental factors (Cleveland et al., 2020). Over the past few decades, the genetic basis of growth in aquatic species has been widely studied. Genetic linkage analyses for genes regulating growth in aquatic animals have been performed using DNA-based molecular markers such as amplified fragment length polymorphism and simple sequence repeat (SSR) (Liu and Cordes, 2004; Yue, 2014). With the development of next-generation sequencing technology, quantitative trait loci (QTL) mapping and genome-wide association study have allowed for the identification of genetic markers associated with the traits of interest. Additionally, transcriptome analyses can be used to identify biomarkers and candidate genes regulating growth. A total of 92 differentially expressed genes (DEGs) were identified between fast-growth and slow-growth families of blunt snout bream (Megalobrama amblycephala) in transcriptomes of the liver and gill (Li et al., 2015). Cleveland et al. (2020) compared liver and muscle transcriptomes of a rainbow trout (Oncorhynchus mykiss) line, selectively bred for fast growth, to that of a contemporary randomly mated control line and identified 145 and 36 DEGs in the liver and white muscle transcriptomes, respectively. Our laboratory has also identified 173 DEGs in the hypothalamus–pituitary and 204 DEGs in the liver of bighead carp with differential growth rates (Fu B. et al., 2019). However, these studies have mainly focused on gene expression patterns in the muscle, liver, and brain tissues. Growth, however, is essentially associated with skeleton and bone development and leads to consequent increases in body weight (BW) and body length (BL). The frontal and parietal bones are important components of the skull in vertebrates, serving to protect the brain and permit skull expansion during development (Quarto et al., 2009; Topczewska et al., 2016). The presence of vertebral bone discriminates vertebrate organisms from invertebrates. It supports the BW of the animals and generates force in the muscles for locomotion and physiological activities (Ibaraki et al., 2015). To our knowledge, no large-scale transcriptomic study investigating the differential expression of genes, regulating differential growth rates, in the bones of bighead carp has been previously reported.

Bighead carp (Hypophthalmichthys nobilis), one of the most important fish in Asia, improves the water quality for human consumption and thus has been introduced in many non-Asian countries (Szabo et al., 2019). Principally produced in China, the global production of bighead carp reached 3.1 million tons in 2017 (FAO). The size of the head in aquaculture fish species is important not only for understanding evolution and biological adaptation but also for predicting the economic value, as it directly affects filet yield (Geng et al., 2016). For most aquaculture fish species, smaller head and uniform body conformations provide a greater proportion of carcass yield; thus, selection for smaller head and streamline body conformation is of great value in aquaculture (Rutten et al., 2005). However, the bigger head of bighead carp harvest fish is more popular among Chinese consumers and is used as a source of nutrition in soups (fish head and tofu soup) or delicious dishes (steamed fish head with diced hot red peppers and so on). Therefore, to meet the demands of markets and farmers, selection for fast-growing and big-headed varieties of bighead carp is prevalent among Chinese breeding programs.

Transcriptomic analyses, widely used in genetic research, uncover the type and expression levels of transcripts and identify physiological and biochemical processes that regulate transcription (Jia et al., 2018). However, owing to technical limitation, obtaining the full-length transcripts and quantifying the isoforms are not possible with RNA sequencing (RNA-seq) (Steijger et al., 2013). Thus, single-molecule real-time (SMRT) sequencing (Pacific Biosciences of California, Inc., Menlo Park, CA, United States) has been developed (Korlach et al., 2010), and it provides a third-generation sequencing platform that is widely used to sequence genomes by generating kilobase-sized sequencing reads (Chaisson et al., 2015; Gong et al., 2018). Moreover, recent studies have reduced the high error rate (up to 15%) of SMRT sequencing by self-correction, via circular-consensus sequencing reads (Li et al., 2014) and validation with high-throughput and high-accuracy short-read data (Au et al., 2013). SMRT sequencing comprehensively analyses splice isoforms of each gene and improves the annotation of existing model organisms by producing full-length transcripts (Tilgner et al., 2014; Wang et al., 2016; Mehjabin et al., 2019). In this study, we applied SMRT sequencing to uncover post-transcriptional regulatory events in bighead carp and coupled it with bone RNA-seq to investigate the genetic regulation of early growth at the transcript level. This is the first study to report the application of SMRT sequencing in bone tissues and provides a comprehensive view of transcriptome complexity in bighead carp at the early growth stage. Furthermore, we identified a set of differentially expressed unigenes (DEGs) from bone development and body growth-related signaling pathways, which potentially services as valuable resources for future molecular breeding studies.

Results

Morphological Measurement

In total, six bighead carp individuals in juvenile stage, including three fast-growing [big group, (BG)] and three slow-growing [small group (SG)] fish, were selected for transcriptome analysis in this study. The BL, head length (HL), head height (HH), head width (HW), and BW were measured (Table 1). The ratios of HL and BL (HL/BL) from the BG and SG were 0.3239 and 0.3533, respectively. The average BLs for BG and SG were 21.30 and 16.13 cm, respectively. BL from BG was significantly larger than that from SG (p < 0.05). Other phenotypic parameters from BG were significantly larger than those from SG, including HL, HH, HW, and BW (p < 0.05).

TABLE 1

Table 1. Phenotype differences of bighead carp between big and small groups.

Full-Length Transcripts From the Bone Tissues

Two full-length transcriptomes of bighead carp with different growth rates were generated using the PacBio Sequel platform on the pooled RNA from each of three frontal and parietal bones and vertebrae in BG and SG, respectively. F01 represented the full-length transcriptome from each of three frontal and parietal bones and vertebrae in BG; F02 represented the full-length transcriptome from each of three frontal and parietal bones and vertebrae in SG. A total of 27.06 and 30.11 G subread bases were generated by two SMRT cells from the PacBio library of BG and SG, respectively. Under the conditions of full passes of ≥3 and quality of >0.9, 374,738, and 402,340 circular consensus sequence (CCS) reads were identified in F01 and F02, with the mean read lengths of 1,521 and 1,772 bp, respectively. CCS reads of F01 and F02 comprised 225,336 full-length non-chimeric (FLNC) reads and 295,486 FLNC reads, respectively. Based on the iterative clustering for error correction (ICE) Quiver and arrow polishing algorithms, we produced 72,625 and 93,838 polished full-length consensus transcripts from F01 and F02 with the mean lengths of 1,496 and 1,741 bp, respectively. Percentages of polished high-quality isoforms from F01 and F02 were 96.13 and 96.50%, respectively. After correction using short reads produced by Illumina (San Diego, CA, United States), short-read RNA-seq, and subsequently removing redundancies using the CD-Hit program, the consensus transcripts were finally clustered into a total of 63,873 non-redundant transcripts for subsequent analysis, including 36,584 and 44,852 non-redundant transcripts from F01 and F02, respectively.

Sequencing and Quality Assessment of Short Reads

Twelve Illumina RNA-seq libraries constructed from frontal and parietal bones and vertebrae of bighead carp with different growth rates (BG and SG) were sequenced to correct the polished CCS and to quantify full-length transcripts obtained from PacBio Iso-Seq. After trimming process and screening with the high-quality reads, a total of 92.04 G clean reads were produced from all samples. An average of 80.67, 61.25, 78.78, and 86.66 million clean reads were obtained from the frontal and parietal bone cDNA libraries of BG (BF group), frontal and parietal bone cDNA libraries of SG (SF group), vertebra cDNA libraries of BG (BV group), and vertebra cDNA libraries of SG (SV group), respectively. In total, 24.16, 18.33, 23.60, and 25.94 Gb of clean bases were generated in the BF, SF, BV, and SV groups, respectively. The Q30 value of each sample was up to 93.03%, and the GC distribution of each sample ranged from 47.22 to 50.64% (Supplementary Table 1). The results showed that the sequencing quality was high, and the data could be used for subsequent analyses.

Efficient Gene Annotation of Full-Length Transcripts

To obtain a comprehensive functional annotation from the full-length bone transcriptome of bighead carp, all the 63,873 non-redundant transcripts were assigned to align with eight different databases, including NCBI non-redundant protein sequences (NR), euKaryotic orthologous groups (KOG), Cluster of Orthologous Groups of proteins (COG), Protein family (Pfam), Swiss-Prot (a manually annotated and reviewed protein sequence database), the Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology, Gene Ontology (GO), and eggNOG. A total of 68.49% of the non-redundant transcripts (43,747 of 63,873 transcripts) were successfully annotated with significant hits (E value < 1E^–5) from these databases. The statistics of the full-length transcript annotations are listed in Table 2. The remaining unannotated transcripts (20,126 transcripts) might represent novel bighead carp species-specific genes.

TABLE 2

Table 2. Summary of annotations on non-redundant transcripts from bone tissues against public databases.

For Nr annotation, 89.69% of homologous hits were assigned to six fish species, respectively, including Danio rerio, Astyanax mexicanus, O. mykiss, Esox lucius, Cyprinus carpio, and Ctenopharyngodon idella. GO annotations generated 58 level 2 GO terms (Figure 1). Among them, the three most abundant terms under the biological process category were cellular process (20,428 transcripts), single-organism process (17,294 transcripts), and biological regulation (14,248 transcripts). Within the cellular component (CC) category, cell (20,686 transcripts), cell part (20,659 transcripts), and organelle (13,936 transcripts) were the most abundant terms. Of the 16 terms in the molecular function (MF) category, binding (19,616 transcripts), catalytic activity (11,868 transcripts), and transporter activity (1,940 transcripts) had the highest number of transcripts. Three putative growth-related GO terms, including growth (involving 565 transcripts), immune system process (involving 1,931 transcripts), and metabolic process (involving 13,193 transcripts), were successfully annotated.

FIGURE 1

Figure 1. GO classification of non-redundant transcripts in bone tissues of bighead carp.

A total of 69.48% annotated transcripts (30,394 of 43,747 transcripts) were annotated in the KOG database, which can be assigned to 25 categories (Supplementary Figure 1). The largest number of functional categories was signal transduction mechanisms reaching 6,286 transcripts. The following four functional categories were general function prediction only (5,947 transcripts), post-translational modification, protein turnover, chaperones (2,882 transcripts), transcription (2,325 transcripts), and function unknown (2,240 transcripts), respectively.

In the KEGG classification, a total of 279 pathways annotated from 25,955 non-redundant transcripts were extracted from the bone transcriptome of bighead carp. The results showed that endocytosis (880 transcripts), mitogen-activated protein kinase (MAPK) signaling pathway (753 transcripts), focal adhesion (723 transcripts), regulation of actin cytoskeleton (709 transcripts), and herpes simplex infection (489 transcripts) were the top five pathways with the most abundant unigenes. Notably, we paid attention to 25 KEGG pathways (Figure 2), which may be closely associated with differential growth and involved in the physiological functions of immune, metabolism, and cytoskeleton of bighead carp, such as insulin signaling pathway, cytokine–cytokine receptor interaction, peroxisome proliferator-activated receptor (PPAR) signaling pathway, transforming growth factor β (TGF-β) signaling pathway, and regulation of actin cytoskeleton. Among these 25 KEGG pathways, endocytosis (880 unigenes), MAPK signaling pathway (753 unigenes), and regulation of actin cytoskeleton (709 unigenes) were the top three pathways with the most abundant unigenes, which provided some reference value for studying the early growth regulation and bone development in bighead carp.

FIGURE 2

Figure 2. Pathways associated with growth.

Prediction of Gene Families and Coding Sequences

We utilized the Coding Genome Reconstruction Tool (COGENT) to further partition these non-redundant transcripts into putative gene families and reconstruct each family into one or several full-length unique transcript models. Of the 63,873 non-redundant transcripts, COGENT constructed 5,474 gene families (Supplementary Table 2). A total of 44,895 ORFs (open reading frames) were identified with TransDecoder, including 34,093 complete ORFs. The length distribution of the complete coding protein sequences is shown in Supplementary Figure 2.

De novo Detection of Alternative Splicing Events and Long Non-coding RNA Prediction

A total of 381 and 784 pairs of FL transcripts from F01 and F02 that might represent alternative splicing (AS) events were detected (Supplementary Table 3). Additionally, because no reference genome was available for SMRT sequencing of transcriptome in bighead carp, we could not determine the types of AS events.

The numbers of long non-coding RNA (lncRNA) predicted from FL transcripts by Coding Potential Assessment Tool (CPAT), Coding–Non-coding Index (CNCI), Coding Potential Calculator (CPC), and Pfam were 26,254, 27,302, 22,045, and 27,426, respectively. The intersection of these four results yielded 20,907 lncRNAs (Figure 3).

FIGURE 3

Figure 3. Venn graph of lncRNA transcripts from CPAT, CNCI, CPC, and Pfam analyses.

Identification of Transcription Factors and SSRs

In our Iso-Seq, a total of 1,579 non-redundant transcription factors (TF) transcripts were identified, and their detailed information is shown in Supplementary Table 4. Based on the Animal TFDB 2.0 database classification, these TFs belong to more than 56 families, and a large number of TFs were dominant in zf-C2H2, miscellaneous, and TF_bZIP (Supplementary Figure 3). It was the first time to extensively identify TFs using transcriptome dataset in bighead carp, which provided a useful foundation for TFs studies in the future.

Simple sequence repeats, also known as microsatellite DNAs, have a tandem repeat motif of 1–6 bp in length. The characters of high polymorphism (mainly due to the differences in the number of tandem motifs), stability, and reliability for SSR enable it to be an ideal molecular marker that is widely used in genetic map construction, QTL mapping, and genetic diversity assessment. We searched for SSRs in the 50,694 bighead carp transcripts longer than 500 bp, and a total of 53,508 SSRs were identified, with 13,270 transcripts containing more than one SSR. Most of the SSRs identified were mononucleotide repeats (31,758, 59.35%), followed by the dinucleotide repeats (15,540, 29.04%), trinucleotide repeats (4,989, 9.32%), tetranucleotide repeats (1,009, 1.89%), hexanucleotide repeats (199, 0.37%), and pentanucleotide repeats (13, 0.02%). Among the 53,508 SSRs, 9,085 SSRs presented in compound formation. All SSRs and their primers are listed in Supplementary Table 5.

Identification of DEGs From Bone Tissues

To capture transcript-level expression differences related to different growth rates of bighead carp, the Illumina RNA-seq data of bone tissues, including the frontal and parietal bone and vertebra tissues, were aligned to the refined full-length transcripts of the Iso-Seq database for quantification. The DEGs of the frontal and parietal bone and vertebra tissues between the big and SGs were explored by using DEseq2 with the criteria of | log2 ratio| ≥ 1 and FDR (false discovery rate) < 0.01. Supplementary Table 6 was used as an input file to run DESeq2. In total, the numbers of DEGs from the frontal and parietal bone and vertebra tissues between BG and SG were as follows: 27 from the frontal and parietal bone tissues (15 up- and 12 down-regulated in BG) and 45 from vertebra tissues (24 up- and 21 down-regulated in BG). When we compared DEGs from bone tissues, 12 DEGs were shared in the frontal and parietal bones and vertebral bones between BG and SG, which were mainly associated with metabolic process, biological regulation, and immune system process, such as Golgi apparatus protein 1 (glg1), serine/threonine-protein kinase Sgk1(sgk1), and β2-microglobulin (b2m).

To elucidate the biological events of the DEGs from the frontal and parietal bone and vertebra tissues, which would be mainly involved in different growth rates of bighead carp at early growth stage, GO term enrichment analyses were conducted. First, GO terms were obtained on the basis of numbers of the DEGs assigned to each GO term (Figure 4). The result was as follows: Under the CC category in both the frontal and parietal bone and vertebra tissues, cell (GO: 0005623) and cell part (GO: 0044464) had the most abundant GO function items. Within the biological process (BP) category in both the frontal and parietal bone and vertebra tissues, higher percentages of genes were commonly clustered into cellular process (GO: 0009987), biological regulation (GO: 0065007), and metabolic process (GO: 0008152). In the MF category in both the frontal and parietal bone and vertebra tissues, most genes were assigned to binding (GO: 0005488), catalytic activity (GO: 0003824), and MF regulator (GO: 0003674). Second, GO terms of the DEGs were also identified according to top three p values of CC, BP, and MF categories, respectively. Under the CC category, the top three GO terms in both the frontal and parietal bone and vertebra tissues were extracellular exosome [GO: 0070062, p value = 7.4E-05 (frontal and parietal bone)/p value = 7.7E-05(vertebra)], endoplasmic reticulum membrane [GO: 0005789, p value = 0.00026 (frontal and parietal bone)/p value = 0.00027 (vertebra)], and mitochondrial membrane part [GO: 0044455, p value = 0.00044 (frontal and parietal bone)/p value = 0.00044 (vertebra)]. Within the BP category in both the frontal and parietal bone and vertebra tissues, translation [GO: 0006412, p value = 1.8E-13 [frontal and parietal bone)/p value = 1.9E-13(vertebra)], antigen processing and presentation [GO: 0019882, p value = 0.00026 (frontal and parietal bone)/p value = 0.00012 (vertebra)], and immune response [GO: 0006955, p value = 0.00478 (frontal and parietal bone)/p value = 0.00374 (vertebra)] were the top three GO terms. Protein serine/threonine kinase activity [GO: 0004674, p value = 0.00126 (frontal and parietal bone)/p value = 0.00128 (vertebra)], carbohydrate binding [GO: 0030246, p value = 0.00354 (frontal and parietal bone)/p value = 0.00356 (vertebra)], and monosaccharide binding [GO: 0048029, p value = 0.01207 (frontal and parietal bone)/p value = 0.01209 (vertebra)] were the top three GO terms of MF category in both the frontal and parietal bone and vertebra tissues. These common GO terms of DEGs from these two tissues might suggest the frontal and parietal bone, and vertebra tissues played roles in differential growth of bighead carp cooperatively.

FIGURE 4

Figure 4. Distribution of GO classifications of DEGs in bone tissues of bighead carp. (A) Distribution of GO classifications of DEGs in the frontal and parietal bones. (B) Distribution of GO classifications of DEGs in the vertebral bones. Blue, green, and orange colors represented molecular function, cellular component, and biological process categories, respectively.

Kyoto Encyclopedia of Genes and Genomes pathway–based analyses help to identify the biological pathways that are related to DEGs. Pathway enrichment analyses identified the enriched pathways from frontal and parietal bone and vertebra tissues, respectively (Figure 5). Among these pathways, metabolic pathways were the most frequently represented pathways in bone tissues of bighead carp with different growth rates, followed by organismal systems, environmental information processing, and genetic information processing pathways. These observations disclosed the vital implications of energy metabolism, cytoskeleton, and immune regulation in bone tissues between BG and SG of bighead carp, such as citrate cycle (TCA cycle) [ko00020, q = 1 (frontal and parietal bone)/q = 1(vertebra)], regulation of actin cytoskeleton [ko04810, q = 1 (vertebra)], and cell adhesion molecules (CAMs) [ko04514, q = 0.00761 (frontal and parietal bone)/q = 0.12807(vertebra)] pathways. These enriched pathways were not all significant because the number of DEGs was too few, which were considered as potential pathways associated with differential growth of bighead carp at early growth stage.

FIGURE 5

Figure 5. Pathway enrichment analysis of DEGs in bone tissues of bighead carp. (A) Enriched pathways from frontal and parietal bones between big and small groups of bighead carp. (B) Enriched pathways from vertebral bones between big and small groups of bighead carp.

Key DEGs Related to Differential Growth

In total, 15 key pathways associated with the early growth regulation of bighead carp were identified from frontal and parietal bone and vertebra tissues (Table 3), which played important roles in the physiological functions of energy metabolism, cytoskeleton, and immune regulation. Twenty key DEGs (Figure 6) were selected by three strategies simultaneously, which were considered to be related to the differential growth of bighead carp: (i) 10 important DEGs that showed high participation frequencies in the 15 key pathways; (ii) another seven important DEGs that appeared more than two times in biological regulation, immune system process, and metabolic process of GO term enrichment analyses; (iii) three important DEGs that were commonly identified in frontal and parietal bone and vertebra tissues. Then, these 20 key DEGs were used to construct the correlation network by GeneMANIA (Figure 6).

TABLE 3

Table 3. Key pathways related to early growth regulation in bone tissues of bighead carp.

FIGURE 6

Figure 6. Key genes related to early growth regulation in bone tissues of bighead carp. (A) Key genes related to early growth regulation in the frontal and parietal bones. (B) Key genes related to early growth regulation in the vertebral bones. The FPKM data of genes was used for heatmap construction. (C) The correlation network of key genes. The round nodes filled with diagonal lines indicate key genes related to early growth regulation in bone tissues. The round nodes filled with pure color indicate genes that interact with key genes. The interactions among different genes were represented by different colorful lines. Orange lines indicate predicted interactions. Purple lines indicate co-expression interactions. Gene abbreviations: pleckstrin homology domain-containing family J member 1 (plekhj1), SURP and G-patch domain-containing protein 1 (sugp1), prolyl 4-hydroxylase subunit alpha-1 (p4ha1), novel MHCII β chain protein (novel mhc II), protein tyrosine phosphatase type IVA (ptp4a), β2-microglobulin (b2m), serine/threonine-protein kinase Sgk1 (sgk1), receptor-type tyrosine-protein phosphatase C (ptprc), uncharacterized protein LOC101883013 (LOC101883013), myc target protein 1(myct1), golgi apparatus protein 1 (glg1), sodium/potassium-transporting ATPase subunit β (atp1b), stearoyl-CoA desaturase (scd), dopamine β-hydroxylase (dbh), succinate dehydrogenase [ubiquinone] cytochrome b small subunit B (sdhdb), MHC class II β precursor (mhcII), actin filament–associated protein 1-like 2 (afap1l2), acyl-CoA–binding protein (acbp), ubiquitin conjugation factor E4 B (ube4b), zinc finger protein 574 (znf574).

Validation of DEGs by Quantitative Real-Time Polymerase Chain Reaction

Quantitative real-time polymerase chain reaction (qRT-PCR) was performed on 12 selected genes and the internal control gene β-actin (Supplementary Table 7), to validate the DEGs identified in bone tissues of bighead carp by RNA-seq. Fold changes from qRT-PCR were compared with the RNA-seq expression profiles (Figure 7). The expression patterns revealed by qRT-PCR analysis were similar to those revealed by RNA-seq for the same gene. Thus, RNA-seq could provide reliable data for mRNA differential expression analyses.

FIGURE 7

Figure 7. qRT-PCR validation of differently expressed genes in bone tissues of bighead carp from differential growth groups. (A) qRT-PCR validation of differently expressed genes of frontal and parietal bones between big and small groups of bighead carp. (B) qRT-PCR validation of differently expressed genes of vertebral bones between big and small groups of bighead carp. Gene expressions were expressed as mean normalized ratios (n = 3, ± SE). β-Actin was used as reference gene. Positive values denote up-regulation in big group compared to small group of bighead carp. Gene abbreviations: SURP and G-patch domain-containing protein 1 (sugp1), sodium/potassium-transporting ATPase subunit β (atp1b), receptor-type tyrosine-protein phosphatase C (ptprc), protein tyrosine phosphatase type IVA (ptp4a), stearoyl-CoA desaturase (scd), pleckstrin homology domain-containing family J member 1 (plekhj1), actin filament–associated protein 1-like 2 (afap1l2), microfibril-associated glycoprotein 4-like (msg4l), nuclear receptor co-activator 5 (ncoa5), golgi apparatus protein 1 (glg1), G-protein coupled receptor 126(gpr126), serine/threonine-protein kinase Sgk1(sgk1).

Discussion

Transcriptome reconstruction and annotation have significantly improved with the advent of new sequencing technologies. These techniques play an important role in gene discovery, genome annotation, and detection of genomic signatures, particularly in species without a reference genome (Li et al., 2017). In recent years, traditional short-read RNA-seq has been commonly used to investigate RNA expression patterns in several tissues such as that of the brain, liver, and muscle (Zhou et al., 2019; Cleveland et al., 2020). However, short-read RNA-seq has certain limitations in regard to precise reconstruction and reliable sequencing of isoforms due to the complexity of AS mechanisms in eukaryotes (Tilgner et al., 2013; Yi et al., 2018). In contrast, SMRT sequencing is a superior strategy, directly generating a comprehensive transcriptome with accurate sequencing of AS isoforms and novel genes, and its advantages have been extensively documented in previous studies. In this study, the full-length transcriptome of bighead carp was performed using the Iso-Seq technique.

Previously reported transcriptomic studies of the bighead carp were performed using Illumina sequencing platforms (Fu B. et al., 2019; Li et al., 2019). In comparison, our study produced a comprehensive transcriptome with several features. First, accurate full-length transcripts (63,872 transcripts) were generated, serving as a valuable resource for various gene structures and sequences, which can be directly used in gene-function studies without using additional gene clones. Second, a total of 20,907 lncRNAs were identified, which could be useful for investigations of potential lncRNA functions in the bighead carp. Third, the full-length transcripts could serve as a reference for genome assembly and gene annotation of bighead carp. Finally, reliable ORFs identified in this study are essential for the identification of orthologous genes, aiding in the better understanding of culture and breeding techniques used for bighead carp.

A total of 279 pathways were annotated from 25,955 non-redundant transcripts extracted from the bone transcriptome of the bighead carp, using the KEGG classification system. Notably, we focused on 25 KEGG pathways (Figure 2), which are potentially associated with regulation of early growth and are involved in physiological functions of the immune system, metabolism, and cytoskeleton of bighead carp. These KEGG pathways included the insulin signaling pathway, cytokine–cytokine receptor interaction, PPAR signaling pathway, TGF-β signaling pathway, Wnt signaling pathway, and regulation of actin cytoskeleton and were also the focus of previous reports on growth and bone development (Nie et al., 2019; Cleveland et al., 2020), suggesting that they play important roles in growth modulation and bone development of bighead carp at early growth stage.

To identify growth-related genes in the bighead carp, the DEGs in the big and the SGs were identified and functionally analyzed. In total, 15 key enriched pathways (Table 3) and 20 key DEGs (Figure 6), of the frontal and parietal bones and the vertebra, were screened and found to be mainly involved in the physiological functions of metabolism, cytoskeleton, and immunization. Arginine and proline metabolism, biosynthesis of unsaturated fatty acids, PPAR signaling pathway, citrate cycle, oxidative phosphorylation pathway, and carbon and fatty acid–metabolizing pathways are responsible for metabolizing amino acids, lipids, and carbohydrates, producing energy for vital functions of an organism (Jiao et al., 2020). Most of these metabolic pathways are also found in reports of growth-related transcriptomes in fish, such as the PPAR signaling pathway (Lu et al., 2019), carbon-metabolizing pathways (Lu et al., 2020), and oxidative phosphorylation pathway (Fu J. et al., 2019). The eight other aforementioned pathways also play an important role in growth, by regulating the actin cytoskeleton, cell adhesion molecules, and FoxO signaling pathway (Pang et al., 2018; Lu et al., 2019).

Previous studies have shown that energy metabolism can influence the growth of an organism (Lu et al., 2019). In this study, DEGs, potentially influencing amino acid, lipid, and carbohydrate metabolism of bighead carp, were identified using transcriptome analysis. For instance, acyl-CoA–binding protein (acbp), SURP, G-patch domain-containing protein 1 (sugp1), and stearoyl-CoA desaturase (scd) are important lipogenic enzymes, and a change in their protein activities can alter the rates of biosynthesis of fatty acids (Igal, 2016; Kim et al., 2016; Zhou et al., 2020). Succinate dehydrogenase [ubiquinone] cytochrome b small subunit B (sdhdb) plays an important role in lipid transport and metabolism, as well as in carbon metabolism (Oyedotun and Lemire, 1999). Prolyl 4-hydroxylase subunit alpha-1 (p4ha1) and ubiquitination factor E4B (ube4b) are involved in protein synthesis and degradation (Chakravarthi et al., 2014; Lu et al., 2019). Moreover, several immune-related DEGs were identified in the bone tissue transcriptome of bighead carp, such as β2-microglobulin (b2m), MHC class II β precursor, receptor-type tyrosine-protein phosphatase C (ptprc), and dopamine β-hydroxylase (dbh) (Kountikov et al., 2004; Das et al., 2011; Cheng et al., 2017). It has been previously shown that these immune system-related genes are essential for maintaining normal growth and physiological functions (Cleveland et al., 2020). In addition to the genes of energy metabolism and immune function, cytoskeletal genes have been identified as candidate growth-related genes in fish (Salem et al., 2012). In this study, we found a differential expression of cytoskeletal genes such as myc target protein 1 homolog (myct1), Golgi apparatus protein 1 (glg1), zinc finger protein 574 (znf574), and actin filament–associated protein 1-like 2 (afap1l2) in the two groups with different growth rates (Mourelatos et al., 1995; Berg et al., 2010; Tie et al., 2016; Wu et al., 2016), suggesting that they play important roles in cell growth, proliferation, apoptosis, and transformation. The protein tyrosine phosphatase 4A (PTP4A) family, consisting of PTP4A1/PRL1, PTP4A2/PRL2, and PTP4A3/PRL3, has been implicated in cell proliferation and tumorigenesis (Kobayashi et al., 2014). Ptp4a is a critical promoter of TGF-β signaling pathway in primary dermal fibroblasts (Sacchetti et al., 2017) and might also regulate the growth of the bighead carp. Serine/threonine-protein kinase (Sgk1) is a serum glucocorticoid kinase that is involved in the regulation of fat storage, body size, and development in Caenorhabditis elegans (Jones et al., 2009). Sgk1 has also been identified as a growth-related gene in the Arctic charr (Salvelinus alpinus), at a specific developmental stage, and is correlated to the size of the organism (Beck et al., 2019). Although reports regarding DEGs pleckstrin homology domain-containing family J member 1 (plekhj1) and uncharacterized protein (LOC101883013) are rare, we consider them as novel candidate growth-related genes that need further functional investigation and verification.

Among the aforementioned DEGs (Figure 6), several DEGs have been identified as candidate genes regulating growth in previous transcriptomic and genomic studies, including scd, ube4b, atp1b, myct1, b2m, and mhcII (Xu et al., 2013; Wu et al., 2016; Lu et al., 2019; Zhou et al., 2019). These reports provide additional support to our findings of these genes being significantly involved in the modulation of differential growth in different domestic animals. However, compared with other transcriptomes of bone tissues in fish, certain well-known pathways and genes involved in bone formation and differentiation have not been identified in DEG analyses of frontal and parietal bones and vertebral bones in bighead carp, such as calcium, MAPK, TGF-β, and osteoclast pathways (Lu et al., 2019; Nie et al., 2019). For our study, this may be because the bighead carp used were 6 months old, and bone formation and differentiation might have already been completed. Bone formation and differentiation in fish usually occur during early growth and development, approximately 30 days post-fertilization or even earlier, such as in Epinephelus lanceolatus (Lv et al., 2019), M. amblycephala (Nie et al., 2019), and Cynoglossus semilaevis (Ma et al., 2019). Although well-known pathways and genes involved in bone formation and differentiation were not identified in this study, we found several growth-related DEGs that have been previously reported to regulate bone development, such as p4ha1 (Zou et al., 2017) and sgk1 (Beck et al., 2019), suggesting that differential growth of bighead carp is associated with bone development, to an extent.

Materials and Methods

Ethics Statement

All experimental protocols involved in fish in this study were conducted in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the Institute of Hydrobiology, the Chinese Academy of Sciences, China. All efforts were made to minimize suffering of the fish.

Fish Sample Collection

Fish samples of bighead carp at early growth stage used in this study originated from one full-sib family, which were cultured at one pond of the Zhangdu Lake Fish Farm (Wuhan, China). In November 2018, BL, HL, HH, HW, and BW of 6-month samples were measured after being anesthetized with MS-222 (tricaine methanesulfonate; Sigma). The ratios of HL and BL (HL/BL) were calculated. Three samples with larger BW were clustered in fast-growing group (BG). Another three samples with smaller BW were clustered in slow-growing group (SG). The frontal and parietal bones were separated from the skull, removed skins, washed with pure water, cut into pieces, mixed together, and then immediately placed in liquid nitrogen and stored at −80°C refrigerator before total RNA extraction. The vertebrae were separated from the body, removed muscles, also washed with pure water, cut into pieces, mixed, and then immediately placed in liquid nitrogen and stored at −80°C refrigerator before total RNA extraction. The layout plan of the study design is shown in Figure 8.

FIGURE 8

Figure 8. The layout plan of the study design. BG is the abbreviation of big bighead carp, and SG means small bighead carp.

RNA Extraction and Quality Evaluation

Total RNA was extracted from each bone tissue (the frontal and parietal bones or vertebral bones) using RNAiso reagent (Takara, Tokyo, Japan). To prevent genomic DNA contamination, RNA samples were treated to digest DNA using RNase-free DNase I during extraction of total RNA. RNA degradation and contamination were verified by ethidium bromide staining of 28s and 18s ribosomal RNA on a 1% agarose gel. RNA purity and concentration were checked using a Nanodrop 2000 spectrophotometer (Thermo Scientific). RNA integrity was assessed using an Agilent RNA 6000 Nano reagents part I Kit in an Agilent 2100 Bioanalyzer System (Agilent Technologies, Santa Clara, CA, United States). The RNA quality criteria for the RNA samples were RIN ≥ 8.0 (RNA Integrity Number), and 1.8 < OD260/280 < 2.2. The qualified RNAs were used for further PacBio and Illumina library construction, respectively.

PacBio Library Construction and Sequencing

To construct the library for PacBio sequencing, the qualified RNAs of bone tissues from BG and SG, including the frontal and parietal bone and vertebra tissues, were mixed in equal amounts, respectively. The mixed RNA sample from BG or SG was reverse-transcribed for mRNA using the SMARTer^TM PCR cDNA Synthesis Kit. PCR amplification was performed using the KAPA HiFi HotStart PCR Kit. Then, the PCR product for the SMRTbell library was constructed using the SMRTbell template pre kit. The concentration of the SMRTbell library was measured using a Qubit 3.0 fluorometer with a Qubit^TM 1× dsDNA HS Assay kit (Invitrogen, Carlsbad, CA, United States). The quantified criteria of library quality were concentration >10 ng/μL with dispersive but continuous distribution in the range of 1–6 k bp. A total of 2.5 ng of the library was sequenced for each SMRT cell using the binding kit 2.1 from the PacBio Sequel platform, producing 20 h of movies. In the sequencing process, two SMRT cells were prepared on the PacBio RSII platform, including one SMRT cell from mixed bone tissues in BG and the other SMRT cell from bone tissues in SG.

Illumina RNA-Seq and de novo Assembly

The Illumina library for each tissue sample was constructed using the TruSeq RNA Sample Prep Kit (Illumina) following the manufacturer’s instructions. Briefly, the polyA mRNA was fragmented using divalent cations at elevated temperature. The RNA fragments were reverse transcribed into first-strand cDNA using reverse transcriptase and random primers, followed by second-strand cDNA synthesis, end repair, and ligation of the adapters. The ligated fragments were purified and enriched through PCR to generate the final cDNA library. Finally, 12 transcriptomic libraries (six libraries from each group) were sequenced on Illumina HiSeq × Ten platform to obtain 150 bp pair-end reads. Raw RNA-seq reads in fastq format were first filtered through in-house perl scripts to filter out the low-quality reads. Reads with a Q30 percentage greater than 85% were retained as high-quality reads; the rest of the reads as low-quality reads were filtered out. Then clean reads were obtained by removing reads containing sequencing adapters, ploy-N, and low quality. The clean paired-end reads from each library were merged together and then de novo assembled by using Trinity 2.8.4 software with the default parameters. The clean short reads were then mapped to the PacBio reference sequence using Tophat2 tools.

PacBio Iso-Seq Data Processing and Error Correction

According to PacBio’s protocol, the raw polymerase reads were first processed using SMRTlink 5.0 software. Briefly, after removing the SMRTbell^TM adapter and the low-quality data, postfilter polymerase reads were obtained. The CCS was generated from the subreads BAM files, also known as the reads of insert (ROIs). All the ROIs whose the number of full passes was >1 were further classified into full-length (FL) and non–full-length (nFL) transcript sequences based on whether the 5′ primer, 3′ primer, and poly A tail could be simultaneously observed. We employed a three-step strategy for error correction to improve the accuracy of the full-length transcripts produced by the PacBio Iso-Seq platform. First, the circle sequencing with >1 pass provided an opportunity for ROI self-correction. Second, full-length, non-chimeric (FLNC) reads were subjected to non-redundant and clustering treatments by the ICE Quiver algorithm and to arrow polishing with the nFL sequence, producing high-quality and polished full-length consensus sequences. Finally, these polished consensus sequences were further subjected to correction and redundancy removal with Illumina short reads using the Proovread tool and the CD-HIT program with a–c 0.99 parameter cutoff, respectively. The above three corrections resulted in non-redundant, non-chimeric, full-length transcripts (isoform level) with high accuracy for subsequent analyses.

Functional Annotation

For comprehensive functional annotation, the transcripts were searched against seven databases using BLAST software (version 2.2.26): NCBI NR, COG, Pfam, KOG, Swiss-Prot (A manually annotated and reviewed protein sequence database), GO, and eggNOG using an e value of 1e^–5. The KEGG orthology results were obtained by comparing with KEGG database using KOBAS2.0 (Xie et al., 2011). After predicting the amino acid sequence of transcripts, the software HMMER (Eddy, 1998) was used to compare them with Pfam database to get the annotation information of transcripts.

Gene Family and Coding Sequence Prediction

Coding Genome Reconstruction Tool v1.3¹ used K-mer similarity profiles to partition full-length coding sequences into gene families, after which it reconstructed subreads containing the full coding region. To predict the ORFs in transcripts, we used the TransDecoder v2.0.1 program² to define putative coding sequences (CDSs). The predicted CDSs were searched and confirmed by BLASTX (E value < 1 × 10^–5) against the protein databases of NR, Swiss-Prot, and KEGG. Those transcripts containing complete ORFs were designated as full-length transcripts.

Prediction of LncRNA and AS

Transcripts with a length of more than 200 bp and having more than two exons were selected as lncRNA candidates. The lncRNAs were predicted by four computational approaches, including CNCI (v2), CPC (v1), CPAT (v1.2), and Pfam (v1.5) with default parameters. These approaches can effectively distinguish protein-coding and non-coding transcripts. Transcripts were removed that did not pass any of these analyses; the intersection of the four results was then selected as lncRNAs.

Based on the BLAST method (Altschul et al., 1997), all the transcripts were used for pairwise alignment. BLAST alignments were considered products of candidate AS events (Liu et al., 2017), which met three criteria simultaneously: (i) the length of two transcripts was both greater than 1,000 bp, and there were two high-scoring segment pairs in the alignment; (ii) the AS gap between two aligned transcripts was greater than 100 bp and at least 100 bp from the 3′ end and 5′ end; and (iii) a 5-bp overlap could be allowed.

Detection of TF and Microsatellite Markers

Transcription factors-related transcript sequences were predicted using the BLAST method with the AnimalTFDB database (Zhang et al., 2015).

Microsatellite markers (also known as SSRs) were identified using MISA³ with parameters as default. SSR detection was only conducted on non-redundant transcripts that were larger than 500 bp in size. The minimum repeat time for core repeat motifs was set as follows: 10 for mononucleotide, six for dinucleotides, and five for trinucleotides, tetranucleotides, pentanucleotides, and hexanucleotides. Based on the structural organization of the repeat motifs, SSRs were classified into perfect and complicated (compound or interrupted) SSRs.

Screening Differentially Expressed Unigenes and GO and KEGG Enrichment Analyses

The expression levels of all the unigenes in 12 samples were assayed based on the Illumina short reads dataset, and reference sequences were the full-length transcripts yielded from PacBio Iso-Seq. The transcripts were quantified using RSEM software. Relative gene expression levels of each unigene were determined by FPKM (fragments per kilobase of transcript per million mapped reads). DEGs of bone tissues (the frontal and parietal bones and vertebral bones) in BG and SG were screened using DESeq2 package. Supplementary Table 6 was used as an input file to run DESeq2. The FDR (<0.01) adjusted by Benjamini–Hochberg method was adopted for screening DEGs. DEGs were defined as by parameters of FDR < 0.01 and the absolute value of the log2 ratio ≥ 1. DEGs were also employed for the enrichment analyses of GO and KEGG pathways in order to determine the potential functions and metabolic pathways.

Validation of Differentially Expressed Unigenes by qRT-PCR

In order to examine the reliability of the RNA-seq results, 12 DEGs randomly selected and the internal control gene β-actin were used for validation by qRT-PCR. Total RNA from 12 samples (each of the frontal and parietal bones and vertebrae from BG and SG) was extracted individually using TRIzol Reagent (Invitrogen) according to the manufacturer’s instruction. The cDNA was synthesized from 1 μ g of total RNA for each sample using PrimeScript^TM RT reagent Kit (TaKaRa, Dalian, China). The selected DEGs and specific primer sequences used for qRT-PCR are listed in Supplementary Table 7, qRT-PCR was performed on a StepOne^TM Real-Time PCR System (Applied Biosystems, Foster City, CA, United States). The qRT-PCR reaction solution consisted of 6.5 μL Power SYBR Green PCR Master Mix (Applied Biosystems), 0.2 μM of each forward and reverse primer, 1.2 μL diluted cDNA, and 4.5 μL sterile distilled water. The PCR reaction condition was performed at 95°C for 10 min, followed by 40 cycles of 95°C for 15 s, 58°C for 30 s, and 72°C for 45 s. RNA samples in bone tissues of bighead carp from big and SGs were run in three times’ biological replicates and three technical replicates for qRT-PCR. The expression level of each DEG was normalized to that of the reference gene β-actin by using the 2^–ΔΔCT value method (Livak and Schmittgen, 2001) to validate the results of RNA-seq.

Conclusion

In summary, we obtained full-length transcriptome sequences of the frontal and parietal bones and vertebral bones by PacBio Iso-Seq in bighead carp at early growth stage known to exhibit different growth rates. Coupling the RNA-seq data with the Iso-Seq results of the big and the SGs, 27 and 45 DEGs were identified from skull bones and vertebral bones, respectively. A total of 15 pathways and 20 DEGs potentially regulate differential growth in bighead carp and were found to be mainly involved in physiological functions of energy metabolism, immune function, and cytoskeleton function, such as arginine and proline metabolism (p4ha1), fatty acid metabolism (scd), oxidative phosphorylation (sdhdb), FoxO signaling (sgk1), and cell adhesion molecules (b2m, mhcII, ptprc, and glg1). Our study represents the first step in establishing a full-length transcriptome resource of the head and vertebral bones in bighead carp at early growth stage and sheds light on the genetic association between growth and bone development.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/, BioProject ID PRJNA661719.

Ethics Statement

The animal study was reviewed and approved by the Guide for the Care and Use of Laboratory Animals of the Institute of Hydrobiology, the Chinese Academy of Sciences, China.

Author Contributions

JT conceived and designed the experiments and modified the manuscript. WL, YZ, JW, and XY performed the experiments. WL analyzed the data and wrote the manuscript. All authors read and approved the final manuscript.

Funding

This research was supported by the National Key Research and Development Program (No. 2018YFD009102), Special Fund for Strategic Pilot Technology of Chinese Academy of Sciences (XDA24030505), and State Key Laboratory of Freshwater Ecology and Biotechnology (2019FBZ04).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors thank Wuhan Zhangdu Lake Fish Farm for their generous supports for providing hatching and fish culture facilities. The authors also thank Mr. Q. S. Liu, Y. Z Feng for their assistance in sample preparation.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2020.603454/full#supplementary-material

Supplementary Figure 1 | KOG classification of non-redundant transcripts in bone tissues of bighead carp.

Supplementary Figure 2 | The length distribution of the complete coding protein sequences in bone tissues of bighead carp.

Supplementary Figure 3 | Type distribution of transcription factors in bone tissues of bighead carp.

Supplementary Table 1 | Summary statistics for RNA-seq information of the bighead carp frontal and parietal bone and vertebra transcriptomes.

Supplementary Table 2 | Putative gene families of non-redundant transcripts in bone tissues of bighead carp.

Supplementary Table 3 | Alternative splicing of non-redundant transcripts in bone tissues of bighead carp.

Supplementary Table 4 | Detailed information of transcription factors in bone tissues of bighead carp.

Supplementary Table 5 | Information of SSRs and their primers in bone tissues of bighead carp.

Supplementary Table 6 | An input file to run DESeq2 for screening DEGs of bighead carp.

Supplementary Table 7 | Primer sequences of the DEGs for qRT-PCR in bone tissues of bighead carp.

Footnotes

References

Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J. H., Zhang, Z., Miller, W., et al. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402. doi: 10.1093/nar/25.17.3389

PubMed Abstract | CrossRef Full Text | Google Scholar

Au, K. F., Sebastiano, V., Afshar, P. T., Durruthy, J. D., Lee, L., Williams, B. A., et al. (2013). Characterization of the human ESC transcriptome by hybrid sequencing. Proc. Natl. Acad. Sci. U.S.A. 110, E4821–E4830. doi: 10.1073/pnas.1320101110

PubMed Abstract | CrossRef Full Text | Google Scholar

Beck, S. V., Rasanen, K., Ahi, E. P., Kristjansson, B. K., Skulason, S., Jonsson, Z. O., et al. (2019). Gene expression in the phenotypically plastic Arctic charr (Salvelinus alpinus): a focus on growth and ossification at early stages of development. Evol. Dev. 21, 16–30. doi: 10.1111/ede.12275

PubMed Abstract | CrossRef Full Text | Google Scholar

Berg, M., Agesen, T. H., Thiis-Evensen, E., Merok, M. A., Teixeira, M. R., Vatn, M. H., et al. (2010). Distinct high-resolution genome profiles of early onset and late onset colorectal cancer integrated with gene expression data identify candidate susceptibility loci. Mol. Cancer 9:100. doi: 10.1186/1476-4598-9-100

PubMed Abstract | CrossRef Full Text | Google Scholar

Chaisson, M. J. P., Huddleston, J., Dennis, M. Y., Sudmant, P. H., Malig, M., Hormozdiari, F., et al. (2015). Resolving the complexity of the human genome using single-molecule sequencing. Nature 517, 608–U163. doi: 10.1038/nature13907

PubMed Abstract | CrossRef Full Text | Google Scholar

Chakravarthi, B. V. S. K., Pathi, S. S., Goswami, M. T., Cieslik, M., Zheng, H., Nallasivam, S., et al. (2014). The miR-124-prolyl hydroxylase P4HA1-MMP1 axis plays a critical role in prostate cancer progression. Oncotarget 5, 6654–6669. doi: 10.18632/oncotarget.2208

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, W., Ka, Y. W., and Chang, C. C. (2017). Involvement of dopamine beta-hydroxylase in the neuroendocrine-immune regulatory network of white shrimp, Litopenaeus vannamei. Fish Shellfish Immun. 68, 92–101. doi: 10.1016/j.fsi.2017.07.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Cleveland, B. M., Gao, G., and Leeds, T. D. (2020). Transcriptomic response to selective breeding for fast growth in rainbow trout (Oncorhynchus mykiss). Mar. Biotechnol. 22, 539–550. doi: 10.1007/s10126-020-09974-3

CrossRef Full Text | Google Scholar

Das, A., Sahoo, P. K., Mohanty, B. R., and Jena, J. K. (2011). Pathophysiology of experimental Aeromonas hydrophila infection in puntius sarana: early changes in blood and aspects of the innate immune-related gene expression in survivors. Vet. Immunol. Immunop. 142, 207–218. doi: 10.1016/j.vetimm.2011.05.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Eddy, S. R. (1998). Profile hidden Markov models. Bioinformatics 14, 755–763. doi: 10.1093/bioinformatics/14.9.755

PubMed Abstract | CrossRef Full Text | Google Scholar

Fu, B., Yu, X., Tong, J., Pang, M., Zhou, Y., Liu, Q., et al. (2019). Comparative transcriptomic analysis of hypothalamus-pituitary-liver axis in bighead carp (Hypophthalmichthys nobilis) with differential growth rate. BMC Genomics 20:328. doi: 10.1186/s12864-019-5691-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Fu, J., Zhu, W., Wang, L., Luo, M., Song, F., and Dong, Z. (2019). Dynamic transcriptome sequencing and analysis during early development in the bighead carp (Hypophthalmichthys nobilis). BMC Genomics 20:781. doi: 10.1186/s12864-019-6181-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Geng, X., Liu, S. K., Yao, J., Bao, L. S., Zhang, J. R., Li, C., et al. (2016). A genome-wide association study identifies multiple regions associated with head size in catfish. G3 Genes Genom. Genet. 6, 3389–3398. doi: 10.1534/g3.116.032201

PubMed Abstract | CrossRef Full Text | Google Scholar

Gong, G., Dan, C., Xiao, S., Guo, W., Huang, P., Xiong, Y., et al. (2018). Chromosomal-level assembly of yellow catfish genome using third-generation DNA sequencing and Hi-C analysis. Gigascience 7:giy120. doi: 10.1093/gigascience/giy120

PubMed Abstract | CrossRef Full Text | Google Scholar

Ibaraki, H., Wu, X. M., Uji, S., Yokoi, H., Sakai, Y., and Suzuki, T. (2015). Transcriptome analysis of vertebral bone in the flounder, Paralichthys olivaceus (Teleostei, Pleuronectiformes), using Illumina sequencing. Mar. Genom. 24, 269–276. doi: 10.1016/j.margen.2015.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Igal, R. A. (2016). Stearoyl CoA desaturase-1: new insights into a central regulator of cancer metabolism. BBA Mol. Cell Biol. L. 1861, 1865–1880. doi: 10.1016/j.bbalip.2016.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Jia, D., Wang, Y., Liu, Y., Hu, J., Guo, Y., Gao, L., et al. (2018). SMRT sequencing of full-length transcriptome of flea beetle Agasicles hygrophila (Selman and Vogt). Sci. Rep. 8:2197. doi: 10.1038/s41598-018-20181-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiao, S., Nie, M., Song, H., Xu, D., and You, F. (2020). Physiological responses to cold and starvation stresses in the liver of yellow drum (Nibea albiflora) revealed by LC-MS metabolomics. Sci. Total Environ. 715:136940. doi: 10.1016/j.scitotenv.2020.136940

PubMed Abstract | CrossRef Full Text | Google Scholar

Jones, K. T., Greer, E. R., Pearce, D., and Ashrafi, K. (2009). Rictor/TORC2 regulates Caenorhabditis elegans fat storage, body size, and development through sgk-1. PLoS Biol. 7:e1000060. doi: 10.1371/journal.pbio.1000060

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, M. J., Yu, C. Y., Theusch, E., Naidoo, D., Stevens, K., Kuang, Y. L., et al. (2016). SUGP1 is a novel regulator of cholesterol metabolism. Hum. Mol. Genet. 25, 3106–3116. doi: 10.1093/hmg/ddw151

PubMed Abstract | CrossRef Full Text | Google Scholar

Kobayashi, M., Bai, Y., Dong, Y., Yu, H., Chen, S., Gao, R., et al. (2014). PRL2/PTP4A2 phosphatase is important for hematopoietic stem cell self-renewal. Stem Cells 32, 1956–1967. doi: 10.1002/stem.1672

PubMed Abstract | CrossRef Full Text | Google Scholar

Korlach, J., Bjornson, K. P., Chaudhuri, B. P., Cicero, R. L., Flusberg, B. A., Gray, J. J., et al. (2010). “Real-time DNA sequencing from single polymerase molecules,” in Methods in Enzymology, Vol 472: Single Molecule Tools, Pt A: Fluorescence Based Approaches, ed. N. G. Walter, (Cambridge, MA: Academic Press), 431–455.

Google Scholar

Kountikov, E., Wilson, M., Miller, N., Clem, W., and Bengten, E. (2004). Organization and expression of thirteen alternatively spliced exons in catfish CD45 homologs. Dev. Comp. Immunol. 28, 1023–1035. doi: 10.1016/j.dci.2004.04.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, F. G., Chen, J., Jiang, X. Y., and Zou, S. M. (2015). Transcriptome analysis of blunt snout bream (Megalobrama amblycephala) reveals putative differential expression genes related to growth and hypoxia. PLoS One 10:e0142801. doi: 10.1371/journal.pone.0142801

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, G., Zhao, Y., Guo, S., Liu, B., Chen, Y., Sun, X., et al. (2019). Comparative analysis of spleen transcriptome detects differences in evolutionary adaptation of immune defense functions in bighead carp and silver carp. Fish Shellfish Immun. 84, 148–157. doi: 10.1016/j.fsi.2018.09.077

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Harata-Lee, Y., Denton, M. D., Feng, Q., Rathjen, J. R., Qu, Z., et al. (2017). Long read reference genome-free reconstruction of a full-length transcriptome from Astragalus membranaceus reveals transcript variants involved in bioactive compound biosynthesis. Cell Discov. 3:17031. doi: 10.1038/celldisc.2017.31

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Q., Li, Y., Song, J., Xu, H., Xu, J., Zhu, Y., et al. (2014). High-accuracy de novo assembly and SNP detection of chloroplast genomes using a SMRT circular consensus sequencing strategy. New Phytol. 204, 1041–1049. doi: 10.1111/nph.12966

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, X., Mei, W., Soltis, P. S., Soltis, D. E., and Barbazuk, W. B. (2017). Detecting alternatively spliced transcript isoforms from single-molecule long-read sequences without a reference genome. Mol. Ecol. Resour. 17, 1243–1256. doi: 10.1111/1755-0998.12670

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Z. J., and Cordes, J. F. (2004). DNA marker technologies and their applications in aquaculture genetics (vol 238, pg 1, 2004). Aquaculture 242, 735–736. doi: 10.1016/j.aquaculture.2004.08.022

CrossRef Full Text | Google Scholar

Livak, K. J., and Schmittgen, T. D. (2001). Analysis of relative gene expression data using real-time quantitative PCR and the 2-ΔΔCT method. Methods 25, 402–408. doi: 10.1006/meth.2001.1262

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, X., Chen, H. M., Qian, X. Q., and Gui, J. F. (2020). Transcriptome analysis of grass carp (Ctenopharyngodon idella) between fast- and slow-growing fish. Comp. Biochem. Phys. D 35, 100688–100688. doi: 10.1016/j.cbd.2020.100688

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, X., Wen, H., Li, Q., Wang, G., Li, P., Chen, J., et al. (2019). Comparative analysis of growth performance and liver transcriptome response of juvenile Ancherythroculter nigrocauda fed diets with different protein levels. Comp. Biochem. Phys. D 31:100592. doi: 10.1016/j.cbd.2019.05.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Lv, X., Wang, Y., Xiao, Z., Liu, Q., Xu, S., Zhai, J., et al. (2019). Skeletal ontogeny and deformity during the early fry culture process for Epinephelus lanceolatus. Aquaculture 508, 113–126. doi: 10.1016/j.aquaculture.2019.04.059

CrossRef Full Text | Google Scholar

Ma, Q., Liu, S. F., Wang, X. X., Xiang, Z. L., and Zhuang, Z. M. (2019). Skeletal development of the chondrocranium in the tongue sole Cynoglossus semilaevis (Pleuronectiformes: Cynoglossidae). J. Fish Biol. 94, 223–230. doi: 10.1111/jfb.13870

PubMed Abstract | CrossRef Full Text | Google Scholar

Mehjabin, R., Xiong, L., Huang, R., Yang, C., Chen, G., He, L., et al. (2019). Full-length transcriptome sequencing and the discovery of new transcripts in the unfertilized eggs of zebrafish (Danio rerio). G3 Genes Genom. Genet. 9, 1831–1838. doi: 10.1534/g3.119.200997

PubMed Abstract | CrossRef Full Text | Google Scholar

Mourelatos, Z., Gonatas, J. O., Nycum, L. M., Gonatas, N. K., and Biegel, J. A. (1995). Assignment of the glg1 gene for mg-160, a fibroblast growth-factor and e-selectin binding membrane sialoglycoprotein of the golgi-apparatus, to chromosome 16q22-q23 by fluorescence in-situ hybridization. Genomics 28, 354–355. doi: 10.1006/geno.1995.1156

PubMed Abstract | CrossRef Full Text | Google Scholar

Nie, C. H., Wan, S. M., Liu, Y. L., Liu, H., Wang, W. M., and Gao, Z. X. (2019). Development of teleost intermuscular bones undergoing intramembranous ossification based on histological-transcriptomic-proteomic data. Int. J. Mol. Sci. 20:4698. doi: 10.3390/ijms20194698

PubMed Abstract | CrossRef Full Text | Google Scholar

Oyedotun, K. S., and Lemire, B. D. (1999). The Saccharomyces cerevisiae succinate-ubiquinone oxidoreductase - Identification of Sdh3p amino acid residues involved in ubiquinone binding. J. Biol. Chem. 274, 23956–23962. doi: 10.1074/jbc.274.34.23956

PubMed Abstract | CrossRef Full Text | Google Scholar

Pang, M., Luo, W., Fu, B., Yu, X., Zhou, Y., and Tong, J. (2018). Transcriptomic profiles of brain provide insights into molecular mechanism of feed conversion efficiency in crucian carp (Carassius auratus). Int. J. Mol. Sci. 19:858. doi: 10.3390/ijms19030858

PubMed Abstract | CrossRef Full Text | Google Scholar

Quarto, N., Behr, B., Li, S., and Longaker, M. T. (2009). Differential FGF ligands and FGF receptors expression pattern in frontal and parietal calvarial bones. Cells Tissues Organs 190, 158–169. doi: 10.1159/000202789

PubMed Abstract | CrossRef Full Text | Google Scholar

Rutten, M. J. M., Bovenhuis, H., and Komen, H. (2005). Genetic parameters for fillet traits and body measurements in Nile tilapia (Oreochromis niloticus L.). Aquaculture 246, 125–132. doi: 10.1016/j.aquaculture.2005.01.006

CrossRef Full Text | Google Scholar

Sacchetti, C., Bai, Y., Stanford, S. M., Di Benedetto, P., Cipriani, P., Santelli, E., et al. (2017). PTP4A1 promotes TGF beta signaling and fibrosis in systemic sclerosis. Nat. Commun. 8:1060. doi: 10.1038/s41467-017-01168-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Salem, M., Vallejo, R. L., Leeds, T. D., Palti, Y., Liu, S., Sabbagh, A., et al. (2012). RNA-seq identifies SNP markers for growth traits in rainbow trout. PLoS One 7:e0036264. doi: 10.1371/journal.pone.0036264

PubMed Abstract | CrossRef Full Text | Google Scholar

Steijger, T., Abril, J. F., Engstrom, P. G., Kokocinski, F., Hubbard, T. J., Guigo, R., et al. (2013). Assessment of transcript reconstruction methods for RNA-seq. Nat. Methods 10, 1177–1184. doi: 10.1038/NMETH.2714

PubMed Abstract | CrossRef Full Text | Google Scholar

Szabo, T., Urbanyi, B., Muller, T., Szabo, R., and Horvath, L. (2019). Assessment of induced breeding of major Chinese carps at a large-scale hatchery in Hungary. Aquac. Rep. 14:100193. doi: 10.1016/j.aqrep.2019.100193

CrossRef Full Text | Google Scholar

Tie, S. R., McCarthy, D. J., Kendrick, T. S., Louw, A., Le, C., Satiaputra, J., et al. (2016). Regulation of sarcoma cell migration, invasion and invadopodia formation by AFAP1L1 through a phosphotyrosine-dependent pathway. Oncogene 35, 2098–2111. doi: 10.1038/onc.2015.272

PubMed Abstract | CrossRef Full Text | Google Scholar

Tilgner, H., Grubert, F., Sharon, D., and Snyder, M. P. (2014). Defining a personal, allele-specific, and single-molecule long-read transcriptome. Proc. Natl. Acad. Sci. U.S.A. 111, 9869–9874. doi: 10.1073/pnas.1400447111

PubMed Abstract | CrossRef Full Text | Google Scholar

Tilgner, H., Raha, D., Habegger, L., Mohiuddin, M., Gerstein, M., and Snyder, M. (2013). Accurate identification and analysis of human mRNA isoforms using deep long read sequencing. G3 Genes Genom. Genet. 3, 387–397. doi: 10.1534/g3.112.004812

PubMed Abstract | CrossRef Full Text | Google Scholar

Topczewska, J. M., Shoela, R. A., Tomaszewski, J. P., Mirmira, R. B., and Gosain, A. K. (2016). The morphogenesis of cranial sutures in Zebrafish. PLoS One 11:e0165775. doi: 10.1371/journal.pone.0165775

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, B., Tseng, E., Regulski, M., Clark, T. A., Hon, T., Jiao, Y., et al. (2016). Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing. Nat. Commun. 7:11708. doi: 10.1038/ncomms11708

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, S., Gui, J., Yin, X., Pan, Q., Liu, X., and Chu, L. (2016). Transmembrane domain is crucial to the subcellular localization and function of Myc target 1. J. Cell. Mol. Med. 20, 471–481. doi: 10.1111/jcmm.12747

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie, C., Mao, X., Huang, J., Ding, Y., Wu, J., Dong, S., et al. (2011). KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 39, W316–W322. doi: 10.1093/nar/gkr483

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, Q., Feng, C. Y., Hori, T. S., Plouffe, D. A., Buchanan, J. T., and Rise, M. L. (2013). Family-specific differences in growth rate and hepatic gene expression in juvenile triploid growth hormone (GH) transgenic Atlantic salmon (Salmo salar). Comp. Biochem. Phys. D 8, 317–333. doi: 10.1016/j.cbd.2013.09.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Yi, S., Zhou, X., Li, J., Zhang, M., and Luo, S. (2018). Full-length transcriptome of Misgurnus anguillicaudatus provides insights into evolution of genus Misgurnus. Sci. Rep. 8:11699. doi: 10.1038/s41598-018-29991-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Yue, G. H. (2014). Recent advances of genome mapping and marker-assisted selection in aquaculture. Fish Fish. 15, 376–396. doi: 10.1111/faf.12020

CrossRef Full Text | Google Scholar

Zhang, H. M., Liu, T., Liu, C. J., Song, S., Zhang, X., Liu, W., et al. (2015). AnimalTFDB 2.0: a resource for expression, prediction and functional study of animal transcription factors. Nucleic Acids Res. 43, D76–D81. doi: 10.1093/nar/gku887

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, B., Fu, Y., Zhang, H., Wang, X., Jin, G., Xu, J., et al. (2020). Functional characterization of acyl-CoA binding protein in Neospora caninum. Parasite. Vector. 13:85. doi: 10.1186/s13071-020-3967-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Y., Luo, W., Yu, X., Liu, Q., and Tong, J. (2019). Brain and intestine transcriptome analyses and identification of genes involved in feed conversion efficiency of Yellow River carp (Cyprinus carpio haematopterus). Comp. Biochem. Phys. D 29, 221–227. doi: 10.1016/j.cbd.2018.12.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Zou, Y., Donkervoort, S., Salo, A. M., Foley, A. R., Barnes, A. M., Hu, Y., et al. (2017). P4HA1 mutations cause a unique congenital disorder of connective tissue involving tendon, bone, muscle and the eye. Hum. Mol. Genet. 26, 2207–2217. doi: 10.1093/hmg/ddx110

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: bighead carp, differential growth, Iso-Seq, full-length transcript, differentially expressed unigene

Citation: Luo W, Zhou Y, Wang J, Yu X and Tong J (2021) Identifying Candidate Genes Involved in the Regulation of Early Growth Using Full-Length Transcriptome and RNA-Seq Analyses of Frontal and Parietal Bones and Vertebral Bones in Bighead Carp (Hypophthalmichthys nobilis). Front. Genet. 11:603454. doi: 10.3389/fgene.2020.603454

Received: 07 September 2020; Accepted: 14 December 2020;
Published: 15 January 2021.

Edited by:

Peng Xu, Xiamen University, China

Reviewed by:

Chuanju Dong, Henan Normal University, China
You-Yi Kuang, Heilongjiang River Fisheries Research Institute, Chinese Academy of Fishery Sciences, China
Zexia Gao, Huazhong Agricultural University, China

Copyright © 2021 Luo, Zhou, Wang, Yu and Tong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jingou Tong, amd0b25nQGloYi5hYy5jbg==; Xiaomu Yu, eG15dUBpaGIuYWMuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.