ORIGINAL RESEARCH article
Identification and Functional Prediction of Long Intergenic Non-coding RNAs Related to Subcutaneous Adipose Development in Pigs
- 1Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of the Ministry of Education and Key Laboratory of Swine Genetics and Breeding of the Ministry of Agriculture, Huazhong Agricultural University, Wuhan, China
- 2The Cooperative Innovation Center for Sustainable Pig Production, Wuhan, China
An increasing number of studies have shown that long intergenic non-coding RNAs (lincRNAs) are a very important class of non-coding RNAs that plays a vital role in many biological processes. Adipose tissue is an important place for storing energy, but few studies on lincRNAs were related to pig subcutaneous fat development. Here, we used published RNA-seq data from subcutaneous adipose tissue of Italian Large White pigs and identified 252 putative lincRNAs, wherein 34 were unannotated. These lincRNAs had relatively shorter length, lower number of exons, and lower expression level compared with protein-coding transcripts. Gene ontology and pathway analysis indicated that the adjacent genes of lincRNAs were involved in lipid metabolism. In addition, differentially expressed lincRNAs (DELs) between low and high backfat thickness pigs were identified. Through the detection of quantitative trait locus (QTL), DELs were mainly located in QTLs related to adipose development. Based on the expression correlation of DEL genes and their differentially expressed potential target genes, we constructed a co-expression network and a potential pathway of DEL’s effect on lipid metabolism. Our study identified and analyzed lincRNAs in subcutaneous adipose tissue, and results suggested that lincRNAs may be involved in the regulation of subcutaneous fat development. Our findings provided new insights into the biological function of porcine lincRNAs.
Pigs are very similar to humans in terms of metabolic characteristics and organ development, so pigs are an ideal animal model for human disease research (Lunney, 2007; Schook et al., 2015). For example, atherosclerosis and diabetes are examples of the impact of pigs as biomedical model (Lunney, 2007). These diseases have become the focus of research because of the growing problem of obesity (Lunney, 2007). To delay disease progression and prevent disease occurrence, pigs are being extensively studied as animal models of these diseases to determine disease pathologies, such as the role of genetic background (Lunney, 2007). Meanwhile, pigs provide essential nutrients to humans, such as protein. With the improvement of human living standards, the demand for lean meat rate of pigs is also increasing. The 10th rib BFT and days to 100 kg are the economically important traits in pigs and are commonly used to predict carcass lean meat content and growth rate in pig breeding programs, respectively (Guo et al., 2017; Jiang et al., 2018). Furthermore, a previous study indicated that a low amount of subcutaneous fat or backfat deposition means good growth performance (Zambonelli et al., 2016).
LincRNAs are a class of non-coding RNAs (ncRNAs) that are transcribed between protein-coding gene regions and are more than 200 nucleotides (nt) in length (Han and Chen, 2013). In recent years, due to the development of sequencing technology, a large number of lincRNAs have been identified in many plants and animals (Guttman et al., 2010; Pauli et al., 2012; Lv et al., 2014; Hao et al., 2015; Zhou et al., 2015; Wang J. et al., 2016). Some lincRNAs had been identified in humans and mice, but most lincRNAs are still unknown (Chen X. et al., 2018). Moreover, studies have shown that lincRNAs play important roles in some biological processes, such as mouse embryonic development (Lv et al., 2013; Zhang et al., 2014), immune response (Carpenter et al., 2013a,b; Heward and Lindsay, 2014), human fat deposition (Liu et al., 2014; Ding et al., 2018; Gao et al., 2018), and human and pig muscle growth and development (Zhao et al., 2015; Gao et al., 2017; Lim et al., 2018).
In animals, adipose tissues play an important role and is the main energy storage site (Wang S. et al., 2016). Excessive accumulation of human adipose tissue can lead to obesity, further causing some chronic diseases, such as hypertension, diabetes, and cancer (Hanson et al., 2014; Rao et al., 2014). Studying the mechanism of adipose development cannot only improve the breeding efficiency of pigs but also help overcome obesity and related diseases. Previous studies found that some protein-coding genes, such as CTRP6 (Wu et al., 2017), NR1H3 (Zhang et al., 2016), ubiquitin D (Zhao et al., 2018), and MAT2B (Zhao et al., 2016), play an important role in regulating pig fat deposition. However, few studies are available on the mechanism of action of lincRNAs in pig fat deposition, and most functions of the lincRNAs in pig subcutaneous fat development are still unknown.
In this study, we used the published RNA-seq data from a previous study to assemble the transcriptome of subcutaneous adipose tissue in ILW pigs (Zambonelli et al., 2016). Zambonelli et al. (2016) measured the EBV in millimeter by using the BFT by the best linear unbiased prediction multiple-trait animal model program (Henderson and Quaas, 1976). The fixed effects of batch in test, gender, weight at slaughter, inbreeding coefficient, and the random effects of animal were all included in the model (Zambonelli et al., 2016). EBV was used for animal breeding selection and increased genetic gain in breeding programs (Kasinathan et al., 2015; Shin et al., 2017). ILW pigs were divided into two groups, namely, fat and lean samples, according to the difference of BFT EBV (Zambonelli et al., 2016). Based on the expression correlation of DEL genes and its neighboring protein-coding genes or DEPTGs, we investigated the role of lincRNAs in subcutaneous fat deposition of pigs. In summary, our study suggested that lincRNA plays an important role in pig subcutaneous fat development. This work enriches our knowledge on lincRNAs in pig and provides a valuable resource for future genetic and genomic studies.
Materials and Methods
Ethics Statement and the Datasets Source
All experiments in our study were performed according to the guidelines of the Key Lab of Agriculture Animal Genetics, Breeding, and Reproduction of Ministry of Education, Animal Care and Use Committee, Wuhan, China. In this study, the formula and amount of the ration were similar in animals used for RNA-seq (Zambonelli et al., 2016). Samples were taken from the subcutaneous adipose tissue of ILW pigs at an average age of 8 months (Zambonelli et al., 2016). 20 RNA-seq datasets were downloaded from the NCBI GEO databases with the accession numbers provided by Zambonelli et al. (2016) (Table 1, GEO accession GSE68007). All RNA-seq datasets were divided into two groups with 10 replicates in each group according to the difference of BFT EBV (Zambonelli et al., 2016).
Publicly Available Annotations
The pig gene annotations used in this article were downloaded from the Ensembl database at http://ftp.ensemblorg.ebi.ac.uk/pub/release-93/gtf/sus_scrofa/. Referring to previous studies, and combining ALDB1 with NONCODE database2, we obtained pig lincRNA annotations (Li et al., 2015; Tang et al., 2017; Zou et al., 2017a,b). In addition, the non-redundant reference sequence database was downloaded from https://ftp.ncbi.nih.gov/blast/db/.
RNA-Seq Reads Mapping and Transcriptome Assembly
All RNA-seq reads were mapped to the pig reference genome (Sus scrofa 11.13) using the default parameters of the HISAT2 version 2.0.1 (Pertea et al., 2016; Keel and Snelling, 2018; Kim et al., 2018). Then, we set the “-G” option of StringTie version 1.2.2 for transcript assembly, and obtained 20 samples of the GTF files respectively (Pertea et al., 2015, 2016). Afterward, the 20 GTF files were merged into a non-redundant transcriptome using the merge tool in the StringTie package (Pertea et al., 2016). The putative lincRNAs were then obtained by filtering the unique transcriptome from the lincRNA detection pipelines (Zou et al., 2017b).
LincRNAs Identification Pipeline
Referring to our laboratory’s previous research methods (Zou et al., 2017b), the unique transcriptome assembled by the merging tool was gradually screened to obtain candidate lincRNAs. The method is as follows: (1) Retained those transcripts with ‘u’ category categorized by using gffcompare, which indicated intergenic transcripts. (2) The transcripts with length ≥ 200 bp and exon number ≥ 2 were retained. Because putative long non-coding RNAs (lncRNAs) were defined as transcripts that are more than 200 bp (Pauli et al., 2012; Ulitsky and Bartel, 2013). (3) Calculated the coding potential of transcripts by the CPC tool (Kong et al., 2007), retaining the transcripts that have a CPC score less than 0 which means that there was no coding potential. (4) Filtered out the transcripts which containing any known protein coding domain. Firstly, the sequences of transcript were translated into six possible protein sequences by using Transeq4. Then, the corresponding transcripts which had a significant hit in the Pfam database5 were discarded using HMMER (E-value < 1e-5) (Prakash et al., 2017). (5) BLASTX program (Mount, 2007) was used to filter out any transcripts that have similarities to known proteins in the NCBI NR and UniRef90 databases (E-value < 1e-5). (6) The FPKM values for the 20 samples were estimated using the “-B” option of StringTie version 1.2.2 (Pertea et al., 2016). Transcripts with FPKM values greater than 0 in at least one sample were retained.
Comparisons Between LincRNAs and Protein-Coding Transcripts
Detailed information on 45,788 protein-coding transcripts were extracted from the pig reference genome annotation file (Sus scrofa 11.13). Detailed information on 252 putative lincRNAs were extracted from the unique transcriptome file. Then, we compared the transcript length, exon length and exon number between lincRNAs and protein-coding transcripts.
Analysis of Differentially Expressed Genes and DEPTGs
We used the htseq-count tool to count how many aligned reads of each gene overlap with their exons (Anders et al., 2015). Then, through the DESeq2 tool (a count-based technique) in the R packages (version 3.4.3) to perform differential expression analysis of gene-level between fat and lean group using these counts (Love et al., 2014). The gene with an FDR-adjusted P-value less than 0.05 will be considered as a differentially expressed gene between the two groups (Benjamini et al., 2001). If the expression level between each pair of lincRNA and the protein-coding gene was significantly correlated (P-value < 0.05), we regarded a protein-coding gene as a PTG of lincRNA. Meanwhile, if the potential target protein-coding gene was differentially expressed, we regarded it as a DEPTG of lincRNA.
Gene Ontology (GO) and Pathway Analysis of Adjacent Genes of LincRNAs
The position information of lincRNAs were extracted from the unique transcriptome file. Then, the neighboring genes (<100 kb) of lincRNAs were obtained by BEDTools version 2.17.0 (Quinlan and Hall, 2010). Due to the limited annotation of the pig genome, we need to use Ensembl to convert the genes into human homolog genes (Kersey et al., 2014), and then used DAVID to perform GO enrichment and KEGG pathway analysis (Huang da et al., 2009). The P-value less than 0.05 was considered statistically significant.
QTL Mapping Analysis of DELs
Firstly, the position information of DELs were obtained from the unique transcriptome file. Then, we downloaded the pig QTL database6 from the animal QTL database. After that, we used BEDTools version 2.17.0 tool for QTL mapping of DELs (Quinlan and Hall, 2010).
Correlation Analysis of DEL Genes and Their Adjacent Genes
The position information of the DEL genes were obtained from the unique transcriptome file. Then, we identified the neighboring protein-coding genes (<100 kb) of each DEL gene using the BEDTools version 2.17.0 tool (Quinlan and Hall, 2010). The Pearson correlation coefficient were calculated based on the expression level of each pair of DEL gene and protein-coding gene (Salleh et al., 2015).
Correlation Validation Between LincRNA Genes and Their PTGs Through the Real-Time Quantitative Polymerase Chain Reaction (RT-qPCR)
We used 11 large white pigs with the BFT ranging from 18 to 22 mm for RT-qPCR verification. Total RNA was extracted from adipose tissue by using Trizol reagent (Invitrogen, Life Technologies, Foster City, CA, United States) from the PSCs according to the manufacturer’s instructions. The purity and concentration of total RNA were measured by a micro-spectrophotometer (Thermo, NanoDrop 2000, United States) at 260 and 280 nm. Ratios of absorption (260/280 nm) of all samples were between 1.8 and 2.0. cDNA synthesis for lincRNA genes and PTGs detection was performed using RevertAid First Strand cDNA Synthesis Kit (Thermo, Wuhan, Cat#k1622). We used approximately 1 μg of total RNA for cDNA synthesis. RT-qPCR for lincRNA genes and PTGs detection was performed with SYBR Green (CWBIO, Beijing, China, CW0957) in Roche LightCyler 480 system (Roche, Mannheinm, Germany) according to the manufacturer’s instructions. Seven pairs of RT-qPCR primers were designed using the Primer 5 program (Supplementary Table S1). The 18S rRNA was used as an endogenous control gene. The RT-qPCR data were analyzed using the 2-ΔΔCT method.
Identification of LincRNAs
We used the published RNA-seq data from ILW pigs to identify lincRNAs (Zambonelli et al., 2016). These RNA-seq data were divided into two groups according to the BFT EBV (Zambonelli et al., 2016). The numbers of raw reads per sample were between 161.88 and 249.33 million. After removing adapter and low quality reads by using fastp software (Chen S. et al., 2018), approximately 152.81 to 236.66 million reads were mapped to the whole genome of Sus scrofa (11.1) (Table 1). Then, we reconstructed the transcriptome of each sample separately. Next, the transcripts of all samples were merged into a unique transcriptome. At this point, we obtained 76,822 transcripts, of which 1,149 were intergenic transcripts. We identified 252 putative lincRNAs from 1,149 transcripts based on the pipeline shown in Figure 1A, in which 34 did not overlap with the currently annotated coding or non-coding transcripts (Figure 1B). These putative lincRNAs were distributed in all chromosomes (Figure 1C).
Figure 1. (A) The pipeline for the identification of putative lincRNAs in this study. The frames in the direction of the arrow show the filtering process and the number of screened transcripts. (B) The number of known and novel lincRNAs. (C) The chromosome distribution of putative lincRNAs.
Characterization of LincRNAs
Previous studies found many differences between lincRNAs and protein-coding transcripts (Derrien et al., 2012; Liu et al., 2017). Based on the assembled transcriptome, we compared the characteristics of novel lincRNAs, known lincRNAs, and protein-coding transcripts. A total of 45,788 protein-coding transcripts corresponding to 22,342 genes in the pig annotation in Ensembl database and 12,103 known lincRNA transcripts encoded by 7,381 lincRNA genes in the pig lincRNA annotation in ALDB were observed (Li et al., 2015). The average transcript length of the protein-coding transcripts (3,285 bp) was longer than the novel lincRNA transcripts (1,011 bp) and the known lincRNA transcripts (1,891 bp) (Figure 2A). At the same time, the average exon length of the protein-coding transcripts (283 bp) was shorter than those of novel lincRNA transcripts (378 bp) and the known lincRNA transcripts (628 bp) (Figure 2B). In addition, the novel lincRNA transcripts (2.7) and the known lincRNA transcripts (3.0) had similar average exon number, but all were less than the protein-coding transcripts (11.6) (Figure 2C). In addition, the average expression level of protein-coding transcripts in the samples (3.18 FPKM) was higher than those of the novel lincRNA transcripts (0.51 FPKM) and the known lincRNA transcripts (0.90 FPKM) (Figure 2D). Our results are consistent with previous reports that the pig lncRNA genes have shorter transcript lengths, longer exon length, fewer exon number, and lower expression level than protein-coding genes (Zhou et al., 2014; Li et al., 2016; Tang et al., 2017).
Figure 2. Comparison of the characteristics of protein-coding genes, novel lincRNA genes and known lincRNA genes. (A) Comparison of transcript length. (B) Comparison of exon length. (C) Comparison of exon number. (D) Comparison of the expression level.
Differential Expression Analysis of LincRNA Genes and Protein-Coding Genes
To investigate the function of lincRNAs, we performed differential expression analysis on fat and lean samples based on expression levels. In both groups, we identified five DEL genes. Compared with lean pigs, two lincRNA genes (MSTRG.11102 and MSTRG.1952) were upregulated and three (MSTRG.12872, MSTRG.752, and MSTRG.9812) were downregulated in fat pigs (Figure 3A). In addition, we identified 43 differentially expressed protein-coding genes with 27 upregulations and 16 downregulations in fat pigs (Figure 3B).
Figure 3. (A) Heat map of five DEL genes between fat and lean samples. (B) Heat map of 43 differentially expressed protein-coding genes between fat and lean samples. Red represents an increase in expression, while green reduces expression.
Functional Enrichment Analysis of Adjacent Genes of LincRNAs
To explore the function of putative lincRNAs, we performed functional enrichment analysis of genes near the lincRNAs (<100 kb) with DAVID (Supplementary Table S2). The DAVID results showed that the adjacent genes of lincRNAs significantly participated in 25 biological processes (apoptotic process, amine metabolic process, negative regulation of cell proliferation, and protein K48-linked ubiquitination) and six pathways (phenylalanine metabolism, cell adhesion molecules, and glycosaminoglycan biosynthesis-chondroitin sulfate/dermatan sulfate) (P < 0.05). We found that the neighboring genes of lincRNAs were involved in lipid metabolisms, such as glycerophospholipid metabolism (Figure 4).
Functional Prediction of DELs
To further investigate the function of lincRNAs, we performed QTL mapping analysis on DELs (Supplementary Table S3). The result showed that 10 DELs were located in 140 QTLs, wherein 28 QTLs were associated with fat deposition (Figure 5A). Through the distribution of DELs in QTLs, we found that all 10 DELs were located in QTLs related to fat deposition, and most of the DELs were located in QTLs associated with backfat (Figure 5B). For example, eight DELs were located in the QTLs of the backfat weight QTL and shoulder subcutaneous fat thickness QTL. Seven DELs were located in the average BFT QTL and backfat at the last lumbar QTL. Simultaneously, we found that 140 QTLs were distributed on 1, 3, 5, 6, and 11 chromosomes (Figure 5C). In summary, we found that DELs mainly located in QTLs related to fat depositions, such as shoulder subcutaneous fat thickness, backfat weight, and average BFT.
Figure 5. Quantitative trait locus mapping analysis of DELs. (A) The number distribution of different QTLs. (B) The quantitative distribution of DELs which were located in QTLs associated with fat deposition. (C) The chromosome distribution of all QTLs.
Correlation Analysis Between DEL Genes and Its Adjacent Genes
Previous studies indicated that lincRNAs can affect the expression of their neighboring genes (Fatica and Bozzoni, 2014; Taylor et al., 2015; Engreitz et al., 2016). According to the expression level, we calculated the Pearson correlation coefficient of DEL genes and their nearby protein-coding genes (<100 kb), including three DEL genes and six protein-coding genes (Table 2). The results showed that the expression levels of three DEL genes were significantly positively correlated with five adjacent protein-coding genes and significantly negatively correlated with one adjacent protein-coding gene. Compared with lean pigs, MSTRG.11102 and MSTRG.1952 were upregulated in fat pigs, whereas MSTRG.9812 was downregulated. The expression of five adjacent protein-coding genes was upregulated, and one was downregulated by three DEL genes. Then, the function of each DEL gene was speculated according to the functions of their nearby protein-coding genes. Among them, the expression level of DEL gene (MSTRG.1952) was significantly positively correlated with the nearby protein-coding gene MEDAG (also known as MEDA-4). The expression level of DEL gene (MSTRG.9812) was significantly negatively correlated with the nearby protein-coding gene SPTBN1.
Expression Regulation Analysis of DEL Genes and Their DEPTGs
To investigate the function of the DEL genes, we analyzed the expression regulation relationship between DEL genes and their DEPTGs (Supplementary Table S4) and constructed the co-expression network by using Cytoscape_3.6.1 (Figure 6A). Based on the expression level, we found that five DEL genes and 36 DEPTGs were significantly correlated (Figure 6A), among which 32 of 36 DEPTGs were regulated by two or more DEL genes. Thus, we speculated that the regulatory mechanisms among DEL genes and their DEPTGs were complicated. In fat pigs, the expression levels of MSTRG.752, MSTRG.9812, MSTRG.12872, and RRAD were downregulated, and the expression levels of MSTRG.11102 and SHC3 were upregulated. Furthermore, the expression level of RRAD was significantly positively correlated with MSTRG.752 and MSTRG.9812. The expression level of SHC3 was significantly negatively correlated with MSTRG.752, MSTRG.9812, and MSTRG.12872 and significantly positively correlated with MSTRG.11102.
Figure 6. Expression regulation analysis of DEL genes and their DEPTGs. (A) Co-expression network of DELs and DEPTGs.DELs are indicated in red hexagons;DEPTGs are indicated in blue circles; light green edge represent DELs downregulate DEPTGs; dark green edge represent DELs upregulate DEPTGs. (B) Potential pathways for DELs and DEPTGs. red indicates that the genes are upregulated in fat pigs, green indicates downregulation in fat pigs, and yellow indicates potential pathways for DELs and DEPTGs; quadrilaterals represent DELs, irregular polygons represent DEPTGs, and dash lines indicate predict interactions.
Correlation Verification Between LincRNA Genes and Their PTGs
In the data analysis results of 20 samples (10 fat samples and 10 lean samples), we randomly selected three lincRNA genes (Supplementary Table S5) and their PTGs with significant positive correlation based on the expression level (MSTRG.4365 vs. TKT; MSTRG.10113 vs. STOML2; MSTRG.9843 vs. ELOVL6 and DBI). The correlation coefficients were all greater than 0.80, and the P-value were less than 0.01. To verify these results, we conducted the RT-qPCR experiment in 11 samples and performed linear regression analysis on the results. The four pairs of lincRNA genes and their PTGs were significantly positive correlated based on the expression level, with correlation coefficients greater than 0.65 and P- value less than 0.05. The experimental results of RT-qPCR showed that the results of the two datasets have good consistency, thereby further improving our research reliability (Figure 7).
Figure 7. Linear regression of lincRNA and PTG expression. The r0 and p0 represent the Pearson correlation coefficient and P-value of each pair of lincRNA and PTG in 20 samples, respectively;The r and p represent verification in 11 samples. (A) MSTRG.4365 vs. TKT; (B) MSTRG.10113 vs. STOML2; (C) MSTRG.9843 vs. ELOVL6; (D) MSTRG.9843 vs. DBI.
A large number of lincRNAs are present in the mammalian genomes (Carninci et al., 2005; Cabili et al., 2011). A small number of lincRNAs have been revealed in pigs; these lincRNAs play a key role in biological processes (Li et al., 2016; Che et al., 2018). In Zou’s study, some lincRNAs in longissimus dorsi muscle of Laiwu pigs may be involved in intramuscular fat-related biological processes, such as oxidative metabolism, lipid metabolism, and adipogenesis (Zou et al., 2018). Zou et al. (2018) provided a valuable resource for further study of lipid metabolism in pigs. However, a large number of lincRNAs are still unknown. In particular, many lincRNAs associated with subcutaneous fat development in pigs have not been identified. In this study, based on the published RNA-seq data, we performed a comprehensive identification and analysis of lincRNAs in subcutaneous adipose tissue of ILW pigs. We identified 252 candidate lincRNAs by using our laboratory’s previous research methods, wherein 34 of which were novel lincRNAs (Zou et al., 2017b), further enriching the annotation of lincRNAs and providing new insights into the study of lincRNAs in pigs.
In this study, lincRNAs have shorter lengths, fewer exons, and lower expression levels than protein-coding transcripts, consistent with previous reports in pigs and increases the reliability of our study (Zhou et al., 2014; Li et al., 2016; Tang et al., 2017). Our research is difficult because lincRNAs are largely unknown. LincRNA has several ways to regulate gene expression, such as cis- and trans-regulation (Yap et al., 2010; Trimarchi et al., 2014; Carmona et al., 2018). At present, cis-regulation is an important method for studying lincRNA function. We mainly looked at neighboring genes as PTGs in this work. Previous studies investigated the function of lincRNAs by genes that are adjacent to the lincRNAs (<100 kb) or have expression correlation with lincRNAs (Hong et al., 2018; Zhang et al., 2018). In this article, we identified 252 candidate lincRNAs between fat and lean samples. According to GO and pathway analysis, we found that the adjacent genes of lincRNAs (<100 kb) were involved in glycosaminoglycan biosynthetic process, glycerophospholipid metabolism, and glycosaminoglycan catabolic process. Among them, glycerophospholipid metabolism is closely related to fat deposition (Ecker and Liebisch, 2014; Jiang et al., 2018). Thus, we speculate that lincRNAs may have a regulatory role in subcutaneous fat development. The QTL mapping analysis of DELs further improved our speculation.
To determine the regulatory mechanisms of lincRNAs, we summarized the expression relationships of five DEL genes and their adjacent protein-coding genes (<100 kb). The expression level of lincRNA gene (MSTRG.1952) and mesenteric estrogen-dependent adipose gene (MEDAG, also known as MEDA-4) were significantly upregulated in fat pigs compared with lean pigs. MEDAG has previously been identified as a novel adipogenic gene in mouse and human that promotes lipid accumulation in adipocytes (Zhang et al., 2012). From this finding, we hypothesized that MSTRG.1952 may promote fat deposition by positively regulating the MEDAG expression in fat pigs. At the same time, high expression of lincRNA gene (MSTRG.1952) significantly upregulated the expression of arachidonate 5-lipoxygenase-activating protein (ALOX5AP) in fat pigs. Previous studies confirmed that ALOX5AP overexpression in mouse adipose tissue leads to lipoxin A4 (LXA4) production and diet-induced obesity (Mehrabian et al., 2005; Horrillo et al., 2010; Elias et al., 2016). In addition, obese subjects have significantly higher ALOX5AP expression in subcutaneous adipose tissue than lean subjects (Kaaman et al., 2006). Therefore, we hypothesized that MSTRG.1952 promotes the subcutaneous fat deposition by positively regulating the ALOX5AP expression in fat pigs.
Based on the expression level, we analyzed the expression regulation relationship between DEL genes and their DEPTGs. The results showed that five DEL genes and 36 DEPTGs were significantly correlated. According to previous studies (Pelicci et al., 2002; Ilany et al., 2006) and KEGG pathway database7, we found that some DEPTGs may be involved in lipid metabolism, including Ras-related glycolysis inhibitor and calcium channel regulator (RRAD) and SHC adaptor protein 3 (SHC3). Therefore, we constructed a potential pathway map of key genes and related DELs affecting lipid metabolism between fat and lean pigs (Figure 6B). Previous reports indicated a certain correlation between the expression level of RAD (also known as RRAD) and obesity (Ilany et al., 2006). The expression levels of lincRNA genes (MSTRG.752 and MSTRG.9812) and RRAD were significantly downregulated in fat pigs compared with lean pigs. RRAD overexpression can promote the increase in lipoprotein lipase (LPL) protein level, and LPL hydrolyzes triglycerides in lipoproteins into free fatty acids and monoacylglycerol molecule, thereby decreasing the triglyceride levels and affecting lipid metabolism (Ilany et al., 2006). Therefore, we speculated that MSTRG.752 and MSTRG.9812 promote fat deposition by reducing the RRAD expression in fat pigs. In addition, compared with lean pigs, the expression level of lincRNA gene (MSTRG.11102) and SHC3 were significantly upregulated in fat pigs, and the expression levels of lincRNA genes (MSTRG.752, MSTRG.9812, and MSTRG.12872) were significantly downregulated. Literature showed that Rai (SHC3) overexpression can induce Ras activation (Pelicci et al., 2002). Based on the KEGG pathway database7, we found that Ras may further affect glycerophospholipid metabolism. Thus, we speculated that MSTRG.752, MSTRG.9812, MSTRG.12872, and MSTRG.11102 promote fat deposition by increasing the SHC3 expression in fat pigs. Collectively, we speculated that four DEL genes may affect subcutaneous fat development by regulating the expression of RRAD and SHC3.
In this study, we identified and analyzed the lincRNAs in the subcutaneous adipose tissue of pigs and found that some lincRNAs may affect subcutaneous fat deposition, especially DEL genes, thereby resulting in the difference in BFT between fat and lean pigs. However, the specific regulation mechanism of lincRNAs on subcutaneous fat development is still unclear, and further studies are needed. Given that pig lincRNAs are largely unknown, our research provided valuable resources. This work also provided an ideal candidate for future studies of the function of lincRNAs in the subcutaneous fat development.
Publicly available datasets were analyzed in this study. This data can be found here: https://www.ncbi.nlm.nih.gov/sra?LinkName=bioproject_sra_all&from_uid=281483.
CL conceived and designed this study. GS, LC, and GC analyzed the data with the help of CZ and CF. GS wrote the paper with the help of CZ and CF. RT-qPCR experiment was completed with the help of JL and ML. All authors reviewed the final manuscript.
The work was supported by National Natural Science Foundation of China (NSFC, 31872322), the Fundamental Research Funds for the Central Universities (2662017PY030), National High Technology Research and Development Plan (863, 2011AA100302), National Natural Science Foundation of China (NSFC, 31601859), and China Postdoctoral Science Foundation Funded Project (2016M592344).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2019.00160/full#supplementary-material
TABLE S1 | The information of seven pairs of RT-qPCR primers.
TABLE S2 | Gene ontology and pathway analysis of adjacent genes of lincRNAs.
TABLE S3 | Quantitative trait locus mapping analysis on DELs.
TABLE S4 | The expression regulation relationship between DEL genes and their DEPTGs.
TABLE S5 | Sequence information of lincRNAs used in RT-qPCR.
ALDB, the domestic-animal lncRNA database; BFT, backfat thickness; CPC, coding potential calculator; DAVID, the database for annotation, visualization and integrated discovery; DELs, differentially expressed lincRNAs; DEPTGs, differentially expressed potential target genes; EBV, estimated breeding value; FPKM, fragments per kilobase of transcript per million mapped reads; GEO, Gene Expression Omnibus; GTF, gene transfer format; GO, gene ontology; ILW, Italian large white; KEGG, kyoto encyclopedia of genes and genomes; LincRNAs, long intergenic non-coding RNAs; LncRNA, long non-coding RNA; NCBI, National Center for Biotechnology Information; NR, non-redundant; PTG, potential target gene; QTL, quantitative trait locus; RT-qPCR, real-time quantitative polymerase chain reaction.
- ^ http://res.xaut.edu.cn/aldb/download.jsp
- ^ http://www.noncode.org/download.php
- ^ http://ftp.ensemblorg.ebi.ac.uk/pub/release-93/gtf/sus_scrofa/
- ^ https://www.ebi.ac.uk/Tools/st/emboss_transeq/
- ^ http://pfam.xfam.org/search
- ^ https://www.animalgenome.org/cgi-bin/QTLdb/SS/index
- ^ https://www.genome.jp/kegg/pathway.html
Benjamini, Y., Drai, D., Elmer, G., Kafkafi, N., and Golani, I. (2001). Controlling the false discovery rate in behavior genetics research. Behav. Brain Res. 125, 279–284. doi: 10.1016/S0166-4328(01)00297-2
Cabili, M. N., Trapnell, C., Goff, L., Koziol, M., Tazon-Vega, B., Regev, A., et al. (2011). Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 25, 1915–1927. doi: 10.1101/gad.17446611
Carmona, S., Lin, B., Chou, T., Arroyo, K., and Sun, S. (2018). LncRNA Jpx induces Xist expression in mice using both trans and cis mechanisms. PLoS Genet. 14:e1007378. doi: 10.1371/journal.pgen.1007378
Carpenter, S., Aiello, D., Atianand, M. K., Ricci, E. P., Gandhi, P., Hall, L. L., et al. (2013a). A long noncoding rna mediates both activation and repression of immune response genes. Science 341, 789–792. doi: 10.1126/science.1240925
Carpenter, S., Atianand, M., Aiello, D., Ricci, E., Gandhi, P., Hall, L. L., et al. (2013b). LincRNA-Cox2 is a long noncoding RNA induced by TLRs that mediates both activation and repression of immune response genes. Cytokine 63, 251–251. doi: 10.1016/j.cyto.2013.06.037
Derrien, T., Johnson, R., Bussotti, G., Tanzer, A., Djebali, S., Tilgner, H., et al. (2012). The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res. 22, 1775–1789. doi: 10.1101/gr.132159.111
Ding, C. M., Lim, Y. C., Chia, S. Y., Walet, A. C. E., Xu, S. H., Lo, K. A., et al. (2018). De novo reconstruction of human adipose transcriptome reveals conserved lncRNAs as regulators of brown adipogenesis. Nat. Commun. 9:1329. doi: 10.1038/s41467-018-03754-3
Ecker, J., and Liebisch, G. (2014). Application of stable isotopes to investigate the metabolism of fatty acids, glycerophospholipid and sphingolipid species. Prog. Lipid Res. 54, 14–31. doi: 10.1016/j.plipres.2014.01.002
Elias, I., Ferre, T., Vila, L., Munoz, S., Casellas, A., Garcia, M., et al. (2016). ALOX5AP overexpression in adipose tissue leads to LXA4 production and protection against diet-induced obesity and insulin resistance. Diabetes 65, 2139–2150. doi: 10.2337/db16-0040
Engreitz, J. M., Haines, J. E., Perez, E. M., Munson, G., Chen, J., Kane, M., et al. (2016). Local regulation of gene expression by lncRNA promoters, transcription and splicing. Nature 539, 452–455. doi: 10.1038/nature20149
Gao, H., Kerr, A., Jiao, H., Hon, C. C., Ryden, M., Dahlman, I., et al. (2018). Long non-coding RNAs associated with metabolic traits in human white adipose tissue. Ebiomedicine 30, 248–260. doi: 10.1016/j.ebiom.2018.03.010
Gao, P. F., Guo, X. H., Du, M., Cao, G. Q., Yang, Q. C., Pu, Z. D., et al. (2017). LncRNA profiling of skeletal muscles in large white pigs and mashen pigs during development. J. Anim. Sci. 95, 4239–4250. doi: 10.2527/jas2016.1297
Guo, Y. M., Qiu, H. Q., Xiao, S. J., Wu, Z. F., Yang, M., Yang, J., et al. (2017). A genome-wide association study identifies genomic loci associated with backfat thickness, carcass weight, and body weight in two commercial pig populations. J. Appl. Genet. 58, 499–508. doi: 10.1007/s13353-017-0405-6
Guttman, M., Garber, M., Levin, J. Z., Donaghey, J., Robinson, J., Adiconis, X., et al. (2010). Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nat. Biotechnol. 28, 503–510. doi: 10.1038/nbt.1633
Hanson, C., Rutten, E. P., Wouters, E. F., and Rennard, S. (2014). Influence of diet and obesity on COPD development and outcomes. Int. J. Chron. Obstruct. Pulmon. Dis. 9, 723–733. doi: 10.2147/COPD.S50111
Hao, Z. Q., Fan, C. Y., Cheng, T., Su, Y., Wei, Q., and Li, G. L. (2015). Genome-wide identification, characterization and evolutionary analysis of long intergenic noncoding rnas in cucumber. Plos One 10:e0121800. doi: 10.1371/journal.pone.0121800
Horrillo, R., Gonzalez-Periz, A., Martinez-Clemente, M., Lopez-Parra, M., Ferre, N., Titos, E., et al. (2010). 5-lipoxygenase activating protein signals adipose tissue inflammation and lipid dysfunction in experimental obesity. J. Immunol. 184, 3978–3987. doi: 10.4049/jimmunol.0901355
Huang da, W., Sherman, B. T., Zheng, X., Yang, J., Imamichi, T., Stephens, R., et al. (2009). Extracting biological meaning from large gene lists with DAVID. Curr. Protoc. Bioinform. 27, 13.11.1–13.11.13. doi: 10.1002/0471250953.bi1311s27
Ilany, J., Bilan, P. J., Kapur, S., Caldwell, J. S., Patti, M. E., Marette, A., et al. (2006). Overexpression of Rad in muscle worsens diet-induced insulin resistance and glucose intolerance and lowers plasma triglyceride level. Proc. Natl. Acad. Sci. U S A. 103, 4481–4486. doi: 10.1073/pnas.0511246103
Jiang, Y., Tang, S., Wang, C., Wang, Y., Qin, Y., Wang, Y., et al. (2018). A genome-wide association study of growth and fatness traits in two pig populations with different genetic backgrounds. J. Anim. Sci. 96, 806–816. doi: 10.1093/jas/skx038
Kaaman, M., Ryden, M., Axelsson, T., Nordstrom, E., Sicard, A., Bouloumie, A., et al. (2006). ALOX5AP expression, but not gene haplotypes, is associated with obesity and insulin resistance. Int. J. Obes. 30, 447–452. doi: 10.1038/sj.ijo.0803147
Kasinathan, P., Wei, H., Xiang, T. H., Molina, J. A., Metzger, J., Broek, D., et al. (2015). Acceleration of genetic gain in cattle by reduction of generation interval. Sci. Rep. 5:8674. doi: 10.1038/srep08674
Keel, B. N., and Snelling, W. M. (2018). Comparison of burrows-wheeler transform-based mapping algorithms used in high-throughput whole-genome sequencing: application to illumina data for livestock genomes. Front. Genet. 9:35. doi: 10.3389/fgene.2018.00035
Kersey, P. J., Allen, J. E., Christensen, M., Davis, P., Falin, L. J., Grabmueller, C., et al. (2014). Ensembl genomes 2013: scaling up access to genome-wide data. Nucl. Acids Res. 42, D546–D552. doi: 10.1093/nar/gkt979
Kim, T., Seo, H. D., Hennighausen, L., Lee, D., and Kang, K. (2018). Octopus-toolkit: a workflow to automate mining of public epigenomic and transcriptomic next-generation sequencing data. Nucl. Acids Res. 46:e53. doi: 10.1093/nar/gky083
Kong, L., Zhang, Y., Ye, Z. Q., Liu, X. Q., Zhao, S. Q., Wei, L., et al. (2007). CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucl. Acids Res. 35, W345–W349. doi: 10.1093/nar/gkm391
Li, J., Gao, Z., Wang, X., Liu, H., Zhang, Y., and Liu, Z. (2016). Identification and functional analysis of long intergenic noncoding RNA genes in porcine pre-implantation embryonic development. Sci. Rep. 6:38333. doi: 10.1038/srep38333
Lim, Y. H., Kwon, D. H., Kim, J., Park, W. J., Kook, H., and Kim, Y. K. (2018). Identification of long noncoding RNAs involved in muscle differentiation. PLoS One 13:e0193898. doi: 10.1371/journal.pone.0193898
Liu, S. L., Wang, Z. B., Chen, D., Zhang, B. W., Tian, R. R., Wu, J., et al. (2017). Annotation and cluster analysis of spatiotemporal- and sex-related lncRNA expression in rhesus macaque brain. Genome Res. 27, 1608–1620. doi: 10.1101/gr.217463.116
Liu, Y. C., Ferguson, J. F., Xue, C. Y., Ballantyne, R. L., Silverman, I. M., Gosai, S. J., et al. (2014). Tissue-specific RNA-Seq in human evoked inflammation identifies blood and adipose lincrna signatures of cardiometabolic diseases. Arterioscl. Thromb. Vascul. Biol. 34, 902–912. doi: 10.1161/Atvbaha.113.303123
Lv, J., Cui, W., Liu, H. B., He, H. J., Xiu, Y. C., Guo, J., et al. (2013). Identification and characterization of long non-coding rnas related to mouse embryonic brain development from available transcriptomic data. PLoS One 8:e71152. doi: 10.1371/journal.pone.0071152
Lv, J., Huang, Z. J., Liu, H., Liu, H. B., Cui, W., Li, B., et al. (2014). Identification and characterization of long intergenic non-coding RNAs related to mouse liver development. Mol. Genet. Genomics 289, 1225–1235. doi: 10.1007/s00438-014-0882-9
Mehrabian, M., Allayee, H., Stockton, J., Lum, P. Y., Drake, T. A., Castellani, L. W., et al. (2005). Integrating genotypic and expression data in a segregating mouse population to identify 5-lipoxygenase as a susceptibility gene for obesity and bone traits. Nat. Genet. 37, 1224–1233. doi: 10.1038/ng1619
Pauli, A., Valen, E., Lin, M. F., Garber, M., Vastenhouw, N. L., Levin, J. Z., et al. (2012). Systematic identification of long noncoding RNAs expressed during zebrafish embryogenesis. Genome Res. 22, 577–591. doi: 10.1101/gr.133009.111
Pelicci, G., Troglio, F., Bodini, A., Melillo, R. M., Pettirossi, V., Coda, L., et al. (2002). The neuron-specific rai (ShcC) adaptor protein inhibits apoptosis by coupling ret to the phosphatidylinositol 3-kinase/Akt signaling pathway. Mol. Cell Biol. 22, 7351–7363. doi: 10.1128/MCB.22.20.7351-7363.2002
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T., and Salzberg, S. L. (2016). Transcript-level expression analysis of RNA-seq experiments with HISAT. StringTie Ballg. Nat. Protoc. 11, 1650–1667. doi: 10.1038/nprot.2016.095
Pertea, M., Pertea, G. M., Antonescu, C. M., Chang, T. C., Mendell, J. T., and Salzberg, S. L. (2015). StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295. doi: 10.1038/nbt.3122
Salleh, F. H. M., Arif, S. M., Zainudin, S., and Firdaus-Raih, M. (2015). Reconstructing gene regulatory networks from knock-out data using gaussian noise model and pearson correlation coefficient. Comput. Biol. Chem. 59, 3–14. doi: 10.1016/j.compbiolchem.2015.04.012
Schook, L. B., Collares, T. V., Darfour-Oduro, K. A., De, A. K., Rund, L. A., Schachtschneider, K. M., et al. (2015). Unraveling the swine genome: implications for human health. Annu. Rev. Anim. Biosci. 3, 219–244. doi: 10.1146/annurev-animal-022114-110815
Shin, D., Lee, C., Park, K. D., Kim, H., and Cho, K. H. (2017). Genome-association analysis of korean holstein milk traits using genomic estimated breeding value. Asian-Aus. J. Anim. Sci. 30, 309–319. doi: 10.5713/ajas.15.0608
Tang, Z. L., Wu, Y., Yang, Y. L., Yang, Y. C. T., Wang, Z. S., Yuan, J. P., et al. (2017). Comprehensive analysis of long non-coding RNAs highlights their spatio-temporal expression patterns and evolutional conservation in Sus scrofa. Sci. Rep. 7:43166. doi: 10.1038/srep43166
Trimarchi, T., Bilal, E., Ntziachristos, P., Fabbri, G., Dalla-Favera, R., Tsirigos, A., et al. (2014). Genome-wide mapping and characterization of notch-regulated long noncoding RNAs in acute leukemia. Cell 158, 593–606. doi: 10.1016/j.cell.2014.05.049
Wang, J., Fu, L. Y., Koganti, P. P., Wang, L., Hand, J. M., Ma, H., et al. (2016). Identification and functional prediction of large intergenic noncoding rnas (lincrnas) in rainbow trout (Oncorhynchus mykiss). Mar. Biotechnol. 18, 271–282. doi: 10.1007/s10126-016-9689-5
Wu, W., Zhang, J., Zhao, C., Sun, Y., Pang, W., and Yang, G. (2017). CTRP6 regulates porcine adipocyte proliferation and differentiation by the adipoR1/MAPK signaling pathway. J. Agric Food Chem. 65, 5512–5522. doi: 10.1021/acs.jafc.7b00594
Yap, K. L., Li, S., Munoz-Cabello, A. M., Raguz, S., Zeng, L., Mujtaba, S., et al. (2010). Molecular interplay of the noncoding RNA ANRIL and methylated histone H3 lysine 27 by polycomb CBX7 in transcriptional silencing of INK4a. Mol. Cell 38, 662–674. doi: 10.1016/j.molcel.2010.03.021
Zambonelli, P., Gaffo, E., Zappaterra, M., Bortoluzzi, S., and Davoli, R. (2016). Transcriptional profiling of subcutaneous adipose tissue in italian large white pigs divergent for backfat thickness. Anim. Genet. 47, 306–323. doi: 10.1111/age.12413
Zhang, G., Chen, D., Zhang, T., Duan, A., Zhang, J., and He, C. (2018). Transcriptomic and functional analyses unveil the role of long non-coding RNAs in anthocyanin biosynthesis during sea buckthorn fruit ripening. DNA Res. 25, 465–476. doi: 10.1093/dnares/dsy017
Zhang, H., Chen, X., and Sairam, M. R. (2012). Novel genes of visceral adiposity: identification of mouse and human mesenteric estrogen-dependent adipose (MEDA)-4 gene and its adipogenic function. Endocrinology 153, 2665–2676. doi: 10.1210/en.2011-2008
Zhang, K. S., Huang, K. F., Luo, Y. P., and Li, S. G. (2014). Identification and functional analysis of long non-coding RNAs in mouse cleavage stage embryonic development based on single cell transcriptome data. BMC Genomics 15:845. doi: 10.1186/1471-2164-15-845
Zhao, C., Chen, X., Wu, W., Wang, W., Pang, W., and Yang, G. (2016). MAT2B promotes adipogenesis by modulating SAMe levels and activating AKT/ERK pathway during porcine intramuscular preadipocyte differentiation. Exp. Cell Res. 344, 11–21. doi: 10.1016/j.yexcr.2016.02.019
Zhao, C., Yao, X., Chen, X., Wu, W., Xi, F., Yang, G., et al. (2018). Knockdown of ubiquitin D inhibits adipogenesis during the differentiation of porcine intramuscular and subcutaneous preadipocytes. Cell Prolif. 51:e12401. doi: 10.1111/cpr.12401
Zhao, W. M., Mu, Y. L., Ma, L., Wang, C., Tang, Z. L., Yang, S. L., et al. (2015). Systematic identification and characterization of long intergenic non-coding RNAs in fetal porcine skeletal muscle development. Sci. Rep. 5:8957. doi: 10.1038/srep08957
Zhou, Z. Y., Li, A. M., Adeola, A. C., Liu, Y. H., Irwin, D. M., Xie, H. B., et al. (2014). Genome-wide identification of long intergenic noncoding RNA genes and their potential association with domestication in pigs. Genome Biol. Evol. 6, 1387–1392. doi: 10.1093/gbe/evu113
Zhou, Z. Y., Li, A. M., Wang, L. G., Irwin, D. M., Liu, Y. H., Xu, D., et al. (2015). DNA methylation signatures of long intergenic noncoding RNAs in porcine adipose and muscle tissues. Sci. Rep. 5:15435. doi: 10.1038/srep15435
Zou, C., Li, J., Luo, W., Li, L., Hu, A., Fu, Y., et al. (2017a). Transcriptome analysis reveals long intergenic non-coding RNAs involved in skeletal muscle growth and development in pig. Sci. Rep. 7:8704. doi: 10.1038/s41598-017-07998-9
Zou, C., Li, S., Deng, L. L., Guan, Y., Chen, D., Yuan, X. K., et al. (2017b). Transcriptome analysis reveals long intergenic noncoding RNAs contributed to growth and meat quality differences between yorkshire and wannanhua pig. Genes 8:203. doi: 10.3390/genes8080203
Keywords: lincRNA, RNA-seq, subcutaneous fat development, backfat thickness, pig
Citation: Shi G, Chen L, Chen G, Zou C, Li J, Li M, Fang C and Li C (2019) Identification and Functional Prediction of Long Intergenic Non-coding RNAs Related to Subcutaneous Adipose Development in Pigs. Front. Genet. 10:160. doi: 10.3389/fgene.2019.00160
Received: 20 November 2018; Accepted: 14 February 2019;
Published: 04 March 2019.
Edited by:Enrique Medina-Acosta, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Brazil
Reviewed by:Bao Yuan, Jilin University, China
Lei Zhou, Guangxi University, China
Carrie Finno, University of California, Davis, United States
Copyright © 2019 Shi, Chen, Chen, Zou, Li, Li, Fang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Changchun Li, firstname.lastname@example.org