RNA Sequencing (RNA-Seq) Reveals Extremely Low Levels of Reticulocyte-Derived Globin Gene Transcripts in Peripheral Blood From Horses (Equus caballus) and Cattle (Bos taurus)

Correia, Carolina N.; McLoughlin, Kirsten E.; Nalpas, Nicolas C.; Magee, David A.; Browne, John A.; Rue-Albrecht, Kevin; Gordon, Stephen V.; MacHugh, David E.

doi:10.3389/fgene.2018.00278

ORIGINAL RESEARCH article

Front. Genet., 14 August 2018

Sec. Livestock Genomics

Volume 9 - 2018 | https://doi.org/10.3389/fgene.2018.00278

RNA Sequencing (RNA-Seq) Reveals Extremely Low Levels of Reticulocyte-Derived Globin Gene Transcripts in Peripheral Blood From Horses (Equus caballus) and Cattle (Bos taurus)

1. Animal Genomics Laboratory, UCD School of Agriculture and Food Science, UCD College of Health and Agricultural Sciences University College Dublin, Dublin, Ireland
2. UCD School of Veterinary Medicine, UCD College of Health and Agricultural Sciences University College Dublin, Dublin, Ireland
3. UCD Conway Institute of Biomolecular and Biomedical Research University College Dublin, Dublin, Ireland

Article metrics

View details

Citations

9,1k

Views

4,5k

Downloads

Abstract

RNA-seq has emerged as an important technology for measuring gene expression in peripheral blood samples collected from humans and other vertebrate species. In particular, transcriptomics analyses of whole blood can be used to study immunobiology and develop novel biomarkers of infectious disease. However, an obstacle to these methods in many mammalian species is the presence of reticulocyte-derived globin mRNAs in large quantities, which can complicate RNA-seq library sequencing and impede detection of other mRNA transcripts. A range of supplementary procedures for targeted depletion of globin transcripts have, therefore, been developed to alleviate this problem. Here, we use comparative analyses of RNA-seq data sets generated from human, porcine, equine, and bovine peripheral blood to systematically assess the impact of globin mRNA on routine transcriptome profiling of whole blood in cattle and horses. The results of these analyses demonstrate that total RNA isolated from equine and bovine peripheral blood contains very low levels of globin mRNA transcripts, thereby negating the need for globin depletion and greatly simplifying blood-based transcriptomic studies in these two domestic species.

Introduction

It is increasingly recognised that new technological approaches are urgently required for infectious disease diagnosis, surveillance, and management in burgeoning domestic animal populations as livestock production intensifies across the globe (Thornton, 2010; Nabarro and Wannous, 2014; Animal Task Force, 2016). In this regard, new strategies have emerged that leverage peripheral blood gene expression to study host immunobiology and to identify panels of RNA transcript biomarkers that can be used as specific biosignatures of infection by particular pathogens for both animal and human infectious disease (Ramilo and Mejias, 2009; Mejias and Ramilo, 2014; Chaussabel, 2015; Ko et al., 2015; Holcomb et al., 2017). For example, we and others have applied this approach to bovine tuberculosis (BTB) caused by infection with Mycobacterium bovis (Meade et al., 2007; Killick et al., 2011; Blanco et al., 2012; Churbanov and Milligan, 2012; McLoughlin et al., 2014; Cheng et al., 2015). It is also important to note that peripheral blood transcriptomics using technologies such as microarrays or RNA-sequencing (RNA-seq) can be used to monitor changes in the physiological status of domestic animals due to reproductive status, diet and nutrition or stress (O'Loughlin et al., 2012; Takahashi et al., 2012; Song et al., 2013; Kolli et al., 2014; Shen et al., 2014; de Greeff et al., 2016; Elgendy et al., 2016; Jégou et al., 2016).

During the last 15 years, a major hindrance to whole blood transcriptomics studies has emerged, which is the presence of large quantities of globin mRNA transcripts in peripheral blood from many mammalian species (Wu et al., 2003; Fan and Hegde, 2005; Liu et al., 2006). This is a consequence of abundant α globin and β globin mRNA transcripts in circulating reticulocytes, which in humans, may account for more than 95% of the total cellular mRNA content in these immature erythrocytes (Debey et al., 2004). Reticulocytes, in turn, account for 1–4% of the erythrocytes in healthy adult humans, which corresponds to between 5 × 10⁷ and 2 × 10⁸ cells per ml compared to 7 × 10⁶ cells per ml for leukocytes (Greer et al., 2013). Hence, globin transcripts can account for a substantial proportion of total detectable mRNAs in peripheral blood samples collected from humans and many other mammals (Bruder et al., 2010; Winn et al., 2010; Schwochow et al., 2012; Choi et al., 2014; Shin et al., 2014; Bowyer et al., 2015; Huang et al., 2016; Morey et al., 2016). In particular, for humans, more than 70% of peripheral blood mRNA transcripts are derived from the haemoglobin subunit alpha 1, subunit alpha 2 and subunit beta genes (HBA1, HBA2, and HBB) (Wu et al., 2003; Field et al., 2007; Mastrokolias et al., 2012).

The emergence of massively parallel transcriptome profiling for clinical applications in human peripheral blood—initially with gene expression microarrays, but more recently using RNA-seq—has prompted development of methods for the systematic reduction of globin mRNAs in total RNA samples purified from peripheral blood samples, including: oligonucleotides that bind to globin mRNA molecules with subsequent digestion of the RNA strand of the RNA:DNA hybrid (Wu et al., 2003); peptide nucleic acid (PNA) oligonucleotides that are complementary to globin mRNAs and block reverse transcription of these targets (Liu et al., 2006); the GLOBINclear™ system, which uses biotinylated oligonucleotides that hybridise with globin transcripts followed by capture and separation using streptavidin-coated magnetic beads (Field et al., 2007); and the recently introduced GlobinLock method that uses a pair of modified oligonucleotides complementary to the 3′ portion of globin transcripts and that block enzymatic extension (Krjutškov et al., 2016).

In the present study we use RNA-seq data generated from globin-depleted and non-depleted total RNA purified from human and porcine peripheral blood, in conjunction with non-depleted total RNA isolated from equine and bovine peripheral blood, for a comparative investigation of the impact of reticulocyte-derived globin mRNA transcripts on routine transcriptome profiling of blood in domestic cattle and horses. The primary objective of the present study to test the hypothesis that both cattle and horses exhibit significantly lower quantities of haemoglobin gene transcripts compared to humans and pigs.

Materials and methods

Data sources

RNA-seq data sets from human peripheral whole blood samples used for assessment of globin depletion and with parallel non-depleted controls (Shin et al., 2014) were obtained from the NCBI Gene Expression Omnibus (GEO) database (accession number GSE53655). A comparable RNA-seq data set from globin-depleted and non-depleted porcine peripheral whole blood was obtained directly from the study authors (Choi et al., 2014). A published RNA-seq data set (Ropka-Molik et al., 2017) from equine non-depleted peripheral whole blood was obtained from the NCBI GEO database (accession number GSE83404). Finally, bovine RNA-seq data from peripheral whole blood were generated by us as described below and can be obtained from the European Nucleotide Archive (ENA) database (PRJEB27764). A summary overview of the methodology used for the current study is shown in Figure 1.

Figure 1

Human, porcine and equine sample collection, globin depletion, and RNA-seq libraries

Detailed information concerning ethics approval, sample collection, total RNA extraction, and RNA-seq library preparation and sequencing for the human, porcine, and equine data sets is provided in the original publications (Choi et al., 2014; Shin et al., 2014; Ropka-Molik et al., 2017). Supplementary Table 1 provides summary information on the human, porcine and equine samples and RNA-seq libraries.

In brief, for the human samples, peripheral blood from six healthy subjects (three females and three males) was collected into PAXgene blood RNA tubes (PreAnalytiX/Qiagen Ltd., Manchester, UK). Total RNA, including small RNAs, was purified from the collected blood samples using the PAXgene Blood miRNA Kit (PreAnalytiX/Qiagen Ltd.) as described by Shin et al. (2014). Human HBA1, HBA2 and HBB mRNA transcripts were depleted from a subset of the total RNA samples using the GLOBINclear kit (Invitrogen™/Thermo Fisher Scientific, Loughborough, UK). RNA-seq data was then generated using 24 paired-end (PE) RNA-seq libraries (12 undepleted and 12 globin-depleted) generated from the six biological replicates and six identical technical replicates created from pooled total RNA across all six donor samples. The multiplexing and sequencing was then performed such that data for the 12 samples in each treatment group (undepleted and globin depleted) was generated from two separate lanes of a single flow cell twice, for a total of four sequencing lanes (Shin et al., 2014).

Porcine peripheral blood samples were collected from 12 healthy crossbred pigs [Duroc × (Landrace × Yorkshire)] using Tempus™ blood RNA tubes (Applied Biosystems™/Thermo Fisher Scientific, Warrington, UK) and total RNA was purified using the MagMAX™ for Stabilized Blood Tubes RNA Isolation Kit (Invitrogen™/Thermo Fisher Scientific) (Choi et al., 2014). Porcine HBA and HBB mRNA transcripts were subsequently depleted from a subset of the total RNA samples using a modified RNase H globin depletion method with custom porcine-specific antisense oligonucleotides for HBA and HBB. RNA-seq data was then generated from 24 PE RNA-seq libraries (12 undepleted and 12 globin-depleted).

Equine peripheral blood samples were collected using Tempus™ blood RNA tubes from 12 healthy Arabian horses (five females and seven males) at three different time points during flat racing training (Ropka-Molik et al., 2017). In addition, peripheral blood samples were collected from six healthy untrained Arabian horses (two females and four males). Total RNA was purified using the MagMAX™ for Stabilized Blood Tubes RNA Isolation Kit and 37 of the 42 total RNA samples were used to generate single-end (SE) libraries for RNA-seq data generation. Globin depletion for the equine samples was not performed prior to RNA-seq library preparation (Katarzyna Ropka-Molik, pers. comm.).

Bovine peripheral blood collection and RNA extraction

Approximately 3 ml of peripheral blood from 10 age-matched healthy male Holstein-Friesian calves were collected into Tempus™ blood RNA tubes. The Tempus™ Spin RNA Isolation Kit (Applied Biosystems™/Thermo Fisher Scientific) was used to perform total RNA extraction and purification, following the manufacturer's instructions. RNA quantity and quality checking were performed using a NanoDrop™ 1,000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA) and an Agilent 2,100 Bioanalyzer using an RNA 6,000 Nano LabChip kit (Agilent Technologies Ltd., Cork, Ireland). The majority of samples displayed a 260/280 ratio >1.8 and an RNA integrity number (RIN) >8.0 (Supplementary Table 2). Globin mRNA depletion was not performed on the total RNA samples purified from bovine peripheral blood samples.

Bovine RNA-seq library generation and sequencing

Individually barcoded strand-specific RNA-seq libraries were prepared with 1 μg of total RNA from each sample. Two rounds of poly(A)⁺ RNA purification were performed for all RNA samples using the Dynabeads® mRNA DIRECT™ Micro Kit (Thermo Fisher Scientific) according to the manufacturer's instructions. The purified poly(A)⁺ RNA was then used to generate strand-specific RNA-seq libraries using the ScriptSeq™ v2 RNA-Seq Library Preparation Kit, the ScriptSeq™ Index PCR Primers (Sets 1 to 4) and the FailSafe™ PCR enzyme system (all sourced from Epicentre®/Illumina® Inc., Madison, WI, USA), according to the manufacturer's instructions.

RNA-seq libraries were purified using the Agencourt® AMPure® XP system (Beckman Coulter Genomics, Danvers, MA, USA) according to the manufacturer's instructions for double size selection (0.75 × followed by 1.0 × ratio). RNA-seq libraries were quantified using a Qubit® fluorometer and Qubit® dsDNA HS Assay Kit (Invitrogen™/Thermo Fisher Scientific), while library quality checks were performed using an Agilent 2,100 Bioanalyzer and High Sensitivity DNA Kit (Agilent Technologies Ltd.). Individually barcoded RNA-seq libraries were pooled in equimolar quantities and the quantity and quality of the final pooled libraries (three pools in total) were assessed as described above. Cluster generation and high-throughput sequencing of three pooled RNA-seq libraries were performed using an Illumina® HiSeq™ 2,000 Sequencing System at the MSU Research Technology Support Facility (RTSF) Genomics Core (https://rtsf.natsci.msu.edu/genomics; Michigan State University, MI, USA). Each of the three pooled libraries were sequenced independently on five lanes split across multiple Illumina® flow cells. The pooled libraries were sequenced as PE 2 × 100 nucleotide reads using Illumina® version 5.0 sequencing kits.

Deconvolution (filtering and segregation of sequence reads based on the unique RNA-seq library barcode index sequences; Supplementary Table 2) was performed by the MSU RTSF Genomics Core using a pipeline that simultaneously demultiplexed and converted pooled sequence reads into discrete FASTQ files for each RNA-seq sample with no barcode index mismatches permitted. The RNA-seq FASTQ sequence read data for the bovine samples were obtained from the MSU RTSF Genomics Core FTP server.

RNA-seq data quality control and filtering/trimming of reads

Bioinformatics procedures and analyses were performed as described below for the human, porcine, equine, and bovine samples, except were specifically indicated. All of the bioinformatics workflow scripts were developed using GNU bash (version 4.3.48) (Free Software Foundation, 2013), Python (version 3.5.2) (Python Software Foundation, 2017), and R (version 3.4.0) (R Core Team, 2017). The scripts and further information are available at a public GitHub repository (https://github.com/carolcorreia/Globin_RNA-sequencing). Computational analyses were performed on a 32-core Linux Compute Server (4 × AMD Opteron™ 6220 processors at 3.0 GHz with 8 cores each), with 256 GB of RAM, 24 TB of hard disk drive storage, and with Ubuntu Linux OS (version 14.04.4 LTS). Deconvoluted FASTQ files (generated from SE equine RNA-seq libraries and PE RNA-seq libraries for the other species) were quality-checked with FastQC (version 0.11.5) (Andrews, 2016).

Using the ngsShoRT software package (version 2.2) (Chen et al., 2014), filtering/trimming consisted of: (1) removal of SE or PE reads with adapter sequences (with up to three mismatches); (2) removal of SE or PE reads of poor quality (i.e., at least one of the reads containing ≥25% bases with a Phred quality score below 20); (3) for porcine samples only, 10 bases were trimmed at the 3′ end of all reads; (4) removal of SE or PE reads that did not meet the required minimum length (70 nucleotides for human and equine, 80 nucleotides for porcine and 100 nucleotides for bovine). Filtered/trimmed FASTQ files were then re-evaluated using FastQC. Filtered FASTQ files were transferred to a 36-core/64-thread Compute Server (2 × Intel® Xeon® CPU E5-2697 v4 at 2.30 GHz with 18 cores each), with 512 GB of RAM, 96 TB SAS storage (12 × 8 TB at 7200 rpm), 480 GB SSD storage, and with Ubuntu Linux OS (version 16.04.2 LTS).

Transcript quantification

The Salmon software package (version 0.8.2) (Patro et al., 2017) was used in quasi-mapping-mode for transcript quantification. Sequence-specific and fragment-level GC bias correction was enabled and transcript abundance was quantified in transcripts per million (TPM) for each filtered library (multiple lanes from the same library were processed together) was estimated after mapping of SE or PE reads to their respective reference transcriptomes. As summarised in Table 1, the NCBI RefSeq database is currently the only one to contain haemoglobin gene annotations for all species analysed. Hence, NCBI RefSeq reference transcript models were used for the human, porcine, equine, and bovine data sets. Detailed information about these reference transcriptomes is provided in Supplementary Table 3.

Table 1

	Ensembl	NCBI RefSeq	UCSC table browser
Homo sapiens
Annotation release	Human release 92 (April 2018)^a	NCBI Homo sapiens Annotation Release 109 (March 2018)^b	hg38.refGene annotation track (last updated on May 2018)^c^,^d^,^e
Genome assembly used to derive annotation	GRCh38.p12, GCA_000001405.27, December 2017	GRCh38.p12, GCF_000001405.38, December 2017	GRCh38, GCF_000001405.15, December 2013
HBA1	Annotated with gene ID ENSG00000206172	Annotated with Entrez Gene ID 3039	Annotated with Entrez Gene ID 3039
HBA2	Annotated with gene ID ENSG00000188536	Annotated with Entrez Gene ID 3040	Annotated with Entrez Gene ID 3040
HBB	Annotated with gene ID ENSG00000244734	Annotated with Entrez Gene ID 3043	Annotated with Entrez Gene ID 3043
Sus scrofa
Annotation release	Pig release 92 (April 2018)^a	NCBI Sus scrofa Annotation Release 106 (May 2017)^b	susScr3.refGene annotation track (last updated on May 2018)^d^,^e
Genome assembly used to derive annotation	Sscrofa11.1, GCA_000003025.6, February 2017	Sscrofa11.1, GCF_000003025.6, February 2017	Sscrofa11.1, GCF_000003025.6, February 2017
LOC110259958 (HBA)	Absent from current annotation release, accessible via online search with ID 110259958.1	Annotated with Entrez Gene ID 110259958	Absent
LOC100737768 (HBA)	Absent from current annotation release, accessible via online search with ID 100737768.1	Annotated with Entrez Gene ID 100737768	Absent
HBB	Annotated with gene ID ENSSSCG00000014725	Annotated with Entrez Gene ID 407066	Annotated with Entrez Gene ID 407066
Equus caballus
Annotation release	Horse release 92 (April 2018)^a	NCBI Equus caballus Annotation Release 103 (January 2018)^b	equCab2.refGene annotation track (last updated on May 2018)^d^,^e
Genome assembly used to derive annotation	EquCab 2, GCA_000002305.1, September 2007	EquCab3, GCF_002863925.1, May 2018	EquCab 2, GCA_000002305.1, September 2007
HBA (also known as HBA1)	Absent	Annotated with Entrez Gene ID 100036557	Annotated with Entrez Gene ID 100036557
HBA2	Absent	Annotated with Entrez Gene ID 100036558	Annotated with Entrez Gene ID 100036558
HBB	Annotated with gene ID ENSECAG00000010020	Annotated with Entrez Gene ID 100054109	Annotated with Entrez Gene ID 100054109
Bos taurus
Annotation release	Cow release 92 (April 2018)^a	NCBI Bos taurus Annotation Release 106 (May 2018)^b	bosTau8.refGene annotation track (last updated on May 2018)^d^,^e
Genome assembly used to derive annotation	UMD3.1, GCA_000003055.3, November 2009	ARS-UCD1.2, GCF_002263795.1, April 2018	UMD3.1.1, GCA_000003055.4, June 2014
HBA1	Annotated as GLNC1 with ID ENSBTAG00000026417	Annotated with Entrez gene ID 100140149	Absent
HBA (also known as HBA2)	Annotated as GLNC1 with ID ENSBTAG00000026418	Annotated with Entrez Gene ID 512439	Annotated with Entrez Gene ID 512439
HBB	Annotated with gene ID ENSBTAG00000038748	Annotated with Entrez Gene ID 280813	Annotated with Entrez Gene ID 280813

Status of current human, porcine, equine, and bovine haemoglobin gene annotations in the Ensembl, NCBI RefSeq, and UCSC databases.

(Zerbino et al., 2018).

(O'Leary et al., 2016).

(Kent et al., 2002).

(Karolchik et al., 2004).

(Tyner et al., 2017).

Gene annotations and summarisation of TPM estimates at the gene level

Using R (3.5.0) within the RStudio IDE (version 1.1.447) (R Studio Team, 2015) and Bioconductor (version 3.7 using BiocInstaller 1.30.0) (Gentleman et al., 2004), the GenomicFeatures (version 1.32.0) (Lawrence et al., 2013) and AnnotationDbi (version 1.42.1) (Pagès et al., 2017) packages were used to obtain corresponding gene and transcript identifiers from the NCBI RefSeq annotation releases pertinent to each species, as detailed in Table 1. Using these identifiers, the tximport (version 1.8.0) package (Soneson et al., 2015) was used to import into R and summarise at gene level the TPM estimates obtained from the Salmon tool. A threshold of greater than or equal to 1 TPM across at least half of the total number of samples (≥12 for human and porcine, ≥18 for equine, and ≥5 for bovine) was applied in order to remove lowly expressed genes.

Data exploration, plotting, and summary statistics

Data wrangling and tidying from all species was performed using the following R packages: tidyverse (version 1.2.1) (Wickham, 2017b), dplyr (version 0.7.5) (Wickham et al., 2017), tidyr (version 0.8.1) (Wickham and Henry, 2017), reshape2 (version 1.4.3) (Wickham, 2017a), and magrittr (version 1.5) (Bache and Wickham, 2017). The ggplot2 (version 2.2.1) (Wickham and Chang, 2017), and ggjoy (version 0.4.1) (Wilke, 2017), packages were used for figure generation. Finally, the mean and standard deviation were calculated for the undepleted and globin-depleted groups in each species using the skimr (version 1.0.2) R package (McNamara et al., 2017).

Results and discussion

Status of human, porcine, equine, and bovine haemoglobin gene annotations

Annotation of the haemoglobin subunit alpha 1 and 2 genes (HBA1 and HBA2, respectively) is well-established for the human genome; however, annotations for these genes in the porcine, equine, and bovine genomes are inconsistent across databases. As shown in Table 1, the porcine HBA gene annotation is absent from Ensembl and the UCSC Table Browser. For the NCBI RefSeq database, this gene has been assigned to two loci (LOC110259958 and LOC100737768) that have similar descriptions (haemoglobin subunit alpha and haemoglobin subunit alpha-like). Therefore, these NCBI LOC symbols were used.

Equine HBA (HBA1) and HBA2 genes are absent from the current Ensembl annotation release. Similarly, bovine HBA1 and HBA (HBA2) have been annotated as GLNC1 in Ensembl, whereas HBA1 is absent from the UCSC Table Browser annotation (Table 1). In the NCBI RefSeq database, equine HBA (HBA1) is described as haemoglobin subunit alpha 1; and bovine HBA (HBA2) is described as haemoglobin subunit alpha 2, thus their descriptions are shown in parenthesis herein. In contrast to these observations, haemoglobin subunit beta (HBB) genes for the four species are well-annotated in Ensembl, NCBI RefSeq and UCSC Genome Browser databases (Table 1).

At the time of writing, NCBI RefSeq is the only database that contains annotations for all three haemoglobin genes in all species analysed. Additionally, equine and bovine gene annotations are based on the latest genome assemblies (Table 1). EquCab3 and ARS-UCD1.2 have incorporated major improvements compared to previous versions, including increased genome coverage (from 6.8 × and 9 × , to 80 × , respectively), and incorporation of PacBio sequencing reads (Kalbfleisch et al., 2018; Rosen et al., 2018).

Basic RNA-seq data outputs

Unfiltered SE (equine libraries) or PE (human, porcine, and bovine libraries) RNA-seq FASTQ files were quality-checked, adapter- and quality-filtered prior to transcript quantification. As shown in Table 2, the human and porcine undepleted groups each had ~40 million (M) raw reads per library, whereas globin-depleted libraries showed a mean of ~37 and 31 M, respectively. Equine and bovine libraries, which did not include a globin depletion step had an average of 24 M raw reads and 21 M raw read pairs, respectively.

Table 2

Species	Treatment	RNA-seq library type	Sequencing mode	Mean no. of reads (SE) or pairs (PE)	Mean no. of reads (SE) or pairs (PE) removed	Mean proportion of reads (SE) or pairs (PE) removed	Mean no. of observed fragments^*	Mean no. of mapped fragments^*	Average mapping rate	Reference source
Homo sapiens	Undepleted	Inward unstranded	PE	40,218,886	8,203,217	20.4%	32,015,669	25,593,239	80.9%	NCBI RefSeq
Homo sapiens	Globin depleted	Inward unstranded	PE	36,874,759	10,704,088	29.0%	26,170,671	20,371,624	75.8%	NCBI RefSeq
Sus scrofa	Undepleted	Inward unstranded	PE	39,036,515	4,613,991	11.8%	28,685,437	25,129,140	87.7%	NCBI RefSeq
Sus scrofa	Globin depleted	Inward unstranded	PE	31,339,886	3,899,427	12.4%	22,867,049	19,959,995	87.3%	NCBI RefSeq
Equus caballus	Undepleted	Unstranded	SE	24,271,141	38,892	0.2%	14,850,797	11,387,774	76.5%	NCBI RefSeq
Bos taurus	Undepleted	Inward stranded forward	PE	20,495,983	3,474,597	17.0%	17,021,386	12,353,147	72.6%	NCBI RefSeq

Summary of RNA-seq filtering/trimming and mapping statistics.

The Salmon tool categorises fragments as single read (for SE RNA-seq libraries) or a read pair (for PE RNA-seq libraries).

After adapter- and quality-filtering of RNA-seq libraries, an average of 20 and 29% read pairs were removed from the human undepleted and globin-depleted libraries, respectively. Conversely, ~12% of read pairs were removed from each of the porcine undepleted and globin-depleted libraries. For the undepleted equine and bovine RNA-seq libraries, an average of 0.2% reads and 17% read pairs were removed, respectively. Detailed information on filtering/trimming of RNA-seq libraries from all species, including technical replicates from libraries sequenced over multiple lanes, is presented in Supplementary Table 4. All data sets exhibited a mean mapping rate >70% (Table 2). Supplementary Tables 5 contain sample-specific RNA-seq mapping statistics.

Transcript quantification

Transcript-level TPM estimates generated using the Salmon tool were imported into the R environment and summarised at gene level with the package tximport (Soneson et al., 2015). Gene-level TPM estimates represent the sum of corresponding transcript-level TPMs and provide results that are more accurate and comprehensible than transcript-level estimates (Soneson et al., 2015). In the current study, gene-level TPM estimates are referred as TPM.

Filtering of lowly expressed genes (see section Gene Annotations and Summarisation of TPM Estimates at the Gene Level) resulted in 12,951 genes expressed across all human samples, and represented 24% of 54,644 total annotated genes and pseudogenes. Porcine samples showed a total of 9,396 expressed genes (31% of 30,334 annotated genes and pseudogenes); and equine and bovine samples exhibited 12,724 (38% of 33,146) and 14,044 (40% of 35,143) expressed genes, respectively.

The density distribution of TPM values for the human and porcine samples improved after globin depletion; this is evident by the shift of gene detection levels toward greater log₁₀ TPM values for the globin-depleted samples in Figure 2. In this regard, it is noteworthy that the undepleted bovine and equine samples also exhibited similar TPM density distributions to the human and porcine globin-depleted samples.

Figure 2

Proportions of human and porcine haemoglobin gene transcripts in undepleted and depleted peripheral blood

In line with previous reports (Field et al., 2007; Mastrokolias et al., 2012), the proportion of haemoglobin gene transcripts (HBA1, HBA2, and HBB) detected in undepleted human peripheral blood samples for the current study averaged 70% (Figure 3 and Supplementary Table 6), which is lower than the mean proportion of 81% reported by Shin et al. (2014). On the other hand, after depletion the human samples exhibited an identical reduction to a 17% proportion of globin sequence reads in both the present study and that of Shin et al. (2014) (Figure 3 and Supplementary Table 6).

Figure 3

In the current study, for the undepleted porcine peripheral blood samples, the percentage of haemoglobin gene transcripts (LOC110259958 [HBA], LOC100737768 [HBA], and HBB) observed as a proportion of the total expressed genes was 72% (Figure 3 and Supplementary Table 6), which is considerably larger than the mean of 46.1% reported in the original study (Choi et al., 2014). Similarly, after depletion, the porcine samples in the present study contained a mean proportion of 22% globin transcripts (Figure 3 and Supplementary Table 6) compared to a mean proportion of 8.9% reported by Choi et al. (2014). Additionally, Table 3 shows the mean TPM for each haemoglobin gene across undepleted or globin-depleted samples.

Table 3

Species	Gene symbol	Treatment	No. of samples	Mean TPM	Standard deviation
Homo sapiens	HBA1	Undepleted	12	191,209	16,601
Homo sapiens	HBA1	Globin depleted	12	66,718	23,557
Homo sapiens	HBA2	Undepleted	12	300,000	29,523
Homo sapiens	HBA2	Globin depleted	12	79,818	31,259
Homo sapiens	HBB	Undepleted	12	200,000	43,262
Homo sapiens	HBB	Globin depleted	12	20,770	6,706
Sus scrofa	LOC110259958 (HBA)	Undepleted	12	86	30
Sus scrofa	LOC110259958 (HBA)	Globin depleted	12	13	13
Sus scrofa	LOC100737768 (HBA)	Undepleted	12	243,864	31,605
Sus scrofa	LOC100737768 (HBA)	Globin depleted	12	84,021	86,095
Sus scrofa	HBB	Undepleted	12	476,284	52,939
Sus scrofa	HBB	Globin depleted	12	136,172	128,232
Equus caballus	HBA (HBA1)	Undepleted	37	443	560
Equus caballus	HBA2	Undepleted	37	653	789
Equus caballus	HBB	Undepleted	37	1,024	1,144
Bos taurus	HBA1	Undepleted	10	21	29
Bos taurus	HBA (HBA2)	Undepleted	10	1,101	1,102
Bos taurus	HBB	Undepleted	10	532	469

Summary statistics for haemoglobin gene-level transcripts per million (TPM).

A number of possible explanations, including the different approaches used for read mapping and transcript quantification, may account for the different proportions of haemoglobin gene transcript detected in human and porcine samples for the present study compared to the original studies (Choi et al., 2014; Shin et al., 2014). For the present study, a recently developed lightweight alignment method was adopted (Salmon and tximport), in contrast to the more traditional methodologies used in the original publications. Shin et al. (2014) used the TopHat and Cufflinks software tools (Trapnell et al., 2012), while Choi et al. (2014) implemented TopHat with Htseq-count (Anders et al., 2015). In addition to this, different gene annotations were used: NCBI Homo sapiens Annotation Release 109 and NCBI Sus scrofa Annotation Release 106 were used for the present study, while UCSC hg18 (H. sapiens) and Ensembl release 71 (S. scrofa) were used by Shin et al. (2014) and Choi et al. (2014), respectively.

Equine and bovine peripheral blood contains extremely low levels of haemoglobin gene transcripts

The equine and bovine peripheral blood samples, which did not undergo globin depletion, had extremely low proportions of haemoglobin gene transcripts to total expressed genes: 0.21 and 0.17%, respectively (Figure 3 and Supplementary Table 6). Notably, similar results have been reported in a transcriptomics study of bovine peripheral blood in response to vaccination against neonatal pancytopenia. In that study, 12 cows were profiled before and after vaccination (24 peripheral blood samples in total), and a mean proportion of 1.0% of RNA-seq reads were observed to map to the bovine α haemoglobin gene cluster on BTA25 or to the β haemoglobin gene cluster on BTA15 (Demasius et al., 2013). To the best of our knowledge, this is the first time that the average number of equine haemoglobin transcripts have been reported for RNA-seq data.

Finally, it is important to note that log₂ TPM values for haemoglobin gene transcripts in the undepleted equine and bovine peripheral blood RNA samples are substantially lower than log₂ TPM values for the globin-depleted human and porcine peripheral blood RNA samples (Figure 4). This is a direct consequence of extremely low levels of circulating reticulocytes in equine and bovine peripheral blood (Tablin and Weiss, 1985; Harper et al., 1994; Hossain et al., 2003; Cooper et al., 2005).

Figure 4

Conclusion

In light of our RNA-seq data analyses, we propose that globin mRNA transcript depletion is not a pre-requisite for transcriptome profiling of bovine and equine peripheral blood samples. This observation greatly simplifies the laboratory and bioinformatics workflows required for RNA-seq studies of whole blood collected from domestic cattle and horses. It will also be directly relevant to future work on blood-based biomarker and biosignature development in the context of infectious disease, reproduction, nutrition, and animal welfare. For example, transcriptomics of peripheral blood has been used extensively in development of new diagnostic and prognostic modalities for human tuberculosis (HTB) disease caused by infection with Mycobacterium tuberculosis (for reviews see: Blankley et al., 2014; Haas et al., 2016; Weiner and Kaufmann, 2017; Goletti et al., 2018). Therefore, as a consequence of this HTB research, comparable transcriptomics studies in cattle (Meade et al., 2007; Killick et al., 2011; Blanco et al., 2012; Churbanov and Milligan, 2012; McLoughlin et al., 2014; Cheng et al., 2015), and the ease with which RNA-seq can be performed in bovine peripheral blood, it should be feasible to develop transcriptomics-based biomarkers and biosignatures for bovine tuberculosis caused by M. bovis infection.

Data accessibility

The RNA-seq data generated for this study using peripheral blood from 10 age-matched healthy male Holstein-Friesian calves can be obtained from the ENA database (PRJEB27764).

Statements

Ethics statement

Animal experimental work for the present study (cattle samples) was carried out according to the UK Animal (Scientific Procedures) Act 1986. The study protocol was approved by the Animal Health and Veterinary Laboratories Agency (AHVLA–Weybridge, UK), now the Animal & Plant Health Agency (APHA), Animal Use Ethics Committee (UK Home Office PCD number 70/6905).

Author contributions

DEM, SG, CC, and KM conceived and designed the project and organised bovine sample collection. KM, NN, DAM, and JB performed RNA extraction and RNA-seq library generation. CC, KM, NN, KR-A, and DEM performed the analyses. CC and DEM wrote the manuscript. All authors reviewed and approved the final manuscript.

Funding

This work was supported by Investigator Grants from Science Foundation Ireland (Nos: SFI/08/IN.1/B2038 and SFI/15/IA/3154), a Research Stimulus Grant from the Department of Agriculture, Food and the Marine (No: RSF 06 405), a European Union Framework 7 Project Grant (No: KBBE-211602- MACROSYS), a Brazilian Science Without Borders—CAPES grant (No: BEX-13070-13-4) and the UCD Wellcome Trust funded Computational Infection Biology Ph.D. Programme (Grant no: 097429/Z/11/Z).

Acknowledgments

The authors wish to express their gratitude to Prof Martin Vordermeier and Dr Bernardo Villarreal-Ramos (Animal and Plant Health Agency, UK) for provision of bovine peripheral blood samples, Prof Graham Plastow (University of Alberta, Canada) for provision of porcine peripheral blood RNA-seq data. We also thank Drs. Gabriella Farries (University College Dublin) and Kerri Malone (EMBL-EBI, Cambridge, UK) for stimulating discussion and advice concerning genome annotations, equine genetics and data visualisation.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2018.00278/full#supplementary-material

References

1
AndersS.PylP. T.HuberW. (2015). HTSeq–a Python framework to work with high-throughput sequencing data. Bioinformatics31, 166–169. 10.1093/bioinformatics/btu638
2
AndrewsS. (2016). FastQC: A Quality Control Tool for High Throughput Sequence Data. Available: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
- Google Scholar
3
Animal Task Force (2016). A Strategic Research and Innovation Agenda for a Sustainable livestock sector in Europe. Available online at: http://www.animaltaskforce.eu
- Google Scholar
4
BacheS. M.WickhamH. (2017). magrittr: A Forward-Pipe Operator for R. Available online at: https://github.com/tidyverse/magrittr
- Google Scholar
5
BlancoF. C.SoriaM.BiancoM. V.BigiF. (2012). Transcriptional response of peripheral blood mononuclear cells from cattle infected with Mycobacterium bovis. PLoS ONE7:e41066. 10.1371/journal.pone.0041066
6
BlankleyS.BerryM. P.GrahamC. M.BloomC. I.LipmanM.O'GarraA. (2014). The application of transcriptional blood signatures to enhance our understanding of the host response to infection: the example of tuberculosis. Philos. Trans. R. Soc. Lond. B. Biol. Sci.369:20130427. 10.1098/rstb.2013.0427
7
BowyerJ. F.TranterK. M.HanigJ. P.CrabtreeN. M.SchleimerR. P.GeorgeN. I. (2015). Evaluating the stability of RNA-seq transcriptome profiles and drug-induced immune-related expression changes in whole blood. PLoS ONE10:e0133315. 10.1371/journal.pone.0133315
8
BruderC. E.YaoS.LarsonF.CampJ. V.TappR.McBrayerA.et al. (2010). Transcriptome sequencing and development of an expression microarray platform for the domestic ferret. BMC Genomics11:251. 10.1186/1471-2164-11-251
9
ChaussabelD. (2015). Assessment of immune status using blood transcriptomics and potential implications for global health. Semin. Immunol.27, 58–66. 10.1016/j.smim.2015.03.002
10
ChenC.KhaleelS. S.HuangH.WuC. H. (2014). Software for pre-processing Illumina next-generation sequencing short read sequences. Source Code Biol. Med.9:8. 10.1186/1751-0473-9-8
11
ChengY.ChouC.-H.TsaiH.-J. (2015). In vitro gene expression profile of bovine peripheral blood mononuclear cells in early Mycobacterium bovis infection. Exp. Ther. Med.10, 2102–2118. 10.3892/etm.2015.2814
12
ChoiI.BaoH.KommadathA.HosseiniA.SunX.MengY.et al. (2014). Increasing gene discovery and coverage using RNA-seq of globin RNA reduced porcine blood samples. BMC Genomics15:954. 10.1186/1471-2164-15-954
13
ChurbanovA.MilliganB. (2012). Accurate diagnostics for bovine tuberculosis based on high-throughput sequencing. PLoS ONE7:e50147. 10.1371/journal.pone.0050147
14
CooperC.SearsW.BienzleD. (2005). Reticulocyte changes after experimental anemia and erythropoietin treatment of horses. J. Appl. Physiol.99, 915–921. 10.1152/japplphysiol.00438.2005
15
de GreeffA.BikkerP.Smit-HeinsbroekA.BruininxE.ZwolschenH.FijtenH.et al. (2016). Increased fat and polyunsaturated fatty acid content in sow gestation diet has no effect on gene expression in progeny during the first 7 days of life. J. Anim. Physiol. Anim. Nutr.100, 127–135. 10.1111/jpn.12345
16
DebeyS.SchoenbeckU.HellmichM.GathofB. S.PillaiR.ZanderT.et al. (2004). Comparison of different isolation techniques prior gene expression profiling of blood derived cells: impact on physiological responses, on overall expression and the role of different cell types. Pharmacogenomics J.4, 193–207. 10.1038/sj.tpj.6500240
17
DemasiusW.WeikardR.HadlichF.MullerK. E.KuhnC. (2013). Monitoring the immune response to vaccination with an inactivated vaccine associated to bovine neonatal pancytopenia by deep sequencing transcriptome analysis in cattle. Vet. Res.44:93. 10.1186/1297-9716-44-93
18
ElgendyR.GiantinM.CastellaniF.GrottaL.PalazzoF.DacastoM.et al. (2016). Transcriptomic signature of high dietary organic selenium supplementation in sheep: a nutrigenomic insight using a custom microarray platform and gene set enrichment analysis. J. Anim. Sci.94, 3169–3184. 10.2527/jas.2016-0363
19
FanH.HegdeP. S. (2005). The transcriptome in blood: challenges and solutions for robust expression profiling. Curr. Mol. Med.5, 3–10. 10.2174/1566524053152861
20
FieldL. A.JordanR. M.HadixJ. A.DunnM. A.ShriverC. D.EllsworthR. E.et al. (2007). Functional identity of genes detectable in expression profiling assays following globin mRNA reduction of peripheral blood samples. Clin. Biochem.40, 499–502. 10.1016/j.clinbiochem.2007.01.004
21
Free Software Foundation (2013) Bash (4.3.48) Unix Shell Program. Available online at: http://ftp.gnu.org/gnu/bash/
- Google Scholar
22
GentlemanR. C.CareyV. J.BatesD. M.BolstadB.DettlingM.DudoitS.et al. (2004). Bioconductor: open software development for computational biology and bioinformatics. Genome Biol.5:R80. 10.1186/gb-2004-5-10-r80
23
GolettiD.LeeM. R.WangJ. Y.WalterN.OttenhoffT. H. M. (2018). Update on tuberculosis biomarkers: from correlates of risk, to correlates of active disease and of cure from disease. Respirology23, 455–466. 10.1111/resp.13272
24
GreerJ. P.ArberD. A.GladerB.ListA. F.MeansR. T.ParaskevasF.et al. (2013). Wintrobe's Clinical Hematology. Philadelphia, PA: Lippincott, Williams and Wilkins
- Google Scholar
25
HaasC. T.RoeJ. K.PollaraG.MehtaM.NoursadeghiM. (2016). Diagnostic 'omics' for active tuberculosis. BMC Med.14:37. 10.1186/s12916-016-0583-9
26
HarperS. B.HurstW. J.Ohlsson-WilhelmB.LangC. M. (1994). The response of various hematologic parameters in the young bovine subjected to multiple phlebotomies. ASAIO J.40, M816–M825.
- Pubmed Abstract
- Google Scholar
27
HolcombZ. E.TsalikE. L.WoodsC. W.McClainM. T. (2017). Host-based peripheral blood gene expression analysis for diagnosis of infectious diseases. J. Clin. Microbiol.55, 360–368. 10.1128/jcm.01057-16
28
HossainM. A.YamatoO.YamasakiM.OtsukaY.MaedeY. (2003). Relation between reticulocyte count and characteristics of erythrocyte 5'-nucleotidase in dogs, cats, cattle and humans. J. Vet. Med. Sci.65, 193–197. 10.1292/jvms.65.193
29
HuangZ.GallotA.LaoN. T.PuechmailleS. J.FoleyN. M.JebbD.et al. (2016). A nonlethal sampling method to obtain, generate and assemble whole blood transcriptomes from small, wild mammals. Mol. Ecol. Resour.16, 150–162. 10.1111/1755-0998.12447
30
JégouM.GondretF.VincentA.TrefeuC.GilbertH.LouveauI. (2016). Whole blood transcriptomics is relevant to identify molecular changes in response to genetic selection for feed efficiency and nutritional status in the pig. PLoS ONE11:e0146550. 10.1371/journal.pone.0146550
31
KalbfleischT. S.RiceE.DePriestM. S.WalenzB. P.HestandM. S.VermeeschJ. R.et al. (2018). EquCab3, an updated reference genome for the domestic horse. bioRxiv[Preprint]. 306928. 10.1101/306928
- CrossRef
- Google Scholar
32
KarolchikD.HinrichsA. S.FureyT. S.RoskinK. M.SugnetC. W.HausslerD.et al. (2004). The UCSC Table Browser data retrieval tool. Nucleic Acids Res.32, D493–D496. 10.1093/nar/gkh103
33
KentW. J.SugnetC. W.FureyT. S.RoskinK. M.PringleT. H.ZahlerA. M.et al. (2002). The human genome browser at UCSC. Genome Res.12, 996–1006. 10.1101/gr.229102
34
KillickK. E.BrowneJ. A.ParkS. D.MageeD. A.MartinI.MeadeK. G.et al. (2011). Genome-wide transcriptional profiling of peripheral blood leukocytes from cattle infected with Mycobacterium bovis reveals suppression of host immune genes. BMC Genomics12:611. 10.1186/1471-2164-12-611
35
KoE. R.YangW. E.McClainM. T.WoodsC. W.GinsburgG. S.TsalikE. L. (2015). What was old is new again: using the host response to diagnose infectious disease. Expert Rev. Mol. Diagn.15, 1143–1158. 10.1586/14737159.2015.1059278
36
KolliV.UpadhyayR. C.SinghD. (2014). Peripheral blood leukocytes transcriptomic signature highlights the altered metabolic pathways by heat stress in zebu cattle. Res. Vet. Sci.96, 102–110. 10.1016/j.rvsc.2013.11.019
37
KrjutškovK.KoelM.RoostA. M.KatayamaS.EinarsdottirE.JouhilahtiE. M.et al. (2016). Globin mRNA reduction for whole-blood transcriptome sequencing. Sci. Rep.6:31584. 10.1038/srep31584
38
LawrenceM.HuberW.PagesH.AboyounP.CarlsonM.GentlemanR.et al. (2013). Software for computing and annotating genomic ranges. PLoS Comput. Biol.9:e1003118. 10.1371/journal.pcbi.1003118
39
LiuJ.WalterE.StengerD.ThachD. (2006). Effects of globin mRNA reduction methods on gene expression profiles from whole blood. J. Mol. Diagn.8, 551–558. 10.2353/jmoldx.2006.060021
40
MastrokoliasA.den DunnenJ. T.van OmmenG. B.t HoenP. A.van Roon-MomW. M. (2012). Increased sensitivity of next generation sequencing-based expression profiling after globin reduction in human blood RNA. BMC Genomics13:28. 10.1186/1471-2164-13-28
41
McLoughlinK. E.NalpasN. C.Rue-AlbrechtK.BrowneJ. A.MageeD. A.KillickK. E.et al. (2014). RNA-seq transcriptional profiling of peripheral blood leukocytes from cattle infected with Mycobacterium bovis. Front. Immunol.5:396. 10.3389/fimmu.2014.00396
42
McNamaraA.Arino de la RubiaE.ZhuH.LowndesJ.EllisS.WaringE.et al. (2017). skimr: Compact and Flexible Summaries of Data. Available online at: https://github.com/ropenscilabs/skimr
- Google Scholar
43
MeadeK. G.GormleyE.DoyleM. B.FitzsimonsT.O'FarrellyC.CostelloE.et al. (2007). Innate gene repression associated with Mycobacterium bovis infection in cattle: toward a gene signature of disease. BMC Genomics8:400. 10.1186/1471-2164-8-400
44
MejiasA.RamiloO. (2014). Transcriptional profiling in infectious diseases: ready for prime time?J. Infect. 68 (Suppl. 1), S94–99. 10.1016/j.jinf.2013.09.018
45
MoreyJ. S.NeelyM. G.LunardiD.AndersonP. E.SchwackeL. H.CampbellM.et al. (2016). RNA-seq analysis of seasonal and individual variation in blood transcriptomes of healthy managed bottlenose dolphins. BMC Genomics17:720. 10.1186/s12864-016-3020-8
46
NabarroD.WannousC. (2014). The potential contribution of Iivestock to food and nutrition security: the application of the One Health approach in livestock policy and practice. Rev. Off. Int. Epizoot.33, 475–485. 10.20506/rst.33.2.2292
47
O'LearyN. A.WrightM. W.BristerJ. R.CiufoS.HaddadD.McVeighR.et al. (2016). Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res.44, D733–D745. 10.1093/nar/gkv1189
48
O'LoughlinA.LynnD. J.McGeeM.DoyleS.McCabeM.EarleyB. (2012). Transcriptomic analysis of the stress response to weaning at housing in bovine leukocytes using RNA-seq technology. BMC Genomics13:250. 10.1186/1471-2164-13-250
49
PagèsH.CarlsonM.FalconS.LiN. (2017). AnnotationDbi: Annotation Database Interface. R package version 1.38.0. Available online at: https://doi.org/10.18129/B9.bioc.AnnotationDbi
- Google Scholar
50
PatroR.DuggalG.LoveM. I.IrizarryR. A.KingsfordC. (2017). Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods14, 417–419. 10.1038/nmeth.4197
51
Python Software Foundation (2017). Python (version 3.5.2). Available online at: https://www.python.org/downloads/release/python,-352/
- Google Scholar
52
R Core Team (2017). R: A Language and Environment for Statistical Computing. Available online at: https://www.R-project.org
- Google Scholar
53
R Studio Team (2015). RStudio: Integrated Development for R. Available online at: http://www.rstudio.com
- Google Scholar
54
RamiloO.MejiasA. (2009). Shifting the paradigm: host gene signatures for diagnosis of infectious diseases. Cell Host Microbe6, 199–200. 10.1016/j.chom.2009.08.007
55
Ropka-MolikK.Stefaniuk-SzmukierM.ZukowskiK.PiorkowskaK.GurgulA.Bugno-PoniewierskaM. (2017). Transcriptome profiling of Arabian horse blood during training regimens. BMC Genet.18:31. 10.1186/s12863-017-0499-1
56
RosenB. D.BickhartD. M.SchnabelR. D.KorenS.ElsikC. G.ZiminA.et al. (2018). Modernizing the bovine reference genome assembly in Proceedings of the World Congress on Genetics Applied to Livestock Production (Auckland), 802.
- Google Scholar
57
SchwochowD.SerieysL. E.WayneR. K.ThalmannO. (2012). Efficient recovery of whole blood RNA–a comparison of commercial RNA extraction protocols for high-throughput applications in wildlife species. BMC Biotechnol.12:33. 10.1186/1472-6750-12-33
58
ShenJ.ZhouC.ZhuS.ShiW.HuM.FuX.et al. (2014). Comparative transcriptome analysis reveals early pregnancy-specific genes expressed in peripheral blood of pregnant sows. PLoS ONE9:e114036. 10.1371/journal.pone.0114036
59
ShinH.ShannonC. P.FishbaneN.RuanJ.ZhouM.BalshawR.et al. (2014). Variation in RNA-seq transcriptome profiles of peripheral whole blood from healthy individuals with and without globin depletion. PLoS ONE9:e91041. 10.1371/journal.pone.0091041
60
SonesonC.LoveM. I.RobinsonM. D. (2015). Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res4:1521. 10.12688/f1000research.7563.2
61
SongK. D.DowdS. E.LeeH. K.KimS. W. (2013). Long-term dietary supplementation of organic selenium modulates gene expression profiles in leukocytes of adult pigs. Anim. Sci. J.84, 238–246. 10.1111/j.1740-0929.2012.01060.x
62
TablinF.WeissL. (1985). Equine bone marrow: a quantitative analysis of erythroid maturation. Anat. Rec.213, 202–206. 10.1002/ar.1092130212
63
TakahashiJ.WakiS.MatsumotoR.OdakeJ.MiyajiT.TottoriJ.et al. (2012). Oligonucleotide microarray analysis of dietary-induced hyperlipidemia gene expression profiles in miniature pigs. PLoS ONE7:e37581. 10.1371/journal.pone.0037581
64
ThorntonP. K. (2010). Livestock production: recent trends, future prospects. Philos. Trans. R. Soc. Lond. B. Biol. Sci.365, 2853–2867. 10.1098/rstb.2010.0134
65
TrapnellC.RobertsA.GoffL.PerteaG.KimD.KelleyD. R.et al. (2012). Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc.7, 562–578. 10.1038/nprot.2012.016
66
TynerC.BarberG. P.CasperJ.ClawsonH.DiekhansM.EisenhartC.et al. (2017). The UCSC Genome Browser database: 2017 update. Nucleic Acids Res.45, D626–D634. 10.1093/nar/gkw1134
67
WeinerJ.KaufmannS. H. (2017). High-throughput and computational approaches for diagnostic and prognostic host tuberculosis biomarkers. Int. J. Infect. Dis.56, 258–262. 10.1016/j.ijid.2016.10.017
68
WickhamH. (2017a). reshape2: Flexibly Reshape Data: A Reboot of the Reshape Package. Available online at: https://github.com/hadley/reshape
- Google Scholar
69
WickhamH. (2017b). tidyverse. Available online at: http://tidyverse.tidyverse.org
- Google Scholar
70
WickhamH.ChangW. (2017). ggplot2: Create Elegant Data Visualisations Using the Grammar of Graphics. Available online at: http://ggplot2.tidyverse.org
- Google Scholar
71
WickhamH.FrancoisR.HenryL.MüllerK. (2017). dplyr: A Grammar of Data Manipulation. Available online at: http://dplyr.tidyverse.org
- Google Scholar
72
WickhamH.HenryL. (2017). tidyr: Easily Tidy Data With 'spread()' and 'gather()' Functions. Available online at: http://tidyr.tidyverse.org
- Google Scholar
73
WilkeC. O. (2017). ggjoy: Joyplots in ‘ggplot2'. Available online at: https://github.com/clauswilke/ggjoy
- Google Scholar
74
WinnM. E.ZapalaM. A.HovattaI.RisbroughV. B.LillieE.SchorkN. J. (2010). The effects of globin on microarray-based gene expression analysis of mouse blood. Mamm. Genome21, 268–275. 10.1007/s00335-010-9261-y
75
WuK.MiyadaG.MartinJ.FinkelsteinD. (2003). Technical Note: Globin Reduction Protocol: A Method for Processing Whole Blood RNA Samples for Improved Array Results. Available online at: https://tools.thermofisher.com/content/sfs/brochures/blood2_technote.pdf
- Google Scholar
76
ZerbinoD. R.AchuthanP.AkanniW.AmodeM. R.BarrellD.BhaiJ.et al. (2018). Ensembl 2018. Nucleic Acids Res.46, D754–D761. 10.1093/nar/gkx1098

Summary

Keywords

blood, cattle, globin, horses, pigs, reticulocyte, RNA-seq, transcriptome

Citation

Correia CN, McLoughlin KE, Nalpas NC, Magee DA, Browne JA, Rue-Albrecht K, Gordon SV and MacHugh DE (2018) RNA Sequencing (RNA-Seq) Reveals Extremely Low Levels of Reticulocyte-Derived Globin Gene Transcripts in Peripheral Blood From Horses (Equus caballus) and Cattle (Bos taurus). Front. Genet. 9:278. doi: 10.3389/fgene.2018.00278

Received

03 April 2018

Accepted

09 July 2018

Published

14 August 2018

Volume

9 - 2018

Edited by

John Anthony Hammond, Pirbright Institute (BBSRC), United Kingdom

Reviewed by

Chuanju Dong, Henan Normal University, China; Luiz Lehmann Coutinho, Universidade de São Paulo, Brazil

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: David E. MacHugh david.machugh@ucd.ie

†Present Address: Nicolas C. Nalpas, Quantitative Proteomics and Proteome Centre Tübingen, Interfaculty Institute for Cell Biology, University of Tübingen, Tübingen, Germany

This article was submitted to Livestock Genomics, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Livestock Genomics

ORIGINAL RESEARCH article

RNA Sequencing (RNA-Seq) Reveals Extremely Low Levels of Reticulocyte-Derived Globin Gene Transcripts in Peripheral Blood From Horses (Equus caballus) and Cattle (Bos taurus)

Abstract

Introduction