ORIGINAL RESEARCH article

Front. Vet. Sci., 30 June 2025

Sec. Veterinary Clinical, Anatomical, and Comparative Pathology

Volume 12 - 2025 | https://doi.org/10.3389/fvets.2025.1601306

This article is part of the Research TopicInnovations in Veterinary Diagnostics for Enhancing Animal Health and WelfareView all 5 articles

Strong correlation of gene counts and differentially expressed genes between a 3′ RNA-Seq and an RNA hybridization platform in transcriptome analyses from canine archival tissues

  • 1Department of Veterinary Medicine, Institute of Veterinary Pathology, Freie Universität Berlin, Berlin, Germany
  • 2Department of Mathematics and Computer Science, Bioinformatics Solution Center, Freie Universität Berlin, Berlin, Germany
  • 3Human Biology and Primate Evolution, Institute of Biology, Freie Universität Berlin, Berlin, Germany

Analyses of nucleic acids from archival tissues offer invaluable prospects for numerous fields of veterinary medicine, such as the study of differential gene expression in rare or historic diseases. The establishment of modern methodologies, however, raises questions regarding the comparability and reproducibility of data obtained from unlike tools. 3′ RNA-Seq and direct RNA hybridization are such conceptually different approaches for high-throughput transcriptome analysis. Since both are applicable to short, partially degraded mRNA fragments, they in principle allow investigations of formalin-fixed, paraffin-embedded (FFPE) tissues that are abundantly available in pathology archives. Here, we compared the two methods in several relevant details using the RNA from the same set of 35 FFPE canine tumors as input, including sample- and gene-wise count levels, gene expression strengths and directions, as well as the overlaps of differentially expressed genes (DEGs). Both methods proved suitable for their use on archival tissues with moderately to very strong overall count correlations, as indicated by a range of Pearson and Spearman means between 0.66 and 0.87. Of note, the gene-wise count correlations depended on gene expression strength. In an entity-contrasting comparison, expression directions correlated very strongly ranging from 0.88 to 0.91, but DEGs overlapped only moderately with a Jaccard index of 0.53. Finally, we contrasted the different practically relevant aspects of the two technologies with their distinct advantages that depend on the objectives and design of the study. This comparison will guide and help to select the appropriate method and to validate and interpret the data obtained.

1 Introduction

Research and diagnostic institutions usually store formalin-fixed, paraffin-embedded (FFPE) tissues in archives, accumulating large numbers and variations of specific diseases, samples of rare entities, or otherwise valuable properties. Such archives represent a priceless resource for retrospective biomedical studies allowing basic and applied research in many fields, including comparative oncology. Recent technologies with different conceptual approaches (13) have made it possible to quantify specific RNA sequences from such samples that allow a deeper insight into the dynamics of disease-specific gene expression levels and gene regulation.

However, FFPE specimens pose particular challenges in terms of RNA quality and detectability, such as contamination with RNases and other inhibitory proteins (3, 4) or diverse chemical alterations (5). The latter include RNA degradation prior to fixation, chemical modification via cross-linking with peptides by formaldehyde, and fragmentation by high temperatures during paraffin embedding and prolonged storage (68).

Recently developed 3′ RNA-Seq methods, such as QuantSeq 3′, generate libraries from one sequence per transcript by capturing and sequencing short fragments at the 3′ end of polyadenylated RNA (9, 10), thereby requiring significantly fewer reads compared to conventional RNA-Seq. Moreover, poly(A) enrichment, rRNA depletion, and RNA fragmentation before reverse transcription are no longer needed which simplifies and accelerates processing. Additionally, transcript length bias usually seen in traditional next generation sequencing (NGS) techniques (11) where long transcripts are artificially overrepresented (12) is circumvented, facilitating bioinformatic processing. Moreover, by capturing only short sequences of 60–80 nucleotides (nt) from the 3′ end of mRNA, QuantSeq 3′ can be used for analyzing partially degraded RNA from FFPE samples.

Alternatively, RNA hybridization panels can be employed for the analyses of specific groups of genes involved in specific disease processes. nCounter® (13) represents such a technology, employing a panel of color-coded molecular barcodes to target a pre-selected set of RNA molecules for digital quantification without prior cDNA synthesis or amplification, as opposed to RNA-Seq methods. Each unique transcript target is hybridized to a capture and reporter probe pair to generate a target-probe-complex, yielding a single count per transcript (14). This assay currently allows the detection of 800 pre-selected plus 6–55 optional, user-defined transcripts. The preselection of targeted gene transcripts is either based on commercial gene expression panels, such as select aspects in oncology or immunology, or customizable for individual research endeavors. Importantly, even partially degraded RNA may serve as adequate input.

Despite overlapping applications, several decisive differences distinguish the two methods both in study goal and practical perspectives, which are summarized in Table 1. Depending on the goal and design of a study, the two methods offer different strengths and limitations. The QuantSeq 3′ system is particularly suitable for investigating entire pathways over virtually all cellular functions. Thus, the aim is to obtain a holistic overview of the differences between contrasting groups, or in studies that focus on, for example, biomarker identification. On the other hand, the nCounter® Canine IO Panel is suitable for studies that focus directly on immuno-oncological questions in dogs, including inflammatory subtypes of cancer and clinical research involving, for example, the testing of treatment effects (1517). The major advantage of this assay is that even lowly expressed genes can be detected with a high sensitivity as a result of the targeted analysis, whereas in a whole transcriptome approach these genes are potentially not detected due to the much higher abundance of other genes. On the other hand, prior knowledge about the genes of interest or splice variants of the transcripts is necessary to generate specific nCounter® probes, thus limiting the discovery of new or unexpected transcripts. Furthermore, it is not always possible to differentiate between different known splice variants (18, 19). However, the same applies to the QuantSeq 3′ method, since a small number of nucleotides is used to infer the presence of transcripts.

Table 1
www.frontiersin.org

Table 1. Overview of conceptual and practical differences between the QuantSeq 3′ and nCounter® Canine IO Panel technologies.

Several independent studies have shown that both QuantSeq 3′ (9, 20) and nCounter® (2123) are suitable for gene expression profiling performed on partially degraded RNA isolated from FFPE tissues. In a comparison of nine different methods, both nCounter® and QuantSeq 3′ performed well on up to 15 year-old FFPE tissues (24). In addition, molecular subtypes of specific tumors have been accurately identified with both approaches from FFPE tissue samples (10, 2528).

So far, however, the data output of these two technologies has not been compared to one another in terms of mutual validation and reproducibility when using RNA from canine archival tissues as input. A comparison of the two platforms’ performances on canine FFPE tumor tissue is particularly interesting, as the dog is considered a valuable model for naturally occurring cancers in humans and other comparative oncological studies (2931).

Therefore, this study was designed to determine the correlation of gene expression data generated by QuantSeq 3′ and the nCounter® Canine IO Panel from three types of canine tumors. In order to exploit the full potential of the immuno-oncological emphasis of this nCounter® panel, two different oncological perspectives were chosen that included a strong immunological component. First, a stage-dependent comparison of early and late stages of a tumor with stereotypical spontaneous regression was selected to compare the performance on similar tissues. For this intra-tumor comparison, canine cutaneous histiocytomas (CCH) were employed. CCH is a common benign skin tumor generally of young dogs with progenitor cells to epidermal dendritic (Langerhans) cells as the cell of origin (32). CCH’s unique regression has been speculated to be due to an immune-mediated anti-tumor host response that warrants further investigation (3337). The two stages compared here, namely group 1 and group 3 based on Cockerell & Slauson (38), represent early and late time points in the course of CCH regression: After a short period of complete absence of lymphocytes (group 1), superficial ulceration and basolateral lymphofollicular infiltration ensue (group 2), followed by progressive infiltration of the tumor by lymphocytes (group 3), and final tumor regression with a predominance of lymphocytes and coagulation necrosis of the histiocytic tumor cells (group 4) (38).

Second, an entity-contrasting comparison between the two most common glandular, perianal tumors in the dog – the hepatoid gland adenoma (HGA) and apocrine gland anal sac adenocarcinoma (AGASAC) – was elected. Despite their virtually identical site of origin, they represent both extremes on the dignity spectrum. While HGAs hardly ever develop invasive or metastatic behavior (39, 40), malignant AGASACs commonly metastasize early to regional lymph nodes (4144). Thus, the biological differences and the different cells of origin served as another suitable setting for a technical and bioinformatic comparison of the two methodologies.

2 Materials and methods

2.1 Selection of FFPE tissue samples

A total of 35 FFPE tissue samples from 2012 to 2021 were obtained from the archive of the Institute of Veterinary Pathology. Specimens had originally been surgically excised from privately owned pet dogs and submitted for individual diagnostic and therapeutic purposes. Dog owners had given their consent and the work was ethically approved by the State Office for Health and Social Affairs, Berlin (StN 010/23). The tumors (AGASAC: n = 15, HGA: n = 10, CCH: n = 5 early stage, and n = 5 late stage) were selected based on histopathological diagnosis on hematoxylin and eosin stained slides by a board-certified veterinary pathologist. Tumor-adjacent tissue was removed to obtain largely homogeneous tumor cell masses. Further information on the individual tissue samples is provided in Supplementary Table S1.

2.2 RNA extraction from FFPE samples and RNA quality control

Five 10 μm FFPE scrolls were prepared from entire cross sections, collected in sterile centrifuge tubes and stored at −80°C. Total RNA was extracted using the PureLink™ FFPE Total RNA Isolation Kit (Thermo Fisher) according to the manufacturer’s guidelines. Total RNA concentrations were measured with the NanoDrop™ 2000c spectrophotometer (Thermo Fisher) and quality was determined using the Agilent 5200 Fragment Analyzer (Agilent Technologies, Inc., Santa Clara, United States) employing the DNF-471F33 - SS Total RNA 15 nt - FFPE Illumina DV200 method mode with a range of smear analysis from 200 to 20,000 nt. Only samples with a total RNA quality number (RQN) (45) of > 4 and a DV200 (percentage of RNA fragments over 200 nt in length) (46) of > 64.5% were chosen. For RNA quality classification, the Illumina® recommendations were applied, which denote a DV200 of > 70% as high and a DV200 of 50–70% as medium quality. For data on RNA quality, see Supplementary Table S2. Total RNA was treated with DNase I. Both the nCounter® and QuantSeq 3′ analyses were employed on the total RNA from the same isolation batches.

2.3 Direct mRNA hybridization

The RNA (150–250 ng) was hybridized to the nCounter® Canine IO Panel XT CodeSets (NanoString Technologies, Inc., Seattle, WA, United States), including probes representing 780 pre-selected genes and 20 “housekeeper” genes. A 30 probe Panel Plus (Supplementary Table S3) was added to the hybridization of the HGA and AGASAC samples following the manufacturer’s hybridization protocols (manual IDs: MAN-10023-11, MAN-10056-06). This Panel Plus included genes of further interest for this entity comparison. Hybridized samples were loaded onto the nCounter® MAX Analysis System’s Prep Station (NanoString) for purification and immobilization on sample cartridges, transferred to the Digital Analyzer for data collection and analyzed following the manufacturer’s user manual (manual ID: MAN-C0035-08).

Following the workflow described in the manufacturer’s recommendations (manual IDs: MAN-C0019-08, MAN-C0011-04), the reporter library files (RLF) and reporter code count (RCC) files were imported into the nSolver™ 4.0 Analysis Software (NanoString). Quality control and normalization followed default settings. Differential gene expression (DGE) analysis was implemented with the R 3.3.2-based Advanced Analysis 2.0 plug-in (version 2.0.134) with the recommended statistical settings. The raw, normalized, and DGE data were exported and read into R / Python Jupyter notebooks for further bioinformatic analysis and correlation calculations.

2.4 3′ RNA sequencing

The RNA was sequenced with QuantSeq 3′ (Lexogen GmbH, Vienna, Austria) at Lexogen Services. DNase I treated total RNA (500–1,000 ng from HGA and AGASAC; 375 ng from the early and late CCH stages) was processed with the QuantSeq 3’ mRNA-Seq FWD Library Preparation Kit (Lexogen) according to the manufacturer’s guidelines (user guide: 015UG009V0251) using the low-quality RNA protocol. Quality of the libraries was determined with the Agilent 5300 Fragment Analyzer (DNF-474-33 - HS NGS Fragment 1-6000 bp method mode). The samples were pooled in equimolar ratios. The library pool was quantified using a Qubit™ dsDNA HS assay kit (Thermo Fisher) and sequenced utilizing an Illumina® NextSeq™ 500 system with a SR75 High Output Kit. In total, 0.011 sequencing lanes were used per sample, resulting in an output of 4,025,165 to 7,548,231 trimmed reads for the HGA and AGASAC samples with an average input read length of 63.41 to 71.6 nt and 5,426,266 to 7,783,141 trimmed reads for the CCH samples with an average input length of 66.21 to 71.65 nt.

The FASTQ sequencing files were first pre-processed (adapter trimmed, filtered) using Cutadapt (47) and subsequently aligned to the NCBI Reference Sequence (RefSeq) assembly for the dog (Canis lupus familiaris) CanFam3.1 (48) (GCF_000002285.3) with the STAR aligner (49). Read counts per gene were generated with two rounds of featureCounts (50) to mitigate the effect of too short 3′ UTR annotations. Reads that remained unassigned in the first round were subjected to the second round of featureCounts on an adjusted annotation with a 3′ extension by 2 kilobases. Multi-mapped reads were retained and all counts distributed evenly across all mapping locations. For comparison with nCounter® counts, the raw gene counts were normalized with the edgeR package (5153). DGE analyses were performed with R using DESeq2 (54). The workflow is fully accessible in figshare (see data availability, DOI: 10.6084/m9.figshare.25768587). The significance thresholds were set at a log2 (fold change) (log2FC) of ≤ −1 or ≥ 1 and an adjusted p-value (padj) of ≤ 0.05.

As the nCounter® Canine IO Panel probes were designed using CamFam3.1, the same reference genome for QuantSeq 3′ read alignment was used for proper gene annotation comparison, despite the dog reference genome having been updated since. This same annotation offers a more robust background for comparison, as genes have been either dropped or added in the more recent genome.

2.5 Biological comparisons for differential gene expression

Two biological comparisons were chosen to test whether data sets stemming from an intra- or an inter-tumor contrast might be more prone to errors, possibly reflected in a weaker correlation of results from different measurements. For the inter-tumor, entity-contrasting comparison, the HGA (n = 10) was compared to the AGASAC (n = 15 for gene correlation, n = 11 for DGE) as baseline (from now on “HGA versus AGASAC”) and for the intra-tumor, stage-dependent comparison, the early CCH stage (n = 5) was compared to the late CCH stage (n = 5) as baseline (from now on “CCH early versus late”).

2.6 Correlation calculation

For correlation calculations of gene abundance, nCounter® read counts were normalized with nSolver™, while QuantSeq 3′ read counts were normalized using three different methods based on the full set of known genes (Figure 1). nCounter® reads were normalized in nSolver™ in a two-step process with positive control and “housekeeping” gene normalization. At the level of QuantSeq 3′ data normalization, three different commonly applied normalization methods were compared. These were TMM (Trimmed Means of M-values); the method used by edgeR, CPM (counts per million), and RLE (relative log expression); the method used by DESeq2. All three methods were implemented within the edgeR package.

Figure 1

Figure 1. Study workflow. The schematic representations of QuantSeq 3′ and the nCounter® Canine IO Panel depict the respective basic principles of the technologies. For correlation calculations of gene abundance, nCounter® read counts were normalized with nSolver, while QuantSeq 3′ read counts were normalized using three different methods based on the full set of known genes, not only the panel subset considered by nCounter®. At the level of QuantSeq 3′ data normalization, three different commonly applied normalization methods were compared. These were TMM (Trimmed Means of M-values), CPM (counts per million), and RLE (relative log expression). FFPE = formalin-fixed, paraffin-embedded, DV200 = percentage of RNA fragments over 200 nucleotides (nt) in length, DEG = differentially expressed genes, log2FC = log2 (fold change).

Different correlation coefficients were calculated for each the correlation of counts and the differentially expressed genes (DEGs) based on their log2FC values. The Pearson correlation coefficient indicated the correlation of counts and Spearman’s rank correlation coefficient the correlation of gene rank. To mitigate the effect of outliers, the Pearson correlation was also calculated on log-transformed counts with a pseudocount of 1 (from now on “Pearson-log”) for the sample-wise correlation calculation.

Count correlations were considered on a sample-wise (with Pearson-log and Spearman) and gene-wise (with Pearson and Spearman) level. The sample-wise level evaluated the correlation of counts of all genes per matched FFPE sample in both analysis methods used. The gene-wise level viewed the correlation of counts of an individual gene across the analyzed sample pairs.

The interpretation of the correlation strengths was based on previous publications for application in the medical field (55, 56): ≥ 0.8 = very strong, ≥ 0.6 to < 0.8 = moderately strong, ≥ 0.3 to < 0.6 = fair, and < 0.3 = poor.

For the “HGA versus AGASAC” comparison, all 25 samples were used for the sample- and gene-wise count correlations, as a higher sample size generally improves data reliability. For DEG correlation, only the 21 primary tumor samples were included, thereby decreasing some biological variability in the tissue background within the sample cohort. All 5 CCH samples in either the early or late stage were used in both count and DEG correlations.

2.7 p-value adjustment

Different procedures for the calculation of the false discovery rate (FDR) used for p-value adjustment during DGE calculation are routinely used. The nSolver™ user manual recommends using the Benjamini-Yekutieli (BY) procedure (57). On the other hand, DESeq2 routinely employs the Benjamini-Hochberg (BH) procedure (58). Thus, to allow for better comparisons, the BY method was employed on all data shown here, as it is the more conservative and discriminating of the two (5961).

3 Results

3.1 Normalization of reads from both techniques and correlation of counts

To begin, the genes included on the nCounter® Canine IO Panel were identified which are not annotated in CanFam3.1 and thus could not be routinely found in the QuantSeq 3′ data after alignment to this genome assembly. The genes were considered “missing” or without overlap between the two methods. It was consequently impossible to calculate correlation coefficients for these genes.

In the “HGA versus AGASAC” comparison, out of 830 genes included on the nCounter® panel with Panel Plus, 821 genes (98.9%) were found in the QuantSeq 3′ data. The following nine genes (1.1%) were missing in the QuantSeq 3′ data: FCAR, HAVCR2, IFNA7, IGHG, IGHM, IL2, MIF, TRAC, and TRBC.

Similarly, in the “CCH early versus late” comparison, out of 800 genes included on the nCounter® panel, ten genes (1.25%) were missing in the QuantSeq 3′ data: CCR2, FCAR, IFNA7, IGHG, IGHM, MIF, TRAC, TRBC, TRGC2, and TRGC3. Thus, 790 genes (98.75%) genes overlapped.

To determine the correlations on a count level, counts of all genes per matched FFPE sample analyzed with both nCounter® and QuantSeq 3′ (sample-wise level) were investigated. Overall, the sample-wise count correlations exhibited a narrow distribution of correlations and all correlation medians on this level were very strong in both biological comparisons (> 0.83).

For the “HGA versus AGASAC” comparison, data obtained with TMM provided a very strong count correlation (Pearson-log median: 0.87). All normalization methods, however, similarly yielded very strong count correlations (Pearson-log medians: > 0.84). All gene rank correlations were identically very strong (Spearman medians: 0.86) among all normalization methods (Figure 2A).

Figure 2

Figure 2. Correlation of counts from QuantSeq 3′ and the nCounter® Canine IO Panel. The box plots show the sample-wise (A,B) and gene-wise (C,D) correlations of counts from both methods. Results for the entity-contrasting comparison (“HGA versus AGASAC”; A,C) and the stage-dependent comparison (“CCH early versus late”; B,D) are shown. The Pearson (for gene-wise) or Pearson-log (for sample-wise) correlation (white background) indicates the correlation at the count level, while the Spearman correlation (gray background) reflects the values at the gene rank level. The calculations were performed without normalization (patternless) and with three different normalization methods: Trimmed Means of M-values (TMM; obliquely striped), counts per million (CPM; horizontally striped), and relative log expression (RLE; reticulated). Orange horizontal line represents the median. Circles indicate outliers.

In the “CCH early versus late” comparison, the utilization of TMM also provided a very strong count correlation (Pearson-log median: 0.85). The data from all normalization methods, however, likewise yielded very strong correlations (Pearson-log medians: > 0.83). All gene rank correlations were equally very strong (Spearman medians: 0.84) among all normalization methods (Figure 2B).

For the sample-wise count correlation calculations in both biological comparisons, the application of TMM and CPM (medians: 0.87 and 0.85, respectively) both slightly improved the Pearson-log correlations compared to the correlations with no normalization (medians: 0.86 and 0.84, respectively). However, the application of RLE slightly decreased the correlations (medians: 0.85 and 0.84, respectively). No improvement was seen after implementing the three normalization methods on the Spearman correlations; here all coefficients were identical (medians: 0.86 and 0.84, respectively).

In summary, all sample-wise count correlations were very strong with TMM, CPM, and RLE. Both the Pearson-log and Spearman correlations were slightly stronger in the entity-contrasting comparison (“HGA versus AGASAC”) than in the stage-dependent comparison (“CCH early versus late”).

Next, the correlations on a count level based on the counts of an individual gene across FFPE sample pairs analyzed with both nCounter® and QuantSeq 3′ (gene-wise level) were examined. Altogether, for the gene-wise count correlations, there was a much wider distribution of correlations.

Moderately strong (medians: > 0.63) correlations were calculated for the “HGA versus AGASAC” comparison (Figure 2C). This was observed on the level of counts (Pearson medians: > 0.67) and gene rank (Spearman medians: > 0.63). All normalization methods generated stronger correlations compared to no normalization (Pearson median: 0.67, Spearman median: 0.63). The application of RLE resulted in a slightly stronger correlation of counts (Pearson median: 0.732), compared to TMM (Pearson median: 0.726) and CPM (Pearson median: 0.731). With the use of TMM and CPM, almost identical gene rank correlations (Spearman median: 0.66) were calculated, compared to RLE (Spearman median: 0.65).

The “CCH early versus late” comparison also showed a much larger distribution of correlation values in all normalization methods (Figure 2D). This was observed on the level of counts (Pearson medians: > 0.76) and gene rank (Spearman medians: > 0.71). The application of all three normalization methods only slightly improved the correlations on count level (Pearson medians: > 0.77), compared to no normalization (Pearson median: 0.76). The application of CPM resulted in a slightly stronger correlation for counts (Pearson median: 0.79), compared to TMM and RLE (Pearson medians: 0.777 and 0.779, respectively).

Notably, both the Pearson and Spearman correlations were slightly stronger in the stage-dependent comparison (“CCH early versus late”) than in the entity-contrasting comparison (“HGA versus AGASAC”), contrary to the sample-wise results.

3.2 Gene-wise correlation by gene expression level

To identify the gene-wise correlations of gene counts depending on the respective gene expression level, the Pearson and Spearman correlation coefficients normalized with TMM were plotted against the mean gene expression data from either nCounter® or QuantSeq 3′ (Figure 3). Thresholds were drawn based on the gene distribution within the scatter plots. All gene counts above a respective threshold were grouped into an expression strength category and the correlation coefficients calculated for each of the respective genes. Specifically, the mean expression correlations for all, the top 90%, and 50% of genes were calculated. In both comparisons, the median correlation among all genes including the lowly expressed genes was weaker (“HGA versus AGASAC”: Pearson: 0.73, Spearman: 0.66/“CCH early versus late”: Pearson: 0.77, Spearman: 0.73) than among all genes excluding the lowly expressed genes (“HGA versus AGASAC”: Pearson: 0.77, Spearman: 0.69/“CCH early versus late”: Pearson: 0.8, Spearman: 0.75) and including only the highly expressed genes (“HGA versus AGASAC”: Pearson: 0.85, Spearman: 0.8/“CCH early versus late”: Pearson: 0.83, Spearman: 0.81). Summarizing, the correlation of genes including the highly expressed genes was strongest.

Figure 3

Figure 3. Correlation coefficients for genes with low, medium, and high mean expression. Results for the entity-contrasting comparison (“HGA versus AGASAC”; A–F) and the stage-dependent comparison (“CCH early versus late”; G–L) are shown. Scatter plots present the correlation (y-axis) of the counts of both methods plotted against the mean nCounter® (A,B,G,H) or QuantSeq 3′ (D,E,J,K) log gene count (x-axis). Pearson (A,D,G,J) and Spearman rank (B,E,H,K) correlation coefficients were calculated with the TMM normalization method. The vertical lines indicate the thresholds between all, the top 90%, and top 50% of expressed genes. Box plots depict the correlations of all the genes included in these expression strength groups for nCounter® (C,I) and QuantSeq 3′ (F,L). Pearson (white) and Spearman rank (gray) correlation coefficients were calculated. Orange horizontal lines indicate medians. Circles indicate outliers.

3.3 Correlation of log2FC and differentially expressed genes

To ascertain the correlations of genes based on their differential gene expression in the two biological comparisons, it was first necessary to calculate the correlations of the log2FC values from DESeq2 without log2FC-shrinkage (log2FC-shrinkage = shrinks estimated effect size toward zero, stronger for genes with little information, i.e., low average read counts that can likely yield artificially high log2FC estimates) on the overlapping genes. For the “HGA versus AGASAC” comparison, both correlations were very strong (Pearson: 0.91, Spearman: 0.87) based only on the log2FC values of the genes. For the “CCH early versus late” comparison, on the other hand, both correlations were only moderately strong (Pearson: 0.68, Spearman: 0.72).

In order to determine the overlap of differential gene expression direction, the log2FC values from DESeq2 with shrinkage were collated into scatter plots. Therefore, the log2FC of a given gene from the QuantSeq 3′ data was plotted against the log2FC of the corresponding gene from the nCounter® data. The dots of the corresponding genes were colored according to the significance thresholds padj ≤ 0.05 and log2FC ≤ −1 or ≥ 1 and the correspondence of the expression direction based on the log2FC (≤ −1 = “down”; log2FC ≥ 1 = “up”).

The scatter plot for the “HGA versus AGASAC” comparison showed a total of 599 genes (Figure 4A). One hundred and twenty genes (79 “down”; 41 “up”) had the same log2FC direction (red dots). One hundred and twelve genes were predicted only in one of the methods: 62 in nCounter® (orange dots): 16 “down”; 46 “up” and 50 in QuantSeq 3′ (blue dots): 42 “down”; 8 “up.” No genes were classified in opposite directions. Three hundred and sixty-seven genes fell under the significance thresholds.

Figure 4

Figure 4. Overlap of log2FC direction and significantly differentially expressed genes (DEGs) from the subset of nCounter® Canine IO Panel targets. (A,B) Results for the “HGA versus AGASAC” comparison (entity-contrasting comparison). (C,D) Results for the “CCH early versus late” comparison (stage-dependent comparison). (A,C) Scatter plots show the log2FC of a given gene from the QuantSeq 3′ data (y-axis) plotted against the log2FC of the corresponding gene from the nCounter® data (x-axis). The dots are colored according to the correspondence of expression direction: Significantly (padj ≤ 0.05 and log2FC ≤ −1 or ≥ 1) highly/lowly expressed in both methods (red), significantly differentially expressed in only one method (nCounter®: orange, QuantSeq 3′: blue), or not significantly differentially expressed in both methods (gray). (B,D) Venn diagrams depict the overlap (light gray), if any, of significantly DEGs from nCounter® and QuantSeq 3′. DEGs exclusively detected by nCounter® are colored in dark gray (left), while DEGs found only in QuantSeq 3′ are labeled in slate gray (right). The padj were corrected with the Benjamini-Yekutieli (BY) method and the log2FC were calculated with shrinkage from DESeq2 for the scatter plot and Venn diagram.

For the “CCH early versus late” comparison, a total of 658 genes were displayed in the scatter plot (Figure 4C). However, with a p-value threshold based on the BY adjustment, no genes were recognized by nCounter® and 89 genes (15 “down”; 74 “up”) only by QuantSeq 3′. The other 569 genes fell under the significance thresholds.

The lists of DEGs with their corresponding log2FC values and padj from both methods calculated by DESeq2 with shrinkage (scatter plot groups) from both biological comparisons are provided in Supplementary Table S4.

Differential gene expression (DGE) analysis performed using the BY p-value adjustment and log2FC with shrinkage calculated a total of 182 significantly differentially expressed genes (DEGs) for nCounter® and 170 significantly DEGs for QuantSeq 3′ in the “HGA versus AGASAC” comparison. To identify the overlap of the significantly DEGs in the “HGA versus AGASAC” comparison, a Venn diagram was created using the Matplotlib Python library (62). An overlap of 120 significantly DEGs was found in the “HGA versus AGASAC” comparison (Figure 4B), corresponding to a Jaccard index of 0.52.

In the “CCH early versus late” comparison, no significantly DEGs were calculated for nCounter® and a total of 89 significantly DEGs for QuantSeq 3′ were computed. As there were no significantly DEGs in the nCounter® data, the Venn diagram showed no overlap, matching a Jaccard index of 0 (Figure 4D).

4 Discussion

4.1 Stronger correlations compared to similar correlation studies

Here, we present the first systematic comparison between data resulting from the technical platforms QuantSeq 3′ and nCounter® using the Canine IO Panel on canine archival FFPE tissue. Previous correlation studies had contrasted data from QuantSeq 3′ or nCounter® to other mRNA quantification methods utilizing FFPE tissue. These had, for example, used different human FFPE tissues to directly compare nCounter® gene selections to other RNA-Seq methods or RT-PCR. For instance, a comparison of immune gene expression in 27 clear cell renal cell carcinoma samples between the nCounter® Pan Cancer Immune Profiling Panel and the Oncomine™ Immune Response Research Assay on FFPE specimens had revealed a moderately strong correlation (Spearman: 0.73) of fold changes for 248 shared genes. On a gene-wise level, 226 of these genes had shown positive correlations and 16 had negative correlations. The mean Pearson correlation for all genes had been fair at 0.45 (range: 0.98 to −0.25) (63). Comparing 20 genes in oral carcinoma samples using custom nCounter® CodeSets and RT-PCR, a moderately strong overall correlation (Pearson: 0.59) had been calculated (22) for gene expression levels in 19 FFPE sample pairs. In contrast, the correlation results of our study on canine tissues were stronger on both count and log2FC levels. This could be due to the improved compatibility of nCounter® and QuantSeq 3′ for FFPE material compared to the Oncomine™ assay and RT-PCR that had been employed in the previous studies. Considering RNA quality, the DV200 values of the total RNA in the samples chosen for this study on canine tissues were predominantly of high quality (n = 31 with DV200 > 70%) with some samples with medium quality (n = 4 with DV200 50–70%). One previous study (63) had not provided RNA quality data. The other study (22) had employed only the RNA integrity number (RIN) and disclosed the use of strongly degraded RNA (mean RIN: 2.3, range 1.5–2.5). However, the DV200 has been shown to be a superior criterion to evaluate RNA quality compared to the widely used RIN, especially when employing partially degraded RNA from FFPE samples (46). Thus, somewhat superior RNA quality may have contributed to the slightly better correlations in our comparison.

Other studies performed on cancer cell lines (18) or peripheral blood (64) had shown that the correlations on count level are dependent on the gene expression level. Generally, weaker correlations had been observed for genes with low expression levels and stronger correlations had been calculated for genes with high expression levels. This is in line with our study’s results: both nCounter® and QuantSeq 3′ exhibited the same expression-level dependent correlation phenomenon, with stronger correlations for higher expressed genes.

The strong correlations observed in this study raise much hope for retrospective studies using samples from veterinary pathology archives, as routinely stored FFPE tissues seem principally accessible for transcriptome analyses using both nCounter® and QuantSeq 3′. In comparison to FFPE material, fresh, or fresh-frozen tissue is much more difficult to obtain in veterinary and comparative pathology for logistical, ethical, potentially infectious, and legal reasons. Additionally, the cost of RNA-Seq has greatly decreased since its first introduction and in combination with the easy storage of FFPE material provides a cost-effective research and diagnostic tool for veterinary researchers and clinicians. Especially in favor of Russell’s and Burch’s 3 R (reduce, refine, replace) principles to improve ethical concerns when using experimental animals, it is imperative to obtain the highest possible benefit out of animal tissues. Obviously, routine archiving of FFPE tissues for many decades has proven to be a fortunate circumstance.

4.2 Confounders and influences on transcriptome data from FFPE samples

Confounding factors when using FFPE material, however, include the type, buffering state, and concentration of formalin used, duration of fixation, tissue processing, and storage conditions, which all may impact RNA quality (1, 6, 6568). In the case of this study, in which all tissues stemmed from routinely fixed, processed, and stored samples as part of the institutional diagnostic service, such limitations seemed to have had little, if any impact, on the results based on the RNA quality (Supplementary Table S2).

Still, multiple other aspects likely have an influence on correlations of data from different transcriptome analysis methodologies and might also account for the differences in DEGs detected in both methods in this study. To begin, the differing detection methods may result in slight discrepancies in the identification of transcripts and bioinformatic alignment of sequences after RNA-Seq may not be as specific as the probes used in nCounter®. Furthermore, by focusing on a selection of approximately 800 genes, the nCounter® panel offers the possibility to detect even weakly expressed transcripts, which may be overlooked compared to highly expressed genes, depending on the depth of sequencing in the case of QuantSeq 3′. However, pre-selected gene panels have some inherent limitations due to their focused design compared to genome-wide approaches, such as RNA-Seq. As expression analysis is restricted to specific genes of interest, changes in other genes not included on the panel remain undetected. Thus, potential novel biomarkers, broader biological processes, or unforeseen gene interactions can remain unidentified (69). Incomplete interpretation of underlying mechanisms or the oversight of key regulatory genes involved in a disease phenotype can result. Rather, panels are intended for focused research questions or validation of known targets. The choice of genes selected for inclusion is dependent on current knowledge and thus introduces a bias, restricting the scope of discovery. The gene set enrichment analysis tools available for RNA-Seq are of little statistical value for panels, as the background gene set is limited and biased by the panel’s focus. Thus, pre-selection violates the statistical assumption of unbiased, comprehensive, and genome-wide gene expression data (70).

Furthermore, different normalization methods may have an effect. This study employed the nSolver™-embedded normalization method for the nCounter® data and compared three common normalization methods (TMM, CPM, RLE) for the QuantSeq 3′ data. All methods performed similarly, with only minimal differences that seemingly did not influence the strength of correlations. Extending beyond these normalization methods, there is however a multitude of other approaches, which were developed to correct biases or weaknesses, such as accounting for gene length, refinement for FFPE RNA-Seq data, or microarray input (71, 72). For nCounter® data, further methods have likewise been developed for normalization and differential expression analysis apart from nSolver™. As the three normalization methods employed in this study provided little differences in correlations, it was not deemed necessary to test the impact of alternate normalization strategies on count correlations. Instead, it seems that DESeq2 and edgeR can reliably be used for count normalizations on QuantSeq 3′ data and that the software package one uses for normalization will likely have very little impact on the data. Similarly, other packages or approaches for differential gene expression (DGE) analysis are available (73). Again, in this study, the impact that these alternate DGE calculations may have on DEG correlations was not investigated. Overall, the analysis of DEGs is based on different approaches, i.e., statistical calculations, available software, and pathway lists for enrichment analyses, when using transcriptomic platforms. The nCounter® reads are usually fed into the nSolver™ software with the Advanced Analysis plug-in and pathway analyses are conducted with the program’s inbuilt modules. Annotations are assigned to most genes, which are provided in annotation files. The annotations for the Canine IO Panel are based on two previously developed human panels: the immune response categories originate from the PanCancer Immune Profiling Panel and the functional annotations are based on the IO 360 Panel. These annotations in turn stem from the Gene Ontology (GO) (74, 75), Kyoto Encyclopedia of Genes and Genomes (KEGG) (76), and Reactome (77, 78) databases. If an investigator has another research focus in mind, a gene curation team is able to annotate genes included on a panel with suitable terms. A custom annotation route, using an annotation engine, assigns annotations to customized gene add-ons (Panel Plus) or CodeSets directly from a database based on context (NanoString curation team, personal communication). Additionally, based on a specific research query, each investigator has the ability to manually change and adjust the annotations in an annotation file. This can lead to fluctuations and different statistical outcomes and should be transparently declared in consequent publications. With a multitude of gene annotation databases available for pathway analysis, such as GO, KEGG, Reactome, and Molecular Signatures Database (MSigDB) (79, 80), the nCounter® annotations add further nomenclature for gene grouping categories and pathways, which are not always transferable and/or interchangeable when comparing terms. However, researches may be able to answer questions more target-oriented this way.

Noteworthy, in the stage-dependent comparison conducted in this study, 89 DEGs were detected using QuantSeq 3′, but none using nCounter®. As previously outlined, it is generally to be expected that the two methods will yield divergent results, partly due to the differing scope of potentially detectable transcripts (whole transcriptome versus focus on 800 genes) and partly, to a lesser extent, due to different underlying detection methods used. Still, when comparing the stages of CCH, it must be emphasized that there are only minor differences at the transcriptome level (36). This is also reflected by the low number of detected DEGs using QuantSeq 3′ in the stage-dependent contrary to the entity-contrasting comparison in this study. However, studies that specifically had examined individual transcripts utilizing quantitative PCR found differences between early and late stages of CCH, for example in expression of proinflammatory cytokines (34). Thus, further investigation is required in order to resolve this discrepancy. Larger group sizes than those used in this study (n = 5) might be necessary to detect a larger number of significantly DEGs. Additionally, to the presumably minor differences at transcriptome level, the number of DEGs is likely to be reduced by the implementation of the BY procedure, which has been demonstrated to be more conservative than other FDR calculation methods (5761).

Major differences also lie in data processing. As with all large-scale transcriptome analysis systems, vast amounts of raw data are generated requiring further bioinformatic analysis. The QuantSeq 3′ system provides nucleotide sequences in FASTQ data files for alignment and normalization using standard bioinformatic methods. Prior bioinformatic know-how is thus indispensable. In comparison, the nCounter® system has the advantage of the specially developed nSolver™ software. The generated raw data can be imported into this software, normalized and analyzed by the user without any previous bioinformatic expertise.

4.3 Biological interpretation of transcriptome differences between the tumor entities studied

This technical study was based on two comparisons between contrasting tumor entities, the first between early and late stages of the spontaneously regressing canine cutaneous histiocytoma (CCH) and the second between two common canine perianal tumors with very distinct biological behaviors. The widely overlapping data obtained from the two methodological approaches clearly allowed for several oncologically interesting implications, with mutual confirmations by the two methods used. These results and interpretations, however, are subject of separate publications already published (36) or in preparation.

5 Conclusion

Taken together, our comparison between gene expression data obtained from QuantSeq 3′ and the nCounter® Canine IO Panel revealed very strong to moderately strong correlations when FFPE archival tissues stored up to 8 years long were used. The best correlations were achieved on the sample-wise level in both biological comparisons. Based on these strong correlations, it appears feasible to use either of the approaches to validate data generated by the other. nCounter® seems to be superior for validation compared to PCR for fragmented mRNA from FFPE tissue. There are, however, several more strategical and practical differences between the technologies and their actual use depends on the project’s study goal and design (Table 1). Clearly, both approaches make archival tissue well accessible for transcriptome studies in veterinary and comparative medicine. In particular, studies on rare entities or elusive biological cases will profit immensely from recent technological progress when compared to previous transcriptome methodologies.

Data availability statement

The raw data underlying the results presented in this study are publicly available from the Gene Expression Omnibus (GEO) database under the accession numbers GSE262020 (nCounter® data for CCH samples), GSE261791 (nCounter® data for HGA and AGASAC samples), GSE262022 (QuantSeq 3’ data for CCH samples), and GSE261790 (QuantSeq 3’ data for HGA and AGASAC samples. The source code and processed datasets are available on figshare: dx.doi.org/10.6084/m9.figshare.25768587.

Ethics statement

The animal studies were approved by State Office for Health and Social Affairs, Berlin (StN 010/23). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent was obtained from the owners for the participation of their animals in this study.

Author contributions

AH: Conceptualization, Formal analysis, Investigation, Methodology, Validation, Visualization, Project administration, Writing – original draft, Writing – review & editing. AL: Conceptualization, Formal analysis, Investigation, Methodology, Validation, Visualization, Project administration, Writing – original draft, Writing – review & editing. VJ: Investigation, Software, Writing – original draft, Writing – review & editing. SA: Methodology, Software, Investigation, Visualization, Writing – original draft, Writing – review & editing. AG: Conceptualization, Funding acquisition, Project administration, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was partly funded by a grant from the Berlin Einstein Foundation (EC3R) to AG. Additionally, this work was supported by a TEAMS grant from the Freie Universität Berlin to AG.

Acknowledgments

The authors thank Hedwig Lammert at the Charité University Medicine Berlin, as well as Tiina Berg and Magdalena Mecking from Lexogen Services, Vienna, for their technical assistance.

Conflict of interest

AG, AH, and AL were actively involved in the unremunerated collaborative construction of the nCounter® Canine IO Panel as part of NanoString’s Canine Consortium in 2020/2021. None of the authors have stock options in this publicly traded company.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fvets.2025.1601306/full#supplementary-material

References

1. Klopfleisch, R, Weiss, AT, and Gruber, AD. Excavation of a buried treasure--DNA, mRNA, miRNA and protein analysis in formalin fixed, paraffin embedded tissues. Histol Histopathol. (2011) 26:797–810. doi: 10.14670/hh-26.797

PubMed Abstract | Crossref Full Text | Google Scholar

2. Abrahamsen, HN, Steiniche, T, Nexo, E, Hamilton-Dutoit, SJ, and Sorensen, BS. Towards quantitative mRNA analysis in paraffin-embedded tissues using real-time reverse transcriptase-polymerase chain reaction: a methodological study on lymph nodes from melanoma patients. J Mol Diagn. (2003) 5:34–41. doi: 10.1016/S1525-1578(10)60449-7

PubMed Abstract | Crossref Full Text | Google Scholar

3. Lewis, F, Maughan, NJ, Smith, V, Hillan, K, and Quirke, P. Unlocking the archive – gene expression in paraffin-embedded tissue. J Pathol. (2001) 195:66–71. doi: 10.1002/1096-9896(200109)195:1<66::AID-PATH921>3.0.CO;2-F

PubMed Abstract | Crossref Full Text | Google Scholar

4. Mies, C. Molecular biological analysis of paraffin-embedded tissues. Hum Pathol. (1994) 25:555–60. doi: 10.1016/0046-8177(94)90218-6

PubMed Abstract | Crossref Full Text | Google Scholar

5. Cieslik, M, Chugh, R, Wu, YM, Wu, M, Brennan, C, Lonigro, R, et al. The use of exome capture RNA-seq for highly degraded RNA with application to clinical cancer sequencing. Genome Res. (2015) 25:1372–81. doi: 10.1101/gr.189621.115

PubMed Abstract | Crossref Full Text | Google Scholar

6. von Ahlfen, S, Missel, A, Bendrat, K, and Schlumpberger, M. Determinants of RNA quality from FFPE samples. PLoS One. (2007) 2:e1261. doi: 10.1371/journal.pone.0001261

PubMed Abstract | Crossref Full Text | Google Scholar

7. Masuda, N, Ohnishi, T, Kawamoto, S, Monden, M, and Okubo, K. Analysis of chemical modification of RNA from formalin-fixed samples and optimization of molecular biology applications for such samples. Nucleic Acids Res. (1999) 27:4436–43. doi: 10.1093/nar/27.22.4436

PubMed Abstract | Crossref Full Text | Google Scholar

8. Werner, M, Chott, A, Fabiano, A, and Battifora, H. Effect of formalin tissue fixation and processing on immunohistochemistry. Am J Surg Pathol. (2000) 24:1016–9. doi: 10.1097/00000478-200007000-00014

PubMed Abstract | Crossref Full Text | Google Scholar

9. Corley, SM, Troy, NM, Bosco, A, and Wilkins, MR. Quant seq. 3′ sequencing combined with Salmon provides a fast, reliable approach for high throughput RNA expression analysis. Sci Rep. (2019) 9:18895. doi: 10.1038/s41598-019-55434-x

PubMed Abstract | Crossref Full Text | Google Scholar

10. Mehine, M, Khamaiseh, S, Ahvenainen, T, Heikkinen, T, Äyräväinen, A, Pakarinen, P, et al. 3′RNA sequencing accurately classifies formalin-fixed paraffin-embedded uterine leiomyomas. Cancers. (2020) 12:3839. doi: 10.3390/cancers12123839

PubMed Abstract | Crossref Full Text | Google Scholar

11. Tandonnet, S, and Torres, TT. Traditional versus 3' RNA-seq in a non-model species. Genom Data. (2017) 11:9–16. doi: 10.1016/j.gdata.2016.11.002

PubMed Abstract | Crossref Full Text | Google Scholar

12. Ma, F, Fuqua, BK, Hasin, Y, Yukhtman, C, Vulpe, CD, Lusis, AJ, et al. A comparison between whole transcript and 3’ RNA sequencing methods using Kapa and Lexogen library preparation methods. BMC Genomics. (2019) 20:9. doi: 10.1186/s12864-018-5393-3

PubMed Abstract | Crossref Full Text | Google Scholar

13. Geiss, GK, Bumgarner, RE, Birditt, B, Dahl, T, Dowidar, N, Dunaway, DL, et al. Direct multiplexed measurement of gene expression with color-coded probe pairs. Nat Biotechnol. (2008) 26:317–25. doi: 10.1038/nbt1385

PubMed Abstract | Crossref Full Text | Google Scholar

14. Brumbaugh, CD, Kim, HJ, Giovacchini, M, and Pourmand, N. NanoStriDE: normalization and differential expression analysis of NanoString nCounter data. BMC Bioinformatics. (2011) 12:479. doi: 10.1186/1471-2105-12-479

PubMed Abstract | Crossref Full Text | Google Scholar

15. Barreno, L, Sevane, N, Valdivia, G, Alonso-Miguel, D, Suarez-Redondo, M, Alonso-Diez, A, et al. Transcriptomics of canine inflammatory mammary Cancer treated with empty cowpea mosaic virus implicates neutrophils in anti-tumor immunity. Int J Mol Sci. (2023) 24:14034. doi: 10.3390/ijms241814034

PubMed Abstract | Crossref Full Text | Google Scholar

16. Magee, K, Marsh, IR, Turek, MM, Grudzinski, J, Aluicio-Sarduy, E, Engle, JW, et al. Safety and feasibility of an in situ vaccination and immunomodulatory targeted radionuclide combination immuno-radiotherapy approach in a comparative (companion dog) setting. PLoS One. (2021) 16:e0255798. doi: 10.1371/journal.pone.0255798

PubMed Abstract | Crossref Full Text | Google Scholar

17. Chambers, MR, Foote, JB, Bentley, RT, Botta, D, Crossman, DK, Della Manna, DL, et al. Evaluation of immunologic parameters in canine glioma patients treated with an oncolytic herpes virus. J Transl Genet Genom. (2021) 5:423–42. doi: 10.20517/jtgg.2021.31

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhang, W, Petegrosso, R, Chang, J-W, Sun, J, Yong, J, Chien, J, et al. A large-scale comparative study of isoform expressions measured on four platforms. BMC Genomics. (2020) 21:272. doi: 10.1186/s12864-020-6643-8

PubMed Abstract | Crossref Full Text | Google Scholar

19. Eastel, JM, Lam, KW, Lee, NL, Lok, WY, Tsang, AHF, Pei, XM, et al. Application of NanoString technologies in companion diagnostic development. Expert Rev Mol Diagn. (2019) 19:591–8. doi: 10.1080/14737159.2019.1623672

PubMed Abstract | Crossref Full Text | Google Scholar

20. Jang, JS, Holicky, E, Lau, J, McDonough, S, Mutawe, M, Koster, MJ, et al. Application of the 3′ mRNA-Seq using unique molecular identifiers in highly degraded RNA derived from formalin-fixed, paraffin-embedded tissue. BMC Genomics. (2021) 22:759. doi: 10.1186/s12864-021-08068-1

PubMed Abstract | Crossref Full Text | Google Scholar

21. Zheng, C-M, Piao, X-M, Byun, YJ, Song, SJ, Kim, S-K, Moon, S-K, et al. Study on the use of Nanostring nCounter to analyze RNA extracted from formalin-fixed-paraffin-embedded and fresh frozen bladder cancer tissues. Cancer Genet. (2022) 268-269:137–43. doi: 10.1016/j.cancergen.2022.10.143

PubMed Abstract | Crossref Full Text | Google Scholar

22. Reis, PP, Waldron, L, Goswami, RS, Xu, W, Xuan, Y, Perez-Ordonez, B, et al. mRNA transcript quantification in archival samples using multiplexed, color-coded probes. BMC Biotechnol. (2011) 11:46. doi: 10.1186/1472-6750-11-46

PubMed Abstract | Crossref Full Text | Google Scholar

23. Norton, N, Sun, Z, Asmann, YW, Serie, DJ, Necela, BM, Bhagwate, A, et al. Gene expression, single nucleotide variant and fusion transcript discovery in archival material from breast tumors. PLoS One. (2013) 8:e81925. doi: 10.1371/journal.pone.0081925

PubMed Abstract | Crossref Full Text | Google Scholar

24. Turnbull, AK, Selli, C, Martinez-Perez, C, Fernando, A, Renshaw, L, Keys, J, et al. Unlocking the transcriptomic potential of formalin-fixed paraffin embedded clinical tissues: comparison of gene expression profiling approaches. BMC Bioinformatics. (2020) 21:30. doi: 10.1186/s12859-020-3365-5

PubMed Abstract | Crossref Full Text | Google Scholar

25. Jouinot, A, Lippert, J, Sibony, M, Violon, F, Jeanpierre, L, De Murat, D, et al. Transcriptome in paraffin samples for the diagnosis and prognosis of adrenocortical carcinoma. Eur J Endocrinol. (2022) 186:607–17. doi: 10.1530/eje-21-1228

PubMed Abstract | Crossref Full Text | Google Scholar

26. Nielsen, T, Wallden, B, Schaper, C, Ferree, S, Liu, S, Gao, D, et al. Analytical validation of the PAM50-based Prosigna breast Cancer prognostic gene signature assay and nCounter analysis system using formalin-fixed paraffin-embedded breast tumor specimens. BMC Cancer. (2014) 14:177. doi: 10.1186/1471-2407-14-177

PubMed Abstract | Crossref Full Text | Google Scholar

27. Wallden, B, Storhoff, J, Nielsen, T, Dowidar, N, Schaper, C, Ferree, S, et al. Development and verification of the PAM50-based Prosigna breast cancer gene signature assay. BMC Med Genet. (2015) 8:54. doi: 10.1186/s12920-015-0129-6

PubMed Abstract | Crossref Full Text | Google Scholar

28. Veldman-Jones, MH, Brant, R, Rooney, C, Geh, C, Emery, H, Harbron, CG, et al. Evaluating robustness and sensitivity of the NanoString technologies nCounter platform to enable multiplexed gene expression analysis of clinical samples. Cancer Res. (2015) 75:2587–93. doi: 10.1158/0008-5472.Can-15-0262

PubMed Abstract | Crossref Full Text | Google Scholar

29. Dow, S. A role for dogs in advancing cancer immunotherapy research. Front Immunol. (2020) 10:2935. doi: 10.3389/fimmu.2019.02935

PubMed Abstract | Crossref Full Text | Google Scholar

30. Gardner, HL, Fenger, JM, and London, CA. Dogs as a model for cancer. Annu Rev Anim Biosci. (2016) 4:199–222. doi: 10.1146/annurev-animal-022114-110911

PubMed Abstract | Crossref Full Text | Google Scholar

31. Rowell, JL, McCarthy, DO, and Alvarez, CE. Dog models of naturally occurring cancer. Trends Mol Med. (2011) 17:380–8. doi: 10.1016/j.molmed.2011.02.004

PubMed Abstract | Crossref Full Text | Google Scholar

32. Moore, PF. A review of histiocytic diseases of dogs and cats. Vet Pathol. (2014) 51:167–84. doi: 10.1177/0300985813510413

PubMed Abstract | Crossref Full Text | Google Scholar

33. Puff, C, Risha, E, and Baumgärtner, W. Regression of canine cutaneous histiocytoma is associated with an orchestrated expression of matrix metalloproteinases. J Comp Pathol. (2013) 149:208–15. doi: 10.1016/j.jcpa.2013.01.014

PubMed Abstract | Crossref Full Text | Google Scholar

34. Kaim, U, Moritz, A, Failing, K, and Baumgärtner, W. The regression of a canine Langerhans cell tumour is associated with increased expression of IL-2, TNF-alpha, IFN-gamma and iNOS mRNA. Immunology. (2006) 118:472–82. doi: 10.1111/j.1365-2567.2006.02394.x

PubMed Abstract | Crossref Full Text | Google Scholar

35. Kipar, A, Baumgärtner, W, Kremmer, E, Frese, K, and Weiss, E. Expression of major histocompatibility complex class II antigen in neoplastic cells of canine cutaneous histiocytoma. Vet Immunol Immunopathol. (1998) 62:1–13. doi: 10.1016/s0165-2427(97)00170-0

PubMed Abstract | Crossref Full Text | Google Scholar

36. Loriani Fard, AK, Haake, A, Jovanovic, V, Andreotti, S, Hummel, M, Hempel, BF, et al. Immuno-oncologic profiling by stage-dependent transcriptome and proteome analyses of spontaneously regressing canine cutaneous histiocytoma. PeerJ. (2024) 12:e18444. doi: 10.7717/peerj.18444

PubMed Abstract | Crossref Full Text | Google Scholar

37. Diehl, B, and Hansmann, F. Immune checkpoint regulation is critically involved in canine cutaneous histiocytoma regression. Front Vet Sci. (2024) 11:1371931. doi: 10.3389/fvets.2024.1371931

PubMed Abstract | Crossref Full Text | Google Scholar

38. Cockerell, GL, and Slauson, DO. Patterns of lymphoid infiltrate in the canine cutaneous histiocytoma. J Comp Pathol. (1979) 89:193–203. doi: 10.1016/0021-9975(79)90058-6

PubMed Abstract | Crossref Full Text | Google Scholar

39. McCourt, MR, Levine, GM, Breshears, MA, Wall, CR, and Meinkoth, JH. Metastatic disease in a dog with a well-differentiated perianal gland tumor. Vet Clin Pathol. (2018) 47:649–53. doi: 10.1111/vcp.12662

PubMed Abstract | Crossref Full Text | Google Scholar

40. Jardim, J, Kobayashi, PE, Cosentino, PD, Alcaraz, A, Laufer-Amorim, R, and Fonseca-Alves, CE. Clinicopathological and immunohistochemical description of an intrapelvic hepatoid gland carcinoma in a 14-year-old Teckel dog. Vet Q. (2018) 38:9–13. doi: 10.1080/01652176.2017.1404167

PubMed Abstract | Crossref Full Text | Google Scholar

41. Williams, LE, Gliatto, JM, Dodge, RK, Johnson, JL, Gamblin, RM, Thamm, DH, et al. Carcinoma of the apocrine glands of the anal sac in dogs: 113 cases (1985-1995). J Am Vet Med Assoc. (2003) 223:825–31. doi: 10.2460/javma.2003.223.825

PubMed Abstract | Crossref Full Text | Google Scholar

42. Repasy, AB, Selmic, LE, and Kisseberth, WC. Canine apocrine gland anal sac adenocarcinoma: A review. Top Companion Anim Med. (2022) 50:100682. doi: 10.1016/j.tcam.2022.100682

PubMed Abstract | Crossref Full Text | Google Scholar

43. Brown, RJ, Newman, SJ, Durtschi, DC, and LeBlanc, AK. Expression of PDGFR-β and kit in canine anal sac apocrine gland adenocarcinoma using tissue immunohistochemistry. Vet Comp Oncol. (2012) 10:74–9. doi: 10.1111/j.1476-5829.2011.00286.x

PubMed Abstract | Crossref Full Text | Google Scholar

44. Wong, H, Byrne, S, Rasotto, R, Drees, R, Taylor, A, Priestnall, SL, et al. A retrospective study of clinical and histopathological features of 81 cases of canine apocrine gland adenocarcinoma of the anal sac: independent clinical and histopathological risk factors associated with outcome. Animals. (2021) 11:3327. doi: 10.3390/ani11113327

PubMed Abstract | Crossref Full Text | Google Scholar

45. Schroeder, A, Mueller, O, Stocker, S, Salowsky, R, Leiber, M, Gassmann, M, et al. The RIN: an RNA integrity number for assigning integrity values to RNA measurements. BMC Mol Biol. (2006) 7:3. doi: 10.1186/1471-2199-7-3

PubMed Abstract | Crossref Full Text | Google Scholar

46. Matsubara, T, Soh, J, Morita, M, Uwabo, T, Tomida, S, Fujiwara, T, et al. DV200 index for assessing RNA integrity in next-generation sequencing. Biomed Res Int. (2020) 2020:9349132. doi: 10.1155/2020/9349132

PubMed Abstract | Crossref Full Text | Google Scholar

47. Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. (2011) 17:10–2. doi: 10.14806/ej.17.1.200

Crossref Full Text | Google Scholar

48. Hoeppner, MP, Lundquist, A, Pirun, M, Meadows, JRS, Zamani, N, Johnson, J, et al. An improved canine genome and a comprehensive catalogue of coding genes and non-coding transcripts. PLoS One. (2014) 9:e91172. doi: 10.1371/journal.pone.0091172

PubMed Abstract | Crossref Full Text | Google Scholar

49. Dobin, A, Davis, CA, Schlesinger, F, Drenkow, J, Zaleski, C, Jha, S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. (2012) 29:15–21. doi: 10.1093/bioinformatics/bts635

PubMed Abstract | Crossref Full Text | Google Scholar

50. Liao, Y, Smyth, GK, and Shi, W. Feature counts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. (2013) 30:923–30. doi: 10.1093/bioinformatics/btt656

PubMed Abstract | Crossref Full Text | Google Scholar

51. Robinson, MD, McCarthy, DJ, and Smyth, GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. (2009) 26:139–40. doi: 10.1093/bioinformatics/btp616

PubMed Abstract | Crossref Full Text | Google Scholar

52. McCarthy, DJ, Chen, Y, and Smyth, GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. (2012) 40:4288–97. doi: 10.1093/nar/gks042

PubMed Abstract | Crossref Full Text | Google Scholar

53. Chen, Y, Lun, A, and Smyth, G. From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline. F1000Res. (2016) 5:1438. doi: 10.12688/f1000research.8987.2

Crossref Full Text | Google Scholar

54. Love, MI, Huber, W, and Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. (2014) 15:550. doi: 10.1186/s13059-014-0550-8

PubMed Abstract | Crossref Full Text | Google Scholar

55. Chan, YH. Biostatistics 104: correlational analysis. Singapore Med J. (2003) 44:614–9.

PubMed Abstract | Google Scholar

56. Akoglu, H. User's guide to correlation coefficients. Turk J Emerg Med. (2018) 18:91–3. doi: 10.1016/j.tjem.2018.08.001

PubMed Abstract | Crossref Full Text | Google Scholar

57. Benjamini, Y, and Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Ann Stat. (2001) 29:1165–88. doi: 10.1214/aos/1013699998

Crossref Full Text | Google Scholar

58. Benjamini, Y, and Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Stat Methodol. (1995) 57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x

Crossref Full Text | Google Scholar

59. Brinster, R, Köttgen, A, Tayo, BO, Schumacher, M, and Sekula, Pon behalf of the CC. Control procedures and estimators of the false discovery rate and their application in low-dimensional settings: an empirical investigation. BMC Bioinformatics. (2018) 19:78. doi: 10.1186/s12859-018-2081-x

PubMed Abstract | Crossref Full Text | Google Scholar

60. Chen, S-Y, Feng, Z, and Yi, X. A general introduction to adjustment for multiple comparisons. J Thorac Dis. (2017) 9:1725–9. doi: 10.21037/jtd.2017.05.34

PubMed Abstract | Crossref Full Text | Google Scholar

61. Goeman, JJ, and Solari, A. Multiple hypothesis testing in genomics. Stat Med. (2014) 33:1946–78. doi: 10.1002/sim.6082

PubMed Abstract | Crossref Full Text | Google Scholar

62. Hunter, JD. Matplotlib: a 2D graphics environment. Comput Sci Eng. (2007) 9:90–5. doi: 10.1109/MCSE.2007.55

Crossref Full Text | Google Scholar

63. Talla, SB, Rempel, E, Endris, V, Jenzer, M, Allgäuer, M, Schwab, C, et al. Immuno-oncology gene expression profiling of formalin-fixed and paraffin-embedded clear cell renal cell carcinoma: performance comparison of the NanoString nCounter technology with targeted RNA sequencing. Genes Chromosom Cancer. (2020) 59:406–16. doi: 10.1002/gcc.22843

PubMed Abstract | Crossref Full Text | Google Scholar

64. Bondar, G, Xu, W, Elashoff, D, Li, X, Faure-Kumar, E, Bao, TM, et al. Comparing NGS and NanoString platforms in peripheral blood mononuclear cell transcriptome profiling for advanced heart failure biomarker development. J Biol Methods. (2020) 7:e123. doi: 10.14440/jbm.2020.300

PubMed Abstract | Crossref Full Text | Google Scholar

65. Frankel, A. Formalin fixation in the ‘-omics’ era: a primer for the surgeon-scientist. ANZ J Surg. (2012) 82:395–402. doi: 10.1111/j.1445-2197.2012.06092.x

PubMed Abstract | Crossref Full Text | Google Scholar

66. Medeiros, F, Rigl, CT, Anderson, GG, Becker, SH, and Halling, KC. Tissue handling for genome-wide expression analysis: A review of the issues, evidence, and opportunities. Arch Pathol Lab Med. (2007) 131:1805–16. doi: 10.5858/2007-131-1805-thfgea

PubMed Abstract | Crossref Full Text | Google Scholar

67. Chung, JY, Braunschweig, T, Williams, R, Guerrero, N, Hoffmann, KM, Kwon, M, et al. Factors in tissue handling and processing that impact RNA obtained from formalin-fixed, paraffin-embedded tissue. J Histochem Cytochem. (2008) 56:1033–42. doi: 10.1369/jhc.2008.951863

PubMed Abstract | Crossref Full Text | Google Scholar

68. Xie, R, Chung, J-Y, Ylaya, K, Williams, RL, Guerrero, N, Nakatsuka, N, et al. Factors influencing the degradation of archival formalin-fixed paraffin-embedded tissue sections. J Histochem Cytochem. (2011) 59:356–65. doi: 10.1369/0022155411398488

PubMed Abstract | Crossref Full Text | Google Scholar

69. Chilimoniuk, J, Erol, A, Rödiger, S, and Burdukiewicz, M. Challenges and opportunities in processing NanoString nCounter data. Comput Struct Biotechnol J. (2024) 23:1951–8. doi: 10.1016/j.csbj.2024.04.061

PubMed Abstract | Crossref Full Text | Google Scholar

70. Subramanian, A, Tamayo, P, Mootha, VK, Mukherjee, S, Ebert, BL, Gillette, MA, et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci. (2005) 102:15545–50. doi: 10.1073/pnas.0506580102

PubMed Abstract | Crossref Full Text | Google Scholar

71. Bullard, JH, Purdom, E, Hansen, KD, and Dudoit, S. Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics. (2010) 11:94. doi: 10.1186/1471-2105-11-94

PubMed Abstract | Crossref Full Text | Google Scholar

72. Abbas-Aghababazadeh, F, Li, Q, and Fridley, BL. Comparison of normalization approaches for gene expression studies completed with high-throughput sequencing. PLoS One. (2018) 13:e0206312. doi: 10.1371/journal.pone.0206312

PubMed Abstract | Crossref Full Text | Google Scholar

73. Seyednasrollah, F, Laiho, A, and Elo, LL. Comparison of software packages for detecting differential expression in RNA-seq studies. Brief Bioinform. (2013) 16:59–70. doi: 10.1093/bib/bbt086

PubMed Abstract | Crossref Full Text | Google Scholar

74. Ashburner, M, Ball, CA, Blake, JA, Botstein, D, Butler, H, Cherry, JM, et al. Gene ontology: tool for the unification of biology. Nat Genet. (2000) 25:25–9. doi: 10.1038/75556

PubMed Abstract | Crossref Full Text | Google Scholar

75. Aleksander, SA, Balhoff, J, Carbon, S, Cherry, JM, Drabkin, HJ, Ebert, D, et al. The gene ontology knowledgebase in 2023. Genetics. (2023) 224:iyad031. doi: 10.1093/genetics/iyad031

PubMed Abstract | Crossref Full Text | Google Scholar

76. Kanehisa, M, Sato, Y, Kawashima, M, Furumichi, M, and Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. (2015) 44:D457–62. doi: 10.1093/nar/gkv1070

PubMed Abstract | Crossref Full Text | Google Scholar

77. Fabregat, A, Sidiropoulos, K, Viteri, G, Forner, O, Marin-Garcia, P, Arnau, V, et al. Reactome pathway analysis: a high-performance in-memory approach. BMC Bioinformatics. (2017) 18:142. doi: 10.1186/s12859-017-1559-2

PubMed Abstract | Crossref Full Text | Google Scholar

78. Gillespie, M, Jassal, B, Stephan, R, Milacic, M, Rothfels, K, Senff-Ribeiro, A, et al. The reactome pathway knowledgebase 2022. Nucleic Acids Res. (2021) 50:D687–92. doi: 10.1093/nar/gkab1028

PubMed Abstract | Crossref Full Text | Google Scholar

79. Liberzon, A, Birger, C, Thorvaldsdóttir, H, Ghandi, M, Mesirov, JP, and Tamayo, P. The molecular signatures database hallmark gene set collection. Cell Syst. (2015) 1:417–25. doi: 10.1016/j.cels.2015.12.004

PubMed Abstract | Crossref Full Text | Google Scholar

80. Liberzon, A, Subramanian, A, Pinchback, R, Thorvaldsdóttir, H, Tamayo, P, and Mesirov, JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. (2011) 27:1739–40. doi: 10.1093/bioinformatics/btr260

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: dog, FFPE, nCounter, pathology, QuantSeq 3′, oncology, veterinary

Citation: Haake AFH, Loriani Fard AK, Jovanovic VM, Andreotti S and Gruber AD (2025) Strong correlation of gene counts and differentially expressed genes between a 3′ RNA-Seq and an RNA hybridization platform in transcriptome analyses from canine archival tissues. Front. Vet. Sci. 12:1601306. doi: 10.3389/fvets.2025.1601306

Received: 04 April 2025; Accepted: 05 June 2025;
Published: 30 June 2025.

Edited by:

Guadalupe Gómez-Baena, University of Cordoba, Spain

Reviewed by:

Robert Stryiński, University of Warmia and Mazury in Olsztyn, Poland
Jingjing Ling, Elanco, United States

Copyright © 2025 Haake, Loriani Fard, Jovanovic, Andreotti and Gruber. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Achim D. Gruber, YWNoaW0uZ3J1YmVyQGZ1LWJlcmxpbi5kZQ==

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.