Abstract
Background:
The human genome contains approximately 3,200 near full-length autonomous human endogenous retroviral (HERV) genomes distributed across the 23 chromosomes. These autonomous HERV proviral genomes include long terminal repeats (LTRs) capable of promoting RNA transcription. In quiescent cells, most HERV loci remain transcriptionally silent. However, environmental changes, such as epigenetic remodeling of chromatin, can activate these silenced loci.
Methods:
To study HERV reactivation, we previously analyzed autonomous HERV expression patterns in monocytes isolated from peripheral blood mononuclear cells (PBMCs) identified in single-cell RNA sequencing (scRNA-seq) databases using the Azimuth application. We developed a Window-based HERV Alignment (WHA) method, which analyzes aligned DNA sequences using sequential, non-overlapping windows of defined lengths. Samples were scored as positive (>= 9 good/usable windows) or negative (<= 8 good/usable windows).
Results:
Using WHA, we established a control set from 31 normal individuals, with fewer than 8 windows at selected HERV loci. We analyzed scRNA-seq data from three studies of hospitalized COVID-19 patients and found distinct HERV expression patterns in monocytes. Unique patterns were also found in patients with influenza, Dengue virus, or sepsis. We next examined HERV expression at early (<7 days) and late (>14 days) timepoints post COVID-19 recovery and detected HERV loci in both groups. Analyzing 12 patients with post-acute sequelae of COVID-19 (PASC), we identified three HERV loci expressed in all patients. Some loci showed amplified numbers of good/usable windows, indicating longer transcripts and greater sequence depth. The most amplified locus was located within an intron of JAKMIP2, which, along with neighboring host genes, also showed increased transcription.
Conclusion:
Previous studies have shown that viral infections, including COVID-19, influenza, and Dengue virus, as well as sepsis, can induce innate immune memory in monocytes through epigenetic remodeling of hematopoietic stem and myeloid precursor cells. The identification of co-amplified HERV loci and neighboring host gene transcripts in monocytes from PASC patients suggests expansion of epigenetically remodeled myeloid progenitors. The identification of these HERV-host gene patterns provides a foundation needed to understand the clinical features of patients with PASC.
Introduction
Studies have established the importance of innate immune cells, such as monocytes, in the response to various viral and bacterial pathogens (–). Viral infections, including COVID-19, influenza, and Dengue virus, as well as bacterial infections (sepsis), have been shown to alter the function of monocytes (–). Furthermore, there is growing recognition of the ability of these viral infections to induce innate immune memory, that include epigenetic remodeling of bone marrow-derived hematopoietic stem cells and myeloid precursor cells (–). These epigenetically remodeled myeloid precursors have a limited capacity for self-renewal, leading to their expansion following primary viral infection (–).
The human genome contains approximately 3,200 near full-length autonomous human endogenous retroviral (HERV) genomes, which are distributed across the 23 chromosomes (, ). These autonomous HERV proviral genomes contain long terminal repeats (LTR) capable of promoting RNA transcription (, ). In quiescent, normal cells, only a subset of these autonomous HERV loci is transcriptionally active, while the most of the HERV loci are silenced in the resting cell (–). In a previous study, using Single-cell RNA sequencing (scRNA-seq) databases we analyzed the expression patterns of the autonomous HERVs in monocytes identified in resting and in vitro stimulated peripheral blood mononuclear cells (PBMC) (). We developed a method called Window-based HERV Alignment (WHA), which analyzes aligned DNA sequences using sequential, non-overlapping windows of defined nucleotide lengths. Samples are scored as either positive (9 or more good/usable windows detected) or negative (8 or fewer good/usable windows) () (See Supplementary Method). We established a control set consisting of 31 normal individuals identified from several different scRNA-seq analyses, in which the numbers of windows were fewer than 8 for selected HERV loci. Using this analysis, we found positive HERV loci expression patterns in monocytes by analyzing scRNA-seq datasets from PBMCs obtained from individuals with trauma or COVID-19, both of which are known to stimulate in vivo monocyte transcription. Since HERV expression is sensitive to gene silencing, our system provides a useful tool for assessing the impact of different pathological conditions that lead to the in vivo expression of previously silenced HERV loci in innate immune cells, such as the monocyte (, ).
In the current study, we used these 31 individual data sets as a pangenome control to compare the autonomous HERV loci transcriptomes in monocytes from patients with different diseases. WHA analysis revealed that the monocyte populations from acutely infected patients with COVID-19, influenza, Dengue virus, or sepsis exhibited distinct patterns of autonomous HERV loci transcriptome expression compared to the 31-pangenome controls. Additionally, using a dataset from patients with post-acute sequelae of severe acute COVID-19 (PASC) at 8 months post-acute infection, we identified a common set of HERV loci in monocytes present in all 12 patients. A subset of these HERV loci exhibited an increased number of good/usable windows (i.e., sequence length) and greater sequence depth compared to the same loci found in hospitalized patients with acute COVID-19 infections, who tested positive by antigen or RT-PCR. The most amplified HERV loci transcriptome was in the first intron of a cellular gene, JAKMIP2. Further analysis revealed the JAKMIP2, along with other nearby host genes in Chromosome 5 also had significantly increased expression. The patterns of co-amplified HERV loci with surrounding host gene transcripts in monocytes in patients with PASC are consistent with the establishment of trained innate immunity that is characterized by localized epigenetic remodeling of myeloid precursors (, ).
Results
In a previous study, we described Window-based HERV Alignment (WHA), in which DNA from scRNA-seq analysis is aligned to the 3,200 autonomous HERV loci in non-overlapping windows of defined nucleotide lengths (). Usable windows are defined as those with a read depth of 3 or greater and must have a minimum of 9 good/usable windows, corresponding to an extended (i.e., longer) expression of HERV transcripts. In contrast, those HERV loci designated as negative (8 or less windows) do not meet the necessary sequence read depth or number of good/usable windows as determined by WHA. We used the WHA to develop a pangenome control that consisted of scRNA-seq datasets from 31 normal individuals taken from multiple studies. The WHA data was then filtered to eliminate any of the 3,200 HERV loci that had at least one positive (i.e., WHA of greater than 9 windows) in the 31-pangenome samples () (See Supplementary Method).
HERV loci expression in monocytes from COVID-19 exposed, infected with no symptoms and acutely infected hospitalized patients
In the first analysis, we used three scRNA-seq datasets from hospitalized acute COVID-19 patients, along with an scRNA-seq dataset from individuals exposed to or infected with COVID-19 who showed no symptoms (, , , ). Each of these scRNA-seq datasets, along with the pangenome datasets, was first reanalyzed using Azimuth with the specific reference sequences used for PBMC (, ). We analyzed the PBMCs from the patients for monocytes and used the WHA to identify positive HERV loci. We then filtered these HERV loci against the normal individual controls for each study to identify those that were positive in the patients but negative in the controls. Using the comparison, we found no positive HERV loci in the individuals who were exposed to, or showed no COVID-19 symptoms () (Figure 1).
Figure 1
In contrast, we found positive HERV loci in monocytes from three independent studies of acutely infected, hospitalized COVID-19 patients. The patients in Lee et al. (herein referred to as Acute 1 group) and Unterman et al. (herein referred to as the Acute 2 group) recovered after hospitalization. In contrast, in Amrute et al. (herein referred to as the Acute 3 group), one set of patients recovered after hospitalization, while another set did not recover and died (Supplementary Table 1) (
Analysis of HERV loci expression in monocytes obtained from patients with different disease or vaccinated normal individuals
We next expanded the analysis to include scRNA-seq datasets from hospitalized patients with severe influenza, acute respiratory sepsis, and Dengue virus (Figure 2, Supplementary Table 3) (
Figure 2

Distribution of positive HERV loci found in patients infected with influenza, dengue virus or sepsis. The positive HERV loci found in PBMCs from patients infected with influenza, Dengue virus or sepsis is depicted in a vertical column. HERV loci that were accessed were depicted in a horizontal column. Red box indicates a positive HERV loci, yellow indicates a negative HERV loci below cut-off values for depth and windows. Numbers of patients per each group; Flu: 5, Dengue virus: 15, Sepsis: 4. Note some samples contained multiple time points – see Supplementary Table 3 for detailed information on identity of HERV loci.
Figure 3

Distribution of positive HERV loci found in individuals who were administered either the influenza or COVID-19 vaccines. The positive HERV loci found in PBMCs from patients who were administered either the influenza or COVID-19 vaccine is depicted in a vertical column. HERV loci that were accessed were depicted in a horizontal column. Red box indicates a positive HERV loci, yellow indicates a negative HERV loci below cut-off values for depth and windows. Number of individuals per each group; Flu vaccine: 1, COVID vaccine 6. Note some samples contained multiple time points – see Supplementary Table 4 for detailed information on identity of HERV loci.
Analysis of HERV loci expression in patients post acute COVID-19 infection
We next analyzed a scRNA-seq dataset from patients at early times post-acute COVID-19 infection. In the first dataset, two patient groups were analyzed for HERV loci expression: an early group that became RT-PCR negative at or before 7 days and a late group that became RT-PCR negative at 14 days or later (
Figure 4

Distribution of positive HERV loci found in individuals with early or late recovery, or those identified as PASC. The positive HERV loci found in PBMCs from patients post COVID-19 infection, categorized as early, late, or identified as PASC is depicted in a vertical column. HERV loci that were accessed were depicted in a horizontal column. Red box indicates a positive HERV loci, yellow indicates a negative HERV loci below cut-off values for depth and windows. Number of patients per each group; ERS: 5, LRS: 5, PASC 1 (8m): 12, PASC 2: 2. Note some samples contained multiple time points – see Supplementary Table 5 for detailed information on identity of HERV loci.
Numerous studies have described patients with post-acute sequelae of COVID-19 (PASC), also known as long-term COVID-19 (
In summary, we found that the three acute COVID-19 sample sets (totaling 51 samples) and the 12 samples from PASC patients had the highest number of unique HERV loci sites (Supplementary Table 6). A comparison between acute COVID-19 and PASC samples revealed 69 unique HERV loci in the acute samples and 52 unique loci in the PASC samples. Additionally, 51 positive HERV loci were identified in both acute and PASC samples, however, no single HERV locus was found in all acute and PASC patients.
To highlight the differences between acute, the early/late recovered (ERS/LRS), and the PASC samples, we analyzed the number of good/usable windows identified for each positive HERV locus (Figure 5, Supplementary Table 7). Overall, there was a clear difference in the distribution of HERV loci, with PASC samples exhibiting a higher number of good/usable windows compared to acute or early/late samples. The difference was most evident in HERV loci 001-017. In the acute samples, we identified one individual (251-0) with a cluster of HERV loci (HERV 114-121) showing a high number of good/usable windows, such as HERV-118 with HERV-339 good/usable windows. Interestingly, this elevated number of good/usable windows was not observed in the same individual 7 days later (251-7). To quantify these differences, we compared the number of good/usable windows in acute versus PASC samples. We identified 5 HERV loci present only in PASC patients (Figure 6A). The average number of good/usable windows for these 5 HERV loci ranged from 10 to 25. Additionally, we identified 6 HERV loci (HERV001, 004, 006, 012, 015, and 020) where the number of good/usable windows in PASC samples was significantly greater than in acute sample (Figure 6B, Supplementary Figure 1).
Figure 5

Distribution of positive windows found from analysis of samples from acute patients, early or late recovery patients and PASC patients. The number of the positive HERV loci for each locus were compiled and presented in a heatmap. The 8-month PASC patient sample set contained the greatest number of positive HERV loci. See Supplementary Table 7 for detailed information on identity of HERV loci.
Figure 6

Comparison of the number of positive windows obtained from PASC patients and those found in the remaining datasets (acute, vaccinated, sepsis, influenza, Dengue virus, influenza virus and 24-month post-COVID). The number of positive windows for Individual HERV loci in the sample set from 8-month PASC patients was compared with the combined number of positives for the same loci from the remaining datasets. (A) Positive HERV loci identified at specific loci in the 8-month PASC samples with no positive HERV loci detected in any other datasets. The average number of good/usable windows, along with the standard deviation, observed in the PASC samples is shown in the figure. (B) Positive HERV loci identified at specific loci in the 8-month PASC samples compared to those detected at the same loci in the remaining datasets. The average number of good/usable windows for the top HERV loci in the PASC group was used to perform a t-test against the acute group. The number of positive windows for 1) HERV001 loci in PASC samples versus those found in the other samples 163-7 (Acute 3, Amrute et al.), TS4-A (Acute 2, Unterman et al.), 2) HERV015 loci in PASC samples versus those found in the other samples 80-0, 80-7, 154-0, 163-0, 163-7, 251-0, 251-7 (Acute 3, Amrute et al.), 3) HERV012 loci in PASC samples versus those found in the other samples S1-nCOV1, S12-nCOV6 (Acute 1, Lee et al.) and NS-1A, TP-6A, TP-7A, TP9-B, TS-4A, 4) HERV020 loci in PASC samples versus those found in the other samples 145-0, 163-0, 163-7,272-0, 272-7 (Acute 3, Amrute et al.). "**" = p-value < 0.05, "***" = p-value < 0.001.
One of the most striking results was found from the analysis of HERV001, which was detected in all 12 PASC patients with good/usable window counts ranging from 172 to 332 that were all significantly different than that found in patients with acute COVID-19. In contrast, we found 5 HERV loci (HERV025, 054, 057, 097, and 106) that exhibited similar numbers of good/usable windows between acute and PASC samples, with no significant p-value observed (Supplementary Figure 2A). Additionally, 5 HERV loci (HERV083, 067, 087, 065, and 114) were detected only in the remaining COVID-19 patient samples and not in the PASC sample (Supplementary Figure 2B).
We next examined the sequencing read depth of positive HERV loci samples. We found that 2 HERV loci (HERV001 and 015) had significantly higher read depth in PASC samples compared to acute COVID-19 samples (Figure 7). In contrast, no significant difference in read depth was observed between acute and PASC samples for HERV020 and HERV012 (Figure 7, Supplementary Figure 3). These findings suggest two distinct patterns of HERV expression when comparing acute and PASC sample sets: (1) a pattern in which both the number of good/usable windows and sequencing depth are significantly greater in PASC samples compared to acute samples, and (2) a pattern in which PASC samples have significantly more good/usable windows than acute samples but without a significant difference in sequencing read depth.
Figure 7

Comparison of the sequence read depths of PASC patients at the same HERV loci in the remaining datasets. Comparison of the read depths for four HERV loci between the PASC dataset and the acute, vaccinated, sepsis, influenza and Dengue virus datasets using t-test (Supplementary Table 7). Any statistically significant p-value (p < 0.05) are shown in the figure (‘ns’ indicates not significant). (A) The sequence read depth for HERV001 loci in PASC samples versus those found in the acute sample 163-7 (Acute 3, Amrute et al.) and TS4A (Acute 2, Unterman et al.). (B) The sequence read depth for HERV015 loci in PASC samples versus those found in samples 80-0, 80-7, 154-0, 163-0, 163-7, 251-0, 251-7 (Acute 3, Amrute et al.) and LRS1 (ERS/LRS, Wen et al.). (C) The sequence read depth for HERV020 loci in PASC samples versus 145-0, 163-0, 163-7,272-0, 272-7 (Acute 3, Amrute et al) and ERS5 (ERS/LRS, Wen et al.). (D) The sequence read depth for HERV012 loci in PASC samples versus samples S1-nCOV1, S12-nCOV6 (Acute 1, Lee et al.) and NS-1A, TP-6A, TP-7A, TP9-B, TS-4A (Acute 2, Unterman et al.). "**" = p-value < 0.001, "ns"= not significant.
We next mapped the HERV loci identified in Figures 6A, B to the 23 chromosomes (Supplementary Figure 4). Overall, we did not observe clustering of the identified HERV loci; instead, they were in regions of the chromosomes that were not devoid of HERV loci. Chromosome 1 contained the highest number of HERV loci (2), while Chromosomes 2, 3, 5, 6, 7, 12, and X each contained a single HERV loci. HERV001 was mapped to chromosome 5 within an intron 1–2 of the janus kinase and microtubule interacting protein 2 (JAKMIP2) gene (Figure 8). In contrast, HERV015 was mapped to Chromosome 2 in an independent region that was near the Mer proto-oncogene tyrosine kinase (MERTK) which is highly expressed in macrophages, monocytes, and progenitor cells (
Figure 8

Location of Host of HERV-001 and surrounding genes on Chromosome 5. (A) Chromosome 5 is shown with the subregion 5q32 indicated. A red box highlights the genomic region containing HERV-001 and its neighboring host genes. (B) An expanded view of this 772 kb region includes the host genes JAKMIP2, DPYSL3, SPINK1, STK32A, and SCGB3A2 (highlighted in red). HERV-001, located within intron 1–2 of JAKMIP2, is marked with a blue box and annotated as HERV-001 (intron of JAKMIP2). The bolded JAKMIP2-HERV001-SPINK1 reflects the localized amplification of the expression of these genes in the monocytes from the PASC.
Since the HERV-001 locus was the most amplified, we next compared the expression of JAKMIP2 and the neighboring genes: STK32A, DPYSL3, SPINK1, and SCGB3A2. Using ANOVA followed by Tukey’s HSD test, we compared the following groups: control, acute3, PASC, ERS-LRS, and PASC-upto24m (Figure 9). From this analysis, we identified 3 genes (DPYSL3, JAKMIP2, and SPINK1) with significantly higher good/usable window numbers in PASC group. Additionally, 2 genes (SPINK1 and JAKMIP2) exhibited significantly higher read depth in the PASC group (Figure 10). In contrast, the host genes at a greater distance from JAKMIP2 and SPINK1, the expression as determined by number of windows and read depth of STK32A, DPYSL3 and SCGB3A2 were not different between control, acute 3 PASC, ERS-LRS and PASC-upto24m (Figure 9, Supplementary Table 9). Collectively, these results establish a pattern of co-amplified transcription of select HERV loci and neighboring host genes, consistent with localized epigenetic remodeling known to occur following COVID-19 infection (
Figure 9

Analysis of host gene expression in control, acute, early/late, PASC 24 month and 8 month PASC. For each host gene, the number of good/usable windows observed in each patient group was compared to the Control group using ANOVA followed by Tukey’s HSD test. Any statistically significant p-value (p < 0.05) are shown in the figure (‘ns’ indicates not significant). From this analysis, we identified 3 genes (DPYSL3, JAKMIP2, and SPINK1) with significantly higher numbers of good/usable windows in the PASC group.
Figure 10

Analysis of host gene expression in control, acute, early/late, PASC 24 month and 8-month PASC. For each host gene, the sequence depth observed in each patient group was compared to the Control group using ANOVA followed by Tukey’s HSD test. Any statistically significant p-value (p value < 0.05) are shown in the figure (‘ns’ indicates not significant). Two genes (SPINK1 and JAKMIP2) exhibited significantly higher read depth in the PASC group.
Discussion
In this study, we used our previously described WHA to characterize the expression of HERV loci in monocytes from healthy individuals and patients with infections, including COVID-19, influenza, sepsis and Dengue virus. We identified HERV loci that were expressed in infected patients compared to normal individuals and those who were exposed but did not develop an infection. We also analyzed post COVID-19 infection and found extensive positive HERV loci in monocytes from patients after infection and patients with PASC 8 months after acute COVID-19. Distinct patterns of HERV loci expression, determined by the number of good/usable windows and sequence depth, were observed when comparing acute COVID-19 and PASC patients. Finally, in these PASC patients, we demonstrate that the presence of localized transcription where both host genes and HERV loci were co-amplified due to localized epigenetic modification known to occur because of the expansion of myeloid precursors following COVID-19 infection.
In a previous study, we analyzed a scRNA-seq dataset taken from PBMC that had been stimulated in vitro with LPS, a known activator of innate immune cells such as monocytes (
One of the most striking findings in our study was the identification of positive HERV loci in patients even after the resolution of the acute phase of COVID-19. A sporadic pattern of HERV loci expression was observed in samples from patients at both early and late time points following acute infection as determined by RT-PCR analysis. The timing of sample collection after the acute infection in these patients then was well beyond the half-life of circulating monocytes (
Further analysis of the 12 individuals with PASC revealed that some of the positive HERV loci in the 12 PASC samples had a greater number of good/usable windows (indicative of longer transcript lengths), and sequence read depth when compared to samples from patients with acute COVID-19. One that stands out is the good window/depth amplification found in HERV001 that is located between introns 1–2 of the JAKMIP2 gene. In acutely infected COVID-19 patients, we found parallel expression of JAMMIP2 and HERV001 in a few samples, while in the PASC patients we found amplification of both JAKMIP2 and HERV001. Consistent with these results, a previous study noted that JAKMIP2 expression occurs in a temporal fashion in pluripotent stem cells during development (
Conclusion
Our results characterize the expression of select HERV loci following viral infections, which were not expressed in normal monocytes from patient controls. We found distinct patterns of expressed HERV loci in monocytes from PBMC across 3 acute COVID-19 datasets. These patterns were also different from those found in patients with influenza infections, Dengue virus infections and sepsis and in individuals who received the COVID-19 or influenza vaccine. Collectively, these findings are consistent with our previous study and support a patient specific expression pattern for the HERV loci (
Materials and methods
Datasets used in this study
In this study, we used 14 publicly available scRNA-seq datasets from 1) 6 healthy individuals from Amrute et al. (
scRNA-seq data with WHA analysis
The scRNA-seq PBMC datasets were downloaded in either fastq or bam format, depending on availability from NCBI SRA or GEO. If only bam files were available, they were converted to fastq format using 10x Genomics Cell Ranger software (v 7.1.0) bamtofastq (v 1.4.1) (
Processed fastq files were then used for Window-based HERV Alignment (WHA) analysis. Each sample was duplicated for WHA analysis and mapped to 3,220 HERV loci as reference data (Supplementary Table 11) using BWA (v 0.7.13) with minimum percent match threshold of >99% (
The HERV profiles of healthy individuals and patients were compared through manual filtering using Excel files to identify HERV loci unique to patient samples, ensuring that all control samples excluded any positive HERV loci. To achieve this, we selected only negative values for all healthy individuals, while ensuring that at least one positive value remained in the patient samples. From this analysis, a total of 208 HERV loci were selected for further analysis. Heatmaps were generated to visualize differences in HERV loci across various groups using STAMP (
To determine significant differences in sequencing depth and good/usable windows between the acute COVID-19 group and the long-term COVID-19 group, we performed a t-test using the ggplot2 package in R (https://cran.r-project.org/web/packages/ggpubr/index.html). A P value < 0.05 was considered statistically significant for this analysis.
Host gene analysis
To further investigate the genes, STK32A, DPYSL3, JAKMIP2, SPINK1, and SCGB3A2, which are located near our HERV locus of interest (HERV-001), we reprocessed the fastq files (monocytes PBMC) used in our WHA analysis. This time, all files were mapped using each gene as reference data. For each group (control, acute3, PASC, ERS/LRS, and PASC-upto24m), we recorded the sequence depth and number of good/usable windows for each gene. Additionally, we performed an ANOVA statistical analysis followed by Tukey’s HSD test to determine statistical significance (P value < 0.05) for each gene.
Statements
Data availability statement
The original sequencing dataset used in this study were downloaded from NCBI GEO or NGDC. Accession numbers are as follows: GSE192391, GSE226488, GSE162806, GSE166992, GSE149689, GSE171555, GSE155224, GSE260763, GSE148633, PRJCA002413, GSE235050, GSE262861, GSE151263, GSE220969.
Ethics statement
Ethical approval was not required for the studies involving humans because The original sequencing dataset used in this study were downloaded from NCBI GEO or NGDC. The studies were conducted in accordance with the local legislation and institutional requirements. The human samples used in this study were acquired NCBI GEO or NGDC. Written informed consent to participate in this study was not required fromn the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.
Author contributions
HK: Formal Analysis, Validation, Visualization, Writing – original draft, Writing – review & editing. CM: Conceptualization, Supervision, Validation, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1621657/full#supplementary-material
Supplementary Figure 1Comparison of the number of positive windows obtained from PASC patients and those found in the remaining datasets. The average number of good/usable windows, along with the standard deviation, observed in the PASC samples is shown in the figure.
Supplementary Figure 2Comparison of the number of positive windows obtained from PASC patients and those found in the remaining datasets. (A) The average number of good/usable windows, along with the standard deviation, observed in the PASC samples is shown in the figure. (B) Positive HERV loci identified not in the 8-month PASC group, but only in the remaining groups.
Supplementary Figure 3Comparison of the sequence read depths of PASC with the remaining data sets. Comparison of the read depths for two HERV loci between PASC and the remaining Acute COVID group using t-test.
Supplementary Figure 4Chromosomal distribution of HERV loci. This figure shows the full chromosomal map illustrating the distribution of all identified HERV loci across the human genome. The specific regions highlighted in red/blue correspond to the HERV loci used in Figures 6A and B. All other black markers represent the full set of HERV loci analyzed in this study. See Supplementary Table 11 for the full set of HERV genome coordinates.
References
1
TaksEJMoorlagSJNeteaMGVan-der-MeerJW. Shifting the immune memory paradigm: trained immunity in viral infections. Annu Rev Virol. (2022) 9:469–89. doi: 10.1146/annurev-virology-091919-072546
2
JanewayCAJr.MedzhitovR. Introduction: the role of innate immunity in the adaptive immune response. Semin Immunol. (1998) 10:349–50. doi: 10.1006/smim.1998.0142
3
WolfAAYáñezABarmanPKGoodridgeHS. The ontogeny of monocyte subsets. Front Immunol. (2019) 10:1642. doi: 10.3389/fimmu.2019.01642
4
GretenTFMannsMPKorangyF. Myeloid derived suppressor cells in human diseases. Int Immunopharmacol. (2011) 11:802–7. doi: 10.1016/j.intimp.2011.01.003
5
AgratiCSacchiABordoniVCiminiENotariSGrassiGet al. Expansion of myeloid-derived suppressor cells in patients with severe coronavirus disease (COVID-19). Cell Death Differentiation. (2020) 27:3196–207. doi: 10.1038/s41418-020-0572-6
6
JiangYRosboroughBRChenJDasSKitsiosGDMcVerryBJet al. Single cell RNA sequencing identifies an early monocyte gene signature in acute respiratory distress syndrome. JCI Insight. (2020) 5:e135678. doi: 10.1172/jci.insight.135678
7
UntermanASumidaTSNouriNYanXZhaoAYGasqueVet al. Single-cell multi-omics reveals dyssynchrony of the innate and adaptive immune system in progressive COVID-19. Nat Commun. (2022) 13:440. doi: 10.1038/s41467-021-27716-4
8
SrikiatkhachornAMathewARothmanAL. Immune-mediated cytokine storm and its role in severe dengue. Semin immunopathology. (2017) 39:563–574. doi: 10.1007/s00281-017-0625-1
9
WimmersFDonatoMKuoAAshuachTGuptaSLiCet al. The single-cell epigenomic and transcriptional landscape of immunity to influenza vaccination. Cell. (2021) 184:3915–35. doi: 10.1016/j.cell.2021.05.039
10
DavittEDavittCMazerMBAretiSSHotchkissRSRemyKE. COVID-19 disease and immune dysregulation. Best Pract Res Clin Haematology. (2022) 35:101401. doi: 10.1016/j.beha.2022.101401
11
Schulte-SchreppingJReuschNPaclikDBaßlerKSchlickeiserSZhangBet al. Severe COVID-19 is marked by a dysregulated myeloid cell compartment. Cell. (2020) 182:1419–40. doi: 10.1016/j.cell.2020.08.001
12
PhetsouphanhCDarleyDRWilsonDBHoweAMunierCMLPatelSKet al. Immunological dysfunction persists for 8 months following initial mild-to-moderate SARS-CoV-2 infection. Nat Immunol. (2022) 23:210–6. doi: 10.1038/s41590-021-01113-x
13
AmruteJMPerryAMAnandGCruchagaCHockKGFarnsworthCWet al. Cell specific peripheral immune responses predict survival in critical COVID-19 patients. Nat Commun. (2022) 13:882. doi: 10.1038/s41467-022-28505-3
14
DebisarunPAGösslingKLBulutOKilicGZoodsmaMLiuZet al. Induction of trained immunity by influenza vaccination-impact on COVID-19. PloS Pathog. (2021) 17:e1009928. doi: 10.1371/journal.ppat.1009928
15
LercherACheongJGBaleMJJiangCHoffmannHHAshbrookAWet al. Antiviral innate immune memory in alveolar macrophages following SARS-CoV-2 infection ameliorates secondary influenza A virus disease. Immunity. (2024) 57:2530–46. doi: 10.1016/j.immuni.2024.08.018
16
GuJLiuQZhangJXuS. COVID-19 and trained immunity: the inflammatory burden of long covid. Front Immunol. (2023) 14:1294959. doi: 10.3389/fimmu.2023.1294959
17
NeteaMGZiogasABennCSGiamarellos-BourboulisEJJoostenLAArditiMet al. The role of trained immunity in COVID-19: Lessons for the next pandemic. Cell Host Microbe. (2023) 31:890–901. doi: 10.1016/j.chom.2023.05.004
18
BoesMFalter-BraunP. Long-COVID-19: the persisting imprint of SARS-CoV-2 infections on the innate immune system. Signal transduction targeted Ther. (2023) 8:460. doi: 10.1038/s41392-023-01717-9
19
NasiriKMohammadzadehsalianiSKheradjooHShabestariAMEshaghizadehPPakmehrAet al. Spotlight on the impact of viral infections on Hematopoietic Stem Cells (HSCs) with a focus on COVID-19 effects. Cell Communication Signaling. (2023) 21:103. doi: 10.1186/s12964-023-01122-3
20
CheongJ-GRavishankarASharmaSParkhurstCNGrassmannSAWingertCKet al. Epigenetic memory of coronavirus infection in innate immune cells and their progenitors. Cell. (2023) 186:3882–902. doi: 10.1016/j.cell.2023.07.019
21
ChengHZhengZChengT. New paradigms on hematopoietic stem cell differentiation. Protein Cell. (2020) 11:34–44. doi: 10.1007/s13238-019-0633-0
22
ZaretskyAGEngilesJBHunterCA. Infection-induced changes in hematopoiesis. J Immunol. (2014) 192:27–33. doi: 10.4049/jimmunol.1302061
23
HircheCFrenzTHaasSFDöringMBorstKTegtmeyerPKet al. Systemic virus infections differentially modulate cell cycle state and functionality of long-term hematopoietic stem cells in vivo. Cell Rep. (2017) 19:2345–56. doi: 10.1016/j.celrep.2017.05.063
24
LanderESLintonLMBirrenBNusbaumCZodyMCBaldwinJet al. Erratum: Initial sequencing and analysis of the human genome: international human genome sequencing consortium (Nature (2001) 409 (860-921)). Nature. (2001) 412:565–6. doi: 10.1038/35057062
25
VargiuLRodriguez-ToméPSperberGOCadedduMGrandiNBlikstadVet al. Classification and characterization of human endogenous retroviruses; mosaic forms are common. Retrovirology. (2016) 13:1–29. doi: 10.1186/s12977-015-0232-y
26
CohenCJLockWMMagerDL. Endogenous retroviral LTRs as promoters for human genes: a critical assessment. Gene. (2009) 448:105–14. doi: 10.1016/j.gene.2009.06.020
27
TokuyamaMKongYSongEJayewickremeTKangIIwasakiA. ERVmap analysis reveals genome-wide transcription of human endogenous retroviruses. Proc Natl Acad Sci. (2018) 115:12565–72. doi: 10.1073/pnas.1814589115
28
ChuongEBEldeNCFeschotteC. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science. (2016) 351:1083–7. doi: 10.1126/science.aad5497
29
GeisFKGoffSP. Silencing and transcriptional regulation of endogenous retroviruses: an overview. Viruses. (2020) 12:884. doi: 10.3390/v12080884
30
KooHMorrowCD. Shared and unique patterns of autonomous human endogenous retrovirus loci transcriptomes in CD14+ monocytes from individuals with physical trauma or infection with COVID-19. Retrovirology. (2024) 21:17. doi: 10.1186/s12977-024-00652-z
31
MaoJZhangQZhuangYZhangYLiLPanJet al. Reactivation of senescence-associated endogenous retroviruses by ATF3 drives interferon signaling in aging. Nat Aging. (2024) p:1–19. doi: 10.1038/s43587-024-00745-6
32
MaoJZhangQCongY-S. Human endogenous retroviruses in development and disease. Comput Struct Biotechnol J. (2021) 19:5978–86. doi: 10.1016/j.csbj.2021.10.037
33
OchandoJMulderWJMadsenJCNeteaMGDuivenvoordenR. Trained immunity—basic concepts and contributions to immunopathology. Nat Rev Nephrol. (2023) 19:23–37. doi: 10.1038/s41581-022-00633-5
34
YuCLittletonSGirouxNSMathewRDingSKalnitskyJet al. Mucosal-associated invariant T cell responses differ by sex in COVID-19. Med. (2021) 2:755–72. doi: 10.1016/j.medj.2021.04.008
35
LeeJSParkSJeongHWAhnJYChoiSJLeeHet al. Immunophenotyping of COVID-19 and influenza highlights the role of type I interferons in development of severe COVID-19. Sci Immunol. (2020) 5:eabd1554. doi: 10.1126/sciimmunol.abd1554
36
SatijaRFarrellJAGennertDSchierAFRegevA. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. (2015) 33:495–502. doi: 10.1038/nbt.3192
37
StuartTButlerAHoffmanPHafemeisterCPapalexiEMauckWMet al. Comprehensive integration of single-cell data. Cell. (2019) 177:1888–902. doi: 10.1016/j.cell.2019.05.031
38
GhitaLYaoZXieYDuranVCagiriciHBSamirJet al. Global and cell type-specific immunological hallmarks of severe dengue progression identified via a systems immunology approach. Nat Immunol. (2023) 24:2150–63. doi: 10.1038/s41590-023-01654-3
39
TerzoliSMarzanoPCazzettaVPiazzaRSandrockIRavensSet al. Expansion of memory Vδ2 T cells following SARS-CoV-2 vaccination revealed by temporal single-cell transcriptomics. Vaccines. (2024) 9:63. doi: 10.1038/s41541-024-00853-9
40
WenWSuWTangHLeWZhangXZhengYet al. Immune cell profiling of COVID-19 patients in the recovery stage by single-cell sequencing. Cell Discov. (2020) 6:31. doi: 10.1038/s41421-020-0168-9
41
PelusoMJKellyJDLuSGoldbergSADavidsonMCMathurSet al. Persistence, magnitude, and patterns of postacute symptoms and quality of life following onset of SARS-CoV-2 infection: cohort description and approaches for measurement. Open Forum Infect diseases. (2022) 9:ofab640. doi: 10.1093/ofid/ofab640
42
PelusoMJDeitchmanANTorresLIyerNSMunterSENixonCCet al. Long-term SARS-CoV-2-specific immune and inflammatory responses in individuals recovering from COVID-19 with and without post-acute symptoms. Cell Rep. (2021) 36:109518. doi: 10.1016/j.celrep.2021.109518
43
FilesJKBoppanaSPerezMDSarkarSLowmanKEQinKet al. Sustained cellular immune dysregulation in individuals recovering from SARS-CoV-2 infection. J Clin Invest. (2021) 131:e140491. doi: 10.1172/JCI140491
44
FilesJKSarkarSFramTRBoppanaSSterrettSQinKet al. Duration of post–COVID-19 symptoms is associated with sustained SARS-CoV-2–specific immune responses. JCI Insight. (2021) 6:e151544. doi: 10.1172/jci.insight.151544
45
YouMChenLZhangDZhaoPChenZQinEQet al. Single-cell epigenomic landscape of peripheral immune cells reveals establishment of trained immunity in individuals convalescing from COVID-19. Nat Cell Biol. (2021) 23:620–30. doi: 10.1038/s41556-021-00690-1
46
YinKPelusoMJLuoXThomasRShinMGNeidlemanJet al. Long COVID manifests with T cell dysregulation, inflammation and an uncoordinated adaptive immune response to SARS-CoV-2. Nat Immunol. (2024) 25:218–25. doi: 10.1038/s41590-023-01724-6
47
PhetsouphanhCJackaBBallouzSJacksonKJWilsonDBManandharBet al. Improvement of immune dysregulation in individuals with long COVID at 24-months following SARS-CoV-2 infection. Nat Commun. (2024) 15:3315. doi: 10.1038/s41467-024-47720-8
48
MonacoGLeeBXuWMustafahSHwangYYCarreCet al. RNA-Seq signatures normalized by mRNA abundance allow absolute deconvolution of human immune cell types. Cell Rep. (2019) 26:1627–40. doi: 10.1016/j.celrep.2019.01.041
49
ChenLOzatoK. Innate immune memory in hematopoietic stem/progenitor cells: myeloid-biased differentiation and the role of interferon. Front Immunol. (2021) 12:621333. doi: 10.3389/fimmu.2021.621333
50
DerboisCPalomaresMADeleuzeJFCabannesEBonnetE. Single cell transcriptome sequencing of stimulated and frozen human peripheral blood mononuclear cells. Sci Data. (2023) 10:433. doi: 10.1038/s41597-023-02348-z
51
NagaiYGarrettKPOhtaSBahrunUKouroTAkiraSet al. Toll-like receptors on hematopoietic progenitor cells stimulate innate immune system replenishment. Immunity. (2006) 24:801–12. doi: 10.1016/j.immuni.2006.04.008
52
AmarasingheHEZhangPWhalleyJPAllcockAMiglioriniGBrownACet al. Mapping the epigenomic landscape of human monocytes following innate immune activation reveals context-specific mechanisms driving endotoxin tolerance. BMC Genomics. (2023) 24:595. doi: 10.1186/s12864-023-09663-0
53
PatelAAGinhouxFYonaS. Monocytes, macrophages, dendritic cells and neutrophils: an update on lifespan kinetics in health and disease. Immunology. (2021) 163:250–61. doi: 10.1111/imm.v163.3
54
RommelMGWalzLFotopoulouFKohlscheenSSchenkFMiskeyCet al. Influenza A virus infection instructs hematopoiesis to megakaryocyte-lineage output. Cell Rep. (2022) 41:111447. doi: 10.1016/j.celrep.2022.111447
55
MillrudCRBergenfelzCLeanderssonK. On the origin of myeloid-derived suppressor cells. Oncotarget. (2016) 8:3649. doi: 10.18632/oncotarget.12278
56
BernardoASJouneauAMarksHKenschePKobolakJFreudeKet al. Mammalian embryo comparison identifies novel pluripotency genes associated with the naïve or primed state. Biol Open. (2018) 7:bio033282. doi: 10.1242/bio.033282
57
YiSZhangCLiMQuTWangJ. Machine learning and experiments identifies SPINK1 as a candidate diagnostic and prognostic biomarker for hepatocellular carcinoma. Discover Oncol. (2023) 14:231. doi: 10.1007/s12672-023-00849-2
58
ElahiS. Hematopoietic responses to SARS-CoV-2 infection. Cell Mol Life Sci. (2022) 79:187. doi: 10.1007/s00018-022-04220-6
59
JernPCoffinJM. Effects of retroviruses on host genome function. Annu Rev Genet. (2008) 42:709–32. doi: 10.1146/annurev.genet.42.110807.091501
60
KinTOnoY. Idiographica: a general-purpose web application to build idiograms on-demand for human, mouse and rat. Bioinformatics. (2007) 23:2945–6. doi: 10.1093/bioinformatics/btm455
61
TreangenTJSalzbergSL. Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat Rev Genet. (2012) 13:36–46. doi: 10.1038/nrg3117
62
LiMZhangXAngKSLingJSethiRLeeNYSet al. DISCO: a database of Deeply Integrated human Single-Cell Omics data. Nucleic Acids Res. (2022) 50:D596–602. doi: 10.1093/nar/gkab1020
63
ChenTDelanoMJChenKSperryJLNamasRALamparelloAJet al. A road map from single-cell transcriptome to patient classification for the immune response to trauma. JCI Insight. (2021) 6:145108. doi: 10.1172/jci.insight.145108
64
ThompsonEACascinoKOrdonezAAZhouWVaghasiaAHamacherBradyAet al. Metabolic programs define dysfunctional immune responses in severe COVID-19 patients. Cell Rep. (2021) 34:108863. doi: 10.1016/j.celrep.2021.108863
65
TurnerJSZhouJQHanJSchmitzAJRizkAAAlsoussiWBet al. Human germinal centres engage memory and naive B cells after influenza vaccination. Nature. (2020) 586:127–32. doi: 10.1038/s41586-020-2711-0
66
ZhengGXZhouJQHanJSchmitzAJRizkAAAlsoussiWBet al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. (2017) 8:14049. doi: 10.1038/ncomms14049
67
ParksDHTysonGWHugenholtzPBeikoRG. STAMP: statistical analysis of taxonomic and functional profiles. Bioinformatics. (2014) 30:3123–4. doi: 10.1093/bioinformatics/btu494
68
TeamRC. R: A language and environment for statistical computing [Computer software manual]. Austria: Vienna (2016).
Summary
Keywords
post-acute COVID sequelae, monocytes, epigenetic remodeling, ScRNA-seq, human endogenous retrovirus (HERV)
Citation
Koo H and Morrow CD (2025) Amplification of select autonomous HERV loci and surrounding host gene transcription in monocytes from patients with post-acute sequelae of COVID-19. Front. Immunol. 16:1621657. doi: 10.3389/fimmu.2025.1621657
Received
01 May 2025
Accepted
05 June 2025
Published
26 June 2025
Volume
16 - 2025
Edited by
Prof. Pei-Hui Wang, Shandong University, China
Reviewed by
Xiaodi Yang, Peking University, China
Hsiang-Chi Huang, National Cheng Kung University, Taiwan
Updates

Check for updates
Copyright
© 2025 Koo and Morrow.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hyunmin Koo, khmkhm87@uab.edu; Casey D. Morrow, caseym@uab.edu
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.