Abstract
There is an increased global outbreak of diseases caused by coronaviruses affecting respiratory tracts of birds and mammals. Recent dangerous coronaviruses are MERS-CoV, SARS-CoV, and SARS-CoV-2, causing respiratory illness and even failure of several organs. However, profound impact of coronavirus on host cells remains elusive. In this study, we analyzed transcriptome of MERS-CoV, SARS-CoV, and SARS-CoV-2 infected human lung-derived cells, and observed that infection of these coronaviruses all induced increase of retrotransposon expression with upregulation of TET genes. Upregulation of retrotransposon was also observed in SARS-CoV-2 infected human intestinal organoids. Retrotransposon upregulation may lead to increased genome instability and enhanced expression of genes with readthrough from retrotransposons. Therefore, people with higher basal level of retrotransposon such as cancer patients and aged people may have increased risk of symptomatic infection. Additionally, we show evidence supporting long-term epigenetic inheritance of retrotransposon upregulation. We also observed chimeric transcripts of retrotransposon and SARS-CoV-2 RNA for potential human genome invasion of viral fragments, with the front and the rear part of SARS-CoV-2 genome being easier to form chimeric RNA. Thus, we suggest that primers and probes for nucleic acid detection should be designed in the middle of virus genome to identify live virus with higher probability. In summary, we propose our hypothesis that coronavirus invades human cells and interacts with retrotransposon, eliciting more severe symptoms in patients with underlying diseases. In the treatment of patients with coronavirus infection, it may be necessary to pay more attention to the potential harm contributed by retrotransposon dysregulation.
Introduction
Emerging coronaviruses often spread rapidly from person to person and there seems to be an increased global outbreak of related diseases. MERS-CoV and SARS-CoV are two identified rare coronavirus strains which cause not only severe lung infection but also serious complications (Ksiazek et al., 2003; Rota et al., 2003; Zaki et al., 2012; Arabi et al., 2017). More recently, coronavirus disease named COVID-19 caused by a novel coronavirus SARS-CoV-2 is expanding globally and rapidly, resulting in emerging health issues (Chan et al., 2020; Guan et al., 2020; Huang et al., 2020; Zhou et al., 2020). Although cell receptors and the routes of infection of these coronaviruses have been identified (Li et al., 2003; Raj et al., 2013; Zhou et al., 2020; Wrapp et al., 2020), complicated impact on human cells is far from clear.
Transposable Elements (TEs) are mobile DNA elements in virtually all eukaryotes and comprise more than 40% of human genome (Dewannieux et al., 2003). They can self-replicate and insert into various locations inside genome. Dysregulation of TE may lead to various illnesses like inflammatory diseases (Saleh et al., 2019). The only active member in TE is retrotransposon which can “copy and paste” themselves through RNA intermediate. Examples of retrotransposons include Long interspersed nuclear elements (LINEs), short interspersed nuclear elements (SINEs) and long terminal repeats (LTRs). Expression of most of retrotransposon members is suppressed in somatic cells and they are only active in brains, germ cells, early embryos and pathological conditions (Munoz-Lopez et al., 2016). About 5% of newborn babies show a new retrotransposon integration event (Cordaux et al., 2006). Abnormally upregulation of retrotransposons cause insertions, deletions, and inversions in genome (Gilbert et al., 2002; Symer et al., 2002), resulting in compromised genetic stability and even cell death (Malki et al., 2014; Newkirk et al., 2017). Accumulated evidence in recent years also proved their importance in orchestration of gene expression (Izsvak et al., 2016), regulation of chromatin structure (Fadloun et al., 2013) and modulation of developmental program (Percharde et al., 2018; Lu et al., 2020).
LINEs are common autonomous retrotransposons and comprise about 17% of human genome (Cordaux and Batzer, 2009). Some LINE-1 elements can be transcribed and translated in cells. After reverse transcription of LINE-1 RNA, they can be integrated back into genome (Babushok et al., 2006). Naturally, LINEs expression is repressed in most cell types. Its RNA is mainly heritable during early embryogenesis because of its enrichment and high retrotransposition activity in early embryos (Grow et al., 2015). Transgenic mouse model carrying mouse/human LINE-1 retrotransposition reporter demonstrated that this activity creates somatic mosaicism during development (Kano et al., 2009). Besides LINEs, SINEs and LTRs are also enriched retrotransposons in human genome, and mobilization of SINEs relies on LINE-1-encoded proteins (Dewannieux et al., 2003).
In our study, we analyzed publicly available transcriptome data of human cells infected with coronavirus MERS-CoV, SARS-CoV, and SARS-CoV-2, and observed enhanced expression of TEs including several retrotransposons, as well as inflammation, immunity, and apoptosis related genes. We further noticed potential fusion of SARS-CoV-2 RNA with retrotransposon transcripts especially LINEs and SINEs. Therefore, further examinations on genome and transcriptome of cells from patients and studying models will be valuable to evaluate potential crosstalk between coronavirus and retrotransposons.
Methods
Cell Types Used for Transcriptome Study of Coronavirus Infection
Cell types below are used in this study. Calu-3, human lung cancer cell; MRC5, human fetal lung strain; A549, human adenocarcinomic alveolar basal epithelial cell; NHBE, primary human bronchial epithelial cell. Each group above has three replicates. For human intestinal organoids, each group has two replicates.
RNA-Seq Data Processing
Raw reads were processed with cutadapt v1.16 to perform quality trimming with default parameters except for: quality-cutoff =20, pair-filter=both. To include as many non-uniquely mapped reads as possible, trimmed reads were firstly aligned to human/mouse genome (hg19/mm10) by STAR (v2.5.1b) with default settings including parameters “–outFilterMismatchNmax 10 –winAnchorMultimapmax 2000 –outFilterMultimapNmax 1000”. RSEM was used to calculate FPKM value of genes. The annotation and fasta sequences for consensus transposable element sequences were downloaded from Repbase (version 20.01) (Bao et al., 2015). TEtranscripts program (Jin et al., 2015) with default parameters was used to get counts for transposable elements. Read counts of gene and TE transcripts were normalized by total aligned counts. For RNA-seq alignment of coronavirus genomes, MERS-CoV (NC_019843), SARS-CoV (NC_004718) and SARS-CoV-2 (NC_045512) genomes were downloaded from NCBI, and trimmed reads were aligned to coronavirus genome by STAR (v2.5.1b) using default parameters. To identify potential chimeric transcripts of coronavirus and cellular transcripts from single-end RNA-seq data, 30nt fastq reads from each end were extracted from raw fastq reads and both were aligned to human and SARS-CoV-2 genomes respectively. Non-viral end of the chimeric reads were mapped to consensus transposable element sequences using STAR with parameters “–winAnchorMultimapmax 2000 –outFilterMultimapNmax 1000” to get counts of transposons. Integrative Genomics Viewer (IGV) and UCSC Genome Browser was used for snapshot of transcriptome. R package Deseq2 was used to get differential expressed genes. Metascape was used to visualize functional profiles of genes and gene clusters (Zhou et al., 2019). Graphs were created by R or Excel. Images were organized by Adobe Illustrator.
Accession Number
RNA sequencing data of coronavirus-infected human lung-derived cells are from GSE122876 (transcriptome of MERS-CoV-infected Calu-3 cells; single-read; MOI 2, treated for 24 h) (Yuan et al., 2019), GSE56192 (transcriptome of MERS-CoV and SARS-CoV infected MRC5 cells; paired-end; MOI 2, treated for 24 h), GSE147507 (transcriptome of SARS-CoV-2 infected A549 cells, Calu-3 cells, and NHBE cells; MOI 2, treated for 24 h) (Blanco-Melo et al., 2020). RNA sequencing data of SARS-CoV-2-infected human intestinal organoids are from GSE149312 (MOI 1, treated for 24 and 60 h, grown in differentiation medium) (Lamers et al., 2020). SARS-CoV-2 infected Calu-3 cells were used to identify chimeric transcripts of coronavirus and cellular RNA. RNA sequencing data of IRF1 knockout and control human hepatocytes infected with hepatitis A virus are from GSE114916. RNA sequencing data of STAT1 knockout and control human HepG2 cells treated by IFN are from GSE98372 (Chen et al., 2017). RNA sequencing data of human tissues and cell types are from GSE83115 (Zhu et al., 2016). RNA sequencing data of human early embryos and embryonic stem cells are from GSE36552 (Yan et al., 2013). RNA sequencing data of 8-cell mouse embryos and adult mouse islet developed from zygotes with injection of sperm tsRNAs from high-fat-diet males are from GSE75544 (Chen et al., 2016).
Results and Discussion
Coronavirus Infection Disturbs Diverse Biological Processes in Human Cells and Can Stimulate ACE2 Expression Through IRF1 and STAT1
Coronaviral infection led to not only respiratory failure but also multiple organ dysfunction syndromes, indicating that coronavirus impacts a wide range of human cells (Wang et al., 2020). Transcriptome analysis may provide valuable information on how human cells react with coronavirus entry.
To examine whether coronavirus infection disturbs expression of specific gene sets in human cells, we analyzed public available RNA-seq data of human lung-derived cells with infection of MERS-CoV, SARS-CoV, and SARS-CoV-2. Through comparison of transcriptomes before and after infection, we identified thousands of dysregulated genes (adjusted p-value < 0.05) for each group (Figure 1A). Among those dysregulated genes, we found that 26 genes were commonly upregulated after infection of the three coronaviruses (Figure 1B), but very few genes were identified to be commonly downregulated (Figure 1C). GO analysis of the 26 commonly upregulated genes demonstrated enrichment on inflammation, immunity and apoptosis related pathways (Figure 1B). Through relative viral sequence content in transcriptome, we found that the three coronaviruses can infect various human lung-derived cells (Figure 1D), however, low dose of coronavirus or using NHBE cells for infection were not successful to support coronavirus replication (Figure S1).
Figure 1
ACE2 is the cell receptor of SARS-CoV-2 (Zhou et al., 2020; Wrapp et al., 2020). Differently from robust expression of ACE2 in Calu-3 cells, ACE2 expression was undetectable in A549 cells, but after SARS-CoV-2 infection, low level of ACE2 was observed (Figure 1E). This indicates that transcription factors responding to coronavirus infection induced ACE2 expression. Recent report showed that ACE2 can be stimulated by interferon, and proposed IRF1 and STAT1-binding sites near ACE2 transcription start site (Figure S2) (Ziegler et al., 2020). Here, we noticed that expression of both IRF1 and STAT1 were increased after SARS-CoV-2 infection, and ACE2 expression was reduced when IRF1 was depleted in virus-infected human cells or STAT1 was depleted in interferon-treated human cells (Figure 1E). These results confirmed that IRF1 and STAT1 are essential upstream activators of ACE2 upon virus infection. So, we propose that SARS-CoV-2 might enter human cells with low efficiency by bulk-phase endocytosis in A549 cells, inducing IRF1 and STAT1 expression which further enhances ACE2 expression to facilitate receptor-mediated viral entry.
Coronavirus Infection Enhanced Retrotransposon Expression in Human Lung-Derived Cells
Next, we ask whether TE expression is impacted by coronavirus infection. We first examined transcriptome of human lung adenocarcinoma cell line Calu-3 after 24-h infection of MERS-CoV (Yuan et al., 2019). We observed that expression of TE including retrotransposons was generally activated after coronavirus infection (Figure 2A). Further examination documented that subfamilies of LINEs, SINEs, LTRs were differentially upregulated by coronavirus (Figure 2B). LINE-1 is the mostly well-studied autonomous retrotransposon. Most LINE-1 elements are inactivated in somatic cells, but some escape variously evolved silencing mechanisms. Hence, we ask whether evolutionarily old and young retrotransposons were impacted by coronavirus infection differently. We compared the ratio of fold change of specific LINE-1 element expression ordered by predicted evolutionary ages (Khan et al., 2006), and found that older and younger LINE-1 elements were similarly influenced (Figure 2C). One of the major mechanisms for LINE-1 silencing is DNA methylation, and we examined expression of genes encoding DNA methyltransferases (DNMTs) and Ten-eleven translocation (TET) enzymes mediating active DNA demethylation. We observed that Tet genes were generally upregulated after coronavirus infection (Figure 2D), and upregulated DNA demethylation activity may lead to demethylation of retrotransposon promoters. This result supports that increased retrotransposon expression was caused by genome-wide DNA demethylation. We obtained similar results in MERS-CoV/SARS-CoV infected MRC5 cells which are noncancerous human lung fibroblast cells (Figures 2A–D).
Figure 2
Recent COVID-19 outbreak is caused by the novel coronavirus SARS-CoV-2. Here, we explored transcriptomes of SARS-CoV-2 infected A549 and Calu-3 cells. Similar to MERS-CoV and SARS-CoV infection, we found general increase of multiple transposable elements (Figures 2A, B), no biased impact of older and younger LINE-1 elements by SARS-CoV-2 infection (Figure 2C). SARS-CoV-2 infection also causes upregulation of TET gene expression (Figure 2D). Similarly, SARS-CoV-2 was identified to have the capability of infecting human intestinal organoids (Figure 2E) and increased retrotransposon expression can also be observed post infection in a time-dependent manner (Figure 2F).
Therefore, upregulation of retrotransposon seems to be a common event induced by coronavirus infection, possibly through enhancing global DNA demethylation activity. Despite of similar upregulation of retrotransposon families triggered by the three coronaviruses, individual retrotransposons are differently dysregulated, and this may cause various phenotypes in human cells. Note that above results were from 24-h infection of coronaviruses, and impact of long-term infection should be more severe. Moreover, retrotransposon is able to encode proteins and can form retrovirus-like particles (Grow et al., 2015), so examination of coronavirus-infected samples may need to discriminate coronavirus from retrovirus-like particles because of upregulation of retrotransposons.
Upregulation of Retrotransposon May Be Long-Term Memorized Epigenetically
We then ask whether retrotransposon upregulation can be long-term inherited through several generations of cell divisions. We found the mouse model of transgenerational epigenetic inheritance of acquired traits may provide molecular insights into this question.
tRNA-derived small RNAs (tsRNAs) in sperm were reported to transmit abnormal epigenetic information into preimplantation embryo, and epigenetic abnormality was further inherited to adult tissue, causing metabolic disorders (Chen et al., 2016). Two kinds of tsRNAs were previously identified to regulate retrotransposon LTR (Schorn et al., 2017), so we ask whether abnormal retrotransposon activity is inheritable during this process. We analyzed the transcriptome of cleavage mouse embryo and adult islet originated from zygote with injection of tsRNA of sperm from normal or high-fat diet (HFD) male mice. We found that LINE, SINE, and LTR retrotransposons were all upregulated in 8-cell embryo when HFD tsRNA was injected (Figure 3A). Notably, LTR retrotransposon also showed upregulation in adult islet (Figure 3B). Further analysis on LTR families supported that upregulation of ERV1 expression was inherited from early embryo (Figure 3C) to adult islet (Figure 3D), probably through DNA methylation inheritance at ERV1 locus. Therefore, above result indicates that enhancement of retrotransposon expression, ERV1 in this case, may be long-term inherited, even from cleavage-stage early embryos to adult tissues, with change of DNA methylation as the potential molecular mechanism (Figure 3E).
Figure 3
SARS-CoV-2 RNA May Form Chimeric Transcripts With Retrotransposon RNA Especially LINE for Potential Insertion Into Host Genome
Coronaviruses are RNA viruses and are not supposed to integrate into host genome by themselves. However, it was reported that several RNA viruses have capacity to recombine with retrotransposons to invade host genome (Geuking et al., 2009). Regarding contribution of SARS-CoV-2 RNA to total transcriptome in infected Calu-3 cells to be as high as 15.32% (Figure 1D), we explored in the transcriptome the potential chimeric transcripts of SARS-CoV-2 and cellular RNA, and obtained subtranscriptome with chimeric reads.
We found that 0.23% of SARS-CoV-2 RNA formed chimeric transcripts with non-TE genes and 0.14% with TE (Figure 4A). Surprisingly, TE-virus chimeric reads contribute 37.36% to total mapped chimeric reads, while TE reads are only 2.83% in total mapped reads (Figure 4B), indicating that TE is much more efficient to form chimeric transcripts with SARS-CoV-2 RNA than non-TE genes. We randomly extracted reads from subtranscriptome of chimeric transcripts of SARS-CoV-2 and cellular RNA, and confirmed identity of the chimeric reads (Figure 4C).
Figure 4
We further analyzed distribution of TE subfamilies in total transcriptome and subtranscriptome with chimeric reads, and found that reads of retrotransposon LINE, SINE, and LTR were all enriched in the subtranscriptome of chimeric reads (Figure 4D). Unexpectedly, only LINE RNA was overrepresented in subtranscriptome with chimeric reads than in total transcriptome, and further analysis showed that virus-LINE-1 was overrepresented in virus-LINE reads (Figure 4E). This demonstrates high efficiency of LINE family especially LINE-1 in forming chimeric transcript with SARS-CoV-2 RNA. LINE-1 is autonomous retrotransposon with retrotransposition activity, and RNA-RNA ligation mediated by endogenous RNA ligase RtcB was previously reported for LINE-1 to carry other types of RNA for host genomic invasion (Moldovan et al., 2019), so similar mechanisms may apply for SARS-CoV-2 transcripts. Further examination of human genome from SARS-CoV-2 infected human cells or biopsies will be particularly important to identity existence of integration of coronavirus RNA into human genome.
Moreover, to identify which region of SARS-CoV-2 RNA prone to form chimeric transcripts with cellular RNA, we obtained subtranscriptome of chimeric transcripts, extracted SARS-CoV-2 reads, and aligned to SARS-CoV-2 genome, and viewed on IGV to find that the front and the rear parts, especially the rear part of coronavirus RNA were biased in forming chimeric transcripts (Figure 4F). Moreover, our further examination showed that only the rear part of SARS-CoV-2 is prone to form chimeric RNA with TE/LINE (Figure 4F). However, more direct evidence is needed to prove existence of chimeric transcripts and potential human genome integration, for example, through genome sequencing of blood cells from coronavirus-infected patients. Based on above analysis, we suggest that primers and probes for SARS-CoV-2 testing are designed in middle of the SARS-CoV-2 genome.
Our Hypothesis on Coronavirus-Retrotransposon Interaction
Based on above analysis, we propose our hypothesis that coronavirus infection may increase retrotransposon expression through modulating TET activity to reduce global DNA methylation. Increased retrotransposon RNA may further form chimeric transcripts with coronavirus RNA, and integrate viral genomic fragments into human genome. Moreover, enforced retrotransposon expression may be harmful and probably long-term inherited (Figure 5A).
Figure 5
TE is widely expressed in human tissues (Figure 5B), with highest enrichment in early human embryos (Figure 5C). The cells used in this study are mainly derived from human lung and also robustly express TE (Figure 5D). Moreover, TE subfamilies are variable in different cell types (Figures 5E–G), suggesting extensive but specific phenotype upon global retrotransposon upregulation.
The first concern regarding global retrotransposon upregulation is genome instability. Retrotransposition activity is high in early embryo (Grow et al., 2015) and brain (Zhao et al., 2019) during normal development, so potential integration of coronavirus sequence into human genome is suggested to be scrutinized for these cells. It was also reported that retrotransposon upregulation is positively correlated with tumor progression (Jung et al., 2018), causing genomic deletion, translocation, and duplication (Rodriguez-Martin et al., 2020). What’s more, increased expression of retrotransposon LINE-1 contributes to age-associated inflammation in several tissues (De Cecco et al., 2019). Additionally, vapers and smokers demonstrated higher retrotransposon expression and hypomethylation at associated loci (Caliri et al., 2020). Also, people with neurological disorders may have higher retrotransposon expression and retrotransposition activity (Terry and Devine, 2019). These reports not only show that upregulation of retrotransposon expression may cause several diseases, but also indicate that persons with higher basal level of retrotransposons are supposed to be more susceptible to coronavirus infection and have increased risk of symptomatic infection. In support of this, recent analysis of SARS-CoV-2 patients showed that cancer patients (Liang et al., 2020) and aged people (Wu et al., 2020) get more severe symptoms after infection. Therefore, inhibition of reverse transcriptase activity in human cells may be necessary during pharmaceutical treatment of coronavirus-infected patients, especially those with higher basal level of retrotransposons.
The second concern regarding global retrotransposon upregulation is disturbance of retrotransposon adjacent gene expression. Accumulated evidence shows that retrotransposons are not just genomic fossils, but have molecular functions. For example, physically adjacent retrotransposon activates gene promoter of TMEM156 or MYADM by readthrough mechanism (Figure 5H, Figures S3-S5, Table S1) in both SARS-CoV-2 infected A549 and Calu-3 cells, and the read-through mechanism for BCL3 gene is shown in SARS-CoV-2 infected A549 cells (Figure S6). Also, transcripts of LINEs, SINEs and low-complexity repeats physically interacted with specific genomic areas to play distinct roles (Ding et al., 2004).
The third concern regarding global retrotransposon upregulation is whether coronavirus RNA can enter nucleus and associate with specific genomic regions through sequence homology, similar like the behavior of retrotransposon RNA (Ding et al., 2004; Fadloun et al., 2013). Blast analysis in NCBI using SARS-CoV-2 genome showed no similar sequence in human genome. We further used CENSOR program (Jurka, 1998) to analyze the SARS-CoV-2 genome and all predicted candidate repetitive elements are less than 200bp. Therefore, no evidence supports that SARS-CoV-2 RNA has the ability to recognize human genome by homologous sequence even these transcripts enter nucleus by chance.
Conclusions
Taken together, we demonstrate that coronavirus infection increases retrotransposon expression in human cells, possibly through global DNA hypomethylation, and increased retrotransposon RNA may further form chimeric transcripts with coronavirus RNA for integration of viral genomic fragments into human genome. These enhanced retrotransposon transcripts may be long-term inherited to harm host organs. Therefore, we propose that retrotransposon upregulation induced by coronavirus infection may have potential contributions to coronavirus caused symptoms, and suggest careful transcriptome examination and genetic tests in future investigations on coronavirus-infected patients. Finally, we note that our hypothesis needs further validation in a more direct manner.
Funding
This work was supported by the National Key R&D Program of China [2018YFC1004502, 2018YFC1004001] and the National Natural Science Foundation of China [NSFC 31771661, 32000488].
Statements
Data availability statement
RNA sequencing data of MERS-CoV-infected Calu-3 cells (Zhou et al., 2019) are obtained from NCBI Sequence Read Archive with BioProject ID: PRJNA506733 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA506733/). RNA sequencing data of MERSCoV-infected and SARS-CoV-infected MRC5 cells are obtained from NCBI Sequence Read Archive with BioProject ID:PRJNA233943 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA233943). RNA sequencing data of SARS-CoV-2 infected A549 cells, Calu-3 cells, and NHBE cells (Yuan et al., 2019) are obtained from NCBI Sequence Read Archive with BioProject ID: PRJNA615032 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA615032). RNA sequencing data of SARS-CoV-2-infected human intestinal organoids (Blanco-Melo et al., 2020) are obtained from NCBI Sequence Read Archive with BioProject ID: PRJNA628628 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA628628/). RNA sequencing data of IRF1 knockout and control human hepatocytes infected with hepatitis A virus are are obtained from NCBI Sequence Read Archive with BioProject ID: PRJNA473130 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA473130/). RNA sequencing data of STAT1 knockout and control human HepG2 cells treated by IFN (Lamers et al., 2020) are obtained from NCBI Sequence Read Archive with BioProject ID: PRJNA384926 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA384926/). RNA sequencing data of human tissues and cell types (Chen et al., 2017) are obtained from NCBI Sequence Read Archive with BioProject ID: PRJNA324812 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA324812/). RNA sequencing data of human early embryos and embryonic stem cells (Zhu et al., 2016) are obtained from NCBI Sequence Read Archive with BioProject ID: PRJNA153427 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA153427/). RNA sequencing data of 8-cell mouse embryos and adult mouse islet developed from zygotes with injection of sperm tsRNAs from high-fat-diet males (Yan et al., 2013) are obtained from NCBI Sequence Read Archive with BioProject ID: PRJNA304514 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA304514/).
Author contributions
L-QZ and XH conceived and designed the project. YY analyzed the data and wrote the manuscript. X-ZL performed analysis on chimeric transcripts. L-QZ and XH revised the manuscript. All authors contributed to the article and approved the submitted version.
Acknowledgments
We thank Dr. Bing Li from Shanghai Jiao Tong University for assistance of data analysis on retrotransposon expression. This manuscript has been released as a pre-print at ResearchSquare (Yin Y et al., 2020).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2021.609160/full#supplementary-material
Supplementary Figure 1Related to Figure 1, viral dose and cell type influences SARS-CoV-2 replication. Bar graph indicates percentage of reads mapped to SARS-CoV-2 genome to total mapped reads of human cells infected with SARS-CoV-2.
Supplementary Figure 2Related to Figure 1, potential binding sites of IRF1 and STAT1 at human ACE2. Scheme displays locations of potential genomic binding sites of IRF1 and STAT1 near transcription start site of human ACE2 gene.
Supplementary Figure 3Related to Figure 5H, representative alignment near SINE-UTR junction at TMEM156 locus in SARS-CoV-2 infected A549 or Calu-3 cells.
Supplementary Figure 4UCSC genome browser view of an example of retrotransposon-initiated MYADM gene expression by readthrough mechanism.
Supplementary Figure 5Related to Figure S4, representative alignment near LTR-UTR junction at MYADM locus in SARS-CoV-2 infected A549 or Calu-3 cells.
Supplementary Figure 6Representative alignment near SINE-UTR junction at BCL3 locus in SARS-CoV-2 infected A549 cells.
Supplementary Table 1The ratio of read-through reads to total initiating TE reads for TMEM156, MYADM and BCL3 genes.
Abbreviations
LINE, long interspersed nuclear element; LTR, long terminal repeat; SINE, short interspersed nuclear element; TE, transposable element.
References
1
ArabiY. M.BalkhyH. H.HaydenF. G.BouchamaA.LukeT.BaillieJ. K.et al. (2017). Middle East Respiratory Syndrome. N Engl. J. Med.376 (6), 584–594. doi: 10.1056/NEJMsr1408795
2
BabushokD. V.OstertagE. M.CourtneyC. E.ChoiJ. M.KazazianH. H. Jr. (2006). L1 integration in a transgenic mouse model. Genome Res.16 (2), 240–250. doi: 10.1101/gr.4571606
3
BaoW.KojimaK. K.KohanyO. (2015). Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA6, 11. doi: 10.1186/s13100-015-0041-9
4
Blanco-MeloD.Nilsson-PayantB. E.LiuW. C.UhlS.HoaglandD.MollerR.et al. (2020). Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19. Cell181 (5), 1036–45.e9. doi: 10.1016/j.cell.2020.04.026
5
CaliriA. W.CaceresA.TommasiS.BesaratiniaA. (2020). Hypomethylation of LINE-1 repeat elements and global loss of DNA hydroxymethylation in vapers and smokers. Epigenetics 15 (8), 816–829. doi: 10.1080/15592294.2020.1724401
6
ChanJ. F.YuanS.KokK. H.ToK. K.ChuH.YangJ.et al. (2020). A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. Lancet395 (10223), 514–523. doi: 10.1016/S0140-6736(20)30154-9
7
ChenQ.YanM.CaoZ.LiX.ZhangY.ShiJ.et al. (2016). Sperm tsRNAs contribute to intergenerational inheritance of an acquired metabolic disorder. Science351 (6271), 397–400. doi: 10.1126/science.aad7977
8
ChenK.LiuJ.LiuS.XiaM.ZhangX.HanD.et al. (2017). Methyltransferase SETD2-Mediated Methylation of STAT1 Is Critical for Interferon Antiviral Activity. Cell170 (3), 492–506. doi: 10.1016/j.cell.2017.06.042
9
CordauxR.BatzerM. A. (2009). The impact of retrotransposons on human genome evolution. Nat. Rev. Genet.10 (10), 691–703. doi: 10.1038/nrg2640
10
CordauxR.HedgesD. J.HerkeS. W.BatzerM. A. (2006). Estimating the retrotransposition rate of human Alu elements. Gene373, 134–137. doi: 10.1016/j.gene.2006.01.019
11
De CeccoM.ItoT.PetrashenA. P.EliasA. E.SkvirN. J.CriscioneS. W.et al. (2019). L1 drives IFN in senescent cells and promotes age-associated inflammation. Nature566 (7742), 73–78. doi: 10.1038/s41586-018-0784-9
12
DewannieuxM.EsnaultC.HeidmannT. (2003). LINE-mediated retrotransposition of marked Alu sequences. Nat. Genet.35 (1), 41–48. doi: 10.1038/ng1223
13
DingY.HeL.ZhangQ.HuangZ.CheX.HouJ.et al. (2004). Organ distribution of severe acute respiratory syndrome (SARS) associated coronavirus (SARS-CoV) in SARS patients: implications for pathogenesis and virus transmission pathways. J. Pathol.203 (2), 622–630. doi: 10.1002/path.1560
14
FadlounA.Le GrasS.JostB.Ziegler-BirlingC.TakahashiH.GorabE.et al. (2013). Chromatin signatures and retrotransposon profiling in mouse embryos reveal regulation of LINE-1 by RNA. Nat. Struct. Mol. Biol.20 (3), 332–338. doi: 10.1038/nsmb.2495
15
GeukingM. B.WeberJ.DewannieuxM.GorelikE.HeidmannT.HengartnerH.et al. (2009). Recombination of retrotransposon and exogenous RNA virus results in nonretroviral cDNA integration. Science323 (5912), 393–396. doi: 10.1126/science.1167375
16
GilbertN.Lutz-PriggeS.MoranJ. V. (2002). Genomic deletions created upon LINE-1 retrotransposition. Cell110 (3), 315–325. doi: 10.1016/S0092-8674(02)00828-0
17
GrowE. J.FlynnR. A.ChavezS. L.BaylessN. L.WossidloM.WescheD. J.et al. (2015). Intrinsic retroviral reactivation in human preimplantation embryos and pluripotent cells. Nature522 (7555), 221–225. doi: 10.1038/nature14308
18
GuanW. J.NiZ. Y.HuY.LiangW. H.OuC. Q.HeJ. X.et al. (2020). Clinical Characteristics of Coronavirus Disease 2019 in China. N Engl. J. Med382 (18), 1708–1720. doi: 10.1056/NEJMoa2002032
19
HuangC.WangY.LiX.RenL.ZhaoJ.HuY.et al. (2020). Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet395 (10223), 497–506. doi: 10.1016/S0140-6736(20)30183-5
20
IzsvakZ.WangJ.SinghM.MagerD. L.HurstL. D. (2016). Pluripotency and the endogenous retrovirus HERVH: Conflict or serendipity? Bioessays38 (1), 109–117. doi: 10.1002/bies.201500096
21
JinY.TamO. H.PaniaguaE.HammellM. (2015). TEtranscripts: a package for including transposable elements in differential expression analysis of RNA-seq datasets. Bioinformatics31 (22), 3593–3599. doi: 10.1093/bioinformatics/btv422
22
JungH.ChoiJ. K.LeeE. A. (2018). Immune signatures correlate with L1 retrotransposition in gastrointestinal cancers. Genome Res.28 (8), 1136–1146. doi: 10.1101/gr.231837.117
23
JurkaJ. (1998). Repeats in genomic DNA: mining and meaning. Curr. Opin. Struct. Biol.8 (3), 333–337. doi: 10.1016/S0959-440X(98)80067-5
24
KanoH.GodoyI.CourtneyC.VetterM. R.GertonG. L.OstertagE. M.et al. (2009). L1 retrotransposition occurs mainly in embryogenesis and creates somatic mosaicism. Genes Dev.23 (11), 1303–1312. doi: 10.1101/gad.1803909
25
KhanH.SmitA.BoissinotS. (2006). Molecular evolution and tempo of amplification of human LINE-1 retrotransposons since the origin of primates. Genome Res.16 (1), 78–87. doi: 10.1101/gr.4001406
26
KsiazekT. G.ErdmanD.GoldsmithC. S.ZakiS. R.PeretT.EmeryS.et al. (2003). A novel coronavirus associated with severe acute respiratory syndrome. N Engl. J. Med.348 (20), 1953–1966. doi: 10.1056/NEJMoa030781
27
LamersM. M.BeumerJ.van der VaartJ.KnoopsK.PuschhofJ.BreugemT. I.et al. (2020). SARS-CoV-2 productively infects human gut enterocytes. Science369 (6499), 50–54. doi: 10.1126/science.abc1669
28
LiW.MooreM. J.VasilievaN.SuiJ.WongS. K.BerneM. A.et al. (2003). Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus. Nature426 (6965), 450–454. doi: 10.1038/nature02145
29
LiangW.GuanW.ChenR.WangW.LiJ.XuK.et al. (2020). Cancer patients in SARS-CoV-2 infection: a nationwide analysis in China. Lancet Oncol.21 (3), 335–337. doi: 10.1016/S1470-2045(20)30096-6
30
LuJ. Y.ShaoW.ChangL.YinY.LiT.ZhangH.et al. (2020). Genomic Repeats Categorize Genes with Distinct Functions for Orchestrated Regulation. Cell Rep.30 (10), 3296–3311.e5. doi: 10.1016/j.celrep.2020.02.048
31
MalkiS.van der HeijdenG. W.O’DonnellK. A.MartinS. L.BortvinA. (2014). A role for retrotransposon LINE-1 in fetal oocyte attrition in mice. Dev. Cell.29 (5), 521–533. doi: 10.1016/j.devcel.2014.04.027
32
MoldovanJ. B.WangY.ShumanS.MillsR. E.MoranJ. V. (2019). RNA ligation precedes the retrotransposition of U6/LINE-1 chimeric RNA. Proc. Natl. Acad. Sci. U. S. A.116 (41), 20612–20622. doi: 10.1073/pnas.1805404116
33
Munoz-LopezM.Vilar-AstasioR.Tristan-RamosP.Lopez-RuizC.Garcia-PerezJ. L. (2016). Study of Transposable Elements and Their Genomic Impact. Methods Mol. Biol.1400, 1–19. doi: 10.1007/978-1-4939-3372-3_1
34
NewkirkS. J.LeeS.GrandiF. C.GaysinskayaV.RosserJ. M.Vanden BergN.et al. (2017). Intact piRNA pathway prevents L1 mobilization in male meiosis. Proc. Natl. Acad. Sci. U.S.A.114 (28), E5635–E5E44. doi: 10.1073/pnas.1701069114
35
PerchardeM.LinC. J.YinY.GuanJ.PeixotoG. A.Bulut-KarsliogluA.et al. (2018). A LINE1-Nucleolin Partnership Regulates Early Development and ESC Identity. Cell174 (2), 391–405 e19. doi: 10.1016/j.cell.2018.05.043
36
RajV. S.MouH.SmitsS. L.DekkersD. H.MullerM. A.DijkmanR.et al. (2013). Dipeptidyl peptidase 4 is a functional receptor for the emerging human coronavirus-EMC. Nature495 (7440), 251–254. doi: 10.1038/nature12005
37
Rodriguez-MartinB.AlvarezE. G.Baez-OrtegaA.ZamoraJ.SupekF.DemeulemeesterJ.et al. (2020). Pan-cancer analysis of whole genomes identifies driver rearrangements promoted by LINE-1 retrotransposition. Nat. Genet.52 (3), 306–319. doi: 10.1038/s41588-019-0562-0
38
RotaP. A.ObersteM. S.MonroeS. S.NixW. A.CampagnoliR.IcenogleJ. P.et al. (2003). Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science300 (5624), 1394–1399. doi: 10.1126/science.1085952
39
SalehA.MaciaA.MuotriA. R. (2019). Transposable Elements, Inflammation, and Neurological Disease. Front. Neurol.10, 894. doi: 10.3389/fneur.2019.00894
40
SchornA. J.GutbrodM. J.LeBlancC.MartienssenR. (2017). LTR-Retrotransposon Control by tRNA-Derived Small RNAs. Cell170 (1), 61–71. doi: 10.1016/j.cell.2017.06.013
41
SymerD. E.ConnellyC.SzakS. T.CaputoE. M.CostG. J.ParmigianiG.et al. (2002). Human l1 retrotransposition is associated with genetic instability in vivo. Cell110 (3), 327–338. doi: 10.1016/S0092-8674(02)00839-5
42
TerryD. M.DevineS. E. (2019). Aberrantly High Levels of Somatic LINE-1 Expression and Retrotransposition in Human Neurological Disorders. Front. Genet.10, 1244. doi: 10.3389/fgene.2019.01244
43
WangC.HorbyP. W.HaydenF. G.GaoG. F. (2020). A novel coronavirus outbreak of global health concern. Lancet395 (10223), 470–473. doi: 10.1016/S0140-6736(20)30185-9
44
WrappD.WangN.CorbettK. S.GoldsmithJ. A.HsiehC. L.AbionaO.et al. (2020). Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science367 (6483), 1260–1263. doi: 10.1126/science.abb2507
45
WuJ. T.LeungK.BushmanM.KishoreN.NiehusR.de SalazarP. M.et al. (2020). Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China. Nat. Med.26 (4), 506–510. doi: 10.1038/s41591-020-0822-7
46
YanL.YangM.GuoH.YangL.WuJ.LiR.et al. (2013). Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nat. Struct. Mol. Biol.20 (9), 1131–1139. doi: 10.1038/nsmb.2660
47
Yin YL. X.HeX.ZhouL. (2020). Exogenous coronavirus interacts with endogenous retrotransposon in human cells. Research Square [Preprint]. doi: 10.21203/rs.3.rs-40063/v1
48
YuanS.ChuH.ChanJ. F.YeZ. W.WenL.YanB.et al. (2019). SREBP-dependent lipidomic reprogramming as a broad-spectrum antiviral target. Nat. Commun.10 (1), 120. doi: 10.1038/s41467-018-08015-x
49
ZakiA. M.van BoheemenS.BestebroerT. M.OsterhausA. D.FouchierR. A. (2012). Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N Engl. J. Med.367 (19), 1814–1820. doi: 10.1056/NEJMoa1211721
50
ZhaoB.WuQ.YeA. Y.GuoJ.ZhengX.YangX.et al. (2019). Somatic LINE-1 retrotransposition in cortical neurons and non-brain tissues of Rett patients and healthy individuals. PloS Genet.15 (4), e1008043. doi: 10.1371/journal.pgen.1008043
51
ZhouY.ZhouB.PacheL.ChangM.KhodabakhshiA. H.TanaseichukO.et al. (2019). Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun.10 (1), 1523. doi: 10.1038/s41467-019-09234-6
52
ZhouP.YangX. L.WangX. G.HuB.ZhangL.ZhangW.et al. (2020). A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature579 (7798), 270–273. doi: 10.1038/s41586-020-2012-7
53
ZhuJ.ChenG.ZhuS.LiS.WenZ.BinL.et al. (2016). Identification of Tissue-Specific Protein-Coding and Noncoding Transcripts across 14 Human Tissues Using RNA-seq. Sci. Rep.6, 28400. doi: 10.1038/srep28400
54
ZieglerC. G. K.AllonS. J.NyquistS. K.MbanoI. M.MiaoV. N.TzouanasC. N.et al. (2020). SARS-CoV-2 receptor ACE2 is an interferon-stimulated gene in human airway epithelial cells and is detected in specific cell subsets across tissues. Cell181 (5), 1016–1035. doi: 10.1016/j.cell.2020.04.035
Summary
Keywords
coronavirus, retrotransposon, SARS-CoV-2, TET, long interspersed nuclear element
Citation
Yin Y, Liu X, He X and Zhou L (2021) Exogenous Coronavirus Interacts With Endogenous Retrotransposon in Human Cells. Front. Cell. Infect. Microbiol. 11:609160. doi: 10.3389/fcimb.2021.609160
Received
29 October 2020
Accepted
18 January 2021
Published
25 February 2021
Volume
11 - 2021
Edited by
Jianfeng Dai, Soochow University, China
Reviewed by
Ting Ni, Fudan University, China; Sevgi Marakli, Amasya University, Turkey
Updates
Copyright
© 2021 Yin, Liu, He and Zhou.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Li-quan Zhou, zhouliquan@hust.edu.cn; Ximiao He, XimiaoHe@hust.edu.cn
This article was submitted to Virus and Host, a section of the journal Frontiers in Cellular and Infection Microbiology
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.