Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Cell. Infect. Microbiol., 19 September 2025

Sec. Virus and Host

Volume 15 - 2025 | https://doi.org/10.3389/fcimb.2025.1623390

This article is part of the Research TopicStrategies of Virus PersistenceView all articles

A highly divergent sample from a nearly extinct SARS-CoV-2 lineage in a patient with long-term COVID-19

Elena Nabieva&#x;Elena Nabieva1†Galya V. Klink*&#x;Galya V. Klink2*†Andrey B. Komissarov&#x;Andrey B. Komissarov3†Stanislav V. ZaitsevStanislav V. Zaitsev4Maria SergeevaMaria Sergeeva3Artem V. FadeevArtem V. Fadeev3Kseniya KomissarovaKseniya Komissarova3Anna IvanovaAnna Ivanova3Maria PisarevaMaria Pisareva3Kira KudryaKira Kudrya3Daria DanilenkoDaria Danilenko3Dmitry LioznovDmitry Lioznov3Ryan HisnerRyan Hisner5Federico GueliFederico Gueli6Thomas P. PeacockThomas P. Peacock7Cornelius Roemer,Cornelius Roemer8,9Georgii A. BazykinGeorgii A. Bazykin1
  • 1Russian Academy of Sciences, Moscow, Russia
  • 2International Laboratory of Statistical and Computational Genomics, HSE University, Moscow, Russia
  • 3Smorodintsev Research Institute of Influenza, Saint-Petersburg, Russia
  • 4Kaluga Regional Specialized Centre for Infectious Diseases and AIDS, Kaluga, Russia
  • 5University of Cape Town, Rondebosch, South Africa
  • 6Independent Researcher, Como, Italy
  • 7Department of Infectious Disease, Imperial College London, London, United Kingdom
  • 8Biozentrum, University of Basel, Basel, Switzerland
  • 9Swiss Institute of Bioinformatics, Lausanne, Switzerland

Background: Although COVID-19 is primarily an acute disease, there are cases of persistent infection, primarily in immunocompromised patients. It is hypothesized that at least some variants of concern (VOCs) have arisen in such persistent cases, with the virus “spilling over” into the general population after accumulating intra-host mutations. Additionally, a growing body of evidence hints at the gastrointestinal (GI) tract as a reservoir of long-term infection, at least in some cases.

Results: We report the genomic analysis of a highly divergent SARS-CoV-2 sample obtained in October 2022 from an HIV+ patient with presumably long-term COVID-19 infection. Phylogenetic analysis indicates that the sample is characterized by a gain of 89 mutations since divergence from its nearest sequenced neighbor, which had been collected in September 2020 and belongs to the B.1.1 lineage, largely extinct by 2022. Of these mutations, 33 were nonsynonymous and occurred in the Spike protein. Of these, 17 are lineage-defining in some VOCs or are at sites where another mutation is lineage-defining in a VOC, and/or have been shown to be involved in antibody evasion, and/or have been detected in other cases of persistent COVID-19; these include some “usual suspects,” such as Spike:L452R, E484Q, K417T, Y453F, and N460K. Molecular clock analysis indicates that mutations in this lineage accumulated at an increased rate compared with the ancestral B.1.1 strain. This increase is driven by the accumulation of non-synonymous mutations, with an average dN/dS value of 2.2, indicating strong positive selection during within-patient evolution. Additionally, the presence of mutations that are rare in the general population samples but common in samples from wastewater suggests that the virus had persisted for at least some time in the GI tract.

Conclusions: Our analysis adds to the growing body of evidence that the evolution of SARS-CoV-2 in chronically infected patients can be a major source of novel epidemiologically important variants and points to the potential role of the GI tract in long-term infection.

Introduction

Although COVID-19 is typically an acute disease lasting days or weeks, rare cases of persistent SARS-CoV-2 infection are known, and these are usually associated with suppressed host immune systems (Choi et al., 2020; Kemp et al., 2021; Harari et al., 2022; Wilkinson et al., 2022; Stanevich et al., 2023). In such infections, mutation accumulation continues throughout the disease and tends to be faster than in inter-person transmission (Leung et al., 2022; Quaranta et al., 2022; Stanevich et al., 2023). It is hypothesized that this long-term mutational accumulation under weakened immune pressure sometimes allows the virus to evolve the beneficial combinations of mutations with immune evasion potential and that this mechanism could explain the emergence of variants of concern (VOCs) on long phylogenetic branches (Rambaut et al., 2020; Hill et al., 2022; Viana et al., 2022; Neverov et al., 2023). Certain mutations appear repeatedly in different patients with long-term infection; some of them are also lineage-defining in variants of interest (VOIs) or VOCs. The “hotspots” for such mutations are primarily in the Spike gene; Spike:L452R and E484K are prominent examples (Wilkinson et al., 2022). Moreover, the increased substitution rate observed in the Spike protein in some cases of long-term infection mirrors the increased substitution rate in the Spike protein associated with the emergence of the Omicron variant (Ma et al., 2022), lending further evidence in favor of the persistent-infection hypothesis of the origin of Omicron (Du et al., 2022).

While SARS-CoV-2 has primarily been associated with respiratory syndrome, it has been detected in various organs (Van Cleemput et al., 2021; Stein et al., 2022). An early study of intra-host viral evolution in an immunosuppressed patient detected organ-specific mutations, some of which would be lineage-defining in VOCs that emerged after that patient’s death (Van Cleemput et al., 2021). GI symptoms in particular are now considered part of the common COVID-19 symptom set (CDC, 2020). Consistent with this observation is the detection via autopsy of viral RNA in the intestines of some patients (Stein et al., 2022). Intriguingly, wastewater sequencing, which presumably detects virus particles shed through the GI tract, has discovered, at high frequency, certain mutations that are otherwise rare in the general-population sequencing associated with respiratory transmission (Gregory et al., 2022). These mutations may therefore be adaptive specifically to the GI tract. Furthermore, there is evidence that at least in some patients, the virus persists in the GI tract after it is no longer detected oropharyngeally (Natarajan et al., 2022). Together, these findings suggest that the GI tract may act as a sort of reservoir where the virus resides after the acute respiratory phase in at least some long-COVID patients.

Here, we describe the case of an HIV+ patient with a highly divergent SARS-CoV-2 genotype identified in October 2022 that is likely a result of very long-term infection. Interestingly, the patient was initially admitted to the hospital with GI symptoms. Consistent with these symptoms, the numerous mutations private to the patient’s lineage included several that were specific to wastewater sequencing, providing suggestive evidence for viral adaptation to the GI tract. Additionally, the patient’s sample included a number of mutations that are associated with immune escape and/or are lineage-defining in VOCs. Together, these observations lend further support to the hypothesis that VOCs may originate in immunosuppressed patients with long-term infection, and additionally, suggests that the GI tract may sometimes act as a reservoir for such infections.

Results

Case description

The female patient was admitted to the hospital in the city of Kaluga, Russia, in September 2022 with symptoms of irritable bowel syndrome, including diarrhea and borborygmus. She had not been vaccinated against COVID-19. Initial screening tests revealed positive results for both SARS-CoV-2, detected by RT-PCR from a nasopharyngeal swab, and HIV, detected by an antigen/antibody HIV assay. The patient (henceforth referred to as “patient K” for “Kaluga”; the identifier is not known to anyone outside the research group and is only used for purposes of this publication) claimed that she had been HIV-positive since 2005 and never received highly active antiretroviral therapy (HAART). The patient also had a dry cough and fatigue but was afebrile. Over the course of her hospital stay, the patient was tested for SARS-CoV-2 by RT-PCR four more times (on days 3, 6, 9, and 13 post-admission), with all results being positive. She has had no recent travel history.

Despite the COVID-19 diagnosis, the patient exhibited no signs of lung injury on X-ray imaging and did not require oxygen support. However, her complete blood count was remarkable for leukopenia (1.6%–2.2%) and a low hemoglobin level (86–108 g/L). On day 14 post-admission, the patient refused any further medical treatment and discharged herself from the hospital.

In April 2023, the patient was admitted to a hospital with diarrhea, myalgia, borborygmus, and progressing fatigue.

A SARS-CoV-2 sample on an ultra-long branch

We sequenced the COVID-19 sample from patient K obtained on October 2, 2022 (13 days post-admission), and deposited it in Global Initiative on Sharing All-Influenza Data (GISAID) as hCoV-19/Russia/KLU-RII-MH107906S/2022. Bioinformatic analysis of the sequence strongly suggests that it comes from a persistent infection. Specifically, UShER (Turakhia et al., 2021) identified the sample as belonging to the B.1.1 lineage. This lineage has become nearly extinct by late 2022: apart from the variant reported here, it comprised just 0.004% (549 out of 12,648,328) of GISAID sequences with a collection date later than 1 September 2022 (GISAID queried on 18 February 2025), and was only present in 5 of the 76,460 sequences (0.06%) collected at that time in Russia. To verify the phylogenetic placement of the sample in the B.1.1 clade, we considered alternative tools: pangoLEARN (O’Toole et al., 2021) and NextClade (Aksamentov et al., 2021). The latter uses an approach similar to that of UShER, whereas the former employs an entirely different machine-learning method. Both agreed that the most likely placement was B.1.1.

Despite the robust placement of the sample in the early B.1.1 clade, its lineage is highly divergent, with 89 point mutations since its divergence from the last common ancestor (LCA) with the closest related sample. This closest neighbor, hCoV-19/Russia/MOW-CSP-KOM-0303_S25/2020, was collected in Moscow, Russia, on 23 September 2020 (Figure 1). More broadly, out of the 14 phylogenetically adjacent samples, 13 were obtained in Russia between 16 July 2020 and 4 June 2021 (Figure 1), confirming that the lineage likely originated from a community infection around the estimated time of the internal node. According to Bayesian estimation with Bayesian Evolutionary Analysis Sampling Trees (BEAST), the median age of the patient K branch was 2.1 years with 90% HPD of 1.8 to 2.3 years. This puts the date of its ancestor to the end of August 2020, consistent with the dating of the nearest outgroup sequences. Together, these observations strongly suggest that this is indeed a case of persistent (up to 2 years or more) SARS-CoV-2. Since we only have a single sample from patient K, it is possible that some mutations unique to its lineage had occurred in previous unsampled hosts. However, mutations accumulated in the general population probably constituted a minority, if any, of the mutations in the patient K lineage, as a lineage circulating for a long time would have been sampled.

Figure 1
A radial phylogenetic tree shows genetic relationships among various specimens, each represented by colored dots on spiral lines. A red line connects a specific dot to a detailed list of identifiers, including the origins and dates of the sequences, primarily from Russia. The colors represent different clusters.

Figure 1. Placement of hCoV-19/Russia/KLU-RII-MH107906S/2022 (red branch) in the phylogenetic tree by UShER (Turakhia et al., 2021), with global tree downsampled (Turakhia et al., 2021) to 2,000 sequences. The inset shows the local phylogenetic vicinity of the patient K sample. Tree visualization by Nextstrain (Hadfield et al., 2018) and Taxonium (Sanderson, 2022).

Long-term infection: the most likely explanation for the number of mutations

In the absence of time-course samples from the patient, we establish long-term infection as the most likely explanation for the high number of mutations observed by, first, showing that it could not have resulted from a series of undetected community infections with the normal evolutionary rate, and, second, ruling out the two alternative possibilities that could also explain the “long branch” observation, namely, contamination and recombination.

The 89 mutations observed in the patient K lineage are inconsistent with a series of community infections. To show this, we estimate the rate of evolution of the B.1.1 lineage in Russia outside the patient K lineage using BEAST. Consistent with other estimates (Hill et al., 2022; Markov et al., 2023), this rate is 4.07×10−4 (95% HPD 3.32–4.83×10−4) substitutions per nucleotide per year. However, if the patient K lineage has also evolved at this rate, this would imply that it had diverged from the last common ancestor with other B.1.1 samples 7.36 years prior to its sampling in 2022 (Figure 2). This dating is clearly inconsistent with the phylogenetic placement of the patient K lineage within B.1.1 which originated in 2020 (p < 0.0001), and even with the origin of SARS-CoV-2 itself in 2019 (Pagani et al., 2023). This inconsistency indicates that the bulk of the patient K lineage has been evolving in an accelerated regime, implying that it cannot correspond to a series of community infections. The conjecture that this lineage arose during chronic infection is further supported by its elevated substitution rate (see next section).

Figure 2
Histogram showing the distribution of samples from the posterior over a range of years from 2 to 10. The vertical axis ranges from 0 to 300 samples. A vertical red line at the left indicates the sampling time of the nearest sample on a tree.

Figure 2. Posterior distribution of the time required for SARS-CoV-2 to accumulate 89 mutations, according to phylodynamic analysis (see text). The red line shows the time period between the sampling dates of the patient K sample and its phylogenetically nearest neighbor (2.02 years).

We rule out contamination by observing that the sample does not have an excess of variants at intermediate read frequencies. In particular, of the 104 nucleotide point mutations and indels observed in the sample (i.e., having >0.5 of reads supporting them), just a few are at intermediate read frequencies: only 10 have read fractions <0.8, and of these, only 2 have read fractions <0.6.

To see if the observed sequence may be a result of recombination event(s), we ran it through sc2rf (https://github.com/lenaschimmel/sc2rf) to look for evidence of recombination between CoVariants clades (https://covariants.org). As an alternative approach, we used RIPPLES (Turkahia et al., 2021). Neither method detected any recombination events. The “interleaving” of mutations from different VOCs and non-VOC mutations, as well as the presence of non-VOC derived alleles at sites of VOC-defining mutations (Table 1), also contradict the hypothesis of the sequence resulting from recombination event(s) involving phylogenetically distant lineages.

Table 1
www.frontiersin.org

Table 1. Select Spike mutations in the patient K sample.

Elevated mutation accumulation rate in the patient K sample

We plotted the number of mutations accumulated since divergence from the Wuhan ancestral sequence for the patient K sample and sequences from the UShER-selected subtree containing it (Turakhia et al., 2021) (Figure 3) using TempEst (Rambaut et al., 2016). The rate of mutation accumulation “jumped” dramatically at the beginning of 2022, corresponding to the disproportionately large number of changes at the origin of the Omicron variants (Du et al., 2022; Ma et al., 2022). Remarkably, the patient K sample (red dot) falls on the higher end of the Omicron cloud corresponding to the “BA.2” samples obtained around the same time, even though it belongs to the much more slowly evolving B.1.1 lineage. Thus, the rate of mutation accumulation in the lineage of the patient K sample has been comparable with the rate at the origin of the Omicron clade. Notably, while the rates are similar, the identity of the accumulated mutations is different: the patient K sample shares just a few mutations with BA.2 (see below). The median posterior rate of evolution of patient K lineage was 15×10−4 (90% HPD 12–18×10−4) substitutions per nucleotide per year, 3.7 times higher than for other B.1.1 samples, which again confirms that this lineage arose during chronic infection.

Figure 3
Scatter plot showing root-to-tip divergence versus sampling date from 2020 to 2023. It includes colored dots representing different variants: Kaluga sample (red), other B.1.1 (orange), Delta (peach), Omicron (yellow), and other (gray). Two trend lines indicate overall divergence trends.

Figure 3. Regression of root-to-tip distance on sampling date. The regression lines are calculated for all non-Omicron, non-patient K samples (thin line, slope = 6.3 × 10−4) and just for the B.1.1 samples (thick line, slope = 4.7 × 10−4). The patient K sample is high above both regression lines, indicating accelerated evolution at this lineage.

Single-nucleotide substitutions were non-synonymous (83%), similar to the values observed for samples from immunocompromised patients and from ancestral branches of VOCs (Harari et al., 2022). To evaluate whether the rapid evolution of the patient K sample was caused by positive selection, we applied the Branch-Site Unrestricted Statistical Test for Episodic Diversification (BUSTED) method implemented in HyPhy (Pond et al., 2005). We detected a positive selection on the branch leading to the patient K sample both for the entire concatenated coding sequence (excluding nucleotide positions with overlapping reading frames; dN/dS=2.2 for the branch leading to the patient K sample vs. 0.6 for the rest of the B.1.1 tree, p = 0.04) and for Spike (dN/dS=5.7 vs. 0.7, p = 0.0034). The positive selection (dN/dS > 1) observed for the patient K branch contrasts with the predominantly negative selection (dN/dS < 0.5) observed in the global evolution of SARS-CoV-2 (Rochman et al., 2021). Therefore, the increase in mutation accumulation rate on the patient K branch was likely driven by a positive selection specific to this branch, acting (mainly) on the mutations in the Spike protein.

Mutations in the Spike protein are typical for long-term infection, immune escape, and VOCs

Of the 89 private point mutations in the sample, 33 occur in the Spike gene, with an additional two in-frame deletions (Table 1, Figure 4). These include 17 point mutations that have been seen in VOCs or in sites where other mutations have been found in VOCs, and/or in immunocompromised patients, and/or are associated with immune escape. Among these mutations are L452R, E484Q, K417T, N460K, and Q498H. Beyond Spike, the sequence also has several ORF1a mutations that are associated with hypermutated, chronic-infection sequences, most notably T1638I (NSP3:T820I) and K1795Q (NSP3:K977Q) (Wilkinson et al., 2022); the latter has been identified additionally in a number of New York City (NYC) wastewater cryptic lineages, in several of which its fraction was close to 100% (Gregory et al., 2022). These mutations may constitute a response of the virus to selective pressure from the host immune system.

Figure 4
Diagram of mutations across the SARS-CoV-2 spike protein. Circles represent mutations with different colors indicating their occurrence: white (in patient K), red (in VOC), brown (chronic infection), pink (in wastewaters), and blue (immune evasion). The amino acid positions are labeled above, ranging from T19A to A1165E. Below, the protein domains are marked with colored boxes: NTD, RBD, RBM, FP, HR1, CH, CD, HR2, TM, and CT.

Figure 4. A schematic representation of the Spike mutations in the patient K sample and their overlap with the mutations that were also observed in the ancestors of VOCs, previously described mutations found in chronic infections, associated with immune evasion, or overrepresented in wastewater samples (see Table 1 for corresponding references). Spike domain coordinates were obtained from ref (Shrivastava et al., 2021).

In addition to the point mutations, the sample also contains two deletions in the S gene: Δ141–144 and Δ210. They occur in two of the four recurrent deletion regions (RDRs) in the NTD (RDR2 and RDR3) in the McCarthy et al. (2021) nomenclature and map to distinct antibody epitopes (Wilkinson et al., 2022) and overlap or are adjacent to deletions in VOCs: Alpha (Δ144) and Omicron/BA.1 (Δ211) (Mittal et al., 2022).

The table lists the mutations that are lineage-defining in some of the VOCs or are in sites where another mutation is lineage-defining in a VOC, and/or shown to be involved in antibody evasion, and/or detected in other cases of persistent COVID-19, and/or are characteristic of wastewater samples (see Methods). For the latter three groups, the table cells list the corresponding references.

Other mutations in the S gene are H49R, H66Q, G72R, V126A, G181E, F186I, L216F, A222D, Y248H, Q271R, A372T, R765L, V772A, T1006I, A1078S, and D1165E.

To the gastrointestinal tract and back again?

The pattern of Spike receptor binding domain (RBD) mutations in the patient K sequence shows a remarkable overlap with the set of mutations that have been repeatedly found in “cryptic lineages” detected in wastewater (Gregory et al., 2022), which are hypothesized to be long-term chronic infections with a GI tropism. Some of them are also found in VOCs, e.g., K417T (Gamma), K444T (BQ.1), and N460K (BA.2.75, BQ.1, XBB), whereas others have been found recurrently in wastewater but are otherwise rare in GISAID. One example is Spike:Q498H, which is found in only 96 sequences in GISAID (accessed on 22 December 2022) but in 7/9 sewersheds studied in (Gregory et al., 2022) (a different substitution at the same site, Q498R, is present in a number of Omicron lineages, and is therefore widespread). This is consistent with the fact that the patient has presented with GI symptoms. Remarkably, around the time of the patient K infection, K444T, N460K, and Q498H had very low frequencies in the GISAID (0.0023%, 0.0026%, and 0.0006%, respectively, as of 15 March 2022 (Gregory et al., 2022)), making their chance of co-occurrence in the same genome negligible, and increasing the likelihood that these mutations constitute a GI adaptation.

Moreover, of particular interest is the double mutation in codon 828 of Spike (CTT->TTG). Conceivably, during the presumed long intra-host evolution, the sequence could go through the intermediate state of TTT (Spike:L828F), which was later followed by the F828L reversion to a different leucine codon. L828F is remarkable in that it comes up repeatedly in cryptic lineages found in wastewater, while being extremely rare in general SARS-CoV-2 sequences (almost exclusively sampled from the respiratory tract) (Gregory et al., 2022). We propose that L828F may represent a specific adaptation to the GI tract during chronic infection. Based on this proposed chain of events and on the overlap with other mutations recurrently found in wastewater samples described above, we suggest that the virus has resided within and adapted to the patient’s GI tract during some period of time in the long course of the infection. Eventually, however, the 828L reversion restored the consensus sequence, presumably better adapted for nasopharyngeal localization, and the virus was ultimately detected in the nasopharynx. Admittedly, this suggested mechanism is highly speculative: the 828F variant by itself was not observed in the patient K sample, the double CTT→TTG mutation could have proceeded through the synonymous (L) CTG rather than the non-synonymous (F) TTT intermediate, and the pathway for the reverse transfer of the GI-adapted variant into nasopharyngeal localization is unclear. Still, this finding adds further, albeit circumstantial, evidence to the idea that a subset of chronic infections may be GI (likely accounting for cryptic wastewater lineages). At present, it is thought that such GI chronic infections likely represent an evolutionary dead end due to scant evidence of transmission of these lineages. However, in this sequence, we find tentative evidence that this virus has re-adapted back to the respiratory tract, giving a potential mechanism for how such a virus could re-evolve to become transmissible by the airborne route.

Impaired growth in cell culture compared with the B.1 ancestor

In an attempt to find particular markers of phenotypic adaptation, we isolated the virus (KLU, sequence deposited in GISAID as EPI_ISL_18647516) from the patient K swab specimen and compared the growth rate of the KLU virus and ancestral B.1 strain in the Caco-2 human intestinal epithelial cell line (ATCC, Cat # HTB-37). First, we checked three different doses (0.1, 0.01, and 0.001 TCID50/cell) and two incubation temperatures (34°C and 37°C). The temperature dependence of the growth rate was the same for both viruses, indicating no loss of adaptation to the upper respiratory site of infection for the KLU virus (Figure 5A). Second, we checked three different pH of the culture media: neutral pH = 7.5, acidic pH = 6.5, and alkaline pH = 8.5. The pH dependence of the growth rate was the same for both viruses with no difference between the neutral and low pH and impaired growth at higher pH, contrary to the expectation of adaptation of the KLU virus to the potentially different environment of the gastroenterological tract (Figure 5B). For both viruses, no cytopathic effect was observed at any time point. At the same time, we found that the KLU virus growth was overall significantly impaired in comparison with the B.1 strain in the Caco-2 model cell line. The observed difference was similar to that described earlier for the Omicron strains and could be associated with various mechanisms. The first is the previously described greater efficiency of TMPRSS2-dependent cell entry of early lineages of SARS-CoV-2 compared with Omicron (Meng et al., 2022; Willett et al., 2022). We repeated the growth curve experiment in Calu-3 cells, characterized by a different level of TMPRSS2 expression than Caco-2 (Shuai et al., 2020) (Figure 5C). Overall, the growth of both viral strains was slightly increased in Calu-3 in comparison with Caco-2 cells; however, nearly the same difference between the strains was obtained in Calu-3, namely, the decreased growth rate of the KLU strain compared with the B.1 virus (Figure 5C). The second possible explanation for KLU strain inferiority could be its less effective antagonizing of the interferon response in human cells due to mutations in certain viral proteins (Mautner et al., 2022).

Figure 5
Graph A shows virus titer over time for B.1 and KLU strains at 34°C and 37°C with different moi values: 0.1, 0.01, and 0.001. Graph B compares virus titers for B.1 and KLU at 37°C across pH levels 6.5, 7.5, and 8.5. Graph C compares virus titers over time for B.1 and KLU at 37°C in Calu-3 and Caco-2 cell cultures. Error bars indicate variability.

Figure 5. Growth curve of the KLU virus and the B.1 ancestral strain in Caco-2. (A) Cells were infected at indicated multiplicity of infection (moi) (TCID50/cell) and incubated at 34°C or 37°C. (B) Cells were infected at moi = 0.01 TCID50/cell and incubated at 37°C in the culture medium with indicated pH. (C) Calu-3 or Caco-2 cells were infected at moi = 0.001 TCID50/cell and incubated at 37°C. Samples of the culture medium, taken with an 8–24-h interval after inoculation, were titrated for viral infectious activity in the VeroE6/TMPRSS2 cell line.

Discussion

There is growing support for the hypothesis that at least some VOCs, such as Alpha and Omicron, have evolved in patients with chronic infection and escaped into the general population (Rambaut et al., 2020; Hill et al., 2022; Viana et al., 2022). We have described a case of a likely long-term infection in an HIV+ patient. Sampled in 2022, it is most confidently placed in the B.1.1 clade that was largely extinct by that time. In addition to the long branch from the putative root indicative of a long-term infection, remarkable features of the sample include the presence of mutations defining several VOC lineages and/or associated with immune evasion, and an apparently elevated mutation accumulation rate. In fact, the rate is similar to that at the origin of the Omicron clade, suggesting that the evolutionary processes that shaped its evolution may be similar to those that shaped the evolution of the Omicron variant and, conversely, lending further support to the hypothesis that VOCs such as Omicron emerged in the course of chronic infection (Moulana et al., 2022).

It has been shown that many of the lineage-defining mutations in VOCs are associated with antibody escape (providing selective advantage in the largely immune-competent population at the later stages of the pandemic) but that at least some of them reduce RBD affinity for ACE2, an effect that is rescued in BA.1 by the compensatory mutations Q498R and N501Y (Moulana et al., 2022). Neither is observed in our sample, although it contains a different mutation in the 498 codon, namely, Q498H. The rarity of Q498H in GISAID suggests that it may not be responsible for airborne spread, although it has been experimentally shown to increase the ACE2 affinity on a non-N501Y spike background (Mutational effects on ACE2-binding affinity and expression in SARS-CoV-2 variant RBDs). Furthermore, the Y453F mutation greatly increases the ACE2 affinity in both BA.1 and BA.2, as well as Wuhan Hu-1 backgrounds, whereas R403K substantially increases ACE2 affinity on a BA.2 spike background but not a BA.1 or Wuhan Hu-1 spike background (Mutational effects on ACE2-binding affinity and expression in SARS-CoV-2 variant RBDs). Since this sample shares mutations with both BA.1 and BA.2, there is a theoretical possibility of its increased infectivity.

The major limitation of our study is that it is based on only a single sample that was available from this patient. Therefore, the persistence of the virus cannot be verified by repeated positive PCR tests throughout the presumed duration of infection. Instead, the evidence for persistent infection is circumstantial and is based on the likely immunosuppressed (HIV+) status of the patient together with the phylogenetic placement of this sample.

Similarly, the absence of information about the patient’s history (or lack thereof) of COVID-19 infection, as well as the ultimate lack of evidence for community spread of the virus, limit our ability to draw any definitive conclusions about its ability to evade the immune response “in the wild”, beyond the theoretical possibility suggested by the presence of a number of mutations that have been linked to immune evasion (Table 1).

The increased fraction of non-synonymous mutations indicates a strong positive selection in the patient K lineage, particularly in the spike protein. The sampled viral genome is characterized by the presence of mutations previously observed in persistently infected individuals. On top of that, an intriguing feature of the patient K sample is the presence of several mutations that have been found almost exclusively in wastewater lineages. This observation suggests the possibility of persistence for some period in the GI tract. Importantly, this suggestion is based on the highly circumstantial evidence and is countered by our finding of lack of pH dependence of the growth rate, which could be expected under GI adaptation. Regardless, the connection between immune-evading long-term infection, GI symptoms, and the risk of emergence of VOCs suggested by the observations in this sample highlight the need for wastewater monitoring.

As of 18 February 2025, the hCoV-19/Russia/KLU-RII-MH107906S/2022 sample remains the sole representative of this lineage in GISAID, although GISAID carries 92,865 additional samples from Russia, including 642 from Kaluga region, with 278 having later sampling dates. Together with the fact that the virus has been sampled from the likely source of long-term evolution (the HIV+ patient), this suggests that this lineage has not led to substantial onward transmission. Still, it indicates that SARS-CoV-2 has not yet run out of opportunities for further adaptation.

Methods

Sample collection

Nasopharyngeal swab was collected in virus transport medium. Total RNA was extracted using the AutoPure-96 Nucleic Acid Purification System (Allsheng, China) and NAmagp DNA/RNA extraction kit (Biolabmix, Russia). Extracted RNA was immediately tested for SARS-CoV-2 using Novel Coronavirus (2019-nCoV) Nucleic Acid Diagnostic Kit (Sansure Biotech, China).

The COVID-19 sample from patient K was obtained on 2 October 2022. It had been collected as part of the ongoing surveillance of SARS-CoV-2 variability. The main authors had no access to information that could identify the participant at any stage of the project.

Sequencing

Whole-genome amplification (WGA) of the SARS-CoV-2 virus genome was performed using the Midnight 1,200-bp amplicon primer set (Freed and Silander, 2021) and BioMaster RT-PCR Premium Kit (Biolabmix, Russia). Library preparation was performed with the Fast PCR-FREE FS DNA Library Prep Set (MGI, China). Finally, sequencing was performed on MGI DNBSEQ-G400 using the FCL SE100 high-throughput sequencing set.

Bioinformatic analysis

Phylogenetic placement of the K sample was reconstructed using UShER (Turakhia et al., 2021) based on 16,859,038 GISAID, GenBank, COVID-19 Genomics UK Consortium (COG-UK), and China National Center for Bioinformation (CNCB) sequences (accessed on 18 February 2025). Positive selection was tested with the BUSTED (Murrell et al., 2015) method of the HyPhy package (Pond et al., 2005). To allow likelihood inference while preserving the richness of the data, we subsampled the sequences by merging the B.1.1 samples that differed in fewer than five positions with cd-hit (Fu et al., 2012) and then taking a random subset of 500 samples. We then removed samples with premature stop codons, added the patient K sample, and reconstructed a phylogenetic tree with IQTree (Nguyen et al., 2015).

The distribution of evolutionary rates of the Russian B.1.1 clade was estimated using a subset of 100 random B.1.1 samples from Russia (excluding the patient K sample) with BEAST v. 1.10.4 (Suchard et al., 2018) and the following parameters: substitution model GTR+Г4+I, lognormal strict clock prior with mean 0.001, and standard deviation 0.005; 500 million Markov Chain Monte Carlo (MCMC) chain length; the other parameters were default for BEAST, sampling each 50,000th step, for a total of 10,000 samples. The first 10% of steps were discarded before the subsequent analysis. To obtain a distribution of time needed for the accumulation of 89 substitutions, we divided 89 by the rates sampled by BEAST. To obtain the age of the patient K lineage and its evolution rate, we added the patient K sample to this subset and ran the BEAST analysis assuming random local clock with approximate continuous time Markov chain rate reference prior (Ferreira and Suchard, 2008) while keeping the other parameters the same. Again, the first 10% of samples were discarded before the subsequent analysis. The subsequent analysis and visualization were performed in R 4.4.0 (R Core Team, 2013) using the basic functions and the “trackViewer” library (Ou and Zhu, 2019).

Virus isolation and growth curve

Live virus (KLU) was isolated from the patient K swab samples in VeroE6/TMPRSS2 cells (JCRB, #JCRB1819). Culture was inoculated for 2 h with the swab material diluted 1/10 in Dulbecco's Modified Eagle Medium (DMEM) (Biolot) supplemented with 2% heat-inactivated fetal bovine serum (SC, Biolot, St Petersburg, Russia), 1% anti–anti (Gibco, New York, USA) and then incubated for 4 days until 70% cytopathic effect was reached. The sample was subsequently passaged one time in VeroE6/TMPRSS2 cells to generate the working stock. The sequence of the cultivated virus has been deposited in GISAID (EPI_ISL_18647516).

Growth curve experiments were performed in Caco-2 cells (ATCC, #HTB-37) and Calu-3 cells (ATCC, #HTB-55). KLU virus was compared with the B.1 strain hCoV-19/St_Petersburg-3524V/2020 virus (GISAID EPI_ISL_415710). Cells were grown until 100% monolayer in 12-well plates in Modified Eagle Medium (MEM) culture medium (Gibco) supplemented with 20% FBS (Gibco), 1% GlutaMAX (Gibco), 1% sodium pyruvate (Gibco), and 1% NEAA (Biolot). Monolayer cells were inoculated with a particular viral dose (0.1, 0.01, or 0.001 TCI50/cell) in a 0.5-mL neutral medium with 5% FBS for 1h at 37°C, and then inoculum was removed, and 3-mL medium with 5% FBS was added. The neutral (original) medium pH was 7.5 at 37°C. To get the low-pH medium, we added 1-M MES (Amresco, Solon (OH), USA) until pH = 6.5; to get the high-pH medium, we added 1-M Tris-base (Amresco) until pH = 8.5. Samples of culture supernatant were taken at 8–24-h intervals after inoculation and titrated for viral infectious activity in the VeroE6/TMPRSS2 cell line by the standard TCID50 method.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://gisaid.org/, KLU-RII-MH107906S.

Ethics statement

The sample used in this study was collected as part of the ongoing surveillance of SARS-CoV-2 variability routinely conducted at the laboratories of the CoRGI consortium (CoRGI). Genomic epidemiology study conducted by the CoRGI consortium was reviewed and deemed exempt by the Local Ethics Committee of Smorodintsev Research Institute of Influenza (protocol No. 152, 18 June 2020). This case report involved a retrospective retrieval, review, and analysis of clinical data. These activities were approved by the Local Ethics Committee of Smorodintsev Research Institute of Influenza (protocol no. 203, 18 May 2023) in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. The patient gave written informed consent prior to sample collection, according to CARE guidelines (See: https://www.care-statement.org/). To maintain patient privacy, all identifying information has been removed from the publication.

Author contributions

EN: Visualization, Methodology, Formal Analysis, Writing – review & editing, Conceptualization, Writing – original draft. GK: Conceptualization, Writing – review & editing, Funding acquisition, Writing – original draft, Methodology, Visualization, Formal Analysis. AK: Writing – review & editing, Investigation, Supervision. SZ: Investigation, Writing – review & editing. MS: Investigation, Writing – review & editing. AF: Investigation, Writing – review & editing. KKo: Investigation, Writing – review & editing. AI: Investigation, Writing – review & editing. MP: Investigation, Writing – review & editing. KKu: Writing – review & editing, Investigation. DD: Investigation, Writing – review & editing. DL: Writing – review & editing. RH: Conceptualization, Writing – review & editing. FG: Conceptualization, Writing – review & editing. TP: Writing – review & editing, Conceptualization. CR: Conceptualization, Writing – review & editing. GB: Writing – original draft, Conceptualization, Writing – review & editing, Methodology, Supervision.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. Data analysis and publication was supported within the framework of the HSE University Basic Research Program. Sequencing has been supported by Russian Government subsidy for SARS-CoV-2 genomic sequencing (decree No. 267-r, 17.02.2022) to DAL.

Acknowledgments

We are thankful to patient K for her participation in this study. We are grateful to all GISAID submitting and originating labs (Supplementary File) for the rapid open release of SARS-CoV-2 sequencing data. This research was supported in part through computational resources of the High-Performance Computing (HPC) facilities at Higher School of Economics (HSE) University (Kostenetskiy et al., 2021). EN and GB was involved in this study prior to 2023.

Conflict of interest

At the time of final submission of this manuscript, FG was a contractor for Invivyd, a company that develops monoclonal antibodies to treat COVID-19.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2025.1623390/full#supplementary-material

References

Aksamentov, I., Roemer, C., Hodcroft, E. B., and Neher, R. A. (2021). Nextclade: clade assignment, mutation calling and quality control for viral genomes. J. Open Source Software 6, 3773. doi: 10.21105/joss.03773

Crossref Full Text | Google Scholar

Mutational effects on ACE2-binding affinity and expression in SARS-CoV-2 variant RBDs. Available online at: https://jbloomlab.github.io/SARS-CoV-2-RBD_DMS_Omicron/RBD-heatmaps/ (Accessed September 5, 2025).

Google Scholar

CDC (2020). COVID-19 and your health (Atlanta, GA, USA: Centers for Disease Control and Prevention). Available online at: https://www.cdc.gov/coronavirus/2019-ncov/symptoms-testing/symptoms.html (Accessed September 5, 2025).

Google Scholar

Choi, B., Choudhary, M. C., Regan, J., Sparks, J. A., Padera, R. F., Qiu, X., et al. (2020). Persistence and evolution of SARS-coV-2 in an immunocompromised host. N. Engl. J. Med. 383, 2291–2293. doi: 10.1056/NEJMc2031364

PubMed Abstract | Crossref Full Text | Google Scholar

Du, P., Gao, G. F., and Wang, Q. (2022). The mysterious origins of the Omicron variant of SARS-CoV-2. Innovation 3, 100206. doi: 10.1016/j.xinn.2022.100206

PubMed Abstract | Crossref Full Text | Google Scholar

Ferreira, M. A. R. and Suchard, M. A. (2008). Bayesian analysis of elapsed times in continuous-time Markov chains. Can. J. Stat. 36, 355–368. doi: 10.1002/cjs.5550360302

Crossref Full Text | Google Scholar

Freed, N. and Silander, O. (2021). SARS-CoV2 genome sequencing protocol (1200bp amplicon ‘midnight’ primer set, using Nanopore Rapid kit). doi: 10.17504/protocols.io.btsrnnd6

Crossref Full Text | Google Scholar

Fu, L., Niu, B., Zhu, Z., Wu, S., and Li, W. (2012). CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152. doi: 10.1093/bioinformatics/bts565

PubMed Abstract | Crossref Full Text | Google Scholar

Gregory, D. A., Trujillo, M., Rushford, C., Flury, A., Kannoly, S., San, K. M., et al. (2022). Genetic diversity and evolutionary convergence of cryptic SARS- CoV-2 lineages detected via wastewater sequencing. PloS Pathog. 18, e1010636. doi: 10.1371/journal.ppat.1010636

PubMed Abstract | Crossref Full Text | Google Scholar

Hadfield, J., Megill, C., Bell, S. M., Huddleston, J., Potter, B., Callender, C., et al. (2018). Nextstrain: real-time tracking of pathogen evolution. Bioinformatics 34, 4121–4123. doi: 10.1093/bioinformatics/bty407

PubMed Abstract | Crossref Full Text | Google Scholar

Harari, S., Tahor, M., Rutsinsky, N., Meijer, S., Miller, D., Henig, O., et al. (2022). Drivers of adaptive evolution during chronic SARS-CoV-2 infections. Nat. Med. 28, 1501–1508. doi: 10.1038/s41591-022-01882-4

PubMed Abstract | Crossref Full Text | Google Scholar

Hill, V., Du Plessis, L., Peacock, T. P., Aggarwal, D., Colquhoun, R., Carabelli, A. M., et al. (2022). The origins and molecular evolution of SARS-CoV-2 lineage B.1.1.7 in the UK. Virus Evol. 8, veac080. doi: 10.1093/ve/veac080

PubMed Abstract | Crossref Full Text | Google Scholar

Jian, F., Yu, Y., Song, W., Yisimayi, A., Yu, L., Gao, Y., et al. (2022). Further humoral immunity evasion of emerging SARS-CoV-2 BA.4 and BA.5 subvariants. Lancet Infect. Dis. 22, 1535–1537. doi: 10.1016/S1473-3099(22)00642-9

PubMed Abstract | Crossref Full Text | Google Scholar

Kemp, S. A., Collier, D. A., Datir, R. P., Ferreira, I. A. T. M., Gayed, S., Jahun, A., et al. (2021). SARS-CoV-2 evolution during treatment of chronic infection. Nature 592, 277–282. doi: 10.1038/s41586-021-03291-y

PubMed Abstract | Crossref Full Text | Google Scholar

Kostenetskiy, P. S., Chulkevich, R. A., and Kozyrev, V. I. (2021). HPC resources of the higher school of economics. J. Phys. Conf. Ser. 1740, 012050. doi: 10.1088/1742-6596/1740/1/012050

Crossref Full Text | Google Scholar

Leung, W. F., Chorlton, S., Tyson, J., Al-Rawahi, G. N., Jassem, A. N., Prystajecky, N., et al. (2022). COVID-19 in an immunocompromised host: persistent shedding of viable SARS-CoV-2 and emergence of multiple mutations: a case report. Int. J. Infect. Dis. 114, 178–182. doi: 10.1016/j.ijid.2021.10.045

PubMed Abstract | Crossref Full Text | Google Scholar

Ma, W., Yang, J., Fu, H., Su, C., Yu, C., Wang, Q., et al. (2022). Genomic perspectives on the emerging SARS-coV-2 omicron variant. Genomics Proteomics Bioinf. 20, 60–69. doi: 10.1016/j.gpb.2022.01.001

PubMed Abstract | Crossref Full Text | Google Scholar

Markov, P. V., Ghafari, M., Beer, M., Lythgoe, K., Simmonds, P., Stilianakis, N. I., et al. (2023). The evolution of SARS-coV-2. Nat. Rev. Microbiol. 21, 361–379. doi: 10.1038/s41579-023-00878-2

PubMed Abstract | Crossref Full Text | Google Scholar

Mautner, L., Hoyos, M., Dangel, A., Berger, C., Ehrhardt, A., Baiker, A., et al. (2022). Replication kinetics and infectivity of SARS-CoV-2 variants of concern in common cell culture models. Virol. J. 19, 76. doi: 10.1186/s12985-022-01802-5

PubMed Abstract | Crossref Full Text | Google Scholar

McCarthy, K. R., Rennick, L. J., Nambulli, S., Robinson-McCarthy, L. R., Bain, W. G., Haidar, G., et al. (2021). Recurrent deletions in the SARS-CoV-2 spike glycoprotein drive antibody escape. Science 371, 1139–1142. doi: 10.1126/science.abf6950

PubMed Abstract | Crossref Full Text | Google Scholar

Meng, B., Abdullahi, A., Ferreira, I. A. T. M., Goonawardane, N., Saito, A., Kimura, I., et al. (2022). Altered TMPRSS2 usage by SARS-CoV-2 Omicron impacts infectivity and fusogenicity. Nature 603, 706–714. doi: 10.1038/s41586-022-04474-x

PubMed Abstract | Crossref Full Text | Google Scholar

Mittal, A., Khattri, A., and Verma, V. (2022). Structural and antigenic variations in the spike protein of emerging SARS-CoV-2 variants. PloS Pathog. 18, e1010260. doi: 10.1371/journal.ppat.1010260

PubMed Abstract | Crossref Full Text | Google Scholar

Moulana, A., Dupic, T., Phillips, A. M., Chang, J., Nieves, S., Roffler, A. A., et al. (2022). Compensatory epistasis maintains ACE2 affinity in SARS-CoV-2 Omicron BA.1. Nat. Commun. 13, 7011. doi: 10.1038/s41467-022-34506-z

PubMed Abstract | Crossref Full Text | Google Scholar

Murrell, B., Weaver, S., Smith, M. D., Wertheim, J. O., Murrell, S., Aylward, A., et al. (2015). Gene-wide identification of episodic selection. Mol. Biol. Evol. 32, 1365–1371. doi: 10.1093/molbev/msv035

PubMed Abstract | Crossref Full Text | Google Scholar

Natarajan, A., Zlitni, S., Brooks, E. F., Vance, S. E., Dahlen, A., Hedlin, H., et al. (2022). Gastrointestinal symptoms and fecal shedding of SARS-CoV-2 RNA suggest prolonged gastrointestinal infection. Med 3, 371–387.e9. doi: 10.1016/j.medj.2022.04.001

PubMed Abstract | Crossref Full Text | Google Scholar

Neverov, A. D., Fedonin, G., Popova, A., Bykova, D., and Bazykin, G. (2023). Coordinated evolution at amino acid sites of SARS-CoV-2 spike. eLife 12, e82516. doi: 10.7554/eLife.82516.sa2

PubMed Abstract | Crossref Full Text | Google Scholar

Nguyen, L.-T., Schmidt, H. A., von Haeseler, A., and Minh, B. Q. (2015). IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274. doi: 10.1093/molbev/msu300

PubMed Abstract | Crossref Full Text | Google Scholar

Toole, ?. ?., Scher, E., Underwood, A., Jackson, B., Hill, V., McCrone, J. T., et al. (2021). Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 7. doi: 10.1093/ve/veab064

PubMed Abstract | Crossref Full Text | Google Scholar

Ou, J. and Zhu, L. J. (2019). trackViewer: a Bioconductor package for interactive and integrative visualization of multi-omics data. Nat. Methods 16, 453–454. doi: 10.1038/s41592-019-0430-y

PubMed Abstract | Crossref Full Text | Google Scholar

Pagani, I., Ghezzi, S., Alberti, S., Poli, G., and Vicenzi, E. (2023). Origin and evolution of SARS-coV-2. Eur. Phys. J. Plus 138, 157. doi: 10.1140/epjp/s13360-023-03719-6

PubMed Abstract | Crossref Full Text | Google Scholar

Pond, S. L. K., Frost, S. D. W., and Muse, S. V. (2005). HyPhy: hypothesis testing using phylogenies. Bioinformatics 21, 676–679. doi: 10.1093/bioinformatics/bti079

PubMed Abstract | Crossref Full Text | Google Scholar

Qu, P., Evans, J. P., Faraone, J. N., Zheng, Y. M., Carlin, C., Anghelina, M., et al. (2023). Enhanced neutralization resistance of SARS-CoV-2 Omicron subvariants BQ.1, BQ.1.1, BA.4.6, BF.7, and BA.2.75.2. Cell Host Microbe 31, 9–17.e3. doi: 10.1016/j.chom.2022.11.012

PubMed Abstract | Crossref Full Text | Google Scholar

Quaranta, E. G., Fusaro, A., Giussani, E., D??™Amico, V., Varotto, M., Pagliari, M., et al. (2022). SARS-CoV-2 intra-host evolution during prolonged infection in an immunocompromised patient. Int. J. Infect. Dis. 122, 444–448. doi: 10.1016/j.ijid.2022.06.023

PubMed Abstract | Crossref Full Text | Google Scholar

Rambaut, A., Lam, T. T., Max Carvalho, L., and Pybus, O. G. (2020). Preliminary genomic characterisation of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations (University of Edinburgh: Virological). Available online at: https://virological.org/t/preliminary-genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-the-uk-defined-by-a-novel-set-of-spike-mutations/563 (Accessed September 5, 2025).

Google Scholar

Rambaut, A., Lam, T. T., Max Carvalho, L., and Pybus, O. G. (2016). Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2, vew007. doi: 10.1093/ve/vew007

PubMed Abstract | Crossref Full Text | Google Scholar

R Core Team (2013). R: A language and Environment for Statistical Computing (Vienna, Austria: R Foundation for Statistical Computing).

Google Scholar

Rochman, N. D., Wolf, Y. I., Faure, G., Mutz, P., Zhang, F., Koonin, E. V., et al. (2021). Ongoing global and regional adaptive evolution of SARS-CoV-2. Proc. Natl. Acad. Sci. 118, e2104241118. doi: 10.1073/pnas.2104241118

PubMed Abstract | Crossref Full Text | Google Scholar

Sanderson, T. (2022). Taxonium, a web-based tool for exploring large phylogenetic trees. eLife 11, e82392. doi: 10.7554/eLife.82392.sa2

PubMed Abstract | Crossref Full Text | Google Scholar

Shrivastava, T., Singh, B., Rizvi, Z. A., Verma, R., Goswami, S., Vishwakarma, P., et al. (2021). Comparative immunomodulatory evaluation of the receptor binding domain of the SARS-coV-2 spike protein; a potential vaccine candidate which imparts potent humoral and th1 type immune response in a mouse model. Front. Immunol. 12. doi: 10.3389/fimmu.2021.641447

PubMed Abstract | Crossref Full Text | Google Scholar

Shuai, H., Chu, H., Hou, Y., Yang, D., Wang, Y., Hu, B., et al. (2020). Differential immune activation profile of SARS-CoV-2 and SARS-CoV infection in human lung and intestinal cells: Implications for treatment with IFN-β and IFN inducer. J. Infect. 81, e1–e10. doi: 10.1016/j.jinf.2020.07.016

PubMed Abstract | Crossref Full Text | Google Scholar

Stanevich, O. V., Alekseeva, E. I., Sergeeva, M., Fadeev, A. V., Komissarova, K. S., Ivanova, A. A., et al. (2023). SARS-CoV-2 escape from cytotoxic T cells during long-term COVID-19. Nat. Commun. 14, 149. doi: 10.1038/s41467-022-34033-x

PubMed Abstract | Crossref Full Text | Google Scholar

Starr, T. N., Greaney, A. J., Dingens, A. S., and Bloom, J. D. (2021). Complete map of SARS-CoV-2 RBD mutations that escape the monoclonal antibody LY-CoV555 and its cocktail with LY-CoV016. Cell Rep. Med. 2. doi: 10.1016/j.xcrm.2021.100255

PubMed Abstract | Crossref Full Text | Google Scholar

Stein, S. R., Ramelli, S. C., Grazioli, A., Chung, J. Y., Singh, M., Yinda, C. K., et al. (2022). SARS-CoV-2 infection and persistence in the human body and brain at autopsy. Nature 612, 758–763. doi: 10.1038/s41586-022-05542-y

PubMed Abstract | Crossref Full Text | Google Scholar

Suchard, M. A., Lemey, P., Baele, G., Ayres, D. L., Drummond, A. J., and Rambaut, A.. (2018). Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4, vey016. doi: 10.1093/ve/vey016

PubMed Abstract | Crossref Full Text | Google Scholar

Tada, T., Dcosta, B. M., Zhou, H., Vaill, A., Kazmierski, W., and Landau, N. R.. (2021). Decreased neutralization of SARS-CoV-2 global variants by therapeutic anti-spike protein monoclonal antibodies. bioRxiv 2021, 2.18.431897. doi: 10.1101/2021.02.18.431897

PubMed Abstract | Crossref Full Text | Google Scholar

Tada, T., Zhou, H., Dcosta, B. M., Samanovic, M. I., Cornelius, A., Herati, R. S., et al. (2022). High-titer neutralization of Mu and C.1.2 SARS-CoV-2 variants by vaccine-elicited antibodies of previously infected individuals. Cell Rep. 38. doi: 10.1016/j.celrep.2021.110237

PubMed Abstract | Crossref Full Text | Google Scholar

Takatsuki, Y., Takahashi, Y., Nakajima, J., Iwasaki, Y., Nagano, K., Tani-Sassa, C., et al. (2023). Viral load of SARS-CoV-2 Omicron BA.5 is lower than that of BA.2 despite the higher infectivity of BA.5. Immun. Inflamm. Dis. 11, e783. doi: 10.1002/iid3.783

PubMed Abstract | Crossref Full Text | Google Scholar

Turakhia, Y., Thornlow, B., Hinrichs, A. S., De Maio, N., Gozashti, L., Lanfear, R., et al. (2021). Ultrafast Sample placement on Existing tRees (UShER) enables real-time phylogenetics for the SARS-CoV-2 pandemic. Nat. Genet. 53, 809–816. doi: 10.1038/s41588-021-00862-7

PubMed Abstract | Crossref Full Text | Google Scholar

Turkahia, Y., et al. (2021). Pandemic-scale phylogenomics reveals elevated recombination rates in the SARS-coV-2 spike region. Nature. 609 (7929), 994–997. doi: 10.1101/2021.08.04.455157

Crossref Full Text | Google Scholar

Uraki, R., Kiso, M., Imai, M., Yamayoshi, S., Ito, M., Fujisaki, S., et al. (2022). Therapeutic efficacy of monoclonal antibodies and antivirals against SARS-CoV-2 Omicron BA.1 in Syrian hamsters. Nat. Microbiol. 7, 1252–1258. doi: 10.1038/s41564-022-01170-4

PubMed Abstract | Crossref Full Text | Google Scholar

Van Cleemput, J., van Snippenberg, W., Lambrechts, L., Dendooven, A., D’Onofrio, V., Couck, L., et al. (2021). Organ-specific genome diversity of replication-competent SARS-CoV-2. Nat. Commun. 12, 6612. doi: 10.1038/s41467-021-26884-7

PubMed Abstract | Crossref Full Text | Google Scholar

Viana, R., Moyo, S., Amoako, D. G., Tegally, H., Scheepers, C., Althaus, C. L., et al. (2022). Rapid epidemic expansion of the SARS-CoV-2 Omicron variant in southern Africa. Nature 603, 679–686. doi: 10.1038/s41586-022-04411-y

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, R., Chen, J., Gao, K., and Wei, G.-W. (2021). Vaccine-escape and fast-growing mutations in the United Kingdom, the United States, Singapore, Spain, India, and other COVID-19-devastated countries. Genomics 113, 2158–2170. doi: 10.1016/j.ygeno.2021.05.006

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, Z., Schmidt, F., Weisblum, Y., Muecksch, F., Barnes, C. O., Finkin, S., et al. (2021). mRNA vaccine-elicited antibodies to SARS-CoV-2 and circulating variants. Nature 592, 616–622. doi: 10.1038/s41586-021-03324-6

PubMed Abstract | Crossref Full Text | Google Scholar

Wilkinson, S. A. J., Richter, A., Casey, A., Osman, H., Mirza, J. D., Stockton, J., et al. (2022). Recurrent SARS-CoV-2 mutations in immunodeficient patients. Virus Evol. 8, veac050. doi: 10.1093/ve/veac050

PubMed Abstract | Crossref Full Text | Google Scholar

Willett, B. J., Grove, J., MacLean, O. A., Wilkie, C., De Lorenzo, G., Furnon, W., et al. (2022). SARS-CoV-2 Omicron is an immune escape variant with an altered cell entry pathway. Nat. Microbiol. 7, 1161–1179. doi: 10.1038/s41564-022-01143-7

PubMed Abstract | Crossref Full Text | Google Scholar

Wu, J., Zhang, L., Zhang, Y., Wang, H., Ding, R., Nie, J., et al. (2021). The antigenicity of epidemic SARS-coV-2 variants in the United Kingdom. Front. Immunol. 12, 687869. doi: 10.3389/fimmu.2021.687869

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: persistent COVID-19, intrahost evolution, immunosuppressed host, gastrointestinal COVID-19, intrapatient evolution of SARS-CoV-2

Citation: Nabieva E, Klink GV, Komissarov AB, Zaitsev SV, Sergeeva M, Fadeev AV, Komissarova K, Ivanova A, Pisareva M, Kudrya K, Danilenko D, Lioznov D, Hisner R, Gueli F, Peacock TP, Roemer C and Bazykin GA (2025) A highly divergent sample from a nearly extinct SARS-CoV-2 lineage in a patient with long-term COVID-19. Front. Cell. Infect. Microbiol. 15:1623390. doi: 10.3389/fcimb.2025.1623390

Received: 05 May 2025; Accepted: 20 August 2025;
Published: 19 September 2025.

Edited by:

Emma Poole, University of Cambridge, United Kingdom

Reviewed by:

Siyuan Hao, University of Texas Southwestern Medical Center, United States
Mahan Ghafari, University of Oxford, United Kingdom

Copyright © 2025 Nabieva, Klink, Komissarov, Zaitsev, Sergeeva, Fadeev, Komissarova, Ivanova, Pisareva, Kudrya, Danilenko, Lioznov, Hisner, Gueli, Peacock, Roemer and Bazykin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Galya V. Klink, Z2Fsa2FrbGlua0BnbWFpbC5jb20=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.