ORIGINAL RESEARCH article

Front. Cell. Infect. Microbiol., 04 June 2025

Sec. Virus and Host

Volume 15 - 2025 | https://doi.org/10.3389/fcimb.2025.1513392

This article is part of the Research TopicPerspectives in Virus and Host: 2025View all 6 articles

From transmission to adaptive evolution: genomic surveillance of Getah virus

Yuge YuanYuge Yuan1Yujia HaoYujia Hao1Chengcheng PengChengcheng Peng1Duo Zhang,Duo Zhang1,2Wenzhou MaWenzhou Ma1Pengpeng Xiao*Pengpeng Xiao1*Nan Li*Nan Li1*
  • 1Wenzhou Key Laboratory for Virology and Immunology, Institute of Virology, Wenzhou University, Wenzhou, China
  • 2College of Veterinary Medicine, Jilin University, Changchun, China

Getah virus (GETV) is a member of the Alphavirus of the Togaviridae. It is a single-stranded positive-RNA virus that is mainly transmitted by mosquitoes. In recent years, the spread of GETV has become increasingly serious, causing serious losses to the animal economy and posing a potential threat to public health. GETV infected animals extend from traditional domestic animals such as horses and pigs to cattle, foxes and other animals. Especially in China, the virus has been detected in many provinces in recent years. In addition, GETV-specific antibodies were detected in healthy humans. However, the threat posed by GETV in China has not received enough attention. In this study, we downloaded all available GETV genome-wide serials (82 serials in total) from the NCBI as of December 2023. We integrate multiple bioinformatics approaches to understand the characteristics of GETV from the perspectives of epidemiology, virus-host co-evolution, and viral adaptation analysis. The results of this study show that GETV is rapidly expanding its host range and geographical distribution at high evolutionary rates due to the lack of commercially available vaccines. Second, we clearly reveal the cross-species transmission of GETV. Finally, we identified important adaptive and active selection sites. GETV and its media are widely distributed in China, and new host infections continue to appear. Therefore, strengthening surveillance and prevention to avoid serious losses to the pandemic is an important task we face today.

1 Introduction

Getah virus (GETV) is a member of the Alphavirus within the Togaviridae (Zhang et al., 2020). In 1955, GETV was first isolated from Aedes albopictus living in rubber plantations in Malaysia, and it subsequently spread widely across island countries in the South Pacific region. Since its isolation, GETV has been found in various vertebrates, including humans, monkeys, pigs, horses, and other mammals. Infections in these species are crucial for maintaining the transmission cycle of zoonotic diseases like GETV in nature (Scherer et al., 1962). Studies have shown (Li et al., 2022) that GETV can be amplified by mosquito-vertebrate host-mosquito circulation and infected horses, pigs, and cattle. GETV infects pigs of all ages, especially newborn piglets, resulting in high morbidity, mortality and severe economic losses.

GETV is an enveloped, single-stranded positive-sense RNA virus primarily transmitted by mosquitoes. The virus’s genome is 11–12 kb in length, containing untranslated regions (5’ UTR and 3’ UTR) at both ends. The 5’ end features a methylated cap structure (7-methylguanosine), and the 3’ end has a polyadenylated tail. The genome consists of two open reading frames (ORFs). The first ORF occupies about two-thirds of the genome and encodes four non-structural polyproteins (NSP1 to NSP4), which are responsible for RNA transcription, replication, polyprotein cleavage, and RNA capping. This region is followed by the 26S RNA junction region, which facilitates the transcription of subgenomic 26S RNA within the cell. The second ORF, occupying about one-third of the genome, encodes the structural proteins Cap, E3, E2, 6K, and E1. The capsid protein comprises two distinct structural domains, the N-terminal and the C-terminal. The C-terminal domain is highly conserved across all Alphavirus capsid proteins, while the N-terminal domain is highly variable. E2 is the primary protein mediating the virus’s entry into host cells during infection (Chen et al., 2018; Carey et al., 2019; Wang et al., 2022).

The seroepidemiology of GETV-infected animals has attracted attention due to the outbreak of GETV in animals such as horses and pigs, among which the most detailed investigation and study has been conducted in Japan, and in 1972, a serological study was conducted in local horses in Japan, and serum neutralization tests showed that the positive rates of GETV antibodies in local horses before and after the outbreak were 6% (14/232) and 61.2% (172/282), respectively (Imagawa et al., 1981). Serological surveys indicate (Zhang et al., 2020) that GETV-specific antibodies have been detected in the sera of humans, pigs, horses, cattle, goats, dogs, rabbits, kangaroos, chickens, and wild birds from countries in Asia, Europe, and Oceania. For a long time, GETV was considered pathogenic only to pigs and horses, causing fever, widespread rashes, hindlimb edema in horses, arthritis in piglets, and reproductive disorders in sows (Fukunaga et al., 2000; Bannai et al., 2019). Several outbreaks of diseases caused by GETV infections in pigs and horses have been reported in Japan and India (Nemoto et al., 2015). In China, the first GETV strain was isolated from Culex gelidus mosquitoes in Hainan Province in 1964 and was named M1 (Zhai et al., 2008). In our study, we illustrate the cross-species transmission of GETV that has been taking place in recent years through a visualization of the relationship between GETV whole genome sequences and hosts in the NCBI database.

Viruses are a potential threat to global epidemics and future pathogens (Thakur et al., 2024). GETV is a neglected zoonotic amorphous virus. From a surveillance perspective, the silent circulation of GETV in asymptomatic livestock and wild animal hosts makes the early warning system more complex. In addition, the threat posed by the expanding host range and geographical transmission of the GETV need to be paid attention. Since the early 21st century, the geographic distribution of GETV has expanded, with new transmission vectors emerging, an increasing number of infected animals, and more frequent outbreaks. This has resulted in significant economic losses and poses a potential threat to public health (Li et al., 2022). GETV-specific antibodies were detected in healthy people and patients with fever of unknown cause, suggesting that the scope of GETV infection was expanding, and the impact on public health could not be ignored. Due to the public health importance of GETV (Zhang et al., 2020), this study investigates the epidemiology, host diversity, and adaptive evolution of GETV based on whole genome sequences downloaded from NCBI.

2 Materials and methods

2.1 Sequence selection

We retrieved complete GETV genome sequences from the NCBI GenBank database (https://www.ncbi.nlm.nih.gov/) up to December 2023. Relevant information such as GenBank accession number, host, country, region, and collection date (year) was extracted. Based on this information, we analyzed the geographic distribution of GETV in Asia, and the relationship with hosts, and conducted analyses on the transmission, mutation, and evolution of GETV.

2.2 Spatial Distribution of GETV

The downloaded data from NCBI was summarized based on GenBank accession number, host, country, region, and collection date (year) (Supplementary Data). Using this data, a map of GETV distribution in Asia was created. The Asian distribution map was drawn using the online tool MapChart (https://www.mapchart.net/), which allows the selection of different map types and region coloring and supports downloading the final map image.

2.3 Distribution of susceptible hosts of GETV

Based on the sequence information downloaded from NCBI, sequences were classified by country, province, and host type. Cartoon images of different hosts were downloaded from the Biorender website (https://www.biorender.com/) and used to create a distribution map of susceptible hosts using Microsoft Office PowerPoint, based on the sorted sequence information.

2.4 Cross-species analysis of GETV and hosts

To evaluate the co-phylogenetic relationships between GETV and its hosts, we compared the phylogenetic trees of the virus and its hosts. Firstly, we constructed a phylogenetic tree using 82 GETV whole genome sequences from the NCBI database. Multiple sequence alignment (MSA) was performed using MAFFT v7.543 (Katoh et al., 2002), visually inspected, and manually edited using Aliview v1.27 (Larsson, 2014). After obtaining the MSA, the phylogenetic tree of the virus was constructed using IQ-TREE v1.6.12 (Nguyen et al., 2015) with 1000 bootstrap replicates. Based on the BIC value calculated using MEGA11, the best-fit nucleotide substitution model was GTR+G+I. Visualization and annotation of the tree were done using Figtree v1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/). Coloring of the phylogenetic tree was done using the iTOL online tool (Letunic and Bork, 2021) (https://itol.embl.de/). For constructing the host phylogenetic tree, mitochondrial cytb sequences of hosts were downloaded from the NCBI database and processed similarly using MAFFT v7.543 (Katoh et al., 2002), manually edited in Aliview v1.27 (Larsson, 2014), and a phylogenetic tree was constructed using IQ-TREE v1.6.12 (Nguyen et al., 2015) with the best-fit BIC model TPM2u+F+R3. The co-phylogenetic relationships between the virus and host trees were manually annotated in Microsoft Office PowerPoint.

2.5 Transmission and adaptive evolution of GETV

To illustrate the routes of horizontal and vertical transmission of GETV, a transmission route map was created using the Biorender online tool (https://www.biorender.com/). For analyzing amino acid variations in GETV, BioAider v1.527 (Zhou et al., 2020) was used to analyze amino acid mutation sites across the whole genome sequences. The first original strain MM2021 uploaded in 1955 was used as a reference for amino acid mutation analysis. MSA was performed using MAFFT v7.543 (Katoh et al., 2002), and the aligned nucleotide sequences were translated into amino acid sequences in Aliview v1.27 (Larsson, 2014). The amino acid sequences were then analyzed for mutation sites and visualized as lollipop plots using BioAider v1.527 (Zhou et al., 2020). To further identify positively selected sites, the whole genome sequences of GETV were analyzed using the datamonkey online tool (http://www.datamonkey.org/) for positive selection site prediction. Selection pressure at sites was detected using several methods: MEME (Mixed Effects Model of Evolution), FEL (Fixed Effects Likelihood), FUBAR (Fast, Unconstrained Bayesian Approximation), and SLAC (Single-Likelihood Ancestor Counting), MEME uses a mixed-effect maximum likelihood method to test the hypothesis that a single locus is affected by contextual positive selection; FEL assumes that the selection pressure at each locus is constant throughout phylogenesis, and FEL uses maximum likelihood, whereby the MG94xREV model is used to infer the non-synonymous and synonymous substitution rates for each codon locus after optimizing for branch length and nucleotide substitution parameters. FUBAR uses the Bayesian method to infer the d-non-synonymous replacement rate (dN) and synonymous replacement rate (dS) for each locus based on the given coding sequence arrangement and the corresponding phylogenetic relationship. SLAC assumes that the selection pressure at each locus is constant throughout phylogenesis. SLAC first optimizes branch length and nucleotide substitution parameters under the MG94xREV model, using ML to infer the most likely ancestor sequence at each node of phylogeny (Kosakovsky Pond and Frost, 2005; Murrell et al., 2012; Smith et al., 2015). The significance levels for SLAC, FEL, and MEME were set at 0.1, and the p-value threshold for FUBAR was 0.9. Sites identified by more than two algorithms as positively selected were considered significant and tested for statistical significance using methods described by previous researchers (Li et al., 2020). Positively selected sites were visualized using the Weblogo online tool (https://weblogo.berkeley.edu/logo.cgi).

2.6 Statistical analysis

In this study, the sequence information of GETV strains was statistically sorted and analyzed using Microsoft Excel. All bar charts in this study were created using Microsoft Excel.

3 Results

3.1 GETV exists in several countries in Asia

To investigate the distribution of GETV in Asia, we downloaded all available GETV whole genome sequences from the NCBI database, spanning from 1955 to 2023 (Supplementary Data 1). A total of 82 sequences were obtained. According to the sequence information from NCBI, GETV sequences originated from eight countries: China, Japan, Malaysia, Mongolia, South Korea, Thailand, and Russia (Figure 1A). The majority of GETV sequences uploaded from China belonged to the GETV Group III strain, mainly from Guangdong,Guangxi, and 13 other provinces (Figure 1B). This indicates that the GETV Group III strain is the predominant circulating strain in China, with Guangdong province having the most uploads, likely due to its climate and mosquito species diversity.

Figure 1
www.frontiersin.org

Figure 1. Distribution of GETV. (A) Distribution of GETV in Asia. Different color blocks represent different countries. (B) Distribution of GETV in China. The left chart shows the distribution of two different genotypes of GETV across 15 provinces in China, with both Fujian and Yunnan provinces containing two genotypes of GETV. The right chart shows the number of sequences uploaded from the 15 provinces. The X-axis represents the number of virus sequences, and the Y-axis represents the name of the province in China.

In all countries where the sequence is uploaded, Among these, the majority of sequences were from China (60 sequences), followed by Japan (15 sequences). Sequences from other countries were fewer: Malaysia (2 sequences), Mongolia (1 sequence), Russia (1 sequence), South Korea (2 sequences), and Thailand (1 sequence) (Figure 2A). In terms of genotype, Group III had the highest number of sequences (77), followed by Group IV, and finally Groups I and II, with two sequences each, originating from Malaysia and Japan respectively (Figure 2B).

Figure 2
www.frontiersin.org

Figure 2. Source and genotypes of GETV. (A) Sources of GETV(Asia). The horizontal axis represents the source countries, the vertical axis represents the number of sequences, and different countries are represented by different color blocks. The X-axis represents the country and the Y-axis represents the number of viral sequences (B) Genotypes of GETV. The horizontal axis represents the different genotypes of GETV, the vertical axis represents the number of sequences, and Group III is the dominant genotype of GETV. The X-axis represents the four different groupings of GETV, and the Y-axis represents the number of viral sequences.

3.2 GETV has host diversity

Based on the sequence information of all GETV whole genomes obtained (Supplementary Data 1), 81 sequences contained host information, while 5 sequences lacking host information were excluded from the analysis. GETV was found to be transmitted among various hosts through mosquitoes, primarily including vertebrates from the phylum Chordata, subphylum Vertebrata, and the phylum Arthropoda. Vertebrate hosts at the family level included Bovidae, Manidae, Equidae, Canidae, Sciuridae, and Suidae. Arthropoda hosts were exclusively from the Culicidae (Figure 3A).

Figure 3
www.frontiersin.org

Figure 3. Susceptible host types and distribution. (A) Susceptible host types of GETV at the family level (Asia). The X-axis represents the two family levels, and the Y-axis represents the number of viral sequences. (B) Distribution of GETV among susceptible hosts in China. Different colors on the map represent different genotypes. (BI) The color blocks on the map indicate the host classification, and the circles indicate the specific host species (BII) The X-axis represents the number of virus sequences, and the Y-axis represents the provinces of China. (BIII) The X-axis represents the provinces of China, and the Y-axis represents the number of virus sequences.

In China, GETV-susceptible hosts were mainly distributed across 15 provinces: Guangdong, Guangxi, Fujian, Henan, Jiangxi, Jilin, Yunnan, Hebei, Shandong, Hainan, Heilongjiang, Hubei, Jiangsu, Shanxi, and Sichuan (Figures 3BI-II). Provinces with infections in both vertebrates and arthropods included Hebei and Jilin. Provinces with infections only in vertebrates included Guangdong, Guangxi, Fujian, Henan, Jiangxi, Shandong, Heilongjiang, Hubei, Jiangsu, and Shanxi. Provinces with mosquito-only infections were Hainan and Sichuan (Figure 3BIII).

Downloading the mitochondrial cytb gene sequences of GETV hosts from the NCBI database, and based on the sequence information of GETV and hosts from NCBI, phylogenetic trees were constructed for the 82 GETV whole-genome sequences (Figure 4B, right) and the 10 hosts whole-genome sequences (Figure 4A, left). The entanglement relationships between hosts and viruses were also constructed (Figure 4B). By performing multiple sequence alignments and phylogenetic analyses, a total of 82 whole-genome sequences from NCBI were studied. Phylogenetic analysis shows that GETV has gradually evolved into four distinct genetic groups, of which the group I originated from the original GETV strain of mosquitoes found in Malaysia (MM2021), which is located at the root of the phylogenetic tree, indicating that it is the oldest strain. The group II included only two strains isolated from Japanese mosquitoes in 1956. The group IV included one strain isolated in 2000 from mosquitoes in Russia, one strain isolated in 2017 from wild boar in Thailand, and three strains isolated from China. Of the three sequences from China, one strain was isolated from mosquitoes in Yunnan in 2012, one from pangolins in 2020, and one from red-backed squirrels in Fujian Province in 2022. The remaining sequences all clustered into Group III in the phylogenetic evolutionary tree (Figure 4A). Notably, there were several intertwined linkages between mosquitoes, horses, and pigs in infection with GETV, which may be the result of frequent co-infections in these species.

Figure 4
www.frontiersin.org

Figure 4. Complete genome phylogenetic analysis and entanglement relationship between GETV and hosts. (A) The squares represent the groupings, and the circles represent the country of origin. (B) A phylogenetic tree was constructed using 82 full-genome sequences of GETV, with the best model selection being GTR+G+I. Additionally, 10 different host mitochondrial DNA sequences were used for tree construction, with the best model selection being TPM2u+F+R3. Branches and associations are colored according to host types. Tree scale 1 represents the unit in which each unit length corresponds to 1 nucleotide or amino acid substitution; Tree Scale 10 represents the unit in which each unit length corresponds to 10 nucleotide or amino acid substitutions.

3.3 Transmission and adaptive evolution of GETV

Based on the information uploaded to NCBI, it is evident that mosquitoes can horizontally transmit GETV to cows, pangolins, horses, dogs, squirrels, and pigs. The transmission of GETV is mainly caused by mosquito bites and thus spread to various hosts. Vertical transmission of arboviruses in mosquitoes (from infected females to their offspring) is considered a mechanism for maintaining the virus during unfavorable horizontal transmission conditions among vertebrate hosts, such as harsh winters, dry conditions, and interepidemic periods limited by herd immunity, and may influence the epidemiology of arbovirus infections (Figure 5) (Lequime et al., 2016).

Figure 5
www.frontiersin.org

Figure 5. Vector transmission of GETV. The vector transmission of GETV occurs through three main modes: horizontal transmission between vertebrate hosts and vectors, vertical transmission from infected female mosquitoes to their offspring, and sexual transmission from infected female mosquitoes to male mosquitoes and vice versa.

We analyzed the mutation sites in the 82 GETV whole genome sequences, revealing that most mutations occurred in non-structural proteins, with eight mutations in the NSP1 gene and 11 in the NSP3 gene. Mutations were also detected in structural proteins, although fewer than in non-structural proteins, with four mutations in the C gene and five in the E2 gene (Figure 6). Under positive selection, we identified seven positively selected sites in the 82 GETV whole genome sequences. The positions of these sites in the whole genome sequence were: site 427, site 1770, site 1835, site 2372, site 2542, site 2783, and site 3551 (Table 1). Visualization of amino acid variations encoding adaptive sites showed that only at site 427, there was an amino acid variation where arginine changed to serine (Figure 7).

Figure 6
www.frontiersin.org

Figure 6. Evolution of amino acid adaptive sites in GETV. Single nucleotide variants in each gene of GETV strains obtained from NCBI. Amino acid variation in polyproteins is distributed along the genome. The 82 GETV complete genome sequences downloaded from the NCBI database show only mutations that appear in more than 10 serial lines. NSP1–4 represents non-structural proteins, C, E3, E2, 6K, E1 represent structural proteins, and UTR represents untranslated regions.

Table 1
www.frontiersin.org

Table 1. Analysis of positive selection sites in GETV amino acids.

Figure 7
www.frontiersin.org

Figure 7. Analysis of amino acid variations in GETV. Visualization analysis of the amino acid positive selection sites detected in Table 1, with the horizontal axis representing the positions of amino acids.

4 Discussion

Due to the ability of Alphaviruses to cross species barriers multiple times, the transmission of viruses within the Alphavirus genus poses a significant risk to both animal and human health. Bannai et al. demonstrated that GETV can be transmitted from horses to pigs (Bannai et al., 2017). Therefore, we examined the characteristics of GETV in terms of epidemiology, host diversity, and adaptive evolution.

In our study, by integrating sequence information uploaded to NCBI, we found that the GETV full-genome sequences mainly come from eight countries: China, Japan, Malaysia, Mongolia, South Korea, Thailand, Russia, and others. The sequences from Russia are geographically located in the northeastern part of Asia. Globally, since the original GETV strain (MM2021) was first isolated from Malaysia in 1955, GETV has been isolated in 13 countries, including Indochina countries in Asia (Vietnam, Thailand, Cambodia), Southeast Asian countries (Malaysia, Philippines), South Asian subcontinent (India, Sri Lanka), Australia, Japan, South Korea, Mongolia, China, and Russia (Ksiazek et al., 1981; Peiris et al., 1994; Bryant et al., 2005; Liu et al., 2017; Li et al., 2021). China is the country with the most uploaded GETV sequences, followed by Japan. China, Japan, and South Korea serve as the hubs for GETV transmission, covering the vast majority of strains. These three countries have rapidly developed and extensive livestock industries, with most animal products sold to neighboring regions, thereby accelerating the local spread of GETV. Studies on the transmission pathways of GETV have shown that Malaysia, Japan, and Yunnan province in China are the three main transmission sites of the virus. GETV probably spread from Malaysia to Japan in 1917, from Japan to Hainan province in China in the 1960s, from Japan to Yunnan province in China in the 1970s, and then through Yunnan province in China to Hebei province, Shanghai, Gansu province, and Sichuan province, China (Bryant et al., 2005; Li et al., 2017; Shi et al., 2022). In these countries, numerous convergence points in the GETV transmission routes form a closed co-circulation loop (Shi et al., 2022). This reminds us that at the moment of increasingly globalization, various countries have close exchanges, and the conditions for the dissemination of GETV should be strictly controlled by the GETV detection methods for the import and export of animal products of various countries.

In our study, through multiple sequence alignments and phylogenetic analysis of known GETV full genomes, the results show that GETV can be divided into four genotypes (Group I-IV). Group I consists of the original GETV strain (MM2021) isolated in Malaysia in 1955. Group II includes only the strain isolated from Japanese mosquitoes in 1956. Group IV comprises one strain isolated from mosquitoes in Russia (LEIV-16275-MAG, 2000), one strain from Yunnan province, China (YN12031, 2012), one strain isolated from pangolins in China (2020), one strain from red-backed squirrels in Fujian, China (Rbsq202206, 2022), and one strain from Thailand (SW). This indicates that Group IV has still spread in China in recent years and has been found in new hosts. The continued spread of Group IV virus indicates that GETV is breaking through the traditional host boundaries. This highlights the need for preventive measures against GETV in animals to prevent outbreaks. The remaining strains are classified into Group III. In terms of numbers, Group III is the dominant strain of GETV. The absolute advantage of the number of Group III strains reveals the adaptive evolution of GETV during transmission to promote the continuous transmission of Group III strains. The phylogenetic tree results show that Groups I, II, and IV have become stable viral populations in local regions, while the geographical range of Group III has rapidly expanded. Therefore, enhancing the detection and monitoring of GETV infections is crucial for the timely identification of animal outbreaks, thereby reducing economic losses in livestock. Additionally, although GETV outbreaks causing human diseases have not been reported, monitoring people involved in livestock farming (especially those working with horses and pigs) in GETV-endemic areas is particularly important to avoid cross-species transmission to humans. In China, a total of 15 provinces have uploaded full-genome sequences of GETV. The first isolation of GETV in China was from a Culex mosquito collected on Hainan Island in 1964, later identified as GETV. Guangdong and Guangxi provinces have uploaded the most sequences, likely due to the extensive livestock industry and widespread GETV infection in pigs in these regions. In our study, two provinces uploaded sequences from two different GETV genotypes, one of which is Yunnan province. Among the provinces where GETV has been found, Yunnan has been conducting GETV-related research earlier and has isolated the most strains. Yunnan is located in tropical and subtropical regions, with year-round high temperatures suitable for mosquito survival, providing favorable conditions for the circulation of GETV. GETV strains have been isolated in multiple locations in Yunnan, such as Tengchong and Ruili (Zhang et al., 2020).

It has long been widely believed that GETV primarily infects horses and pigs (Bannai et al., 2019; Sun et al., 2022). Pigs are considered the primary amplifying hosts because they develop viremia after being infected with GETV (Kuwata et al., 2018). Horses are important hosts for the amplification and circulation of GETV (Bannai et al., 2015). In our study, the host species of GETV include ten different types: mosquitoes (Culex, Aedes, and Armigeres), cattle, pangolins, horses, dogs, foxes, squirrels, and pigs. Notably, from 2014 to 2023, GETV whole genome sequences have been isolated from various hosts each year. Specifically, in 2020, a strain containing the complete genome sequence of GETV was isolated from a pangolin, and in 2022, strains containing the whole genome sequences of GETV were isolated from a squirrel and a dog. The continual discovery and isolation of GETV strains in new host species indicate that the range of hosts for GETV is expanding, and the virus is spreading among different hosts. In our study, by intertwining the phylogenetic tree of host species with the phylogenetic tree of GETV whole genome sequences, we further confirmed the co-circulation of GETV between mosquitoes and other animals. The infection in different species is essential for maintaining the zoonotic transmission cycle of GETV in nature. In recent years, new species infected with GETV have been frequently reported in China. However, due to the lack of in-depth research on the cross-species transmission events of GETV, GETV has only been around for 70 years, so the scale of evolutionary events of host and virus differentiation varies by orders of magnitude. In conclusion, our study suggests that cross-species transmission of GETV in recent years should be taken seriously to prevent GETV outbreaks in new hosts.

Azerigyik et al. (2023) have shown that mosquitoes are the primary vectors of GETV. Mosquitoes carrying GETV have been isolated from four genera and eight species, including four species of Culex (Culex tritaeniorhynchus, Culex pseudovishnui, Culex gelidus, and Culex annulirostris), one species of Aedes (Aedes albopictus), one species of Anopheles (Anopheles sinensis), one species of Armigeres (Armigeres subalbatus), and one species of mixed mosquitoes. Among these, Culex tritaeniorhynchus and Armigeres subalbatus are widely distributed in China, and often found around rural livestock pens and breeding farms. They have high reproductive capacity and large populations, making them considered dominant mosquito species with significant importance in carrying and transmitting GETV. Hubálek et al. (2014) have also isolated GETV from biting midges Culicoides spp. These studies indicate that the variety of mosquito species transmitting GETV is continually expanding, and the diversity of vectors has increased. Surveys conducted among febrile patients and healthy individuals have found the presence of GETV-specific IgM and IgG antibodies in some humans. The antibody titers in febrile patients were significantly higher than in healthy individuals, suggesting that GETV infection may be associated with human diseases (Li et al., 2019; Lu et al., 2019). GETV-specific antibodies have also been detected in serum samples from chickens, ducks, goats, and birds. These studies collectively indicate that the types of hosts infected by GETV are increasing, and the host range is expanding. Seroepidemiological tests in animals or humans infected with GETV can be detected by serological methods, such as neutralization test (NT) (Matsumura et al., 1981), complement fixation test (CFT) (Li et al., 2016), hemagglutination inhibition test (HIT) (Hohdatsu et al., 1990), enzyme-linked immunosorbent assay (ELSA) (Kuwata et al., 2018), etc. ELSA is by far the most commonly used method. At present, there is no commercially available antigen detection method, so the etiological detection of GETV mainly includes viral gene amplification and virus isolation, viral gene amplification mainly includes reverse transcription-polymerase chain reaction (RT-PCR) (Wang et al., 2018), and for virus isolation, the blood-sucking insect specimens or animal specimens that can be collected are ground and centrifuged, and the supernatant is inoculated into tissue culture cells (Simpson et al., 1975).

Studies have shown (Gorchakov et al., 2012) that single nucleotide substitutions can significantly alter Alphavirus vector specificity and pathogenicity. We conducted whole-genome mutation, selection, and adaptive evolution analyses of GETV, identifying 21 amino acid mutations in non-structural proteins and 11 in structural proteins. The non-structural protein mutations include eight in NSP1 (M313V, I427T, Q465R, E468K, M475T, S478G, A498V, T521A), 11 in NSP3 (P1661S, A1672T, V1713T, Q1717R, I1730T, A1742T, Q1774P, L1775S, I1790T, T1793P, G1799E), and two in NSP4 (R2036Q, T2372A). The structural protein mutations include four in the capsid (F2487Y, S2537P, T2544A, T2546A), five in E2 (S2826F, H2885Y, T2889V, I2921T, D3061N), and two in E1 (K3353M, T3659M). Since the interaction between E1 and E2 is crucial for viral budding and entry, studying the amino acid variations in E1 and E2 is essential (Mukhopadhyay et al., 2006). At the same time, this study revealed the molecular adaptation mechanism of GETV through positive selection site analysis, and identified seven forward selection sites. It is worth noting that E2 and E1 proteins are labeled as potential selection pressure sites by multiple algorithms. These mutations provide evidence of genetic diversity in the GETV genome. Further research into the adaptive evolution of GETV amino acids through positive selection analysis identified seven positively selected sites, encoding Arg, Lys, Asp, Tyr, Val, Val, Gly. Hydrophilic amino acids (e.g., Arg, Lys, Asp) typically interact with the aqueous environment on protein surfaces, while hydrophobic amino acids (e.g., Val) are more likely to be found inside proteins, helping to form a stable hydrophobic core.

Our study offers novel insights into the distribution of GETV in Asia, the diversity of susceptible hosts, and adaptive sites. As GETV continues to evolve and its host range expands, comprehensive monitoring of GETV is necessary to prevent outbreaks from escalating into pandemics, especially in countries like China, Japan, and South Korea, where GETV is widespread.

In this study, we found that China is the country with the most uploaded sequences in the world, although we know that the number of GETV sequences is not enough compared to that of other alphaviruses, which may also be due to the delay in sequence upload in some countries and regions. In the future, when the number of GETV sequences increases, the study of GETV may be more complete to better prevent the epidemic of GETV. It is undeniable that this study also has certain limitations, due to the lack of research on GETV, the coverage of the whole genome sequence of GETV obtained from the NCBI database is limited, and secondly, the study only integrates genomic and geographical distribution data, but does not combine serological investigation, host population distribution, disease incidence, etc., which has certain limitations, despite the above challenges and limitations, this study systematically reveals the three key characteristics of GETV in China, and provides an important scientific basis for the formulation of prevention and control strategies. First, the continuous selection pressure of key sites such as NSP1 protein suggests that the virus may be breaking through the host immune barrier. Second, the risk of zoonoses has escalated, and antibodies have been detected in healthy people, indicating that potential public health threats are accumulating. Finally, it should be noted that group III virus has become a regional epidemic in China, which suggests that we need timely prevention of GETV. In summary, the number of vertebrate infections in GETV in China has increased rapidly in recent years, and the host range has been expanding. However, the threat posed by the Chinese GETV has not yet attracted sufficient attention. In order to prevent and control GETV, a series of relevant measures must be taken.

5 Conclusion

In this study, we identified that GETV is rapidly expanding its host range and geographic distribution at a high evolutionary rate. Coevolutionary analysis of GETV and its host showed that GETV had cross-species transmission events. Finally, we identified important adaptive and positive selection sites. Our study shows that GETV is widely distributed in China and is constantly infecting new hosts.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Author contributions

YY: Writing – original draft, Investigation, Methodology, Software. YH: Software, Writing – original draft. CP: Conceptualization, Writing – review & editing. DZ: Methodology, Writing – review & editing. WM: Methodology, Writing – review & editing. PX: Writing – review & editing. NL: Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the National Natural Science Foundation of China [grant number 32002312], Science and Technology Project of Wenzhou, Zhejiang, China [grant number Y2023024].

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2025.1513392/full#supplementary-material

References

Azerigyik, F. A., Faizah, A. N., Kobayashi, D., Amoa-Bosompem, M., Matsumura, R., Kai, I., et al. (2023). Evaluating the mosquito host range of Getah virus and the vector competence of selected medically important mosquitoes in Getah virus transmission. Parasit Vectors. 16, 99. doi: 10.1186/s13071-023-05713-4

PubMed Abstract | Crossref Full Text | Google Scholar

Bannai, H., Nemoto, M., Niwa, H., Murakami, S., Tsujimura, K., Yamanaka, T., et al. (2017). Geospatial and temporal associations of Getah virus circulation among pigs and horses around the perimeter of outbreaks in Japanese racehorses in 2014 and 2015. BMC Vet. Res. 13, 187. doi: 10.1186/s12917-017-1112-6

PubMed Abstract | Crossref Full Text | Google Scholar

Bannai, H., Nemoto, M., Ochi, A., Kikuchi, T., Kobayashi, M., Tsujimura, K., et al. (2015). Epizootiological investigation of Getah virus infection among racehorses in Japan in 2014. J. Clin. Microbiol. 53, 2286–2291. doi: 10.1128/JCM.00550-15

PubMed Abstract | Crossref Full Text | Google Scholar

Bannai, H., Nemoto, M., Tsujimura, K., Yamanaka, T., and Kokado, H. (2019). Development of an enzyme-linked immunosorbent assay for Getah virus infection in horses using recombinant E2 protein as an antigen. J. Virol. Methods 271, 113681. doi: 10.1016/j.jviromet.2019.113681

PubMed Abstract | Crossref Full Text | Google Scholar

Bryant, J. E., Crabtree, M. B., Nam, V. S., Yen, N. T., Duc, H. M., and Miller, B. R. (2005). Isolation of arboviruses from mosquitoes collected in northern Vietnam. Am. J. Trop. Med. Hyg. 73, 470–473. doi: 10.4269/ajtmh.2005.73.470

PubMed Abstract | Crossref Full Text | Google Scholar

Carey, B. D., Bakovic, A., Callahan, V., Narayanan, A., and Kehn-Hall, K. (2019). New World Alphavirus protein interactomes from a therapeutic perspective. Antiviral Res. 163, 125–139. doi: 10.1016/j.antiviral.2019.01.015

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, R., Mukhopadhyay, S., Merits, A., Bolling, B., Nasar, F., Coffey, L. L., et al. (2018). ICTV virus taxonomy profile: togaviridae. J. Gen. Virol. 99, 761–762. doi: 10.1099/jgv.0.001072

PubMed Abstract | Crossref Full Text | Google Scholar

Fukunaga, Y., Kumanomido, T., and Kamada, M. (2000). Getah virus as an equine pathogen. Vet. Clin. North Am. Equine Pract. 16, 605–617. doi: 10.1016/S0749-0739(17)30099-8

PubMed Abstract | Crossref Full Text | Google Scholar

Gorchakov, R., Wang, E., Leal, G., Forrester, N. L., Plante, K., Rossi, S. L., et al. (2012). Attenuation of Chikungunya virus vaccine strain 181/clone 25 is determined by two amino acid substitutions in the E2 envelope glycoprotein. J. Virol. 86, 6084–6096. doi: 10.1128/JVI.06449-11

PubMed Abstract | Crossref Full Text | Google Scholar

Hohdatsu, T., Ide, S., Yamagishi, H., Eiguchi, Y., Nagano, H., Maehara, N., et al. (1990). Enzyme–linked immunosorbent assay for the serological survey of Getah virus in pigs. Jpn. J. Vet. Sci. 52, 835–837. doi: 10.1292/jvms1939.52.835

PubMed Abstract | Crossref Full Text | Google Scholar

Hubálek, Z., Rudolf, I., and Nowotny, N. (2014). Arboviruses pathogenic for domestic and wild animals. Adv. Virus Res. 89, 201–275. doi: 10.1016/B978-0-12-800172-1.00005-7

PubMed Abstract | Crossref Full Text | Google Scholar

Imagawa, H., Ando, Y., Kamada, M., Sugiura, T., Kumanomido, T., Fukunaga, Y., et al. (1981). Seroepizootiological survey on Getah virus infection in light horses in Japan. Jpn. J. Vet. Sci. 43, 797–802. doi: 10.1292/jvms1939.43.797

PubMed Abstract | Crossref Full Text | Google Scholar

Katoh, K., Misawa, K., Kuma, K., and Miyata, T. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30, 3059–3066. doi: 10.1093/nar/gkf436

PubMed Abstract | Crossref Full Text | Google Scholar

Kosakovsky Pond, S. L. and Frost, S. D. (2005). Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol. Biol. Evol. 22, 1208–1222. doi: 10.1093/molbev/msi105

PubMed Abstract | Crossref Full Text | Google Scholar

Ksiazek, T. G., Trosper, J. H., Cross, J. H., and Basaca-Sevilla, V. (1981). Isolation of Getah virus from Nueva Ecija province, Republic of the Philippines. Trans. R. Soc Trop. Med. Hyg. 75, 312–313. doi: 10.1016/0035-9203(81)90346-1

PubMed Abstract | Crossref Full Text | Google Scholar

Kuwata, R., Shimoda, H., Phichitraslip, T., Prasertsincharoen, N., Noguchi, K., Yonemitsu, K., et al. (2018). Getah virus epizootic among wild boars in Japan around 2012. Arch. Virol. 163, 2817–2821. doi: 10.1007/s00705-018-3897-4

PubMed Abstract | Crossref Full Text | Google Scholar

Larsson, A. (2014). AliView: a fast and lightweight alignment viewer and editor for large datasets. Bioinformatics 30, 3276–3278. doi: 10.1093/bioinformatics/btu531

PubMed Abstract | Crossref Full Text | Google Scholar

Lequime, S., Paul, R. E., and Lambrechts, L. (2016). Determinants of arbovirus vertical transmission in mosquitoes. PLoS Pathog. 12, e1005548. doi: 10.1371/journal.ppat.1005548

PubMed Abstract | Crossref Full Text | Google Scholar

Letunic, I. and Bork, P. (2021). Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296. doi: 10.1093/nar/gkab301

PubMed Abstract | Crossref Full Text | Google Scholar

Li, Y., Fu, S., Guo, X., Li, X., Li, M., Wang, L., et al. (2019). Serological survey of Getah virus in domestic animals in yunnan province, China. Vector Borne Zoonotic Dis. 19, 59–61. doi: 10.1089/vbz.2018.2273

PubMed Abstract | Crossref Full Text | Google Scholar

Li, N., He, Y. W., Zuo, Y. Y., Meng, J. X., Xu, T. G., and Wang, J. L. (2021). Analysis of molecular biological characteristics of structural genes of Getah virus (SZC30) isolated from midge. China Anim. Husb. Vet. Med. 48, 1149–1160. doi: 10.16431/j.cnki.1671-7236.2021.04.001

Crossref Full Text | Google Scholar

Li, Y. Y., Liu, H., Li, X. L., Fu, S. H., Gao, X. Y., Lei, W. W., et al. (2017). Phylogenctics of Getah virus and its migration. Chin. J. Zoonoses 33, 293–299. doi: 10.3969/j.issn.1002-2694.2017.04.001

Crossref Full Text | Google Scholar

Li, M., Shi, Z., Fang, C., Gao, A., Li, C. M., and Yu, L. (2016). Versatile microfluidic complement fixation test for disease biomarker detection. Anal Chim Acta. 916, 67–76. doi: 10.1016/j.aca.2016.02.026

PubMed Abstract | Crossref Full Text | Google Scholar

Li, B., Wang, H., and Liang, G. (2022). Getah virus (Alphavirus): an emerging, spreading zoonotic virus. Pathogens 11, 945. doi: 10.3390/pathogens11080945

PubMed Abstract | Crossref Full Text | Google Scholar

Li, K., Yan, S., Wang, N., He, W., Guan, H., He, C., et al. (2020). Emergence and adaptive evolution of Nipah virus. Transbound Emerg. Dis. 67, 121–132. doi: 10.1111/tbed.13330

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, S., Jiapaer, H., Xue, X. M., Ding, M. Y., Ma, X. J., Ye, F., et al. (2017). Isolation and identification of Getah virus from culicoides in Xinjiang. Aeta Vet. Zootech. Sin. 48, 1998–2004. doi: 10.11843/j/issn.0366-6964.2017.10.024

Crossref Full Text | Google Scholar

Lu, G., Ou, J., Ji, J., Ren, Z., Hu, X., Wang, C., et al. (2019). Emergence of getah virus infection in horse with fever in China, 2018. Front. Microbiol. 10, 1416. doi: 10.3389/fmicb.2019.01416

PubMed Abstract | Crossref Full Text | Google Scholar

Matsumura, T., Goto, H., Shimizu, K., Ando, Y., Imagawa, H., Sugiura, T., et al. (1981). Prevalence of antibodies against Getah virus in horses raised in Hokkaido. Jpn. J. Vet. Sci. 43, 783–786. doi: 10.1292/jvms1939.43.783

PubMed Abstract | Crossref Full Text | Google Scholar

Mukhopadhyay, S., Zhang, W., Gabler, S., Chipman, P. R., Strauss, E. G., Strauss, J. H., et al. (2006). Mapping the structure and function of the E1 and E2 glycoproteins in Alphaviruses. Structure 14, 63–73. doi: 10.1016/j.str.2005.07.025

PubMed Abstract | Crossref Full Text | Google Scholar

Murrell, B., Wertheim, J. O., Moola, S., Weighill, T., Scheffler, K., and Kosakovsky Pond, S. L. (2012). Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 8, e1002764. doi: 10.1371/journal.pgen.1002764

PubMed Abstract | Crossref Full Text | Google Scholar

Nemoto, M., Bannai, H., Tsujimura, K., Kobayashi, M., Kikuchi, T., Yamanaka, T., et al. (2015). Getah virus infection among racehorses, Japan, 2014. Emerg. Infect. Dis. 21, 883–885. doi: 10.3201/eid2105.141975

PubMed Abstract | Crossref Full Text | Google Scholar

Nguyen, L. T., Schmidt, H. A., von Haeseler, A., and Minh, B. Q. (2015). IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274. doi: 10.1093/molbev/msu300

PubMed Abstract | Crossref Full Text | Google Scholar

Peiris, J. S., Amerasinghe, P. H., Amerasinghe, F. P., Calisher, C. H., Perera, L. P., Arunagiri, C. K., et al. (1994). Viruses isolated from mosquitoes collected in Sri Lanka. Am. J. Trop. Med. Hyg. 51, 154–161. doi: 10.4269/ajtmh.1994.51.154

PubMed Abstract | Crossref Full Text | Google Scholar

Scherer, W. F., Funkenbusch, M., Buescher, E. L., and Izumit (1962). Sagiyama virus, a new group A arthropod-borne virus from Japan. I. Isolation, immunologic classification, and ecologic observations. Am. J. Trop. Med. Hyg. 11, 255–268. doi: 10.4269/ajtmh.1962.11.255

PubMed Abstract | Crossref Full Text | Google Scholar

Shi, N., Zhu, X., Qiu, X., Cao, X., Jiang, Z., Lu, H., et al. (2022). Origin, genetic diversity, adaptive evolution and transmission dynamics of Getah virus. Transbound Emerg. Dis. 69, e1037–e1050. doi: 10.1111/tbed.14395

PubMed Abstract | Crossref Full Text | Google Scholar

Simpson, D. I., Way, H. J., Platt, G. S., Bowen, E. T., Hill, M. N., Kamath, S., et al. (1975). Arbovirus infections in Sarawak, October 1968–February 1970: GETAH virus isolations from mosquitoes. Trans. R Soc Trop. Med. Hyg. 69, 35–38. doi: 10.1016/0035-9203(75)90008-5

PubMed Abstract | Crossref Full Text | Google Scholar

Smith, M. D., Wertheim, J. O., Weaver, S., Murrell, B., Scheffler, K., and Kosakovsky Pond, S. L. (2015). Less is more: an adaptive branch-site random effects model for efficient detection of episodic diversifying selection. Mol. Biol. Evol. 32, 1342–1353. doi: 10.1093/molbev/msv022

PubMed Abstract | Crossref Full Text | Google Scholar

Sun, Q., Xie, Y., Guan, Z., Zhang, Y., Li, Y., Yang, Y., et al. (2022). Seroprevalence of Getah virus in pigs in eastern China determined with a recombinant E2 protein-based indirect ELISA. Viruses 14, 2173. doi: 10.3390/v14102173

PubMed Abstract | Crossref Full Text | Google Scholar

Thakur, P., Thakur, V., Sapra, M., Srivastava, S., and Patel, S. K. S. (2024). Post COVID-19 era: Re-emergence of known and future (X) viral pathogens. Vacunas (English Edition) 25, 403–414. doi: 10.1016/j.vacune.2024.08.010

Crossref Full Text | Google Scholar

Wang, N., Zhai, X., Li, X., Wang, Y., He, W. T., Jiang, Z., et al. (2022). Attenuation of Getah virus by a single amino acid substitution at residue 253 of the E2 protein that might be part of a new heparan sulfate binding site on alphaviruses. J. Virol. 96, e0175121. doi: 10.1128/jvi.01751-21

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, A., Zhou, F., Chang, H., Wang, X., Chen, L., Cui, D., et al. (2018). Molecular detection, isolation and identification of the porcine Getah virus from pig herds in four provinces, China. Chin. J. Virol. 34, 522–532. doi: 10.13242/j.cnki.bingduxuebao.003398

Crossref Full Text | Google Scholar

Zhai, Y. G., Wang, H. Y., Sun, X. H., Fu, S. H., Wang, H. Q., Attoui, H., et al. (2008). Complete sequence characterization of isolates of Getah virus (genus Alphavirus, family Togaviridae) from China. J. Gen. Virol. 89, 1446–1456. doi: 10.1099/vir.0.83607-0

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, H. L., Hu, T. S., and Zhang, F. Q. (2020). Progress on Getah virus research in China and its public health significance. Chin. J. Zoonoses. 36, 229–233,244. doi: 10.3969/j.issn.1002-2694.2020.00.012

Crossref Full Text | Google Scholar

Zhou, Z. J., Qiu, Y., Pu, Y., Huang, X., and Ge, X. Y. (2020). BioAider: An efficient tool for viral genome analysis and its application in tracing SARS-CoV-2 transmission. Sustain Cities Soc 63, 102466. doi: 10.1016/j.scs.2020.102466

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Getah virus, epidemiological situation, host range, cross-species, adaptive evolution

Citation: Yuan Y, Hao Y, Peng C, Zhang D, Ma W, Xiao P and Li N (2025) From transmission to adaptive evolution: genomic surveillance of Getah virus. Front. Cell. Infect. Microbiol. 15:1513392. doi: 10.3389/fcimb.2025.1513392

Received: 18 October 2024; Accepted: 12 May 2025;
Published: 04 June 2025.

Edited by:

Yong Qi, Huadong Research Institute for Medicine and Biotechniques, China

Reviewed by:

Sanjay Kumar Singh Patel, Hemwati Nandan Bahuguna Garhwal University, India
Hao Liu, Foshan University, China

Copyright © 2025 Yuan, Hao, Peng, Zhang, Ma, Xiao and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Pengpeng Xiao, eGlhb3BlbmdwZW5nMTAxMEAxMjYuY29t; Nan Li, MjAxOTAyODNAd3p1LmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.