Skip to main content


Front. Microbiol., 04 March 2020
Sec. Virology

Metagenomic Analysis of Plant Viruses Associated With Papaya Ringspot Disease in Carica papaya L. in Kenya

\r\nNaomi Nzilani Mumo*Naomi Nzilani Mumo1*George Edward MamatiGeorge Edward Mamati1Elijah Miinda AtekaElijah Miinda Ateka1Fredah K. RimberiaFredah K. Rimberia1George Ochieng&#x; Asudi,George Ochieng’ Asudi2,3Laura M. BoykinLaura M. Boykin4Eunice M. MachukaEunice M. Machuka5Joyce Njoki NjugunaJoyce Njoki Njuguna5Roger PelleRoger Pelle5Francesca Stomeo*&#x;Francesca Stomeo5*†
  • 1Department of Horticulture and Food Security, Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya
  • 2Department of Biochemistry, Microbiology and Biotechnology, Kenyatta University, Nairobi, Kenya
  • 3Department of Plant Physiology, Faculty of Bioscience, Matthias-Schleiden-Institute, Friedrich Schiller University Jena, Jena, Germany
  • 4ARC Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, WA, Australia
  • 5Biosciences Eastern and Central Africa-International Livestock Research Institute Hub, Nairobi, Kenya

Carica papaya L. is an important fruit crop grown by small- and large-scale farmers in Kenya for local and export markets. However, its production is constrained by papaya ringspot disease (PRSD). The disease is believed to be caused by papaya ringspot virus (PRSV). Previous attempts to detect PRSV in papaya plants showing PRSD symptoms, using enzyme-linked immunosorbent assay (ELISA) and reverse transcriptase-polymerase chain reaction (RT-PCR) procedures with primers specific to PRSV, have not yielded conclusive results. Therefore, the nature of viruses responsible for PRSD was elucidated in papaya leaves collected from 22 counties through Illumina MiSeq next-generation sequencing (NGS) and validated by RT-PCR and Sanger sequencing. Viruses were detected in 38 out of the 48 leaf samples sequenced. Sequence analysis revealed the presence of four viruses: a Potyvirus named Moroccan watermelon mosaic virus (MWMV) and three viruses belonging to the genus Carlavirus. The Carlaviruses include cowpea mild mottle virus (CpMMV) and two putative Carlaviruses—closely related but distinct from cucumber vein-clearing virus (CuVCV) with amino acid and nucleotide sequence identities of 75.7–78.1 and 63.6–67.6%, respectively, in the coat protein genes. In reference to typical symptoms observed in the infected plants, the two putative Carlaviruses were named papaya mottle-associated virus (PaMV) and papaya mild mottle-associated virus (PaMMV). Surprisingly, and in contrast to previous studies conducted in other parts of world, PRSV was not detected. The majority of the viruses were detected as single viral infections, while a few were found to be infecting alongside another virus (for example, MWMV and PaMV). Furthermore, the NGS and RT-PCR analysis identified MWMV as being strongly associated with ringspot symptoms in infected papaya fruits. This study has provided the first complete genome sequences of these viruses isolated from papaya in Kenya, together with primers for their detection—thus proving to be an important step towards the design of long-term, sustainable disease management strategies.


Papaya (Carica papaya L.) is an important fruit crop both in the tropical and subtropical regions (Mishra et al., 2007). The fruit crop is ranked fourth worldwide among tropical fruits, after banana, mango, and pineapple (Edward and Ballen, 2015). Ripe fruits are rich in vitamins A and C, and based on the recommended daily allowance for these vitamins, papaya is ranked first among the 38 most common fruits (Ming et al., 2008). The levels of vitamins A and C from one medium-sized papaya (whose edible portion is about 350 g) exceed the Dietary Reference Intakes of 3000 IU for vitamin A and 90 mg for vitamin C, as established by the United States Food and Nutrition Board for adult minimum daily requirements (Ming et al., 2008). Consumption of this fruit is important in preventing vitamin A deficiency, a cause of childhood blindness in many developing countries, including Kenya (Oyunga et al., 2016). The fruit is reasonably priced and is rich in nutrients, making it a “common man’s” fruit. Papaya is also a source of papain, a proteolytic enzyme obtained by collecting and drying the latex exuded from scratches on the surfaces of slightly immature papaya fruits. The enzyme is purified and used in foods, beverages, pharmaceuticals, and by manufacturing industries (Yogiraj et al., 2014).

In Kenya, the fruit crop is grown in many regions (mostly by small-scale farmers for subsistence), and is ranked sixth for its economic importance after the banana, pineapple, mango, avocado, and watermelon (Asudi, 2010; HCDA, 2016). Large quantities of fruits are mainly consumed locally as a dessert or used in making jams and purée, while the leaves serve as compost. There are few large-scale farmers who produce the fruit as a source of income for local and export markets (Asudi, 2010).

Despite its importance, national economies of many papaya-growing nations are jeopardized by the papaya ringspot disease (PRSD). The disease affects papaya plants at all stages of growth and naturally spreads very quickly, leading to infection of the whole orchard within 3–7 months with severe yield losses of up to 100% (Ventura et al., 2004; Tripathi et al., 2008; Sharma and Tripathi, 2014). A characteristic symptom of the disease on infected plants is the production of ringed spots on the fruits (Gonsalves, 1998; Sharma et al., 2014). Other symptoms of the disease include vein clearing, mottling, mosaic chlorotic spots, leaf curling, green blisters, and distortion of leaves termed as “shoe-stringing.” Reduction in size of the leaf canopies, as the disease advances, results in stunted growth of the plant. Irregular oily, or water-soaked streaks or marks are seen on stems and leaf petioles. These symptoms can occur together or separately. Fruits affected by this disease are of poor quality with low sugar levels, attracting low prices both at local and export market levels (Tripathi et al., 2008; Sharma and Tripathi, 2014). In addition, if plants are infected with the disease—either at the seedling stage or within 2 months after planting—they fail to produce mature fruits, and the affected papaya orchards have a short lifespan of less than a year (Gonsalves, 1998; Tennant et al., 2007). The impact of the disease on rural farming communities has been extreme, due to the fact that papaya trees can no longer be grown without a high possibility of being damaged (Sakuanrungsirikul et al., 2014). The disease is known to be caused by papaya ringspot virus (PRSV), a Potyvirus in the family Potyviridae (Tripathi et al., 2007, 2008; Sharma and Tripathi, 2014).

A study conducted in Kenya documenting papaya cultivation between 2008 and 2009 reported PRSD as the main constraint to papaya production in several regions of the country, including the Coast, Central, Rift Valley, Western, and Eastern regions (Asudi, 2010; Ombwara et al., 2014). Several attempts to detect PRSV in papaya plants that showed PRSD symptoms using double antibody sandwich enzyme-linked immunosorbent assay (ELISA) were not conclusive (Ombwara et al., 2014). Reverse transcriptase-polymerase chain reaction (RT-PCR) procedures using published PRSV primers (Hema and Prasad, 2004; Jain et al., 2004; Diallo et al., 2008; Omar et al., 2011; Srinivasulu and Sai Gopal, 2011; Mohammed et al., 2012; Martínez et al., 2014) also failed to detect the virus. The same failure occurred using primers designed based on PRSV sequences available in GenBank to amplify the virus in symptomatic plants (unpublished data). This has resulted in the notion that there could be a different strain of PRSV in Kenya, or a different virus(es) infecting papaya in the country.

With the development of next-generation sequencing (NGS) technology, plant virus discovery, diagnostics, and evolutionary studies have increased and improved enormously (Roossinck, 2017). The technology can be used to identify plant viruses in a given sample with or without prior knowledge of the viral types present, and it can also reveal the presence of novel and unsuspected agents. The approach has also been helpful for viral co-infection detection in many plants (Candresse et al., 2014; Roossinck, 2015; Akinyemi et al., 2016; Blawid et al., 2017). Therefore, we used NGS approach coupled with RT-PCR and Sanger sequencing to identify and characterize the virus(es) causing symptoms associated with ringspot disease on papaya in Kenya. We believe this to be the first application of NGS technology in assessing viruses associated with PRSD infecting this important fruit crop in Kenya and that it is likely to help in the design of long-term and sustainable disease management strategies in the country.

Materials and Methods

Sample Collection

Field surveys and sampling were carried out during February–April of 2017 in 22 administrative regions (counties) in Kenya—namely, Taita Taveta, Kwale, Kilifi, Bungoma, Busia, Siaya, Vihiga, Kisumu, Homabay, Migori, Nakuru, Baringo, Elgeyo Marakwet, Kiambu, Murang’a, Kirinyaga, Embu, Tharaka Nithi, Meru, Makueni, Machakos, and Kitui (Figure 1). These counties were selected based on reported papaya production and the presence of the PRSD-related symptoms (Asudi, 2010; Ombwara et al., 2014). A total of 287 leaf samples (200 with PRSD symptoms and 87 symptomless) were collected from randomly selected plants using sterile forceps and then immediately immersed in RNAlater® (InvitrogenTM) solution to prevent the degradation of RNA. The samples were then transported to the Biosciences eastern and central Africa–International Livestock Research Institute (BecA-ILRI) Hub in Nairobi, Kenya, and stored at 4°C until RNA extraction. Forty-eight samples (34 with and 14 without ringspot disease symptoms) were randomly selected for NGS analysis based on the region, and symptoms were observed. In every county, a representative sample with or without symptoms was selected, and in counties where more than one sample was chosen, differences in symptoms exhibited by the plants were considered. The geographic coordinates of the sample collection sites were obtained using a handheld global positioning system (GPS), and the data were converted into GIS using ARCGIS 10.4 (Figure 1).


Figure 1. Counties in Kenya sampled during this study.

RNA Extraction, Library Preparation, and Illumina MiSeq Sequencing

Leaf samples selected for NGS analysis were dried using a clean, absorbent paper towel to remove the RNAlater® and then powdered in liquid nitrogen with sterile mortars and pestles. Total RNA was extracted from 0.1 g of leaf samples using the RNeasy® plant mini Kit (Qiagen Inc.), following the manufacturer’s instructions. The RNA was eluted in 70 μl of RNase-free water, visualized on a 0.8% (w/v) agarose gel electrophoresis at 100 V for 30 min, and its quantity was measured using the ssRNA assay on the Qubit® 2.0 fluorometer (InvitrogenTM) system. The extracted RNA was then stored at −80°C. The cDNA libraries were prepared from 1 μg of the total RNA using the Illumina TruSeq® RNA sample preparation protocol according to the manufacturer’s instructions (Illumina, San Diego, CA, United States). Briefly, poly-A containing mRNA molecules were purified using oligo-dT, and then fragmented into small pieces using the Illumina “Elute, Prime, Fragment Mix.” The fragmented RNA was copied into the first-strand cDNA using reverse transcriptase and random primers, and second-strand cDNA was synthesized using DNA polymerase I and RNase H. The double-stranded cDNA was purified using Agencourt AMPure® XP magnetic beads (Beckman Coulter, Inc., Indianapolis, IN, United States). The end-repair of synthesized cDNA was performed using End Repair mix. Thereafter, 3′ ends were adenylated and unique adaptors for each library ligated to the 5′ and 3′ ends ds cDNA. The dsDNA was enriched through PCR to create the final cDNA library under the following cycling conditions: one cycle of 98°C for 30 s; 15 cycles of 98°C for 10 s; 60°C for 30 s; and 72°C for 30 s, with a final extension of 72°C for 5 min.

The final size and concentration of the cDNA libraries were estimated with the Agilent Tape Station 2200 system (Agilent Technologies, Santa Clara, CA, United States) and Qubit® 2.0 fluorometer (InvitrogenTM), respectively. The cDNA libraries, each with unique adaptor, were normalized to 4 nm and pooled for multiplex sequencing. A pooled library consisted of 24 biological samples, each at equal molar concentration (hence, two flow cells) were used. The libraries were sequenced using a 2 × 300 cycle PE V3 Illumina kit (Illumina, San Diego, CA, United States). Paired-end reads were generated using the Illumina MiSeq System at the BecA-ILRI Hub in Nairobi, Kenya.

RNA Sequence Processing and de novo Assembly

Paired-end reads generated in the Illumina MiSeq System were checked for quality using FastQC. The low-quality reads and sequencing adapters were removed using Trimmomatic V 0.33 (Bolger et al., 2014). The host genome was removed by mapping all the reads to the papaya plant genome (GenBank accession number ABIM01000000) (Ming et al., 2008), using Bowtie2 V 2.2.8 (Langmead and Salzberg, 2013). The remaining reads (unmapped) were then assembled de novo to obtain contigs using metaSPAdes V 3.9.0 (Nurk et al., 2017) with default settings.

Virus Identification and Reference Mapping of the Assembled de novo Contigs

The resulting de novo contigs were compared with other sequences in the National Center for Biotechnology Information (NCBI) GenBank database1 (Benson et al., 2012) and the Plant Virus Genome Database (Camacho et al., 2009) using BLASTn search, and the top hit accession was used for virus identification. For each viral species identified, the most frequent annotated accessions in the NCBI was used as reference for alignment and also for the estimation of sequence similarity. The Krona web-based tool (Ondov et al., 2011) was used to visualize BLAST results.

Reference assemblies were performed for complete virus genome sequences by mapping the de novo sequences against the most similar existing viral genomes using the read mapping module of CLC genomics workbench version 5.5.12. The sequences were assigned as complete genomes based on comparison with the reference sequences used in the mapping process obtained from BLASTn search results. The de novo consensus sequences and consensus sequences from reference mapping were then compared through visual inspection of individual mappings to ensure no artifacts were incorporated as a result of sequencing errors or errors during genome assembly. De novo sequences were, however, chosen over the consensus of reference assembly as a precautionary measure in case the viruses identified had considerably diverged from similar viral genome sequences in the GenBank database.

Validation of Assembled Virus Sequences Through RT-PCR and Sanger Sequencing

The assembled viral sequences were validated through reverse transcription (RT) followed by polymerase chain reaction (PCR) and sent to Macrogen (Europe) for Sanger sequencing. Briefly, viral sequences generated from the Illumima MiSeq were aligned using CLC genomics and consensus sequences used for designing primers using Primer 3. Designed primers were evaluated for specificity using Primer-BLAST (Ye et al., 2012) and tested on the samples in which the viruses had earlier been detected (by NGS). The target viruses were amplified from these samples through RT of total RNA using SuperScriptTM III Reverse Transcriptase (InvitrogenTM) followed by PCR using the designed primer sequences. Viral amplicons were purified using the QIAquick PCR Purification Kit (Qiagen Inc.), according to manufacturer instructions and quantified using the NanoDrop Spectrophotometer (Thermo ScientificTM). The amplicons were sequenced using Macrogen Sanger sequencing. The sequences were trimmed and assembled using the CLC Genomics Workbench version 8.03 with the default settings. The consensus sequences were used for BLASTn search in the NCBI and for comparison with the sequences generated on the Illumina MiSeq System.

Analysis of Virus Sequences Associated With Papaya Ringspot Disease

The obtained viral sequences from Illumina MiSeq system were used to determine sequence identity (%), open reading frames (ORFs), conserved motifs, and phylogeny. Sequence identities were computed using the Sequence Identity and Similarity (SIAS) tool3; the search for ORFs was done using the ORF finder4; and conserved protein domains were identified using NCBI conserved domain search program (Marchler-Bauer et al., 2011), whereas the conserved motifs were identified through comparisons with known viral sequences. The sequences from this study and other previously identified viral sequences retrieved from the GenBank database were used to determine phylogenetic relationships among members of the same genus. Briefly, sequences were imported into the CLC Genomics Workbench, aligned and exported in FASTA format, converted to MEGA format, and used for distance and phylogenetic analysis using MEGA 6 software (Tamura et al., 2013). The phylogenetic trees were constructed using the maximum likelihood method based on the JTT matrix (Jones et al., 1992), using 1000 replicates for bootstrap analysis. The recombination detection program (RDP)4 package (Martin et al., 2015) was used to detect recombination in the nucleotide sequences of the identified viruses using RDP (Martin and Rybicki, 2000), GENECONV (Padidam et al., 1999), Bootscan (Martin et al., 2005), MaxChi (Smith, 1992), Chimaera (Posada and Crandall, 2001), 3Seq (Boni et al., 2007), and SiScan (Gibbs et al., 2000) programs implemented in the package with default parameters.


Illumina MiSeq Sequencing Statistics

To provide an insight into viruses associated with PRSD symptoms in Kenya, 48 leaf samples were sequenced using the Illumina MiSeq platform. A total of 50,247,269 reads of length between 35 and 151 bp were generated from two runs. The raw reads were filtered to remove those of low quality, leaving a total of 47,800,743 reads with read length ranging from 60 to 151 bp. The number of reads per sample ranged from 465,116 in S1 to 1,809,690 in S43. The GC content ranged from 42 to 48%. Viruses were detected in 38 out of the 48 samples sequenced (Supplementary Table S1).

Viruses Detected in Symptomatic and Asymptomatic Papaya Leaf Samples

The reads were assembled into 49 contigs, ranging from 469 bases in S11 (collected from Kiambu County) to 10,292 bases in S4 (collected from Nakuru County) (Table 1). The BLASTn search of the de novo assembled sequences against the NCBI non-redundant database indicated the presence of Moroccan watermelon mosaic virus (MWMV), cowpea mild mottle virus (CpMMV), and cucumber vein-clearing virus (CuVCV). The BLASTn results of 35 de novo assembled sequences from 31 samples shared between 80 and 90% sequence similarities with MWMV genome sequences previously reported in Tunisia, South Africa, Democratic Republic of Congo, and Morocco (GenBank accession numbers: LN810061, EF579955, KU315176 EF211959, and AF305545, respectively). The CuVCV genome sequences were recorded in 11 samples with nucleotide sequence similarities of 72–77% to CuVCV previously reported in Tanzania (GenBank accession number JN591720). Three samples showed sequences closely related to CpMMV and had between 76 and 86% nucleotide sequence similarities to CpMMV sequences reported previously in North America, Brazil, and Ghana (GenBank accession numbers: KC774020, KF554101, and HQ184471, respectively) (Table 1).


Table 1. Viruses identified in papaya and their sequence similarity (%) with closest homologs in the databases.

Cases of single and co-infections of the viruses obtained in this study were also observed in the samples. Single virus infections of MWMV were detected in 26 samples collected from Makueni, Nakuru, Homabay, Taita Taveta, Kiambu, Busia, Kilifi, Murang’a, Kirinyaga, Embu, Meru, and Machakos counties, whereas single CuVCV infection was found in two samples collected from Makueni and Kwale (Table 1). The co-infections of MWMV and CuVCV were the majority and were detected in seven samples collected from Kitui, Embu, Machakos, Tharaka Nithi, Meru, and Makueni counties. The existence of mixed infections of MWMV and CpMMV, CuVCV, and CpMMV, as well as MWMV, CpMMV, and CuVCV were reported in one sample each from Baringo, Kitui, and Meru counties, respectively. Due to the misleading interpretations of partial sequences for virus identity (Jo et al., 2017), we used complete genome sequences of the viruses identified through BLASTn search in subsequent analysis. In the GenBank database, the complete genome of MWMV is about 9.7 kbp and CpMMV is 8.1 kbp, while the CuVCV genome sequence is partial (5218 bp). However, we were able to designate sequences of CuVCV as complete or near-complete genomes based on the information of genome features in the species Carlavirus in the database. Out of the 22 counties surveyed, 22 complete virus genome sequences detected in 11 counties—namely, Homabay, Kisumu, Busia, Kiambu, Meru, Embu, Kirinyaga, Murang’a, Machakos, Makueni, and Nakuru—were closely related to MWMV, the most prevalent virus in the samples analyzed (Table 1). The CuVCV was the second most prevalent, and eight complete viral genome sequences were detected in samples collected from Makueni, Kwale, Tharaka Nithi, Meru, Kitui, and Machakos counties. Only one complete genome of CpMMV with 8196 bases was detected in a sample collected from Kitui County (Table 1).

The Relationship Between Virus Detection and Disease

The symptoms observed on infected leaves varied significantly and ranged from mottling, mosaic, shoe–stringing, curling, and puckering. Similarly, on fruits, symptoms such as concentric water-soaked lesions, circular ringspots, and necrotic rings were observed. On the upper part of the stem and leaf petioles, numerous water-soaked lesions were seen (Table 1 and Figure 2). There was a clear association between virus presence and symptoms in papaya plants. Hence, the majority of the symptomatic plants (32 out of 34) tested positive for viral infection. There were, however, cases in which plants were asymptomatic but viruses were detected through NGS (six out of 14) (Table 1). More interestingly, in cases where fruits exhibited concentric water-soaked lesions, circular ringspots, and necrotic rings, single infections with MWMV were detected through NGS, while in occurrences of single infection of CuVCV, mottling was observed. Co-infections of MWMV and CuVCV were found in plants that were either asymptomatic or in those exhibiting mottling, mosaic, vein clearing, or puckering symptoms. Co-infection of CuVCV with CpMMV was detected by NGS in plants exhibiting mottling symptoms. To exclude the presence of PRSV and to confirm that MWMV was associated with ringspots, the spots were excised from the infected fruits and the viruses concentrated using protocol described by Blomström et al. (2010). RNA was extracted using TRIzol LS Reagent (Invitrogen) and further purified using RNeasy Mini Kit columns (Qiagen). The RNA was sequenced using Illumina Miseq system and a de novo assembled 9700 bp sequence was obtained that shared an 80% sequence similarity with MWMV sequences in GenBank. This strongly suggested that MWMV is associated with the symptoms and is the putative cause of the ringspots on papaya fruits in Kenya.


Figure 2. Diversity in PRSD symptoms observed during the field survey. (A) Puckering. (B) Mottling. (C) Mosaic. (D) Leaf deformation. (E) Oily streaked petioles. (F) Oily streaked stem. (G) Necrotic rings. (H,I) Circular rings. (J) Concentric water soaked lesions. (K) Asymptomatic fruits. (L) Asymptomatic leaves.

RT-PCR and Sanger Sequencing Validation of Viruses

Sanger sequencing of the viral amplicons obtained from RT-PCR yielded sequences that were 100% identical to those generated de novo from assembled Illumina sequences, confirming that the de novo assembly gave accurate sequences of the viruses’ genomes. Primers designed, their sequences, the target region in the genome, and the expected size of the virus amplicon are shown in Table 2. To further authenticate the association of MWMV with ringspot symptoms on fruits, extracts from ringed spots were tested using RT-PCR approach in few samples among the 48 sequenced, indicating a strong association of the virus with ringspots on the fruits (Supplementary Figure S1).


Table 2. Primers, target region of virus genome, and expected amplicon size.

Genome Organization of MWMV in This Study and Determination of Its Phylogenetic Affinities

The obtained viral genome sequences of MWMV were deposited in GenBank under accession numbers MH595736 through MH595746. The genomes are 9712–9725 nucleotides (nt) long organized into 142–155 non-coding nt at their 5′ terminus, followed by 9375 nt encoding the polyprotein, from which all the proteins of the virus are derived, and 194–197 non-coding nt at 3′ terminus. The polyprotein codes for 3124 amino acids (aa) with a molecular weight of between 353.9 and 354.5 kDa. The base composition includes 31.4–31.6% adenine, 18.8–19.0% cytosine, 23.3–23.4% uracil, and 26.1–26.4% guanine.

The genomes are single-stranded positive-sense RNA virus with a single ORF that is translated into a single large polyprotein. The polyprotein has nine putative cleavage sites, yielding 10 functional proteins. The length (nt and aa) and organization of the 10 proteins are as follows: P1 (1035/232), helper component protease (Hc-Pro) (1371/457), P3 (1041/347), 6K1 (156/52), cylindrical inclusion (CI) (1905/635), 6K2 (171/57), VPg (570/190), nuclear inclusion a (NIa) (717/239), nuclear inclusion b (NIb) (1550/517), and coat protein (CP) (855/285).

Several conserved motifs found in potyviruses were identified in the MWMV genomes detected in this study. In the HC-Pro gene there are highly conserved “RITC,” “CSC,” and “PTR” motifs, which are associated with virus transmission (Huet et al., 1994; Blanc et al., 1998) and “FRNK(X)12CDN” that is involved in symptom development (Gal-On, 2000). The RNA helicase function motifs “GAVGSGKST” and “PTR” were found to be conserved in the N-terminal region of the CI. Three RdRp motifs “YCDADGS,” “GNNSGQPSTVVDNTLMV,” and “NGDDL” responsible for potyviral genome replication (Hong and Hunt, 1996) were present in NIb. The well-characterized DAG motif, highly conserved among all aphid transmissible potyviruses (Atreya et al., 1995) was found in the N-terminus of the CP. A stretch of glutamic acid and lysine repeats (KE repeats) was found after the “DAG” motif in the N-terminus of the CP in all MWMV signatures in this study.

Potyviruses are usually classified based on the percentage of sequence identity in the polyprotein or CP. Viruses sharing more than 75% nt and 80% aa sequence identity in the CP or polyprotein are considered the same species (Adams et al., 2005). The MWMV viruses from this study share 98.1–98.6% aa (97.5–98.5% nt) sequence identity among themselves in the polyprotein region. They also share 89.8–90.0% aa (79.2–79.6% nt) with isolate from Tunisia (GenBank accession number EF579955), and 90.3–90.5% aa (79.6–79.7% nt) with isolate from South Africa (GenBank accession KU315176) in the polyprotein. Furthermore, sequence identity of 93.7–95.1% aa (84.3–84.9% nt) and 94.1–95.1% aa (83.9–85.3% nt) with Tunisian and South African MWMV isolates in the CP was observed, respectively (Table 3).


Table 3. Nucleotide and amino acid sequence identities (%) between Kenyan MWMV isolates and those from Tunisia and South Africa.


Table 4. Viruses used in phylogenetic analysis, host plants, country of origin, and GenBank accession numbers.

Phylogenetic analysis built using the complete polyprotein aa sequences showed that all the MWMV isolates detected in this study formed a single cluster separate from South Africa, Tunisia, and Greece sequences. A clear geographical clustering was also observed. Isolates from South Africa clustered separately from the Mediterranean ones. The PRSV that was previously believed to be responsible for the ringspot symptoms showed a distinct evolutionary pathway from MWMV as depicted in the phylogenetic tree (Figure 3).


Figure 3. Phylogenetic relationships among MWMV isolates and closely related potyviruses. The phylogenetic tree was generated using the maximum likelihood method based on JTT matrix-based model. The sequence generated in this study is shown by black circle. The viruses, host plant, country of origin, and GenBank accession numbers used in phylogenetic tree are shown in Table 4.

Cowpea Mild Mottle Virus: Identification and Phylogeny

The complete genome sequence of the CpMMV identified in this study is 8151 nt long, excluding the poly-A tail (GenBank accession number MK984605). The genome is single-stranded, positive-sense with six ORFs. The ORF1 encodes a RdRp gene consisting of 1859 aa with an estimated molecular weight of 211.3 kDa and four conserved motifs, including viral methyltransferase (Rozanov et al., 1992), RdRp (Koonin, 1991), Carlavirus endopeptidase (Peptidase C23) (Lawrence et al., 1995), and viral (superfamily 1) RNA helicase (Viral_helicase1) (Gorbalenya and Koonin, 1989). The ORFs 2, 3, and 4 encode the triple gene block proteins (TGB1-3p, with molecular weights of 25.8, 11.6, and 7.6 kDa, respectively), that are essential for virus movement. The ORF 5 encodes the CP comprising of 288 aa with a molecular weight of 32 kDa and contains a strong conserved motif “His-X8Asp-X15Thr-Gly-Gly” at aa position 246–273 in the C-terminal region of the CP (Naidu et al., 1998). The ORF 6 encodes a cysteine-rich protein (CRP) with nucleic acid-binding protein (NaBP) consisting of 109 aa with a molecular mass of 12.3 kDa.

The CP sequence comparison of CpMMV isolate from this study with sequences in the database, indicated that the Kenyan CpMMV shares 84.7, 84, and 82.6% aa sequence identities with Brazilian (GenBank accession number AGS13100), Ghanaian (GenBank accession number YP-004035878), and Indian (GenBank accession number ATV94962) isolates, respectively. However, in the RdRp gene, the Kenyan CpMMV isolate shares 90.7, 88.6, and 72% aa sequence identities with the Indian, Brazilian and Ghanaian isolates, respectively. CpMMV sequences in this study clustered together with those of several other CpMMV isolates from the GenBank with a strong bootstrap support of 100% based on either CP or RdRp gene (Figures 4, 5).


Figure 4. Phylogenetic analysis of coat protein amino acid sequences among CpMMV, PaMMV, PaMV, and closely related Carlavirus generated using maximum likelihood method based on JTT matrix-based model. The sequences generated in this study are shown by black circles. The viruses, host plant, country of origin, and GenBank accession numbers used in phylogenetic tree are shown in Table 4.


Figure 5. Phylogenetic analysis of RdRp amino acid sequences among CpMMV, PaMMV, PaMV, and closely related Carlavirus generated using maximum likelihood method based on JTT matrix-based model. The sequences in this study are shown with black circles. The viruses, host plant, country of origin, and GenBank accession numbers used in phylogenetic tree are shown in Table 4.

Analysis of the ORF1 nucleotide sequences using seven different algorithms showed evidence of recombination. A recombination was detected with major parent being a Ghanaian isolate (YP-004035878) at positions 1–34 and 5330–5650; and the minor parent being a Brazilian isolate (AGS13100) at positions 35–5329. The recombination was detected by four programs; MaxChi Chimaera, Siscan, and 3Seq with a P-value of 1.893 E-06. No recombination was detected in the other genes.

Molecular Characterization of Putative Carlaviruses Identified Papaya

The genome sequences of the putative Carlavirus (PaMV and PaMMV) are linear, single-stranded positive-sense RNA viruses with a poly-A tail consisting of six ORFs, encoding the following proteins; RdRp, movement proteins (i.e., triple gene block, CP, and CRP with NaBP with arrangement typical of the genus Carlavirus) (Martelli et al., 2007; Adams et al., 2012). A BLAST search using individual ORF sequences revealed that these sequences belong to the genus Carlavirus in the family Betaflexiviridae. Functional protein domains known to be conserved in Carlaviruses were also detected in isolates from this study. In the ORF1, there are viral RNA methyltransferase (Vmethyltransf), RdRp_2 Superfamily, Carlavirus endopeptidase (Peptidase_C23), and Viral (Superfamily 1) RNA helicase (Viral_helicase1). The RdRp domain also contains the characteristic core motif SGX3TX2NT22GDD found in Carlaviruses (Martelli et al., 2007; Adams et al., 2012), while the CP has the conserved CP domain of carlviruses Flexi_CP_N and Flexi_CP.

These viruses were found in a mixed infection with MWMV or CpMMV. The association of symptoms to them hence becomes problematic. However, in cases where these viruses were identified as a single infection through NGS, severe and mild mottling symptoms were observed. Accordingly, we have proposed naming of these viruses papaya mottle-associated virus (PaMV) and papaya mild mottle-associated virus (PaMMV).

The PaMV detected in this study is 9061–9071 nt long, excluding the poly-A tail (GenBank accession numbers MK984599, MK984600, MK984601, MK984603, and MK984604) and were obtained from samples collected from Machakos, Meru, Tharaka Nithi, and Makueni counties. The size of the ORFs is as follows: ORF1 1558 aa (partial) for MK984600 and 2162 aa (complete) (175.1–248 kDa), ORF2 228 aa (25.3 kDa), ORF3 108 aa (11.6 kDa), ORF4 63 aa (6.9 kDa) ORF5 276 aa (30.8 kDa), and ORF 6 102 aa (11.6 kDa). The PaMMV, on the other hand, was identified in three samples collected from Makeuni, Kwale, and Kitui counties (GenBank accession numbers MK984597, MK984598, and MK984602). The genomes are 9028-, 9023-, and 9070-nt long, respectively (excluding the poly-A tail). The ORF1 comprises of 2154 aa (245 kDa); ORF2 228 aa (25.3 kDa); ORF3 108 aa (11.6 kDa); ORF4 75 aa (8.1 kDa); ORF5 288 aa (32.1 kDa); and ORF6 103 aa (11.9 kDa).

A BLASTn search using PaMV and PaMMV genomes as query sequences in the GenBank always returned CuVCV as the most similar sequence. However, close analysis indicated that there were differences in the size of ORF1, ORF4, ORF5, and ORF6 between the two sets of viruses. The ORF1 in PaMV comprises 2162 aa, whereas PaMMV has 2154 aa. The ORF4 consists of 63 and 75 aa; the ORF5 276 and 288 aa; and the ORF6 102 and 103 aa for PaMV and PaMMV, respectively. There were insertions or deletions (indels) of aa sequences observed on these ORFs when sequence alignment was performed, contributing to differences in the sizes of these ORFs. These indels are unlikely to be due to sequencing error because they were also found in other published Carlaviruses. Furthermore, these variations were mapped to a common area in the samples analyzed (as was the case on the 5′ end of the ORFs).

Based on the species demarcation criterion of 72% nt and 80% aa similarity in the CP or RdRp among Carlviruses (Adams et al., 2012), it is clear that the two viruses could be considered distinct species within the genus Carlavirus. The percentage of sequence identity indicated that the isolates in this study shared 75.7–78.1% aa and 63.6–67.6% nt sequence identities in the CP gene with CuVCV isolate from Tanzania (GenBank accession number AEP83730) (Table 5), values below the threshold for species discrimination in Carlviruses. Additionally, PaMV and PaMMV shared less than 80% aa and 75% nt sequence identities in the CP, thereby qualifying them to be different species of the same Carlavirus genus.


Table 5. Percentage sequence identities in the coat protein of PaMV and PaMMV Carlaviruses with the closest homolog CuVCV from Tanzania (AEP83730).

The phylogenetic trees generated using the CP and the RNA-dependent RNA polymerase gene (Figures 4, 5, respectively) support the proposed species classification within the genus Carlavirus. The PaMV isolates formed a monophyletic group, whereas PaMMV isolates clustered together, closer to the CuVCV isolate from Tanzania (AEP83730).

A recombination event was detected in PaMV isolate MK984599, collected from Tharaka Nithi county—with the major parent being MK984603 from Machakos County (at positions 1–1169 and 2280–6806) and the minor parent being MK984601 from Meru County (at positions 1170–2279) (Figure 5). The recombination was supported by four programs: MaxChi, Chimaera, Siscan, and 3Seq, with a P-value of 4.718 E-06. No recombination was detected in the coat protein gene.


Through Illumina MiSeq sequencing, we characterized complete genome sequences of MWMV, a Potyvirus; CpMMV, a Carlavirus, and two novel yet divergent Carlaviruses (namely, PaMV and PAMMV) in symptomatic and asymptomatic papaya leaves collected from Kenyan fields. This study provides the first report of these viruses in papaya in Kenya. We also report for the first time the infection of papaya with CpMMV, PaMMV, and PaMV. The presence of MWMV in papaya crops in Kenya would suggest that either the virus is increasing in its geographical distribution or that it has been present in papaya and/or in other host plants but has previously gone undetected. The study also suggests the emergence of new viruses (CpMMV, PaMMV, and PaMV)—or that the viruses have been present but have recently moved to papaya from other hosts—that are now posing a serious threat to papaya production in the country. The sequencing strategy used in this study targeted viruses with poly-A tail, as per the TruSeq RNA Illumina protocol used. The possibility of new or additional viruses falling outside of this detection approach possibly infecting papaya crop cannot be ruled out. Additional viral metagenomics studies could help in elucidating the complete diversity of viruses that are infecting papaya in Kenya.

The characterization of a plant virus disease with a known etiology usually relies on the symptoms expressed in the host plants because they are easy to recognize, especially if they are disease-specific. Symptoms also aid in roguing of diseased plants as a strategy for preventing virus spread (Naidu and Hughes, 2003). Potyviruses have limited host ranges and can be identified based on the characteristic symptoms they produce in certain host plants (Shukla and Ward, 1989). In this study, the symptoms observed on papaya plants included those that are attributed to PRSV infection (Tripathi et al., 2008; Zhao et al., 2016), although PRSV was not detected in our samples. This explains why earlier attempts to detect PRSV in diseased plants through ELISA and RT-PCR procedures using primers specific to PRSV (Ombwara et al., 2014) were unsuccessful. Failure to detect PRSV in papaya plants exhibiting the above symptoms shows the limitations of using symptoms for disease diagnosis (Candresse et al., 2014). Although we did not complete Koch’s postulates for the identified viruses, results from NGS and RT-PCR established a strong association between MWMV and the ringspot symptoms we observed in Kenyan papaya.

Establishing associations between a specific viral infection and the symptoms expressed in host plants can be further complicated by mixed viral infections (Marais et al., 2015). For instance, in the co-infection of papaya with MWMV and PaMV, we were not able to associate specific symptoms to either virus. However, Illumina sequencing of RNA extracted from the ringed spots of fruits strongly suggested MWMV to be associated with the ringspots on Kenyan papaya. This virus has also been reported in papaya plants exhibiting ringspots in Congo (Arocha et al., 2008). However, papaya plants showing ringspot symptoms could also be infected with other viruses such as PaMV, PaMMV, or CpMMV, as observed in this study. The occurrence of mixed viral infections in papaya has also been reported in Mexico (Noa-Carrazana et al., 2006). In many samples sequenced in this study, MWMV was found with other viruses, suggesting that co-infections of these viruses in papaya plants is not an uncommon phenomenon. The co-infection of MWMV with CuVCV has also been reported in watermelon in Tanzania (Menzel et al., 2011). Plant viruses co-infecting the same host may generally interact in either a synergistic or antagonistic way (Syller, 2012). Whether this is the case for viruses present in the papaya in Kenya remains to be determined.

The clear association between virus presence and symptoms expression in papaya plants was clearly observed with the majority of the symptomatic plants testing positive for virus(es) infection. Some of our samples, despite not having clear visual viral symptoms showed the presence of virus(es) when sequenced on the Illumina MiSeq platform. Most of these viral sequences from asymptomatic leaves were partial except in one instance in Tharaka Nithi County, where we recovered complete viral genome of PaMV. Several factors may have contributed to the absence of symptoms in these infected samples, including papaya variety or cultivar, plant age and number of days post-infection at the time samples were collected (Singh and Shukla, 2011) and virus titers (Ghoshal and Sanfaçon, 2015). Further, masking of symptoms occurs in virus infected papaya plants depending on the environmental conditions during the season (Kabir et al., 2017). Thorough screening of asymptomatic plants is therefore paramount for better disease management.

Since the discovery of MWMV in papaya in Congo a decade ago (Arocha et al., 2008), the virus has not been reported again in papaya. This work represents the second report of MWMV in papaya worldwide. MWMV has been reported in Africa and the Mediterranean region mostly in cucurbits such as Cucurbita pepo (Yakoubi et al., 2008; Ibaba et al., 2016; Kidanemariam et al., 2019), Cucumis melo (Lecoq et al., 2001), Lagenaria bleviflora, and Adenopus breviflorus (Owolabi et al., 2012; Mofunanya and Edu, 2015). Our findings would suggest that papaya is an additional natural host for MWMV and that there could be more wild or cultivated hosts requiring further determination.

Phylogenetic analysis of MWMV revealed a clear geographical clustering pattern showing the Kenyan isolates on one clade and the South African and Mediterranean ones in separate groups. Similar geographical grouping of MWMV isolates was also reported in Tunisia (Yakoubi et al., 2008). If this clustering based on host or geographic origin remains to be determined. Nevertheless, all MWMV isolates in this study show high sequence identity values, despite their different counties of origin suggesting a recent introduction in the country.

Recombination and mutation events are major forces attributed to evolution in plant viruses and are associated to adaptation to new hosts, often leading to emergence of new variants and resistance breaking strains (Ohshima et al., 2002; García-Arenal et al., 2003; Nagy, 2008; Desbiez et al., 2011; Kwak et al., 2015; Xie et al., 2016). BLASTn search of PaMV and PaMMV sequences showed CuVCV to be their closest species genetically. Similarly, these viruses seem to have a common ancestor (from the phylogenetic analysis). Moreover, recombination events were detected in PaMV sequences within the RdRp genes although these recombination events did not change the phylogenetic groupings of the isolates. Further comparison between PaMV and PaMMV ORFs showed several indels. From the results, it is likely that these two viruses evolved from a common ancestor. However, a detailed analysis of these viruses from different hosts and locations will be critical in elucidating their evolutionary paths and for determining if these events have any biological significance such as host range and virulence. The CpMMV under this study is a recombinant between Ghanaian (YP_004035878) and Brazilian (AGS13100) isolates. In the phylogenetic tree, it clustered away from both parents. This result points to the likelihood that CpMMV from Kenya is a separate strain from its parents, which could have been caused by recombination event.

The detection of single or co-infections of potyviruses and carlaviruses associated with PRSD in the papaya fruit crop in Kenya in both symptomatic and asymptomatic samples are causes for concern. These viruses cause symptoms resembling other viral diseases and could escape routine detection, thus resulting in a considerable reduction in fruit yield and quality. The inability to recognize a symptomless plant harboring a virus could also result in inadvertent exposure of other crops in the country to a potential inoculum source. Although the insect vectors transmitting these viruses and the mode of transmission in papaya are yet to be established, the MWMV in papaya is probably transmitted by aphids, while CpMMV could be vectored by whiteflies (Naidu et al., 1998). Because of the close relationship between PaMMV and PaMV with the white-fly transmitted cucumber vein-clearing virus, there is a likelihood that they are also transmitted by whiteflies (Menzel et al., 2011). Papaya in Kenya is propagated by seeds and the possibility that any of these viruses is seed transmitted cannot be ruled out. However, further studies are needed to identify specific insect vectors and examine the likelihood of virus transmission to papaya in Kenya and their likely wild hosts of these viruses.

As the viruses identified continue to impact negatively on the livelihoods of many farming householders, there is an urgent need to develop an integrated management strategy for the different virus diseases. Current management practices include the use of chemicals to control the insect vectors and to reduce their populations and limit the spread of viruses, and also by rouging diseased plants which act as sources of viral inoculum. However, these measures are mostly not effective. A larger percentage of farmers also do not employ any control measures (Mumo et al., 2019, unpublished data). Papaya cultivars resistant to the pathogens or less attractive to the insect vectors are also not available to the poor farmers. Therefore, promoting the use of virus-free seeds and seedlings and implementing quarantine measures could help prevent the spread of these viruses to areas that are currently virus-free.

Further, virus-specific primers developed in the current study will help to regularly monitor both symptomatic and asymptomatic plants where necessary and discover new infections. These will help prevent future spread of the viruses as well as developing ways of combating and reducing their effects on papaya crops. However, other management and control options, such as identification of tolerant germplasm as well as alternative hosts, need to be explored.


Two viruses, namely, MWMV and CpMMV, in addition to two newly discovered viruses infecting papaya, tentatively named PaMV and PaMMV, have been discovered in this study and are associated with PRSD in Kenya. Given the rate at which papaya planting materials are exchanged between farmers in different counties in Kenya (Asudi, 2010), it is likely that these viruses, although currently restricted in specific counties, could quickly spread to other papaya-growing areas in the region. Additional and extensive surveys should be carried out to determine the prevalence and distribution of these viruses in adjacent areas to develop and establish control and management measures to prevent further spread. However, further studies are needed for complete classification of PaMV and PaMMV and to understand the risk they pose to papaya and other crops in the region.

Data Availability Statement

RNAseq raw data are available at NCBI, Bioproject ID: PRJNA595878. Complete genomes were deposited to NCBI under the following GenBank Accession Numbers: MWMV (MH595736-MH595746), CpMMV (MK984605), PaMV (MK984599, MK984600, MK984601, MK984603, and MK984604), and PaMMV (MK984597, MK984598, and MK984602).

Author Contributions

NM, GM, EA, FR, GA, LB, RP, and FS conceived and designed the experiments. NM, EM, RP, and FS performed the experiments. NM, JN, and FS analyzed the data. NM, GM, EA, FR, GA, LB, EM, JN, RP, and FS wrote the manuscript. All authors read and approved the final version of the manuscript.


This work was supported by Africa Biosciences Challenge Fund (ABCF) program through the BecA-ILRI Hub. The ABCF Program is funded by the Australian Department for Foreign Affairs and Trade (DFAT) through the BecA-CSIRO partnership; the Syngenta Foundation for Sustainable Agriculture (SFSA); the Bill and Melinda Gates Foundation (BMGF); the UK Department for International Development (DFID); and the Swedish International Development Cooperation Agency (SIDA).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.


The authors thank the papaya farmers in the 22 Kenyan counties for their permission to sample.

Supplementary Material

The Supplementary Material for this article can be found online at:


  1. ^
  2. ^
  3. ^
  4. ^


Adams, M. J., Antoniw, J. F., and Fauquet, C. M. (2005). Molecular criteria for genus and species discrimination within the family Potyviridae. Arch. Virol. 150, 459–479. doi: 10.1007/s00705-004-0440-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Adams, M. J., Candresse, T., Hammond, J., Kreuze, J. F., Martelli, G. P., Namba, S., et al. (2012). “Family Betaflexiviridae,” in Virus Taxonomy, Ninth Report of the International Committee on Taxonomy of Viruses, eds A. M. Q. King, M. J. Adams, E. B. Carstens, and E. J. Lefkowitz (San Diego, CA: Academic Press), 920–941. doi: 10.1016/b978-0-12-384684-6.00078-1

CrossRef Full Text | Google Scholar

Akinyemi, I. A., Wang, F., Zhou, B., Qi, S., and Wu, Q. (2016). Ecogenomic survey of plant viruses infecting Tobacco by Next generation sequencing. Virol. J. 13, 1–12. doi: 10.1186/s12985-016-0639-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Arocha, Y., Vigheri, N., Nkoy-Florent, B., Bakwanamaha, K., Bolomphety, B., Kasongo, M., et al. (2008). First report of the identification of Moroccan watermelon mosaic virus in papaya in Democratic Republic of Congo. Plant Pathol. 57:387. doi: 10.1111/j.1365-3059.2007.01658.x

CrossRef Full Text | Google Scholar

Asudi, G. O. (2010). Collection, Morphological and Molecular Characterization of Papaya. Ph.D. thesis, Jomo Kenyatta University of Agriculture and Technology, Juja.

Google Scholar

Atreya, P. L., Lopez-Moya, J. J., Chu, M., Atreya, C. D., and Pirone, T. P. (1995). Mutational analysis of the coat protein N-terminal amino acids involved in Potyvirus transmission by aphids. J. Gen. Virol. 76, 265–270. doi: 10.1099/0022-1317-76-2-265

PubMed Abstract | CrossRef Full Text | Google Scholar

Benson, D. A., Karsch-Mizrachi, I., Clark, K., Karsch-Mizrachi, I., Lipman, D. J., Ostell, J., et al. (2012). GenBank. Nucleic Acids Res. 40, 48–53. doi: 10.1093/nar/gkr1202

PubMed Abstract | CrossRef Full Text | Google Scholar

Blanc, S., Ammar, E. D., Garcia-Lampasona, S., Dolja, V. V., Llave, C., Baker, J., et al. (1998). Mutations in the Potyvirus helper component protein: effects on interactions with virions and aphid stylets. J. Gen. Virol. 79, 3119–3122. doi: 10.1099/0022-1317-79-12-3119

PubMed Abstract | CrossRef Full Text | Google Scholar

Blawid, R., Silva, J. M. F., and Nagata, T. (2017). Discovering and sequencing new plant viral genomes by next-generation sequencing: description of a practical pipeline. Ann. Appl. Biol. 170, 301–314. doi: 10.1111/aab.12345

CrossRef Full Text | Google Scholar

Blomström, A. L., Widén, F., Hammer, A. S., Belák, S., and Berg, M. (2010). Detection of a novel astrovirus in brain tissue of mink suffering from shaking mink syndrome by use of viral metagenomics. J. Clin. Microbiol. 48, 4392–4396. doi: 10.1128/JCM.01040-10

PubMed Abstract | CrossRef Full Text | Google Scholar

Bolger, A. M., Lohse, M., and Usadel, B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120. doi: 10.1093/bioinformatics/btu170

PubMed Abstract | CrossRef Full Text | Google Scholar

Boni, M. F., Posada, D., and Feldman, M. W. (2007). An exact nonparametric method for inferring mosaic structure in sequence triplets. Genetics 176, 1035–1047. doi: 10.1534/genetics.106.068874

PubMed Abstract | CrossRef Full Text | Google Scholar

Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., et al. (2009). BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421

PubMed Abstract | CrossRef Full Text | Google Scholar

Candresse, T., Filloux, D., Muhire, B., Julian, C., Galzi, S., Fort, G., et al. (2014). Appearances can be deceptive: revealing a hidden viral infection with deep sequencing in a plant quarantine context. PLoS One 9:e102945. doi: 10.1371/journal.pone.0102945

PubMed Abstract | CrossRef Full Text | Google Scholar

Desbiez, C., Joannon, B., Wipf-Scheibel, C., Chandeysson, C., and Lecoq, H. (2011). Recombination in natural populations of watermelon mosaic virus: new agronomic threat or damp squib? J. Gen. Virol. 92, 1939–1948. doi: 10.1099/vir.0.031401-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Diallo, H. A., Monger, W., Kouassi, N. K., Yoro, T. D., and Jones, P. (2008). Infecting papaya in Ivory Coast. Plant Viruses 2, 42–51. doi: 10.1094/PDIS-06-10-0416

PubMed Abstract | CrossRef Full Text | Google Scholar

Edward, A. E., and Ballen, F. H. (2015). An Overview of Global Papaya Production, Trade, and Consumption. Washington, DC: U.S. Department of Agriculture, The Institute of Food and Agricultural Sciences (IFAS).

Google Scholar

Gal-On, A. (2000). A point mutation in the FRNK motif of the Potyvirus helper component-protease gene alters symptom expression in cucurbits and elicits protection against the severe homologous virus. Phytopathology 90, 467–473. doi: 10.1094/PHYTO.2000.90.5.467

PubMed Abstract | CrossRef Full Text | Google Scholar

García-Arenal, F., Fraile, A., and Malpica, J. M. (2003). Variation and evolution of plant virus populations. Int. Microbiol. 6, 225–232. doi: 10.1007/s10123-003-0142-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Ghoshal, B., and Sanfaçon, H. (2015). Symptom recovery in virus-infected plants: revisiting the role of RNA silencing mechanisms. Virology 47, 167–179. doi: 10.1016/j.virol.2015.01.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Gibbs, M. J., Armstrong, J. S., and Gibbs, A. J. (2000). Sister-scanning: a Monte Carlo procedure for assessing signals in rebombinant sequences. Bioinformatics 16, 573–582. doi: 10.1093/bioinformatics/16.7.573

PubMed Abstract | CrossRef Full Text | Google Scholar

Gonsalves, D. (1998). Control of Papaya ringspot virus in papaya: a case study. Annu. Rev. Phytopathol. 36, 415–437. doi: 10.1146/annurev.phyto.36.1.415

PubMed Abstract | CrossRef Full Text | Google Scholar

Gorbalenya, A. E., and Koonin, E. V. (1989). Viral proteins containing the purine NTP-binding sequence pattern. Nucleic Acids Res. 17, 8413–8438. doi: 10.1093/nar/17.21.8413

PubMed Abstract | CrossRef Full Text | Google Scholar

HCDA. (2016). Horticulture Validated Report. Nairobi Kenya.

Google Scholar

Hema, M. V., and Prasad, D. T. (2004). Comparison of the coat protein of a South Indian strain of PRSV with other strains from different geographical locations. J. Plant Pathol. 86, 31–38.

Google Scholar

Hong, Y., and Hunt, A. G. (1996). RNA polymerase activity catalyzed by a Potyvirus-encoded RNA-dependent RNA polymerase. Virology 226, 146–151. doi: 10.1006/viro.1996.0639

PubMed Abstract | CrossRef Full Text | Google Scholar

Huet, H., Gal-On, A., Meir, E., Lecoq, H., and Raccah, B. (1994). Mutations in the helper component protease gene of zucchini yellow mosaic virus affect its ability to mediate aphid transmissibility. J. Gen. Virol. 75, 1407–1414. doi: 10.1099/0022-1317-75-6-1407

PubMed Abstract | CrossRef Full Text | Google Scholar

Ibaba, J. D., Laing, M. D., and Gubba, A. (2016). Genome sequence analysis of two South African isolates of Moroccan watermelon mosaic virus infecting cucurbits. Virus Genes 52, 896–899. doi: 10.1007/s11262-016-1372-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Jain, R. K., Sharma, J., Sivakumar, A. S., Sharma, P. K., Byadgi, A. S., Verma, A. K., et al. (2004). Variability in the coat protein gene of Papaya ringspot virus isolates from multiple locations in India. Arch. Virol. 149, 2435–2442. doi: 10.1007/s00705-004-0392-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Jo, Y., Choi, H., Kim, S. M., Lee, B. C., and Cho, W. K. (2017). The pepper virome: natural co-infection of diverse viruses and their quasispecies. BMC Genomics 18:453. doi: 10.1186/s12864-017-3838-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Jones, D. T., Taylor, W. R., and Thornton, J. M. (1992). The rapid generation of mutation data matricies from protein sequences. Comput. Appl. Biosci. 8, 275–282. doi: 10.1093/bioinformatics/8.3.275

PubMed Abstract | CrossRef Full Text | Google Scholar

Kabir, M. R., Begum, F., Faruq, A. N., Lee, J., and Tonu, N. N. (2017). Field survey on Papaya viral disease in major papaya growing districts in Bangladesh. Int. J. Res. Sci. Manag. 4, 57–66. doi: 10.3153/jfscom.2008021

CrossRef Full Text | Google Scholar

Kidanemariam, D. B., Sukal, A. C., Abraham, A. D., Juguna, J. N., Stomeo, F., Dale, J. L., et al. (2019). Molecular characterisation of a putative new Polerovirus infecting pumpkin (Cucurbita pepo) in Kenya. Arch. Virol. 164, 1717–1721. doi: 10.1007/s00705-019-04219-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Koonin, E. V. (1991). The phylogeny of RNA-dependent RNA polymerases of positive-strand RNA viruses. J. Gen. Virol. 72, 2197–2206. doi: 10.1099/0022-1317-72-9-2197

PubMed Abstract | CrossRef Full Text | Google Scholar

Kwak, H. R., Kim, J., Kim, M. K., Seo, J. K., Jung, M. N., and Kim, J. S. (2015). Molecular characterization of five Potyviruses infecting Korean sweet potatoes based on analyses of complete genome sequences. Plant Pathol. J. 31, 388–401. doi: 10.5423/PPJ.OA.04.2015.0072

PubMed Abstract | CrossRef Full Text | Google Scholar

Langmead, B., and Salzberg, S. L. (2013). Bowtie2. Nat. Methods 9, 357–359. doi: 10.1038/nmeth.1923.Fast

CrossRef Full Text | Google Scholar

Lawrence, D. M., Rozanov, M. N., and Hillman, B. I. (1995). Autocatalytic processing of the 223-kDa protein of blueberry scorch Carlavirus by a papain-like proteinase. Virology 207, 127–135. doi: 10.1006/viro.1995.1058

PubMed Abstract | CrossRef Full Text | Google Scholar

Lecoq, H., Végétale, S. D. P., Cedex, M., Wipf-Scheibel, C., Delécolle, B., Lanina, T., et al. (2001). Biological and molecular characterization of Moroccan watermelon mosaic virus and a Potyvirus isolate from Eastern Sudan. Plant Dis. 85, 547–552. doi: 10.1094/PDIS.2001.85.5.547

PubMed Abstract | CrossRef Full Text | Google Scholar

Marais, A., Faure, C., Mustafayev, E., and Candresse, T. (2015). Characterization of new isolates of Apricot vein clearing-associated virus and of a new Prunus-infecting virus: evidence for recombination as a driving force in betaflexiviridae evolution. PLoS One 10:e0129469. doi: 10.1371/journal.pone.0129469

PubMed Abstract | CrossRef Full Text | Google Scholar

Marchler-Bauer, A., Lu, S., Anderson, J. B., Chitsaz, F., Derbyshire, M. K., DeWeese-Scott, C., et al. (2011). CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 39, 225–229. doi: 10.1093/nar/gkq1189

PubMed Abstract | CrossRef Full Text | Google Scholar

Martelli, G. P., Adams, M. J., Kreuze, J. F., and Dolja, V. V. (2007). Family flexiviridae: a case study in Virion and genome plasticity. Annu. Rev. Phytopathol. 45, 73–100. doi: 10.1146/annurev.phyto.45.062806.094401

PubMed Abstract | CrossRef Full Text | Google Scholar

Martin, D., and Rybicki, E. (2000). RDP: detection of recombination amongst aligned sequences. Bioinformatics 16, 562–563. doi: 10.1093/bioinformatics/16.6.562

PubMed Abstract | CrossRef Full Text | Google Scholar

Martin, D. P., Murrell, B., Golden, M., Khoosal, A., and Muhire, B. (2015). RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. 1, 1–5. doi: 10.1093/ve/vev003

PubMed Abstract | CrossRef Full Text | Google Scholar

Martin, D. P., Posada, D., Crandall, K. A., and Williamson, C. (2005). A modified bootscan algorithm for automated identification of recombinant sequences and recombination breakpoints. AIDS Res. Hum. Retroviruses 21, 98–102. doi: 10.1089/aid.2005.21.98

PubMed Abstract | CrossRef Full Text | Google Scholar

Martínez, D. R., Sousa, P. D., Olmedo, J. G., and Figueira, A. D. S. (2014). Molecular and biological studies of Papaya ringspot virus isolates from Brazil and Cuba. Am. J.Agric. For. 2, 209–218. doi: 10.11648/j.ajaf.20140205.11

CrossRef Full Text | Google Scholar

Menzel, W., Abang, M. M., and Winter, S. (2011). Characterization of cucumber vein-clearing virus, a whitefly (Bemisia tabaci G.)-transmitted Carlavirus. Arch. Virol. 156, 2309–2311. doi: 10.1007/s00705-011-1124-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Ming, R., Hou, S., Feng, Y., Yu, Q., Alexandre, D., Saw, J. H., et al. (2008). The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature 452, 991–996. doi: 10.1038/nature06856

PubMed Abstract | CrossRef Full Text | Google Scholar

Mishra, M., Rames, C., and Sangeeta, S. (2007). “Genome mapping and molecular breeding in plants,” in Fruits and Nuts.©, ed. C. Kole (Heidelberg: Springer-Verlag Publisher), 343–351.

Google Scholar

Mofunanya, A. A. J., and Edu, E. A. (2015). Physiological and biochemical changes in Cucurbita Moschata duch. ex. poir inoculated with a nigerian strain of Moroccan watermelon mosaic virus (MWMV): Lagenaria breviflora isolate. Int. J. Plant Pathol. 6, 36–47. doi: 10.3923/ijpp.2015.36.47

CrossRef Full Text | Google Scholar

Mohammed, H., Manglli, A., Zicca, S., El Hussein, A., Mohamed, M., Tomassoli, T., et al. (2012). First report of Papaya ringspot virus in pumpkin in Sudan. New Dis. Reports. 26:26. doi: 10.5197/j.2044-0588.2012.026.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Nagy, P. (2008). “Recombination in plant RNA viruses,” in Plant Virus Evolution, ed. M. J. Roossinck (Heideberg: Springer-Verlag Publisher), 133–156. doi: 10.1007/978-3-540-75763-4_8

CrossRef Full Text | Google Scholar

Naidu, R. A., Gowda, S., Satyanarayana, T., Boyko, V., Reddy, A. S., Dawson, W. O., et al. (1998). Evidence that whitefly-transmitted cowpea mild mottle virus belongs to the genus Carlavirus. Arch. Virol. 143, 769–780. doi: 10.1007/s007050050328

PubMed Abstract | CrossRef Full Text | Google Scholar

Naidu, R. A., and Hughes, J. d’A (2003). “Methods for the detection of plant virus diseases,” in Proceedings of a Conference on Plant Virology in Sub Saharan Africa Organized by IITA, Ibadan, 233–260.

Google Scholar

Noa-Carrazana, J. C., González-De-León, D., Ruiz-Castro, B. S., Piñero, D., and Silva-Rosales, L. (2006). Distribution of Papaya ringspot virus and Papaya mosaic virus in papaya plants (Carica papaya) in Mexico. Plant Dis. 90, 1004–1011. doi: 10.1094/PD-90-1004

PubMed Abstract | CrossRef Full Text | Google Scholar

Nurk, S., Meleshko, D., Korobeynikov, A., and Pevzner, P. A. (2017). MetaSPAdes: a new versatile metagenomic assembler. Genome Res. 27, 824–834. doi: 10.1101/gr.213959.116

PubMed Abstract | CrossRef Full Text | Google Scholar

Ohshima, K., Yamaguchi, Y., Hirota, R., Hamamoto, T., Tomimura, K., Tan, Z., et al. (2002). Molecular evolution of Turnip mosaic virus: evidence of host adaptation, genetic recombination and geographical spread. J. Gen. Virol. 83, 1511–1521. doi: 10.1099/0022-1317-83-6-1511

PubMed Abstract | CrossRef Full Text | Google Scholar

Omar, A. F., El-Kewey, S. A., Sidaros, S. A., and Shimaa, A. K. (2011). Egyptian isolates of Papaya ringspot virus form a molecularly distinct clade. J. Plant Pathol. 93, 569–576.

Google Scholar

Ombwara, F. K., Asudi, G. O., Rimberia, F. K., Ateka, E. M., and Wamocho, L. S. (2014). The distribution and prevalence of Papaya Ring Spot Virus (PRSV) in Kenyan Papaya. Acta Hortic. 1, 119–124. doi: 10.17660/actahortic.2014.1022.15

CrossRef Full Text | Google Scholar

Ondov, B. D., Bergman, N. H., and Phillippy, A. M. (2011). Interactive metagenomic visualization in a web browser. BMC Bioinformatics 12:385. doi: 10.1186/1471-2105-12-385

PubMed Abstract | CrossRef Full Text | Google Scholar

Owolabi, A. T., Rabenstein, F., Ehrig, F., Maiss Edgar, M., and Vetten, H. J. (2012). Strains of Moroccan watermelon mosaic virus isolated from lagenaria breviflorus and Coccinia barteri in Calabar, Southeastern Nigeria. Int. J. Virol. 8, 258–270. doi: 10.3923/ijv.2012.258.270

CrossRef Full Text | Google Scholar

Oyunga, M. A., Grant, F., Omondi, D., Ouedraogo, H., Levin, C., Low, J. W., et al. (2016). Prevalence and predictors of vitamin a deficiency among infants in Western Kenya using a cross-sectional analysis. African J. Food Agric. Nutr. Dev. 16, 10765–10785. doi: 10.18697/ajfand.73.16190

PubMed Abstract | CrossRef Full Text | Google Scholar

Padidam, M., Sawyer, S., and Fauquet, C. M. (1999). Possible emergence of new geminiviruses by frequent recombination. Virology 265, 218–225. doi: 10.1006/viro.1999.0056

PubMed Abstract | CrossRef Full Text | Google Scholar

Posada, D., and Crandall, K. A. (2001). Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proc. Natl. Acad. Sci. U.S.A. 98, 13757–13762. doi: 10.1073/pnas.241370698

PubMed Abstract | CrossRef Full Text | Google Scholar

Roossinck, M. J. (2015). Plants, viruses and the environment: ecology and mutualism. Virology 479–480, 271–277. doi: 10.1016/j.virol.2015.03.041

PubMed Abstract | CrossRef Full Text | Google Scholar

Roossinck, M. J. (2017). Deep sequencing for discovery and evolutionary analysis of plant viruses. Virus Res. 239, 82–86. doi: 10.1016/j.virusres.2016.11.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Rozanov, M. N., Koonin, E. V., and Gorbalenya, A. E. (1992). Conservation of the putative methyltransferase domain: a hallmark of the ‘Sindbis-like’ supergroup of positive-strand RNA viruses center for biotechnology information, national library of medicine, building 38A, national institutes of health, 8600. J. Gen. Virol. 73, 2129–2134. doi: 10.1099/0022-1317-73-8-2129

PubMed Abstract | CrossRef Full Text | Google Scholar

Sakuanrungsirikul, S., Sarindu, N., Prasartsee, V., Chaikiatiyos, S., Siriyan, R., Sriwatanakul, M., et al. (2014). Update on the development of virus-resistant papaya: virus-resistant transgenic papaya for people in rural communities of Thailand. Food Nutr. Bull. 26, 422–426.

PubMed Abstract | Google Scholar

Sharma, P., Sahu, A. K., Verma, R. K., Ritesh, M., Choudhary, D. K., Gaur, R. K., et al. (2014). Current status of Potyvirus in India. Arch. Phytopathol. Plant Prot. 47, 906–918. doi: 10.1080/03235408.2013.825963

CrossRef Full Text | Google Scholar

Sharma, S. K., and Tripathi, S. (2014). “Papaya ringspot virus-P. Overcoming limitations of resistance breeding in carica papaya L,” in Plant Virus-Host Interaction: Molecular Approaches and Viral Evolution, eds R. K. Gaur, T. Hohn, and P. Sharma (Amsterdam: Elsevier), 177–194.

Google Scholar

Shukla, D. D., and Ward, C. W. (1989). Identification and classification of Potyviruses on the basis of coat protein sequence data and serology. Arch. Virol. 106, 171–200. doi: 10.1007/bf01313952

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, V., and Shukla, K. (2011). Influence of inoculation time on severity of virus disease caused by Papaya ringspot virus. Ann Pl Protec Sci 19, 142–146.

Google Scholar

Smith, J. M. (1992). Analyzing the mosaic structure of genes. J. Mol. Evol. 34, 126–129. doi: 10.1007/BF00182389

PubMed Abstract | CrossRef Full Text | Google Scholar

Srinivasulu, M., and Sai Gopal, D. V. R. (2011). Coat protein sequence comparison of south Indian isolates of Papaya ringspot virus with other Indian subcontinent isolates. Phytopathol. Mediterr. 50, 359–369.

Google Scholar

Syller, J. (2012). Facilitative and antagonistic interactions between plant viruses in mixed infections. Mol. Plant Pathol. 13, 204–216. doi: 10.1111/j.1364-3703.2011.00734.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Tamura, K., Stecher, G., Peterson, D., Filipski, A., and Kumar, S. (2013). MEGA6: molecular evolutionary genetics analysis version 6.0. Mol. Biol. Evol. 30, 2725–2729. doi: 10.1093/molbev/mst197

PubMed Abstract | CrossRef Full Text | Google Scholar

Tennant, P. F., Fermin, G. A., and Roye, M. E. (2007). Viruses infecting Papaya (Carica papaya L.): etiology, pathogenesis, and molecular biology. Plant Viruses 1, 178–188.

Google Scholar

Tripathi, S., Suzuki, J., and Gonsalves, D. (2007). Development of genetically engineered resistant papaya for Papaya ringspot virus in a timely manner a comprehensive and successful approach. Methods Mol. Biol. 354, 197–240. doi: 10.1385/1-59259-966-4:197

PubMed Abstract | CrossRef Full Text | Google Scholar

Tripathi, S., Suzuki, J. Y., Ferreira, S. A., and Gonsalves, D. (2008). Papaya ringspot virus-P: characteristics, pathogenicity, sequence variability and control. Mol. Plant Pathol. 9, 269–280. doi: 10.1111/j.1364-3703.2008.00467.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Ventura, J. A., Costa, H., Tatagiba, J., and da, S. (2004). “Papaya diseases and integrated control,” in Diseases of Fruits and Vegetables, Vol. II, ed. S. A. M. Naqvi (Dordrecht: Kluwer Academic Publishers), 201–268. doi: 10.1007/1-4020-2607-2_7

CrossRef Full Text | Google Scholar

Xie, X., Chen, W., Fu, Q., Zhang, P., An, T., Cui, A., et al. (2016). Molecular variability and distribution of Sugarcane mosaic virus in Shanxi, China. PLoS One 11:e0151549. doi: 10.1371/journal.pone.0151549

PubMed Abstract | CrossRef Full Text | Google Scholar

Yakoubi, S., Desbiez, C., Fakhfakh, H., Wipf-Scheibel, C., Marrakchi, M., Lecoq, H., et al. (2008). Biological characterization and complete nucleotide sequence of a Tunisian isolate of Moroccan watermelon mosaic virus. Arch. Virol. 153, 117–125. doi: 10.1007/s00705-007-1074-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Ye, J., Coulouris, G., Zaretskaya, I., Cutcutache, I., Rozen, S., Madden, T. L., et al. (2012). Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinformatics 13:134. doi: 10.1186/1471-2105-13-134

PubMed Abstract | CrossRef Full Text | Google Scholar

Yogiraj, V., Goyal, P. K., and Chauhan, C. S. (2014). Carica papaya Linn: an overview. Int. J. Herb. Med. 2, 1–8.

Google Scholar

Zhao, H., Jia, R. Z., Zhang, Y.-L., Zhu, Y. J., Zeng, H. C., Kong, H., et al. (2016). Geographical and genetic divergence among Papaya ringspot virus populations within Hainan Province, China. Phytopathology 106, 937–944. doi: 10.1094/phyto-05-15-0111-r

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: papaya, next-generation sequencing, Carlavirus, Potyvirus, ringspot disease, diagnostic primers

Citation: Mumo NN, Mamati GE, Ateka EM, Rimberia FK, Asudi GO, Boykin LM, Machuka EM, Njuguna JN, Pelle R and Stomeo F (2020) Metagenomic Analysis of Plant Viruses Associated With Papaya Ringspot Disease in Carica papaya L. in Kenya. Front. Microbiol. 11:205. doi: 10.3389/fmicb.2020.00205

Received: 24 October 2019; Accepted: 29 January 2020;
Published: 04 March 2020.

Edited by:

Ralf Georg Dietzgen, The University of Queensland, Australia

Reviewed by:

Neil Boonham, Newcastle University, United Kingdom
Jan Frederik Kreuze, International Potato Center, Peru

Copyright © 2020 Mumo, Mamati, Ateka, Rimberia, Asudi, Boykin, Machuka, Njuguna, Pelle and Stomeo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Naomi Nzilani Mumo,; Francesca Stomeo,

present address: Francesca Stomeo, European Molecular Biology Laboratory, Heidelberg, Germany

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.