- 1Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
- 2Department of Plant Pathology and Microbiology, National Taiwan University, Taipei, Taiwan
The periwinkle leaf yellowing (PLY) disease was first reported in Taiwan in 2005. This disease was caused by an uncultivated bacterium in the genus “Candidatus phytoplasma.” In subsequent years, this bacterium was linked to other plant diseases and caused losses in agriculture. For genomic investigation of this bacterium and its relatives, we conducted whole genome sequencing of a PLY phytoplasma from an infected periwinkle collected in Taoyuan. The de novo genome assembly produced eight contigs with a total length of 824,596 bp. The annotation contains 775 protein-coding genes, 63 pseudogenes, 32 tRNA genes, and two sets of rRNA operons. To characterize the genomic diversity across populations, a second strain that infects green onions in Yilan was collected for re-sequencing analysis. Comparison between these two strains identified 337 sequence polymorphisms and 10 structural variations. The metabolic pathway analysis indicated that the PLY phytoplasma genome contains two regions with highly conserved gene composition for carbohydrate metabolism. Intriguingly, each region contains several pseudogenes and the remaining functional genes in these two regions complement each other, suggesting a case of duplication followed by differential gene losses. Comparative analysis with other available phytoplasma genomes indicated that this PLY phytoplasma belongs to the 16SrI-B subgroup in the genus, with “Candidatus Phytoplasma asteris” that causes the onion yellowing (OY) disease in Japan as the closest known relative. For characterized effectors that these bacteria use to manipulate their plant hosts, the PLY phytoplasma has homologs for SAP11, SAP54/PHYL1, and TENGU. For genome structure comparison, we found that potential mobile unit (PMU) insertions may be the main factor that drives genome rearrangements in these bacteria. A total of 10 PMU-like regions were found in the PLY phytoplasma genome. Two of these PMUs were found to harbor one SAP11 homolog each, with one more similar to the 16SrI-B type and the other more similar to the 16SrI-A type, suggesting possible horizontal transfer. Taken together, this work provided a first look into population genomics of the PLY phytoplasmas in Taiwan, as well as identified several evolutionary processes that contributed to the genetic diversification of these plant-pathogenic bacteria.
Introduction
Phytoplasmas are a group of plant-pathogenic bacteria with significant impact on agriculture worldwide (Lee et al., 2000; Hogenhout et al., 2008; Bertaccini and Lee, 2018). These wall-less bacteria belong to the class Mollicutes, with animal-pathogenic Mycoplasma and insect-symbiotic Spiroplasma as their close relatives (Gasparich, 2010; Chen et al., 2012). However, unlike Mycoplasma and Spiroplasma, axenic culture of phytoplasmas has not been successfully established yet, such that these bacteria are assigned to a “Candidatus” (“Ca.”) genus and lack formal taxonomy (The IRPCM Phytoplasma/Spiroplasma Working Team - Phytoplasma taxonomy group, 2004). For classification, the current system relies on the restriction fragment length polymorphism (RFLP) profiles of their 16S rRNA gene (Lee et al., 1993, 1998; Zhao et al., 2009; Zhao and Davis, 2016).
To date, >30 phytoplasmal 16S rRNA gene RFLP (16Sr) groups and many more subgroups have been established, with each group containing one or more “Ca. Phytoplasma” species (Zhao and Davis, 2016). Among these, “Ca. P. asteris” (Lee et al., 2004) that was assigned to the 16SrI group is the best characterized one. Strains belonging to “Ca. P. asteris” have a worldwide distribution and are associated with >100 economically important plant diseases. In Taiwan, a “Ca. P. asteris” phytoplasma that causes the periwinkle leaf yellowing (PLY) disease was first reported in Taoyuan in 2005 (Chen et al., 2011). In addition to leaf yellowing, this phytoplasma also induces other symptoms such as virescence, phyllody, and witches’ broom. Subsequent characterization of this phytoplasma revealed that its potential vectors include multiple leafhoppers, such as Macrosteles orientalis, Cicadulina bipunctella, Phlogotettix cyclops, and Balclutha sp. (Chen et al., 2011). Moreover, it can infect other cultivated plants such as chrysanthemum (Chrysanthemum sp.), cosmos (Cosmos bipinnatus), torenia (Torenia fournieri), Persian violet (Exacum affine), and cucumber (Cucumis sativus), also well as a weed, goosegrass (Eleusine indica) (Chen et al., 2011).
Since the initial disease report, this PLY phytoplasma continued to have recurring occurrences in Taoyuan and affected the commercial production of periwinkles. For more detailed characterization of this phytoplasma, we conducted whole genome sequencing of a PLY strain (i.e., strain DY2014; see section “Materials and Methods”) for comparative analysis with other 16SrI phytoplasmas. Particularly, we are interested in the gene content of this PLY phytoplasma with regard to effectors (Sugio et al., 2011b; Oshima et al., 2013). Moreover, to characterize the genomic diversity across different populations of the PLY phytoplasmas in Taiwan, a second strain (i.e., SS2016) was collected for re-sequencing analysis. The findings and implications are reported in this work.
Materials and Methods
Sample Collection and Characterization
Two strains of phytoplasmas were collected for this study. The first strain, DY2014, was collected from an infected periwinkle (Catharanthus roseus) in an herbal flower nursery located in the Dayuan District (Taoyuan City, Taiwan) in 2014. The second strain, SS2016, was collected from an infected green onion (Allium fistulosum) in a commercial field located in the Sanshing Township (Yilan County, Taiwan) in 2016. In both cases, the diseased plants were provided by the producers to the authors for diagnosis and research purpose.
After the initial symptom observation, total DNA of the infected plant was extracted using the Wizard Genomic DNA Purification Kit (Pomega, United States) for PCR-based detection of phytoplasmas and whole genome shotgun sequencing (Chung et al., 2013). For the periwinkle sample, leaf midribs were used for DNA extraction to increase the proportion of phytoplasma DNA. For the green onion sample, young leaves exhibiting the yellowing symptom were used without any attempt to cut out the vascular tissues. After the standard genomic DNA preparation, no additional enrichment for phytoplasma DNA (e.g., cesium chloride density-gradient centrifugation) was performed. The phytoplasma-specific primer set P1/P7 was used to amplify a partial sequence of the rRNA operon, followed by Sanger sequencing using an ABI Prism 3700 Genetic Analyzer (Applied Biosystems, United States). The resulting sequences were used as the queries to run BLASTN (Camacho et al., 2009) searches against the NCBI non-redundant nucleotide database (Benson et al., 2018) to verify the identity of the phytoplasma found. Furthermore, the 16Sr group assignment was performed by using iPhyClassifier (Zhao et al., 2009).
For the green onion sample, transmission electron microscopy (TEM) was performed to visualize phytoplasma cells within the phloem tissue. The sample was fixed with 2.5% glutaraldehyde and 4% paraformaldehyde in 0.1 M sodium phosphate buffer (pH 7.0) for 4 h at room temperature. Afterward, the sample was cleaned three times (20 min each) with the buffer and postfixed with 1% OsO4 in the same buffer for 4 h at room temperature, followed by three changes of the buffer (20 min each). The dehydration step was performed by using an ethanol series and propylene oxide. After dehydration, the sample was embedded in Spurr’s resin and sectioned with a Lecia Reichert Ultracut S ultramicrotome (Leica, Germany). The ultra-thin sections (70–90 nm) were stained with uranyl acetate and lead citrate. A FEI G2 Tecnai Spirit Twin transmission electron microscope (FEI, United States) at 80 KV was used for viewing and the images were taken with a Gatan Orius CCD camera (Gatan, United States).
Genome Sequencing and Analysis
The procedures for genome sequencing and analysis were based on those described in our previous work on phytoplasma genomes (Chung et al., 2013; Orlovskis et al., 2017; Seruga Music et al., 2019). All bioinformatics tools were used with the default settings unless stated otherwise. For shotgun sequencing, one paired-end library with ∼550-bp insert size was prepared for each sample and sequenced using the Illumina MiSeq (Illumina, United States) platform.
The strain DY2014 was maintained by side grafting of periwinkles in a greenhouse at the National Taiwan University (Taipei, Taiwan). The genomic DNA used for whole genome shotgun sequencing was prepared from an artificially infected plant collected in 2016, 2 years after the initial collection of the naturally infected plant. The de novo assembly was performed using Velvet v1.2.10 (Zerbino and Birney, 2008) with the following settings: k = 191, exp_cov = auto, cov_cutoff = 10, max_coverage = 500, min_contig_lgth = 2000, and scaffolding = no. All contigs were used as the queries to run BLASTX (Camacho et al., 2009) searches against the NCBI non-redundant protein database (Benson et al., 2018) to identify putative phytoplasma contigs. The draft assembly was iteratively improved by mapping all raw reads to the contigs using BWA v0.7.17 (Li and Durbin, 2009) and checking for paired-reads that could extend the contigs, provide evidence for scaffolding, or fill gaps using SAMTOOLS v1.9 (Li et al., 2009) and IGV v2.5.0 (Robinson et al., 2011). Additionally, primer walking and Sanger sequencing were used to close the assembly gaps. The gene prediction was performed using RNAmmer v1.2 (Lagesen et al., 2007), tRNAscan-SE v1.3.1 (Lowe and Eddy, 1997), and Prodigal v2.6.3 (Hyatt et al., 2010). The homologous gene clusters among available phytoplasma genomes were identified by using OrthoMCL (Li et al., 2003). These results were used as the basis for annotation and comparative analysis. For manual curation of the annotation results, the KEGG database (Kanehisa et al., 2010) was used as the reference. Circos v0.69-6 (Krzywinski et al., 2009) was used for preparing the genome map. Mummer v3.23 (Kurtz et al., 2004) was used for inferring pairwise genome alignments with the setting “-maxmatch -c 2000.” For phylogenetic inference, MUSCLE v3.8.31 (Edgar, 2004) was used to generate multiple sequence alignments and PhyML v3.3 (Guindon and Gascuel, 2003) was used to infer maximum likelihood phylogenies.
For strain SS2016, the genomic DNA was prepared from a naturally infected green onion collected in 2016. A re-sequencing approach was used to identify its differences from DY2014. All Illumina raw reads derived from the SS2016 sample were mapped to the finalized assembly of DY2014 genome using BWA v0.7.17 (Li and Durbin, 2009). The mapping results were programmatically checked using the MPILEUP program in SAMTOOLS package v1.9 (Li et al., 2009), and visually inspected using IGV v2.5.0 (Robinson et al., 2011).
To predict the putative effector genes, all protein-coding genes were examined using SignalP v5.0 (Armenteros et al., 2019) to check for the presence of signal peptide based on the Gram-positive bacteria model. The program TMHMM v2.0 (Krogh et al., 2001) was used to verify that these putative secreted proteins have no transmembrane domain. Furthermore, the program COILS-WRAP (Lupas et al., 1991) was used to predict the presence of coiled coil domain with the settings “-m MTIDK -w 1”; regions with a probability of >50% in the sliding window analysis were identified. Finally, for those effectors that have been experimentally characterized (i.e., SAP11, SAP54/PHYL1, and TENGU), the functional domains involved in nuclear localization or virulence were identified based on the results of previous studies (Sugawara et al., 2013; Maejima et al., 2014; Sugio et al., 2014).
Results and Discussion
Symptom Observation and Phytoplasma Detection
The strain DY2014 is associated with the symptom of virescence on periwinkles (Figure 1A), as have been reported in the early characterization of PLY disease in the same region (i.e., Taoyuan, Taiwan) (Chen et al., 2011). The strain SS2016 is associated with chlorosis and dwarfism on green onions (Figure 1B). Furthermore, TEM observation of the leaf midvein confirmed that a large number of wall-less phytoplasma cells could be found within the plant sieve element cells (Figure 1C).
 
  Figure 1. Disease symptoms. (A) Virescence (indicated by yellow arrowheads) on the periwinkle infected by strain DY2014. Photo provided by C-PL. (B) Chlorosis and dwarfism on green onion (front: infected by strain SS2016; back: healthy). Photo provided by Wei-An Tsai. (C) Transmission electron microscopy (TEM) observation of the sample in (B); leaf longitudinal section. Note the large number of wall-less phytoplasma cells within the plant sieve element cell in the center.
The molecular diagnostic based on 16S rDNA PCR found that both strains are 100% identical to the reference record of the PLY phytoplasma collected in 2005 from Taoyuan, Taiwan (GenBank accession FJ437568). Based on the in silico RFLP analysis by iPhyClassifier (Zhao et al., 2009), this sequence may represent a new subgroup within the 16SrI group and is most similar to the 16SrI-B representative (GenBank accession AP006628) with a similarity coefficient of 0.97. The maximum likelihood phylogeny based on the 16S rRNA gene provided a similar inference, namely that these PLY phytoplasma strains in Taiwan are most closely related to the 16SrI-B onion yellows (OY-M) phytoplasma found in Japan (Figure 2).
 
  Figure 2. Maximum likelihood phylogeny of representative phytoplasmas based on the 16S rRNA gene. The numbers on the internal branches indicate the level of bootstrap support based on 1,000 resampling (only values ≥60% are shown). The strains with genome sequences available are highlighted in bold (∗draft; ∗∗complete). The 16Sr group assignments are provided in brackets. The three major clusters of phytoplasmas are highlighted by colored background (blue, yellow, and green). Acholeplasma laidlawii is included as the outgroup.
Genomic Characterization
The genome sequencing of these two phytoplasma strains generated ∼5.2 Gb of raw reads for DY2014 (NCBI accession: PRJNA530090) and ∼4.6 Gb for SS2016 (PRJNA529747). For DY2014 that was found in an infected periwinkle, ∼1.9% of the Illumina reads were mapped to the finalized phytoplasma assembly, while the remaining reads were mostly derived from the plant host. This sequencing depth corresponded to ∼124-fold coverage of the phytoplasma genome and the de novo assembly produced eight contigs totaling 824,596 bp (Figure 3). These contigs can be arranged into one linear scaffold based on PCR confirmation; only the connection between contigs II and III was not verified. Unfortunately, due to the high sequence composition biases (i.e., 27.6% genome-wide G + C content) and highly abundant repeats (e.g., homopolymers and di-/tri-nucleotide repeats), the Sanger sequencing of those PCR products were unsuccessful and we were unable to close those assembly gaps.
 
  Figure 3. Genome map of the periwinkle yellowing phytoplasma DY2014. Rings from the outside in: (1) contig id (I-VIII); (2) scale marks (kb); (3 and 4) protein-coding genes on the forward and reverse strand, respectively (color-coded by functional category); (5) PMU-like regions (orange), putative secreted proteins (red), and pseudogenes (gray); (6) sequencing coverage of DY2014 (purple); (7) sequencing coverage of SS2016 (green), polymorphic sites in comparison with DY2014 are highlighted in red; (8) GC skew (dark gray: positive; light gray: negative); (9) GC content (dark gray: above average; light gray: below average).
For strain SS2016 that was derived from an infected green onion, only ∼0.17% of the Illumina reads were assigned to phytoplasma, which corresponded to ∼11-fold coverage of this bacterial genome. De novo assembly with such low coverage was not feasible so a re-sequencing analysis was performed by using the DY2014 genome as the reference. Based on the mapping results, 10 structural variations and 337 sequence polymorphisms were found between these two genomes (Figure 3 and Supplementary Table S1). Among these sequence polymorphisms, 201 (60%) were found within protein-coding regions. These include six indels that affected six different genes (two encode putative secreted proteins and four encode hypothetical proteins) and 123 single nucleotide polymorphisms (SNPs) that introduced non-synonymous mutations in 93 protein-coding genes. Compared to a previous population genomics study of the maize bushy stunt phytoplasmas (MBSP) collected in Brazil (Orlovskis et al., 2017), the genetic divergence observed between these two PLY phytoplasmas in Taiwan is much higher. For the MBSP study, six strains were collected from two locations in the state of São Paulo (i.e., Guaíra and Piracicaba; ∼280 km apart) and only 86 sequence polymorphisms were found. In comparison, the two PLY strains included in this study were collected from locations that are ∼63 km apart. Even after adjusting for the difference in genome size, the density of polymorphic sites in these PLY phytoplasmas is still >twofold higher. Because those six MBSP strains were all collected from infected maize plants, while the two PLY strains in this study were collected from different plant hosts (i.e., periwinkle and green onion), it is possible that the specialization for different hosts may have contributed to the genetic divergence observed. Nevertheless, many other factors may be involved in shaping the genetic diversity of phytoplasmas. For example, although Taoyuan and Yilan have nearly the same average monthly temperatures, Yilan is located in the eastern coast of Taiwan and receives much more rainfall during winter. Additionally, compared to the large maize farms in Brazil, the periwinkle and green onion production in Taiwan are operated at much smaller scales, which may restrict the effective population sizes of phytoplasmas and their vectors/hosts, thus resulting in higher levels of genetic drift. Given these complexities, it is possible that no straightforward approach exists for quantifying the relative contribution of each factor to phytoplasma genetic diversity.
Despite the polymorphisms found, the genome-wide sequence identity between these two PLY phytoplasmas is >99.9%. Base on this high similarity, as well as the availability of a high-quality de novo assembly, DY2014 was selected as the representative for more detailed analysis and comparison with other phytoplasmas. The annotation of this DY2014 draft assembly (SRMC00000000.1) contains two complete sets of 16S-23S-5S rRNA gene clusters, 32 tRNA genes covering all 20 amino acids, 775 protein-coding genes, and 63 pseudogenes (Table 1). Similar to other phytoplasmas with complete genome sequences available, particularly those belonging to “Ca. P. asteris” within the 16SrI group (Oshima et al., 2004; Bai et al., 2006; Orlovskis et al., 2017), the gene content and metabolic pathways are highly reduced. Among the protein-coding genes, 264 (34%) were annotated as hypothetical proteins and 424 (55%) lacked any specific COG functional category assignment. For those with functional category assignments, 206 (27% of total, 59% among those with COG assignments) are related to the processing of genetic information (i.e., translation [J], transcription [K], and replication [L]). Other than information processing, genes encode putative secreted proteins (76; 10% of total) and transporters (44; 6% of total) are two of the largest groups. These findings are similar to the results from previous characterization of phytoplasma genomes (Kube et al., 2012; Chung et al., 2013; Oshima et al., 2013; Chen et al., 2014; Seruga Music et al., 2019). Two major inferences could be made from these observations. First, a substantial fraction of phytoplasma genes are still yet to be characterized and the current reference databases are inadequate. Second, the overall gene content of all characterized phytoplasma genomes fits the profile of obligate parasites that rely on hosts for survival and replication. Other than the genes necessary for processing their own genetic information, substantial fractions of genes are devoted to manipulate their hosts (e.g., secreted proteins) and to acquire nutrients (e.g., transporters).
Compare to other bacteria that form obligate symbiosis with their hosts, phytoplasmas are similar in having small genomes but are distinct in having high levels of genomic plasticity (McCutcheon and Moran, 2012). We found a set of two 27-kb duplicated regions in this PLY phytoplasma (Figure 4). These regions contain genes involved in glycolysis pathway, which is the major energy-yielding pathway for phytoplasmas (Oshima et al., 2004, 2013; Kube et al., 2012). Intriguingly, differential gene losses have occurred in these two regions, such that both regions are required for the complete glycolysis pathway. Based on a model proposed for symbiont genome evolution (Lo et al., 2016), it is expected that eventually those pseudogenes will be removed through accumulation of mutations due to the deletional biases observed in bacterial genomes (Mira et al., 2001; Kuo and Ochman, 2009). At that point, the disruption of synteny may appear to have been resulted from multiple events of translocation that involved few genes at a time.
 
  Figure 4. (A) Synteny between two duplicated regions within the DY2014 genome. The contig ids (i.e., VII and VIII) and the start-end positions of these regions are labeled. Genes participated in the glycolysis are highlighted in yellow. Intact genes are drawn with solid borders, putative pseudogenes are drawn with dotted borders. (B) Glycolysis pathway and the genes involved. Four of these genes (i.e., fbaA, tpiA, gapA, and pgk) are located in other regions of contig VIII. Correspondence between gene name and product: pgi, glucose-6-phosphate isomerase; pfkA, 6-phosphofructokinase; fbaA, fructose-bisphosphate aldolase; tpiA, triosephosphate isomerase; gapA, glyceraldehyde-3-phosphate dehydrogenase; pgk, 3-phosphoglycerate kinase; gpmI, phosphoglycerate mutase; eno, enolase; pyk, pyruvate kinase.
A similar finding regarding the duplication of glycolytic genes has been reported in an onion yellows phytoplasma strain OY-W (Oshima et al., 2007). Intriguingly, OY-W causes severe symptoms in its plant hosts, while its closely related strain OY-M causes only mild symptoms of leaf yellowing and has only one glycolytic gene cluster, suggesting that gene duplications may affect bacterial physiology and their virulence.
Comparative Genomics Among 16SrI Group Phytoplasmas
In phytoplasma genome organization and evolution, putative mobile units (PMUs) are known to play an important role (Bai et al., 2006). A total of 10 PMUs were found in this DY2014 genome (Supplementary Figure S1), including a set of nested PMUs (i.e., PMU2 was inserted within PMU3, disrupting the dnaB of the latter). Based on pairwise genome alignments, most of the synteny breakpoints correspond to PMU insertion sites (Figure 5). Although DY2014 is most closely related to OY-M based on 16S phylogeny (Figure 2), genome characteristics, and geographical locations (Table 1), the overall chromosome structure differ considerably between these two strains (Figure 5). This lack of conservation appeared to be resulted from a large inversion and multiple expansions of repetitive regions in the OY-M genome. If the PMUs were excluded, DY2014 has a chromosomal organization that is highly similar to the MBSP strain M3, which is a 16SrI-B strain with a much smaller genome (Table 1). Such conservation could be found even in the 16SrI-A strain AYWB (Figure 5). Based on these observations, it is likely that the increase of PMU copy numbers and genome sizes observed in DY2014 and OY-M represent derived traits. Nevertheless, more high-quality genome assemblies are necessary for examining the contribution of PMU abundance to phytoplasma genome size variations in general.
 
  Figure 5. Pairwise genome alignments. The genome of DY2014 was used as the reference for comparison with other 16SrI group representatives. Matches on the same strand and the opposite strand are labeled in red and blue, respectively. PMU regions are highlighted in orange. A region on contig II has a darker shade of orange due to the insertion of PMU2 into the middle of PMU3 (see Supplementary Figure S1).
In addition to genome size expansion and chromosomal restructuring, PMUs also play an important role in phytoplasma evolution because these mobile genetic elements often carry putative effector genes that these bacteria use to manipulate their hosts (Bai et al., 2006; Sugio and Hogenhout, 2012). Horizontal transfer of PMUs between diverse phytoplasmas have been inferred for several lineages within the genus (Chung et al., 2013; Ku et al., 2013; Seruga Music et al., 2019), suggesting that effector gene exchanges were prevalent in their evolutionary history. Moreover, sequence polymorphisms for a putative effector gene located on a PMU among field isolates of the MBSP in Brazil have been shown to be correlated with the symptoms induced by these bacteria (Orlovskis et al., 2017). Examination of the three phytoplasma effectors that have been functionally characterized, namely SAP11 (Bai et al., 2009; Sugio et al., 2011a, 2014; Lu et al., 2014; Chang et al., 2018), SAP54/PHYL1 (MacLean et al., 2011, 2014; Maejima et al., 2014; Orlovskis and Hogenhout, 2016), and TENGU (Hoshi et al., 2009; Sugawara et al., 2013; Minato et al., 2014), found that the homologous genes for all these effectors are present in these two PLY phytoplasma strains in Taiwan. Comparison of these effector genes between the two PLY phytoplasmas found only one non-synonymous mutation affecting a SAP11 homolog (locus tag: PLY_1110). This low level of sequence divergence may be explained by purifying selection acting on these important genes, or the lack of sufficient divergence time for mutation accumulation. Similar to other 16SrI phytoplasmas, the SAP11 and SAP54/PHYL1 homologs in PLY phytoplasma are all located within PMUs (Figure 6).
 
  Figure 6. (A) Comparison of effector-containing putative mobile units (PMUs). The genomic location (contig: start-end) of each PMU is labeled (“chr” is the abbreviation of chromosome for strains with complete genome sequence). Genes that are commonly associated with PMUs are highlighted in colors, putative secreted proteins are colored in red. The two focal effector genes, SAP11 (top) and SAP54/PHYL1 (bottom), are labeled with an asterisk. Intact genes are drawn with solid borders, putative pseudogenes are drawn with dotted borders. Nucleotide sequence similarities between conserved regions are illustrated by different shades of gray colors. (B) Sequence alignment to illustrate a 14-bp deletion (highlighted in yellow) in the contig IV of DY2014 that resulted in a frameshift mutation in its SAP11 homolog. The reading frames (i.e., +1 and +3) are labeled. Polymorphic sites are highlighted in red. The black arrowhead indicated the putative signal peptide cleavage site.
Intriguingly, these two PLY phytoplasma strains in Taiwan (i.e., DY2014 and SS2016), as well as one onion yellows strain in Japan (i.e., OY-V) (Kakizawa et al., 2014), were found to harbor two copies of SAP11 gene. In addition to a vertically inherited 16SrI-B type copy (locus tag: PLY_1110), a second copy may have originated from horizontal acquisition of a 16SrI-A type PMU that exhibited synteny conservation with one PMU-like region in the AYWB phytoplasma genome (Figure 6A). Other than synteny conservation in the neighboring regions, multiple sequence alignment and molecular phylogenetic analysis further support that these PLY/OY-V SAP11 homologs could be classified into two types (Figure 7). Based on the previous characterization of 16SrI group phytoplasmas (Lee et al., 2004), 16SrI-A strains are mainly distributed in the eastern United States and infect Asteraceae hosts, while 16SrI-B strains are found in the western United States as well as worldwide and have highly diverse plant hosts (e.g., those infect celery in California, maize in Brazil, onion in Japan, periwinkle in Taiwan, etc). Thus, it is plausible that the United States is the location for these phytoplasmas to have gene flow. More comprehensive sampling of phytoplasma genomes, particularly those 16SrI-B strains in the United States, is necessary to test such hypothesis.
 
  Figure 7. Comparison of phytoplasma effectors. (A) SAP11 homologs, (B) SAP54/PHYL1 homologs, and (C) TENGU homologs. Left, maximum likelihood phylogenies based on the protein sequence alignments. Branches colored in black indicate those with a bootstrap support of ≥60%, those in gray have a bootstrap support of <60%. The locus tags are color-coded according to the 16Sr group assignments. Right, multiple sequence alignments with highlights for functional domains.
Regarding the putatively acquired 16SrI-A type SAP11 in PLY (locus tag: PLY_3690), a 14-bp deletion in the coding region has introduced a frameshift mutation (Figure 6B). Although an alternative start codon may be used for translation, this truncated product lacks the full-length signal peptide conserved in other SAP11 homologs (Figure 7A) and may not be functional. A previous work that characterized the function of several SAP11 homologs has demonstrated that the AYWB SAP11 (i.e., 16SrI-A type) exhibited strong localization to plant cell nuclei and induced severe phenotypes of leaf crinkling in Arabidopsis thaliana, while the OY-M homolog (i.e., 16SrI-B type) did not (Chang et al., 2018). It is unclear if there is any advantage for a phytoplasma to harbor both types of SAP11 homologs.
The presence and localization of coiled coil domains exhibited some variations among SAP11 homologs (Figure 7A). For example, in the central part of sequences, homologs belonging to the 16SrI-B type (e.g., PLY_1110, PAM_519, and OYV_05380) all exhibited a weak signal (i.e., ∼10% probability) for an additional coiled coil domain, yet CWO85_02055 (from “Candidatus Phytoplasma ziziphi” in the 16SrV group) is the only one that reached the significance threshold (i.e., 60–70% probability). Similar to these findings for SAP11 homologs, SAP54/PHYL1 homologs also exhibited high variation in the exact boundaries of their coiled coil domains despite having high sequence similarities (Figure 7B). These findings require future experimental studies to validate the bioinformatic inference and to establish the biological significance. Finally, in contrast to the nearly genus-wide distribution of SAP11 and SAP54/PHYL1 homologs, TENGU homologs appeared to be restricted to those 16SrI group phytoplasmas and have extremely high levels of sequence conservation (Figure 7C).
Conclusion
The results from this work provided insights into the genomic diversity among phytoplasmas at the levels of populations and species. When comparing across populations, differentiations in plant hosts may play a more important role than geographical distance in shaping genetic diversification. However, such inference requires more comprehensive sampling of closely related phytoplasma genomes for further tests. At the level of species or sub-species comparisons, mobile genetic elements such as PMUs played an important role in genome rearrangement, gene content evolution, and horizontal gene transfer. Particularly, the exchange of effector genes among different phytoplasma lineages may facilitate their adaptation. Furthermore, duplication of chromosomal segments followed by differential gene loss, such as that commonly observed after whole genome duplication in eukaryotes (Kellis et al., 2004), could be another evolutionary process the drives the genome divergence of these bacteria. This finding provided novel insights into the genome evolution of pathogenic bacteria with highly reduced genomes.
Data Availability Statement
The Illumina raw reads were deposited in NCBI Sequence Read Archive (SRA) under the accessions PRJNA530090 (strain DY2014) and PRJNA529747 (strain SS2016). The de novo genome assembly of DY2014 was deposited in GenBank under the accession SRMC00000000. The version described in this manuscript is version SRMC01000000. The genomic variations between these two strains are provided in the Supplementary Material.
Author Contributions
C-PL provided the biological materials. S-TC, C-PL, and C-HK performed the experiments. S-TC and C-HK analyzed the data. S-TC prepared the figures and Supplementary Material. C-HK designed the experiments, acquired the funding, wrote the manuscript, and supervised the project.
Funding
This work was supported by Academia Sinica and the Ministry of Science and Technology of Taiwan (MOST 106-2311-B-001-028-MY3) to C-HK. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We thank Ai-Ping Chen, Shu-Jen Chou, Mei-Jane Fang, Wann-Neng Jane, Hung-Jui Kung, and Wei-An Tsai for technical assistance. The TEM service was provided by the Plant Cell Biology Core Facility (Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan). The service for Sanger sequencing and Illumina sequencing library preparation was provided by the Genomic Technology Core Facility (Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan). The Illumina paired-end sequencing service was provided by the Genomics Core Facility (Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan).
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2019.02194/full#supplementary-material
FIGURE S1 | Organization of putative mobile units (PMUs) in the DY2014 genome. The genomic location of each PMU (i.e., contig id and start-end position) is provided. Note that PMU2 is located within PMU3.
TABLE S1 | Polymorphic sites between the two phytoplasma strains characterized in this study.
References
Armenteros, J. J. A., Tsirigos, K. D., Sønderby, C. K., Petersen, T. N., Winther, O., Brunak, S., et al. (2019). SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol. 37, 420–423. doi: 10.1038/s41587-019-0036-z
Bai, X., Correa, V. R., Toruño, T. Y., Ammar, E.-D., Kamoun, S., and Hogenhout, S. A. (2009). AY-WB phytoplasma secretes a protein that targets plant cell nuclei. Mol. Plant Microb. Interact. 22, 18–30. doi: 10.1094/MPMI-22-1-0018
Bai, X., Zhang, J., Ewing, A., Miller, S. A., Jancso Radek, A., Shevchenko, D. V., et al. (2006). Living with genome instability: the adaptation of phytoplasmas to diverse environments of their insect and plant hosts. J. Bacteriol. 188, 3682–3696. doi: 10.1128/JB.188.10.3682-3696.2006
Benson, D. A., Cavanaugh, M., Clark, K., Karsch-Mizrachi, I., Ostell, J., Pruitt, K. D., et al. (2018). GenBank. Nucleic Acids Res. 46, D41–D47. doi: 10.1093/nar/gkx1094
Bertaccini, A., and Lee, I.-M. (2018). “Phytoplasmas: an update,” in Phytoplasmas: Plant Pathogenic Bacteria - I: Characterisation and Epidemiology of Phytoplasma - Associated Diseases, eds G. P. Rao, A. Bertaccini, N. Fiore, and L. W. Liefting (Singapore: Springer), 1–29. doi: 10.1007/978-981-13-0119-3-1
Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., et al. (2009). BLAST+: architecture and applications. BMC Bioinform. 10:421. doi: 10.1186/1471-2105-10-421
Chang, S. H., Tan, C. M., Wu, C.-T., Lin, T.-H., Jiang, S.-Y., Liu, R.-C., et al. (2018). Alterations of plant architecture and phase transition by the phytoplasma virulence factor SAP11. J. Exp. Bot. 69, 5389–5401. doi: 10.1093/jxb/ery318
Chen, L.-L., Chung, W.-C., Lin, C.-P., and Kuo, C.-H. (2012). Comparative analysis of gene content evolution in phytoplasmas and mycoplasmas. PLoS One 7:e34407. doi: 10.1371/journal.pone.0034407
Chen, W., Li, Y., Wang, Q., Wang, N., and Wu, Y. (2014). Comparative genome analysis of wheat blue dwarf phytoplasma, an obligate pathogen that causes wheat blue dwarf disease in China. PLoS One 9:e96436. doi: 10.1371/journal.pone.0096436
Chen, W.-Y., Huang, Y.-C., Tsai, M.-L., and Lin, C.-P. (2011). Detection and identification of a new phytoplasma associated with periwinkle leaf yellowing disease in taiwan. Australas. Plant Pathol. 40, 476–483. doi: 10.1007/s13313-011-0062-x
Chung, W.-C., Chen, L.-L., Lo, W.-S., Lin, C.-P., and Kuo, C.-H. (2013). Comparative analysis of the peanut witches’-broom phytoplasma genome reveals horizontal transfer of potential mobile units and effectors. PLoS One 8:e62770. doi: 10.1371/journal.pone.0062770
Edgar, R. C. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucl Acids Res. 32, 1792–1797. doi: 10.1093/nar/gkh340
Gasparich, G. E. (2010). Spiroplasmas and phytoplasmas: microbes associated with plant hosts. Biologicals 38, 193–203. doi: 10.1016/j.biologicals.2009.11.007
Guindon, S., and Gascuel, O. (2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52, 696–704. doi: 10.1080/10635150390235520
Hogenhout, S. A., Oshima, K., Ammar, E.-D., Kakizawa, S., Kingdom, H. N., and Namba, S. (2008). Phytoplasmas: bacteria that manipulate plants and insects. Mol. Plant Pathol. 9, 403–423. doi: 10.1111/j.1364-3703.2008.00472.x
Hoshi, A., Oshima, K., Kakizawa, S., Ishii, Y., Ozeki, J., Hashimoto, M., et al. (2009). A unique virulence factor for proliferation and dwarfism in plants identified from a phytopathogenic bacterium. Proc. Natl. Acad. Sci. U.S.A. 106, 6416–6421. doi: 10.1073/pnas.0813038106
Hyatt, D., Chen, G.-L., LoCascio, P., Land, M., Larimer, F., and Hauser, L. (2010). Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 11:119. doi: 10.1186/1471-2105-11-119
Kakizawa, S., Makino, A., Ishii, Y., Tamaki, H., and Kamagata, Y. (2014). Draft genome sequence of Candidatus Phytoplasma asteris strain OY-V, an unculturable plant-pathogenic bacterium. Genome Announc. 2:e00944-14. doi: 10.1128/genomeA.00944-14
Kanehisa, M., Goto, S., Furumichi, M., Tanabe, M., and Hirakawa, M. (2010). KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res. 38, D355–D360. doi: 10.1093/nar/gkp896
Kellis, M., Birren, B. W., and Lander, E. S. (2004). Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428, 617–624. doi: 10.1038/nature02424
Krogh, A., Larsson, B., von Heijne, G., and Sonnhammer, E. L. L. (2001). Predicting transmembrane protein topology with a hidden markov model: application to complete genomes. J. Mol. Biol. 305, 567–580. doi: 10.1006/jmbi.2000.4315
Krzywinski, M., Schein, J., Birol, İ, Connors, J., Gascoyne, R., Horsman, D., et al. (2009). Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645. doi: 10.1101/gr.092759.109
Ku, C., Lo, W.-S., and Kuo, C.-H. (2013). Horizontal transfer of potential mobile units in phytoplasmas. Mob. Genet. Elements 3:e26145. doi: 10.4161/mge.26145
Kube, M., Mitrovic, J., Duduk, B., Rabus, R., and Seemüller, E. (2012). Current view on phytoplasma genomes and encoded metabolism. Sci. World J. 2012, 1–25. doi: 10.1100/2012/185942
Kuo, C.-H., and Ochman, H. (2009). Deletional bias across the three domains of life. Genome Biol. Evol. 1, 145–152. doi: 10.1093/gbe/evp016
Kurtz, S., Phillippy, A., Delcher, A. L., Smoot, M., Shumway, M., Antonescu, C., et al. (2004). Versatile and open software for comparing large genomes. Genome Biol. 5:R12. doi: 10.1186/gb-2004-5-2-r12
Lagesen, K., Hallin, P., Rodland, E. A., Staerfeldt, H.-H., Rognes, T., and Ussery, D. W. (2007). RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35, 3100–3108. doi: 10.1093/nar/gkm160
Lee, I.-M., Davis, R. E., and Gundersen-Rindal, D. E. (2000). Phytoplasma: phytopathogenic mollicutes. Annu. Rev. Microbiol. 54, 221–255. doi: 10.1146/annurev.micro.54.1.221
Lee, I.-M., Gundersen-Rindal, D. E., Davis, R. E., and Bartoszyk, I. M. (1998). Revised classification scheme of phytoplasmas based on RFLP analyses of 16S rRNA and ribosomal protein gene sequences. Int. J. Syst. Evol. Microbiol. 48, 1153–1169. doi: 10.1099/00207713-48-4-1153
Lee, I.-M., Gundersen-Rindal, D. E., Davis, R. E., Bottner, K. D., Marcone, C., and Seemüller, E. (2004). Candidatus Phytoplasma asteris, a novel phytoplasma taxon associated with aster yellows and related diseases. Int. J. Syst. Evol. Microbiol. 54, 1037–1048. doi: 10.1099/ijs.0.02843-0
Lee, I.-M., Hammond, R., Davis, R., and Gundersen, D. (1993). Universal amplification and analysis of pathogen 16S rDNA for classification and identification of mycoplasmalike organisms. Phytopathology 83, 834–842. doi: 10.1094/phyto-83-834
Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics 25, 1754–1760. doi: 10.1093/bioinformatics/btp324
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079. doi: 10.1093/bioinformatics/btp352
Li, L., Stoeckert, C. J., and Roos, D. S. (2003). OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 13, 2178–2189. doi: 10.1101/gr.1224503
Lo, W.-S., Huang, Y.-Y., and Kuo, C.-H. (2016). Winding paths to simplicity: genome evolution in facultative insect symbionts. FEMS Microbiol. Rev. 40, 855–874. doi: 10.1093/femsre/fuw028
Lowe, T., and Eddy, S. (1997). tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964. doi: 10.1093/nar/25.5.955
Lu, Y.-T., Li, M.-Y., Cheng, K.-T., Tan, C. M., Su, L.-W., Lin, W.-Y., et al. (2014). Transgenic plants that express the phytoplasma effector SAP11 show altered phosphate starvation and defense responses. Plant Physiol. 164, 1456–1469. doi: 10.1104/pp.113.229740
Lupas, A., Dyke, M. V., and Stock, J. (1991). Predicting coiled coils from protein sequences. Science 252, 1162–1164. doi: 10.1126/science.252.5009.1162
MacLean, A. M., Orlovskis, Z., Kowitwanich, K., Zdziarska, A. M., Angenent, G. C., Immink, R. G. H., et al. (2014). Phytoplasma effector SAP54 hijacks plant reproduction by degrading MADS-box proteins and promotes insect colonization in a RAD23-dependent manner. PLoS Biol. 12:e1001835. doi: 10.1371/journal.pbio.1001835
MacLean, A. M., Sugio, A., Makarova, O. V., Findlay, K. C., Grieve, V. M., Tóth, R., et al. (2011). Phytoplasma effector SAP54 induces indeterminate leaf-like flower development in arabidopsis plants. Plant Physiol. 157, 831–841. doi: 10.1104/pp.111.181586
Maejima, K., Iwai, R., Himeno, M., Komatsu, K., Kitazawa, Y., Fujita, N., et al. (2014). Recognition of floral homeotic MADS domain transcription factors by a phytoplasmal effector, phyllogen, induces phyllody. Plant J. 78, 541–554. doi: 10.1111/tpj.12495
McCutcheon, J. P., and Moran, N. A. (2012). Extreme genome reduction in symbiotic bacteria. Nat. Rev. Microbiol. 10, 13–26. doi: 10.1038/nrmicro2670
Minato, N., Himeno, M., Hoshi, A., Maejima, K., Komatsu, K., Takebayashi, Y., et al. (2014). The phytoplasmal virulence factor TENGU causes plant sterility by downregulating of the jasmonic acid and auxin pathways. Sci. Rep. 4:7399. doi: 10.1038/srep07399
Mira, A., Ochman, H., and Moran, N. A. (2001). Deletional bias and the evolution of bacterial genomes. Trends Genet. 17, 589–596. doi: 10.1016/S0168-9525(01)02447-7
Orlovskis, Z., Canale, M. C., Haryono, M., Lopes, J. R. S., Kuo, C.-H., and Hogenhout, S. A. (2017). A few sequence polymorphisms among isolates of maize bushy stunt phytoplasma associate with organ proliferation symptoms of infected maize plants. Ann. Bot. 119, 869–884. doi: 10.1093/aob/mcw213
Orlovskis, Z., and Hogenhout, S. A. (2016). A bacterial parasite effector mediates insect vector attraction in host plants independently of developmental changes. Front. Plant Sci. 7:885. doi: 10.3389/fpls.2016.00885
Oshima, K., Kakizawa, S., Arashida, R., Ishii, Y., Hoshi, A., Hayashi, Y., et al. (2007). Presence of two glycolytic gene clusters in a severe pathogenic line of Candidatus phytoplasma asteris. Mol. Plant Pathol. 8, 481–489. doi: 10.1111/j.1364-3703.2007.00408.x
Oshima, K., Kakizawa, S., Nishigawa, H., Jung, H.-Y., Wei, W., Suzuki, S., et al. (2004). Reductive evolution suggested from the complete genome sequence of a plant-pathogenic phytoplasma. Nat. Genet. 36, 27–29. doi: 10.1038/ng1277
Oshima, K., Maejima, K., and Namba, S. (2013). Genomic and evolutionary aspects of phytoplasmas. Front. Microbiol. 4:230. doi: 10.3389/fmicb.2013.00230
Robinson, J. T., Thorvaldsdottir, H., Winckler, W., Guttman, M., Lander, E. S., Getz, G., et al. (2011). Integrative genomics viewer. Nat. Biotech. 29, 24–26. doi: 10.1038/nbt.1754
Seruga Music, M., Samarzija, I., Hogenhout, S. A., Haryono, M., Cho, S.-T., and Kuo, C.-H. (2019). The genome of ‘Candidatus phytoplasma solani’ strain SA-1 is highly dynamic and prone to adopting foreign sequences. Syst. Appl. Microbiol. 42, 117–127. doi: 10.1016/j.syapm.2018.10.008
Sugawara, K., Honma, Y., Komatsu, K., Himeno, M., Oshima, K., and Namba, S. (2013). The alteration of plant morphology by small peptides released from the proteolytic processing of the bacterial peptide TENGU. Plant Physiol. 162, 2005–2014. doi: 10.1104/pp.113.218586
Sugio, A., and Hogenhout, S. A. (2012). The genome biology of phytoplasma: modulators of plants and insects. Curr. Opin Microbiol. 15, 247–254. doi: 10.1016/j.mib.2012.04.002
Sugio, A., Kingdom, H. N., MacLean, A. M., Grieve, V. M., and Hogenhout, S. A. (2011a). Phytoplasma protein effector SAP11 enhances insect vector reproduction by manipulating plant development and defense hormone biosynthesis. Proc. Natl. Acad. Sci. U.S.A. 108, E1254–E1263. doi: 10.1073/pnas.1105664108
Sugio, A., MacLean, A. M., Kingdom, H. N., Grieve, V. M., Manimekalai, R., and Hogenhout, S. A. (2011b). Diverse targets of phytoplasma effectors: from plant development to defense against insects. Annu. Rev. Phytopathol. 49, 175–195. doi: 10.1146/annurev-phyto-072910-095323
Sugio, A., MacLean, A. M., and Hogenhout, S. A. (2014). The small phytoplasma virulence effector SAP11 contains distinct domains required for nuclear targeting and CIN-TCP binding and destabilization. New Phytol. 202, 838–848. doi: 10.1111/nph.12721
The IRPCM Phytoplasma/Spiroplasma Working Team - Phytoplasma taxonomy group, (2004). Candidatus phytoplasma, a taxon for the wall-less, non-helical prokaryotes that colonize plant phloem and insects. Int. J. Syst. Evol. Microbiol. 54, 1243–1255. doi: 10.1099/ijs.0.02854-0
Zerbino, D. R., and Birney, E. (2008). Velvet: algorithms for de novo short read assembly using de bruijn graphs. Genome Res. 18, 821–829. doi: 10.1101/gr.074492.107
Zhao, Y., and Davis, R. E. (2016). Criteria for phytoplasma 16Sr group/subgroup delineation and the need of a platform for proper registration of new groups and subgroups. Int. J. Syst. Evol. Microbiol. 66, 2121–2123. doi: 10.1099/ijsem.0.000999
Zhao, Y., Wei, W., Lee, I.-M., Shao, J., Suo, X., and Davis, R. E. (2009). Construction of an interactive online phytoplasma classification tool, iPhyClassifier, and its application in analysis of the peach X-disease phytoplasma group (16SrIII). Int. J. Syst. Evol. Microbiol. 59, 2582–2593. doi: 10.1099/ijs.0.010249-0
Keywords: phytoplasma, plant pathogen, Mollicutes, genome, molecular evolution, effector
Citation: Cho S-T, Lin C-P and Kuo C-H (2019) Genomic Characterization of the Periwinkle Leaf Yellowing (PLY) Phytoplasmas in Taiwan. Front. Microbiol. 10:2194. doi: 10.3389/fmicb.2019.02194
Received: 25 July 2019; Accepted: 06 September 2019;
Published: 19 September 2019.
Edited by:
Ludmila Chistoserdova, University of Washington, United StatesReviewed by:
Kenro Oshima, Hosei University, JapanAmit Yadav, National Centre for Cell Science, India
Copyright © 2019 Cho, Lin and Kuo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chih-Horng Kuo, Y2hrQGdhdGUuc2luaWNhLmVkdS50dw==
