Bacillus amyloliquefaciens, Bacillus velezensis, and Bacillus siamensis Form an “Operational Group B. amyloliquefaciens” within the B. subtilis Species Complex

The plant growth promoting model bacterium FZB42T was proposed as the type strain of Bacillus amyloliquefaciens subsp. plantarum (Borriss et al., 2011), but has been recently recognized as being synonymous to Bacillus velezensis due to phylogenomic analysis (Dunlap C. et al., 2016). However, until now, majority of publications consider plant-associated close relatives of FZB42 still as “B. amyloliquefaciens.” Here, we reinvestigated the taxonomic status of FZB42 and related strains in its context to the free-living soil bacterium DSM7T, the type strain of B. amyloliquefaciens. We identified 66 bacterial genomes from the NCBI data bank with high similarity to DSM7T. Dendrograms based on complete rpoB nucleotide sequences and on core genome sequences, respectively, clustered into a clade consisting of three tightly linked branches: (1) B. amyloliquefaciens, (2) Bacillus siamensis, and (3) a conspecific group containing the type strains of B. velezensis, Bacillus methylotrophicus, and B. amyloliquefaciens subsp. plantarum. The three monophyletic clades shared a common mutation rate of 0.01 substitutions per nucleotide position, but were distantly related to Bacillus subtilis (0.1 substitutions per nucleotide position). The tight relatedness of the three clusters was corroborated by TETRA, dDDH, ANI, and AAI analysis of the core genomes, but dDDH and ANI values were found slightly below species level thresholds when B. amyloliquefaciens DSM7T genome sequence was used as query sequence. Due to these results, we propose that the B. amyloliquefaciens clade should be considered as a taxonomic unit above of species level, designated here as “operational group B. amyloliquefaciens” consisting of the soil borne B. amyloliquefaciens, and plant associated B. siamensis and B. velezensis, whose members are closely related and allow identifying changes on the genomic level due to developing the plant-associated life-style.

The plant growth promoting model bacterium FZB42 T was proposed as the type strain of Bacillus amyloliquefaciens subsp. plantarum , but has been recently recognized as being synonymous to Bacillus velezensis due to phylogenomic analysis . However, until now, majority of publications consider plant-associated close relatives of FZB42 still as "B. amyloliquefaciens." Here, we reinvestigated the taxonomic status of FZB42 and related strains in its context to the free-living soil bacterium DSM7 T , the type strain of B. amyloliquefaciens. We identified 66 bacterial genomes from the NCBI data bank with high similarity to DSM7 T . Dendrograms based on complete rpoB nucleotide sequences and on core genome sequences, respectively, clustered into a clade consisting of three tightly linked branches: (1) B. amyloliquefaciens, (2) Bacillus siamensis, and (3) a conspecific group containing the type strains of B. velezensis, Bacillus methylotrophicus, and B. amyloliquefaciens subsp. plantarum. The three monophyletic clades shared a common mutation rate of 0.01 substitutions per nucleotide position, but were distantly related to Bacillus subtilis (0.1 substitutions per nucleotide position). The tight relatedness of the three clusters was corroborated by TETRA, dDDH, ANI, and AAI analysis of the core genomes, but dDDH and ANI values were found slightly below species level thresholds when B. amyloliquefaciens DSM7 T genome sequence was used as query sequence. Due to these results, we propose that the B. amyloliquefaciens clade should be considered as a taxonomic unit above of species level, designated here as "operational group B. amyloliquefaciens" consisting of the soil borne B. amyloliquefaciens, and plant associated B. siamensis and B. velezensis, whose members are closely related and allow identifying changes on the genomic level due to developing the plant-associated life-style.

INTRODUCTION
At the time of writing, the genus Bacillus (Gordon et al., 1973), consisted of 318 species with validly published names (http://www.bacterio.net/bacillus.html) with Bacillus subtilis as the type species (Cohn, 1872;Skerman et al., 1980). The industrial important species B. subtilis, Bacillus amyloliquefaciens, Bacillus licheniformis, and Bacillus pumilus are representing a group of phylogenetically and phenetically homogeneous species called, in the vernacular, the B. subtilis species complex (Fritze, 2004). For many years, it has been recognized that these species are hardly to distinguish on the basis of traditional phenotypic methods. Moreover, phylogenetic analysis of the 16S rRNA gene also fails to differentiate species within the complex due to the highly conserved nature of the gene (Rooney et al., 2009).
Some representatives of B. amyloliquefaciens were found plant-root-associated and to act beneficial on plant growth (Idriss Abbreviations: AAI, average amino acid identity; ANI, average nucleotide identity; CDS, coding sequence; DDH, DNA-DNA hybridization; dDDH, digital DNA-DNA hybridization; GGDC, Genome-to-Genome Distance Calculator; TETRA, tetranucleotide frequency distribution. et al., 2002). Reva et al. (2004) reported that seven Bacillus isolates from plants or soil are closely related to but distinct from B. amyloliquefaciens type strain DSM7 T . These strains are more proficient for rhizosphere colonization than other members of the B. subtilis group . B. amyloliquefaciens strains GB03 (Choi et al., 2014), and FZB42 (Chen et al., 2007) are widely used in different commercial formulations to promote plant growth.
With the advent of comparative genomics and the availability of an increasing number of whole genome sequences, it became possible to distinguish two subspecies within B. amyloliquefaciens: B. amyloliquefaciens subsp. amyloliquefaciens (type strain DSM7 T ), and B. amyloliquefaciens subsp. plantarum (type strain: FZB42 T ). Spectroscopic DDH performed with hydroxylapatite-purified chromosomal DNA from DSM7 T and FZB42 T yielded DNA-DNA relatedness values ranging between 63.7 and 71.2% which apparently did not sufficiently support discrimination of both taxons on the species level . According to this view the subspecies "plantarum" represented a distinct ecotype of plant-associated B. amyloliquefaciens strains (Reva et al., 2004), which is increasingly used as biofertilizer and biocontrol agents in agriculture (Borriss, 2011).
Whilst many researchers are still using this classification (e.g., Hossain et al., 2015), recent phylogenomic studies showed a high degree of similarity between the genomes of the B. methylotrophicus, B. velezensis, B. oryzicola, and B. vanillea type strains, and the genome of the B. amyloliquefaciens subsp. plantarum type strain FZB42 T (= DSM 23117 T = BGSC 10A6 T ). Due to this finding it was proposed that the taxon B. amyloliquefaciens subsp. plantarum should be considered as a later heterotypic synonym of either B. methylotrophicus  or, more correctly due to priority rule, of B. velezensis . In spite of this increasingly complex taxonomic situation, we conducted here an extended phylogenomic analysis based on 66 core genomes displaying a high degree of similarity with the type strain of B. amyloliquefaciens DSM7 T . It ruled out that three tightly linked clades including a conspecific group consisting of FZB42 T , B. methylotrophicus KACC 13103 T , and B. velezensis KCTC13012 T , could be distinguished. The tight relatedness of the three clades consisting of representatives of B. amyloliquefaciens, B. velezensis, and B. siamensis was validated by rpoB gene sequence homology, and, ANI, AAI, dDDH, and TETRA analysis of the core genomes. We propose to introduce the term "operational group B. amyloliquefaciens" to underline their close phylogenomic relationship.

Alignment of DNA rpoB Sequences
Alignment of DNA rpoB sequences was performed by the Clustal Omega program accessible at http://www.ebi.ac.uk/Tools/msa/ clustalo/. A distance matrix was calculated from this alignment by DNA distance matrix calcuation (DNADIST program), and the matrix was then transformed into a tree by the NEIGHBOR program. In order to verify the accuracy of the tree multiple data sets were generated with the SEQBOOT program using 200 bootstrap replicates. A tree was built from each replicate with the DNADIST program, and then bootstrap values were computed with the CONSENSE program. The phylogenetic tree was visualized with TreeViewX (http://taxonomy.zoology. gla.ac.uk/rod/treeview.html). The programs used to construct the phylogenetic tree were obtained from the PHYLIP package, v.3.65 (Felsenstein, 1989), which is accessible at http://evolution. genetics.washington.edu/phylip.html.

Comparative Genome Analysis
Comparative genome analysis was performed using the EDGAR 1.3 software framework. For orthology estimation EDGAR uses a generic orthology threshold calculated from the similarity statistics of the compared genomes (Blom et al., 2016;http:// edgar.computational.bio.uni-giessen.de). A private project was constructed comprising 66 genomes closely related to B. amyloliquefaciens DSM7 T and selected other representatives of the B. subtilis species complex. To construct a phylogenetic tree for this project, around 2000 core genes were computed by pairwise iterative comparison of a set of genomes (Blom et al., 2016). In a following step multiple alignments of the core genes were generated using MUSCLE, non-matching parts of the alignment were masked by GBLOCKS and subsequently removed. The remaining parts of all alignments were concatenated to one large alignment. The PHYLIP package was used to generate a phylogenetic tree of this alignment, represented in newick format.
The EDGAR software framework was also used to calculate average nucleotide identity (ANI) and average amino acid identity (AAI), matrices for a selected set of genomes. The blast hits between the orthologous genes of the core of the selected genome were analyzed for their mean/median percent identity values. The recommended species cut-off was 95% for the ANI and AAI indices (Richter and Rosselló-Móra, 2009). In addition, JSpeciesWS (http://jspecies.ribohost.com/jspeciesws/) was used to determine ANIb (average nucleotide identity based on BLAST+) and ANIm (average nucleotide identity based on MUMmer) values by pairwise genome comparisons. Correlation indexes of their Tetra-nucleotide signatures (TETRA) were determined by using the JSpeciesWS software (Richter et al., 2016).

Digital DNA-DNA Hybridization (dDDH)
The genome-to-genome-distance calculator (GGDC) version 2.1 provided by DSMZ (http://ggdc.dsmz.de/) was used for genome-based species delineation (Meier-Kolthoff et al., 2013) and genome-based subspecies delineation (Meier-Kolthoff et al., 2014). Distances were calculated by (i) comparing two genomes using the chosen program to obtain HSPs/MUMs and (ii) inferring distances from the set of HSPs/MUMs using three distinct formulas. Next, the distances were transformed to values analogous to DDH. The DDH estimates were based on an empirical reference dataset comprising real DDH values and genome sequences. The DDH estimate resulted from a generalized linear model (GLM) which also provided the estimate's confidence interval (after the ± sign). Three formulas are available for the calculation: Formula: 1 (HSP length/total length), formula: 2 (identities/HSP length) and formula 3 (identities/total length). Formula 2, which is especially appropriate to analyze draft genomes, was used.

Phylogenomics of the B. Subtilis Species Complex
The core genomes of 20 type strains of the B. subtilis species complex were used for phylogenomic analysis applying the EDGAR software package (Figure 1). Four main monophyletic groups were corroborated by 100% bootstrap values. Clade I ("subtilis") is early diverged into two branches comprising B. atrophaeus, and B. subtilis and its close relatives; clade II ("amyloliquefaciens") comprises B. amyloliquefaciens, B. siamensis, and a conspecific group containing the type strains of B. amyloliquefaciens subsp. plantarum, B. velezensis, and B. methylotrophicus; clade III ("licheniformis") consists of B. licheniformis and B. sonorensis; and clade IV ("pumilus") comprises B. pumilus, B. safensis, B. xiamenensis, and a conspecific group involving the type strains of B. altitudinis, B. stratosphericus, and B. aerophilus. The members of clade II appeared closely related. This is indicated by the high number of orthologous CDSs (2794) shared by the five type strains of clade II. A similar cladogram has been published recently  suggesting that the B. subtilis species complex can be divided into four groups above species level, which need further characterization. We have directed our further analysis to clade II (named from now on "operational group B. amyloliquefaciens"), which clearly shows the highest degree of compactness.

Phylogenetic Analysis of Clade II Based on Complete rpoB Nucleotide Sequence
It is obvious, that 16S rRNA sequences are not sufficient to discriminate representatives of the B. subtilis species complex. For example, comparison of the complete 16S rRNA sequences of B. amyloliquefaciens DSM7 T and B. subtilis 168 T revealed 99.48% identity (Table 1), which is well above of the recommended threshold of >98.65% for species delineation (Kim et al., 2014). In order to elucidate more precisely the phylogenetic and taxonomic relationship of the members of the B. subtilis species complex belonging to the "operational group B. amyloliquefaciens, " we used two methods. (i) Tetra correlation search (TCS, Richter et al., 2016) was performed with the complete genome of DSM7 T and (ii) the complete RNA polymerase beta-subunit (rpoB) gene of DSM7 T was used for BLASTN comparison with the corresponding sequences extracted from complete genomes or genome assemblies. Fifty-Two genomes, which were in range with the intraspecific Tetranucleotide signature correlation index (>0.99) were detected in the JSpecies data bank. The TCS value determined for B. subtilis was only 0.954, suggesting that using this alignmentfree parameter allows discriminating of B. subtilis and B. amyloliquefaciens (Table 1). Complete rpoB gene sequencing has been proposed as phylogenetic marker (Klenk et al., 1994) and as a supplement to DDH (Adékambi et al., 2008). The power and potential of complete rpoB gene sequence in taxonomic, phylogenetic and evolutionary studies has been previously reported (Sharma and Patil, 2011). Our BLASTN search revealed that at least 66 genomes present in the NCBI data bank contain rpoB gene sequences with more than 98% identity to the rpoB gene from DSM7 T , the type strain of B. amyloliquefaciens (Priest et al., 1987). For comparison, the rpoB gene from B. subtilis subsp. subtilis 168 T displayed only 90.3% identity to B. amyloliquefaciens. The rpoB gene identities among strains assigned as being B. amyloliquefaciens, B. siamensis, B. amyloliquefaciens subsp. plantarum, B. methylotrophicus, B. velezensis, and B. vanillea are listed in Table 1. The list of strains containing rpoB genes with high similarity to B. amyloliquefaciens DSM7 T includes also strains obviously not correctly assigned, such as B. subtilis, Bacillus sp., or Paenibacillus polymyxa. It is interesting to note that majority of the strains representing the conspecific B. velezensis/B.methylotrophicus/B.amyloliquefaciens subsp. plantarum group were isolated from plant sources, whilst B. amyloliquefaciens sensu stricto seems to be soil-borne. The main source of the salt tolerant B. siamensis/B.vanillea group was fermented plant food ( Table 1).
The phylogenetic tree based on complete rpoB gene sequence suggests existence of three tightly connected monophyletic groups: (i) B. amyloliquefaciens containing six strains including type strain DSM7 T ; (ii) B. siamensis cluster consists of three strains: the type strain KCTC 13613 T , strain XY18, originally assigned as type strain for B. vanillea (Chen et al., 2014) but recently reclassified as being B. siamensis (Dunlap, 2015), and a strain assigned as being B. amyloliquefaciens JJC33M; (3) the conspecific complex comprising B. velezensis, B. methylotrophicus, and B. amyloliquefaciens subsp. Plantarum contained 57 strains. The tree is robust displaying high bootstrap values for all three groupings, although the three clusters are closely related and separated by only 0.01-0.02 substitutions per nucleotide position. By contrast, taxonomic distance to B. subtilis is around tenfold larger (Figure 2).

Phylogenomic Analysis of Clade II (Operational Group B. amyloliquefaciens)
In order to confirm the phylogenetic analysis based on rpoB sequences we calculated the core genomes using the EDGAR 1.3 program package. A total of 1998 CDSs were shared by the 66 core genome sequences extracted in that analysis. It ruled out that the phylogenomic tree based on complete core genome sequences (Figure 3) did reflect the phylogenomic distances similar as the phylogenetic tree based on rpoB nucleotide sequences (Figure 2). The same robust monophyletic groups as in Figure 2 were obtained. The B. siamensis cluster consisting of three representatives shared a core genome of 3097 CDSs; the B. amyloliquefaciens cluster consisting of six representatives shared a core genome of 3139 CDSs; and the conspecific group containing 57 plant-growth promoting Bacilli including FZB42 T shared a relatively small core genome consisting of only 2295 CDSs, which is mainly due to the high number of genomes included in this analysis. Subgroups of this cluster shared core genomes ranging from 2659 to 3137 CDSs (Figure 3). Again, the NJ tree suggested that B. amyloliquefaciens subsp. Plantarum, B. methylotrophicus, and B. velezensis formed a monophyletic group corroborating recent findings Wu et al., 2015;.
At next we tried to elucidate the taxonomic status of these closely related genomes. Different phylogenetic and phylogenomic methods were used to analyze relationship of all 65 genomes with that of B. amyloliquefaciens DSM7 T . As shown above, rpoB sequence similarity, exceeding threshold of species delineation, and the intraspecific Tetra-nucleotide signature correlation index (>0.99) suggested that all strains analyzed belong to the species B. amyloliquefaciens. TETRA analysis (Jspecies) demonstrated that the six type strains of clade II were closely related and yielded pairwise Tetra results (tetranucleotide signature correlation index) in species range (≥0.989, Figure 4 lower part). Deviations of the mean G+C content calculated for the whole genomes were less than one percent which does not contradict species definition ( Table 2). Grouping of all strains into a single species, B. amyloliquefaciens, was further supported by the AAI values ( Table 1). The mean AAI values of the 66 core genomes selected by their rpoB similarity to DSM7 T were ≥96.5%, exceeding the proposed cut-off of 96% for species delineation. However, parameters, considered recently as being most important for genome-based species delineation, such as ANI and dDDH (Federhen et al., 2016), did not support this conclusion (Table 1).
ANI analysis performed with the EDGAR program package discriminated clearly two clusters corresponding to clades I  Table 1.
Frontiers in Microbiology | www.frontiersin.org FIGURE 3 | NJ phylogenomic tree, constructed from the 66 core genomes with the highest similarity to DSM7 T (Table 1). The B. subtilis genome was used as outgroup. The number of core genome CDSs is indicated at the nodes. They were calculated for the respective subsets of genomes. Bootstrap values obtained from 200 repetitions are also indicated at the nodes. Type strains (T) are indicated by bold letters. Bar, 0.02 substitutions per nucleotide position.
Frontiers in Microbiology | www.frontiersin.org  Table) and Tetra-nucleotide correlation signatures (lower part of the Table). The median nucleotide percent identity values between the orthologous genes of the core of the selected genomes after pairwise BLASTN comparison are indicated. Standard deviation values are given in parentheses.
and II of the B. subtilis species complex (Figure 4). Clade II representing the B. amyloliquefaciens group was divided into three groups consisting of B. amyloliquefaciens DSM7 T (i), B. siamensis and B. vanillea (ii), and the conspecific complex formed by the type strains B. methylotrophicus KACC13105 T , B. velezensis KCTC 13102 T and B. amyloliquefaciens subsp. plantarum FZB42 T (iii). The latter group displayed ANI values of >98% exceeding the cut-off for species delineation when compared with each other suggesting that the members of the conspecific complex belong to a single species. B. methylotrophicus KACC 13105 T , B. velezensis KCTC 13102 T , and B. amyloliquefaciens subsp. plantarum FZB42 T displayed similar median ANI values ranging between 94.3 and 94.8% when compared with B. amyloliquefaciens DSM7 T (1) and B. siamensis KCTC-13613 T (2), respectively. Given a calculated deviation of ±2.2-2.3% the ANI matrix values suggests a high degree of relatedness to B. amyloliquefaciens, B. siamensis and the conspecific group formed by B. amyloliquefaciens subsp. plantarum, B. methylotrophicus, and B. velezensis, but did not sufficiently support species delineation (Figure 4, upper part).
According to more recent findings the recommended cut-off point for species delineation corresponds to ∼96% ANI (Colston et al., 2014). Similar results were obtained when ANIb and ANIm values were determined by using the JSpecies program package for all the 66 genomes included in this analysis. Threshold values sufficient for species delineation were only obtained, when representatives of B. amyloliquefaciens (6 genomes), B. siamensis (3 genomes), and of the conspecific group (57 genomes) were compared with their respective type strains. However, comparison of the 57 strains of the conspecific group (e.g., FZB42, B. velezensis) with either B. amyloliquefaciens DSM7 T or B. siamensis KCTC 13613 T yielded ANI values slightly below the cut-off for species delineation. The same was true when the three members of the B. siamensis group were compared with either FZB42 T or DSM7 T (Table 2) suggesting that according to ANI analysis the members of clade II represent three discrete, although closely related, species.
In order to finally decide, whether all strains of clade II belong to one species or not, electronic DNA-DNA hybridization (dDDH) was applied in a quantitative analysis involving all 66 genomes. As shown previously, dDDH is useful to mimic the wet-lab DDH and can be used for genome-based species delineation and genome-based subspecies delineation (Meier-Kolthoff et al., 2013. For calculating dDDH three different formulas can be applied (see Materials and Methods), but only results obtained with the recommended formula 2 were used in our analysis ( Table 2). When comparing members of the "siamensis group 2" and the "conspecific B. velezensis group" with B. amyloliquefaciens DSM7 T , dDDH values of <70%, the defined threshold for species delineation, were obtained. All in all, dDDH supports our previous finding about a close relationship within clade II, but did not support their classification into one single species. The results are summarized in Table 2 and  Supplementary Table 1.

Gene Clusters Involved in Nonribosomal Synthesis of Secondary Metabolites
Compared to other members of the B. subtilis species complex, the plant-associated B. amyloliquefaciens possess an enormous potential to synthesize bioactive secondary metabolites. Besides five gene clusters, known from B. subtilis to mediate nonribosomal synthesis of secondary metabolites, four giant gene clusters absent in B. subtilis 168 were identified in FZB42 (Chen et al., 2007). The nine gene clusters that direct the synthesis of bioactive peptides and polyketides by modularly organized mega-enzymes define both nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKS). Three (bmyD, dfn, and mln) are not present in B. subtilis 168, but occur in all members of the "operational group B. amyloliquefaciens." Except for the gene cluster encoding bacilysin synthesis, the functional activities of the remaining gene clusters depend on Sfp, an enzyme that transfers 4 ′ -phosphopantetheine from coenzyme A to the carrier proteins of nascent peptide or polyketide chains. A direct comparison revealed that the nine gene cluster responsible for nonribosomal synthesis of bioactive secondary metabolites including macrolactin are only present in FZB42 and in the other members of the conspecific B. velezensis group, whilst the gene cluster involved in macrolactin synthesis was not detected in B. siamensis and B. amyloliquefaciens (Table 3). Noteworthy, the gene cluster responsible for synthesis of the polyketide difficidin was present in B. siamensis, but not in any other member of the B. subtilis species complex suggesting a stepwise loss of the ability to synthesize secondary metabolites in the order B. velezensis (including FZB42) : B. siamensis : B. amyloliquefaciens.

DISCUSSION
The B. subtilis species complex consists of a steadily increasing number of validly described species (see Introduction), which display an extremely high degree of similarity. They are very difficult to distinguish by using classical taxonomy parameters: morphological and physiological characteristics, cell wall compositions, 16S rRNA sequence, G+C content, and FAME. Also, experimental determination of DNA-DNA relatedness (DDH), gold-standard of bacterial taxonomy for (ii) Mean G+C % content is only 0.5% different ranging between 45.9% (subsp. amyloliquefaciens), 46.1% (subsp. siamensis), and 46.4% (subsp. plantarum); (iii) Tetranucleotide signatures, TETRA, were determined as above the cut-off for species delineation (>0.989); (iv) AAI values are well above 96%, representing the intraspecific threshold.
On the other hand, ANIb and ANIm were calculated as around 93 to 94% identity to B. amyloliquefaciens on the nucleotide level, which is slightly lower than the threshold proposed for species delineation (95-96% ANI, Kim et al., 2014). Moreover, electronic DDH calculation using formula 2 yielded only 56% identity, which is clearly below the cut-off for species delineation. In spite of these contradictory results, we have to conclude that three discrete species exist within clade II, given that results obtained by ANI and dDDH are more important in modern taxonomy (Auch et al., 2010;Meier-Kolthoff et al., 2013) and outcompete the other results favoring a B. amyloliquefaciens subspecies concept. However, due to the close relationship of all three species comprised in clade II we propose an "operational group B. amyloliquefaciens" comprising B. amyloliquefaciens, B. siamensis, and B. velezensis. The introduction of this "operational group" above species level should improve hierarchical classification within the B. subtilis species complex. The members of the "operational group B. amyloliquefaciens are distinguished from B. subtilis and its closest relatives by their ability to synthesize nonribosomally antifungal acting lipopopeptides of the iturin group, mostly bacillomycin D or iturin A. The ecotype of plant-associated B. amyloliquefaciens is well introduced since many years (Reva et al., 2004) and includes the most important biocontrol-and plant-growth-promoting Bacilli, which are successfully used as environmental-friendly means in agriculture (Borriss, 2011). In addition, numerous studies have been published in recent years in order to identify and to understand the specific features of the group of B. amyloliquefaciens strains able to colonize plant organs and to withstand strong plant response reactions. As in B. subtilis it is now widely recognized that a relevant part of metabolism of plant-associated B. amyloliquefaciens is devoted to metabolic interactions with plants (Belda et al., 2013). Metabolites produced by the plantassociated B. amyloliquefaciens FZB42 and other members of the conspecific B. velezensis group represent a substantial part of the diversity of nonribosomal secondary metabolites from the genus Bacillus. For example, they produce three types of polyene polyketides (difficidin, macrolactin, and bacillaene) with strong antibiotic action (Chen et al., 2007). By contrast, B. siamensis does only produce two (difficidin and bacillaene) and soil-borne B. amyloliquefaciens does only produce one polyketide (bacillaene). It is highly desirable to apply a correct taxonomic designation to distinguish the plant-associated (= B. velezensis) and the soilborne B. amyloliquefaciens" (= B. amyloliquefaciens) strains, but also to take into consideration their high degree of relatedness. This should be reflected by their grouping into the "operational B. amyloliquefaciens group, " as a novel taxonomic unit above species level but below the "B. subtilis species complex." Introduction of the novel taxonomic unit seems also be recommended in spite of a permanent misuse in describing taxonomy important Bacillus biocontrol strains such as GB03 (Choi et al., 2014) and QST713 (Kinsella et al., 2009), which are often designated as B. subtilis although they are true representatives of B. velezensis and simultaneously members of the "operational group B. amyloliquefaciens." In summary, due to their differences in ANI and dDDH values, which are slightly below species level thresholds, we propose that B. amyloliquefaciens, B. velezensis, and B. siamensis should keep their status as species of its own, as proposed by Dunlap . The close relatedness of the three species is well reflected by the novel taxonomic unit "operational group B. amyloliquefaciens." Introducing of this novel taxonomic unit should improve also understanding of previous and recent scientific investigations performed with "plant-associated B. amyloliquefaciens" strains which often have not been designated correctly.
Another less surprising finding from our analysis was that many of the publically available Bacillus genomes that we analyzed are inconsistently assigned. Fortunately, a recent initiative has been started to correct such mistakes in Genbank entries (Federhen et al., 2016).