ORIGINAL RESEARCH article

Front. Microbiol., 16 June 2025

Sec. Antimicrobials, Resistance and Chemotherapy

Volume 16 - 2025 | https://doi.org/10.3389/fmicb.2025.1515483

Prospection for potential new non-ribosomal peptide gene clusters in Bacillus genus isolated from fermented foods and soil through genome mining

  • 1. Laboratory of Applied Biochemistry and Immunology, University Joseph KI-ZERBO, Ouagadougou, Burkina Faso

  • 2. National Key Laboratory of Agricultural Microbiology, Key Laboratory of Environment Correlative Dietology, College of Food Science and Technology, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan, China

  • 3. College of Food Science and Technology, Zhejiang University of Technology, Hangzhou, China

Article metrics

View details

4

Citations

3,1k

Views

404

Downloads

Abstract

Experimental studies, though often very costly, lead to the discovery of known antimicrobial products. Yet, pathogenic microorganisms are proving increasingly resistant to pre-existing antimicrobial molecules, and this is a cause for worldwide concern. Therefore, it is necessary to search for new molecules that could serve as alternatives in the food, medical and agricultural sectors. Thus, 123 complete genomes of Bacillus strains isolated from soil and fermented foods were analyzed and annotated using bioinformatics prediction and characterization tools. The view was to discover new gene clusters for the biosynthesis of non-ribosomal peptides (lipopeptides, siderophores, antibiotics). This study revealed that 83% of the genomes analyzed possess biosynthetic gene clusters for the production of siderophore bacillibactin, 61% for surfactins, 37% for fengycins, 23% for iturins, 15% for kurstakins and 3% for bacitracin. Besides, seven new biosynthetic gene clusters coding Non Ribosomal Peptide Synthetases (NRPS) have been identified in B. velezensis ATR2, B. velezensis DSYZ, B. velezensis CGMCC11640, B. amyloliquefaciens HM618, B. amyloliquefaciens WF02, B. cereus CMCC P0011, B. cereus CMCC P0021, B. subtilis SJ-10 and B. anthracis CMF9. The results of this study revealed a significant potential of the genus Bacillus to produce new non-ribosomally synthesized peptides. Now, these predicted new antimicrobial molecules can be easily studied experimentally as many new gene clusters have been identified.

Highlights

  • Seven novel biosynthetic gene clusters that encode Non-Ribosomal Peptide Synthetases (NRPS) were discovered.

  • A strain of Bacillus can potentially co-produce 4 families of lipopeptides.

  • Bacillus genus has a significant potential to develop new non-ribosomally produced peptides.

  • Genome mining strategy makes it possible to discover new metabolites.

1 Introduction

Bacillus are Gram-positive bacteria belonging to the firmicute genus and Bacillaceae family. They are spore-forming, aerobic or facultative aero-anaerobic and get energy by respiration or fermentation (Tamang et al., 2016). Their sporulating ability allows them to resist adverse environmental conditions. They are found in water, soil, dust, plants, food and in animals digestive tract. Some species of the genus Bacillus have important roles as antibiotics or antifungals producers (Khurana et al., 2020). For example, locillomycins and surfactin from B. subtilis 916 show antibacterial activity against Xanthomonas oryzae and Fusarium oxysporum (Zhao et al., 2018). Nowadays, 788 species from the genus Bacillus are sequenced, leading to various studies and explorations (www.ncbi.nlm.nih.gov/genome/?term=Bacillus, consulted 05/07/2024).

Indeed, many Bacillus strains have the ability to produce a wide variety of extracellular enzymes and nonribosomal secondary metabolites such as lipopeptides, the bacillibactin siderophore or antimicrobial compounds such as bacitracin which is a semi-cyclic peptide antibiotic commercialized as a mixture of polypeptides for the treatment of Gram-positive bacterial infections (May et al., 2001; Tapi et al., 2010). Other peptides, such as the lipopeptides have antimicrobial properties and can be used in food or soil biopreservation (Zhao et al., 2018).

Bacillus lipopeptides are subdivided into five families (fengycin, iturin, surfactin, kurstakin, and locillomycin) in which various subfamilies and variants can be found (Lam et al., 2021; Waongo et al., 2023). They are produced by modular mega-enzymes or assembly lines referred to as NRPSs (Iqbal et al., 2021). The NRPSs are composed of modules responsible for the incorporation of amino acids in the final nonribosomal peptide (Théatre et al., 2021). The modules contain the following main catalytic domains: the adenylation domain (A), responsible for the amino acid selection and activation, the carrier protein (CP), responsible for tethering the amino acid to the enzyme, and the condensation domain (C), forming a peptide bond between two amino acids attached to two consecutive modules (Leclère et al., 2016). Some NRPSs also include optional domains that can modify the incorporated amino acids during the synthesis, as the Epimerisation domain (E) leading to the D-isomery (Duban et al., 2022). The enzymatic domains work step by step to assemble the monomers into the peptide, so that NRPSs can be compared to assembly lines. A particular trait of NRPSs synthesising Lipopeptides (LP), except those belonging to the iturin and locillomycin families, is the presence of a condensation domain starting the assembly line. The role of this so-called C-starter is to condense the fatty acid into the first amino acid of the peptide chain. The fengycin family includes decapeptides fengycin A, B, C, and plipastatins A, B and S. The iturin family includes bacillomycins (D, DC, F, L and Lc), mycosubtilin, mojavensin A, subtulene A, mixirin and iturins (A, AL and C), all containing 7 amino acids. Heptapeptides are also found in the surfactin family and variant forms of lichenysin, pumilacidin, esperin and kurstakin. The fifth family contains the nonapeptides locillomycins A, B, and C (Luo et al., 2015a; Waongo et al., 2023).

Many drugs remain effective; however, the emergence of resistant organisms has become a major concern, drawing global attention and promoting the “One Health” concept (Aslam et al., 2021). This concept highlights the close links between human, animal, and environmental health. In this context, nonribosomal secondary metabolites produced by the genus Bacillus can be a source of interesting active compounds. Indeed, some antimicrobial peptides have been shown to be antibiotics with a broad spectrum against pathogens and lipopeptides and siderophores, which are not antibiotics but represent an alternative for food preservation and plant protection against phytopathogenic fungi (Huan et al., 2020).

An urgent challenge today is to discover new natural drugs to tackle emerging human, animal and plant pathogens. The identification of these biomolecules is usually performed by experimental studies including isolation, characterization, purification and activity tests (Fanaei et al., 2021). With the development of bioinformatics tools, this approach can now be accelerated and complemented by sequencing and subsequent genome or metagenome mining to identify natural product biosynthetic pathways (Blin et al., 2021).

Regarding nonribosomal peptide synthesis, specific tools have been developed to identify biosynthetic gene clusters in the genome sequences, to decipher the organization of NRPS into modules and domains, and to predict the nature of the monomers activated by each Adenylation-domain and their isomery in regard to the presence of epimerization domains (Blin et al., 2023). Notably, bacterial genome mining via in silico analysis, using the program antiSMASH, offers an attractive opportunity to discover new secondary metabolites such as NRPs (Rahman et al., 2014; Aleti et al., 2015). AntiSMASH 7 allows better visualization of enzyme assembly chain and good structuring of predicted molecules. Predicted peptides are likely to be characterized by comparison with other peptides available in Norine database (Flissi et al., 2020). The questions addressed here are (i) to review the structural and functional information together with annotation of gene clusters for known lipopeptides (LPs), antibiotics and siderophores produced by the genus Bacillus and (ii) to elucidate through genome mining the potential products of yet uncharacterized nonribosomal gene clusters. In addition, a bibliographical search was carried out to better understand the lipopeptides produced by Bacillus. The objective of this study is to combine literature exploration and genome analysis to identify and characterize the diversity of nonribosomal peptides as antibiotics, siderophores, and lipopeptides, which are potentially produced by Bacillus strains isolated from fermented foods and soil.

2 Materials and methods

2.1 An overview of known structures of nonribosomal lipopeptides produced by Bacillus strains

An overview of known structures of nonribosomal lipopeptides generated by Bacillus strains was performed. In brief, Scopus, PubMed, Web of Science, and ResearchGate databases were queried on October 2nd, 2024 using the keywords “lipopeptides AND Bacillus, lipopeptides AND structure, surfactin AND Bacillus, surfactin AND structure, fengycin AND Bacillus, Fengycin AND structure, kurstakin AND Bacillus, Kurstakin AND structure, locillomycin AND Bacillus, locillomycin AND structure, iturin AND Bacillus, iturin AND structure, New lipopeptides AND Bacillus, New lipopeptides Bacillus AND structure, non-ribosomal peptide AND Bacillus.”

2.2 Database search for genome sequences

Sequences (including chromosome and plasmid) of Bacillus isolated from soil and fermented foods were retrieved from the NCBI nucleotide database.1 A total of 123 complete genomes from Bacillus strains isolated from soil and fermented food samples, were selected according to the quality (Table 1). Thus, information on assembly, genome size, number of contigs/scaffolds and N50 were examined.

Table 1

NumberStrainsMoleculesAccession numberIsolation sourceCountryMolecule sizeContig N50Completeness (%)Contamination (%)
1B. aerophilus KJ82ChromosomeCP091093.1SoilChina3,754,440 bp3.8 Mb98.920.59
2B. albus B-9ChromosomeNZ_JAAAWQ000000000.1SoilChina5,371,297 bp200.4 kb99.330.1
3B. albus YK87ChromosomeNZ_CP142670.1SoilChina5,347,990 bp5.3 Mb98.820.10
4B. altitudinis B4133ChromosomeNZ_JXCN00000000.1Fermented foodNetherlands3,718,745 bp95.4 kb99.120.29
5B. altitudinis GR-8ChromosomeCP009108.1SoilChina3,674,849 bp3.7 Mb99.120.00
6B. altitudinis GR-8PlasmidNZ_CP009109.1SoilChina6,935 bp3.7 Mb99.120.00
7B. altitudinis NJ-V2ChromosomeNZ_CP012482.1SoilChina3,787,818 bp3.8 Mb99.120.29
8B. altitudinis NJ-M2ChromosomeNZ_CP012329.1SoilChina3,793,419 bp3.8 Mb99.120.29
9B. altitudinis NJ-VChromosomeNZ_CP012330.1SoilChina3,792,999 bp3.8 Mb99.120.29
10B. altitudinis G6S2ChromosomeNZ_CP126096.1SoilUnknown3,797,173 bp3.8 Mb99.410.29
11B. amyloliquefaciens S499ChromosomeCP014700.1SoilCongo3,927,922 bp3.9 Mb97.180.98
12B. amyloliquefaciens S499PlasmidNZ_CP014701.1SoilCongo8,008 bp3.9 Mb97.180.98
13B. amyloliquefaciens 205ChromosomeCP054415.1SoilChina4,006,790 bp4 Mb95.661.32
14B. amyloliquefaciens WF02ChromosomeCP053376.1SoilTaiwan4,026,648 bp4 Mb98.241.46
15B. amyloliquefaciens bm1ChromosomeCP088005.1SoilChina3,929,792 bp3.9 Mb98.031.25
16B. amyloliquefaciens LS1-002-014 sChromosomeNZ_CP089530.1Fermented foodChina4,214,996 bp4.2 Mb98.260.94
17B. amyloliquefaciens HM618ChromosomeCP029466.1SoilChina4,021,851 bp4 Mb97.710.78
18B. amyloliquefaciens B3ChromosomeNZ_JAJNPN000000000.1Fermented foodChina4,001,777 bp171.8 kb97.252.09
19B. anthracis HYU01ChromosomeCP008846.1SoilSouth Korea5,213,498 bp5.2 Mb98.090.48
20B. anthracis HYU01PlasmidCP008847.1SoilSouth Korea181,894 bp5.2 Mb98.090.48
21B. anthracis HYU01PlasmidCP008848.1SoilSouth Korea94,732 bp5.2 Mb98.090.48
22B. anthracis 9080-GChromosomeCM002398.1SoilGeorgia5,232,192 bp195.9 kb98.430.71
23B. anthracis 9,080-GPlasmidCM002399.1SoilGeorgia181,656 bp195.9 kb98.430.71
24B. anthracis 9,080-GPlasmidNZ_CM002400.1SoilGeorgia94,815 bp195.9 kb98.430.71
25B. anthracis CMF9ChromosomeCP085402.1SoilChina5,324,354 bp5.3 Mb96.093.04
26B. arachidis YX15ChromosomeCP127376.1SoilChina4,913,320 bp4.9 Mb98.530.00
27B. atrophaeus PENSV20ChromosomeCP050705.1SoilCanada4,148,820 bp4.1 Mb99.120.00
28B. badius NBPM-293ChromosomeCP082363.1SoilUnknown3,868,812 bp3.9 Mb98.881.16
29B. bombysepticus F12ChromosomeCP085406.1SoilChina5,244,749 bp5.2 Mb99.120.05
30B. bombysepticus Cuernavaca_S2ChromosomeCP126590.1SoilMexico5,288,525 bp5.3 Mb98.820.00
31B. cellulasensis NJ-V2ChromosomeCP012482.1SoilChina3,787,818 bp3.8 Mb99.120.29
32B. cellulasensis NJ-M2ChromosomeCP012329.1SoilChina3,793,419 bp3.8 Mb99.120.29
33B. cereus CMCC P0011ChromosomeCP011153.1SoilChina5,506,876 bp5.5 Mb98.970.06
34B. cereus CMCC P0011PlasmidNZ_CP011154.1SoilChina591,112 bp5.5 Mb98.970.06
35B. cereus M3ChromosomeCP016316.1Fermented foodSouth Korea5,218,997 bp5.2 Mb98.430.22
36B. cereus CMCC P0021ChromosomeCP011151.1SoilChina5,521,782 bp5.5 Mb99.130.06
37B. cereus CMCC P0021PlasmidNZ_CP011152.1SoilChina591,110 bp5.5 Mb99.130.06
38B. cereus NJ-WChromosomeCP012483.1SoilChina5,370,032 bp5.4 Mb99.220.06
39B. cereus NJ-WPlasmidNZ_CP012485.1SoilChina11,744 bp5.4 Mb99.220.06
40B. cereus NJ-WPlasmidNZ_CP012484.1SoilChina9,032 bp5.4 Mb99.220.06
41B. cereus NJ-WPlasmidNZ_CP012486.1SoilChina7,703 bp5.4 Mb99.220.06
42B. cereus C-1ChromosomeCP089601.1SoilChina5,268,934 bp5.3 Mb97.790.06
43B. gobiensis FJAT-4402ChromosomeCP012600.1SoilChina4,597,707 bp2.8 Mb98.771.69
44B. haikouensis MNJ12ChromosomeCP076017.1SoilChina4,454,189 bp4.5 Mb99.411.78
45B. halotolerans ZB201702ChromosomeCP029364.1SoilChina4,154,245 bp4.2 Mb99.410.59
46B. inaquosorum DE111ChromosomeCP013984.1SoilUSA4,143,890 bp4.1 Mb98.820.00
47B. inaquosorum LBA001ChromosomeNZ_CP127095.1SoilMexico4,200,707 bp4.2 Mb98.820.00
48B. infantis 63–11ChromosomeNZ_JAJBAP020000001.1Fermented foodThailand4,841,671 bp4.8 Mb99.410.21
49B. licheniformis SCDB 14ChromosomeCP014842.1Fermented foodSouth Korea4,136,986 bp4.1 Mb97.231.98
50B. licheniformis P8_B2ChromosomeCP045814.1SoilDenmark4,343,379 bp4.3 Mb97.892.24
51B. licheniformis SCK B11ChromosomeCP014795.1SoilSouth Korea4,300,706 bp4.3 Mb98.331.24
52B. licheniformis 14ADL4ChromosomeCP026673.1Fermented foodSouth Korea4,332,232 bp4.3 Mb97.733.06
53B. methanolicus MGA3ChromosomeCP007739.1SoilUSA3,337,035 bp3.3 Mb97.882.66
54B. mojavensis B-41341ChromosomeNZ_JARLZB000000000.1SoilIsrael3,856,892 bp148 kb99.410.00
55B. mojavensis B-41812ChromosomeNZ_JARSHG000000000.1SoilArgentina3,924,370 bp428.5 kb98.820.00
56B. mycoides KBAB4ChromosomeNC_010184.1SoilUnknown5,262,775 bp5.3 Mb98.820.32
57B. mycoides KBAB4PlasmidNC_010183.1SoilUnknown52,830 bp5.3 Mb98.820.32
58B. mycoides KBAB4PlasmidNC_010182.1SoilUnknown64,977 bp5.3 Mb98.820.32
59B. mycoides KBAB4PlasmidNC_010181.1SoilUnknown75,107 bp5.3 Mb98.820.32
60B. mycoides BGSC 4BQ1ChromosomeNZ_NFDI00000000.1SoilSpain5,673,703 bp639 kb99.410.15
61B. mycoides Gnyt1ChromosomeCP020743.1SoilChina5,597,907 bp5.6 Mb99.410.00
62B. mycoides ATCC 6462ChromosomeCP009692.1SoilUnknown5,255,868 bp5.3 Mb99.410.00
63B. pacificus AT31ChromosomeNZ_CP142003.1SoilChina4,903,194 bp4.9 Mb99.410.00
64B. pacificus MP6ChromosomeCP093424.1Fermented foodChina5,038,982 bp5 Mb99.410.00
65B. pacificus MP6PlasmidCP093425.1Fermented foodChina291,500 bp5 Mb99.410.00
66B. paralicheniformis UBBLI-30ChromosomeNZ_SULF00000000.1Fermented foodIndia4,404,722 bp332.9 kb99.410.06
67B. paralicheniformis 14DA11ChromosomeNZ_CP023168.1Fermented foodSouth Korea4,535,069 bp4.5 Mb95.350.05
68B. paralicheniformis CP47ChromosomeNZ_CP133705.1Fermented foodSouth Korea4,537,254 bp4.5 Mb98.190.05
69B. paranthracis SL1ChromosomeCP093423.1Fermented foodChina5,243,627 bp5.2 Mb99.410.29
70B. paranthracis SL1PlasmidCP093422.1Fermented foodChina325,335 bp5.2 Mb99.410.29
71B. paranthracis Gxun-30ChromosomeNZ_CP065149.1SoilChina5,149,464 bp5.1 Mb99.120.00
72B. paranthracis KF11ChromosomeCP085413.1SoilChina5,276,246 bp5.3 Mb99.410.29
73B. paranthracis Bt C4ChromosomeNZ_CP101135.1SoilChina5,245,328 bp5.2 Mb98.820.29
74B. pumilus NJ-VChromosomeCP012330.1SoilChina3,792,999 bp3.8 Mb99.120.29
75B. pumilus DSM 1794ChromosomeCP187664.1SoilUnknown3,786,030 bp3.8 Mb95.312.17
76B. pumilus MS32ChromosomeNZ_CP092829.1SoilGermany3,824,664 bp3.8 Mb95.532.6
77B. pumilus B4127ChromosomeNZ_JXCL00000000.1Fermented foodNetherlands3,886,280 bp142 kb95.373.01
78B. pumilus B4127PlasmidMH581228.1Fermented foodNetherlands6,580 bp142 kb95.373.01
79B. safensis G6S3ChromosomeCP128114.1SoilGermany3,670,853 bp3.7 Mb99.410.29
80B. spizizenii T30ChromosomeNZ_CP011051.1SoilRussia4,031,727 bp4 Mb95.760.00
81B. spizizenii AS2ChromosomeNZ_MUXL00000000.1SoilOman4,041,230 bp573.3 kb98.820.29
82B. spizizenii HUK15ChromosomeNZ_LSMU00000000.1SoilFrance4,254,363 bp322.1 kb98.820.29
83B. stratosphericus MRPD-01ChromosomePPGB00000000.1SoilIndia3,687,760 bp322 kb99.220.29
84B. subtilis SJ-10ChromosomeNZ_CP025258.1Fermented foodSouth Korea4,041,647 bp4 Mb99.360.59
85B. subtilis MEC_B298ChromosomeNZ_CP100436.1SoilChina4,030,831 bp4 Mb97.410.67
86B. subtilis s-16ChromosomeNZ_CP063150.1SoilChina4,209,504 bp4.2 Mb97.671.35
87B. subtilis BSP1ChromosomeNZ_CP160396.1SoilLebanon4,043,759 bp4 Mb96.281.21
88B. subtilis KH2ChromosomeCP018184.1SoilChina4,138,265 bp4.1 Mb96.731.28
89B. subtilis FUA2231ChromosomeNZ_CP154918.1Fermented foodZimbabwe4,475,230 bp4.5 Mb97.311.62
90B. subtilis UD1022ChromosomeCP011534.1SoilUSA4,025,326 bp4 Mb97.530.91
91B. subtilis SFA-H43ChromosomeKZ836066.1SoilChina4,018,162 bp2.1 Mb96.731.48
92B. subtilis CGMCC 2108ChromosomeCP014471.1SoilChina4,122,154 bp4.1 Mb96.731.08
93B. subtilis CGMCC 2108PlasmidNZ_CP014473.1SoilChina65,774 bp4.1 Mb96.731.08
94B. subtilis CGMCC 2108PlasmidNZ_CP014472.1SoilChina5,820 bp4.1 Mb96.731.08
95B. subtilis T30ChromosomeCP011051.1SoilRussia4,031,727 bp4 Mb95.760.00
96B. subtilis NCIB 3610ChromosomeNZ_CP094361.1SoilChina4,210,909 bp4.2 Mb98.511.33
97B. subtilis ZD01ChromosomeNZ_CP046448.1SoilChina4,015,360 bp4 Mb97.50.67
98B. subtilis 73ChromosomeCP045826.1SoilGermany4,166,516 bp4.2 Mb97.331.97
99B. subtilis G8ChromosomeNZ_AP025224.1Fermented foodIndonesia4,017,503 bp4 Mb96.731.08
100B. subtilis FUA2232ChromosomeNZ_CP154920.1Fermented foodZimbabwe4,475,228 bp4.5 Mb97.311.61
101B. subtilis SRCM103517ChromosomeNZ_CP035226.1Fermented foodSouth Korea4,192,706 bp4.2 Mb96.851.87
102B. subtilis S1ChromosomeNZ_JAGFPW000000000.1Fermented foodIndia4,488,261 bp153.3 kb95.771.85
103B. thuringiensis HD12ChromosomeCP014847.1SoilChina5,776,895 bp5.8 Mb98.390.96
104B. thuringiensis HD12PlasmidNZ_CP014853.1SoilChina345,196 bp5.8 Mb98.390.96
105B. thuringiensis HD12PlasmidNZ_CP014852.1SoilChina161,353 bp5.8 Mb98.390.96
106B. thuringiensis HD12PlasmidNZ_CP014851.1SoilChina112,429 bp5.8 Mb98.390.96
107B. thuringiensis HD12PlasmidNZ_CP014850.1SoilChina39,023 bp5.8 Mb98.390.96
108B. thuringiensis HD12PlasmidNZ_CP014848.1SoilChina17,228 bp5.8 Mb98.390.96
109B. thuringiensis Bt185ChromosomeCP014282.1SoilChina5,243,635 bp5.2 Mb98.261.05
110B. thuringiensis Bt185PlasmidNZ_CP014284.1SoilChina293,705 bp5.2 Mb98.261.05
111B. thuringiensis Bt185PlasmidNZ_CP014285.1SoilChina55,372 bp5.2 Mb98.261.05
112B. thuringiensis Bt185PlasmidNZ_CP014286.1SoilChina54,205 bp5.2 Mb98.261.05
113B. thuringiensis Bt185PlasmidNZ_CP014287.1SoilChina41,937 bp5.2 Mb98.261.05
114B. thuringiensis Bt185PlasmidNZ_CP014288.1SoilChina12,487 bp5.2 Mb98.261.05
115B. thuringiensis Bt185PlasmidNZ_CP014290.1SoilChina7,486 bp5.2 Mb98.261.05
116B. thuringiensis c25ChromosomeNZ_CP022345.1SoilSouth Korea5,334,660 bp5.3 Mb98.911.07
117B. thuringiensis 97–27ChromosomeNC_005957.1SoilUnknown5,237,682 bp5.2 Mb99.440.32
118B. thuringiensis YWC2-8ChromosomeCP013055.1SoilChina5,674,369 bp5.7 Mb98.381.01
119B. thuringiensis YWC2-8PlasmidNZ_CP013056.1SoilChina250,706 bp5.7 Mb98.381.01
120B. thuringiensis YWC2-8PlasmidNZ_CP013057.1SoilChina84,491 bp5.7 Mb98.381.01
121B. thuringiensis YWC2-8PlasmidNZ_CP013058.1SoilChina82,531 bp5.7 Mb98.381.01
122B. thuringiensis YWC2-8PlasmidNZ_CP013059.1SoilChina80,699 bp5.7 Mb98.381.01
123B. thuringiensis YWC2-8PlasmidNZ_CP013060.1SoilChina46,634 bp5.7 Mb98.381.01
124B. thuringiensis YWC2-8PlasmidNZ_CP013061.1SoilChina8,512 bp5.7 Mb98.381.01
125B. thuringiensis BGSC 4AA1ChromosomeCP010577.1SoilChina5,652,292 bp5.7 Mb97.243.40
126B. thuringiensis BGSC 4AA1PlasmidNZ_CP010578.1SoilChina232,994 bp5.7 Mb97.243.40
127B. thuringiensis BGSC 4AA1PlasmidNZ_CP010579.1SoilChina92,619 bp5.7 Mb97.243.40
128B. thuringiensis BGSC 4AA1PlasmidNZ_CP010580.1SoilChina76,979 bp5.7 Mb97.243.40
129B. thuringiensis BGSC 4AA1PlasmidNZ_CP010581.1SoilChina68,444 bp5.7 Mb97.243.40
130B. thuringiensis BGSC 4AA1PlasmidNZ_CP010582.1SoilChina51,723 bp5.7 Mb97.243.40
131B. thuringiensis BGSC 4AA1PlasmidNZ_CP010583.1SoilChina4,845 bp5.7 Mb97.243.40
132B. thuringiensis CTCChromosomeCP013274.1SoilChina5,327,397 bp5.3 Mb97.631.10
133B. thuringiensis CTCPlasmidNZ_CP013273.1SoilChina25,529 bp5.3 Mb97.631.10
134B. thuringiensis HS18-1ChromosomeCP012099.1SoilChina5,292,526 bp5.3 Mb98.680.55
135B. thuringiensis HS18-1PlasmidNZ_CP012101.1SoilChina337,579 bp5.3 Mb98.680.55
136B. thuringiensis HS18-1PlasmidNZ_CP012102.1SoilChina92,085 bp5.3 Mb98.680.55
137B. thuringiensis HS18-1PlasmidNZ_CP012103.1SoilChina94,695 bp5.3 Mb98.680.55
138B. thuringiensis HS18-1PlasmidNZ_CP012104.1SoilChina42,726 bp5.3 Mb98.680.55
139B. thuringiensis HS18-1PlasmidNZ_CP012105.1SoilChina14,336 bp5.3 Mb98.680.55
140B. thuringiensis HS18-1PlasmidNZ_CP012106.1SoilChina4,669 bp5.3 Mb98.680.55
141B. thuringiensis HS18-1PlasmidNZ_CP012107.1SoilChina8,287 bp5.3 Mb98.680.55
142B. thuringiensis HS18-1PlasmidNZ_CP012108.1SoilChina7,386 bp5.3 Mb98.680.55
143B. thuringiensis JW-1ChromosomeNZ_CP045030.1SoilChina5,500,376 bp5.5 Mb99.300.74
144B. thuringiensis HD-1ChromosomeCP004870.1SoilUSA5,631,672 bp5.6 Mb97.671.75
145B. thuringiensis YBT-1518ChromosomeCP005935.1SoilChina6,002,284 bp6 Mb98.603.18
146B. toyonensis UTDF19-29BChromosomeCP081872.1SoilUSA5,240,743 bp5.2 Mb99.410.10
147B. toyonensis Monterrey_S3ChromosomeCP126524.1SoilMexico5,309,620 bp5.3 Mb99.410.10
148B. toyonensis Cuernavaca_S4ChromosomeCP126520.1SoilMexico5,270,039 bp5.3 Mb98.820.10
149B. tropicus CK18ChromosomeCP085399.1SoilChina5,237,233 bp5.2 Mb99.410.00
150B. tropicus T36S-23ChromosomeNZ_CP119875.1SoilSouth Korea5,262,398 bp5.3 Mb98.820.62
151B. vallismortis NBIF-001ChromosomeCP020893.1SoilChina3,929,787 bp3.9 Mb98.820.00
152B. vallismortis DSM 11031ChromosomeCP026362.1SoilUSA4,286,362 bp4.3 Mb99.220.00
153B. velezensis ATR2ChromosomeCP018133.1SoilChina4,006,746 bp4 Mb99.410.00
154B. velezensis SYP-B637ChromosomeCP043546.1SoilChina3,915,550 bp3.9 Mb99.410.00
155B. velezensis CGMCC 11640ChromosomeCP026610.1SoilChina4,322,979 bp4.3 Mb99.410.00
156B. velezensis Lzh-a42ChromosomeCP025308.1SoilChina4,246,605 bp4.2 Mb99.410.00
157B. velezensis YJ0-1ChromosomeNZ_CP128184.1SoilChina4,043,341 bp4 Mb99.411.24
158B. velezensis L-S60ChromosomeCP011278.1SoilChina3,903,017 bp3.9 Mb99.410.00
159B. velezensis CBMB205ChromosomeCP014838.1SoilSouth Korea3,929,745 bp3.9 Mb99.410.00
160B. velezensis SB1216ChromosomeCP015417.1SoilUSA3,814,720 bp3.8 Mb97.560.00
161B. velezensis DSYZChromosomeCP030150.1SoilChina4,258,978 bp4.3 Mb99.410.00
162B. velezensis DR-08ChromosomeCP028437.1SoilSouth Korea3,929,794 bp3.9 Mb99.360.00
163B. velezensis CMF18ChromosomeCP085388.1SoilChina3,963,155 bp4 Mb99.410.00
164B. velezensis CK17ChromosomeCP085706.1SoilChina3,921,806 bp3.9 Mb99.340.00
165B. velezensis DMB06ChromosomeNZ_CP083763.1Fermented foodSouth Korea4,157,945 bp4.2 Mb99.40.00
166B. velezensis UD6-2ChromosomeNZ_JAGFMB000000000.1Fermented foodThailand3,951,373 bp1 Mb99.410.00
167B. velezensis DMB05ChromosomeNZ_CP083715.1Fermented foodSouth Korea3,262,563 bp3.3 Mb99.120.00
168B. velezensis DMB05PlasmidNZ_CP083716.1Fermented foodSouth Korea806,695 bp3.3 Mb99.120.00
169B. velezensis DMB05PlasmidNZ_CP083717.1Fermented foodSouth Korea72,020 bp3.3 Mb99.120.00
170B. wiedmannii LN15ChromosomeCP126099.1SoilChina5,391,143 bp5.4 Mb98.820.10
171B. wiedmannii JAS08/1ChromosomeNZ_CP036073.1SoilPoland5,513,922 bp5.5 Mb99.410.10
172B. wiedmannii JAS07/5ChromosomeCP036070.1SoilPoland5,239,489 bp5.2 Mb99.411.76
173B. wiedmannii JAS07/5PlasmidNZ_CP036072.1SoilPoland536,465 bp5.2 Mb99.411.76
174B. wiedmannii JAS07/5PlasmidNZ_CP036071.1SoilPoland79,335 bp5.2 Mb99.411.76
175B. wiedmannii PL1ChromosomeAP022643.1SoilJapan5,309,441 bp5.3 Mb97.220.00
176B. safensis ZK-1ChromosomeNZ_CP095759.1SoilChina3,735,476 bp3.7 Mb99.060.29
177B. safensis SRCM125915ChromosomeNZ_CP116774.1SoilSouth Korea3,769,976 bp3.8 Mb98.920.59
178B. safensis PLA 1006ChromosomeNZ_CP114177.1SoilChina3,891,304 bp3.9 Mb98.920.59

Genome sequences including chromosomes and plasmids for the different Bacillus strains.

2.3 Detection of nonribosomal peptide biosynthetic gene clusters (BGCs)

The search for nonribosomal peptide sequences from the genome sequences of 123 Bacillus strains was performed following a workflow previously described (Leclère et al., 2016). First, the prediction of biosynthetic gene clusters (BGCs) was performed using antiSMASH version 7 (Blin et al., 2023). The regions encoding nonribosomal synthetases were further analyzed. The monomer composition of the predicted peptide was compared to all known NRPs annotated in the Norine database2 (Flissi et al., 2020) in order to identify them or predict them as a new peptide or new variant. When it was incomplete with truncated or fragmented clusters, we further investigated antiSMASH results to reconstruct partial or complete BGCs by assembling cluster fragments scattered in the shotgun genome. The number of genes, the order of modules, and the domains in the NRPSs and the predictions of the A-domain specificity helped identify clusters of known NRP families, even when they were fragmented.

Since poor sequence assembly can affect the quality of predicted metabolites, only high-quality complete genomes were considered in this study.

3 Results

Out of 123 Bacillus chromosomes selected, 115 were found to carry BGC NRPSs. Among these NRPSs, those responsible for the synthesis of a siderophore (102), antibiotics (4) or lipopeptides (93) were found.

3.1 Prediction of NRP siderophore

The screened sequences harbor genetic potential to produce the siderophore bacillibactin. The gene cluster is composed of three modules with A-domain specificity for 2,3-dihydroxybenzoate (Dhb or diOH-Bz in Norine), glycine and threonine. The corresponding NRPS is known to follow an iterative mode of biosynthesis, which could lead to the production of active iron-chelating compounds named bacillibactins. This small cluster was predicted in 83% of the studied genomes.

3.2 Overview of lipopeptide BGCs produced by Bacillus

The literature search revealed diverse non-ribosomal lipopeptides synthesized by the genus Bacillus. The structures of the different molecules are shown in Table 2.

Table 2

Name of lipopeptidesMonomer composition of the peptide moietyMain fatty acid chainsReferences
Fengycin familyDecapeptide with a lactone ring between carboxy-terminal groupβ-OH fatty acids
Fengycin A1L-Glu,D-Orn,D-Tyr,D-aThr,L-Glu,D-Ala,L-Pro,L-Gln,L-Tyr,L-IleC14-C19Ait Kaki et al. (2020)
Fengycin A2L-Glu,D-Orn,D-Tyr,D-aThr,L-Glu,D-Ala,L-Pro,L-Gln,L-Tyr,L-ValC14, C15, C17Ongena and Jacques (2007)
Fengycin B1L-Glu-D-Orn-D-Tyr-D-aThr,L-Glu,D-Val,L-Pro,L-Gln,L-Tyr,L-IleC14-C17Fanaei et al. (2021)
Fengycin B2L-Glu,D-Orn,D-Tyr,D-aThr,L-Glu,D-Val,L-Pro,L-Gln,L-Tyr,L-ValC14-C21Ongena and Jacques (2007)
Fengycin C1L-Glu,D-Orn,D-Tyr,D-aThr,L-Glu,D-Leu/D-Ile,L-Pro,L-Gln,L-Tyr,L-ValC15-C19Vater et al. (2002)
Fengycin C2L-Glu,D-Orn,D-Tyr,D-aThr,L-Glu,D-Val,L-Pro,L-Gln,L-aThr,L-IleC14-C18Ongena and Jacques (2007)
Fengycin SL-Glu,D-Orn,D-Tyr,D-Ser,L-Glu,D-Val,L-Pro,L-Gln,L-Tyr,L-IleC17Sang-cheol et al. (2010)
Plipastatin A1, A2L-Glu,D-Orn,D-Tyr,D-a-Thr,L-Glu,D-Ala,L-Pro,L-Gln,D-Tyr,L-IlenC16, aiC17Nishikiori et al. (1986)
Plipastatin B1, B2L-Glu,D-Orn,D-Tyr,D-a-Thr,L-Glu,D-Val,L-Pro,L-Gln,L-Tyr,L-IlenC16, aiC17Nishikiori et al. (1986)
Iturin familyHeptapeptide cyclised by a lactam ring with the β-NH2 group of the acid chainβ-NH2 fatty acids
Iturin A, ALL-Asn,D-Tyr,D-Asn,L-Gln,L-Pro,D-Asn,L-SernC14, iC15, aiC15, nC16, iC16Pathak and Keharia (2014)
Iturin CL-Asp,D-Tyr,D-Asn,L-Gln,L-Pro,D-Asn,L-SernC14, iC15, aiC15Pathak and Keharia (2014)
Bacillomycin DL-Asn,D-Tyr,D-Asn,L-Pro,L-Gln,D-Ser,L-ThrnC14, iC15, aiC15Jin et al. (2017)
Bacillomycin DCL-Asn,D-Tyr,D-Asn,L-Pro,L-Glu,D-Ser,L-ThrC13Jin et al. (2017)
Bacillomycin FL-Asn,D-Tyr,D-Asn,L-Gln,L-Pro,D-Asn,L-ThrC16, iC17, aiC17Peypoux et al. (1985)
Bacillomycin LL-Asp,D-Tyr,D-Asn,L-Ser,L-Glu,D-Ser,L-ThrnC14, iC15, aiC15Volpon et al. (2007)
Bacillomycin LcaL-Asn,D-Tyr,D-Asn,L-Ser,L-Glu,D-Ser,L-ThrnC14, iC15, aiC15, iC16Volpon et al. (2007)
MycosubtilinL-Asn,D-Tyr,D-Asn,L-Gln,L-Pro,D-Asn/D-Ser,L-Ser/L-AsnnC16, iC16, aiC17Peypoux et al. (1986)
Mojavensin AL-Asp,D-Tyr,D-Asn,L-Gln,L-Pro,D-Asn,L-AsnaiC12Ma et al. (2012)
Subtulene AL-Asn,D-Tyr,D-Asn,L-Gln,L-Pro,D-Asn,L-Ser(CH2)4HC = CH(CH2)3CH(CH3)2Thasana et al. (2010)
MixirinL-Asn,D-Tyr,D-Asn,L-Gln,L-Ser,D-Asn,L-Pro(A) C11, (B) C8, (C) aiC10Zhang et al. (2004)
Kurstakin familyHeptapeptide with a lactone ring between carboxy-terminal group of Gln7 and OH group of Ser4
KurstakinbD-Thr,L-Gly,D-Ala,L-Ser,L-His,D-Gln,L-GlniC11, nC12, iC12, iC13Hathout et al. (2000)
Surfactin familyHeptapeptide closed by a lactone ring with the b-OH group of the fatty acid chainβ-OH fatty acids
Bamylocin AcGlu,Leu,Met,Leu,Pro,Leu,Leu,LeuC13Lee et al. (2007)
SurfactinL-Glu,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-LeuiC14, nC14, iC15, aC15Bonmatin et al. (2003)
[Orn2] surfactinL-Glu,L-Orn,D-Leu,L-Val,L-Asp,D-Leu,L-IleC13-C15Bonmatin et al. (2003)
[Orn7] surfactinL-Glu,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-OrnBonmatin et al. (2003)
[Phe7] surfactinL-Glu,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-PheBonmatin et al. (2003)
[Cys7] surfactinL-Glu,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-CysBonmatin et al. (2003)
[Ala4] surfactinL-Glu,L-Leu,D-Leu,L-Ala,L-Asp,D-Leu,L-LeuiC14, nC14, iC15, aiC15Bonmatin et al. (2003)
[Ile4] surfactinL-Glu,L-Leu,D-Leu,L-Ile,L-Asp,D-Leu,L-LeuaiC15Bonmatin et al. (2003)
[Ile2, 7] surfactinL-Glu,L-Ile,D-Leu,L-Val,L-Asp,D-Leu,L-IleC13-C15Bonmatin et al. (2003)
[Ile4, 7] surfactinL-Glu,L-Leu,D-Leu,L-Ile,L-Asp,D-Leu,L-IleC13-C15Bonmatin et al. (2003)
[Leu4] surfactinL-Glu,L-Leu,D-Leu,L-Leu,L-Asp,D-Leu,L-LeuiC15Bonmatin et al. (2003)
[Val7] surfactinL-Glu,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-ValiC14, nC14, iC15, aiC15Bonmatin et al. (2003)
[Ile7] surfactinL-Glu,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-IleiC14, nC14, iC15, aiC15Bonmatin et al. (2003)
[Ile2, 4] surfactinL-Glu,L-Ile,D-Leu,L-Ile,L-Asp,D-Leu,L-LeuaiC15Bonmatin et al. (2003)
[Val2, 7] surfactinL-Glu,L-Val,D-Leu,L-Val,L-Asp,D-Leu,L-ValC13-C15Bonmatin et al. (2003)
[Val2, Ile7] surfactinL-Glu,L-Val,D-Leu,L-Val,L-Asp,D-Leu,L-IleC13-C15Bonmatin et al. (2003)
[Ile2, Val7] surfactinL-Glu,L-Ile,D-Leu,L-Val,L-Asp,D-Leu,L-ValC13-C15Bonmatin et al. (2003)
[Ile2, 4, 7] surfactinL-Glu,L-Ile,D-Leu,L-Ile,L-Asp,D-Leu,L-IleaiC15Bonmatin et al. (2003)
LichenysineL-Gln,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-IleiC13, aC13, nC14 iC15, aC15Bonmatin et al. (2003)
[Ile4] lichenysinL-Gln,L-Leu,D-Leu,L-Ile,L-Asp,D-Leu,L-IleaiC15Bonmatin et al. (2003)
[Val7] lichenysinL-Gln,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-ValiC13, aiC13, nC14, iC15, aiC15Bonmatin et al. (2003)
[Ile2, 4] lichenysinL-Gln,L-Ile,D-Leu,L-Ile,L-Asp,D-Leu,L-IleaiC15Bonmatin et al. (2003)
EsperindL-Glu,L-Leu,D-Leu,L-Val,L-Asp,D-Leu,L-XE7-COOHC13, C14, C15Thomas and Ito (1969)
PumilacidinL-Glu,L-Leu,D-Leu,L-Leu,L-Asp,D-Leu,L-XP7aC15, iC15, nC16, iC16, aC17, iC17Naruse et al. (1990)
Locillomycin familyPartially cyclized nonapeptides
Locillomycin A, B, CL-Thr,D-Gln,L-Asp,L-Gly,L-Asn,L-Asp,L-Gly,L-Tyr,L-ValC13, C14, C15Luo et al. (2015a, 2015b)
Cerexin familyLinear decapeptides
Cerexin AD-Asp,D-Val,D-val,L-Asn,D-Asn,L-ϒHyl,D-Thr,L-Ser,D-Trp,D-IleiC11h3Shoji et al. (1976b)
Cerexin BD-Asn,D-val,D-Phe,L-Asn,D-Asn,L-ϒHyl,D-Thr,L-Gly,D-Trp,D-IleiC10h3, nC10h3, iC11h3, aiC11h3Shoji and Kato (1976)
Cerexin CD-Asp,D-Val,D-val,L-Asn,D-Asn,L-Lys,D-Thr,L-Ser,D-Trp,D-IleiC11h3Shoji et al. (1976a)
Cerexin DD-Asp,D-Val,D-Phe,L-Asn,D-Asn,L-Lys,D-Thr,L-Gly,D-Trp,D-IleiC10h3, nC10h3, iC11h3, aiC11h3Shoji et al. (1976a)
Marihysin A familyCyclic heptapeptide
Marihysin APro,Gln,Asn,Ser,Asn,Tyr,AsnC14Liu et al. (2010)
Octapeptin family
Octapeptin A1, A2, A3, A4D-Dab,L-Dab,L-Dab,L-Dab,L-Dab,D-Leu,L-Leu,L-LeuaiC11h3, iC10h3, nC10h3, iC11h3Meyers et al. (1976)
Octapeptin B1, B2, B3, B4D-Dab,L-Dab,L-Dab,L-Dab,L-Dab,D-Leu,L-Leu,L-PheaiC11h3, iC10h3, nC10h3, iC9h3Meyers et al. (1976)
Octapeptin C1, C2, C3, C4D-Dab,L-Dab,L-Dab,L-Dab,L-Dab,L-Leu,L-Leu,D-PheaiC9h3, aiC9h3, iC10h3, nC10h3Meyers et al. (1976)
Octapeptin D1, D2, D3, D4D-Ser-L-Dab-L-Dab-D-Leu-L-Leu-L-Dab-L-Dab-L-LeuaiC11h3, iC10h3, nC10h3, iC11h3Kato and Shoji (1980)

Structure of different lipopeptides classified by families.

aBacillomycin Lca; or bacillopeptin.

bD forms of amino acid residue are deduced from NRPS modular structure.

cL and D forms are not specified.

dThe b-carboxyl of Asp5 is engaged in the lactone.

eOr halobacillin With XE7 = Leu or Val; XP7 = Val or Ile.

ai, anteiso; i, iso; n, normal; h3, 3-hydroxy; L-Hyl, L-threo—hydroxylysine; Dab, 2,4-Diaminobutyric acid.

3.3 Prediction of the lipopeptide structures

Within BGCs detected by antiSMASH, special attention was paid to traits specific to LP synthesis. Thus, a condensation domain allowing the incorporation of fatty acid (FA) (so-called C-starter) was sought. C-starter is usually present in LP NRPS except for LP belonging to iturin and locillomycin families (Figure 1). As up to now, no bioinformatics tool enables the prediction of fatty acid (FA) structure, the work was focused on peptide moieties (Table 3).

Figure 1

Table 3

Predicted LPStructure analysisCommentsStrains
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. aerophilus KJ82
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,Leu)(Ile)CompleteB. altitudinis B4133
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. altitudinis GR-8
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. altitudinis NJ-V2
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. altitudinis NJ-M2
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. altitudinis NJ-V
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. altitudinis G6S2
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. amyloliquefaciens S499
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. amyloliquefaciens S499
Surfactin(Glu,Leu,D-Leu)(…)(Leu)Partial and fragmentedB. amyloliquefaciens S499
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. amyloliquefaciens 205
Fengycin(……)(…)(…)(…Glu,D-Tyr)(Ile)Partial and fragmented (Glu instead of Gln)B. amyloliquefaciens 205
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. amyloliquefaciens 205
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. amyloliquefaciens WF02
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. amyloliquefaciens WF02
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. amyloliquefaciens WF02
Surfactin(Glu,Leu,D-Leu)(Val,Asp,Leu)(Leu)CompleteB. amyloliquefaciens bm1
Fengycin A/B(Glu,D-Orn,Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. amyloliquefaciens bm1
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. amyloliquefaciens bm1
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. amyloliquefaciens LS1-002-014 s
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. amyloliquefaciens LS1-002-014 s
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. amyloliquefaciens LS1-002-014 s
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. amyloliquefaciens HM618
Fengycin(……)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Partial (X could be Val or Ala)B. amyloliquefaciens HM618
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. amyloliquefaciens HM618
Surfactin(Glu,Leu,…)(Val,Asp,…)(Leu)Partial and fragmentedB. amyloliquefaciens B3
Fengycin(……)(…)(…,D-X)(Pro,Gln,D-Tyr)(Ile)PartialB. amyloliquefaciens B3
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. atrophaeus PENSV20
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. atrophaeus PENSV20
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(…,Ser)PartialB. atrophaeus PENSV20
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. bombysepticus F12
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. bombysepticus Cuernavaca_S2
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. cellulasensis NJ-V2
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)completeB. cellulasensis NJ-M2
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. cereus CMCC P0011
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. cereus CMCC P0021
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. cereus NJ-W
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. cereus C-1
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. halotolerans ZB201702
Fengycin(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Glu,…)(…)Partial (Glu instead of Gln and X could be Val or Ala)B. halotolerans ZB201702
Fengycin(…,D-Orn)(Tyr,D-Thr)(…,…)(Pro,…,…)(Ile)Partial and fragmentedB. inaquosorum DE111
Surfactin(Glu,Leu,D-Leu)(Val,Asp,…)(Leu)Partial and fragmentedB. inaquosorum DE111
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. inaquosorum LBA001
Fengycin(…,D-Orn)(Tyr,D-Thr)(Glu,…)(Pro,Glu,…)(Ile)Partial and fragmented (Glu instead of Gln)B. inaquosorum LBA001
Bacillomycin F(Asn)(D-Tyr,D-Asn,Pro,Gln)(D-Asn,Thr)CompleteB. inaquosorum LBA001
Lichenysin A(Gln,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. licheniformis SCDB 14
Lichenysin A(Gln,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. licheniformis P8_B2
Lichenysin A(Gln,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. licheniformis SCK B11
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. licheniformis 14ADL4
Fengycin(…)(…)(…,Glu,D-Tyr)(Ile)Partial and fragmentedB. licheniformis 14ADL4
Fengycin(…,D-Orn)(Tyr,D-Thr)(…,D-X)(Pro,Glu,D-Tyr)(Ile)Partial and fragmented (Glu instead of Gln and X could be Val)B. mojavensis B-41341
Surfactin(Glu,Leu,…)(Val,Asp,…)(Leu)Partial and fragmentedB. mojavensis B-41341
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. mojavensis B-41812
Fengycin B variant(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Glu,D-Tyr)(Ile)Incomplete (Glu instead of Gln and X could be Val)B. mojavensis B-41812
Kurstakin variant(D-Thr)(X,Ala)(Ser,Leu,D-Thr,Glu)Incomplete (Leu instead of His; Thr instead of Gln; Glu instead of Gln; X could be Gly)B. mycoides BGSC 4BQ1
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. mycoides Gnyt1
Lichenysin A(Gln,Leu,D-Leu)(Val,Asp,…)(Ile)Partial and fragmentedB. paralicheniformis UBBLI-30
Fengycin B(…,D-Orn)(Tyr,D-Thr)(…,D-Val)(Pro,Gln,D-Tyr)(Ile)Partial and fragmentedB. paralicheniformis UBBLI-30
Lichenysin A(Gln,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. paralicheniformis 14DA11
Fengycin B(Glu,…)(Tyr,D-Thr)(Glu,Val)(Pro,Gln,D-Tyr)(Ile)Partial and fragmentedB. paralicheniformis 14DA11
Lichenysin A(Gln,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. paralicheniformis CP47
Fengycin B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-Val)(Pro,Gln,D-Tyr)(Ile)CompleteB. paralicheniformis CP47
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. pumilus NJ-V
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. pumilus DSM 1794
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Ile)CompleteB. pumilus MS32
[Ile7] surfactin(Glu,Leu,D-Leu)(Val,Asp,…)(Ile)PartialB. pumilus B4127
[Ile4,7] surfactin(Glu,Leu,D-Leu)(Ile,Asp,D-Leu)(Ile)CompleteB. safensis G6S3
Surfactin(Glu,Leu,D-Leu)(Val,Asp,Leu)(Leu)CompleteB. spizizenii T30
Mycosubtilin(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Ser,Asn)CompleteB. spizizenii T30
Mycosubtilin(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Ser,Asn)CompleteB. spizizenii AS2
Surfactin(Glu,Leu,…)(Val,Asp,…)(Leu)Partial and fragmentedB. spizizenii AS2
Mycosubtilin(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Ser,Asn)CompleteB. spizizenii HUK15
Surfactin(Glu,Leu,…)(Val,Asp,…)(Leu)Partial and fragmentedB. spizizenii HUK15
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis SJ-10
Fengycin B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)CompleteB. subtilis SJ-10
Bacillomycin D(Asn)(D-Tyr,D-Asn,Pro,Glu)(D-Ser,Thr)CompleteB. subtilis SJ-10
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis MEC_B298
Fengycin B variant(…,…)(Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Partial (Glu instead of Gln)B. subtilis MEC_B298
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis s-16
Fengycin B variant(Glu,…)(Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Partial and fragmented (Glu instead of Gln)B. subtilis s-16
Fengycin B variant(Glu,…)(Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Partial and fragmented (Glu instead of Gln)B. subtilis BSP1
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis BSP1
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis KH2
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis FUA2231
Fengycin B variant(Glu,D-Orn,Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Complete (Glu instead of Gln)B. subtilis FUA2231
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis UD1022
Fengycin B variant(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Complete (Glu instead of Gln)B. subtilis UD1022
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis SFA-H43
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis CGMCC 2108
Mycosubtilin(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Ser,Asn)CompleteB. subtilis T30
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis T30
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis NCIB 3610
Fengycin(Glu,D-Orn)(Tyr,D-Thr)(…,…)(Pro,Glu,D-Tyr)(Ile)Partial and fragmented (Glu instead of Gln)B. subtilis NCIB 3610
Fengycin B variant(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Complete (Glu instead of Gln)B. subtilis ZD01
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis ZD01
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis 73
Fengycin(…)(Glu,D-Tyr)(Ile)Partial and fragmentedB. subtilis 73
[Ile7] surfactin(Glu,Leu,…)(Val,Asp,…)(Ile)Partial and fragmentedB. subtilis G8
Fengycin(…)(…)(…,Glu,D-Tyr)(…)Partial and fragmentedB. subtilis G8
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis FUA2232
Fengycin B variant(Glu,D-Orn,Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Complete (Glu instead of Gln)B. subtilis FUA2232
Surfactin(Glu,Leu, D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. subtilis SRCM103517
Fengycin B variant(Glu,D-Orn,Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Complete (Glu instead of Gln)B. subtilis SRCM103517
Surfactin(Glu,Leu,…)(Val,Asp,…)(Leu)Partial and fragmentedB. subtilis S1
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis HD12
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis Bt185
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis YWC2-8
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis BGSC 4AA1
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis HS18-1
Kurstakin(D-Thr)(X-Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis c25
Kurstakin(D-Thr)(X-Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis JW-1
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis HD-1
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. thuringiensis YBT-1518
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. vallismortis NBIF-001
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X Could be Val or Ala)B. vallismortis NBIF-001
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. vallismortis NBIF-001
Fengycin B variant(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-Val)(Pro,Glu,D-Tyr)(Ile)Complete (Glu instead of Gln)B. vallismortis DSM 11031
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. vallismortis DSM 11031
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis ATR2
Fengycin(Glu,D-Orn)(Tyr,D-Thr)(……)(Pro,Gln…)(Ile)Partial and fragmentedB. velezensis ATR2
Bacillomycin D(Asn)(D-Tyr,D-Asn,Pro,Glu)(D-Ser,Thr)CompleteB. velezensis ATR2
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis SYP-B637
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis SYP-B637
Bacillomycin D(Asn)(D-Tyr,D-Asn,Pro,Glu)(D-Ser,Thr)CompleteB. velezensis SYP-B637
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis CGMCC 11640
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis CGMCC 11640
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. velezensis CGMCC 11640
Surfactin(Glu,Leu,D-Leu)(Val,Asp,…)(Leu)Partial and fragmentedB. velezensis Lzh-a42
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis Lzh-a42
Bacillomycin D(Asn)(D-Tyr,D-Asn,Pro,Glu)(D-Ser,Thr)CompleteB. velezensis Lzh-a42
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis YJ0-1
Bacillomycin D(Asn)(D-Tyr,D-Asn,Pro,Glu)(D-Ser,Thr)CompleteB. velezensis YJ0-1
Surfactin(Glu,Leu,D-Leu)(Val)(Asp,D-Leu)(Leu)CompleteB. velezensis YJ0-1
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis L-S60
Fengycin(Glu,D-Orn)(Tyr,D-Thr)(……)(Pro,Gln,D-Tyr)(Ile)Partial and fragmentedB. velezensis L-S60
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. velezensis L-S60
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis CBMB205
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis CBMB205
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. velezensis CBMB205
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis SB1216
Bacillomycin L(Asn)(D-Tyr,D-Asn,Ser,Glu)(D-Ser,Thr)CompleteB. velezensis SB1216
Fengycin(………)(Tyr,D-Thr)(……)(Pro,……)(Ile)Partial and fragmentedB. velezensis SB1216
Surfactin(Glu,Leu,…)(Val,Asp,D-Leu)(Leu)Partial and fragmentedB. velezensis DSYZ
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis DSYZ
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. velezensis DSYZ
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis DR-08
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis DR-08
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. velezensis DR-08
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis CMF18
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis CMF18
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. velezensis CMF18
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis CK17
Fengycin A/B(Glu,D-Orn)(Tyr,D-Thr)(Glu,D-X)(Pro,Gln,D-Tyr)(Ile)Incomplete (X could be Val or Ala)B. velezensis CK17
Iturin A(Asn)(D-Tyr,D-Asn,Gln,Pro)(D-Asn,Ser)CompleteB. velezensis CK17
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis DMB06
Bacillomycin D(Asn)(D-Tyr,D-Asn,Pro,Glu)(D-Ser,Thr)CompleteB. velezensis DMB06
Fengycin(………)(…)(…,D-X)(Pro,…)(Ile)Partial and fragmentedB. velezensis DMB06
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis UD6-2
Fengycin(……)(…)(…,D-X)(Pro…)(Ile)Partial and fragmentedB. velezensis UD6-2
Bacillomycin D(Asn)(D-Tyr,D-Asn,Pro,Glu)(D-Ser,Thr)CompleteB. velezensis UD6-2
Surfactin(Glu,Leu,D-Leu)(Val,Asp,D-Leu)(Leu)CompleteB. velezensis DMB05
Fengycin(……)(……)(…X)(Pro,…)(Ile)Partial and fragmentedB. velezensis DMB05
Bacillomycin D(Asn)(D-Tyr,D-Asn,Pro,Glu)(D-Ser,Thr)CompleteB. velezensis DMB05
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. wiedmannii JAS07/5
Kurstakin(D-Thr)(X,Ala)(Ser,Leu,D-X,Glu)Incomplete (Leu instead of His; Glu instead of Gln; X, X could be Gly and Gln respectively)B. wiedmannii PL1
[Ile4,7] surfactin(Glu,Leu,D-Leu)(Ile,Asp,D-Leu)(Ile)CompleteB. safensis ZK-1
[Ile4,7] surfactin(Glu,Leu,D-Leu)(Ile,Asp,D-Leu)(Ile)CompleteB. safensis SRCM125915
[Ile4,7] surfactin(Glu,Leu,D-Leu)(Ile,Asp,D-Leu)(Ile)CompleteB. safensis PLA 1006

Structures of predicted and identified lipopeptides among different genomic sequences of Bacillus strains.

The monomers in brackets are those synthesized by a single gene. Monomers in bold differ from the known variants described in the same position. When the gene cluster was incomplete because scattered on several contigs or located on a too small contig, it was specify in the comment column and symbolized the truncated genes by “…”.

In silico analysis of the genomic sequences revealed the potential production of four lipopeptide families (Table 2). These are the surfactin, iturin, fengycin and kurstakin families depending on the strains. Over the 123 chromosomes analyzed, 75 (61%) possessed gene clusters responsible for surfactin production, 29 (23%) for iturin, 46 (37%) for fengycin and 19 (15%) for kurstakin (Table 3). No locillomycin synthetic gene cluster was detected in the genomes explored in this study. All chromosomes carrying kurstakin BGCs came from species belonging to the Bacillus cereus group. When this cluster was present, the other clusters (surfactin, fengycin, iturin) were absent. A kurstakin variant was detected in Bacillus mycoîde BGSC4BQ1 (Table 3, Figure 2). A variation was found in monomer 6, which is a threonine instead of Gln or Glu, which are most often found in this position. In addition, 25 chromosomes contained only clusters of surfactin genes. No chromosomes contained only fengycin or iturin gene clusters. A co-existence of gene clusters was also observed. As far surfactin and fengycin gene clusters are concerned, they were found in 21 chromosomes, while surfactin and iturin gene clusters were found in 4 chromosomes. A total of 25 chromosomes contained surfactin, fengycin and iturin gene clusters (Figure 3). In addition, six chromosomes contained four gene clusters (surfactin, fengycin, iturin, and novel gene clusters). All B. subtilis, B. velezensis, B. licheniformis, B. paralicheniformis, B. amlyloliquefaciens, B. pumilus, B. cellulasensis, B. altitudinis, B. spizizenii, B. safensis, B. vallismortis, B. halotolerans, B.inaquosorum, B. stratosphericus, B. aerophilus and B. atrophaeus have gene clusters to produce a known surfactin or variant. In addition, all B. licheniformis and B. paralicheniformis have gene clusters which produce lichenysin A (Table 3). All 29 iturin-producing strains belong to the B. subtilis group (B. subtilis, B. velezensis, B. amyloliquefaciens, B. spizizenii, B. atrophaeus). Gene clusters for the biosynthesis of bacillomycin D and bacillomycin L, variants of iturin were detected in the chromosomes of B. subtilis, B. amyloliquefaciens and B. velezensis species. In species such as B. atrophaeus and B. spizizenii, chromosomes possess gene clusters producing iturin A and mycosubtilin, respectively. For the fengycin family, A or B production gene clusters were detected in 1/3 of the chromosomes of producing strains. In fact, the 6th monomer, which may be Val in the case of fengycin B or Ala in the case of fengycin A, was not predicted during the study. A fengycin B variant was detected in the genomes of 10 strains belonging to the B. subtilis species (Table 3). However, all the genomes of B. paralicheniformis strains contained clusters of fengycin B biosynthesis genes (Table 3). The D-allothreonine observed in fengycin B is predicted to be a D-threonine by antiSMASH because these two threonine isomers can be selected by the same A-domain. A small difference is noticed in the beta carbon. Beta carbon of D-allothreonine (CαD, CβD) belongs to the D series whereas the beta carbon of D-threonine (CαD, CβL) belongs to the L series.

Figure 2

Figure 3

3.4 Prevalence of BGC NRPSs in chromosomes of the different species

The 123 strains represent 33 species with distributions as follows: 19 species were found in the soil samples only, 12 species were found in both soil and fermented food isolation media and 2 species were found only in the fermented food. Bacillus species that were only found in soil samples included B. albus, B. arachidis, B. atrophaeus, B. badius, B. bombysepticus, B. cellulasensis, B. gobiensis, B. haikouensis, B. halotolerans, B. methanolicus, B. mycoides, B. inaquosorum, B. stratosphericus, B. thuringiensis, B. toyoninsis, B. tropicus, B. anthracis, B. vallismortis and B. aerophilus (Figure 4A). Species found in soil and fermented foods were: B. licheniformis, B. paralicheniformis, B. paranthracis, B. pacificus, B. pumilus, B. safensis, B. spizizenii, B. subtilis, B. velezensis, B. wiedmannii, B. amyloliquefaciens and B. cereus. Two species, B. altitudinis and B. infantis, were only found in fermented foods (Figure 4B). Among genomes of soil Bacillus species, only B. atrophaeus had a gene cluster for iturin biosynthesis. Furthermore, only the genome of B. thuringiensis species carried a gene clusters for the biosynthesis of the antibiotic bacitracin and a new gene cluster. Analysis of genomes of the 12 species found in isolation media (soil and fermented food) revealed the presence of new gene clusters in species such as B. velezensis, B. amyloliquefaciens, B. subtilis and B. cereus. Prevalence of new gene clusters was higher in B. velezensis and B. amyloliquefaciens, respectively (Figure 4B) and the genomes of these two strains carried 3 new BGC NRPSs. The genome of B. infantis, a species that was only present in fermented foods had no NRPSs gene cluster (Figure 4B). Moreover, bacitracin biosynthesis genes were carried only by the genomes of B. paralicheniformis and B. thuringiensis. Genomes of B. pacificus, B. cereus, B. wiedmannii and B. paranthracis lacked surfactin biosynthesis genes (Figure 4).

Figure 4

3.5 Bacitracin production

Among the 123 genomes screened, only 4 (B. paralicheniformis 14DA11, B. paralicheniformis CP47, B. paralicheniformis UBBLI-30, and B. thuringiensis Bt185) bear BGC corresponding to the synthetic pathway for antibiotic bacitracin. This antibiotic is a semi-cyclic peptide constituted of 12 amino acids. In B. paralicheniformis CP47 chromosome sequence, a BGC encoding an NRPS containing 12 modules was found. These 12 modules correspond exactly to the 12 modules of bacitracin synthetase. In the genomes of B. thuringiensis Bt185, B. paralicheniformis 14DA11, and B. paralicheniformis UBBLI-30, modular structuring and amino acid composition were similar to bacitracin A1 (Figure 1), similarities ranged from 85 to 100%.

3.6 BGC potentially producing new metabolites

A total of 7 new molecules carried by 7 new gene clusters were detected in this study, based on the exploration of 123 complete genomes of strains belonging to the Bacillus genus and isolated either from fermented foods or from soil. These newly identified gene clusters would be capable of producing other new lipopeptide families and new antibiotics. All newly predicted molecules were first compared with those identified subsequently, and then with the non-ribosomal peptides available in the Norine database. Low similarities ranged from 28.6 to 50%. The first new gene cluster detected consisted of 3 genes and 4 modules incorporating Thr, X, Asn and Asn monomers, respectively (Figure 2). Amino acid incorporated by module 2 was not detected, hence the letter X at this position (Figure 2). The predicted tetrapeptide showed a low similarity of 39.4% to Cis-7-tetradecenoyl-D-Asparagine. This new gene cluster was carried by plasmids from B. cereus CMCC P0011 and B. cereus CMCC P0021 strains. Two new gene clusters consisting of 6 modules each were detected by exploring the genomes of B. velezensis ATR2, B. velezensis DSYZ, B. velezensis CGMCC 11640 and B. amyloliquefaciens WF02. The hexapeptide predicted in the B. velezensis ATR2 genome was structured as follows: D-Cys-Ser-Cys-Ala-X-D-Asn, and contained a CAL (Co-enzyme A ligase) domain. The hexapeptide predicted in the genomes of B. velezensis DSYZ, B. velezensis CGMCC 11640 and B. amyloliquefaciens WF02 was structured: Val-D-Phe-Asp-D-Asn-Gly-D-Glu. Formation is ensured by the combination of an NRPS system and a PKS (polyketide synthase) system (Figure 2). CAL and C-starter domains were not detected in this hexapeptide. Therefore, it is probably an antibiotic. Literature searches and structural analyses in Norine did not reveal any similar molecules.

In silico analyses also revealed the presence of 3 new NRPS clusters capable of producing 3 different heptapeptides depending on the modular organization and amino acid composition of each. The first predicted heptapeptide detected was carried by B. velezensis DSYZ and B. subtilis SJ-10 genomes and is structured as follows: D-Cys-Ser-Cys-Ala-X-Asn-D-Asn (Figure 2). Monomer 5 has not been predicted. A CAL domain was detected at the start of the peptide chain (Figure 2). Structural analysis of this heptapeptide in Norine showed little similarity to iturin A1 (32.3%). Furthermore, literature searches revealed no molecules with a similar structure. For the last two predicted heptapeptides, each gene cluster consists of 7 modules incorporating 7 amino acids. One had the following structure: D-Phe-D-Leu-Phe-D-Thr-Val-Ala-Thr and was carried by the B. amyloliquefaciens HM618 genome while the other had the following structural architecture: D-Phe-Tyr-Ile-X-D-Phe-Leu-Leu and was carried by the B. anthracis CMF9 genome (Figure 2). Their structures were similar to kahalalide A (39.4%) and axinastatin 5 (40%) respectively. The heptapeptide identified in the B. amyloliquefaciens HM618 genome contained a thioesterase (TE) domain marking the end of the peptide chain. The heptapeptide identified in the B. anthracis genome contained a CAL domain at the start of the peptide chain and a C condensation domain at the end of module 7 (Figure 2).

Exploration of the plasmid sequence of B. cereus CMCCPOO11 and B. cereus CMCCPOO21 revealed a BGC probably responsible for the synthesis of a new octapeptide nonribosomal peptide (Figure 2). The chemical structure of unpredictable monomers are represented by the letter X in the peptide chain. The structure search on Norine showed little similarity (50%) with cyanostatin B. No molecules with a similar structure were found in the literature.

4 Discussion

Bacillus species are known to produce a large variety of secondary metabolites. This production is influenced by environmental conditions. Thus, this study explored 123 complete genomes of Bacillus isolated from soil and fermented foods. Thus, in silico analysis of chromosomal and plasmid sequences using bio-informatics tools specific to non-ribosomal peptides revealed a potential gene cluster responsible for the biosynthesis of lipopeptides (sufactin, fengycin, iturin, kurstakin), the antibiotic bacitracin and the siderophore bacillibactin. Only B. subtilis 916 (Luo et al., 2015a) produced locillomycin, which was not detected in this study. However, surfactins are lipoheptapeptides with variants such as esperin, lichenysin, pumilacidin and surfactin (Waongo et al., 2023). Two distinct amino acids were predicted to be incorporated in the first module: either a glutamate (Glu) or a glutamic acid (Gln) (Hu et al., 2019). As for iturins, they are lipoheptapeptides whose main variants are: iturins A and C, bacillomycins D, F and L, mycosubtilin and mojavensin (Waongo et al., 2023). Bacillibactin is a siderophore whose sequence is Dhb-Gly-Thr (May et al., 2001). As a result, the modular organization and monomer composition of predicted peptides from the surfactin and iturin groups as well as bacillibactin, were similar to literature data. Compared with fengycin, the lipodecapeptide predicted in genomes of B. subtilis UD1022, B. vallismortis DSM11031, B. subtilis ZD01, B. halotolerans ZB201702, B. mojavensis B-41812, B. mojavensis B-41341, B. subtilis MEC_B298, B. subtilis s-16, B. subtilis FUA2231, B. subtilis FUA2232, B. subtilis SRCM103517, and B. subtilis BSP1, contains at position 8 a glutamic acid (Glu). In contrast, various authors have reported that module 8 incorporates glutamine (Gln) during peptide chain formation (Ait Kaki et al., 2020; Hussein, 2019). This observed variability shows that certain Bacillus could produce a fengycin B variant. Also, structural analysis of predicted kurstakins shows the presence of a threonine (Thr) at position 6 in the B. mycoide BGSC 4BQ1 genome. Generally, the amino acid occupying this position in the case of kurstakin is glutamine (Gln) (Béchet et al., 2012). Although monomers 2 and 6 have not been predicted in the genomes of other B. cereus group species (B. thuringiensis, B. cereus, B. wiedmannii, B. bombysepticus), it is quite possible that B. mycoide BGSC 4BQ1 would be capable of producing a kurstakin variant. Nevertheless, the two putative variants identified through this in silico screening approach will need to be further investigated through in vitro experiments.

This study revealed the coexistence of lipopeptide biosynthesis genes. Indeed, Luo et al. (2015b) reported the coexistence of surfactin (sfr), bacillomycin (bmy), fengycin (fen) and locillomycin (Loc) gene clusters in B. subtilis 916 genome. In this study, B. amyloliquefaciens WF02, B. amyloliquefaciens HM618, B. velezensis ATR2, B. velezensis CGMCC 11640, B. velezensis Lzh-a42, B. subtilis SJ-10 and B. velezensis DSYZ genomes also contained three known lipopeptide biosynthesis gene clusters and novel gene cluster that could produce a new lipopeptide family. To our knowledge, no study has demonstrated the coexistence of four or five gene clusters of lipopeptide in genomes of species such as B. amyloliquefaciens and B. velezensis. According to Luo et al. (2015b), a co-production of surfactin, fengycin, iturin and locillomycin by B. subtilis 916 is at the origin of its inhibitory capacity against multi-resistant Staphylococcus aureus. Similarly, the co-production of lipopeptides would reduce hemolytic activity of the producing strains (Luo et al., 2019; Waongo et al., 2023).

The emergence of multi-resistant strains to commonly used antibiotics is a major challenge (Eduardo-Correia et al., 2020). The exploration of new molecules that could serve as alternatives remains a necessity. Thus, the study identified seven new gene clusters (NRPS and NRPS/PKS), two of which were carried by plasmid sequences. These new gene clusters were responsible for the biosynthesis of a tetrapeptide, two hexapeptides, three heptapeptides and octapeptide. The predicted tetrapeptide and octapeptide were synthesized by gene clusters carried by plasmids. The structural architecture and monomer composition of all predicted molecules differed from the peptides available in the database (Flissi et al., 2023). Furthermore, literature searches did not reveal any molecules with similar structure and monomer composition. Except for the hexapeptide predicted in the chromosomes of B. velezensis DSYZ and B. amyloliquefaciens WF02, all the others carried a CAL or C-starter domain in the first module. In fact, lipopeptides are made up of chains of amino acids and fatty acids. Their structure is characterized by the presence of C-starter and CAL domains, which play a role in fatty acid chain activation (Ongena and Jacques, 2007). Thus, their presence in the first module of the seven predicted new peptides means that they could belong to the lipopeptide family. However, certain antibiotics of purely peptidic clinical interest, such as vancomycin and bacitracin, are characterized by the presence of an adenylation domain (A) in the first module (Konz et al., 1997; Wageningen et al., 1998). This A domain is responsible for selecting and activating the first amino acid to be integrated into the growing peptide chain. Consequently, the new hexapeptide predicted in the chromosomes of B. velezensis DSYZ and B. amyloliquefaciens WF02 would be a peptide antibiotic. Esmaeel et al. (2016) reported the synthesis of a new lipopeptide by analyzing the genomes of genus Burkholderia. On the other hand, since lipopeptides are synthesized non-ribosomally, the presence of gene clusters responsible for their synthesis in plasmids is rare. According to the literature, no study has revealed the presence of NRPS lipopeptide synthase genes on a Bacillus plasmid, only on the Burkholderia plasmid (Esmaeel et al., 2016). As a result, the putative new BGCs would constitute new families of lipopeptides as well as new antibiotics, which could have unique and interesting biological properties (e.g., antifungal, antibacterial, antiviral). Biomolecules newly predicted in this study could be at the center of current research in the interest of their future use in several fields such as agri-food, cosmetics and medicine. The results of this study reveal that the genomes of Bacillus strains available in databases contain many unknown molecules that could play important roles in antimicrobial control. However, it would be interesting to elucidate the biological functions of all these new molecules. Although the bioinformatic discovery of novel molecules often captures attention, rigorous experimental validation remains the crucial step in translating promising ideas into tangible scientific findings and ensuring genuine progress.

5 Conclusion

Exploration of microorganisms genomes allows for the rapid identification of gene clusters located on chromosomes coding for new beneficial molecules. Thus, in addition to chromosomes, it would be interesting to analyze plasmids that could have BGC clusters of NRPs or lipopeptides. Then, this study allowed us to identify seven new gene clusters synthesizing new non-ribosomal peptides. Our results suggest that several NRPs capable of being produced by Bacillus strains isolated from fermented foods and soil are still uncharacterized and their properties still unknown. Therefore, targeted research should be conducted experimentally to validate these different predicted molecules.

Statements

Data availability statement

Genome mining work was done base on the whole genome sequences of 123 Bacillus strains is listed in Table 1, which can be obtained from NCBI nucleotide database (https://www.ncbi.nlm.nih.gov/nuccore). All the other data supporting the findings is contained within the manuscript.

Ethics statement

The manuscript presents research on animals that do not require ethical approval for their study.

Author contributions

BW: Conceptualization, Investigation, Methodology, Writing – original draft, Writing – review & editing. LN: Investigation, Methodology, Writing – review & editing. FT: Methodology, Writing – review & editing. W-SAZ: Methodology, Writing – review & editing. JL: Supervision, Writing – review & editing. AS: Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

We would like to thank University Joseph KI-ZERBO of Burkina Faso and Huazhong Agricultural University of China for their financial support.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  • 1

    Ait KakiA.SmargiassoN.OngenaM.Kara AliM.MoulaN.de PauwE.et al. (2020). Characterization of new fengycin cyclic lipopeptide variants produced by Bacillus amyloliquefaciens (ET) originating from a salt lake of eastern Algeria. Curr. Microbiol.77, 443451. doi: 10.1007/s00284-019-01855-w

  • 2

    AletiG.SessitschA.BraderG. (2015). Genome mining: prediction of lipopeptides and polyketides from Bacillus and related Firmicutes. Comput. Struct. Biotechnol. J.13, 192203. doi: 10.1016/j.csbj.2015.03.003

  • 3

    AslamB.KhurshidM.ArshadM. I.MuzammilS.RasoolM.YasmeenN.et al. (2021). Antibiotic resistance: one health one world outlook. Front. Cell. Infect. Microbiol.11, 120. doi: 10.3389/fcimb.2021.771510

  • 4

    BéchetM.CaradecT.HusseinW.AbderrahmaniA.CholletM.LecléreV.et al. (2012). Structure, biosynthesis, and properties of kurstakins, nonribosomal lipopeptides from Bacillus spp. Appl. Microbiol. Biotechnol.95, 593600. doi: 10.1007/s00253-012-4181-2

  • 5

    BlinK.ShawS.AugustijnH. E.ReitzZ. L.BiermannF.AlanjaryM.et al. (2023). antiSMASH 7.0:new and improved predictions for detection, regulation, chemical structures and visualisation. Nucleic Acids Res.51, 4650. doi: 10.1093/nar/gkad344

  • 6

    BlinK.ShawS.KloostermanA. M.Charlop-PowersZ.Van WezelG. P.MedemaM. H.et al. (2021). AntiSMASH 6.0: improving cluster detection and comparison capabilities. Nucleic Acids Res.49, W29W35. doi: 10.1093/nar/gkab335

  • 7

    BonmatinJ.-M.LaprevoteO.PeypouxF. (2003). Diversity among microbial cyclic Lipopeptides: Iturins and Surfactins. Activity-structure relationships to design new bioactive agents. Comb. Chem. High Throughput Screen.6, 541556. doi: 10.2174/138620703106298716

  • 8

    DubanM.CociancichS.LeclèreV. (2022). Nonribosomal peptide synthesis definitely working out of the rules. Microorganisms.10, 119. doi: 10.3390/microorganisms10030577

  • 9

    Eduardo-CorreiaB.Morales-FilloyH.AbadJ. P. (2020). Bacteria from the multi-contaminated Tinto river estuary (SW, Spain) show high multi-resistance to antibiotics and point to Paenibacillus spp. as antibiotic-resistance-dissemination players. Front. Microbiol.10, 118. doi: 10.3389/fmicb.2019.03071

  • 10

    EsmaeelQ.PupinM.KieuN. P.ChataignéG.BéchetM.DeravelJ.et al. (2016). Burkholderia genome mining for nonribosomal peptide synthetases reveals a great potential for novel siderophores and lipopeptides synthesis. MicrobiologyOpen.5, 512526. doi: 10.1002/mbo3.347

  • 11

    FanaeiM.JurcicK.EmtiaziG. (2021). Detection of simultaneous production of kurstakin, fengycin and surfactin lipopeptides in Bacillus mojavensis using a novel gel-based method and MALDI-TOF spectrometry. World J. Microbiol. Biotechnol.37, 111. doi: 10.1007/s11274-021-03064-9

  • 12

    FlissiA.DubanM.JacquesP.LeclèreV.PupinM. (2023). Norine: bioinformatics methods and tools for the characterization of newly discovered nonribosomal peptidesFlissiA.DubanM.JacquesP.LeclèreV.PupinM.Methods Mol. Biol.2670, 303318. doi: 10.1007/978-1-0716-3214-7_16

  • 13

    FlissiA.RicartE.CampartC.ChevalierM.DufresneY.MichalikJ.et al. (2020). Norine: update of the nonribosomal peptide resource. Nucleic Acids Res.48, 465469. doi: 10.1093/nar/gkz1000

  • 14

    HathoutY.HoY.RyzhovV.DemirevP.FenselauC. (2000). Kurstakins: a new class of lipopeptides isolated from Bacillus thuringiensis. J. Nat. Prod.63, 14921496. doi: 10.1021/np000169q

  • 15

    HuF.LiuY.LiS. (2019). Rational strain improvement for surfactin production: enhancing the yield and generating novel structures. Microb. Cell Factories.18, 113. doi: 10.1186/s12934-019-1089-x

  • 16

    HuanY.KongQ.MouH.YiH. (2020). Antimicrobial peptides: classification, design, application and research progress in multiple fields. Front. Microbiol.11, 121. doi: 10.3389/fmicb.2020.582779

  • 17

    HusseinW. (2019). Fengycin or plipastatin? A confusing question in bacilli. Biotechnologia.100, 4755. doi: 10.5114/bta.2019.83211

  • 18

    IqbalS.UllahN.JanjuaH. A. (2021). In vitro evaluation and genome mining of Bacillus subtilis strain RS10 reveals its biocontrol and plant growth-promoting potential. Agriculture.11:1273. doi: 10.3390/agriculture11121273

  • 19

    JinP.WangH.LiuW.FanY.MiaoW. (2017). A new cyclic lipopeptide isolated from Bacillus amyloliquefaciens HAB-2 and safety evaluation. Pestic. Biochem. Physiol.147, 4045. doi: 10.1016/j.pestbp.2017.08.015

  • 20

    KatoT.ShojiJ. (1980). The structure of octapeptin D (studies on antibiotics from the genus Bacillus. XXVIII). J. Antibiot.33, 186191. doi: 10.7164/antibiotics.33.186

  • 21

    KhuranaH.SharmaM.VermaH.LopesB. S.LalR.NegiR. K. (2020). Genomic insights into the phylogeny of Bacillus strains and elucidation of their secondary metabolic potential. Genomics.112, 31913200. doi: 10.1016/j.ygeno.2020.06.005

  • 22

    KonzD.KlensA.SchörgendorferK.MarahielM. A. (1997). The bacitracin biosynthesis operon of Bacillus licheniformis ATCC 10716: molecular characterization of three multi-modular peptide synthetases. Chem. Biol.4, 927937. doi: 10.1016/S1074-5521(97)90301-X

  • 23

    LamV. B.MeyerT.AriasA. A.OngenaM.OniF. E.HöfteM. (2021). Bacillus cyclic lipopeptides iturin and fengycin control rice blast caused by Pyricularia oryzae in potting and acid sulfate soils by direct antagonism and induced systemic resistance. Microorganisms9, 125. doi: 10.3390/microorganisms9071441

  • 24

    LeclèreV.WeberT.JacquesP.PupinM. (2016). Bioinformatics tools for the discovery of new nonribosomal peptides. Methods Mol. Biol.1401, 209232. doi: 10.1007/978-1-4939-3375-4_14

  • 25

    LeeS. C.KimS. H.ParkI. H.ChungS. Y.ChoiY. L. (2007). Isolation and structural analysis of bamylocin a, novel lipopeptide from Bacillus amyloliquefaciens LP03 having antagonistic and crude oil-emulsifying activity. Arch. Microbiol.188, 307312. doi: 10.1007/s00203-007-0250-9

  • 26

    LiuR.ZhangD.LiY.TaoL.TianL. (2010). A new antifungal cyclic lipopeptide from Bacillus marinus B-9987. Helv. Chim. Acta.93, 24192425. doi: 10.1002/hlca.201000094

  • 27

    LuoC.ChenY.LiuX.WangX.WangX.LiX.et al. (2019). Engineered biosynthesis of cyclic lipopeptide locillomycins in surrogate host Bacillus velezensis FZB42 and derivative strains enhance antibacterial activity. Appl. Microbiol. Biotechnol.103, 44674481. doi: 10.1007/s00253-019-09784-1

  • 28

    LuoC.LiuX.ZhouX.GuoJ.TruongJ.WangX.et al. (2015a). Unusual biosynthesis and structure of locillomycins from Bacillus subtilis 916. Appl. Environ. Microbiol.81, 66016609. doi: 10.1128/AEM.01639-15

  • 29

    LuoC.LiuX.ZhouH.WangX.ChenZ. (2015b). Nonribosomal peptide synthase gene clusters for lipopeptide biosynthesis in Bacillus subtilis 916 and their phenotypic functions. Appl. Environ. Microbiol.81, 422431. doi: 10.1128/AEM.02921-14

  • 30

    MaZ.WangN.HuJ.WangS. (2012). Isolation and characterization of a new iturinic lipopeptide, mojavensin a produced by a marine-derived bacterium Bacillus mojavensis B0621A. J. Antibiot.65, 317322. doi: 10.1038/ja.2012.19

  • 31

    MayJ. J.WendrichT. M.MarahielM. A. (2001). The dhb operon of Bacillus subtilis encodes the biosynthetic template for the catecholic siderophore 2, 3-dihydroxybenzoate-glycine-threonine trimeric ester bacillibactin *. J. Biol. Chem.276, 72097217. doi: 10.1074/jbc.M009140200

  • 32

    MeyersE.ParkerW. L.BrownW. E. (1976). A nomenclature proposal for the octapeptin antibiotics. J. Antibiot.29, 12411242. doi: 10.7164/antibiotics.29.1241

  • 33

    NaruseN.TenmyoO.KobaruS.KameiH.MiyakiT.KonishiM.et al. (1990). Pumilacidin, a complex of new antiviral antibiotics: production, isolation, chemical properties, structure and biological activity. J. Antibiot.43, 267280. doi: 10.7164/antibiotics.43.267

  • 34

    NishikioriT.NaganawaH.MuraokaY.AoyagiT.UmezawaH. (1986). Plipastatins: new inhibitors of phospholipase A2, produced by Bacillus cereus BMG302-fF67. J. Antibiot.39, 755761. doi: 10.7164/antibiotics.39.755

  • 35

    OngenaM.JacquesF. (2007). Bacillus lipopeptides: versatile weapons for plant disease biocontrol. Trends Microbiol.16, 115125. doi: 10.1016/j.tim.2007.12.009

  • 36

    PathakK. V.KehariaH. (2014). Identification of surfactins and iturins produced by potent fungal antagonist, Bacillus subtilis K1 isolated from aerial roots of banyan (Ficus benghalensis) tree using mass spectrometry. 3 Biotech4, 283295. doi: 10.1007/s13205-013-0151-3

  • 37

    PeypouxF.MarionD.Maget-DanaR.PtakM.DasB. C.MichelG. (1985). Structure of bacillomycin F, a new peptidolipid antibiotic of the iturin group. Eur. J. Biochem.153, 335340. doi: 10.1111/j.1432-1033.1985.tb09307.x

  • 38

    PeypouxF.PommierM. T.MarionD.PtakM.DasB. C.MichelG. (1986). Revised structure of mycosubtilin, a peptidolipid antibiotic from Bacillus subtilis. J. Antibiot.39, 636641. doi: 10.7164/antibiotics.39.636

  • 39

    RahmanM. A.NooreM. S.HasanM. A.UllahM. R.RahmanM. H.HossainM. A.et al. (2014). Identification of potential drug targets by subtractive genome analysis of Bacillus anthracis A0248: an in silico approach. Comput. Biol. Chem.52, 6672. doi: 10.1016/j.compbiolchem.2014.09.005

  • 40

    Sang-cheolL.KimS.ParkI.ChungS.ChandraM. S.ChoiY. (2010). Isolation, purification, and characterization of novel fengycin S from Bacillus amyloliquefaciens LSC04 degrading-crude oil. Biotechnol. Bioprocess Eng.15, 246253. doi: 10.1007/s12257-009-0037-8

  • 41

    ShojiJ.KatoT. (1976). The structure of cerexin B (studies of antibiotics from the genus Bacillus. XVII). J. Antibiot.29, 12751280.

  • 42

    ShojiJ.KatoT.MatsumotoK.TakahashiY.MayamaM. (1976a). Production and isolation of cerexins C and D (studies on antibiotics from the genus Bacillus. XVIII). J. Antibiot.29, 12811285. doi: 10.7164/antibiotics.29.1268

  • 43

    ShojiJ.KatoT.SakazakiR. (1976b). The total structure of cerexin a (studies on antibiotics from the genus Bacillus. XVI). J. Antibiot.29, 12681274. doi: 10.7164/antibiotics.29.1275

  • 44

    TamangJ. P.ShinD.-H.JungS. J.ChaeS.-W. (2016). Functional properties of microorganisms in fermented foods. Front. Microbiol.7, 113. doi: 10.3389/fmicb.2016.00578

  • 45

    TapiA.CholletM.ScherensB.JacquesP. (2010). New approach for the detection of non-ribosomal peptide synthetase genes in Bacillus strains by polymerase chain reaction. Appl. Microbiol. Biotechnol.85, 15211531. doi: 10.1007/s00253-009-2176-4

  • 46

    ThasanaN.PrapagdeeB.RangkadilokN.SallabhanR.AyeS. L.RuchirawatS.et al. (2010). Bacillus subtilis SSE4 produces subtulene a, a new lipopeptide antibiotic possessing an unusual C15 unsaturated β-amino acid. FEBS Lett.584, 32093214. doi: 10.1016/j.febslet.2010.06.005

  • 47

    ThéatreA.Cano-PrietoC.BartoliniM.LaurinY.DeleuM.NiehrenJ.et al. (2021). The surfactin-like lipopeptides from Bacillus spp.: natural biodiversity and synthetic biology for a broader application range. Front. Bioeng. Biotechnol.9:623701. doi: 10.3389/fbioe.2021.623701

  • 48

    ThomasD. W.ItoT. (1969). The revised structure of the peptide antibiotic esperin, established by mass spectrometry. Tetrahedron.25, 19851990. doi: 10.1016/S0040-4020(01)82819-2

  • 49

    VaterJ.KablitzB.WildeC.FrankeP.MehtaN.CameotraS. S. (2002). Matrix-assisted laser desorption ionization-time of flight mass spectrometry of lipopeptide biosurfactants in whole cells and culture filtrates of Bacillus subtilis C-1 isolated from petroleum sludge. Appl. Environ. Microbiol.68, 62106219. doi: 10.1128/AEM.68.12.6210-6219.2002

  • 50

    VolponL.TsanP.MajerZ.VassE.HollósiM.NoguéraV.et al. (2007). NMR structure determination of a synthetic analogue of bacillomycin Lc reveals the strategic role of l-Asn1 in the natural iturinic antibiotics. Spectrochim. Acta A Mol. Biomol. Spectrosc.67, 13741381. doi: 10.1016/j.saa.2006.10.027

  • 51

    WageningenA. A.KirkpatrickP. N.WilliamsD. H.HarrisB. R.KershawJ. K.LennardN. J.et al. (1998). Sequencing and analysis of genes involved in the biosynthesis of a vancomycin group antibiotic. Chem. Biol.5, 155162. doi: 10.1016/S1074-5521(98)90060-6

  • 52

    WaongoB.PupinM.DubanM.ChataigneG.ZongoO.CisseH.et al. (2023). Kawal: a fermented food as a source of Bacillus strain producing antimicrobial peptides. Sci. Afr.20:e01714. doi: 10.1016/j.sciaf.2023.e01714

  • 53

    ZhangH. L.HuaH. M.PeiY. H.YaoX. S. (2004). Three new cytotoxic cyclic acylpeptides from marine Bacillus sp. Chem. Pharm. Bull.52, 10291030. doi: 10.1248/cpb.52.1029

  • 54

    ZhaoP.XueY.GaoW.LiJ.ZuX.FuD.et al. (2018). Bacillaceae-derived peptide antibiotics since 2000. Peptides. 101, 1016. doi: 10.1016/j.peptides.2017.12.018

Summary

Keywords

Bacillus, nonribosomal peptide, lipopeptide, biosynthetic gene clusters, genome mining

Citation

Waongo B, Ndayishimiye L, Tapsoba F, Zongo W-SA, Li J and Savadogo A (2025) Prospection for potential new non-ribosomal peptide gene clusters in Bacillus genus isolated from fermented foods and soil through genome mining. Front. Microbiol. 16:1515483. doi: 10.3389/fmicb.2025.1515483

Received

23 October 2024

Accepted

21 May 2025

Published

16 June 2025

Volume

16 - 2025

Edited by

Jørgen J. Leisner, University of Copenhagen, Denmark

Reviewed by

Maria Carla Martini, Worcester Polytechnic Institute, United States

Sajid Iqbal, Oujiang Laboratory (Zhejiang Laboratory for Regenerative Medicine, Vision, and Brain Health), China

Updates

Copyright

*Correspondence: Blaise Waongo,

ORCID: François Tapsoba, https://orcid.org/0000-0001-6964-0442

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics