Your new experience awaits. Try the new design now and help us make it even better

DATA REPORT article

Front. Genet., 21 January 2026

Sec. Livestock Genomics

Volume 17 - 2026 | https://doi.org/10.3389/fgene.2026.1720096

This article is part of the Research TopicBacterial Taxono-genomics and Comparative genomicsView all articles

Whole-genome sequence and assembly of the sporogenic Bacillus paralicheniformis T7 strain with high proteolytic and amylolytic activities

  • 1Laboratory for Genetics and Biochemistry of Microorganisms, National Center for Biotechnology, Astana, Kazakhstan
  • 2Faculty of Natural Sciences, L. N. Gumilyev Eurasian National University, Astana, Kazakhstan
  • 3Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Astana, Kazakhstan

1 Introduction

The diverse enzymatic characteristics of Bacillus species render them advantageous for several applications, including food processing, agriculture, biomedicine, biofuel generation, hydrolysis, bioremediation, and natural polymer processing (Muras et al., 2021; Sharma et al., 2021; Zhao et al., 2021). Several bacillary strains exhibit elevated production capabilities (Contesini et al., 2018), with Bacillus paralicheniformis strains being prominent in the enzyme manufacturing sector. For instance, B. paralicheniformis MKU3 is recognized for its ability to produce proteases that effectively break down feather keratin (Santha Kalaikumari et al., 2019), while B. paralicheniformis BL. HK produces extracellular proteases used in the enzymatic processing of animal hides (Akhtar et al., 2024). Furthermore, B. paralicheniformis HR-1 and Bacillus haynesii HR-5, isolated from bottom sediments, are employed for the production of alkaline proteases applicable in the textile and leather industries (Thakor et al., 2025). Proteases obtained from B. paralicheniformis T7 also exhibit significant keratinase activity in the hydrolysis of avian feathers, wool, horns, hooves, and hides (Aktayeva and Khassenov, 2024b; Aktayeva and Khassenov, 2024a). Alongside proteases and keratinases, B. paralicheniformis strains produce α-amylases (Božić et al., 2020), phosphatases (Abdelgalil et al., 2021), esterases (Ganesh Kumar et al., 2021), and xylanases (Ngom et al., 2023). Specifically, the α-amylase produced by B. paralicheniformis ATCC 9945a effectively hydrolyzes starch without pretreatment (Božić et al., 2020). Additionally, the α-amylase from B. paralicheniformis GRA2 was integrated into a multienzyme preparation for the hydrolysis of food waste (Roslan et al., 2021). Meanwhile, B. paralicheniformis APSO efficiently produces thermostable alkaline phosphatase (Abdelgalil et al., 2021), and B. paralicheniformis strain G1, isolated from Arabian Sea sediments, secretes esterases that effectively biodegrade polystyrene (Ganesh Kumar et al., 2021).

In addition to enzymes, the antimicrobial peptides produced by B. paralicheniformis are of particular interest to researchers. For instance, bacitracin, produced by B. paralicheniformis UBBLi30, inhibits the growth of Micrococcus luteus, methicillin-resistant Staphylococcus aureus (MRSA), Streptococcus pyogenes, and Propionibacterium acnes, while additionally inhibiting biofilm formation by M. luteus and MRSA (Ahire et al., 2020). The antimicrobial peptide produced by B. paralicheniformis exhibits a bacteriostatic effect against Salmonella typhi and Listeria monocytogenes (Choyam et al., 2021) while remaining neutral against Lactobacillus. The advantage of these antimicrobial peptides is their resistance to degradation by physiological proteases or under high pH and temperature. Given these beneficial properties, such antimicrobial peptides may serve as viable alternatives to antibiotics as well as biopreservatives (Ahire et al., 2020).

Genomic methods are applied for the characterization of bacteria, enabling the identification of genes responsible for the biochemical and phenotypic features of a strain. Analysis of the B. paralicheniformis G1 genome revealed the presence of genes encoding serine, metallo-, and cysteine proteases. Most of the identified varieties were metalloproteases from the M20 and M50 families, along with serine proteases from the S8, S9, and S33 families (Santha Kalaikumari et al., 2021). Similarly, the B. paralicheniformis NBG-07 genome includes genes that code for various enzymes, including alpha-amylase, protease, cellulase, and laccase (Ramzan et al., 2024). Genomic sequencing of the polystyrene-degrading bacterium B. paralicheniformis G1 identified genes encoding monooxygenase, dioxygenase, peroxidase, esterase, and hydrolases capable of the degradation of synthetic polymers. Genes associated with bacterial motility and biofilm formation have also been identified in Bacillus species (Ganesh Kumar et al., 2021).

The present investigation focuses on the B. paralicheniformis T7 strain, isolated from Kazakhstan soil, which exhibits enhanced protease and keratinase activities. This strain also demonstrates α-amylase, esterase, phosphatase, and phytase activities, indicating that it may serve as a potential source for multienzyme production. Acquiring a comprehensive understanding of the strain’s biochemical features necessitates a genome-wide analysis. Such insights may then be applied in genetic engineering and in the formulation of strategies for improving the production of extracellular enzymes for biotechnological applications.

The objective of this work was to perform whole-genome sequencing and de novo genome assembly of the B. paralicheniformis T7 strain using Nanopore sequencing technology. A comprehensive investigation of the genome is expected to provide significant insights into the mechanisms of enzyme action. Moreover, this investigation will generate genetic data that is both valuable for future research and relevant for practical applications.

2 Materials and methods

2.1 Media, culture, and storage conditions

The B. paralicheniformis T7 strain was cultured on several media containing (w/v) lysogeny broth (0.5% yeast extract [Condalab, Madrid, Spain, Lot # 1702.00], 1% tryptone plus [Sigma-Aldrich, St. Louis, MO, United States, Lot # BCCB8073], 0.5% NaCl [Sigma-Aldrich, Lot # SZBE2960V0], pH 7.0), nutrient broth (Himedia, Mumbai, India, Lot # 0000044915), and feather medium (0.03% NaH2PO4, 0.035% Na2HPO4, and 0.75% feather powder, pH 7.0). The agar media used for the isolation and screening of the strain comprised (w/v) nutrient agar (TM Media, Rajastan, India, Lot # M3E1EV01), LB agar (lysogeny broth with 1.5% agar [TM Media, Lot # B1CA1HT01]), skimmed milk agar (2% skimmed milk, 0.1% NaCl, 1% tryptone plus, 1% agar), gelatin agar (0.4% peptone [TM Media, Product Code # 1506], 0.1% yeast extract, 1.5% gelatin, 1.5% agar), and feather agar (0.17% NaH2PO4, 0.035% feather powder, 1.5% agar). Sporulation was induced in Difco sporulation medium (w/v) (0.8% nutrient broth, 0.1% KCl, 0.012% MgSO4, 0.001% MnCl2, 0.005549% CaCl2, 0.15191 × 10−3% FeSO4, pH 7.2), Arret-Kirshbaum sporulation agar (0.6% pancreatic digest of gelatin, 0.4% casein enzyme, 0.3% g/L yeast extract, 0.15 g/L beef extract, 0.1% dextrose, 0.03% MnSO4, 1.5% agar, pH 7.0), and modified nutrient agar (0.28% nutrient agar, 0.01% CaCl2, 0.005% MnSO4, pH 6.9). The specific preparation methods are detailed in Aktayeva and Khassenov (2024a).

The strain was cultivated either in liquid broth at 30 °C–37 °C with shaking (150–200 rpm) for 10–16 h, or on solid agar medium at 30 °C–37 °C for 18–20 h. For short-term maintenance, the strain was stored on LB agar or nutrient agar plates at 4 °C for up to 30 days. For long-term storage, the cells were suspended in lysogeny broth supplemented with 50% sterile glycerol and stored at −80 °C. Storage at −80 °C for 5 years did not affect its characteristics, including growth conditions, sporulation, and enzymatic properties.

2.2 Isolation and identification of the strain

The strain was isolated from soil samples collected in Kazakhstan (42°54′00″N, 71°22′00″E) and was identified based on the morphological features of the colonies and cultures, as well as by light microscopic analysis (Bergey and Holt, 1994). For the identification of genetic features, the 16S rRNA gene was amplified using universal primers 27F and 1492R and subjected to Sanger sequencing (Sanger et al., 1977) using the BigDye Terminator v.3.1 Cycle Sequencing Kit (Thermo Fisher Scientific, Waltham, MA, United States) according to the manufacturer’s protocol. DNA fragments were separated using an ABI 3730xl automated sequencer (Applied Biosystems, Foster City, CA, United States). The obtained chromatograms were analyzed and compared with reference sequences using Vector NTI software v.11 (Thermo Fisher Scientific) and the NCBI database (http://blast.ncbi.nlm.nih.gov/Blast.cgi).

2.3 Enzyme assays

Keratinase activity was determined using azokeratin and keratin azure (St. Louis, MO, United States) as a substrate in 50 mM Tris-HCl (pH 9.0) at both 60 °C and 70 °C (Lin et al., 1992; Alane Beatriz Vermelho and Sabrina Martins Lage, 2013; Aktayeva and Khassenov, 2024b; Aktayeva and Khassenov, 2024a). Proteolytic activity was assessed according to Coêlho et al. (2016) using azocasein (Sigma-Aldrich) as a substrate in 50 mM Tris-HCl (pH 9.0) at 60 °С. Collagenase activity was evaluated according to Chavira et al. (1984) using azocoll (Sigma-Aldrich) as a substrate in 50 mM Tris-HCl (pH 9.0) supplemented with 1 mM CaCl2 at 37 °C. Milk clotting activity was measured following the method of Akishev et al. (2022) using cow milk as a substrate. Alpha-amylase activity was determined in 100 mM phosphate buffer, pH 6.0, at 85 °C via the reducing sugar method using potato starch as the substrate (Sigma-Aldrich) (Kiribayeva et al., 2024). Esterase activity was ascertained at 40 °C in 50 mM phosphate buffer, pH 7.0, according to Zhao et al. (2022) employing 4-nitrophenyl acetate (Thermo Scientific, Waltham, MA, United States) or 4-nitrophenyl octanoate (Thermo Fisher, Kandel, Germany) as a substrate. Alkaline phosphatase activity was assayed following the method of Abdelgalil et al. (2021) using p-nitrophenyl phosphate disodium salt hexahydrate (PanReacAppliChem, Darmstadt, Germany) as a substrate in 100 mM phosphate buffer, pH 10.3, at 70 °C. Phytase activity was assessed as described by Choi et al. (2001) using phytic acid sodium salt as a substrate in 100 mM Tris-HCl (pH 8.0) at 60 °C. All experiments were performed in triplicate. Enzymatic activity measurement data were derived from independent activity assays, with mean values, standard deviations (SD), and p-values calculated using GraphPad Prism version 8.0.1 (GraphPad Software, La Jolla, CA, United States, www.graphpad.com). All data are presented as means ± SD (n = 3).

2.4 Isolation and purification of endospores

Endospores were generated by culturing the strain on Arret-Kirshbaum sporulation agar or modified nutrient agar at 37 °C, and in Difco sporulation medium in a shaking incubator (150 rpm) at 37 °C. Cultivation time was determined by microbiological observation using Schaeffer-Fulton staining. Cultivation was stopped when the spore-to-cell ratio reached 1:1 or when spores predominated. Cells and spores were harvested by centrifugation. To remove the remaining vegetative cells, the suspension was heated for 20 min at 90 °C.

2.5 Antibiotic resistance

The strain’s antibiotic resistance profile was tested using the disk diffusion method as described by Matuschek et al. (2014). The following antibiotics were used: ampicillin (TM Media, 10 µg/disc, Lot # 041210), chloramphenicol (TM Media, 30 µg/disc, Lot # 0712109), ciprofloxacin (TM Media, 5 µg/disc, Lot # 0812110), clindamycin (TM Media, 2 µg/disc, Lot # 0012109), erythromycin (TM Media, 15 µg/disc, Lot # 0412109), gentamicin (TM Media, 120 µg/disc, Lot # 0012012), kanamycin (TM Media, 30 µg/disc, Lot # 0612110), nalidixic acid (TM Media, 30 µg/disc, Lot # 0922108), penicillin-G (TM Media, 10 µg/disc, Lot # 0422108), rifampicin (TM Media, 5 µg/disc, Lot # 0712109), streptomycin (TM Media, 10 µg/disc, Lot # 0912109), tetracycline (TM Media, 30 µg/disc, Lot # 0012109), tobramycin (TM Media, 10 µg/disc, Lot # 1212109), and cefazolin (HiMedia, 30 µg/disc, Lot # 0000156850).

2.6 Zymography and proteomic analysis

The culture supernatant of the strain was used for zymographic and mass spectrometric analysis. Zymographic analysis was performed accordingly to Aktayeva et al. (2022) using casein, keratin, and gelatin as substrates. To determine the specific protease classes, the following inhibitors were employed: phenylmethylsulfonyl fluoride, EDTA, E64, and Pepstatin A. Mass spectrometric analysis of the secreted proteome was performed using a Maxis Impact II Instrument (Bruker, Germany). The extracellular enzymes were identified via the Mascot platform.

2.7 Whole-genome investigation

The strain was cultivated in 10 mL of nutrient broth. The cells were harvested by centrifugation at 6,000 g for 7 min at 4 °C. Genomic DNA was isolated with a Genomic Wizard Purification Kit (Promega, Madison, WI, United States) following the manufacturer’s protocol. The quantity and quality of the DNA were determined using a NanoDrop OneC spectrophotometer (Thermo Scientific, Waltham, MA, United States) and agarose gel electrophoresis. The concentration obtained was 1.79 ng/µL, which corresponds to a yield of 38 μg of genomic DNA. Whole-genome libraries were generated using the Oxford Nanopore Technologies (ONT) Ligation Sequencing Kit (SQK-LSK109), which entailed the ligation of sequencing adapters following the manufacturer’s protocol (https://nanoporetech.com/document/gDNA-sqk-lsk109). The generated libraries were quantified using a Qubit 2.0 fluorometer (Invitrogen) and sequenced on the MinION platform using a FLO-MIN106 (R9.4.1) flow cell. Raw data underwent base calling using Guppy v3.4.1, followed by the removal of low-quality reads to ensure high-quality data.

Sequencing generated a cumulative count of 149,880 sequencing reads with a median read length of 5,597 base pairs and an average read quality score of 11.98. The “epi2me-labs/wf-bacterial-genomes” module was employed for downstream analysis. De novo genome assembly was performed using the Flye v.2.9.1-b1780 algorithm (Kolmogorov et al., 2019). A single circular contig of 4,360,494 bp with a G + C content of 45.93% and a high mean contig coverage of 360X was generated (Supplementary File 1, Supplementary Table S1; Supplementary Table S2). Before annotation, the assembly was polished using Medaka (v.1.7.2) based on the same ONT reads. Genomic features were visualized via DNA Features Viewer. Additional genome annotation and circular genome visualization (Figure 1) were performed using Proksee (Grant et al., 2023). The PATRIC resource was accessed via the BV-BRC platform, which integrates and extends the former PATRIC database (Wattam et al., 2017; Olson et al., 2023). Genome completeness was assessed against the bacillus_odb12 database using BUSCO v.6.0.0 (Manni et al., 2021). The results showed that the assembly contained 97.2% complete BUSCOs (97.0% single-copy, 0.1% duplicated), 1.8% fragmented BUSCOs, and 1.0% missing BUSCOs (Supplementary File 1, Supplementary Table S3). The quality of the assembled B. paralicheniformis T7 strain genome was further verified using the CheckM2 tool (Supplementary File 1, Supplementary Table S4; Chklovski et al., 2023). Phylogenetic analysis was performed using CSI Phylogeny v.1.4 (Call SNPs and Infer Phylogeny) (Kaas et al., 2014) via the Center for Genomic Epidemiology (CGE) web service. As input, genome assemblies (FASTA format) of the studied strain and selected reference genomes were analyzed using a reference-based SNP-calling workflow implemented in CSI Phylogeny, with the B. paralicheniformis genome serving as the reference. Default alignment and SNP-calling settings were applied. SNPs were filtered based on a minimum depth at SNP positions of 10×, a minimum relative depth of 10%, a minimum distance between SNPs (pruning) of 10 bp, a minimum SNP quality score of 30, a minimum mapping quality of 25, and a minimum Z-score of 1.96. High-confidence SNPs were concatenated into a single alignment, and a maximum-likelihood phylogenetic tree was inferred using FastTree as implemented in CSI Phylogeny. Branch support values were estimated using the Shimodaira–Hasegawa-like (SH-like) approximate likelihood ratio test. The resulting phylogenetic tree was visualized and annotated using iTOL v.6 (Letunic and Bork, 2024).

Figure 1
Circular genome map of Bacillus paralicheniformis T7. The map displays color-coded features: coding sequences (cds) in purple, transfer RNA (tRNA) in blue, ribosomal RNA (rRNA) in green, and GC content in black. GC skew+ is highlighted in green, GC skew- in purple, and CARD in red. Genes are labeled around the map, with positions marked in million base pairs (Mbp).

Figure 1. Circular genome representation of de novo assembled Bacillus paralicheniformis T7 genome. From the outside to the center: (1) predicted protein-coding sequences (CDSs) on the forward strand; (2) predicted protein-coding sequences (CDSs) on the reverse strand; (3) tRNA genes; (4) rRNA genes; (5) GC content plotted relative to the genome average; (6) GC skew, where positive values indicate G > C (green) and negative values indicate C > G (purple). Antimicrobial resistance genes identified using the CARD database are highlighted. The genome size and coordinate scale (in Mbp) are shown in the inner circle. The figure provides an overview of genome organization, gene distribution, and compositional features of the B. paralicheniformis T7 chromosome.

2.8 Morphological and biochemical features of the strain

The present study provides the genetic sequence of strain T7, which was isolated from soil in Kazakhstan. The bacterial cells are Gram-positive, rod-shaped, motile, and capable of sporulation. Bacterial cells occur both singly and in chains. When cultured on LB agar at 37 °C for 16 h, the strain forms milky, irregular colonies (2–4 mm in diameter), characterized by a glossy surface and cloudy, opaque regions. When cultured in nutrient broth under aerobic conditions at 37 °C with shaking (150 rpm), the strain shows profuse growth within 24 h. During growth, the medium becomes cloudy and acquires a characteristic odor, and bacterial flakes form in the culture medium. A difficult-to-break film forms at the liquid-air interface. Based on these characteristics, the strain was classified as belonging to the genus Bacillus. Sequencing of the 16S rRNA gene identified the strain as B. paralicheniformis (100% identity). B. paralicheniformis T7 grew well in both nutrient and lysogeny broths. When cultivated on milk, keratin, and gelatin agar, clearance zones appeared, indicative of its protease, keratinase, and gelatinase activities. The strain grows across a wide temperature range of 30 °C–55 °C and a pH range of 6.0–8.5. These phenotypic and biochemical characteristics are typical of Bacillus species and are consistent with those of other B. paralicheniformis species (Dunlap et al., 2015). When cultivated in a bioreactor, B. paralicheniformis T7 achieves maximum proteolytic activity within 24 h, which is shorter than that reported for Bacillus subtilis KT004404 (Rehman et al., 2017) and Bacillus sp. CL33A (Ignatova et al., 1999). After 24 h of submerged fermentation in a bioreactor on minimal feather medium (0.3 g/L NaH2PO4, 0.35 g/L Na2HPO4, 7.5 g/L feather powder, pH 7.0), the culture supernatant demonstrated keratinase, protease, collagenase, milk-clotting, amylase, esterase, phosphatase, and phytase activities. The specific activities and their respective substrates are detailed in Table 1. For comparative context, the keratinase activity of Bacillus licheniformis ALW1 is 72.2 U/mL (Abdel-Fattah et al., 2018); the protease activity of Bacillus cereus FT and Bacillus sp. DPUA 1728 is 187 and 86.27 U/mL, respectively (Lima et al., 2015; Asha and Palanisamy, 2018); and the amylase activity of B. licheniformis 104 K is 163 U/mL (Kholikov et al., 2025). The high levels of keratinase, protease, and amylase, in combination with collagenase, amylase, esterase, phosphatase, and phytase activities, indicate that the B. paralicheniformis T7 strain has significant potential as a source of multienzyme preparations (Aktayeva and Khassenov, 2024a). This strain is capable of effectively hydrolyzing keratin-containing raw materials, particularly bird feathers, horns, hooves, wool, and cattle skin, resulting in the release of peptones and free amino acids (Aktayeva and Khassenov, 2024b). While keratinase and protease activity have also been observed in B. paralicheniformis MKU3, this strain differs from B. paralicheniformis T7 by the lack of collagenase activity (Santha Kalaikumari et al., 2019). During 5 days of cultivation on Difco sporulation medium, Arret-Kirschbaum agar, or modified nutrient agar, the strain produced endospores capable of withstanding a temperature of 121 °C and a pressure of 1.1 bar for 20 min. Additionally, this strain was sensitive to the following antibiotics: ampicillin, cefazolin, chloramphenicol, ciprofloxacin, clindamycin, erythromycin, gentamicin, kanamycin, nalidixic acid, penicillin, rifampicin, streptomycin, tetracycline, and tobramycin.

Table 1
www.frontiersin.org

Table 1. Enzymatic activity of the supernatant from Bacillus paralicheniformis T7 after submerged fermentation. All measurements were performed three times independently, and the average of the three replicates was reported as the defined result with standard deviation (±SD).

Zymographic analysis using protease inhibitors—PMSF, EDTA, E64, and pepstatin A—complemented by proteomic studies, revealed that this strain secretes enzymes with molecular masses ranging from 20 to 60 kDa. These enzymes predominantly belong to the S8 and S41 families of serine peptidases (James, 1978) and the M14, M42, and M55 families of metallopeptidases (Hazra et al., 2012), with peak activity observed at 60 °C and pH 9.0. The enzyme extract of the strain has amylase activity, which reaches a maximum at pH 7.0 and at 85 °C. The strain’s high production capacity and bioreactor fermentation efficiency, along with its protease and keratinase activities, render it a promising candidate for industrial applications.

2.9 Genomic data

The whole genome of B. paralicheniformis T7 was sequenced to provide a comprehensive understanding of its genetic characteristics. Aseptic cultures of the strain were cultivated in 10 mL of nutritional broth in a shaking incubator at 37 °C and 150 rpm for 18 h. After cultivation, the cells were harvested by centrifugation at 6,000 g for 7 min at 4 °C.

De novo assembly yielded a single circular contig of 4,360,494 bp exhibiting a high mean contig coverage of 360X (Supplementary File 1). Whole-genome assembly annotation via the PATRIC database identified a total of 4,652 protein-coding sequences, along with 82 tRNA molecules, 38 repetitive elements, and 24 rRNA molecules (Supplementary File 2). Among the identified proteins, 972 were classified as hypothetical, and 3,680 were assigned functional roles. Within the functional assignments, 1,099 proteins were associated with Enzyme Commission (EC) numbers, 926 with Gene Ontology (GO) terms, and 807 were mapped to KEGG pathways (Supplementary File 3). Comparative genomic analyses and phylogenetic tree reconstruction were performed using Bacillus genus reference strains retrieved from the NCBI database (Figure 2).

Figure 2
Phylogenetic tree depicting the evolutionary relationships among various Bacillus species. Branches are labeled with distance values indicating evolutionary divergence. Species include Bacillus amyloliquefaciens, Bacillus mojavensis, Bacillus vallismortis, Bacillus subtilis, Bacillus spizizenii, Bacillus sp. NSP9.1, Bacillus sonorensis, Bacillus licheniformis, and Bacillus paralicheniformis, each denoted with strain identifiers.

Figure 2. Phylogenetic tree depicting the evolutionary relationships among various Bacillus species. Branches are labeled with distance values indicating evolutionary divergence. Species include Bacillus amyloliquefaciens, Bacillus mojavensis, Bacillus vallismortis, Bacillus subtilis, Bacillus spizizenii, Bacillus sp. NSP9.1, Bacillus sonorensis, Bacillus licheniformis, and Bacillus paralicheniformis, each denoted with strain identifiers.

The whole-genome assembly for B. paralicheniformis T7 was deposited in NCBI GenBank under accession number CP124861 and BioProject number PRJNA967188. This assembly provides an extensive overview of the genetic information contained in this strain and facilitates the identification of the specific genetic components responsible for the observed morphological characteristics. The analytical pipeline encompassed bacterial culture, growth monitoring under various conditions, genomic DNA isolation, library creation, Oxford Nanopore sequencing, de novo genome assembly, genome annotation, and comparative genomic analysis. The final assembly is publicly available in NCBI GenBank to support additional comparative analyses and research.

3 Limitations

The genome sequence of the B. paralicheniformis T7 strain was generated using Oxford Nanopore technology, allowing for complete genome assembly. Nanopore sequencing enables the direct detection of nucleotides without the need for further DNA synthesis (Branton et al., 2008; Deamer et al., 2016) or imaging devices for nucleotide detection. By enhancing portability, this technology significantly reduces the initial cost of whole-genome sequencing (Branton et al., 2008). The fidelity of this sequencing method depends on the frequency at which a DNA strand is translocated through the pore (Mikheyev and Tin, 2014). This approach has exhibited robust performance in studies involving whole-genome sequencing (Mussakhmetov et al., 2024). Through the application of updated and validated bioinformatics pipelines and high sequencing coverage, the final genome assembly yielded a single circular contig.

Data availability statement

The data reported in the present paper are readily available and can be accessed from NCBI GenBank under the BioProject number PRJNA967188.

Author contributions

AM: Formal Analysis, Methodology, Writing – original draft, Data curation, Investigation, Validation. SA: Investigation, Formal Analysis, Data curation, Validation, Methodology, Writing – original draft. AS: Validation, Methodology, Writing – original draft. AD: Methodology, Software, Formal Analysis, Writing – original draft, Visualization. BK: Supervision, Conceptualization, Methodology, Writing – review and editing, Writing – original draft, Investigation, Funding acquisition, Validation, Resources. UK: Resources, Funding acquisition, Visualization, Formal Analysis, Writing – review and editing, Investigation, Writing – original draft, Software, Methodology.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research was funded by the Science Committee of the Ministry of Science and Higher Education of the Republic of Kazakhstan (Grants No. AP23488270 and AP23490594).

Acknowledgements

Concerning the ONT MinION sequencing platform, the authors would like to acknowledge A. Shevtsov and A. Amirgazin from the National Center for Biotechnology.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2026.1720096/full#supplementary-material

Footnotes

Abbreviations:ONT, Oxford Nanopore Technology; DNA, Deoxyribonucleic acid; bp, base pair; NCBI, National Center for Biotechnology Information.

References

Abdel-Fattah, A. M., El-Gamal, M. S., Ismail, S. A., Emran, M. A., and Hashem, A. M. (2018). Biodegradation of feather waste by keratinase produced from newly isolated Bacillus licheniformis ALW1. J. Genet. Eng. Biotechnol. 16, 311–318. doi:10.1016/j.jgeb.2018.05.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Abdelgalil, S. A., Soliman, N. A., Abo-Zaid, G. A., and Abdel-Fattah, Y. R. (2021). Dynamic consolidated bioprocessing for innovative lab-scale production of bacterial alkaline phosphatase from Bacillus paralicheniformis strain APSO. Sci. Rep. 11, 6071. doi:10.1038/s41598-021-85207-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Ahire, J. J., Kashikar, M. S., Lakshmi, S. G., and Madempudi, R. (2020). Identification and characterization of antimicrobial peptide produced by indigenously isolated Bacillus paralicheniformis UBBLi30 strain. 3 Biotech. 10, 112. doi:10.1007/s13205-020-2109-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Akhtar, M. A., Butt, M. Q. S., Afroz, A., Rasul, F., Irfan, M., Sajjad, M., et al. (2024). Approach towards sustainable leather: characterization and effective industrial application of proteases from Bacillus sps. for ecofriendly dehairing of leather hide. Int. J. Biol. Macromol. 266, 131154. doi:10.1016/j.ijbiomac.2024.131154

PubMed Abstract | CrossRef Full Text | Google Scholar

Akishev, Z., Aktayeva, S., Kiribayeva, A., Abdullayeva, A., Baltin, K., Mussakhmetov, A., et al. (2022). Obtaining of recombinant camel chymosin and testing its milk-clotting activity on cow’s, goat’s, ewes’, camel’s and mare’s milk. Biology 11, 1545. doi:10.3390/biology11111545

PubMed Abstract | CrossRef Full Text | Google Scholar

Aktayeva, S., and Khassenov, B. (2024a). High keratinase and other types of hydrolase activity of the new strain of Bacillus paralicheniformis. PLOS ONE 19, e0312679. doi:10.1371/journal.pone.0312679

PubMed Abstract | CrossRef Full Text | Google Scholar

Aktayeva, S., and Khassenov, B. (2024b). New Bacillus paralicheniformis strain with high proteolytic and keratinolytic activity. Sci. Rep. 14, 22621. doi:10.1038/s41598-024-73468-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Aktayeva, S., Baltin, K., Kiribayeva, A., Akishev, Z., Silayev, D., Ramankulov, Y., et al. (2022). Isolation of Bacillus sp. A5.3 strain with keratinolytic activity. Biol. (Basel) 11, 244. doi:10.3390/biology11020244

PubMed Abstract | CrossRef Full Text | Google Scholar

Alane Beatriz Vermelho, A. M. M., and Sabrina Martins Lage, C. (2013). Keratinases: detection methods, methods to determine enzymatic activity. arXiv. doi:10.2174/97816080530011130101

CrossRef Full Text | Google Scholar

Asha, B., and Palanisamy, M. P. (2018). Optimization of alkaline protease production by Bacillus cereus FT 1isolated from soil. J. Appl. Pharm. Sci. 8 (02), 119–127. doi:10.7324/JAPS.2018.8219

CrossRef Full Text | Google Scholar

Bergey, D. H., and Holt, J. G. (1994). Bergey's manual of determinative bacteriology. Baltimore: Lippincott Williams and Wilkins.

Google Scholar

Božić, N., Rozeboom, H. J., Lončar, N., Slavić, M., Janssen, D. B., and Vujčić, Z. (2020). Characterization of the starch surface binding site on Bacillus paralicheniformis α-amylase. Int. J. Biol. Macromol. 165, 1529–1539. doi:10.1016/j.ijbiomac.2020.10.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Branton, D., Deamer, D. W., Marziali, A., Bayley, H., Benner, S. A., Butler, T., et al. (2008). The potential and challenges of nanopore sequencing. Nat. Biotechnol. 26, 1146–1153. doi:10.1038/nbt.1495

PubMed Abstract | CrossRef Full Text | Google Scholar

Chavira, R., Jr., Burnett, T. J., and Hageman, J. H. (1984). Assaying proteinases with azocoll. Anal. Biochem. 136, 446–450. doi:10.1016/0003-2697(84)90242-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Chklovski, A., Parks, D. H., Woodcroft, B. J., and Tyson, G. W. (2023). CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning. Nat. Methods 20, 1203–1212. doi:10.1038/s41592-023-01940-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Choi, Y. M., Suh, H. J., and Kim, J. M. (2001). Purification and properties of extracellular phytase from bacillus sp. KHU-10. J. Protein Chem. 20, 287–292. doi:10.1023/A:1010945416862

PubMed Abstract | CrossRef Full Text | Google Scholar

Choyam, S., Jain, P. M., and Kammara, R. (2021). Characterization of a potent new-generation antimicrobial peptide of Bacillus. Front. Microbiol. 12, 710741. doi:10.3389/fmicb.2021.710741

PubMed Abstract | CrossRef Full Text | Google Scholar

Coêlho, D. F., Saturnino, T. P., Fernandes, F. F., Mazzola, P. G., Silveira, E., and Tambourgi, E. B. (2016). Azocasein substrate for determination of proteolytic activity: reexamining a traditional method using bromelain samples. BioMed Res. Int. 2016, 8409183. doi:10.1155/2016/8409183

PubMed Abstract | CrossRef Full Text | Google Scholar

Contesini, F. J., Melo, R. R., and Sato, H. H. (2018). An overview of bacillus proteases: from production to application. Crit. Rev. Biotechnol. 38, 321–334. doi:10.1080/07388551.2017.1354354

PubMed Abstract | CrossRef Full Text | Google Scholar

Deamer, D., Akeson, M., and Branton, D. (2016). Three decades of nanopore sequencing. Nat. Biotechnol. 34, 518–524. doi:10.1038/nbt.3423

PubMed Abstract | CrossRef Full Text | Google Scholar

Dunlap, C. A., Kwon, S. W., Rooney, A. P., and Kim, S. J. (2015). Bacillus paralicheniformis sp. Nov., isolated from fermented soybean paste. Int. J. Syst. Evol. Microbiol. 65, 3487–3492. doi:10.1099/ijsem.0.000441

PubMed Abstract | CrossRef Full Text | Google Scholar

Ganesh Kumar, A., Hinduja, M., Sujitha, K., Nivedha Rajan, N., and Dharani, G. (2021). Biodegradation of polystyrene by deep-sea Bacillus paralicheniformis G1 and genome analysis. Sci. Total Environ. 774, 145002. doi:10.1016/j.scitotenv.2021.145002

PubMed Abstract | CrossRef Full Text | Google Scholar

Grant, J. R., Enns, E., Marinier, E., Mandal, A., Herman, E. K., Chen, C. Y., et al. (2023). Proksee: in-depth characterization and visualization of bacterial genomes. Nucleic Acids Res. 51, W484–W492. doi:10.1093/nar/gkad326

PubMed Abstract | CrossRef Full Text | Google Scholar

Hazra, S., Guha, R., Jongkey, G., Palui, H., Mishra, A., Vemuganti, G. K., et al. (2012). Modulation of matrix metalloproteinase activity by EDTA prevents posterior capsular opacification. Mol. Vis. 18, 1701–1711.

PubMed Abstract | Google Scholar

Ignatova, Z., Gousterova, A., Spassov, G., and Nedkov, P. (1999). Isolation and partial characterisation of extracellular keratinase from a wool degrading thermophilic actinomycete strain Thermoactinomyces candidus. Can. J. Microbiol. 45, 217–222. doi:10.1139/w98-230

PubMed Abstract | CrossRef Full Text | Google Scholar

James, G. T. (1978). Inactivation of the protease inhibitor phenylmethylsulfonyl fluoride in buffers. Anal. Biochem. 86, 574–579. doi:10.1016/0003-2697(78)90784-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaas, R. S., Leekitcharoenphon, P., Aarestrup, F. M., and Lund, O. (2014). Solving the problem of comparing whole bacterial genomes across different sequencing platforms. PLoS ONE 9 (8), e104984. doi:10.1371/journal.pone.0104984

PubMed Abstract | CrossRef Full Text | Google Scholar

Kholikov, A., Vokhidov, K., Murtozoyev, A., Tóth, Z. S., Nagy, G. N., Vértessy, B. G., et al. (2025). Characterization of a thermostable α-Amylase from Bacillus licheniformis 104.K for industrial applications. Microorganisms 13, 1757. doi:10.3390/microorganisms13081757

PubMed Abstract | CrossRef Full Text | Google Scholar

Kiribayeva, A., Silayev, D., Akishev, Z., Baltin, K., Aktayeva, S., Ramankulov, Y., et al. (2024). An impact of N-glycosylation on biochemical properties of a recombinant α-amylase from Bacillus licheniformis. Heliyon 10, e28064. doi:10.1016/j.heliyon.2024.e28064

PubMed Abstract | CrossRef Full Text | Google Scholar

Kolmogorov, M., Yuan, J., Lin, Y., and Pevzner, P. A. (2019). Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546. doi:10.1038/s41587-019-0072-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Letunic, I., and Bork, P. (2024). Interactive tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 52 (W1), W78–W82. doi:10.1093/nar/gkae268

PubMed Abstract | CrossRef Full Text | Google Scholar

Lima, L. A., Cruz Filho, R. F., Dos Santos, J. G., and Silva, W. C. (2015). Protease with collagenolytic activity produced by Bacillus sp. DPUA 1728 from Amazonian soil. Braz J. Microbiol. 46, 1217–1223. doi:10.1590/s1517-83822015005030656

PubMed Abstract | CrossRef Full Text | Google Scholar

Lin, X., Lee, C. G., Casale, E. S., and Shih, J. C. (1992). Purification and characterization of a keratinase from a feather-degrading Bacillus licheniformis strain. Appl. Environ. Microbiol. 58, 3271–3275. doi:10.1128/aem.58.10.3271-3275.1992

PubMed Abstract | CrossRef Full Text | Google Scholar

Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A., and Zdobnov, E. M. (2021). BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol. 38, 4647–4654. doi:10.1093/molbev/msab199

PubMed Abstract | CrossRef Full Text | Google Scholar

Matuschek, E., Brown, D. F., and Kahlmeter, G. (2014). Development of the EUCAST disk diffusion antimicrobial susceptibility testing method and its implementation in routine microbiology laboratories. Clin. Microbiol. Infect. 20, O255–O266. doi:10.1111/1469-0691.12373

PubMed Abstract | CrossRef Full Text | Google Scholar

Mikheyev, A. S., and Tin, M. M. Y. (2014). A first look at the Oxford nanopore MinION sequencer. Mol. Ecol. Resour. 14, 1097–1102. doi:10.1111/1755-0998.12324

PubMed Abstract | CrossRef Full Text | Google Scholar

Muras, A., Romero, M., Mayer, C., and Otero, A. (2021). Biotechnological applications of Bacillus licheniformis. Crit. Rev. Biotechnol. 41, 609–627. doi:10.1080/07388551.2021.1873239

PubMed Abstract | CrossRef Full Text | Google Scholar

Mussakhmetov, A., Kiribayeva, A., Daniyarov, A., Bulashev, A., Kairov, U., and Khassenov, B. (2024). Genome sequence and assembly of the amylolytic Bacillus licheniformis T5 strain isolated from Kazakhstan soil. BMC Genom. Data. 25, 3. doi:10.1186/s12863-023-01177-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Ngom, S. I., Maski, S., Rached, B., Chouati, T., Oliveira Correia, L., Juste, C., et al. (2023). Exploring the hemicellulolytic properties and safety of Bacillus paralicheniformis as stepping stone in the use of new fibrolytic beneficial microbes. Sci. Rep. 13, 22785. doi:10.1038/s41598-023-49724-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Olson, R. D., Assaf, R., Brettin, T., Conrad, N., Cucinell, C., Davis, J. J., et al. (2023). Introducing the bacterial and viral bioinformatics resource center (BV-BRC): a resource combining PATRIC, IRD and ViPR. Nucleic Acids Res. 51, D678–D689. doi:10.1093/nar/gkac1003

PubMed Abstract | CrossRef Full Text | Google Scholar

Ramzan, R., Ali, A., Farooq, M., Ali, H., Khaliq, S., and Rashid, M. H. (2024). Draft genome sequence of extracellular enzyme-producing Bacillus paralicheniformis strain NBG-07. Microbiol. Resour. Announc 13, e0009624. doi:10.1128/mra.00096-24

PubMed Abstract | CrossRef Full Text | Google Scholar

Rehman, R., Ahmed, M., Siddique, A., Hasan, F., Hameed, A., and Jamal, A. (2017). Catalytic role of thermostable metalloproteases from Bacillus subtilis KT004404 as dehairing and destaining agent. Appl. Biochem. Biotechnol. 181, 434–450. doi:10.1007/s12010-016-2222-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Roslan, M. A.M., Jefri, N., Ramlee, N., Rahman, N. A. A., Chong, N. H. H., Bunawan, H., et al. (2021). Enhancing food waste biodegradation rate in a food waste biodigester with the synergistic action of hydrolase-producing Bacillus paralicheniformis GRA2 and Bacillus velezensis TAP5 co-culture inoculation. Saudi J. Biol. Sci. 28, 3001–3012. doi:10.1016/j.sjbs.2021.02.041

PubMed Abstract | CrossRef Full Text | Google Scholar

Sanger, F., Nicklen, S., and Coulson, A. R. (1977). DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. U. S. A. 74, 5463–5467. doi:10.1073/pnas.74.12.5463

PubMed Abstract | CrossRef Full Text | Google Scholar

Santha Kalaikumari, S., Vennila, T., Monika, V., Chandraraj, K., Gunasekaran, P., and Rajendhran, J. (2019). Bioutilization of poultry feather for keratinase production and its application in leather industry. J. Clean. Prod. 208, 44–53. doi:10.1016/j.jclepro.2018.10.076

CrossRef Full Text | Google Scholar

Santha Kalaikumari, S., Sivakumar, R., Gunasekaran, P., and Rajendhran, J. (2021). Whole-genome sequencing and mining of protease coding genes in Bacillus paralicheniformis MKU3, and its degradomics in feather meal medium. Curr. Microbiol. 78, 206–217. doi:10.1007/s00284-020-02271-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Sharma, D., Uniyal, S., and Tewari, L. (2021). Biocatalytic transesterification of algal oil employing a heterogenous methanol tolerant lipase enzyme aggregate from Bacillus mycoides strain CV18. Process Biochem. 111, 43–52. doi:10.1016/j.procbio.2021.10.005

CrossRef Full Text | Google Scholar

Thakor, R., Mistry, H., Almoallim, H. S., Ansari, M. J., Patel, A., Yadav, V. K., et al. (2025). Enhanced synthesis, purification, and characterization of a marine bacterial consortium-derived protease enzyme with destaining and keratinolytic activity. Biotechnol. Appl. Biochem. 72, 963–973. doi:10.1002/bab.2711

PubMed Abstract | CrossRef Full Text | Google Scholar

Wattam, A. R., Davis, J. J., Assaf, R., Boisvert, S., Brettin, T., Bun, C., et al. (2017). Improvements to PATRIC, the all-bacterial bioinformatics database and analysis resource center. Nucleic Acids Res. 45, D535–D542. doi:10.1093/nar/gkw1017

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, T., Yong, X., Zhao, Z., Dolce, V., Li, Y., and Curcio, R. (2021). Research status of bacillus phytase. 3 Biotech. 11, 415. doi:10.1007/s13205-021-02964-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, J., Ma, M., Yan, X., Zhang, G., Xia, J., Zeng, G., et al. (2022). Expression and characterization of a novel lipase from Bacillus licheniformis NCU CS-5 for application in enhancing fatty acids flavor release for low-fat cheeses. Food Chem. 368, 130868. doi:10.1016/j.foodchem.2021.130868

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Bacillus paralicheniformis, Kazakhstan, oxford nanopore sequencing, proteolytic activity, whole genome

Citation: Mussakhmetov A, Aktayeva S, Sarsen A, Daniyarov A, Khassenov B and Kairov U (2026) Whole-genome sequence and assembly of the sporogenic Bacillus paralicheniformis T7 strain with high proteolytic and amylolytic activities. Front. Genet. 17:1720096. doi: 10.3389/fgene.2026.1720096

Received: 07 October 2025; Accepted: 12 January 2026;
Published: 21 January 2026.

Edited by:

Ramasamy Dhamodharan, Sri Balaji Vidyapeeth University, India

Reviewed by:

Ximena Blanco Crivelli, University of Buenos Aires, Argentina

Copyright © 2026 Mussakhmetov, Aktayeva, Sarsen, Daniyarov, Khassenov and Kairov. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Bekbolat Khassenov, a2hhc3Nlbm92QGJpb2NlbnRlci5reg==; Ulykbek Kairov, dWx5a2Jlay5rYWlyb3ZAbnUuZWR1Lmt6

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.