ORIGINAL RESEARCH article

Front. Plant Sci., 02 March 2026

Sec. Plant Symbiotic Interactions

Volume 17 - 2026 | https://doi.org/10.3389/fpls.2026.1718185

Development of molecular biomarkers for monitoring of arable crops colonization with Methylobacterium symbioticum SB0023/3, a methylotrophic bacterium commonly used as a biostimulant in agriculture

  • 1. GOBIO GmbH - Institute for Aquatic Ecology and Applied Biology, Aarbergen, Germany

  • 2. Laboratory of Applied Microbial Ecology, Institute of Bioengineering, Faculty of Biology, University of Warsaw, Warsaw, Poland

Abstract

Introduction:

The intensive use of synthetic fertilizers has led to increased nitrous oxide emissions, declining soil fertility, and reduced biodiversity. Biological alternatives, such as the use of endophytic bacteria to improve plant growth, are promising alternatives but require reliable monitoring tools to assess colonization success and biological effectiveness under field conditions. One of the most commonly used microbial biostimulants is Methylobacterium symbioticum SB0023/3; thus, monitoring the efficacy of inoculation and maintenance of this strain is required for adequate evaluation of fertilization practices.

Methodology:

The resequencing of the genome of M. symbioticum SB0023/3, followed by comparative genomics and functional annotation were performed. Specific real-time PCR primers were developed and validated for strain-specific detection. The colonization of various crops (wheat, corn, rapeseed, peas, and tomatoes) was tested under controlled conditions using developed molecular markers.

Results:

The resequencing of the SB0023/3 genome revealed novel genetic content and updated previous records. The resequenced genome showed 121 novel regions with 165 protein-coding genes and five tRNA. Based on the newly obtained genome, two highly specific biomarker genes (copG and ubik) were identified and, together with the (Methylobacterium spp./methylotrophs-specific) xoxF gene, validated for their strain/genus-specificity. The developed real-time PCR assays using copG and ubik biomarkers demonstrated high specificity for M. symbioticum SB0023/3, distinguishing it from related species. In contrast, the xoxF gene showed relaxed specificity and cannot be used for SB0023/3 detection. Successful endophytic colonization was confirmed in all tested crops, with high detection rates exceeding 80% in tomatoes. Classical culturing on a novel nitrogen-free medium additionally confirmed colonization, with the same validating the real-time PCR assays.

Discussion:

This study provides a robust, genome-informed molecular detection system for monitoring M. symbioticum SB0023/3 in crops. The presented approach enables direct detection from plant tissues, facilitating studies on colonization dynamics and biosafety. This methodology can be extended to other microbial biostimulants, supporting sustainable agricultural practices.

1 Introduction

The intensive use of synthetic fertilizers has been a major driver of the drastic increase in nitrous oxide emissions (Tian et al., 2020). It is also recognized as an important contributor to the decline in soil fertility (Tripathi et al., 2020) and the alteration or loss of natural biodiversity (Pahalvi et al., 2021), one of the most pressing global crises (Newbold et al., 2016).

The substitution of synthetic nitrogen fertilizers with biological nitrogen sources, therefore, represents both a promising strategy and a major challenge on the path toward sustainable agriculture (Mus et al., 2016). Among the various approaches, the inoculation of non-leguminous crops with endophytic diazotrophic bacteria has been extensively investigated as a means to improve plant nitrogen supply (Soumare et al., 2020). Field trials are critical to evaluate both colonization and functional efficacy of these bacteria under agricultural conditions. While numerous greenhouse and laboratory studies demonstrate plant growth–promoting effects of microbial biostimulants with respect to yield and quality (Silveira et al., 2016), comparatively only a few field trials have provided direct evidence of successful endophytic colonization. Moreover, field results are often inconsistent, with instances showing limited or no measurable effects. In some cases, improvements in plant performance are used as indirect evidence of colonization, but direct confirmation is frequently lacking (De Almeida Lopes et al., 2016; Ahmad et al., 2022; Torres Vera et al., 2024; Tsipinana et al., 2024). The scarcity of rigorously documented field studies is largely due to the absence of simple, reliable, and strain-specific detection methods.

Traditional approaches, such as enumeration of microorganisms on selective nutrient media, generally fail to differentiate natural from artificially introduced endophytic colonization (Zinniel et al., 2002). While 16S rRNA gene sequencing provides robust genus-level identification, species-level resolution is often constrained by the quality of available reference databases and methodological biases, including interference from plant DNA, intragenomic variation in 16S rRNA operons, and PCR artifacts (Church et al., 2020). This highlights the urgent need to develop new, strain-specific molecular biomarkers and detection systems suitable for monitoring inoculated microorganisms with biofertilizing and/or biostimulating potential in agricultural settings.

One of the most widely used microbial biofertilizers/biostimulants, with applications covering more than 2 million hectares worldwide in 2024 (M. Dzikowski, Corteva Agriscience, personal communication, July 8th, 2025), contains Methylobacterium symbioticum SB0023/3, a Gram-negative alphaproteobacterium. This species is characterized by its pink pigmentation, attributable to carotenoid biosynthesis, and its ability to utilize C1 carbon sources such as methanol (Toyama et al., 1998; Van Dien et al., 2003). M. symbioticum can grow in nitrogen-free media, although its nitrogen fixation mechanism remains unclear (Giller et al., 2025). Nonetheless, its plant growth-promoting effects are well documented, supporting its continued use in crop production (Agafonova et al., 2013; Dourado et al., 2015; Pascual et al., 2020; Juma et al., 2022; Pappalettere et al., 2024). However, its monitoring in the field and assessment of colonization efficiency remain challenging.

In this study, we performed resequencing of M. symbioticum SB0023/3 (first reported genomic sequence: GenBank acc. no. GCF_902141845.1), intending to refine the genomic reference and identify additional genetic features relevant to its ecological function. Particular emphasis was placed on the systematic identification of molecular biomarkers that could facilitate strain-specific biomonitoring, which fills the significant knowledge gap for the application of that agriculturally-relevant strain. These biomarkers are designed to be compatible with high-throughput laboratory methods and adaptable to diverse agricultural contexts, including various crop species, biological matrices, and potential vectors that may influence the bacterium’s distribution in agroecosystems. Establishing such a methodological framework is critical for the development of robust molecular tools to enable precise tracking of M. symbioticum SB0023/3 during field applications and to advance our understanding of its ecological interactions. It is also important to emphasize that the development of strain-specific biomarkers is critical, as it provides an advantage over biomarkers specific to an entire functional group or genus (e.g., the xoxF gene), as well as over general bacterial biomarkers (e.g., the 16S rRNA gene). The latter requires further bioinformatic analysis and lacks specificity toward the target group of interest, instead providing insight into the overall environmental biodiversity. Additionally, the use of strain-specific biomarkers and molecular methods offers advantages over classical approaches, including cultivation-based methods, which are time-consuming and costly. Overall, the study aims to fill a significant methodological gap by providing a genomic, strain-specific tool for directly tracking M. symbioticum SB0023/3 in plants, which, to our knowledge, has been largely lacking in previous work on Methylobacterium and other bacterial biostimulants. Furthermore, this work proposes a pipeline for the general implementation of the presented methods for use in biomonitoring.

2 Materials and methods

2.1 Bacterial strains and cultivation conditions

M. symbioticum SB0023/3 was obtained from Corteva Agriscience GmbH, Munich, Germany. Lyophilized M. symbioticum (1.7 mg/ml) was dissolved in DPBS buffer (Carl Roth, Karlsruhe, Germany). Then the strain was cultivated on a nitrogen-free medium (Cf-Nf). Pink colonies were seen after 8 days of incubation at 28 °C. The taxonomy of the strain was confirmed via 16S rDNA sequencing.

In this study, a novel nitrogen-free medium – Cf-Nf – was developed, and methanol was used as a single carbon source in this medium. The Cf-Nf medium was composed of: 17 g/L Noble Agar (Thermo Scientific, Darmstadt, Germany), K2HPO4 (0.5 g/L) (Merck, Darmstadt, Germany), MgSO4 × 7H2O (0.2 g/L) (Carl Roth, Karlsruhe, Germany), FeSO4 × 7H2O (0.1 g/L) (Carl Roth), CaCl2 × 2H2O (0.01 g/L) (Carl Roth), NaCl (0.1 g/L) (Life Technologies, Darmstadt, Germany). To this, 2 mL micronutrient solution containing CuSO4 × 5H2O (0.4 g/L) (Carl Roth), ZnSO4 × 7 H2O (0.12 g/L) (Carl Roth), MgSO4 × 7 H2O (0.2 g/L) (Carl Roth), H3BO3 (1.4 g/L) (Merck), Na2MoO4 × 2H2O (1 g/L) (Carl Roth), MnSO4 × H2O (1.5 g/L) (Carl Roth); 2 mL bromothymol blue solution (0,5g/100 mL 0,2M KOH) (Carl Roth); 4 mL Fe(III)-EDTA (1.64% final concentration) (Carl Roth); and 1 mL vitamin solution (10 mg biotin (Carl Roth) and 20 mg pyridoxal HCl (Carl Roth) in 100 mL H2O) was added. The micronutrient and vitamin solutions were autoclaved individually prior to their incorporation into the base medium. The pH level of the Cf-Nf medium was adjusted to 7. The Cf-Nf medium was autoclaved at 121 °C for 20 minutes. It was cooled down to 55 °C, and 2% methanol was added before plating.

The reference strains of M. radiotolerans DSM 760, M. mesophilicum DSM 1708 and M. dankookense SW08-7 (DSM 22415) were obtained from DSMZ (German Collection of Microorganisms and Cell Cultures) and cultured on nutrient agar [peptone from casein (10 g/L) (Carl Roth), yeast extract (5 g/L) (Carl Roth) and agar (15 g/L) (Carl Roth)]. For the DSM 760 and DSM 1708 strains, no growth was seen on the Cf-Nf selective medium. M. dankookense showed comparable growth to M. symbioticum on the selective nutrient medium, but was also cultivated on full nutrient medium, as were the other reference strains.

2.2 DNA isolation

M. symbioticum SB0023/3 was cultivated in R2A medium at 20 °C with shaking at 160 rpm for 48 hours. After incubation, the culture was centrifuged at 6,000 × g for 5 min (benchtop centrifuge 5424R, Eppendorf, Hamburg, Germany), and the resulting cell pellet was used for genomic DNA extraction with the Bacterial & Yeast Genomic DNA Purification Kit (EURx, Gdansk, Poland) according to the manufacturer’s instructions. To enhance cell lysis efficiency, lysozyme (Merck, Darmstadt, Germany) was added during the lysis step to a final concentration of 1 mg/mL. DNA concentration was measured using a Qubit™ 2.0 Fluorometer (Invitrogen, Carlsbad, CA, USA), while purity was assessed with a NanoReady Touch Microvolume Spectrophotometer (Life Real, Hangzhou, China) and further confirmed by electrophoresis in a 0.8% agarose gel.

2.3 Genomic DNA sequencing and processing

Whole-genome resequencing of M. symbioticum SB0023/3 was performed using both short-read (Illumina) and long-read (Oxford Nanopore Technologies, ONT) sequencing approaches. Illumina sequencing was carried out by Eurofins Genomics GmbH (Ebersberg, Germany). Libraries were prepared following the standard Eurofins protocol and sequenced on an Illumina NovaSeq 6000 platform with an S4 flow cell, generating 2 × 150 bp paired-end reads. For ONT sequencing, libraries were prepared using the Native Barcoding Kit 24 v14 (Oxford Nanopore Technologies) according to the manufacturer’s protocol, and sequencing was performed on an R10.4 flow cell with a MinION Mk1B device.

The Illumina dataset comprised 13.67 million reads with a total length of 2 Gbp. Raw reads were processed with fastp v0.23.4 in paired-end mode using the parameters --cut_tail --detect_adapter_for_pe --trim_poly_x --cut_window_size 10 --cut_mean_quality 25. The ONT dataset comprised 245 thousand reads, including 7300 duplex ones, with a total length of 576 Mbp. The reads were basecalled with DORADO v0.9.0 using dna_r10.4.1_e8.2_400bps_sup@v5.0.0 model, followed by barcode trimming and reads filtering based on the length to keep only reads longer than 2000 bp. Filtered reads were assembled with Unicycler v0.5.1 wrapping SPAdes v4.0.0 (Prjibelski et al., 2020), Racon v1.5.0 (Vaser et al., 2017), minimap2 v2.28 (Li, 2021), and miniasm v0.3 [https://github.com/lh3/miniasm]. The coverage was calculated using CoverM v0.7.0 (Aroney et al., 2025), wrapping samtools 1.19 (Danecek et al., 2021) and with either bwa-mem2 v2.1.1 (Vasimuddin et al., 2019) or minimap2 v2.28 for short and long reads, respectively. Complete plasmid replicons were reoriented using the dnaapler v1.3 (Bouras et al., 2024).

2.4 Genome annotation

The cleaned genome assembly was automatically annotated using bakta v1.7.0 (database v5.1) (Schwengers et al., 2021) and eggnog-mapper v2.1.12 (Huerta-Cepas et al., 2019; Cantalapiedra et al., 2021). The results were subsequently manually evaluated and refined with MAISEN v2.1.3 (Dziurzynski et al., 2021). During annotation, the following tools and databases were employed: tRNAscan-SE v2.0 (Chan et al., 2021), Aragorn (Laslett and Canback, 2004), Infernal (Nawrocki and Eddy, 2013), PilerCR (Edgar, 2007), Pyrodigal (Hyatt et al., 2010; Larralde, 2022), Diamond (Buchfink et al., 2021), BLAST+ (Camacho et al., 2009), PyHMMER (Eddy, 2011; Larralde, 2022), AMRFinderPlus (Feldgarden et al., 2021), HHPred (Hildebrand et al., 2009; Zimmermann et al., 2018), MMseqs2 (Steinegger and Söding, 2017), UniProt (The UniProt Consortium et al., 2021), RCSB Protein Data Bank (Burley et al., 2021), ECOD (Cheng et al., 2014), SCOPe (Chandonia et al., 2022), TIGRFAMs (Li et al., 2021), and the Conserved Domain Database (Wang et al., 2023).

2.5 Comparative genomics

Comparative genomics between the genomic sequences of M. symbioticum SB0023/3, i.e., GenBank acc. no. GCF_902141845.1, and the newly resequenced genome was performed using Panaroo v1.5.0 (Tonkin-Hill et al., 2020) with the parameters --clean-mode sensitive --threshold 0.98 --family_threshold 0.7, nucleotide BLAST using e-value 1e-20 as the threshold, and MMseqs2 clustering with 80% sequence coverage and identity thresholds. Additional searches were conducted against RefSeq-annotated genomes (Li et al., 2021) and the NCBI NR database using the parameters -s 7.5 -c 0.3 -e 1e-3 --cov_mode 0 (Sayers et al., 2022). The RefSeq-annotated assembly was downloaded from the RefSeq database and used as a reference for comparative analysis. Whole-genome comparisons at the nucleotide level were carried out to identify regions that were not conserved between the two assemblies.

2.6 Real-time PCR and validation of the specificity of the molecular markers

PCR primers were synthesized by Eurofins Genomics (Cologne, Germany). Amplifications were performed in 20 µl reaction volumes using a StepOnePlus Real-Time PCR System (Applied Biosystems, Thermo Fisher Scientific, Bensheim, Germany). Each reaction contained 100 ng of genomic DNA, 10 µl of PowerUp SYBR Green Master Mix (Thermo Fisher Scientific Baltics UAB), and 0.4 nM of each primer. Three primer pairs were used: (i) GoBio_Xoxf_f (5′-GGTCGAGCTTGGGATCGTAG-3′)/GoBio_Xoxf_r (5′-GAAGAAGGGCGAGACCAACA-3′); (ii) GoBio_CopG_f (5′-TCATCACCCAAGCCAACCAG-3′)/GoBio_CopG_r (5′-TCATGGTCGATCCGTCCTCT-3′); (iii) GoBio_Ubik_f (5′-GTTCATCGCCCTTGAGGTAG-3′)/GoBio_Ubik_r (5′-GTTCCTTGCGAATGCGGG-3′).

The thermal cycling protocol consisted of an initial denaturation at 95 °C for 2 min, followed by 40 cycles of 95 °C for 30 s, 60 °C for 30 s, and 72 °C for 45 s. Melt curve analysis was subsequently performed by increasing the temperature from 72 °C to 95 °C at a rate of 0.3 °C/s, with continuous fluorescence acquisition.

Melt curve analysis was used to confirm the presence of M. symbioticum in the tested samples. During amplification, SYBR Green binds to double-stranded DNA and emits fluorescence. Gradual denaturation of the amplicons released SYBR Green, resulting in decreased fluorescence. Plotting fluorescence intensity against temperature produced a melt curve characteristic of each amplicon. The melting profile was influenced by amplicon length, sequence composition, GC content, and strand complementarity. Samples were considered positive for M. symbioticum when the cycle threshold (Ct) value was ≤30 and the melting temperature (Tm) differed by no more than 0.5 °C from that of the positive control.

To evaluate the strain specificity of the molecular markers and the detection protocol, three additional Methylobacterium species: M. radiotolerans DSM 760, M. mesophilicum DSM 1708, and M. dankookense SW08–7 were used.

2.7 Plant cultivation and leaf samples collection

Experiments were carried out in a climate chamber (Model FPG-450, Faithful Instrument [Hebei] Co., Ltd., Cangzhou, China). Five arable plant species were selected for experimentation: wheat (Triticum aestivum), corn (Zea mays), oilseed rape (Brassica napus), pea (Pisum sativum), and tomato (Solanum lycopersicum). Seeds were cultivated in a hydroponic medium inside the chamber for 45 days under a 16 h light/8 h dark photoperiod and 60% relative humidity. Plants were divided into two groups: a control group and a treatment group inoculated with M. symbioticum SB0023/3. The inoculum was prepared by suspending M. symbioticum SB0023/3 in DPBS (Dulbecco’s Phosphate-Buffered Saline; VWR International, Belgium/France) at a concentration of 10,000 CFU/ml. The suspension was applied to the leaves of test plants using a standard hand pump sprayer.

2.8 Leaf samples preparation and DNA isolation

Leaves from untreated and inoculated plants were collected 14 (n = 6) and 28 (n = 3) days after treatment. Each sample originated from a different plant but from the same developmental level (youngest fully developed leaves). Due to the rapid growth of maize and oilseed rape, the available space in the climate chamber became limited; relocating plants would have compromised comparability, and therefore data from the second time point are not shown for those crops. No plant was sampled more than once to avoid potential effects of leaf removal. To prevent cross-contamination, untreated control plants were cultivated in a second climate chamber of the same design and with identical environmental conditions.

Leaf samples underwent surface sterilization through sequential washes: 3% sodium hypochlorite (Life Technologies) for 30 s, 3% hydrogen peroxide (35%, Carl Roth) for 30 s, 70% ethanol (VWR, Darmstadt, Germany) with 0.01% Tween-20 (Carl Roth) for 5 s, and 70% ethanol (VWR) for 5 s (Figure 1). Samples were dried with sterile filter paper, transferred into 15 ml Falcon tubes (Roth, Karlsruhe, Germany), and kept on dry ice. Homogenization was performed with a sterile pestle, after which 2 ml of DPBS buffer was added. Samples were incubated at 28 °C for 12 h. On the following day, leaf extracts were plated onto CF-Nf agar. The remaining extract was filtered through sterile gauze into a new 15 ml Falcon tube, supplemented with 2 ml DPBS containing 2% methanol, and incubated for another 12 h at 28 °C.

Figure 1

On the third day, samples were vortexed for 20 s and centrifuged (Heraeus Multifuge 4 KR, Thermo Scientific) at 4,650 × g for 25 min. The supernatant was discarded, and the pellet was resuspended in 100 µl nuclease-free distilled water. For DNA extraction, 12 µl of 20% Chelex 100 Resin (Sigma-Aldrich, St. Louis, MO, USA) was mixed with 38 µl of the leaf extract in 500 µl PCR tubes. Tubes were incubated at 95 °C for 15 min, cooled at 6 °C for 3 min, vortexed briefly, and centrifuged at 16,000 × g for 2 min. The supernatant was transferred to 200 µl PCR tubes, and the extracted DNA was stored at -20 °C until further analyses.

3 Results

3.1 Characterization of the resequenced M. symbioticum SB0023/3 genome

The hybrid assembly of the M. symbioticum SB0023/3 resulted in the reconstruction of a complete genome of a total length of 6,203,152 bp (Table 1). The genome is composed of a single circular chromosome, two circular plasmids (pMSB0023-3_1 and pMSB0023-3_2), and a prophage (pMSB0023-3_3) existing in the genome as an integrated prophage or an episome. The GC content varies from 65.9 – 70.3% with an average of 70.2%. Total coding density was estimated as 85.8%. The resequenced genome comprised 5,744 protein-coding genes, 61 tRNAs, 15 rRNAs, 12 ncRNAs, and 1 tmRNA (Table 1). Among the protein-coding genes, 533 were annotated as hypothetical proteins, and 289 carried domains of unknown function.

Table 1

Genomic featureComplete genomeChromosomepMSB0023-3_1pMSB0023-3_2Prophage as episome (genome location: 5,964,342.6,021,680)
Length (bp)6,203,1526,021,680123,59557,87757,339
Short/Long reads depth316×/83×313×/84×539×/170×232×/93×492×/158×
GC-content (%)70.270.266.665.969.7
CDSs5,7445,5641275484
tRNAs6160011
tmRNAs11000
rRNAs1515000
nsRNAs1212000
Regulatory ncRNAs1919000

General characteristics of the resequenced M. symbioticum SB0023/3 genome.

While complete genomes of all replicons were successfully reconstructed, it must be noted that the prophage genome (chromosome coordinates: 5,964,342.6,021,680) was also observed in an episome state. The assembly performed by Unicycler and further evaluation of the assembly graph indicated a dual-state of the prophage genome in part of the cell population, i.e., integrated within a chromosome or excised and circularized as an episome. This observation was confirmed by manual evaluation of reads overlapping the ends of the prophage genome as an episome, the adjacent regions of the chromosome of the prophage integration site, as well as reads stretching over the SB0023/3 chromosome and prophage genomes when integrated. Additionally, the coverage of the prophage region was approx. 1.6× higher compared to the chromosome.

Resequencing of the SB0023/3 genome identified 121 regions (at least 10-bp-long) across the chromosome that exclusively lacked significant similarity (e-value > 1e−10) to the previously deposited genome of M. symbioticum SB0023/3 (GenBank acc. no. GCF_902141845.1), with a total length of 88,574 bp. These novel regions contained predicted 165 protein- and 5 tRNA-coding genes (Supplementary Material Table 1). Comparative analysis with Panaroo revealed functional annotation discrepancies between the two assemblies, including the presence or absence of certain proteins. In total, 7 proteins were unique to the original SB0023/3 assembly, whereas 144 proteins were unique to the resequenced genome (Supplementary Material Table 2). The observed discrepancies in functional annotation strongly suggest discontinuity of the original only-Illumina-based sequencing due to repetitive regions and the presence of mobile genetic elements and related proteins. Among the overrepresented proteins in the complete genome, there are 16 insertion sequences (IS110 – 8, IS256 – 4, IS3 – 2, IS3/IS911 – 2), 5 integrase domain-containing proteins, and plasmid replication protein RepA. Importantly, proteins involved in the cell surface and envelope composition, in particular extracellular polysaccharides (EPS) metabolism, influencing the biofilm formation and the host-plant interaction of the SB0023/3 strain were also identified. These include polysaccharide biosynthesis/export proteins (Tripathi et al., 2020) together with polysaccharide chain length determinant domain-containing protein (Tian et al., 2020), UDP-glucose 6-dehydrogenase (Pahalvi et al., 2021), porin (Tripathi et al., 2020), transglycosylase (Tripathi et al., 2020), FtsI peptidoglycan transpeptidase (Tian et al., 2020), and DegP and DegP-like periplasmic peptidase (Tripathi et al., 2020). The functions of other proteins, among others, relate to motility (e.g., flagellar hook-length control FliK – 2), chemotaxis (CheW – 2), stress response (e.g., SOS-response peptidases – 2), DNA repair (e.g., DNA Pol Y or RecF proteins), and virulence/secretion (e.g., TssA of T6SS). On the other hand, the complete assembly is missing an ANT(3”)-Ia aminoglycoside nucleotidyltransferase (WP_001206316.1). Although the protein sequence does not present any significant similarity (e-value 0.01) to reference sequences of the Comprehensive Antibiotic Resistance Database (McArthur et al., 2013), there is a wide range of nearly identical proteins in the NCBI nr database.

In the resequenced genome, genes were assigned to clusters of orthologous groups (COGs) (Figure 2). Comparison with the COG database revealed that approximately 1,999 genes (30%) were associated with metabolism-related categories. 1,246 genes (21%) lacked functional assignment, and 890 genes (15%) were annotated as encoding proteins of unknown function. In addition, 1,185 genes (20%) were assigned to cellular processes and signaling, while 791 genes (13%) were associated with information storage and processing. Within the metabolic category, subgroup E (amino acid transport and metabolism) was the most abundant, comprising 430 genes, surpassed only by the group of genes with uncharacterized functions. Subgroups P (inorganic ion transport and metabolism) and C (energy production and conversion) were the next most represented, with 398 and 352 genes, respectively, underscoring their importance in the metabolic capacity of M. symbioticum (Figure 2).

Figure 2

3.2 Development of xoxF-specific PCR primers

The xoxF gene encodes a methanol dehydrogenase, which plays a central role in methanol utilization and can thus serve as a marker for the presence of methylotrophic bacteria, particularly members of the genus Methylobacterium, and not specifically M. symbioticum SB0023/3 (Alessa et al., 2021). We hypothesized that combining xoxF-based molecular detection with, e.g., a cultivation approach employing a selective medium with methanol as the sole carbon source would enable reliable biomonitoring of methylotrophic and diazotrophic bacteria in plants and soil. The xoxF gene is highly conserved in Methylobacterium, and therefore the xoxF1 gene (locus_tag MET9862_RS14105; NCBI acc. no. NZ_CABFPH010000037), encoding the lanthanide-dependent methanol dehydrogenase XoxF1 of M. symbioticum SB0023/3, was selected as the target for primer design. The primer design criteria were as follows: (i) PCR amplicon length > 300 bp, suitable for real-time PCR analyses; (ii) primer self-complementarity between 0 and 4; (iii) GC content between 40% and 60%. As a result, the following primer pair was designed: GoBio_Xoxf_f (5′-GGTCGAGCTTGGGATCGTAG-3′)/GoBio_Xoxf_r (5′-GAAGAAGGGCGAGACCAACA-3′).

3.3 Selection of the molecular biomarkers specific for M. symbioticum SB0023/3

To identify a set of potential unique molecular markers specific for M. symbioticum, all Methylobacterium spp. genomes available in the RefSeq database as of August 2nd, 2024, were downloaded. Their protein sequences were subjected to clustering with MMseqs2. As a result, 789 proteins encoded by the resequenced SB0023/3 strain were classified as singletons (Figure 3; step 1). To further refine this set, these proteins were queried against the NCBI non-redundant protein database, and candidates with the fewest homologous sequences and the lowest sequence identity were retained. After manual curation, this step yielded 23 genes with predicted functions (Figure 3; step 2). Genes annotated as hypothetical proteins were excluded from subsequent analyses.

Figure 3

The identified genes ranged in length from 93 bp to 1,176 bp. Genes shorter than 300 bp (16 in total) were deemed unsuitable due to the expected small amplicon size or questionable annotation (Figure 3; step 3). Primer-BLAST was then performed in NCBI for the remaining seven sequences. The resulting primer pairs were evaluated according to the following quality criteria: (i) self-complementarity scores between 0 and 4, and (ii) GC content between 40% and 60%. An additional requirement was the absence of mispriming within the family Methylobacteriaceae, which could otherwise produce comparably long PCR products (Figure 3; step 4).

After applying these selection criteria, two biomarker genes were retained. The first encodes a ribbon-helix-helix protein containing a CopG domain (contig_1240), and the second encodes an accessory factor UbiK family protein (contig_25530). The following primer pairs were designed GoBio_CopG_f (5′-TCATCACCCAAGCCAACCAG-3′)/GoBio_CopG_r (5′-TCATGGTCGATCCGTCCTCT-3′); GoBio_Ubik_f (5′-GTTCATCGCCCTTGAGGTAG-3′)/GoBio_Ubik_r (5′-GTTCCTTGCGAATGCGGG-3′).

3.4 Validation of the selected molecular markers and optimization of the real-time PCR methodology

The specificity of the selected molecular markers (xoxF, copG, and ubik) and PCR primers for biomarker-specific real-time-PCR-based identification was assessed using three reference species – M. radiotolerans DSM 760, M. mesophilicum DSM 1708, and M. dankookense SW08-7 – alongside M. symbioticum SB0023/3. The primer pairs GoBio_CopG_f/_r and GoBio_UbiK_f/_r demonstrated high specificity, yielding amplification exclusively in M. symbioticum (Figure 4). In contrast, the GoBio_XoxF_f/_r primers were non-specific, producing amplification across all four species (Figure 4). Melting curve analysis (Figure 4) revealed that GoBio_CopG_f/_r and GoBio_UbiK_f/_r produced single, sharp melting peaks at 84.6 °C and 93.2 °C, respectively, exclusively in M. symbioticum SB0023/3. By comparison, the non-specific GoBio_XoxF_f/_r primers generated a melting peak at 90.3 °C in all tested species. The gradual temperature increase during the creation of the melting curve was carried out in 0.5 °C increments. This means that the measurement uncertainty of the method is 0.5 °C.

Figure 4

Genomic DNA concentrations were quantified using a SpectraDrop Microplate in a SpectraMax iD3 spectrophotometer (Molecular Devices, San Jose, USA) and standardized to 100 ng per reaction to ensure comparability among PCR samples. Amplification of the copG gene in M. symbioticum was detected at a cycle threshold (Ct) value of 20, whereas the ubiK gene was amplified at a Ct value of 11. The xoxF-specific primers amplified DNA from M. symbioticum (Ct = 21), M. dankookense (Ct = 22), M. mesophilicum DSM 1708 (Ct =30), and M. radiotolerans (Ct = 32).

Fluorescence intensities of the melt peaks further supported these findings. GoBio_CopG_f/_r and GoBio_UbiK_f/_r produced melt peaks reaching approximately 17,000 and 21,000 fluorescence units, respectively, specific to M. symbioticum. For GoBio_XoxF_f/_r, melting peak intensity varied among species: M. symbioticum exhibited the highest signal (40,000), followed by M. radiotolerans (34,000), M. mesophilicum (32,000), and M. dankookense (19,000).

3.5 Proof-of-concept for monitoring plant colonization success with M. symbioticum SB0023/3 using a real-time-PCR-based method of xoxF, copG, and ubik gene detection under controlled conditions

In the next stage of the study, all three primer pairs specific to xoxF, copG, and ubik gene, respectively, were employed to evaluate the success of colonization with M. symbioticum SB0023/3 in various arable crops. This experiment served as a proof-of-concept for their application under plant cultivation conditions, rather than solely in pure bacterial cultures. Successful endophytic colonization by M. symbioticum SB0023/3 was confirmed in all inoculated plant species (wheat, corn, oilseed rape, pea, and tomato), whereas no bacteria were detected in the non-inoculated controls (Figure 5). The real-time PCR analysis of leaf tissues yielded positive amplification with at least one of the three primer pairs (GoBio_CopG_f/_r, GoBio_UbiK_f/_r, or GoBio_XoxF_f/_r) in ≥50% of the samples at both sampling time points (14 and 28 days post-inoculation). The only exception was observed for the GoBio_UbiK_f/_r primers in pea plants at 14 days post-inoculation, where only 30% of the samples tested positive.

Figure 5

For three crops (pea, tomato, and wheat), where samples were analyzed 28 days post-inoculation, the frequency of positive real-time PCR results increased compared to the earlier (14 days) time point. No obvious differences in colonization frequency were detected among the crop species tested. The highest colonization success was observed in tomato plants, where positive detection rates consistently exceeded 80%.

To complement the molecular approach, bacterial re-isolation was performed from leaf extracts using a specially designed selective Cf-Nf medium supplemented with methanol. This classical culturing method additionally confirmed successful colonization by M. symbioticum SB0023/3.

4 Discussion

The resequencing and comprehensive analysis of the M. symbioticum SB0023/3 genome presented in this study substantially expands our understanding of the genetic composition and variability of this plant-associated bacterium, widely used as a biostimulant. Compared with the previously assembled genome (GenBank acc. no. GCF_902141845.1), the resequenced genome revealed notable structural and functional differences, including 121 novel genomic regions (88,574 bp in total) comprising 165 protein-coding genes and five tRNA.

The genomic discrepancies observed between the two assemblies may reflect biological variability among sequenced clones, technical differences in sequencing platforms or assembly pipelines (Rhie et al., 2021), or potential evolutionary events (O’Donnell et al., 2023). Of particular interest is the abundance of mobile genetic elements within the SB0023/3 genome. Several contigs displayed elevated sequencing depth and a hallmark often associated with extrachromosomal elements. Annotation and geNomad analysis confirmed that these contigs correspond to complete bacteriophage genomes, with terminal repeats characteristic of active phage forms (Camargo et al., 2024). This indicates a dynamic phage-host interaction, which may have contributed to genome plasticity in M. symbioticum SB0023/3 (Bobay et al., 2013; Piel et al., 2022). In addition, two plasmids were detected. Such elements may facilitate horizontal gene transfer, potentially influencing host adaptation and niche specialization (Dziewit et al., 2014; diCenzo et al., 2016; Lu et al., 2019).

Functionally, the annotated genome contains the typical complement of protein-coding genes and RNA elements expected in Methylobacterium spp (Jirakkakul et al., 2023). Clustering into COG categories revealed a distribution consistent with other soil- and phyllosphere-associated bacteria, with approximately 30% of genes assigned to metabolic processes (Carbonetto et al., 2014). A relatively high fraction of hypothetical proteins (~10%) and proteins with domains of unknown function underscores the extent of uncharacterized genetic content and highlights opportunities for novel functional insights.

Importantly, this study addressed the need for reliable molecular tools to specifically detect and monitor M. symbioticum SB0023/3. Experimental validation of three primer sets confirmed their utility. The GoBio_CopG_f/_r and GoBio_UbiK_f/_r primers exhibited high specificity, amplifying products exclusively in M. symbioticum, whereas the GoBio_XoxF_f/_r primers, targeting the xoxF gene – widely present and even duplicated across methylobacterial species (Beck et al., 2015; Taubert et al., 2015) – showed relaxed specificity. Melt curve analysis corroborated these results, differentiating M. symbioticum SB0023/3 from M. radiotolerans, M. mesophilicum, and M. dankookense when using GoBio_CopG_f/_r and GoBio_UbiK_f/_r. Notably, the absence of amplification in M. dankookense, despite its close phylogenetic relationship (Pascual et al., 2020; Leducq et al., 2022), further emphasizes the specificity of these markers. This highlights the potential of the molecular markers developed in comparison to other identification methods. For example, it is not possible to clearly distinguish between M. symbioticum and M. dankookense via 16S rRNA analysis. The threshold value for reliable species identification is typically given as approximately 98.65-99% (Kim et al., 2014; Rodriguez-R et al., 2018; Hackmann, 2025). The 16S rRNA identity between M. symbioticum and M. dankookense is 98.7%. Differences were also reflected in Ct values for GoBio_XoxF_f/_r, with M. symbioticum amplifying earlier and with higher fluorescence intensity, consistent with reduced template matching and specificity in non-SB0023/3 strains. Nevertheless, GoBio_XoxF_f/_r alone cannot be recommended for reliable detection of M. symbioticum SB0023/3.

A proof-of-concept under controlled conditions for the developed detection system was provided by tracking the colonization efficiency of various crop plants at 14- and 28-day post-inoculation. The results demonstrated successful endophytic colonization of pea, maize, tomato, wheat, and oilseed species by M. symbioticum. The real-time-PCR-based method enabled direct detection of colonization from leaf tissues without the need for prior bacterial cultivation on selective media.

Microbial complexity under real-life conditions (e.g., soil, rhizosphere, phyllosphere) may negatively influence the sensitivity, specificity, and quantitative accuracy of molecular detection. Nevertheless, real-time PCR assays can operate even using extremely low DNA concentrations in complex environments, giving the chance of detecting microorganisms occurring in minorities. To ensure accuracy, careful primer-focused assay design and testing against closely related species are essential to avoid misleading false positives and negatives (Wilcox et al., 2013). Taking these principles into account, we have placed great emphasis on the quality parameters of the primers used (Figures 3, 6) and on validating them against the NCBI database resources and ensuring their selectivity in comparison to the closest known related species. Although databases are becoming increasingly comprehensive, and in silico testing of primers can provide a more holistic approach than in vitro testing could ever achieve, cultivation remains a valuable complementary approach. Since bacterial DNA may persist in plant tissues after cell death, PCR-based assays cannot determine bacterial viability at the time of sampling. For this reason, we recommend a combined application of cultivation and molecular techniques, as demonstrated in this study. Here, plating was performed using the same plant extracts analyzed by real-time PCR, allowing direct comparison of results while minimizing methodological variability (Figure 6).

Figure 6

However, cultivation is not recommended as a routine preliminary step for three main reasons. First, isolation of viable bacteria requires advanced methodology during sampling, preparation of extracts, and transport – often a significant challenge under field conditions. Second, rigorous surface sterilization necessary for eliminating epiphytic bacterial contaminants may inadvertently reduce endophyte viability. Third, despite selective conditions targeting diazotrophs and methylotrophs, non-target organisms may proliferate, sometimes exploiting Methylobacterium spp. as pioneer colonizers providing limiting nutrients (Torres et al., 2022).

This study demonstrated that by combining one non-specific primer set (GoBio_XoxF_f/_r) with two highly specific sets (GoBio_CopG_f/_r and GoBio_UbiK_f/_r), a robust strategy was established to distinguish between naturally occurring methylobacterial colonization (by indigenous soil or phyllosphere strains) and deliberate inoculation with M. symbioticum SB0023/3. The development of a simple, rapid, and reliable detection system is important not only from a biosafety perspective – for instance, monitoring unintended spread and potential impacts on soil microbial biodiversity – but also for practical field applications. Well-validated molecular biomarkers provide a valuable tool for tracing bacterial dispersal dynamics and investigating potential mechanisms of transmission into untreated areas. For M. symbioticum SB0023/3 in particular, accurate detection methods are critically needed, as the strain is widely used as a biostimulant in North and South America, Europe, Asia, and Australia. Reliable confirmation of colonization is essential to differentiate between genuine biological inefficacy and technical failures during application or cultivation that may result in unsuccessful inoculation (Beirne, 1985; Knights et al., 2021). Robust detection systems, combined with a clear understanding of colonization mechanisms, are therefore essential for accurately diagnosing the underlying causes of failed inoculation/colonization events.

The detection system developed in this study was evaluated under controlled laboratory conditions, with only preliminary field trials conducted to date. Although the initial field results were encouraging, further rounds of testing are required and are constrained by vegetation cycles. Given the extended time frame necessary to complete comprehensive field validation, we have chosen to present the method to the scientific community at this stage. In the spirit of open scientific evaluation, we anticipate that early dissemination will facilitate broader and more rigorous validation through independent field studies, alongside our ongoing investigations.

Nevertheless, several challenges should be considered when evaluating the system under field conditions. The principal limitations include: (i) natural microbial complexity, which may influence primer performance – although the formation of non-specific amplification products is considered unlikely due to comprehensive in silico validation against the NCBI database, a more relevant concern is the effective titration of primers through non-specific binding to abundant background (environmental) DNA; (ii) inhibition of amplification by naturally occurring PCR inhibitors that may be present in environmental samples; and (iii) technical constraints related to experimental design, including the number of samples required to achieve statistically robust outcomes under heterogeneous field conditions.

In addition, general limitations associated with the use of molecular markers in biomonitoring, particularly in biosafety-related applications, must be acknowledged. PCR-based approaches detect DNA irrespective of cell viability; therefore, the presence of a molecular signal does not necessarily indicate viable or metabolically active cells. This may lead to an overestimation of persistence or colonization in environmental samples. Furthermore, DNA-based markers provide limited insight into the physiological state or functional activity of the detected organism. As a result, detection confirms presence but does not directly demonstrate metabolic activity or ecological impact.

It is also important to consider general limitations arising from the applied experimental framework (Figure 6), particularly from a future biomonitoring perspective. First, the presence of mobile genetic elements in any bacterial strain intended for monitoring represents an important factor influencing long-term assay robustness. In the present study, the selected biomarkers (copG and ubik) are chromosomally encoded, which reduces the risk of target loss during cultivation and minimizes variability in copy number between cells. However, when designing biomarkers for tracking biofertilizers or biostimulants more broadly, it is essential to determine whether candidate loci are located on mobile elements such as plasmids, prophages, or transposon-associated regions. Markers residing on such elements may be horizontally transferred to other bacteria, potentially leading to false-positive signals in field monitoring and complicating biosafety assessments. Furthermore, mobile genetic elements often occur in multiple copies and may exhibit variable copy numbers across populations or environmental conditions (e.g., episomal versus integrated prophage states). This variability can artificially elevate qPCR signals and compromise longitudinal comparability of quantitative data. Therefore, future biomarker development pipelines should explicitly include replicon-level localization of target genes (chromosome vs. plasmid/phage), evaluation of copy-number stability, and validation using field reisolates as well as different production batches. Second, while this proof-of-concept focused on leaf tissues, the same marker framework can be adapted to other matrices relevant for agricultural monitoring (e.g., seed coats, rhizosphere/soil, or root tissues), but such extensions require matrix-specific optimization due to increased microbial complexity and PCR inhibitors in soil, as well as the higher likelihood of detecting extracellular DNA. Finally, false positives and false negatives have distinct consequences in field monitoring: false positives could lead to overestimation of inoculant persistence, whereas false negatives could mask successful colonization and confound interpretation of performance trials. To mitigate these risks, a conservative interpretation strategy is recommended, including strict assay thresholds, use of multiple independent markers (as implemented in this study), appropriate inhibition controls, and – where decisions depend on viability – complementary cultivation-based verification or viability-informed molecular approaches.

In summary, the results presented here establish a foundation for the molecular characterization of M. symbioticum SB0023/3 and highlight the potential of genome-informed biomarker design in environmental monitoring and agricultural biotechnology. The ability to detect M. symbioticum SB0023/3 directly from plant tissues without cultivation greatly facilitates studies on colonization dynamics, ecological interactions, and biosafety. Moreover, the experimental approach described here could be extended (as a pipeline) to other microbial strains used as biostimulants or biocontrol agents, supporting their responsible and traceable application in sustainable agriculture (Figure 6).

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/genbank/, PRJNA1330767.

Author contributions

RA: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Validation, Visualization, Writing – original draft, Writing – review & editing. PD: Data curation, Methodology, Software, Visualization, Writing – review & editing. TA: Investigation, Software, Writing – review & editing. AB: Investigation, Methodology, Writing – review & editing. LD: Conceptualization, Formal Analysis, Supervision, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared financial support was received for thiswork and/or its publication. The research project was funded entirely by GOBIO GmbH, under application of the research grant pursuant to Section 6 of the Research Allowance Act (FZulG) on research and development projects (R&D projects). The funder had no role in the study design, data collection, data analysis, interpretation of data, decision to publish, or preparation of the manuscript beyond the contributions described above.

Acknowledgments

The authors would like to thank Yen Pham and Katherin Sugiarto for their excellent technical assistance. We thank Petra Stahschmidt-Allner and Bernhard Allner for fruitful discussions.

Conflict of interest

Authors RA and TA were employed by the company GOBIO GmbH.

The remaining author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2026.1718185/full#supplementary-material

References

  • 1

    AgafonovaN. V.KaparullinaE. N.DoroninaN. V.Trotsenko YuA. (2013). Phosphate-solubilizing activity of aerobic methylobacteria. Microbiology.82, 864867. doi: 10.1134/S0026261714010020

  • 2

    AhmadT.FarooqS.MirzaD. N.KumarA.MirR. A.Riyaz-Ul-HassanS. (2022). Insights into the endophytic bacterial microbiome of Crocus sativus: Functional characterization leads to potential agents that enhance the plant growth, productivity, and key metabolite content. Microb. Ecol.83, 669688. doi: 10.1007/s00248-021-01810-y

  • 3

    AlessaO.OguraY.FujitaniY.TakamiH.HayashiT.SahinN.et al. (2021). Comprehensive comparative genomics and phenotyping of Methylobacterium species. Front. Microbiol.12. doi: 10.3389/fmicb.2021.740610

  • 4

    AroneyS. T. N.NewellR. J. P.NissenJ. N.CamargoA. P.TysonG. W.WoodcroftB. J. (2025). CoverM: Read alignment statistics for metagenomics. Bioinformatics41, btaf147. doi: 10.1093/bioinformatics/btaf147

  • 5

    BeckD. A.McTaggartT. L.SetboonsarngU.VorobevA.GoodwinL.ShapiroN.et al. (2015). Multiphyletic origins of methylotrophy in Alphaproteobacteria, exemplified by comparative genomics of Lake Washington isolates. Environ. Microbiol.17, 547554. doi: 10.1111/1462-2920.12736

  • 6

    BeirneB. P. (1985). Avoidable obstacles to colonization in classical biological control of insects. Can. J. Zool.63, 743747. doi: 10.1139/z85-108

  • 7

    BobayL.-M.RochaE. P. C.TouchonM. (2013). The adaptation of temperate bacteriophages to their host genomes. Mol. Biol. Evol30, 737751. doi: 10.1093/molbev/mss279

  • 8

    BourasG.GrigsonS. R.PapudeshiB.MallawaarachchiV.RoachM. J. (2024). Dnaapler: A tool to reorient circular microbialgenomes. JOSS.9, 5968. doi: 10.21105/joss.05968

  • 9

    BuchfinkB.ReuterK.DrostH.-G. (2021). Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods18, 366368. doi: 10.1038/s41592-021-01101-x

  • 10

    BurleyS. K.BhikadiyaC.BiC.BittrichS.ChenL.CrichlowG. V.et al. (2021). RCSB Protein Data Bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. Nucleic Acids Res.49, D437D451. doi: 10.1093/nar/gkaa1038

  • 11

    CamachoC.CoulourisG.AvagyanV.MaN.PapadopoulosJ.BealerK.et al. (2009). BLAST+: architecture and applications. BMC Bioinf.10, 421. doi: 10.1186/1471-2105-10-421

  • 12

    CamargoA. P.RouxS.SchulzF.BabinskiM.XuY.HuB.et al. (2024). Identification of mobile genetic elements with geNomad. Nat. Biotechnol.42, 13031312. doi: 10.1038/s41587-023-01953-y

  • 13

    CantalapiedraC. P.Hernández-PlazaA.LetunicI.BorkP.Huerta-CepasJ. (2021). eggNOG-mapper v2: Functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol. Biol. Evol38, 58255829. doi: 10.1093/molbev/msab293

  • 14

    CarbonettoB.RascovanN.ÁlvarezR.MentaberryA.VázquezM. P. (2014). Structure, composition and metagenomic profile of soil microbiomes associated to agricultural land use and tillage systems in Argentine pampas. PloS One9, e99949. doi: 10.1371/journal.pone.0099949

  • 15

    ChanP. P.LinB. Y.MakA. J.LoweT. M. (2021). tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res.49, 90779096. doi: 10.1093/nar/gkab688

  • 16

    ChandoniaJ.-M.GuanL.LinS.YuC.FoxN. K.BrennerS. E. (2022). SCOPe: improvements to the structural classification of proteins – extended database to facilitate variant interpretation and machine learning. Nucleic Acids Res.50, D553D559. doi: 10.1093/nar/gkab1054

  • 17

    ChengH.SchaefferR. D.LiaoY.KinchL. N.PeiJ.ShiS.et al. (2014). ECOD: An evolutionary classification of protein domains. PloS Comput. Biol.10, e1003926. doi: 10.1371/journal.pcbi.1003926

  • 18

    ChurchD. L.CeruttiL.GürtlerA.GrienerT.ZelaznyA.EmlerS. (2020). Performance and application of 16S rRNA gene cycle sequencing for routine identification of Bacteria in the clinical microbiology laboratory. Clin. Microbiol. Rev.33, e00053e00019. doi: 10.1128/CMR.00053-19

  • 19

    DanecekP.BonfieldJ. K.LiddleJ.MarshallJ.OhanV.PollardM. O.et al. (2021). Twelve years of SAMtools and BCFtools. GigaScience10, giab008. doi: 10.1093/gigascience/giab008

  • 20

    De Almeida LopesK. B.Carpentieri-PipoloV.OroT. H.Stefani PagliosaE.DegrassiG. (2016). Culturable endophytic bacterial communities associated with field-grown soybean. J. Appl. Microbiol.120, 740755. doi: 10.1111/jam.13046

  • 21

    diCenzoG. C.CheccucciA.BazzicalupoM.MengoniA.VitiC.DziewitL.et al. (2016). Metabolic modelling reveals the specialization of secondary replicons for niche adaptation in Sinorhizobium meliloti. Nat. Commun.7, 12219. doi: 10.1038/ncomms12219

  • 22

    DouradoM. N.Aparecida Camargo NevesA.SantosD. S.AraújoW. L. (2015). Biotechnological and agronomic potential of endophytic pink-pigmented methylotrophic Methylobacterium spp. BioMed. Res. Int.2015, 119. doi: 10.1155/2015/909016

  • 23

    DziewitL.CzarneckiJ.WibbergD.RadlinskaM.MrozekP.SzymczakM.et al. (2014). Architecture and functions of a multipartite genome of the methylotrophic bacterium Paracoccus aminophilus JCM 7686, containing primary and secondary chromids. BMC Genomics15, 124. doi: 10.1186/1471-2164-15-124

  • 24

    DziurzynskiM.DecewiczP.CiuchcinskiK.GoreckiA.DziewitL. (2021). “ Simple, reliable, and time-efficient manual annotation of bacterial genomes with MAISEN,” in Bacterial pangenomics, vol. 2242 . Eds. MengoniA.BacciG.FondiM. ( Springer US, New York, NY), 221229. Available online at: https://link.springer.com/10.1007/978-1-0716-1099-2_14. doi: 10.1007/978-1-0716-1099-2_14

  • 25

    EddyS. R. (2011). Accelerated profile HMM searches. PloS Comput. Biol.7, e1002195. doi: 10.1371/journal.pcbi.1002195

  • 26

    EdgarR. C. (2007). PILER-CR: Fast and accurate identification of CRISPR repeats. BMC Bioinf.8, 18. doi: 10.1186/1471-2105-8-18

  • 27

    FeldgardenM.BroverV.Gonzalez-EscalonaN.FryeJ. G.HaendigesJ.HaftD. H.et al. (2021). AMRFinderPlus and the Reference Gene Catalog facilitate examination of the genomic links among antimicrobial resistance, stress response, and virulence. Sci. Rep.11, 12728. doi: 10.1038/s41598-021-91456-0

  • 28

    GillerK. E.JamesE. K.ArdleyJ.UnkovichM. J. (2025). Science losing its way: examples from the realm of microbial N2-fixation in cereals and other non-legumes. Plant Soil511, 124. doi: 10.1007/s11104-024-07001-1

  • 29

    HackmannT. J. (2025). Setting new boundaries of 16S rRNA gene identity for prokaryotic taxonomy. Int. J. System Evol Microbiol.75, 6747. doi: 10.1099/ijsem.0.006747

  • 30

    HildebrandA.RemmertM.BiegertA.SödingJ. (2009). Fast and accurate automatic structure prediction with HHpred. Proteins.77, 128132. doi: 10.1002/prot.22499

  • 31

    Huerta-CepasJ.SzklarczykD.HellerD.Hernández-PlazaA.ForslundS. K.CookH.et al. (2019). eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res.47, D309D314. doi: 10.1093/nar/gky1085

  • 32

    HyattD.ChenG. L.LoCascioP. F.LandM. L.LarimerF. W.HauserL. J. (2010). Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinf.11, 119. doi: 10.1186/1471-2105-11-119

  • 33

    JirakkakulJ.KhoiriA. N.DuangfooT.DulsawatS.SutheeworapongS.PetsongK.et al. (2023). Insights into the genome of Methylobacterium sp. NMS14P, a novel bacterium for growth promotion of maize, chili, and sugarcane. PloS One18, e0281505. doi: 10.1371/journal.pone.0281505

  • 34

    JumaP. O.FujitaniY.AlessaO.OyamaT.YurimotoH.SakaiY.et al. (2022). Siderophore for lanthanide and iron uptake for methylotrophy and plant growth promotion in Methylobacterium aquaticum strain 22A. Front. Microbiol.13. doi: 10.3389/fmicb.2022.921635

  • 35

    KimM.OhH.-S.ParkS.-C.ChunJ. (2014). Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes. Int. J. System Evol Microbiol.64, 346351. doi: 10.1099/ijs.0.059774-0

  • 36

    KnightsH. E.JorrinB.HaskettT. L.PooleP. S. (2021). Deciphering bacterial mechanisms of root colonization. Environ. Microbiol. Rep.13, 428444. doi: 10.1111/1758-2229.12934

  • 37

    LarraldeM. (2022). Pyrodigal: Python bindings and interface to Prodigal, an efficient method for gene prediction in prokaryotes. J. Open Source Software7, 4296. doi: 10.21105/joss.04296

  • 38

    LaslettD.CanbackB. (2004). ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res.32, 1116. doi: 10.1093/nar/gkh152

  • 39

    LeducqJ.-B.SneddonD.SantosM.Condrain-MorelD.BourretG.Martinez-GomezN. C.et al. (2022). Comprehensive phylogenomics of Methylobacterium reveals four evolutionary distinct groups and underappreciated phyllosphere diversity. Genome Biol. Evol.14, evac123. doi: 10.1093/gbe/evac123

  • 40

    LiH. (2021). New strategies to improve minimap2 alignment accuracy. Bioinformatics.37, 45724574. doi: 10.1093/bioinformatics/btab705

  • 41

    LiW.O’NeillK. R.HaftD. H.DiCuccioM.ChetverninV.BadretdinA.et al. (2021). RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation. Nucleic Acids Res.49, D1020D1028. doi: 10.1093/nar/gkaa1105

  • 42

    LuX.WangW.ZhangL.HuH.XuP.WeiT.et al. (2019). Molecular mechanism of N,N -dimethylformamide degradation in Methylobacterium sp. strain DM1. Appl. Environ. Microbiol.85, e00275e00219. doi: 10.1128/AEM.00275-19

  • 43

    McArthurA. G.WaglechnerN.NizamF.YanA.AzadM. A.BaylayA. J.et al. (2013). The comprehensive antibiotic resistance database. Antimicrob. Agents Chemother.57, 33483357. doi: 10.1128/AAC.00419-13

  • 44

    MusF.CrookM. B.GarciaK.Garcia CostasA.GeddesB. A.KouriE. D.et al. (2016). Symbiotic nitrogen fixation and the challenges to its extension to nonlegumes. Appl. Environ. Microbiol.82, 36983710. doi: 10.1128/AEM.01055-16

  • 45

    NawrockiE. P.EddyS. R. (2013). Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics29, 29332935. doi: 10.1093/bioinformatics/btt509

  • 46

    NewboldT.HudsonL. N.ArnellA. P.ContuS.De PalmaA.FerrierS.et al. (2016). Has land use pushed terrestrial biodiversity beyond the planetary boundary? A global assessment. Science.353, 288291. doi: 10.1126/science.aaf2201

  • 47

    O’DonnellS.YueJ.-X.SaadaO. A.AgierN.CaradecC.CokelaerT.et al. (2023). Telomere-to-telomere assemblies of 142 strains characterize the genome structural landscape in Saccharomyces cerevisiae. Nat. Genet.55, 13901399. doi: 10.1038/s41588-023-01459-y

  • 48

    PahalviH. N.RafiyaL.RashidS.NisarB.KamiliA. N. (2021). “ Chemical fertilizers and their impact on soil health,” in Microbiota and biofertilizers, vol. 2 . Eds. DarG. H.BhatR. A.MehmoodM. A.HakeemK. R. ( Springer International Publishing, Cham). Available online at: https://link.springer.com/10.1007/978-3-030-61010-4_1. doi: 10.1007/978-3-030-61010-4_1

  • 49

    PappalettereL.BartoliniS.ToffaninA. (2024). Auxin-producing bacteria used as microbial biostimulants improve the growth of tomato (Solanum lycopersicum L.) seedlings in hydroponic systems. BioTech13, 32. doi: 10.3390/biotech13030032

  • 50

    PascualJ. A.RosM.MartínezJ.CarmonaF.BernabéA.TorresR.et al. (2020). Methylobacterium symbioticum sp. nov., a new species isolated from spores of Glomus Iranicum var. tenuihypharum. Curr. Microbiol.77, 20312041. doi: 10.1007/s00284-020-02101-4

  • 51

    PielD.BrutoM.LabreucheY.BlanquartF.GoudenègeD.Barcia-CruzR.et al. (2022). Phage–host coevolution in natural populations. Nat. Microbiol.7, 10751086. doi: 10.1038/s41564-022-01157-1

  • 52

    PrjibelskiA.AntipovD.MeleshkoD.LapidusA.KorobeynikovA. (2020). Using SPAdes de novo assembler. CP Bioinf.70, e102. doi: 10.1002/cpbi.102

  • 53

    RhieA.McCarthyS. A.FedrigoO.DamasJ.FormentiG.KorenS.et al. (2021). Towards complete and error-free genome assemblies of all vertebrate species. Nature.592, 737746. doi: 10.1038/s41586-021-03451-0

  • 54

    Rodriguez-RL. M.CastroJ. C.KyrpidesN. C.ColeJ. R.TiedjeJ. M.KonstantinidisK. T. (2018). How much do rRNA gene surveys underestimate extant bacterial diversity? Appl. Environ. Microbiol.84, e00014e00018. doi: 10.1128/AEM.00014-18

  • 55

    SayersE. W.BoltonE. E.BristerJ. R.CaneseK.ChanJ.ComeauD. C.et al. (2022). Database resources of the national center for biotechnology information. Nucleic Acids Res.50, D20D26. doi: 10.1093/nar/gkab1112

  • 56

    SchwengersO.JelonekL.DieckmannM. A.BeyversS.BlomJ.GoesmannA. (2021). Bakta: rapid and standardized annotation of bacterial genomes via alignment-free sequence identification. Microbial Genomics7, 685. doi: 10.1099/mgen.0.000685

  • 57

    SilveiraA. P. D. D.SalaV. M. R.CardosoE. J. B. N.LabancaE. G.CiprianoM. A. P. (2016). Nitrogen metabolism and growth of wheat plant under diazotrophic endophytic bacteria inoculation. Appl. Soil Ecol107, 313319. doi: 10.1016/j.apsoil.2016.07.005

  • 58

    SoumareA.DiedhiouA. G.ThuitaM.HafidiM.OuhdouchY.GopalakrishnanS.et al. (2020). Exploiting biological nitrogen fixation: A route towards a sustainable agriculture. Plants.9, 1011. doi: 10.3390/plants9081011

  • 59

    SteineggerM.SödingJ. (2017). MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol.35, 10261028. doi: 10.1038/nbt.3988

  • 60

    TaubertM.GrobC.HowatA. M.BurnsO. J.DixonJ. L.ChenY.et al. (2015). XoxF encoding an alternative methanol dehydrogenase is widespread in coastal marine environments. Environ. Microbiol.17, 39373948. doi: 10.1111/1462-2920.12896

  • 61

    The UniProt ConsortiumBatemanA.MartinM.-J.OrchardS.MagraneM.AgivetovaR.et al. (2021). UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res.49, D480D489. doi: 10.1093/nar/gkaa1100

  • 62

    TianH.XuR.CanadellJ. G.ThompsonR. L.WiniwarterW.SuntharalingamP.et al. (2020). A comprehensive quantification of global nitrous oxide sources and sinks. Nature.586, 248256. doi: 10.1038/s41586-020-2780-0

  • 63

    Tonkin-HillG.MacAlasdairN.RuisC.WeimannA.HoreshG.LeesJ. A.et al. (2020). Producing polished prokaryotic pangenomes with the Panaroo pipeline. Genome Biol.21, 180. doi: 10.1186/s13059-020-02090-4

  • 64

    TorresM. J.Gonzalez-BallesterD.Gómez-OsunaA.GalvánA.FernandezE.DubiniA. (2022). Chlamydomonas-Methylobacterium oryzae cooperation leads to increased biomass, nitrogen removal, and hydrogen production. Bioresour Technol. 352, 127088. doi: 10.1016/j.biortech.2022.127088

  • 65

    Torres VeraR.Bernabé GarcíaA. J.Carmona ÁlvarezF. J.Martínez RuizJ.Fernández MartínF. (2024). Application and effectiveness of Methylobacterium symbioticum as a biological inoculant in maize and strawberry crops. Folia Microbiol.69, 121131. doi: 10.1007/s12223-023-01078-4

  • 66

    ToyamaH.AnthonyC.LidstromM. E. (1998). Construction of insertion and deletion mxa mutants of Methylobacterium extorquens AM1 by electroporation. FEMS Microbiol. Lett166, 17. doi: 10.1111/j.1574-6968.1998.tb13175.x

  • 67

    TripathiS.SrivastavaP.DeviR. S.BhadouriaR. (2020). “ Influence of synthetic fertilizers and pesticides on soil health and soil microbiology,” in Agrochemicals Detection, Treatment and Remediation (Oxford, United Kingdom and Cambridge, MA, USA: Elsevier), 2554. Available online at: https://linkinghub.elsevier.com/retrieve/pii/B9780081030172000027. doi: 10.1016/B978-0-08-103017-2.00002-7

  • 68

    TsipinanaS.ObiL.AmooS.AdelekeR. (2024). Plant growth-promoting potential of bacterial endophytes isolated from Lessertia frutescens. South Afr. J. Bot.174, 768778. doi: 10.1016/j.sajb.2024.09.043

  • 69

    Van DienS. J.OkuboY.HoughM. T.KorotkovaN.TaitanoT.LidstromM. E. (2003). Reconstruction of C3 and C4 metabolism in Methylobacterium extorquens AM1 using transposon mutagenesis. Microbiology.149, 601609. doi: 10.1099/mic.0.25955-0

  • 70

    VaserR.SovićI.NagarajanN.ŠikićM. (2017). Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res.27, 737746. doi: 10.1101/gr.214270.116

  • 71

    VasimuddinM. D.MisraS.LiH.AluruS. (2019). “ Efficient architecture-aware acceleration of BWA-MEM for multicore systems,” in 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS), (Rio de Janeiro, Brazil: IEEE). 314–324. doi: 10.1109/IPDPS.2019.00041

  • 72

    WangJ.ChitsazF.DerbyshireM. K.GonzalesN. R.GwadzM.LuS.et al. (2023). The conserved domain database in 2023. Nucleic Acids Res.51, D384D388. doi: 10.1093/nar/gkac1096

  • 73

    WilcoxT. M.McKelveyK. S.YoungM. K.JaneS. F.LoweW. H.WhiteleyA. R.et al. (2013). Robust detection of rare species using environmental DNA: The importance of primer specificity. PloS One8, e59520. doi: 10.1371/journal.pone.0059520

  • 74

    ZimmermannL.StephensA.NamS. Z.RauD.KüblerJ.LozajicM.et al. (2018). A completely reimplemented MPI bioinformatics toolkit with a new HHpred server at its core. J. Mol. Biol.430, 22372243. doi: 10.1016/j.jmb.2017.12.007

  • 75

    ZinnielD. K.LambrechtP.HarrisN. B.FengZ.KuczmarskiD.HigleyP.et al. (2002). Isolation and characterization of endophytic colonizing bacteria from agronomic crops and prairie plants. Appl. Environ. Microbiol.68, 21982208. doi: 10.1128/AEM.68.5.2198-2208.2002

Summary

Keywords

biomonitoring, biostimulant, Methylobacterium symbioticum, plant colonization, real-time PCR

Citation

Allner R, Decewicz P, Allner T, Bluszcz A and Dziewit L (2026) Development of molecular biomarkers for monitoring of arable crops colonization with Methylobacterium symbioticum SB0023/3, a methylotrophic bacterium commonly used as a biostimulant in agriculture. Front. Plant Sci. 17:1718185. doi: 10.3389/fpls.2026.1718185

Received

03 October 2025

Revised

29 January 2026

Accepted

09 February 2026

Published

02 March 2026

Volume

17 - 2026

Edited by

Lénia Rodrigues, University of Évora, Portugal

Reviewed by

Ali Baghdadi, University of Saskatchewan, Canada

Muhammad Shahbaz Farooq, Pakistan Agricultural Research Council, Pakistan

Updates

Copyright

*Correspondence: Lukasz Dziewit,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics