- 1Department of Biotechnology, University of Inland Norway, Hamar, Norway
- 2TINE SA, Oslo, Norway
Long-read metagenomic sequencing allows for the rapid, culture-independent, and accurate identification of causative pathogens and antimicrobial resistance (AMR) profiles, supporting precise antibiotic use and reducing the spread of resistance. However, its application to mastitis milk is challenging due to the complex milk matrix, low bacterial count, and high somatic cell content. This study primarily aimed to further optimize our previously developed culture-free nanopore sequencing protocol for milk samples from mastitis cases. Additional optimizations included combining centrifugation, gradient centrifugation, and fat fraction treatment with Tween 20 and citric acid. Subsequently, four DNA extraction kits (Blood and Tissue, Molysis Complete5, HostZero, and SPINeasy Host depletion) were evaluated for their ability to remove host DNA and enrich bacterial DNA for long-read sequencing with Oxford Nanopore technologies. qPCR was used to quantify bacterial and bovine DNA, allowing comparison of host depletion efficiency among the kits. Our results show that simple centrifugation effectively concentrates bacterial cells, removing the need for chemical treatments. The HostZero kit consistently produced higher DNA yields, improved DNA integrity, and more effective host DNA depletion. Using nanopore sequencing, both Gram-positive and Gram-negative mastitis pathogens, along with their AMR genes, were successfully detected. Overall, this study underscores the importance of an effective DNA extraction method for the direct sequencing of mastitis milk samples. Additionally, our findings support the potential of direct metagenomic sequencing as a rapid, culture-free approach for identifying mastitis pathogens and their resistance profiles.
Introduction
Mastitis is an inflammation of the mammary gland, usually caused by various pathogens invading the udder tissue, and occasionally by mechanical or chemical trauma (Malcata et al., 2020). Clinical and subclinical mastitis are the second and third costliest dairy cattle diseases globally, with estimated annual costs of approximately US$13B and US$9B, respectively (Rasmussen et al., 2024) due to reduced milk production, milk wastage, treatment costs, early culling, and, in severe cases, mortality (Seegers et al., 2003). In Norway, records from the Norwegian Dairy Herd Recording System (NDHRS) indicate that mastitis accounts for more than one-third of all reported diseases in dairy cows and is the leading cause of antibiotic use (TINE, 2025).
Although over 134 pathogens, including bacteria, viruses, mycoplasma, yeasts, and algae, have been linked to bovine mastitis, bacteria are responsible for approximately 95% of all cases (Zigo et al., 2019). Staphylococcus aureus is the leading cause of both clinical and subclinical mastitis in Norway, while other commonly identified pathogens are non-aureus Staphylococci and Mammaliicocci (NASM), Escherichia coli, and Streptococcus species (Smistad et al., 2023). Mastitis is typically categorized into two types: clinical and subclinical. Clinical mastitis is further divided into mild, moderate, and severe levels depending on symptom intensity. It is marked by visible changes in the milk, such as clots, discoloration, blood, or a watery look, along with signs of inflammation in the udder, like swelling, heat, redness, and pain. Conversely, subclinical mastitis often shows no obvious symptoms but can be identified through increased somatic cell count (SCC) and bacterial cultures. It acts as a reservoir for pathogen spread within the herd (Urrutia-Angulo et al., 2024).
Mastitis is primarily treated with antimicrobials based on the clinical diagnosis report. The current gold standard for diagnosis involves culturing milk samples, identifying pathogens through biochemical tests or MALDI-TOF mass spectrometry, and performing culture-based antibiotic susceptibility testing. This traditional method takes 3–5 days. It has notable limitations in mastitis diagnosis because the sensitivity of culture-based detection is relatively low (Imam et al., 2024), either due to the presence of mixed bacterial populations or the absence of detectable growth. The delayed information on the infection-causing pathogens and their susceptibility to antibiotics leads to the empirical use of broad-spectrum antibiotics (Strich et al., 2020), which contributes to the emergence of AMR, a growing global public health concern (WHO, 2016). Overusing antimicrobials for an extended period can lead to the accumulation of drug residues in milk (Zhang et al., 2009), contributing to further economic losses and the spread of antimicrobial resistance. Furthermore, inappropriate or delayed treatment compromises animal welfare by causing prolonged pain, discomfort, and reduced quality of life (Ruegg, 2017). Therefore, developing rapid and reliable methods to diagnose mastitis is essential for improving diagnostic accuracy and promoting the responsible use of antibiotics.
Molecular techniques based on PCR offer high sensitivity, but they can only detect a preset number of pathogens and antimicrobial resistance genes (ARGs) (Yamin et al., 2023). In contrast, metagenomic next-generation sequencing enables unbiased sequencing of DNA from known, unexpected, rare, or even novel pathogens without prior assumptions (Gu et al., 2019). In recent years, metagenomics and long-read sequencing technologies have become faster, more accurate, and affordable, gaining significant attention as powerful tools for diagnostics (Satam et al., 2023). Among these, Oxford Nanopore technology is especially promising because it produces long sequencing reads that can be analyzed in real-time, thus speeding up the identification of pathogens along with AMR and Virulence Factor (VF) genes in clinical settings. Detecting AMR genes helps predict whether a pathogen is likely to be resistant and indicates the pathogen’s potential for pathogenicity (Nguyen et al., 2020). Many studies have demonstrated the effectiveness of Oxford Nanopore sequencing technology to identify pathogens and ARGs in human clinical samples, such as blood (Ali et al., 2024; Gu et al., 2025), urine (Liu M. et al., 2023; Bellankimath et al., 2024), bronchoalveolar lavage fluid (Li et al., 2025), as well as in bovine milk samples (Ahmadi et al., 2023; Usui et al., 2023). One major hurdle in sequencing-based diagnostics is separating pathogenic DNA from clinical samples that contain far more host DNA. This is particularly true for mastitis milk due to its high somatic cell content, which easily exceeds 200,000 cells/ml, even in subclinical mastitis (Liu J. et al., 2023). Additionally, the complex matrix containing high levels of fat and protein in milk samples makes DNA isolation from these samples more challenging.
In our previous work (Ahmadi et al., 2023), we described a culture- and amplification-independent sequencing approach to identify pathogens and antibiotic resistance genes in mastitis milk samples, which could potentially reduce diagnostic time to 5–9 h. This study aimed to optimize the sample treatment protocols before DNA extraction to remove the matrix without affecting the viability of bacterial cells from clinical mastitis milk samples. Additionally, we compared four commercial DNA extraction kits for effective host depletion and microbial DNA isolation from mastitis milk samples, suitable for Oxford Nanopore metagenomic sequencing to identify the causative pathogens and their AMR profiles.
Methodology
Milk samples
In this study, 10 quarter milk samples from 10 different cows diagnosed with clinical mastitis caused by various gram-positive (S. aureus, Streptococcus dysgalactiae, and Streptococcus uberis) and gram-negative (E. coli) bacteria were provided by TINE SA from the routine mastitis diagnostics. Initial bacterial identification was performed at TINE’s Mastitis Laboratory (Molde, Norway), using overnight culturing followed by MALDI TOF mass spectrometry. To minimize the freezing effect on genomic material, 30% glycerol (v/v) was added to the samples. The frozen samples were shipped to the INN laboratory in Hamar, Norway, and kept at −20 °C. On the day of the experiment, samples were thawed at room temperature and re-cultured in Brain Heart Infusion (BHI) agar (15 g/L Agar, 37 g/L BHI Broth, VWR Life Science, USA) to determine the CFU/mL values post-freezing. The experimental design overview is presented in Figure 1.
Figure 1. A graphical overview of the experimental design. Experiment A: Evaluation of different preDNA extraction sample treatment methods; Experiment B: The optimized sample treatment method from Experiment A was employed for DNA extraction using four different commercial kits, followed by qPCR, TapeStation analysis, nanopore sequencing, and bioinformatics analysis.
Experimental design
Experiment A: optimizing the pre-DNA extraction sample treatment
In our previous study (Ahmadi et al., 2023), the MolYsis™ Complete5 kit (Molzym, Bremen, Germany) demonstrated optimal performance when an additional centrifugation step was included before proceeding with the manufacturer’s protocol for DNA isolation. In this study, three different methods were tested to optimize the recovery of bacterial cells that may become trapped in fat globules and are often lost in the supernatant during centrifugation. The workflow of these three methods is presented in (Supplementary Figure 1). The pre-DNA extraction sample treatment optimization was performed using three representative milk samples with varying bacterial loads: high (∼107 CFU/mL), medium (∼105 CFU/mL), and low (∼103 CFU/mL). Each milk sample was divided into three 1 mL aliquots and subjected to the following pretreatment methods.
Method 1 – Centrifugation only
Milk samples were centrifuged at 4500 x g for 20 min at 4 °C to separate the fat and whey layers from the cellular components. The upper fat and whey fractions were carefully removed, and the remaining pellet was retained. To reduce residual components, the pellet was washed with 1 mL of phosphate-buffered saline (PBS) and centrifuged at 13000 x g for 1 min (Ahmadi et al., 2023). The washing step was performed twice.
Method 2 - Centrifugation followed by chemical treatment
Initial centrifugation was performed as described in Method 1 (4,500 × g for 20 min at 4 °C). The resulting pellet was kept on ice, while the supernatant, comprising the fat and whey layers, was subjected to further processing. To disrupt protein and fat components and release any bacterial cells potentially trapped within them, the supernatant was incubated with 0.1% Tween 20 and 2% citric acid at room temperature for 15 min (Duarte and Porcellato, 2024). The treated layer was centrifuged at 8000 x g for 10 min at 4 °C, and the pellet was combined with the original pellet from the initial centrifugation. The combined pellet was subsequently washed twice with PBS as described in Method 1.
Method 3 - Gradient centrifugation combined with chemical treatment
An equal volume of Percoll solution (1.050 g/ml) was added to the milk sample to create a density gradient, followed by centrifugation at 4500 x g for 15 min at room temperature (Meisel et al., 2011). The supernatant was carefully transferred to a clean microcentrifuge tube, then chemically treated and combined with the initial pellet. The combined pellet was washed twice and resuspended in PBS.
Following sample treatment, a 100 μL aliquot of the final supernatant from all three tested conditions was plated on BHI agar (VWR Life Sciences, USA) and incubated at 37 °C for 24 h. The CFUs were counted to assess potential bacterial loss compared to the initial CFUs of the selected samples. The resulting pellets, presumed to contain concentrated bacterial cells, were suspended in 1 mL PBS for DNA extraction. The modified protocol (Mol Com5cent–nuc) of the MolYsis™ Complete5 kit, containing an additional micrococcal nuclease treatment after host depletion, was used for extracting DNA as previously described (Ahmadi et al., 2023). The extracted DNA was assessed using qPCR, with primers specific to S. aureus and bovine (Supplementary Table 1). The reaction condition and thermal profile for qPCR were as described below.
Experiment B: evaluation of DNA extraction kits
Direct DNA extraction from milk samples
Four commercial DNA extraction kits were evaluated for their effectiveness in isolating microbial DNA from mastitic milk, including three kits specifically designed for the selective depletion of host DNA. The kits tested were: a modified version of the MolYsis™ Complete 5 kit (Mol Com5cent–nuc) as described by Ahmadi et al. (2023), HostZERO Microbial DNA Kit (Zymo Research), SPINeasy® Host Depletion Microbial DNA Kit (MP Biomedicals), and DNeasy Blood & Tissue Kit (Qiagen), hereafter Mol Com5, HostZero, SPINeasy and Blood and Tissue, respectively.
The study utilized five clinical mastitis milk samples, each infected with commonly encountered bovine mastitis pathogens, including S. aureus, S. dysgalactiae, and S. uberis, and E. coli. Sample pretreatment was performed according to the previously described “Method 1”. After washing, the pellets were resuspended in sterile PBS, and volumes were adjusted to meet the input requirements specified for each kit. DNA extraction was performed according to the respective manufacturers’ protocols. Final DNA elution was performed using 100 μL of elution buffer for Mol Com5cent–nuc and Blood and Tissue kits, and 50 μL for HostZERO and SPINeasy kits.
DNA quality assessment
All samples were evaluated for quantity, purity, and fragment length after DNA extraction. DNA concentration was measured using the Qubit High Sensitivity Assay kit and the Qubit 4.0 fluorometer (Invitrogen, USA), following the manufacturer’s protocol. Sample purity was assessed using a Nanodrop ND-1000 spectrophotometer (NanoDrop Technologies, Rockland, DE, United States), which measured the absorption ratios at 260/280 nm and 260/230 nm. The fragment length and integrity of the extracted DNA (one representative gram-positive: S. aureus and one gram-negative: E. coli) were analyzed using the Agilent 4150 TapeStation System using Genomic DNA ScreenTape Analysis (Agilent Technologies, USA) for the kits that include host depletion mechanisms (Mol Com5, HostZero, and SPINeasy kits).
qPCR
To determine the relative proportions of bacterial and bovine DNA, qPCR was performed using primers specific to the pathogens detected in the milk samples, following overnight culturing and subsequent MALDI-TOF analysis (Supplementary Table 1). Each reaction was conducted in a total volume of 15 μL, containing 3 μL of 5X Hotfire Pol EvaGreen qPCR supermix (Solis BioDyne, Estonia), 0.3 μL each of 10 μM forward and reverse primers, and 1 μL of template DNA. Nucleic acid- and nuclease-free water was used as a substitute for template DNA in negative control reactions.
qPCR amplification was carried out using a 7500 Fast Real-Time PCR system (Invitrogen™, USA) under the following thermal cycling conditions: initial denaturation at 95 °C for 12 min, followed by 40 cycles of 95 °C for 25 s, 60 °C for 45 s (data collection stage), and 72 °C for 1 min.
MinION library preparation and sequencing
DNA samples from one mastitis milk sample infected with S. aureus (gram-positive) infection and one from E. coli (gram-negative) infection were selected for sequencing. For each species, DNA was extracted using Mol Com5 and HostZero kits. Before library preparation, the DNA samples were purified and concentrated using AMPure XP beads (Beckman Coulter™, USA) to improve purity and yield. Library preparation was performed using the Oxford Nanopore Technologies Rapid PCR Barcoding kit 24 V14 (SQK-RPB114.24), according to the manufacturer’s instructions. Sequencing was performed for over 24 h using the R10.4.1 flow cell (FLO-MIN 114) mounted on a MinION MK1D device (Oxford Nanopore Technologies).
Bioinformatic analysis
Raw sequencing reads were generated and base-called in real-time using the ONT MinKNOW GUI software (version 6.0.11) in FAST base-calling mode, with the Dorado basecaller (version 7.4.13). Sequencing read statistics, including read length, read quality, and N50, were assessed using NanoStat v1.4.0 (De Coster et al., 2018).
To identify potential pathogens, reads were mapped using BLASTn against the NCBI Prokaryotic Reference Genomes collection (RefProk). The BLASTn search was performed with the following parameters: word size-28, maximum target sequences−150, and e-value cutoff −0.000001. The following cutoff values were used for bacterial identification: minimum percent identity: 80%, minimum read coverage in alignment: 65%, minimum read length: 200 nt. Sequencing reads that did not meet the specified criteria or failed to align with the RefProk database were classified as non-aligned, primarily representing the host (bovine) derived sequences.
To create an assembly, the reads were first mapped to the reference genome of the top-identified pathogen using Minimap2 v2.29 (Li, 2018). Correctly mapped reads were extracted using SAMtools v1.13 (Danecek et al., 2021) and subsequently used for de novo assembly with Flye v2.9.6 (Kolmogorov et al., 2019). The quality and completeness of the assemblies were evaluated using the QUality Assessment Tool (QUAST) v5.3.0 (Gurevich et al., 2013) and Benchmarking Universal Single-Copy Orthologs (BUSCO) 5.2.2 (Simão et al., 2015). To determine how well the assembly represents the original sequencing data, reads were mapped back to the assembly using Minimap2 v2.29, enabling an assessment of sequencing depth and the contribution of reads to the assembly. The sorted SAM files from the mapping were used to extract the mapping statistics using samtools (Li et al., 2009).
Assembled genomes were used to identify ARGs and VF genes using ABRicate v1.0.1 (Seemann, 2025). To identify the ARGs, the NCBI resistance database (Feldgarden et al., 2019), as well as the CARD 2023 database (Alcock et al., 2023), and to identify VF genes, the core VF database (Liu et al., 2019) within ABricate were used.
Statistical analyses
The statistical analyses were performed using GraphPad Prism version 10.6.0 for Windows (GraphPad Software, Boston, Massachusetts, USA1). The Ct values across kits/treatments were compared using a non-parametric ANOVA approach (Friedman test for paired samples). Post hoc pairwise comparisons were conducted using Dunn’s test, with p-values adjusted for multiple testing (α = 0.05).
Results
The centrifugation-only method is adequate, and additional fat fraction treatment did not improve bacterial DNA recovery in milk samples
Method 1: Centrifugation only
To concentrate the bacterial cells and remove the unwanted milk components before DNA extraction, milk samples were centrifuged at 4500 x g for 20 min at 4 °C. The CFU count in the supernatant fat and whey layer, which were discarded after centrifugation, indicated an approximate loss of 5%–18% of bacterial cells (Figure 2A). This loss was inversely related to the initial bacterial load, with greater losses observed in samples with lower bacterial concentrations.
Figure 2. (A) Percentage of bacterial loss in the supernatant after three pre-DNA extraction sample treatment methods applied to mastitis milk samples with high, medium, and low bacterial loads (CFU counts in Supplementary Table 3a). (B) Ct values following qPCR targeting the nuc gene (Staphylococcus aureus) and BGb gene (bovine) in DNA isolated from mastitis milk samples treated with three pretreatment methods. All qPCR reactions were performed in triplicate, and data presented as mean ± SD. The upper and lower connection bars with * show the significant groups for bacterial and host Ct values, respectively (Supplementary Table 3b).
Method 2: Centrifugation combined with chemical treatment
To recover bacterial cells trapped in the fat layer after centrifugation (method 1), the fat and whey fractions were additionally treated with 0.1% Tween 20 and 2% citric acid (Supplementary Figure 1), aiming to emulsify the fat and release the associated bacterial cells. Following this chemical treatment and subsequent centrifugation, minimal loss of viable bacteria was observed in the resulting supernatant for samples with high and medium bacterial loads. However, approximately 2.5% of bacterial cell loss was seen in samples with a low initial bacterial concentration.
Method 3: Gradient centrifugation combined with chemical treatment
A Percoll gradient was used to improve the separation of bacterial cells trapped in the fat layer (Figure 1). The resulting CFU counts in the supernatant after treatment were negligible across all samples, regardless of their bacterial loads (Figure 2A). These results suggest that bacterial recovery from milk samples is more efficient when chemical treatment is combined with gradient centrifugation.
qPCR results
qPCR was performed on the total DNA extracted from treated milk samples to evaluate the effectiveness of different pre-DNA extraction sample treatment strategies in enriching bacterial DNA. The cycle threshold (Ct) values for both bacterial (nuc) and host (BGb) targets are presented in Figure 2B. In samples with a high bacterial load, Ct values for the bacterial gene target were comparable across methods, ranging from 16.7 to 17.4. The centrifugation-only method produced the lowest Ct value (16.7), indicating slightly more efficient bacterial DNA recovery. In samples with a low bacterial load, the centrifugation-only method also resulted in a lower bacterial Ct value (21.4) compared to methods 2 (24.2) and 3 (24.7), suggesting better bacterial DNA enrichment. For the medium bacterial load sample, method 1 exhibited the slightly highest bacterial Ct value, indicating lower overall bacterial DNA recovery compared to methods 2 and 3 (Figure 2B).
We also examined the differential Ct value (Ctdiff) for bacteria and host (Ctbacteria – Cthost) across all methods in samples with low, medium, and high bacterial loads. Results showed a lower and significantly different average Ctdiff for method 1 (−2.13 compared to 1.24 in method 2 and 8.99 in method 3*) in samples with low bacterial loads. The same pattern was observed for samples with a medium bacterial load (5.91 for method 1, 12.76 for method 2*, and 11.95 for method 3). These findings suggest that method 1 effectively depletes the host and enriches bacteria in samples with low and medium bacterial loads. In samples with a high bacterial load, method 3 produced the lowest Ctdiff, although the difference was not statistically significant (2.21 for method 1, 2.47 for method 2, and 0.56 for method 3).
HostZero provides superior DNA yield and integrity compared to other host depletion kits
DNA concentration, total yield, and purity ratios (260/280 and 260/230) were evaluated across four commercial DNA extraction kits. The Blood and Tissue kit produced the highest DNA concentration and overall yield in all samples. The HostZero kit provided the most consistent yields across samples infected with different pathogens, with DNA amounts ranging from 13.3 to 350 ng. In contrast, the SPINeasy kit performed the poorest, with DNA concentrations below the detection limit of the Qubit High Sensitivity Assay kit (<0.005 ng/μL) in three samples and very low yields in the remaining two (0.51 and 19.7 ng). The Mol Com5 kit also generated relatively low DNA quantities, ranging from 1.22 to 89.2 ng.
Purity ratios varied across different extraction methods and samples. The Blood and tissue kit typically delivered the highest purity, with 260/280 ratios around 1.8 and 260/230 ratios close to the optimal range of 2.0–2.2. In contrast, both the HostZero and Mol Com5 kits consistently exhibited lower 260/280 and 260/230 ratios, indicating the presence of residual protein and salt contamination. The SPINeasy kit often yielded negative or suboptimal 260/230 ratios (see Supplementary Table 2), suggesting potential carryover of solvents or salts.
DNA fragmentation analysis using the Agilent TapeStation system revealed that the HostZero kit yielded higher-integrity DNA (DIN 7.4–7.9) compared to Mol Comp5 (0–1.8). However, DNA extracted with the SPINeasy kit had very low concentration and was not within the detectable range of the Genomic DNA Screentape used for the TapeStation; therefore, the fragment size or DNA integrity number could not be determined for this kit (Supplementary Figure 2).
Host depletion varies across different samples
Among the four kits tested, the Blood and Tissue kit lacks any host depletion mechanism. Meanwhile, HostZero, Mol Com5, and SPINeasy kits are specifically designed for the selective removal of host DNA. Each kit was evaluated across biological replicates, representing milk samples with different pathogens and varying bacterial loads. qPCR was used to quantify bacterial and bovine DNA, and Ct values for each sample are shown in Figure 3.
Figure 3. Comparison of Ct values for bacterial and bovine targets in DNA extracted using four different DNA extraction and host depletion kits. Each dot or cross represents the Ct value for one of five samples. Lower Ct values indicate higher DNA abundance. Higher Ct values for bovine DNA suggest more effective host depletion. Data is presented as mean (black circle) ± SD. The * indicates a significant change (p-value < 0.05) (Supplementary Table 3b).
Ct values obtained with the Blood and Tissue kit for bacterial targets ranged from 15 to 35, while host Ct values were consistently lower (15–23), indicating a higher abundance of host DNA relative to bacterial DNA (Figure 3). Since it lacks a host depletion mechanism, these results are expected to serve as a baseline for comparing the performance of host-depleting kits. The HostZero kit exhibited minimal variation in host Ct values across samples, indicating consistent depletion of host DNA, regardless of bacterial load and pathogen. However, the average host Ct remained lower than the bacterial Ct, indicating that host DNA was still present in abundance. The Mol Com5 kit provided better and significant Ctdiff (−6.2 – 5.4) compared to Ctdiff in the Blood and Tissue kit. The Ctdiff ranged from −5.2 to 9.1 for HostZero and −8.6 to 13.1 for the SPINeasy kit, with none of them being significant compared to BT (Supplementary Table 2). Mol Com5 was the only kit where the average host Ct exceeded the bacterial Ct, indicating effective depletion of host DNA. This kit also produced low bacterial Ct values, supporting efficient bacterial DNA recovery. Although some variation in Ct values was observed across samples, this is expected due to natural biological differences in bacterial load. Overall, Molysis Complete5 showed the strongest performance in selectively enriching bacterial DNA while reducing host background.
Nanopore sequencing & bioinformatic analysis
Based on the evaluation of DNA yield, integrity, and the bacterial-to-host DNA ratio, we selected samples extracted using the Mol Com5 and HostZero kits for downstream nanopore sequencing. The chosen samples include one representative gram-positive and one gram-negative sample. These samples were infected with a high bacterial load (107 CFU/ml) and showed similar DNA yields and Ct values for both bacterial and host DNA across extractions with both kits. Sequencing read length, quality, and taxonomic classification results are summarized in Table 1.
Table 1. An overview of DNA quality and sequencing metrics for mastitis milk samples using two DNA isolation kits.
Read length, quality, and taxonomy classification
The average read length across all sequenced samples from both kits ranged from approximately 3,100 to 4,100 base pairs (Table 1). Although the S. aureus sample processed with the Mol Com5 kit exhibited the highest mean read length among the sequenced samples, it also showed the most significant degree of fragmentation, with notably smaller DNA fragments, as indicated by TapeStation.
Mean read quality (Q-scores) was slightly higher for the S. aureus (11.4–11.6) sample compared to E. coli (10.4–10.6) in both extraction methods. The number of reads generated from samples extracted using the HostZero kit was nearly 10 times greater than those from the Mol Com5 kit. After taxonomic classification, 88 and 82% of reads from samples extracted with the HostZero kit were assigned to the target pathogen, compared to 77 and 76% of reads from samples extracted with the Mol Com5 kit. Similarly, about 12% of the reads from the HostZero method were from the host (bovine reads), which was lower than the 23% and 15% of reads classified as host in the samples extracted with Mol Com5 (Table 1).
Genome assembly
Both the S. aureus and E. coli assemblies, generated using the HostZero kit, were of the highest quality among the evaluated methods (Table 2). Assembly quality was assessed using fragmentation metrics, where the HostZero kit produced assemblies with higher N50 values (415684 bp for E. coli and 1518621 bp for S. aureus) and higher AuN values (457120.8 bp and 1350486.9 bp, respectively), along with a lower number of contigs (23 and 4 contigs for E. coli and S. aureus), compared to Mol Com5. AuN is a newly defined metric representing the area under the Nx curve, and it is considered more robust and less sensitive to large variations in contig length (Heng Li’s blog., n.d.). The superior quality of the assemblies generated with the HostZero kit was further supported by BUSCO analysis, which revealed a higher proportion of complete BUSCO genes (88% for E. coli and 98% for S. aureus) compared to (44 and 52%, respectively) completeness in the assemblies generated by Mol Com5. In addition, the mean depth of coverage for reads contributing to the final assemblies was remarkably higher with the HostZero kit (63.2x for E. coli and 136.6x for S. aureus) than with the Mol Com5 method (8.1x and 11.7x, respectively).
Table 2. Genome assembly and mapping statistics for mastitis milk samples processed with two DNA isolation kits, Mol Com5 and HostZero.
AMR gene and VF detection
Using DNA extracted with the HostZero kit, 8 AMR genes in the S. aureus sample were identified with an average depth coverage of 109.1 ± 21.2, and 59 VF genes with an average depth coverage of 114.4 ± 39.2. In E. coli, 47 AMR genes (47 ± 14) and 44 VF (51.6 ± 22.5) were identified. In contrast, samples processed with the Mol Com5 kit yielded fewer genes and lower genome coverage: in S. aureus, 7 AMR genes (12 ± 2.4) and 59 VF genes (10.9 ± 4.8) were detected, while in E. coli, 32 AMR genes (5.8 ± 2.4) and 37 VF genes (6.1 ± 2.5) were detected (Figure 4 and Supplementary Table 4). These results highlight the superior performance of the HostZero kit in recovering both AMR and virulence genes from metagenomic milk samples.
Figure 4. Heatmap of the AMR and VF genes detected through metagenomic sequencing of DNA extracted using the Mol Com5 and HostZero kits from milk samples. The panels in the figure represent (top) VF genes in the Staphylococcus aureus sample, (middle) VF genes in the Escherichia coli sample, and (bottom) AMR genes associated with both samples. Genes were considered detected (green) or not detected (red) based on alignment to VFDB, CARD, and NCBI resistance finder using ABricate.
Discussion
Our previous study successfully identified pathogens and detected antibiotic resistance directly from mastitis milk samples, following optimization of a commercial kit and the use of long-read sequencing (Ahmadi et al., 2023). However, one of the main challenges in our previous work was enriching bacterial cells while minimizing inhibitory components such as proteins, lipids, and somatic cells present in the mastitis milk samples. To enrich bacterial cells, high-speed centrifugation is a commonly used method; however, prior studies (Stinson et al., 2021; Sun et al., 2019) have reported that a significant proportion of bacteria remain trapped in the fat fraction after centrifugation of milk samples. In this study, we evaluated three pre-DNA extraction sample treatment methods across milk samples with varying bacterial loads (high, medium, and low CFU), aiming to recover bacterial cells trapped in the fat layer.
The supernatant of the milk samples after applying all three tested treatment methods contained some culturable bacterial cells. Using the standard centrifugation method (Method 1), overall bacterial recovery ranged from 82 to 95%, with greater losses in samples that had low initial bacterial concentrations. These results highlight the limitation of centrifugation in recovering all bacterial cells from milk samples, especially when the bacterial concentration is low. This observed bacterial loss in the fat layer is consistent with the findings of Sun et al. (2019), who reported a 7% loss in the fat layer under similar centrifugation conditions in milk samples spiked with 107 CFU/ml of S. aureus. Similarly, Brewster and Paul (2016) reported that less than 7% of the bacteria in the pellet from raw milk were recovered following centrifugation, primarily due to bacterial entrapment within the fat layer. At higher bacterial loads, the binding capacity of the cream layer becomes saturated, resulting in improved recovery rates as a greater proportion of cells remain in the pellet (Brewster and Paul, 2016). A study conducted by Stinson et al. (2021) isolated both bacterial and human DNA from fat layers of the centrifuged human milk samples, highlighting that a significant amount of cells are trapped in the fat layer after centrifugation (Stinson et al., 2021). Although fat levels are typically lower in the infected milk samples (Bochniarz et al., 2023), the fat portion can vary considerably across samples, introducing variability in the efficiency of cell recovery by centrifugation.
In this study, to release the entrapped bacteria, the fat and whey fractions obtained after centrifugation were further treated with 0.1% Tween 20 and 2% citric acid. Tween 20 is a non-ionic surfactant commonly used to emulsify fats and oils in aqueous solutions (Frederick et al., 2013; Reichler et al., 2023). Meanwhile, the minimum inhibitory concentration of tween20 is 25% (Spadini et al., 2024), and at a lower concentration, such as 0.1%, tween20 has no antimicrobial effect (Dikici et al., 2013). When combined with citric acid, an acidulant that clarifies the protein matrix (Seth and Bajwa, 2015), it might improve the microbial recovery from milk samples. The significant decrease in the CFU count in the supernatant after treating it with Tween 20 and citrate water (Figure 2A) suggests that chemical emulsification can release bacteria trapped in the fat layer. The use of gradient centrifugation can be advantageous for effectively separating cells in complex matrices, such as those found in milk. Previous studies (Fukushima et al., 2007; Meisel et al., 2011) have demonstrated the ability of Percoll gradients to effectively separate bacterial cells from milk and other complex food matrices.
We extracted DNA from the pellets obtained after treating aliquots of the same milk samples with all three methods and compared the bacterial and host Ct values. Although Method 1 led to the greatest loss of culturable bacterial cells (CFU count) in the supernatant, it consistently produced the lowest bacterial Ct and the highest host Ct, along with a balanced bacterial-to-host DNA ratio, as shown by qPCR analysis. This is important in applications like metagenomics or pathogen detection, where host DNA can dominate sequencing results and hide microbial signals. Methods 2 and 3 are more time-consuming, labor-intensive, and do not improve bacterial DNA recovery. Therefore, treating milk samples with Method 1 (centrifugation alone) is the preferred choice for microbial metagenomics and rapid diagnosis.
Host DNA depletion is a vital step in direct metagenomic sequencing of clinical samples because the high amount of host DNA can overshadow microbial DNA and interfere with accurate microbial profiling. In our previous study, we tested various DNA isolation kits specifically designed for microbial DNA extraction from food samples. However, pathogen and AMR identification was only successful with the Mol Com5 kit, which included additional micrococcal nuclease treatment to remove host DNA and enrich microbial DNA selectively (Ahmadi et al., 2023). In this study, we evaluated the performance of various commercial DNA kits with host depletion mechanisms in recovering bacterial DNA while minimizing host DNA contamination from bovine mastitis milk samples for downstream microbial metagenomics analysis. The Blood and Tissue kit, which lacks a selective host DNA depletion mechanism, served as a reference in this study. Although this kit produced higher DNA yield and better purity ratios, these results do not make it the best for metagenomics, as they reflect the total DNA from both host and pathogen, rather than specifically microbial DNA. Several studies (Ahmadi et al., 2023; Yap et al., 2020; Liu M. et al., 2023; Bellankimath et al., 2024) have utilized DNA extraction kits without specific host depletion steps for milk and other clinical samples and reported that the majority of sequencing reads were of host origin. This high proportion of host DNA overshadows microbial DNA, which reduces the effectiveness of metagenomic sequencing in clinical diagnosis.
Three widely used kits for selective host DNA depletion and microbial DNA extraction were tested. The DNA yield was comparatively higher in samples with greater bacterial concentrations (CFU/ml) (Supplementary Table 2); however, the overall DNA yield remained low across all kits. The lower yield may result from the removal of host DNA, as the Blood and Tissue kit, which does not specifically deplete host DNA, consistently produced higher DNA yields and good purity ratios in all samples. Similarly, lower purity ratios for host depletion kits may result from lower DNA concentrations. At low DNA absorbance levels in UV-Vis spectrophotometry, the DNA signal contributes minimally, making it challenging to distinguish it from background contaminants. As a result, contaminants appear disproportionately in the purity ratio calculation, resulting in misleading values. The fragment sizes and DNA integrity values of DNA extracted with the HostZero kit were notably greater than those from the Mol Com5 kit, indicating better preservation of high-molecular-weight DNA. The longer DNA fragments recovered by the HostZero kit are beneficial for metagenomic nanopore sequencing, where longer DNA molecules support better genome assembly and taxonomic classification (Quince et al., 2017; Maghini et al., 2021). No fragment size and DIN values were obtained for the DNA produced by the SPINeasy kit, which may be due to extremely low yield (Supplementary Table 2) or excessive fragmentation, potentially caused by suboptimal lysis or purification steps. Such degradation or loss of DNA significantly limits the usefulness of this kit for downstream metagenomic analysis, where both DNA quantity and integrity are crucial for accurate microbial profiling.
The qPCR-based assessment of bacterial and bovine DNA showed distinct differences in performance among the samples. The Blood and Tissue kit, as expected, yielded moderate bacterial Ct values but consistently low Ct values for the host target, indicating substantial presence of host DNA. Both the HostZero kit and Mol Com5 kit demonstrated consistent host DNA depletion across all tested samples, while preserving the bacterial DNA, which is supported by the findings of Marchukov et al. (2023). However, complete depletion of host DNA is not achievable, and it heavily depends on the number of somatic cells and the change in sample composition due to infection (Marchukov et al., 2023). Samples with lower bacterial concentrations yielded very less DNA and had very high Ct values for both bacterial and host targets compared to samples with higher CFU. Studies have indicated that the number of somatic cells and bacterial count greatly affect both microbial and total DNA yield, with samples containing fewer somatic cells also being challenging for DNA extraction (Duarte and Porcellato, 2024). This suggests the need to develop a DNA extraction approach that effectively reduces host DNA while preserving bacterial cells to facilitate metagenomic diagnosis in samples with low bacterial content. The elevated Ct values for both bacterial and bovine targets in DNA extracted with the SPINeasy kit may result from inefficient extraction processes or excessively vigorous depletion steps, causing non-specific cell lysis and loss of DNA. This interpretation is reinforced by the notably low DNA yield from this method.
Samples with higher bacterial concentrations exhibited negative Ctdiff values across all three host depletion kits, indicating efficient host depletion and a higher proportion of bacterial DNA (Supplementary Table 2). This further highlights the need to develop DNA extraction methods suitable for samples with low bacterial loads. Both the Mol Com5 kit and the HostZero kit demonstrated optimal and comparable performance in samples with higher CFU counts. However, in samples with lower CFU, the Mol Com5 kit showed a smaller Ctdiff than the other tested kits, suggesting it has a better ability to enrich bacterial DNA in low-biomass samples. Despite this advantage, the Mol Com5 kit produced comparatively lower total DNA, which could limit its use in downstream sequencing applications. Additionally, the Mol Com5 kit effectively recovers DNA from gram-positive bacteria after host depletion. However, its performance with gram-negative bacteria was less optimal compared to the HostZero and Blood and Tissue kits (Supplementary Figure 3). Gram-negative bacteria have a thinner cell wall and are more easily lysed than gram-positive bacteria. The chaotropic buffer used in the Mol Com5 protocol might be too harsh for fragile Gram-negative bacterial cells, leading to over-lysis and consequently lower DNA recovery. Several previously published studies have also demonstrated the negative effect of the Mol Com5 kit on the recovery of gram-negative bacterial DNA (Horz et al., 2008; Loonen et al., 2013).
To overcome the limitation of low DNA yield, the ONT Rapid PCR Barcoding kit was used for library preparation, which is optimized for low input samples and requires less than 5 ng of starting DNA. Previous studies (Zhang et al., 2022; Simpson et al., 2023) have successfully used a PCR barcoding kit for low biomass samples. This kit involves PCR amplification of DNA, which produces amplicons of 2–5 kb (Oxford Nanopore Technologies, 2018). The comparable mean read lengths observed across all sequenced samples reflect the uniform amplicon sizes generated during the PCR, rather than inherent differences in initial DNA fragment sizes. However, we observed that samples with higher initial DNA integrity produced more reads and a higher quality assembly (Tables 1, 2). This has been clearly reflected in the E. coli sample extracted with the Mol Com5 kit, where, despite having a higher DNA fragment size, a low number of reads were generated. The TapeStation analysis revealed a peak of DNA fragments at 16,555; however, the peak concentration was very low (0.238 ng), and the DIN value was 1.8, which appears to be the reason for the lower number of reads and poor genome mapping. In this study, for DNA isolated using both HostZero and Mol Com5 kits, more than 75% of collected reads were assigned to the target pathogen, which is similar to the findings of Ahmadi et al. (2023) and Wright et al. (2023). The number of reads assigned to the target pathogen is slightly lower in Mol Com5.
Both samples sequenced in this study contained very high concentrations of bacteria (107 CFU/ml). A limitation of this study is the small sample size and the focus on samples with high bacterial loads. The ability of these methods to detect pathogens and AMR genes in samples with lower bacterial concentrations using nanopore sequencing remains to be tested. However, Grützke et al. (2021) reported the identification of the pathogen using metagenomics shotgun sequencing from milk samples spiked with as low as 101 CFU/ml of Brucella abortus was isolated with the HostZero kit. Although ONT offers adaptive sequencing, where only the DNA strand of interest is sequenced, we decided to disable this feature to gain a comprehensive understanding of the kit’s performance in host depletion and direct sequencing.
The Ct values for the bacterial target (17.6 and 17.2) and the bovine target (21.8 and 22) were similar in S. aureus infected samples, where DNA was extracted using both Mol Com5 and HostZero kits. The Ctdiff was −4.2 for Mol Com5 and −4.8 for HostZero, a difference of only 0.6. Despite this small difference, HostZero produced about 10% more bacterial reads and 10% fewer bovine reads (Table 1) than Mol Com5. In contrast, for the E. coli sample, the bacterial Ct values were 18.8 (Mol Com5) and 14.1 (HostZero), and bovine Ct values were 25 (Mol Com5) and 19.3 (HostZero), resulting in Ctdiff values of −6.2 (Mol Com5) and −5.2 (HostZero), a ΔCt difference of 1. This resulted in only 6% more target bacterial reads and 2% fewer bovine reads with the HostZero kit. These findings suggest that while Ctdiff values from qPCR can give a rough estimate of host depletion and bacterial enrichment, they do not necessarily correlate proportionally with differences in sequencing read distributions.
Contiguity is essential for downstream genomic analysis, including taxonomic identification, AMR, and virulence factor detection in diagnostics, as well as structural variant detection and other analyses. Previous studies have shown that fragmentation during extraction adversely affects contiguity, affecting genome completeness and accuracy (Hillmann et al., 2018; Nicholls et al., 2019). Although a PCR barcoding kit was used in this study, which produces sequencing reads of similar length, a high number of reads and comprehensive genome assembly were achieved using the high-integrity DNA produced by the HostZero kit. The superior contiguity attained with HostZero supports its application in workflows requiring high-fidelity genome reconstruction.
Conclusion
Sample preparation, which includes bacterial cell enrichment and DNA extraction, is crucial for culture-independent nanopore sequencing. This study demonstrates that centrifugation alone is sufficient to enrich bacterial cells from milk samples, eliminating the need for additional fat and whey fraction treatment with Tween 20 and citric acid. Additionally, effective host DNA depletion and microbial DNA enrichment are vital for diagnosing mastitis from infected milk samples. Among the tested methods with a selective host depletion mechanism, the HostZero kit proved to be the most effective in producing higher DNA with better integrity, which is beneficial for long-read sequencing and subsequent bioinformatics analysis. Ct values from qPCR provided insight into host depletion, which was reflected in sequencing; however, they may not directly correspond to the proportion of host and pathogen reads obtained from sequencing. This study supports and confirms the ability of a culture-free metagenomic nanopore sequencing approach to identify both gram-positive and gram-negative pathogens, as well as their antibiotic resistance profiles, in bovine milk samples from mastitic bovines, consistent with our previous findings from Ahmadi et al. (2023). Future studies will focus on sequencing a larger number of samples infected with different mastitis pathogens.
Data availability statement
The data presented in the study are deposited in the European Nucleotide Archive (ENA) repository, accession number PRJEB95132.
Author contributions
CC: Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing, Data curation, Software, Validation, Visualization. AK: Formal analysis, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing JA: Formal analysis, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing. MS: Writing – review & editing. LS: Writing – Writing – review & editing. RA: Investigation, Methodology, Supervisions, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work is supported by funding from the OH-AMR-Diag project, which is financed by the Research Council of Norway (project number 336420).
Conflict of interest
MS and LS were employed by TINE SA.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2025.1680165/full#supplementary-material
Footnotes
References
Ahmadi, A., Khezri, A., Nørstebø, H., and Ahmad, R. (2023). A culture-, amplification-independent, and rapid method for identification of pathogens and antibiotic resistance profile in bovine mastitis milk. Front. Microbiol. 13:1104701. doi: 10.3389/fmicb.2022.1104701
Alcock, B. P., Huynh, W., Chalil, R., Smith, K. W., Raphenya, A. R., Wlodarski, M. A., et al. (2023). CARD 2023: Expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database. Nucleic Acids Res. 51, D690–D699. doi: 10.1093/nar/gkac920
Ali, J., Johansen, W., and Ahmad, R. (2024). Short turnaround time of seven to nine hours from sample collection until informed decision for sepsis treatment using nanopore sequencing. Sci. Rep. 14:6534. doi: 10.1038/s41598-024-55635-z
Bellankimath, A. B., Chapagain, C., Branders, S., Ali, J., Wilson, R. C., Johansen, T. E. B., et al. (2024). Culture and amplification-free nanopore sequencing for rapid detection of pathogens and antimicrobial resistance genes from urine. Eur. J. Clin. Microbiol. Infect. Dis. 43, 2177–2190. doi: 10.1007/s10096-024-04929-1
Bochniarz, M., Błaszczyk, P., Szczubiał, M., Vasiu, I., Adaszek, Ł, Michalak, K., et al. (2023). Comparative analysis of total protein, casein, lactose, and fat content in milk of cows suffering from subclinical and clinical mastitis caused by Streptococcus spp. J. Vet. Res. 67, 251–257. doi: 10.2478/jvetres-2023-0028
Brewster, J. D., and Paul, M. (2016). Short communication: Improved method for centrifugal recovery of bacteria from raw milk applied to sensitive real-time quantitative PCR detection of Salmonella spp. J. Dairy Sci. 99, 3375–3379. doi: 10.3168/jds.2015-9655
Danecek, P., Bonfield, J. K., Liddle, J., Marshall, J., Ohan, V., Pollard, M. O., et al. (2021). Twelve years of SAMtools and BCFtools. GigaScience 10:giab008. doi: 10.1093/gigascience/giab008
De Coster, W., D’Hert, S., Schultz, D. T., Cruts, M., and Van Broeckhoven, C. (2018). NanoPack: Visualizing and processing long-read sequencing data. Bioinformatics 34, 2666–2669. doi: 10.1093/bioinformatics/bty149
Dikici, A., Arslan, A., Yalcin, H., Ozdemir, P., Aydin, I., and Calicioglu, M. (2013). Effect of Tween 20 on antibacterial effects of acidic, neutral and alkaline decontaminants on viability of Salmonella on chicken carcasses and survival in waste decontamination fluids. Food Control 30, 365–369. doi: 10.1016/j.foodcont.2012.07.043
Duarte, V. D. S., and Porcellato, D. (2024). Host DNA depletion methods and genome-centric metagenomics of bovine hindmilk microbiome. mSphere 9:e00470-23. doi: 10.1128/msphere.00470-23
Feldgarden, M., Brover, V., Haft, D. H., Prasad, A. B., Slotta, D. J., Tolstoy, I., et al. (2019). Validating the AMRFinder tool and resistance gene database by using antimicrobial resistance genotype-phenotype correlations in a collection of isolates. Antimicrob. Agents Chemother. 63:e00483-19. doi: 10.1128/AAC.00483-19
Frederick, J. L., Walker, S. P., Crofcheck, C. L., Newman, M., and Payne, F. (2013). Evaluation of chemical additives for the separation and recovery of bacteria from food matrices. Biol. Eng. Trans. 105–115. doi: 10.13031/bet.6.10084
Fukushima, H., Katsube, K., Hata, Y., Kishi, R., and Fujiwara, S. (2007). Rapid separation and concentration of food-borne pathogens in food samples prior to quantification by viable-cell counting and real-time PCR. Appl. Environ. Microbiol. 73, 92–100. doi: 10.1128/AEM.01772-06
Grützke, J., Gwida, M., Deneke, C., Brendebach, H., Projahn, M., Schattschneider, A., et al. (2021). Direct identification and molecular characterization of zoonotic hazards in raw milk by metagenomics using Brucella as a model pathogen. Microb. Genomics 7:000552. doi: 10.1099/mgen.0.000552
Gu, W., Miller, S., and Chiu, C. Y. (2019). Clinical metagenomic next-generation sequencing for pathogen detection. Annu. Rev. Pathol. Mech. Dis. 14, 319–338. doi: 10.1146/annurev-pathmechdis-012418-012751
Gu, W., Wang, J., Qin, X., Duan, M., Wang, M., Guan, Y., et al. (2025). The performance of nanopore sequencing in rapid detection of pathogens and antimicrobial resistance genes in blood cultures. Diagn. Microbiol. Infect. Dis. 111:116720. doi: 10.1016/j.diagmicrobio.2025.116720
Gurevich, A., Saveliev, V., Vyahhi, N., and Tesler, G. (2013). QUAST: Quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075. doi: 10.1093/bioinformatics/btt086
Heng Li’s blog. (n.d.). auN: a new metric to measure assembly contiguity. Available online at: https://lh3.github.io/2020/04/08/a-new-metric-on-assembly-contiguity (accessed September 13, 2025).
Hillmann, B., Al-Ghalith, G. A., Shields-Cutler, R. R., Zhu, Q., Gohl, D. M., Beckman, K. B., et al. (2018). Evaluating the information content of shallow shotgun metagenomics. mSystems 3:e00069-18. doi: 10.1128/msystems.00069-18
Horz, H.-P., Scheer, S., Huenger, F., Vianna, M. E., and Conrads, G. (2008). Selective isolation of bacterial DNA from human clinical specimens. J. Microbiol. Methods 72, 98–102. doi: 10.1016/j.mimet.2007.10.007
Imam, T., Horsman, S., Wood, B., Grewar, J. D., Langhorne, C., Price, R., et al. (2024). Assessment of sensitivity and specificity of bacterial culture and the VetMAX™ MastiType Multi Kit in detecting Streptococcus uberis and Escherichia coli in milk samples from dairy cows with clinical mastitis in subtropical Australia. Prev. Vet. Med. 233:106358. doi: 10.1016/j.prevetmed.2024.106358
Kolmogorov, M., Yuan, J., Lin, Y., and Pevzner, P. A. (2019). Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546. doi: 10.1038/s41587-019-0072-8
Li, G., Sun, H., Ye, Y., Chen, L., Zhang, W., Yu, S., et al. (2025). Clinical utility of nanopore-targeted sequencing for diagnosing and treating pulmonary infectious diseases from bronchoalveolar lavage fluid. Front. Cell. Infect. Microbiol. 15:1469440. doi: 10.3389/fcimb.2025.1469440
Li, H. (2018). Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100. doi: 10.1093/bioinformatics/bty191
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., et al. (2009). The sequence alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079. doi: 10.1093/bioinformatics/btp352
Liu, B., Zheng, D., Jin, Q., Chen, L., and Yang, J. (2019). VFDB 2019: A comparative pathogenomic platform with an interactive web interface. Nucleic Acids Res. 47, D687–D692. doi: 10.1093/nar/gky1080
Liu, J., Liu, H., Cao, G., Cui, Y., Wang, H., Chen, X., et al. (2023). Microbiota characterization of the cow mammary gland microenvironment and its association with somatic cell count. Vet. Sci. 10:699. doi: 10.3390/vetsci10120699
Liu, M., Yang, S., Wu, S., Chen, L., Li, S., Li, Z., et al. (2023). Detection of pathogens and antimicrobial resistance genes directly from urine samples in patients suspected of urinary tract infection by metagenomics nanopore sequencing: A large-scale multi-centre study. Clin. Transl. Med. 13:e824. doi: 10.1002/ctm2.824
Loonen, A. J. M., Bos, M. P., Van Meerbergen, B., Neerken, S., Catsburg, A., Dobbelaer, I., et al. (2013). Comparison of pathogen DNA isolation methods from large volumes of whole blood to improve molecular diagnosis of bloodstream infections. PLoS One 8:e72349. doi: 10.1371/journal.pone.0072349
Maghini, D. G., Moss, E. L., Vance, S. E., and Bhatt, A. S. (2021). Improved high-molecular-weight DNA extraction, nanopore sequencing and metagenomic assembly from the human gut microbiome. Nat. Protoc. 16, 458–471. doi: 10.1038/s41596-020-00424-x
Malcata, F. B., Pepler, P. T., O’Reilly, E. L., Brady, N., Eckersall, P. D., Zadoks, R. N., et al. (2020). Point-of-care tests for bovine clinical mastitis: What do we have and what do we need? J. Dairy Res. 87, 60–66. doi: 10.1017/S002202992000062X
Marchukov, D., Li, J., Juillerat, P., Misselwitz, B., and Yilmaz, B. (2023). Benchmarking microbial DNA enrichment protocols from human intestinal biopsies. Front. Genet. 14:1184473. doi: 10.3389/fgene.2023.1184473
Meisel, S., Stöckel, S., Elschner, M., Rösch, P., and Popp, J. (2011). Assessment of two isolation techniques for bacteria in milk towards their compatibility with Raman spectroscopy. Analyst 136:4997. doi: 10.1039/c1an15761b
Nguyen, M., Olson, R., Shukla, M., VanOeffelen, M., and Davis, J. J. (2020). Predicting antimicrobial resistance using conserved genes. PLoS Comput. Biol. 16:e1008319. doi: 10.1371/journal.pcbi.1008319
Nicholls, S. M., Quick, J. C., Tang, S., and Loman, N. J. (2019). Ultra-deep, long-read nanopore sequencing of mock microbial community standards. GigaScience 8:giz043. doi: 10.1093/gigascience/giz043
Oxford Nanopore Technologies. (2018). Rapid sequencing DNA - PCR Barcoding (SQK-RPB004). Available online at: https://nanoporetech.com/document/rapid-pcr-barcoding (accessed June 4, 2025).
Quince, C., Walker, A. W., Simpson, J. T., Loman, N. J., and Segata, N. (2017). Shotgun metagenomics, from sampling to analysis. Nat. Biotechnol. 35, 833–844. doi: 10.1038/nbt.3935
Rasmussen, P., Barkema, H. W., Osei, P. P., Taylor, J., Shaw, A. P., Conrady, B., et al. (2024). Global losses due to dairy cattle diseases: A comorbidity-adjusted economic analysis. J. Dairy Sci. 107, 6945–6970. doi: 10.3168/jds.2023-24626
Reichler, S. J., Orta-Ramirez, A., Martin, N. H., and Wiedmann, M. (2023). Culture-independent bacterial cell extraction from fluid milk and oat-based beverage for basic qualitative microscopy. JDS Commun. 4, 65–69. doi: 10.3168/jdsc.2022-0320
Ruegg, P. L. (2017). A 100-year review: Mastitis detection, management, and prevention. J. Dairy Sci. 100, 10381–10397. doi: 10.3168/jds.2017-13023
Satam, H., Joshi, K., Mangrolia, U., Waghoo, S., Zaidi, G., Rawool, S., et al. (2023). Next-generation sequencing technology: Current trends and advancements. Biology 12:997. doi: 10.3390/biology12070997
Seegers, H., Fourichon, C., and Beaudeau, F. (2003). Production effects related to mastitis and mastitis economics in dairy cattle herds. Vet. Res. 34, 475–491. doi: 10.1051/vetres:2003027
Seemann, T. (2025). tseemann/abricate. Available online at: https://github.com/tseemann/abricate (accessed May 29, 2025).
Seth, K., and Bajwa, U. (2015). Effect of acidulants on the recovery of milk constituents and quality of Mozzarella processed cheese. J. Food Sci. Technol. 52, 1561–1569. doi: 10.1007/s13197-013-1176-7
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V., and Zdobnov, E. M. (2015). BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212. doi: 10.1093/bioinformatics/btv351
Simpson, A. C., Tighe, S., Wong, S., Leo, P., Parker, C., Chander, A. M., et al. (2023). Analysis of microbiomes from ultra-low biomass surfacesusing novel surface sampling and nanopore sequencing. J. Biomol. Tech. 34, doi: 10.7171/3fc1f5fe.bac4a5b3
Smistad, M., Bakka, H. C., Sølverød, L., Jørgensen, H. J., and Wolff, C. (2023). Prevalence of udder pathogens in milk samples from Norwegian dairy cows recorded in a national database in 2019 and 2020. Acta Vet. Scand. 65:19. doi: 10.1186/s13028-023-00681-2
Spadini, C., Iannarelli, M., Carrillo Heredero, A. M., Montanaro, S. L., Mezzasalma, N., Simoni, M., et al. (2024). Stability of the antimicrobial activity of selected essential oils and nature identical compounds and their interaction with Tween 20 against reference bacterial strains of zootechnical interest. Ital. J. Anim. Sci. 23, 189–199. doi: 10.1080/1828051X.2024.2304042
Stinson, L. F., Ma, J., Rea, A., Dymock, M., and Geddes, D. T. (2021). Centrifugation does not remove bacteria from the fat fraction of human milk. Sci. Rep. 11:572. doi: 10.1038/s41598-020-79793-y
Strich, J. R., Heil, E. L., and Masur, H. (2020). Considerations for empiric antimicrobial therapy in sepsis and septic shock in an era of antimicrobial resistance. J. Infect. Dis. 222, S119–S131. doi: 10.1093/infdis/jiaa221
Sun, L., Dicksved, J., Priyashantha, H., Lundh, Å, and Johansson, M. (2019). Distribution of bacteria between different milk fractions, investigated using culture-dependent methods and molecular-based and fluorescent microscopy approaches. J. Appl. Microbiol. 127, 1028–1037. doi: 10.1111/jam.14377
TINE. (2025). Statistikksamlingen 2024 - utviklingstrekk i norsk melkeproduksjon. Available online at: https://medlem.tine.no/fag-og-forskning/statistikksamlingen-2024-utviklingstrekk-i-norsk-melkeproduksjon (accessed April 24, 2025).
Urrutia-Angulo, L., Ocejo, M., Oporto, B., Aduriz, G., Lavín, J. L., and Hurtado, A. (2024). Unravelling the complexity of bovine milk microbiome: Insights into mastitis through enterotyping using full-length 16S-metabarcoding. Anim. Microbiome 6:58. doi: 10.1186/s42523-024-00345-0
Usui, M., Akiyoshi, M., Fukuda, A., Iwano, H., and Kato, T. (2023). 16S rRNA nanopore sequencing for rapid diagnosis of causative bacteria in bovine mastitis. Res. Vet. Sci. 161, 45–49. doi: 10.1016/j.rvsc.2023.06.006
WHO. (2016). Global action plan on antimicrobial resistance. Available online at: https://www.who.int/publications/i/item/9789241509763 (accessed April 24, 2025)
Wright, M. L., Podnar, J., Longoria, K. D., Nguyen, T. C., Lim, S., Garcia, S., et al. (2023). Comparison of commercial DNA extraction kits for whole metagenome sequencing of human oral, vaginal, and rectal microbiome samples. bioRxiv [Preprint] doi: 10.1101/2023.02.01.526597
Yamin, D., Uskoković, V., Wakil, A., Goni, M., Shamsuddin, S., Mustafa, F., et al. (2023). Current and future technologies for the detection of antibiotic-resistant bacteria. Diagnostics 13:3246. doi: 10.3390/diagnostics13203246
Yap, M., Feehily, C., Walsh, C. J., Fenelon, M., Murphy, E. F., McAuliffe, F. M., et al. (2020). Evaluation of methods for the reduction of contaminating host reads when performing shotgun metagenomic sequencing of the milk microbiome. Sci. Rep. 10:21665. doi: 10.1038/s41598-020-78773-6
Zhang, H., Ren, Y., and Bao, X. (2009). Simultaneous determination of (fluoro)quinolones antibacterials residues in bovine milk using ultra performance liquid chromatography–tandem mass spectrometry. J. Pharm. Biomed. Anal. 49, 367–374. doi: 10.1016/j.jpba.2008.10.043
Zhang, L., Huang, W., Zhang, S., Li, Q., Wang, Y., Chen, T., et al. (2022). Rapid detection of bacterial pathogens and antimicrobial resistance genes in clinical urine samples with urinary tract infection by metagenomic nanopore sequencing. Front. Microbiol. 13:858777. doi: 10.3389/fmicb.2022.858777
Keywords: bovine mastitis, DNA extraction, nanopore sequencing, culture-free sequencing, bioinformatics, metagenomics, cow milk
Citation: Chapagain C, Khezri A, Ali J, Smistad M, Sølverød LS and Ahmad R (2025) Optimizing sample preparation for culture-free nanopore sequencing to enable rapid pathogen and antimicrobial resistance profiling in bovine mastitis. Front. Microbiol. 16:1680165. doi: 10.3389/fmicb.2025.1680165
Received: 05 August 2025; Accepted: 07 October 2025;
Published: 06 November 2025.
Edited by:
Kevin M. Koo, The University of Queensland, AustraliaReviewed by:
Samah Attia Algharib, Benha University, EgyptWill Anderson, The University of Queensland, Australia
Copyright © 2025 Chapagain, Khezri, Ali, Smistad, Sølverød and Ahmad. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Rafi Ahmad, cmFmaS5haG1hZEBpbm4ubm8=
Liv Synnøve Sølverød2