- Yunnan Center for Disease Control and Prevention (Yunnan Academy of Preventive Medicine), Kunming, China
Background: Whole genome sequencing (WGS) can provide valuable insights into the transmission patterns of Mycobacterium tuberculosis, which can inform effective intervention strategies for controlling tuberculosis (TB) in local areas.
Methods: We conducted a retrospective study using strains isolated from L County between January 2019 and July 2020. Demographic data of patients were retrieved from National Tuberculosis Information Management System. A whole genome sequencing-based clustering analysis was performed to assess the transmission characteristics of pulmonary TB in L County. Univariate and multivariate logistic regression analyses were employed to identify potential risk factors associated with clustering.
Results: A total of 143 strains from 136 confirmed TB cases were available for analysis, among them 74.26% of M. tuberculosis strains belonged to Lineage 2, while Lineage 4 comprised the reminder. Among the pulmonary TB cases, 38.97% (53/136) exhibited genomic-clustered strains (≤12 SNP threshold). Notably, 29 (62.26%) strains belonging to 15 clusters were isolated from the same township, while 10 (18.87%) strains belonging to 5 clusters originated from the same villages. The younger age group (≤44) exhibited a higher risk of clustering (aOR 19.21, 95% CI: 2.37–155.80, p = 0.006). Additionally, 9 (27.27%) drug resistant (DR) strains were identified within cluster, and 4 serial DR strains of 2 patients showed the accumulation of acquired drug resistance.
Conclusion: A significant proportion of tuberculosis cases in L County are attributed to recent transmission occurring within communities or neighborhoods across townships for both sensitive and drug-resistant TB. These findings suggest that an active case-finding strategy should be implemented in this area to identify patients at the early stages. Furthermore, active screening should not only target key populations such as close household contacts or old population but also extend to all residents over 15 years. Improving chemotherapy treatment quality and patient follow-up is essential in L County.
Background
Tuberculosis, caused by Mycobacterium tuberculosis, continues to pose a significant global public health threat. According to the Global Tuberculosis Report 2024, an estimate of 10.8 million new TB cases were reported worldwide, resulting in an incidence rate of 134 per 100,000 population. This reflects a sustained increase since 2021 and suggests that TB once again emerge as the leading cause of death from a single infectious agent (1). China ranked third among the 30 high-TB burdens countries, which highlighting ongoing challenges in the prevention and control of this disease (1).
The previous studies have demonstrated significant regional variations in the incidence and mortality of pulmonary tuberculosis (PTB) in China (2). Yunnan Province, located in the southwest of China, also bears a high burden of TB, the geospatial difference of TB prevalence distribution has been observed in Yunnan (3), and the current decline rate in this area is insufficient to meet the targets which is set by the World Health Organization (WHO) End TB Strategy (4, 5).
Controlling human-to-human transmission of TB is crucial for achieving the targets set by the WHO (6, 7). Prevention and control strategies that aligned with transmission dynamics could significantly accelerate the decline of this epidemic. However, due to the nature of the TB, traditional genotyping methods, including IS6110-RFLP, mycobacterial interspersed repetitive unit-variable number tandem repeats (MIRU-VNTRs) typing and spoligotyping, are insufficient for accurately decoding transmission patterns within populations, particularly in Beijing genotype strains are predominant areas, due to their low discriminatory power (8). Our previous molecular epidemiological study has established the predominance of the Beijing genotype across Yunnan province, however, it offered limited insights into the actual transmission networks (9).
WGS data can provide valuable insights into transmission dynamic, in which strains with near-identical genetic variants are considered to possess a transmission relationship (10, 11). However, previous WGS-based studies on TB transmission in China have predominantly focused on large cities characterized by high population density and frequent movement, such as Shenzhen (12). In contrast, rural areas of Yunnan, which are marked by geographical isolation and stable demographics, may exhibit distinct transmission patterns. Our study aimed to elucidate the dynamics of tuberculosis transmission in Yunnan’s region with the high TB burden through WGS analysis and generate evidence-based data to inform targeted TB control strategies in this endemic area.
Methods
Study population and setting
L County, located in northwestern Yunnan Province, served as one of 33 provincial-level drug-resistance surveillance sites in Yunnan. This mountainous region encompasses eight townships with 82.1% of residents live in rural mountainous areas (95% of the county’s terrain consists of high mountains and valleys). Furthermore, this county was the high incidence area of tuberculosis in Yunnan Province. We conducted a retrospective study using Mycobacterium tuberculosis strains collected in L County between January 2019 and July 2020. All confirmed TB cases of were consecutively enrolled for sputum culture at the laboratory of county-level hospital, culture-positive strains were subsequently transferred to the Tuberculosis Reference Laboratory of Yunnan province for further analysis. The demographic data of patients were retrieved from National Tuberculosis Information Management System. This retrospective study cause no harm to human subjects and involves no sensitive personal information or commercial interests, could be exempted from ethical review in accordance with the national legislation.
Whole-genome sequence and bioinformatic analysis
Genomic DNA were extracted using the cetyl-trimethylammonium bromide-sodium chloride (CTAB) method (13). WGS was performed on the HiSeq (Illumina1) platform with an expected coverage of 100×, raw sequence data quality was assessed using FastQC (v0.12.0) (14), followed by adapter trimming and quality filtering, high-quality reads were aligned to the M. tuberculosis H37Rv reference genome (GenBank accession NC_000962.3) using BWA-MEM (v0.7.17). The low-quality SNPs and indels were filtered using bcftools (v1.21) with thresholds for minimum quality (Q ≥ 30), read depth (DP ≥ 5), and genotype call rate (≥90%). Variants with an alternate allele frequency (AF) < 0.75 were also excluded. Mutations included synonymous and nonsynonymous mutation, missense, frameshift mutations were annotated with SnpEff (v4.3t). Drug resistance and lineage were identified by TB-profiler (15, 16). Mutations matrix based on the mutation sites of each strain was generated by Snippy (v4.4.3), pairwise SNP distances were calculated using snp-dists (v0.7.0).
Phylogenetic analysis
Based on the identified SNPs, a phylogeny tree was constructed with IQtree, using the maximum likelihood method with 100 bootstraps and visualized with iTOL (17). The minimum spanning tree was generated with Phyloviz. To minimize the potential distort of phylogenetic signals and transmission clustering, variations in known drug-resistant genes or intergenic regions, direct repeat regions, microsatellite-like sequences, transposition insertion sequences (such as IS6110), ESX secretion system protein genes, and PE/PPE/PGRS family genes were masked (18, 19). Transmission patterns were classified according to the phylogenetic clustering and geographic distribution. Strains sharing ≤12 SNPs between genomes (20–22) and originating from the same township were considered indicative of community transmission. Clustered strains from different townships were classified as cross-township transmission events.
Data analysis
The clustering rate was calculated as following: (Number of clustered strains/Total number of sequenced strains) × 100%. We distinguished between drug-resistant tuberculosis (DR-TB) strains with acquired resistance and potential primary sensitive TB or DR-TB strains based on the presence of additional resistance-associated mutations. If strains within a cluster shared the same resistant mutation, we inferred that the emergence of drug resistant tuberculosis was attributable to the transmission of drug-resistant strains. Conversely, if the strains within a cluster exhibited different genotypic resistance and were isolated from previously treated patients, we considered it acquired drug resistance during transmission.
The distribution of continuous and categorical variables between groups was compared using the Wilcoxon rank sum test or the chi-square test when applicable. Logistic regression analysis was used to calculate the odds ratios (OR) and 95% confidence intervals (CI) for risk factors associated with genomic clustering. Variables with p-values less than 0.2 in the univariable analysis were included in the multivariable analysis to calculate the adjusted odds ratios (aOR). Factors with a p-value less than 0.05 in the final model were considered statistical significantly. All analysis were performed use SPSS (version17.0).
Results
Demographic characteristics of the patients
From January 2019 to July 2020, we collected 145 Mycobacterium tuberculosis isolates from 138 patients (6 patients provided multiple isolates), two isolates were excluded from subsequent analyses due to sequencing failure (n = 1) and contamination (n = 1). Finally, 136 patients (36.03% were smear-negative cases) included analysis. Among them, 75.0% were male, 96.32%were farmers, 67.65% were new cases, and the mean age ± standard deviation (SD) was 46.48 ± 14.30 years (Table 1).
Drug resistant analysis
The genomic drug resistance prediction based on whole genome sequencing revealed that the overall drug resistance rate of 136 strains against 21 anti-tuberculosis drug was 24.26%. The rates of Rifampicin resistance (RR), multi-drug resistance (MDR), and extensive drug resistance (XDR) were 13.24, 4.41, and 1.47%, respectively. No resistance related mutations were found for new antituberculosis drugs.
Phylogenetic analysis
Among the total 136 strains, the majority of M. tuberculosis strains were belonged to the East-Asian lineage (Lineage2) (74.26%, 101/136), followed by the Euro-American lineage (Lineage4) (25.74%, 35/136), Lineage 2.2.1 (72.06%, 98/136) was the main sub-lineage among 136 strains, following by Lineage4.5 (15.44%, 21/136) (Figure 1A). A geographical divergence in lineage distribution was observed in different township, with Lineage 2 (Beijing genotype) ranging from 20 to 94.74%, and ZP township exhibiting the highest prevalence of this lineage. In contrast, Lineage 4 prevalence varied from 5.26 to 80%, with TE township showing the highest distribution (Figure 1B).
Figure 1. The composition and distribution of lineages. (A) The composition of different lineages and sub-lineages, (B) the main lineages distribution among 8 townships.
The transmission clusters
The maximum-likelihood phylogeny tree (Figure 2) showed limited diversity among L County. Fifty-three isolates grouped into 21 clusters, with a cluster rate of 38.97%, each cluster contained 2 to 6 strains. Among the clustered strains, 29 (54.72%) belonged to 15 clusters isolated from the same township, while 10 (18.87%) belonged to 5 clusters and were isolated from the same villages. The results of genomic clustering analysis suggested that recent transmission may have occurred among approximately 38.97% of patients in L County, with a majority of these transmissions were likely taken place within community or neighborhood.
Figure 2. The maximum likelihood phylogeny tree of 136 strains. The inner branch indicates the genetic distance, the pink belt of inner circle indicates genomic-clustered strains differing by ≤12 single-nucleotide polymorphisms, the different color of middle circle indicates the drug resistant type and the outermost colored dots indicates the resistance to 11 anti-TB drugs. Mycobacterium canettii was used as an outgroup.
To further investigate the characteristics of TB transmission in L County, we conducted a comprehensive analysis of the origins and drug resistance types of the clustered strains. Twelve clusters comprising 34 isolates were classified as cross-township transmissions, while 7 clusters with 15 isolates originated from distinct townships. Additionally, another 5 clusters consisting of 19 isolates included both intra- and inter-township cases, suggesting localized transmission with occasional spillover (Figure 3A). Furthermore, among the identified clusters, 7 contained drug-resistant strains (n = 9), the drug-resistant strains clustering rate was 27.27% (9/33). Notably, five clusters comprised both drug-resistant (n = 5) and drug-susceptible strains, those 5 drug-resistant strains within a cluster exhibited different genotypic resistance and were isolated from previously treated patients, we considered it indicative of resistance during transmission. Two clusters consisted solely of drug-resistant strains (2 strains per cluster), each strain of the same cluster shared with the incidental drug related mutations (2 strains with mutation of rpoB Ser450Leu, the other 2 with mutation of rpsL Lys88Met) profile indicating the primary transmission of resistance (Figure 3B), based on these, we can speculate that recent TB transmission in this area is active for both sensitive and drug-resistant forms.
Figure 3. (A) The minimum spanning tree of 136 strains. The size of the circle represents the number of isolates in the cluster, the color of each circle represents the originated township of strain (TE, ZP, SD, YP were defined as high-burden townships), circle compromised of different color represents the cross-township transmission of this cluster. (B) The minimum spanning tree of 136 strains, The size of the circle represents the number of isolates in the cluster, the color of each circle represents drug resistance type of strains, circle compromised of non-green color represents the cluster contained drug-resistant strains.
Longitudinal analysis of drug resistance acquisition
We obtained the drug resistance acquisition under the selection pressure of anti-tuberculosis drugs by analyzing 13 serial Mycobacterium tuberculosis isolates collected from six patients at various time points during anti-TB therapy. Three isolates (May 2019, May 2020, June 2020) from a previously treated but drug-sensitive patient exhibited no resistance mutations by WGS analysis. Six isolates from three initially drug-sensitive patients (2 new cases, 1 previously treated case) maintained susceptibility throughout standard therapy. In contrast, a retreated MDR-TB patient developed additional resistance to ethambutol (embB Met306Val) and pyrazinamide (pncA Gly24Asp) after 8 months of treatment with 2HRZE/10HRE. A new TB case with initial isoniazid (katG Ser315Thr) and streptomycin (rpsL Lys43Arg) resistance acquired aminoglycoside resistance (rrs_r.1401a > g) after 6 months treatment. This observation suggests that acquisition of drug resistance occurs may be more readily in DR-TB, particularly among those who received sub-optimal anti-tuberculosis therapy, the speculation based on this observation requires further verification.
Factors associated with genomics clustering
In our analysis, we employed logistic regression to identify risk factors associated with clustering. The univariate analysis revealed that the less than 45 age group was significantly linked to clustering, and this association remained significant in the multivariate analysis. Compared to individuals over 65 years old, those in the younger age group exhibited a higher risk of clustering (aOR 19.21, 95% CI: 2.37–155.80, p = 0.006) (Table 2).
Discussion
Tuberculosis remains a significant public health challenge in China, exhibiting considerable variations in transmission characteristics across different regions. Our study presents the demographic characteristics, transmission dynamics, and acquisition drug resistance of Mycobacterium tuberculosis in Yunnan, which burdened by high rates of TB. Our findings provide an initial overview of the transmission patterns within this high-burden context, indicating a limited genetic diversity characterized exclusively by lineage 2 and lineage 4 strains. Our data specifically indicate that 74.26% of the strains belong to lineage 2, while 25.74% are classified as lineage 4. A significant proportion of the clustered strains (73.58%) is attributed to sub-lineage 2.2.1, which represents a modern sub-lineage within the Beijing family. These findings are consistent with previous reports that emphasize the predominance and widespread distribution of these lineages across China (23–25).
In L County, it was observed that 38.97% (53/136) of pulmonary tuberculosis cases exhibited genomic clustering within a period of less than 2 years. This clustering rate significantly exceeds those reported in Luo dian (13.1%) and Kashgar Prefecture (25.6%) from a 1-year study (24, 26), as well as surpassing the national average clustering rate of 23.0% (27). These findings indicate a significantly higher level of community transmission in L County compared to other regions of China. Several factors may contribute to this, first, L County experiences low population mobility while simultaneously experiencing a high prevalence of tuberculosis, coupled with the frequent gatherings that occur during traditional festivals, these factors create optimal conditions for airborne transmission of TB. Second, insufficient health awareness and limited access to healthcare services have resulted in prolonged diagnostic delays (exceeding 2 months) following the onset of symptoms, consequently, tuberculosis transmission due to these delays is significantly more pronounced in this region compared to other areas in Yunnan. Third, we observed suboptimal treatment outcomes in six cases examined, these instances of non-converted may contribute to ongoing transmission even after prolonged treatment duration.
According to the analysis of risk factors, younger individuals demonstrate a higher clustering rate compared to older adults (≥65), likely attributable to their increased social and occupational activities, which is consistent with findings from Shanghai (28). The disparities in lineage distribution highlight the necessity for region-specific strategies in tuberculosis control programs. Research conducted in Central Asia has revealed a significant association between Beijing strains and the spread of multidrug-resistant TB (MDR-TB) (29). Our study found Beijing genotype strains and the drug resistant were not independent risk factor for recent transmission, which contradicts findings from other studies conducted in China (26, 30). Compared to these mentioned studies, our sample size may have limited the statistical power necessary to detect transmission associated with Beijing strains, additionally, the inclusion of only a few clustered drug-resistant strains included could introduce bias when analyzing risk factors. Therefore, larger cohort studies are required to validate our current observations.
This study presents several limitations, primarily its retrospective design as opposed to a prospective approach, the absence of socioeconomic data for comprehensive analysis, and the lack of detailed epidemiological linkage data. Consequently, potential transmission clusters were inferred solely based on genetic distances among strains. Furthermore, accurately estimating the precise timing of potential transmission events presents significant challenges, clusters that originating years ago may still be propagating. The second limitation pertains sample size and composition, which may influence the interpret ability of our findings. During the study period, approximately 75% of patients in L County were presented as smear-negative. However, due to the low culture positivity rate in local laboratories, recruiting additional smear-negative cases for whole-genome sequencing proved challenging. Given that smear-negative cases have been shown to account for at least 30% of secondary transmissions (31), the inclusion of only 49 such cases in this study may lead an underestimation of the true clustering rate. Furthermore, because molecular epidemiological studies typically utilize sampling periods spanning 2 years or longer, transmission chains may be lost during surveillance efforts that are brief (as TB usually takes 6–12 months post-infection before noticeable disease manifests). Consequently, recent transmission events in L County could be far more severe than our findings. In the last, we observed the dissemination of drug-resistant strain and the accumulation of acquired drug resistance, however, only 4 clustered DR-TB strains involved the primary transmission, and serial sampling was limited to just 6 patients. Consequently, we cannot verified the confidence frequency of those events in this study. Therefore, a substantially enlarged cohort is needed to determine the true transmission pattern and drug-acquired characteristic of drug resistance TB in L County.
In conclusion, this study has demonstrated the utility of WGS for tuberculosis TB epidemiology in regions with a high burden of TB, The analysis provides insights into genomic population characteristics of TB specific to L County and indicates that a significant proportion of cases are attributable to recent transmissions— insights that are critical for developing targeted interventions against the transmission of TB. We believe that implementing continuous active case-finding strategies will significantly aid early patient identification within this area—not just focusing on key populations such as close household contacts or old population but expanding screening efforts across all residents aged over 15 years old. Furthermore, treatment regimens should strictly adhere to national guidelines alongside rigorous monitoring protocols, introducing electronic medication adherence tracking tools is also essential as effective intervention measures for enhancing chemotherapy quality and patient follow-up processes. Additionally, rapid drug susceptibility testing recommended by the WHO should be systematically implemented when diagnosing drug-resistant tuberculosis, surging that all close contacts of DR-TB patients are identified. Greater emphasis should also be placed on health monitoring among patients with drug-resistant TB to prevent further accumulation of acquired resistance and subsequent transmissions.
To enhance the characterization of transmission dynamics in future studies, the prospective investigations should focus on incorporating larger representative samples while employing robust analytical methodologies for data, including social contacts, which may ultimately contribute to the development of comprehensive models regarding transmission patterns. Enhancing the rates of favorable culture through direct-from-sputum sequencing, coupled with the implementation of ongoing surveillance, constitutes essential measures required for establishing a comprehensive network that delineates TB transmissions throughout L County. Such initiatives would yield solid scientific evidence enhancing understanding surrounding dynamic aspects related specifically toward rural mountainous regions characterized by low population mobility yet facing considerable burdens posed by tuberculosis.
Data availability statement
All Mycobacterium tuberculosis genomes data is provided by the National Microbiology Data Center. The name of the repository and accession number can be found below: https://nmdc.cn/resource/genomics/project/detail/NMDC10020193.
Ethics statement
Ethical approval was exempt for the study involving humans in accordance with the national legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.
Author contributions
XY: Conceptualization, Formal Analysis, Project administration, Writing – original draft. HR: Data curation, Investigation, Methodology, Writing – review & editing. TC: Data curation, Investigation, Methodology, Writing – original draft. LC: Conceptualization, Project administration, Writing – review & editing. QN: Data curation, Investigation, Methodology, Writing – original draft. SY: Data curation, Investigation, Methodology, Writing – original draft. RY: Formal Analysis, Software, Writing – review & editing. JC: Formal Analysis, Software, Writing – review & editing. LX: Conceptualization, Funding acquisition, Resources, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the National Natural Science Foundation of China (Grant No. 82060613). The funding body had no role in the design of the study, collection, analysis, and interpretation of data, or in writing the manuscript.
Acknowledgments
We would like to thank all the participants who participated in this study. We thank members of the Working Group on Yunnan Provincial Tuberculosis Drug Resistance Surveillance. We also thank the tireless contributions of designated hospital and CDCs for both county and prefecture level.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Footnotes
References
1. Global tuberculosis report. Geneva: World Health Organization. (2024). Licence: CC BY-NC-SA 3.0 IGO.
2. Guo, J, Liu, C, Liu, F, Zhou, E, Ma, R, Zhang, L, et al. Tuberculosis disease burden in China: a spatio-temporal clustering and prediction study. Front Public Health. (2025) 12:1436515. doi: 10.3389/fpubh.2024.1436515
3. Chen, J, Qiu, Y, Yang, R, Li, L, Hou, J, Lu, K, et al. The characteristics of spatial-temporal distribution and cluster of tuberculosis in Yunnan Province, China, 2005-2018. BMC Public Health. (2019) 19:1715. doi: 10.1186/s12889-019-7993-5
4. Churchyard, G, Kim, P, Shah, NS, Rustomjee, R, Gandhi, N, Mathema, B, et al. What we know about tuberculosis transmission: an overview. J Infect Dis. (2017) 216:S629–35. doi: 10.1093/infdis/jix362
5. Floyd, K, Glaziou, P, Zumla, A, and Raviglione, M. The global tuberculosis epidemic and progress in care, prevention, and research: an overview in year 3 of the end TB era. Lancet Respir Med. (2018) 6:299–314. doi: 10.1016/S2213-2600(18)30057-2
6. Dheda, K, Gumbo, T, Maartens, G, Dooley, KE, McNerney, R, Murray, M, et al. The epidemiology, pathogenesis, transmission, diagnosis, and management of multidrug-resistant, extensively drug-resistant, and incurable tuberculosis. Lancet Respir Med. (2017) 5:291–360. doi: 10.1016/S2213-2600(17)30079-6
7. World Health Organization (WHO). End TB strategy. Available online at: https://www.who.int/publications/i/item/WHO-HTM-TB-2015.19 (2015). (Accessed August 16, 2015).
8. Yang, C, and Gao, Q. Recent transmission of Mycobacterium tuberculosis in China: the implication of molecular epidemiology for tuberculosis control. Front Med. (2018) 12:76–83. doi: 10.1007/s11684-017-0609-5
9. Chen, L, Pang, Y, Ma, L, Yang, H, Ru, H, Yang, X, et al. First insight into the molecular epidemiology of Mycobacterium tuberculosis isolates from the minority enclaves of southwestern China. Biomed Res Int. (2017) 2017:2505172. doi: 10.1155/2017/2505172
10. Meehan, CJ, Moris, P, Kohl, TA, Pečerska, J, Akter, S, Merker, M, et al. The relationship between transmission time and clustering methods in Mycobacterium tuberculosis epidemiology. EBioMedicine. (2018) 37:410–6. doi: 10.1016/j.ebiom.2018.10.013
11. Guerra-Assunção, JA, Crampin, AC, Houben, RM, Mzembe, T, Mallard, K, Coll, F, et al. Large-scale whole genome sequencing of M. tuberculosis provides insights into transmission in a high prevalence area. eLife. (2015) 4:4. doi: 10.7554/eLife.05166
12. Mijiti, P, Liu, C, Hong, C, Li, M, Tan, X, Zheng, K, et al. Implications for TB control among migrants in large cities in China: a prospective population-based genomic epidemiology study in Shenzhen. Emerg Microb Infect. (2024) 13:2287119. doi: 10.1080/22221751.2023.2287119
13. Larsen, MH, Biermann, K, Tandberg, S, Hsu, T, and Jacobs, WR Jr. Genetic manipulation of Mycobacterium tuberculosis. Curr Protoc Microbiol. (2007):10A.2.1–10A.2.21. doi: 10.1002/9780471729259.mc10a02s6
14. Andrews, S. FastQC: A quality control tool for high throughput sequence data [Computer software]. Babraham Bioinformatics. (2014). Available at: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
15. Coll, F, McNerney, R, Preston, MD, Guerra-Assunção, JA, Warry, A, Hill-Cawthorne, G, et al. Rapid determination of anti-tuberculosis drug resistance from whole-genome sequences. Genome Med. (2015) 7:51. doi: 10.1186/s13073-015-0164-0
16. Phelan, JE, O’Sullivan, DM, Machado, D, Ramos, J, Oppong, YEA, Campino, S, et al. Integrating informatics tools and portable sequencing technology for rapid detection of resistance to anti-tuberculous drugs. Genome Med. (2019) 11:41. doi: 10.1186/s13073-019-0650-x
17. Letunic1, I, and Bork, P. Interactive tree of life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. (2019) 47:W256–259. doi: 10.1093/nar/gkz239
18. Zhou, Y, Anthony, R, Wang, S, Ou, X, Liu, D, Zhao, Y, et al. The epidemic of multidrug resistant tuberculosis in China in historical and phylogenetic perspectives. J Infect. (2020) 80:444–53. doi: 10.1016/j.jinf.2019.11.022
19. Modlin, SJ, Robinhold, C, Morrissey, C, Mitchell, SN, Ramirez-Busby, SM, Shmaya, T, et al. Exact mapping of Illumina blind spots in the Mycobacterium tuberculosis genome reveals platform-wide and workflow-specific biases. Microb Genom. (2021) 7:mgen000465. doi: 10.1099/mgen.0.000465
20. Yang, C, Luo, T, Shen, X, Wu, J, Gan, M, Xu, P, et al. Transmission of multidrug-resistant Mycobacterium tuberculosis in Shanghai, China: a retrospective observational study using whole-genome sequencing and epidemiological investigation. Lancet Infect Dis. (2017) 17:275–84. doi: 10.1016/S1473-3099(16)30418-2
21. Quan, Z, Xu, J, Li, M, Cheng, C, Mijiti, P, Jiang, Q, et al. Transmission of tuberculosis in rural Henan, China: a prospective population-based genomic spatial epidemiological study. Emerg Microbes Infect. (2024) 13:2399273:10. doi: 10.1080/22221751.2024.2399273
22. Van der Roest, BR, Bootsma, MCJ, Fischer, EAJ, Gröschel, MI, Anthony, RM, de Zwaan, R, et al. Phylodynamic assessment of SNP distances from whole genome sequencing for determining Mycobacterium tuberculosis transmission. Sci Rep. (2025) 15:10694. doi: 10.1038/s41598-025-94646-2
23. Liu, Q, Ma, A, Wei, L, Pang, Y, Wu, B, Luo, T, et al. China’s tuberculosis epidemic stems from historical expansion of four strains of Mycobacterium tuberculosis. Nat Ecol Evol. (2018) 2:1982–92. doi: 10.1038/s41559-018-0680-6
24. Liu, M, Xu, P, Liao, X, Li, Q, Chen, W, Gao, Q, et al. Molecular epidemiology and drug-resistance of tuberculosis in Luodian revealed by whole genome sequencing. Infect Genet Evol. (2021) 93:104979. doi: 10.1016/j.meegid.2021.104979
25. Jiang, Q, Liu, Q, Ji, L, Li, J, Zeng, Y, Meng, L, et al. Citywide transmission of multidrug-resistant tuberculosis under China’s rapid urbanization: a retrospective population based genomic spatial epidemiological study. Clin Infect Dis. (2020) 71:142–51. doi: 10.1093/cid/ciz790
26. Fan, YF, Liu, DX, Chen, YW, Ou, XC, Mao, QZ, Yang, TT, et al. Inferring Mycobacterium Tuberculosis drug resistance and transmission using whole-genome sequencing in a high TB-burden setting in China. Biomed Environ Sci. (2024) 37:157–69. doi: 10.3967/bes2023.116
27. Liu, D, Huang, F, Li, Y, Mao, L, He, W, Wu, S, et al. Transmission characteristics in tuberculosis by WGS: nationwide cross-sectional surveillance in China. Emerg Microbes Infect. (2024) 13:2348505. doi: 10.1080/22221751.2024.2348505
28. Li, M, Lu, L, Jiang, Q, Jiang, Y, Yang, C, Li, J, et al. Genotypic and spatial analysis of transmission dynamics of tuberculosis in shanghai, China: a 10-year prospective population-based surveillance study. Lancet Reg Health – West Pac. (2023) 38:100833. doi: 10.1016/j.lanwpc.2023.100833
29. Cox, HS, Kubica, T, Doshetov, D, Kebede, Y, Rüsch-Gerdess, S, and Niemann, S. The Beijing genotype and drug resistant tuberculosis in the Aral Sea region of Central Asia. Respir Res. (2005) 6:134. doi: 10.1186/1465-9921-6-134
30. Liu, Z, Li, X, Xiong, H, Zhou, Q, Yi, H, Wu, K, et al. Genomic and spatial analysis reveal the transmission dynamics of tuberculosis in areas with high incidence of Zhejiang, China: a prospective cohort study. Infect Genet Evol. (2024) 121:105603. doi: 10.1016/j.meegid.2024.105603
Keywords: Mycobacterium tuberculosis , whole genome sequencing, high burden setting, cluster analysis, transmission pattern
Citation: Yang X, Ru H, Chen T, Chen L, Ni Q, Yan S, Yang R, Chen J and Xu L (2025) Transmission characteristics of tuberculosis in the high-TB-burden region of Southwest China. Front. Public Health. 13:1670514. doi: 10.3389/fpubh.2025.1670514
Edited by:
Vitali Sintchenko, The University of Sydney, AustraliaReviewed by:
Yu Pang, Capital Medical University, ChinaJizhou Wu, Nanjing Medical University, China
Copyright © 2025 Yang, Ru, Chen, Chen, Ni, Yan, Yang, Chen and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lin Xu, MTIzMzE2ODU5QHFxLmNvbQ==
†ORCID: Lin Xu, orcid.org/0000-0003-4512-347X
Haohao Ru