Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol., 28 November 2025

Sec. Thyroid Endocrinology

Volume 16 - 2025 | https://doi.org/10.3389/fendo.2025.1691788

Development and validation of a collagen signature to predict central lymph node metastasis in papillary thyroid cancer

Weisheng ChenWeisheng Chen1Junna GeJunna Ge1Tingyue LuoTingyue Luo2Shumin DongShumin Dong3Wei JiangWei Jiang1Cangui WuCangui Wu1Buning YeBuning Ye1Dongling ZhangDongling Zhang1Wanying HeWanying He1Jun Yan*Jun Yan1*Shangtong Lei*Shangtong Lei1*
  • 1Department of General Surgery, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong, China
  • 2Chronic Airways Diseases Laboratory, Department of Respiratory and Critical Care Medicine, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong, China
  • 3Department of Thyroid Surgery, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China

Background: Current clinicopathological risk factors lack the precision necessary for accurate prediction of central lymph node metastasis (CLNM) in patients with papillary thyroid cancer (PTC). Structural remodeling of the tumor microenvironment (TME), particularly collagen organization, may play a pivotal role in metastatic dissemination.

Objective: The objective of this study was to develop a collagen signature within the TME to predict CLNM in PTC and validate that the new model incorporating it into the assessment alongside clinicopathological risk factors would enhance the predictive accuracy.

Methods: In this retrospective study, we included 350 patients with classic PTC, all of whom underwent thyroidectomy with prophylactic central lymph node dissection. The cases were randomly assigned to a training cohort and a testing cohort with a 6:4 ratio. A total of 142 collagen features in the TME were extracted from second harmonic generation images of tumor specimens. We constructed a collagen signature using a least absolute shrinkage and selection operator (LASSO) regression model. Multivariate logistic regression was used to integrate the signature with clinicopathological variables and construct a nomogram.

Results: The predictive ability of collagen signature was also validated by AUC of 0.821 in training cohort and AUC of 0.793 in testing cohort. The collagen signature remained an independent predictor after adjustment for tumor size, capsular invasion, and tumor location in the multivariate analysis. Furthermore, the integrated model showed superior predictive performance compared to the clinicopathological model alone (0.842 vs. 0.679, p < 0.001). Decision curve analysis confirmed higher net clinical benefit across a wide range of thresholds.

Conclusions: The collagen signature within the TME represents a promising new biomarker that can effectively predict CLNM in PTC patients, potentially improving clinical decision-making and patient management.

Introduction

Papillary thyroid carcinoma (PTC) is the most common endocrine malignancy and frequently exhibits central lymph node metastasis (CLNM), with reported rates ranging from 40% to 90% (15). Accurate assessment of CLNM is essential for surgical planning, as inadequate or excessive lymph node dissection can respectively increase recurrence risk or operative morbidity (6). However, preoperative imaging, particularly ultrasound, often fails to detect occult metastases due to the deep anatomical location of central lymph nodes. Traditional risk stratification factors including age, tumor size, extrathyroidal extension, and histological subtypes, provide limited predictive accuracy, which underscores the need for novel biomarkers to guide individualized management (69).

Recent advances in molecular profiling, including BRAF mutation testing and circulating RNA signatures, have shown potential in predicting lymph node metastasis (10, 11). Nevertheless, these methods require further validation before they can be widely implemented in clinical practice. Additionally, collagen, the main component of the tumor microenvironment (TME), plays a critical role in tumor development and progression (12), and has emerged as an important predictive and prognostic biomarker in various malignancies, including gastric, breast, and colon cancers (1316).

Second harmonic generation (SHG) imaging is a label-free optical technique that visualizes and quantifies collagen architecture (17, 18).This quantitative method provides valuable information about collagen in complex TME and has been employed successfully in diagnosing several diseases (1921).

In this study, we utilized SHG imaging to quantify collagen changes in the TME of PTC and investigated the predictive utility of the resulting collagen signature for CLNM. This approach offers a powerful means to assess collagen characteristics and may serve as a robust predictive tool for lymph node metastasis in PTC patients.

Materials and methods

Patient selection and study design

This study was conducted at the Department of General Surgery, Nanfang Hospital of Southern Medical University, and included patients treated between January 2018 and June 2019. The inclusion criteria were as follows: age ≥18 years, histologically confirmed classic papillary thyroid cancer (PTC) after radical thyroid surgery, and the availability of both clinicopathological data and pathological specimens. Patients were excluded if they had previously undergone thyroid therapy, had a history of other malignant tumors, or were diagnosed with concurrent Hashimoto’s thyroiditis. In accordance with the thyroid cancer treatment guidelines of the Chinese Society of Clinical Oncology, all patients underwent either lobectomy or total thyroidectomy, accompanied by prophylactic central lymph node dissection (22, 23). The following clinicopathological characteristics were collected for analysis: gender, age at the time of surgery, body mass index (BMI), maximum tumor size, the number of tumor lesions, tumor location within the thyroid gland, the status of thyroid capsular invasion (TCI), the status of BRAF V600E mutation and ultrasound features. Maximum tumor size was measured as the largest diameter of the tumor in millimeters. The cutoff for tumor size (>10 mm) followed the 2015 ATA guideline definition of clinically significant PTC, applied a priori before statistical modeling. Tumor location was classified as upper versus middle-inferior according to craniocaudal thirds of the thyroid lobe. TCI was determined by histopathological examination. BRAF V600E mutation status was determined by PCR amplification and Sanger sequencing of exon 15 from formalin-fixed paraffin-embedded specimens. Preoperative ultrasound characteristics including microcalcification, margin definition, echogenicity, and shape ratio (height/width), were independently reviewed by two blinded radiologists.

The included cases were randomly assigned to a training cohort and a testing cohort with a 6:4 ratio. This study was approved by the Institutional Review Board of Nanfang Hospital of Southern Medical University (Approval No. NEFC-2021-217), and the research was conducted in line with the principles of the Declaration of Helsinki.

Selection of the regions of interest, multiphoton image acquisition, and collagen feature extraction

We obtained formalin-fixed, paraffin-embedded (FFPE) specimens from all patients; specimens were serially sectioned and stained with hematoxylin-eosin (H-E). Two pathologists who were blinded to the status of lymph node metastasis of the patients independently identified the region of the invasive margin of the thyroid cancer sample with a microscope at 200× magnification. Discrepancies were resolved through consensus with a senior pathologist. To ensure an accurate representation of each sample, five regions of interest (ROIs) were chosen along the invasive margin. These ROIs were equidistantly spread and had a field of view (FOV) of 1000 μm × 1000 μm per specimen. An unstained serial section was used for multiphoton imaging with a 20× objective (Figure 1). The multiphoton imaging system has been previously described (14, 24). The excitation wavelength used in this study was 800 nm.

Figure 1
Diagram of collagen signature calculation process. It includes H-E and SHG imaging for feature extraction into morphological, intensity, and texture categories. These features are then processed using LASSO logistic regression, with a graph showing coefficients versus log(lambda). The collagen signature is calculated as the sum of coefficients multiplied by collagen features.

Figure 1. Study protocol. Two pathologists independently identified the region of the invasive margin of the thyroid cancer sample, and five regions of interest with fields of view of 1000 μm×1000 μm per specimen that were equidistantly spread throughout the invasive margin were selected. An unstained serial section was used for multiphoton imaging, and the SHG images were transformed to gray-scale for feature extraction. LASSO logistic regression was conducted to select the collagen features with the greatest prognostic value. By combining the selected collagen features and the corresponding coefficients, the collagen signature of each patient was calculated with a standard linear formula. All histological and SHG images include scale bars.

Collagen features were extracted from SHG images using MATLAB 2015b. A total of 142 features were quantified, including morphological (n=8), intensity (n=6), and textural (n=128) parameters as previously reported (19, 20, 25). The morphological features included the following information about the collagen fibers: the area, number, length, width, straightness, crosslink density, crosslink space, and arrangement. The intensity features included the mean, variance, skewness, kurtosis, energy, and entropy of the collagen fibers. The texture features included the contrast, correlation, energy, and uniformity of the gray level co-occurrence matrix (GLCM) at five different pixel distances and in four different directions, and the mean and variance of the convolution of Gabor filter images at four scales and in six directions. Patients’ feature values were calculated by averaging the measurements of the five ROIs. All features were standardized using Z-score normalization based on the distribution of the training cohort, and these parameters were applied directly to the testing cohort to prevent data leakage.

Development and validation of the prediction model

Least-absolute shrinkage and selection operator (LASSO) logistic regression was conducted to select the collagen features in the training cohort with the most predictive value, using R (version 4.3.3) with the “glmnet” package in this study (26, 27). Five-fold cross-validation was used to determine the optimal tuning parameter λ. The status of CLNM of each patient was used in the LASSO model. By combining selected collagen features and LASSO logistic regression coefficients, the collagen signature of each patient was calculated with a standard linear formula. The area under the curve (AUC) of the receiver-operating characteristic (ROC) curve of the collagen signature was measured.

Univariate and multivariate logistic regression analyses were performed to evaluate the association between the collagen signature, seven clinicopathological factors, and CLNM in both cohorts. The odd ratio (OR) and 95% confidence interval (95% CI) were reported, with statistical significance defined as p < 0.05.

A nomogram integrating the collagen signature with independent clinicopathological predictors was constructed in the training cohort. Discrimination was assessed using the concordance index (C-index) with bootstrap validation. ROC curves and DeLong’s test were used to compare model performance across cohorts. Decision curve analysis (DCA) was performed to assess clinical utility. The models could include numerical variables alone (for example collagen signature), or categorical variables alone (for example TCI status) and both of them (for example new model) (28, 29). The clinicopathological model included only clinicopathological risk factors in the multivariate logistic regression.

To evaluate the clinical utility of our predictive model, we performed decision curve analysis (DCA). This method compares the net benefit of the model to that of standard treatment strategies across a range of threshold probabilities.

Subset analysis incorporating BRAF mutation and ultrasound features

A subset analysis was performed in 120 patients for whom both BRAF mutation and preoperative ultrasound data were available. The predictive performance of the new model, BRAF mutation, ultrasound features, were assessed using AUC with 95% confidence intervals. Model comparisons were conducted using the DeLong’s test.

Statistical analysis

All feature preprocessing, including aggregation, normalization parameter derivation, and LASSO feature selection with λ tuning, was conducted exclusively in the training cohort. The final coefficients were fixed and directly applied to the standardized testing cohort without refitting to avoid data leakage. A detailed workflow is provided in Supplementary Figure 1. ROC curve analysis was performed using the “pROC” package in R. AUCs with 95% CIs were computed using 1000 bootstrap resamples. Categorical variables were presented as frequencies and percentages. Chi-square tests (or Fisher’s exact tests, if applicable) were used to compare the differences between categorical variables. Statistical significance was determined by a p-value of less than 0.05. All statistical analyses were performed using SPSS for Windows, version 22.0 (SPSS, Chicago, IL, USA).

Results

Patients and clinicopathological characteristics

A total of 210 and 140 patients were randomly assigned to the training and testing cohorts, respectively. The CLNM rates in the training and testing cohorts were 41.0% and 42.1%, respectively (Table 1). The clinicopathological characteristics of the two cohorts showed no statistically significant differences.

Table 1
www.frontiersin.org

Table 1. Clinical characteristics of the patients in the training and testing cohorts.

Collagen signature construction

The procedure for the construction of the LASSO logistic regression model was presented in Figure 1. The coefficient profiles of the 142 collagen features in thyroid cancer were extracted from the LASSO regression. The optimal value log(λ) was -2.987 and the final model selected the 5 most powerful predictors from among all collagen features, which were the collagen crosslink density, collagen arrangement, mean of collagen intensity, contrast of 45° GLCM of the collagen fibers at direction 2, variance convolution of Gabor filter images on scale 2 at direction 5. Furthermore, the collagen signature of each sample was calculated with the linear formula. (Supplementary Formula 1).

Prediction model for CLNM

Figure 2 illustrates the ROC curves of the collagen signature in the training and testing cohorts. The collagen signature had a considerable predictive power, with AUCs of 0.821 (95% CI, 0.764-0.878) in the training cohort, and 0.793 (95% CI, 0.715-0.870) in the testing cohort, respectively. Univariate analyses demonstrated the collagen signature and tumor size, tumor location and TCI were significantly associated with CLNM (p < 0.001, p = 0.011, p < 0.001, and p = 0.002, in the training cohort; p < 0.001, p = 0.020, p=0.013, and p = 0.021, in the testing cohort; respectively, see in Supplementary Table 1). Multivariate analyses (Table 2) confirmed that the collagen signature, tumor size, tumor location, and TCI remained independent predictors of CLNM in both cohorts.

Figure 2
Two ROC (Receiver Operating Characteristic) curves compare collagen signatures. Panel A shows the training cohort with an AUC (Area Under the Curve) of 0.821, while Panel B presents the testing cohort with an AUC of 0.793. Both graphs plot sensitivity against specificity, with the diagonal line indicating random chance.

Figure 2. Predictive performance of the collagen signature in the training and testing cohorts. (A) The collagen signature predicted the CLNM risk with AUC of 0.821 in the training cohort. (B) The collagen signature predicted the CLNM risk with AUC of 0.793 in the testing cohort.

Table 2
www.frontiersin.org

Table 2. Multivariate Logistic regression analysis of the association of the collagen signature and clinical characteristics with CLNM in the training and testing cohort.

The nomogram for the prediction of CLNM was generated on the basis of the collagen signature, tumor size, tumor location and TCI status (Figure 3). The C-indexes of the nomograms of CLNM were 0.841 (95% CI, 0.812 -0.876) in the training cohort and 0.821 (95% CI, 0.793 -0.851) in the testing cohort. The nomogram demonstrated satisfactory discrimination and calibration in the two cohorts (Figure 4). The decision curve analysis showed that the new model provided greater net benefit across a wide range of threshold probabilities compared with the treat-all, treat-none, and clinicopathological strategies. (Figure 5).

Figure 3
Nomogram assessing the risk of central lymph node metastasis (CLNM) using factors like collagen signature, tumor location, maximum tumor size, and thyroid capsular invasion. Point scales provide totals correlated with risk probability. C-index values show model performance in training (0.841) and testing cohorts (0.821).

Figure 3. Nomogram of the new model for the prediction of CLNM. The C-indexes of the nomograms of CLNM were 0.841 (95% CI, 0.812 -0.876), 0.821 (95% CI, 0.793 -0.851) in the training and testing cohort, respectively.

Figure 4
Two graphs titled “Calibration Curve in Training Cohort” and “Calibration Curve in Testing Cohort” compare calibration. Both show predicted probability on the x-axis versus actual probability on the y-axis, with a grey line for ideal calibration and a black line for the new model’s performance. Each graph features a series of vertical lines along the x-axis indicating data frequency across probability bins.

Figure 4. Calibration curve analysis of the new model for the prediction of CLNM. (A) The new model demonstrated satisfactory calibration in the training cohort. (B) The new model demonstrated satisfactory calibration in the testing cohort.

Figure 5
Two decision curve analysis graphs compare standardized net benefit versus high risk threshold for a training cohort and a testing cohort. Both graphs include lines for a new model, clinicopathological model, all, and none. The new model shows higher benefit across thresholds in both the training (left) and testing (right) cohorts.

Figure 5. Decision curve analysis of the new model and clinicopathological model for the prediction of CLNM. (A) The new model offered a higher net benefit compared to ‘treat-all’, ‘treat-none’, and clinicopathological model strategies across a broad spectrum of threshold probabilities in the training cohort. (B) The new model offered a higher net benefit compared to ‘treat-all’, ‘treat-none’, and clinicopathological model strategies across a broad spectrum of threshold probabilities in the testing cohort. The new model combined the collagen signature and clinicopathological risk factors and the clinicopathological model included only clinicopathological risk factors.

The new model exhibited high sensitivity and negative predictive value (NPV) in both cohorts (Table 3, Figure 6). In the training cohort, the sensitivity was 88.4% (95% CI: 81.6% - 95.1%), specificity 70.2% (95% CI: 62.1% - 78.2%), PPV 67.3% (95% CI: 58.6% - 75.9%), NPV 89.7% (95% CI: 83.6% - 95.7%), and accuracy 77.6% (95% CI: 72.0% - 83.3%). In the testing cohort, the sensitivity was 84.7% (95% CI: 75.6% - 93.9%), specificity 64.2% (95% CI: 53.8% - 74.6%), PPV 63.3% (95% CI: 52.7% - 73.9%), NPV 85.2% (95% CI: 76.3% - 94.1%), and accuracy 72.9% (95% CI: 65.5% - 80.2%). The consistently high NPV across both cohorts indicates the model’s strong capability to rule out CLNM.

Table 3
www.frontiersin.org

Table 3. Performance Metrics of the new model.

Figure 6
Radar chart illustrating the prediction performance for central lymph node metastasis. It compares training and validation cohorts across five metrics: Sensitivity (training: 0.88, validation: 0.85), Specificity (0.70, 0.64), Accuracy (0.78, 0.73), NPV (0.90, 0.85), and PPV (0.67, 0.63). Training cohort is marked in blue and validation cohort in red.

Figure 6. Performance metrics of the new model in the training and testing cohorts.

Comparison between the new prediction model and the clinicopathological prediction model

The new model was compared with the clinicopathological model. The clinicopathological prediction model included the parameters that were statistically important in the multivariate logistic regression analysis, i.e., tumor size, tumor location and TCI. In the training cohort, the new model outperformed the clinicopathological model, with AUCs of 0.852 (95% CI, 0.802–0.902) versus 0.706 (95% CI, 0.637–0.775), respectively. The difference was statistically significant (DeLong’s test, p < 0.001) (Figure 7A). Similarly, in the testing cohort, the new model demonstrated superior discrimination, with AUCs of 0.842 (95% CI, 0.777–0.908) versus 0.679 (95% CI, 0.590–0.777) for the clinicopathological model (DeLong’s test, p < 0.001) (Figure 7B).

Figure 7
ROC curve graphs comparing model performances in training and testing cohorts. The left panel displays the training cohort with models including a new model (AUC 0.852), clinicopathological model, location, capsular invasion, and maximum tumor size. The right panel shows the testing cohort data with similar models, where the new model achieves an AUC of 0.842. Axes are labeled for sensitivity and specificity, and each line represents a different model.

Figure 7. Comparison of the predictive value for CLNM of the new model and the clinicopathological model. (A) Comparison of the predictive value of the two models for CLNM in the training cohort. (B) Comparison of the predictive value of the two models for CLNM in the testing cohort. The new model combined the collagen signature and clinicopathological risk factors and the clinicopathological model included only clinicopathological risk factors.

Subset analysis comparing BRAF mutation and ultrasound features with new model

In a subset of 120 patients with available molecular and ultrasound data, the BRAF V600E mutation was detected in 67 patients (55.8%). Among ultrasound characteristics, microcalcification (OR = 2.368, 95% CI 1.092–5.340, p = 0.032) and ill-defined margins (OR = 2.579, 95% CI 1.216–5.580, p = 0.014) were significantly associated with CLNM (Supplementary Table 2). These two variables were incorporated into an ultrasound radiomics model. The new model maintained superior predictive performance (AUC = 0.860) compared with BRAF mutation (AUC = 0.618, DeLong’s test p <0.001) or ultrasound radiomics model (AUC = 0.664, DeLong’s test p=0.001). (Supplementary Figure 2).

Discussion

The existing clinicopathological risk factors lack sufficient accuracy for identifying CLNM in patients with thyroid cancer. To address this gap, we conducted a retrospective analysis of 350 PTC cases and developed a collagen signature derived from five SHG-based features. The collagen signature showed robust and independent predictive power and significantly outperformed the clinicopathological model alone. These findings highlight the utility of collagen remodeling as a structural biomarker that complements existing clinical and molecular indicators for individualized risk assessment.

Extracellular matrix (ECM) remodeling represents a central component of TME reprogramming and is a key determinant of invasive behavior in PTC (30, 31). The five SHG-derived collagen features selected in our model capture distinct TME-derived stromal phenotypes that are biologically integral to metastatic progression and their directions of association are consistent with known ECM biology. Collagen crosslink density, which showed a positive association with CLNM, reflects LOX-mediated stiffening driven by BRAF–TGF-β1 signaling (32, 33). Increased crosslinking enhances tensile strength and promotes force transmission, enabling directional migration toward lymphatic vessels (34). In contrast, collagen arrangement demonstrated a negative coefficient, indicating that highly uniform and orderly fiber patterns are associated with lower metastatic risk. This is consistent with observations that invasive tumors often lose microdomain-level alignment and exhibit fragmented, reoriented, or partially disrupted collagen architectures, which permit more adaptable and plastic modes of invasion. Similarly, the mean collagen intensity was negatively associated with CLNM, supporting the concept that metastatic potential is not determined by bulk collagen abundance, but rather by its biomechanical organization; densely fibrotic yet structurally stable matrices often correspond to less aggressive phenotypes. The negative association of GLCM-contrast aligns with the known decrease in local textural sharpness at invasive fronts, where protease-mediated ECM degradation produces smoother SHG gradients. Finally, Gabor-variance, positively associated with CLNM, quantifies multi-scale orientation dispersion and the dynamic ECM rearrangements characteristic of active metastatic niches (34). Together, these features reflect complementary dimensions of ECM mechanics, including stiffness, microdomain disruption, alignment loss, diminished local contrast, and increased remodeling complexity, and these supports their biological relevance as microenvironmental markers of lymphatic invasion.

Preoperative ultrasound has limited sensitivity in accurately assessing the deep anatomical spaces of the central neck compartment. In our study, we observed a CLNM rate of 42.3% in cases where preoperative examinations did not reveal any evidence of lymph node metastasis, which aligns with existing literature reporting CLNM rates ranging from 30% to 52.3% (3537). Molecular predictors such as BRAF V600E mutations or circRNA panels, though informative, are costly and not universally available (10, 37, 38). Several deep learning models have shown promising potential in the radiomics of ultrasound and computed tomography, though further validation is still needed (39, 40). Compared to ultrasound radiomics models reporting AUCs of 0.70-0.85 (39), our collagen signature achieved comparable discriminative capacity (AUC = 0.852) while avoiding reliance on operator-dependent imaging protocols. Subset analysis further demonstrated that the new model outperformed BRAF mutation status and the ultrasound radiomics model, underscoring its complementary value. While genetic tests provide mechanistic insights and the clinicopathological factors including tumor size, capsular invasion, and location represent clinical manifestations of local aggressiveness and lymphatic spread potential, the collagen signature reflects tumor microenvironment remodeling. Their complementary inclusion enhances biological plausibility of the predictive model.

This study represents the first successful application of the multiphoton microscopy imaging system for predicting CLNM in PTC. The imaging system used in our research has a well-established track record and has been extensively validated in prior studies for its effectiveness in various biomedical applications (13, 19, 20). The SHG imaging protocol enabled real-time collagen analysis within 10 minutes per specimen, compatible with intraoperative frozen section workflows. This proof-of-concept study was conducted on FFPE tissues rather than frozen sections; thus intraoperative application remains technically challenging. However, rapid postoperative implementation is feasible, providing timely information to guide decisions on completion or prophylactic central neck dissection. Future optimization of real-time SHG imaging and automated feature extraction could enable intraoperative decision-making in routine surgical workflows.

This retrospective single-center study inevitably carries potential selection bias, and the internal random split does not fully substitute for external validation. CLNM rates could vary by iodine intake, ethnicity, and pathological practices, and our patient population may not represent global PTC heterogeneity. Future studies will incorporate multicenter external cohorts and predefined sample size calculations based on events-per-variable principles to confirm generalizability and predictive stability. As molecular data and ultrasound features were not all available for this retrospective cohort, future prospective studies will incorporate molecular, transcriptomic and ultrasound data to complement and validate the new model. Mechanistic validation through immunohistochemical or transcriptomic correlation with CAF or LOX markers would further strengthen the biological interpretation of the collagen signature.

Conclusions

This study establishes an integrative predictive framework that synergizes tumor microenvironment collagen signatures with key clinicopathological variables (tumor location, size, and capsular invasion) to refine risk stratification of CLNM in papillary thyroid carcinoma. Future implementation of this model could standardize surgical decision-making and optimize postoperative surveillance protocols.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Institutional Review Board of Nanfang Hospital of Southern Medical University (Approval no. NEFC-2021-217). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

WC: Conceptualization, Writing – review & editing, Writing – original draft. JG: Writing – review & editing, Writing – original draft, Conceptualization. TL: Investigation, Data curation, Writing – review & editing, Formal Analysis, Writing – original draft. SD: Formal Analysis, Writing – review & editing, Writing – original draft, Investigation, Data curation. WJ: Writing – review & editing, Software, Methodology, Writing – original draft, Investigation. CW: Visualization, Investigation, Writing – review & editing, Writing – original draft. BY: Writing – review & editing, Writing – original draft. DZ: Writing – review & editing, Writing – original draft. WH: Writing – original draft, Writing – review & editing. JY: Supervision, Resources, Writing – review & editing, Writing – original draft, Funding acquisition. SL: Writing – review & editing, Supervision, Writing – original draft, Resources, Project administration, Funding acquisition.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by grants from the National Natural Science Foundation of China (82373366, 82273360), Science and Technology Program of Guangzhou(2023A04J2393), Clinical Research Funding of Nanfang Hospital (2020CR011), High-tech, Major and Unique Clinical Technology Projects of Guangzhou (2023P-TS02), and Noncommunicable Chronic Diseases-National Science and Technology Major Project (2024ZD0525600).

Acknowledgments

We would like to express our gratitude to the Department of Pathology of Nanfang Hospital of Southern Medical University and the Central Laboratory of Southern Medical University for their professional support in sample processing, staining techniques, and multiphoton imaging. In addition, we would like to thank all the patients and their families who participated in this study for their trust and cooperation.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2025.1691788/full#supplementary-material

Abbreviations

PTC, Papillary thyroid carcinoma; CLNM, Central lymph node metastasis; TME, Tumor microenvironment; SHG, Second harmonic generation; ROI, Region of interest; LASSO, Least absolute shrinkage and selection operator; TCI, Thyroid capsular invasion; AUC, Area under the curve; DCA, Decision curve analysis; NPV, Negative predictive value; FFPE, Formalin-fixed paraffin-embedded; ECM, Extracellular matrix; CAF, Cancer-associated fibroblast.

References

1. Wada N, Duh QY, Sugino K, et al. Lymph node metastasis from 259 papillary thyroid microcarcinomas: frequency, pattern of occurrence and recurrence, and optimal strategy for neck dissection. Ann Surg. (2003) 237:399–407. doi: 10.1097/01.SLA.0000055273.58908.19

PubMed Abstract | Crossref Full Text | Google Scholar

2. Lundgren CI, Hall P, Dickman PW, and Zedenius J. Clinically significant prognostic factors for differentiated thyroid carcinoma: a population-based, nested case-control study. Cancer. (2006) 106:524–31. doi: 10.1002/cncr.21653

PubMed Abstract | Crossref Full Text | Google Scholar

3. Moo TA, McGill J, Allendorf J, Lee J, Fahey T 3rd, and Zarnegar R. Impact of prophylactic central neck lymph node dissection on early recurrence in papillary thyroid carcinoma. World J Surg. (2010) 34:1187–91. doi: 10.1007/s00268-010-0418-3

PubMed Abstract | Crossref Full Text | Google Scholar

4. Forest VI, Clark JR, Ebrahimi A, Cho EA, Sneddon L, Gao K, et al. Central compartment dissection in thyroid papillary carcinoma. Ann Surg. (2011) 253:123–30. doi: 10.1097/SLA.0b013e3181fc9644

PubMed Abstract | Crossref Full Text | Google Scholar

5. Liu FH, Kuo SF, Hsueh C, Chao TC, and Lin JD. Postoperative recurrence of papillary thyroid carcinoma with lymph node metastasis. J Surg Oncol. (2015) 112:149–54. doi: 10.1002/jso.23967

PubMed Abstract | Crossref Full Text | Google Scholar

6. Kim DH, Kim GJ, Kim SW, and Hwang SH. Predictive value of ipsilateral central lymph node metastasis for contralateral central lymph node metastasis in patients with thyroid cancer: Systematic review and meta-analysis. Head Neck. (2021) 43:3177–84. doi: 10.1002/hed.26787

PubMed Abstract | Crossref Full Text | Google Scholar

7. Zheng H, Lai V, Lu J, Kang JK, Chou J, Burman KD, et al. Clinical factors predictive of lymph node metastasis in thyroid cancer patients: A multivariate analysis. J Am Coll Surg. (2022) 234:691–700. doi: 10.1097/XCS.0000000000000107

PubMed Abstract | Crossref Full Text | Google Scholar

8. Liu C, Xiao C, Chen J, Li X, Feng Z, Gao Q, et al. Risk factor analysis for predicting cervical lymph node metastasis in papillary thyroid carcinoma: a study of 966 patients. BMC Cancer. (2019) 19:622. doi: 10.1186/s12885-019-5835-6

PubMed Abstract | Crossref Full Text | Google Scholar

9. Liu F, Han F, Lu L, Chen Y, Guo Z, and Yao J. Meta-analysis of prediction models for predicting lymph node metastasis in thyroid cancer. World J Surg Oncol. (2024) 22:278. doi: 10.1186/s12957-024-03566-4

PubMed Abstract | Crossref Full Text | Google Scholar

10. Ruiz EML, Niu T, Zerfaoui M, Kunnimalaiyaan M, Friedlander PL, Abdel-Mageed AB, et al. A novel gene panel for prediction of lymph-node metastasis and recurrence in patients with thyroid cancer. Surgery. (2020) 167:73–9. doi: 10.1016/j.surg.2019.06.058

PubMed Abstract | Crossref Full Text | Google Scholar

11. Yang W, Bai C, Zhang L, Li Z, Tian Y, Yang Z, et al. Correlation between serum circRNA and thyroid micropapillary carcinoma with cervical lymph node metastasis. Med (Baltimore). (2020) 99:e23255. doi: 10.1097/MD.0000000000023255

PubMed Abstract | Crossref Full Text | Google Scholar

12. Buchheit CL, Weigel KJ, and Schafer ZT. Cancer cell survival during detachment from the ECM: multiple barriers to tumour progression. Nat Rev Cancer. (2014) 14:632–41. doi: 10.1038/nrc3789

PubMed Abstract | Crossref Full Text | Google Scholar

13. Jiang W, Wang H, Chen W, Zhao Y, Yan B, Chen D, et al. Association of collagen deep learning classifier with prognosis and chemotherapy benefits in stage II-III colon cancer. Bioeng Transl Med. (2023) 8:e10526. doi: 10.1002/btm2.10526

PubMed Abstract | Crossref Full Text | Google Scholar

14. Chen W, Dong S, Liu X, Wang G, Xu S, Lei S, et al. Association of the collagen signature in the tumor microenvironment with recurrence and survival of patients with T4N0M0 colon cancer. Dis Colon Rectum. (2021) 64:563–75. doi: 10.1097/DCR.0000000000001907

PubMed Abstract | Crossref Full Text | Google Scholar

15. Xi G, Qiu L, Xu S, Guo W, Fu F, Kang D, et al. Computer-assisted quantification of tumor-associated collagen signatures to improve the prognosis prediction of breast cancer. BMC Med. (2021) 19:273. doi: 10.1186/s12916-021-02146-7

PubMed Abstract | Crossref Full Text | Google Scholar

16. Chen D, Chen G, Jiang W, Fu M, Liu W, Sui J, et al. Association of the collagen signature in the tumor microenvironment with lymph node metastasis in early gastric cancer. JAMA Surg. (2019) 154:e185249. doi: 10.1001/jamasurg.2018.5249

PubMed Abstract | Crossref Full Text | Google Scholar

17. Zipfel WR, Williams RM, and Webb WW. Nonlinear magic: multiphoton microscopy in the biosciences. Nat Biotechnol. (2003) 21:1369–77. doi: 10.1038/nbt899

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zipfel WR, Williams RM, Christie R, Nikitin AY, Hyman BT, and Webb WW. Live tissue intrinsic emission microscopy using multiphoton-excited native fluorescence and second harmonic generation. Proc Natl Acad Sci U S A. (2003) 100:7075–80. doi: 10.1073/pnas.0832308100

PubMed Abstract | Crossref Full Text | Google Scholar

19. Xu S, Kang CH, Gou X, Peng Q, Yan J, Zhuo S, et al. Quantification of liver fibrosis via second harmonic imaging of the Glisson's capsule from liver surface. J Biophotonics. (2016) 9:351–63. doi: 10.1002/jbio.201500001

PubMed Abstract | Crossref Full Text | Google Scholar

20. Xu S, Wang Y, Tai DCS, Wang S, Cheng CL, Peng Q, et al. qFibrosis: a fully-quantitative innovative method incorporating histological features to facilitate accurate fibrosis scoring in animal model and chronic hepatitis B patients. J Hepatol. (2014) 61:260–9. doi: 10.1016/j.jhep.2014.02.015

PubMed Abstract | Crossref Full Text | Google Scholar

21. Chen X, Chen L, Miao J, Huang X, Han X, Zheng L, et al. Prognostic significance of collagen signatures in pancreatic ductal adenocarcinoma obtained from second-harmonic generation imaging. BMC Cancer. (2024) 24:652. doi: 10.1186/s12885-024-12412-5

PubMed Abstract | Crossref Full Text | Google Scholar

22. Gao M, Ge M, Ji Q, Cheng R, Lu H, Guan H, et al. 2016 Chinese expert consensus and guidelines for the diagnosis and treatment of papillary thyroid microcarcinoma. Cancer Biol Med. (2017) 14:203–11. doi: 10.20892/j.issn.2095-3941.2017.0051

PubMed Abstract | Crossref Full Text | Google Scholar

23. Chinese Society of Clinical Oncology (CSCO) diagnosis and treatment guidelines for persistent/recurrent and metastatic differentiated thyroid cancer 2018 (English version). Chin J Cancer Res. (2019) 31:99–116. doi: 10.21147/j.issn.1000-9604.2019.01.06

PubMed Abstract | Crossref Full Text | Google Scholar

24. Yan J, Kang Y, Xu S, Ong LL, Zhuo S, Bunte RM, et al. In vivo label-free quantification of liver microcirculation using dual-modality microscopy. J BioMed Opt. (2014) 19:116006. doi: 10.1117/1.JBO.19.11.116006

PubMed Abstract | Crossref Full Text | Google Scholar

25. Yu X, Jiang W, Dong X, Yan B, Xu S, Lin Z, et al. Nomograms integrating the collagen signature and systemic immune-inflammation index for predicting prognosis in rectal cancer patients. BJS Open. (2024) 8:zrae014. doi: 10.1093/bjsopen/zrae014

PubMed Abstract | Crossref Full Text | Google Scholar

26. Mavaddat N, Michailidou K, Dennis J, Lush M, Fachal L, Lee A, et al. Polygenic risk scores for prediction of breast cancer and breast cancer subtypes. Am J Hum Genet. (2019) 104:21–34. doi: 10.1016/j.ajhg.2018.11.002

PubMed Abstract | Crossref Full Text | Google Scholar

27. Xiong P, Chen J, Zhang Y, Shu L, Shen Y, Gu Y, et al. Predictive modeling for eosinophilic chronic rhinosinusitis: Nomogram and four machine learning approaches. iScience. (2024) 27:108928. doi: 10.1016/j.isci.2024.108928

PubMed Abstract | Crossref Full Text | Google Scholar

28. Jiang W, Wang H, Dong X, Yu X, Zhao Y, Chen D, et al. Pathomics signature for prognosis and chemotherapy benefits in stage III colon cancer. JAMA Surg. (2024) 159:519–28. doi: 10.1001/jamasurg.2023.8015

PubMed Abstract | Crossref Full Text | Google Scholar

29. Chen D, Lai J, Cheng J, Fu M, Lin L, Chen F, et al. Predicting peritoneal recurrence in gastric cancer with serosal invasion using a pathomics nomogram. iScience. (2023) 26:106246. doi: 10.1016/j.isci.2023.106246

PubMed Abstract | Crossref Full Text | Google Scholar

30. Mason JA, Hagel KR, Hawk MA, and Schafer ZT. Metabolism during ECM detachment: achilles heel of cancer cells? Trends Cancer. (2017) 3:475–81. doi: 10.2478/jtim-2023-0105

PubMed Abstract | Crossref Full Text | Google Scholar

31. Liu N, Li D, Liu D, Liu Y, and Lei J. FOSL2 participates in renal fibrosis via SGK1-mediated epithelial-mesenchymal transition of proximal tubular epithelial cells. J Transl Int Med. (2023) 11:294–308. doi: 10.2478/jtim-2023-0105

PubMed Abstract | Crossref Full Text | Google Scholar

32. Minna E, Brich S, Todoerti K, Pilotti S, Collini P, Bonaldi E, et al. Cancer associated fibroblasts and senescent thyroid cells in the invasive front of thyroid carcinoma. Cancers (Basel). (2020) 12:112. doi: 10.3390/cancers12010112

PubMed Abstract | Crossref Full Text | Google Scholar

33. Li Y, Wang Y, Wu Q, and Hu B. Transforming growth factor β1 could influence thyroid nodule elasticity and also improve cervical lymph node metastasis in papillary thyroid carcinoma. Ultrasound Med Biol. (2015) 41:2866–72. doi: 10.1016/j.ultrasmedbio.2015.07.010

PubMed Abstract | Crossref Full Text | Google Scholar

34. Han W, Chen S, Yuan W, Fan Q, Tian J, Wang X, et al. Oriented collagen fibers direct tumor cell intravasation. Proc Natl Acad Sci U S A. (2016) 113:11208–13. doi: 10.1073/pnas.1610347113

PubMed Abstract | Crossref Full Text | Google Scholar

35. Yun M, Noh TW, Cho A, Choi YJ, Hong SW, Park CS, et al. Visually discernible [18F]fluorodeoxyglucose uptake in papillary thyroid microcarcinoma: a potential new risk factor. J Clin Endocrinol Metab. (2010) 95:3182–8. doi: 10.1210/jc.2009-2091

PubMed Abstract | Crossref Full Text | Google Scholar

36. Wang P, Dong Z, Zhao S, Su Y, Zhang J, Ma Y, et al. Trends of the prevalence rate of central lymph node metastasis and multifocality in patients with low-risk papillary thyroid carcinoma after delayed thyroid surgery. Front Endocrinol (Lausanne). (2024) 15:1349272. doi: 10.3389/fendo.2024.1349272

PubMed Abstract | Crossref Full Text | Google Scholar

37. Toraih EA, Ruiz E, Ning B, Tortelote GG, Hilliard S, Moroz K, et al. Chromatin-accessible miRNA regulons driving thyroid tumorigenesis and progression. J Am Coll Surg. (2023) 236:732–50. doi: 10.1097/XCS.0000000000000541

PubMed Abstract | Crossref Full Text | Google Scholar

38. Wang JR, Zafereo ME, Cabanillas ME, Wu CC, Xu L, Dai Y, et al. The association between thyroid differentiation score and survival outcomes in papillary thyroid carcinoma. J Clin Endocrinol Metab. (2025) 110:356–63. doi: 10.1210/clinem/dgae532

PubMed Abstract | Crossref Full Text | Google Scholar

39. Jia W, Cai Y, Wang S, and Wang J. Predictive value of an ultrasound-based radiomics model for central lymph node metastasis of papillary thyroid carcinoma. Int J Med Sci. (2024) 21:1701–9. doi: 10.7150/ijms.95022

PubMed Abstract | Crossref Full Text | Google Scholar

40. Li F, Pan D, He Y, Wu Y, Peng J, Li J, et al. Using ultrasound features and radiomics analysis to predict lymph node metastasis in patients with thyroid cancer. BMC Surg. (2020) 20:315. doi: 10.1186/s12893-020-00974-7

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: collagen signature, tumor microenvironment, papillary thyroid cancer, predictive model, lymph node metastasis

Citation: Chen W, Ge J, Luo T, Dong S, Jiang W, Wu C, Ye B, Zhang D, He W, Yan J and Lei S (2025) Development and validation of a collagen signature to predict central lymph node metastasis in papillary thyroid cancer. Front. Endocrinol. 16:1691788. doi: 10.3389/fendo.2025.1691788

Received: 27 August 2025; Accepted: 13 November 2025; Revised: 12 November 2025;
Published: 28 November 2025.

Edited by:

Weihang Li, Fourth Military Medical University, China

Reviewed by:

Mariano Francesco Caratozzolo, National Research Council (CNR), Italy
Xiangtai Zeng, First Affiliated Hospital of Gannan Medical University, China
Yukui Gao, First Affiliated Hospital of Wannan Medical College, China

Copyright © 2025 Chen, Ge, Luo, Dong, Jiang, Wu, Ye, Zhang, He, Yan and Lei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shangtong Lei, bGVpc2h0NzgxOTIwQDE2My5jb20=; Jun Yan, eWFuanVuZnVkYW5AMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.