ORIGINAL RESEARCH article

Front. Oncol., 11 December 2024

Sec. Cancer Imaging and Image-directed Interventions

Volume 14 - 2024 | https://doi.org/10.3389/fonc.2024.1433196

Prediction of soft tissue sarcoma grading using intratumoral habitats and a peritumoral radiomics nomogram: a multi-center preliminary study

  • 1. Department of Radiology, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China

  • 2. Department of Operation Center, Women and Children’s Hospital, Qingdao University, Qingdao, Shandong, China

  • 3. Department of Radiology, The Puyang Oilfield General Hospital, Puyang, Henan, China

  • 4. Department of Radiology, The Third Hospital of Hebei Medical University, Shijiazhuang, Hebei, China

  • 5. Department of Research Collaboration, Research and Development (R&D) Center, Beijing Deepwise and League of Philosophy Doctor (PHD) Technology Co., Ltd, Beijing, China

  • 6. Department of Radiology, Peking University Third Hospital, Beijing, China

Article metrics

View details

6

Citations

2k

Views

1,1k

Downloads

Abstract

Background:

Accurate identification of pathologic grade before operation is helpful for guiding clinical treatment decisions and improving the prognosis for soft tissue sarcoma (STS).

Purpose:

To construct and assess a magnetic resonance imaging (MRI)-based radiomics nomogram incorporating intratumoral habitats (subregions of clusters of voxels containing similar features) and peritumoral features for the preoperative prediction of the pathological grade of STS.

Methods:

The MRI data of 145 patients with STS (74 low-grade and 71 high-grade) from 4 hospitals were retrospectively collected, including enhanced T1-weighted and fat-suppressed-T2-weighted sequences. The patients were divided into training cohort (n = 102) and validation cohort (n = 43). K-means clustering was used to divide intratumoral voxels into three habitats according to signal intensity. A number of radiomics features were extracted from tumor-related regions to construct radiomics prediction signatures for seven subgroups. Logistic regression analysis identified peritumoral edema as an independent risk factor. A nomogram was created by merging the best radiomics signature with the peritumoral edema. We evaluated the performance and clinical value of the model using area under the curve (AUC), calibration curves, and decision curve analysis.

Results:

A multi-layer perceptron classifier model based on intratumoral habitats and peritumoral features combined gave the best radiomics signature, with an AUC of 0.856 for the validation cohort. The AUC of the nomogram in the validation cohort was 0.868, which was superior to the radiomics signature and the clinical model established by peritumoral edema. The calibration curves and decision curve analyses revealed good calibration and a high clinical application value for this nomogram.

Conclusion:

The MRI-based nomogram is accurate and effective for predicting preoperative grading in patients with STS.

1 Introduction

Soft tissue sarcomas (STSs) are rare heterogeneous tumors that account for 1% of all tumors (1), and surgical resection is regarded as the primary therapeutic approach for localized STS. Recent studies reported that in addition to surgery, adjuvant radiotherapy and chemotherapy can improve the prognosis for high-grade (grade III) STS. In this respect, the possible adverse consequences of radiation and chemotherapy can be avoided by a preoperative diagnosis of low-grade illness (grades I and II) (2, 3). Accurate preoperative grading is also beneficial for the selection of neoadjuvant chemoradiotherapy (2, 4, 5). Histologic grade is considered to be an important factor affecting the prognosis for STS (4). Although the preoperative histological tumor grading of STS is largely dependent on needle biopsy, because of tumor heterogeneity (6), the initial biopsy-based pathology test may underestimate the actual grade (5, 7). Therefore, there is an urgent need to develop a non-invasive and reliable approach for determining STS grade before surgery, so that patients can receive more effective and targeted treatment.

Magnetic resonance imaging (MRI) is the most commonly used technique for the preoperative diagnosis and evaluation of STS due to its non-invasive nature and excellent soft tissue contrast resolution. Although STS can be diagnosed on MRI by an experienced radiologist, the STS grade is difficult to determine because of tumor heterogeneity (8). Radiomics is a noninvasive method that extracts markers to assist physicians in making judgments through the quantitative mining of features from medical images (9, 10). Considered a digital biopsy, radiomics enables a detailed description of tumor characteristics and spatial heterogeneity in multiple clinical settings (11, 12). In earlier research, biomarkers based on quantitative MRI radiomics features have been regarded as an effective tool to distinguish tumor grades (13, 14). STSs are highly heterogeneous (6), and their growth patterns, tumor heterogeneity, and grade have all been linked in numerous prior investigations (47). However, conventional radiomics analysis is usually performed on the whole tumor, thus ignoring regional phenotypic changes within the tumor (15).

Recently, an emerging approach of partitioning tumors into subregions (known as habitats) containing clusters of voxels with similar features has allowed for more effective quantification of intratumor heterogeneity and definition of tumor subregions that are relevant to tumor growth or invasiveness (1618). A study demonstrated that tumor habitat analysis has high value for predicting tumor grade (19). In addition, recent research demonstrated that the peritumoral microenvironment is valuable for the clinical evaluation of tumor-aggressive biological behavior (20). However, few studies have evaluated the value of both the tumor habitat and the peritumoral environment for accurately predicting the grade and invasive potential of tumors.

Thus, the aim of this study was to develop and validate a non-invasive MRI-based radiomics model combining intratumoral habitats and peritumoral microenvironments for the pretreatment differentiation of high-grade STS from low-grade STS.

2 Materials and methods

2.1 Study population

This retrospective investigation obtained ethical approval and written informed consent was not required. The inclusion and exclusion criteria are shown in Supplementary A1 in Supplementary Data Sheet 1. Consecutive patients with soft tissue sarcomas treated between January 2007 and July 2022 at the Puyang Oilfield General Hospital, the Third Hospital of Hebei Medical University, the Shandong Provincial Hospital affiliated with Shandong First Medical University, and the Affiliated Hospital of Qingdao University were collected. A total of 145 patients (56 ± 16, 55.2% male) meeting the criteria were enrolled in the study and were divided into two groups. There were 102 patients from the affiliated hospital of Qingdao University in the training cohort and 43 patients from the other three hospitals in the validation cohort. The pathological findings are shown in Supplementary Table S1.

2.2 MRI protocol

The MRI sequences and scanners are shown in Supplementary A2 in Supplementary Data Sheet 1. The parameters of the MRI sequences are listed in Supplementary Table S2.

2.3 Clinical information and collection of MRI morphological characteristics

Clinical information including age, sex, and the tumor-node-metastasis (TNM) staging were collected. The FNCLCC (Fédération Nationale des Centres de luttte contre le cancer) system scores for the tumor mitotic index, degree of differentiation, and degree of necrosis were added to give the tumor grade. Histopathological low grade was assigned to FNCLCC grades I and II, whereas histopathological high grade was assigned to grade III.

MRI morphological characteristics were assessed by two radiologists with more than 7 years of experience who were unaware of the pathological findings (Supplementary A3 in Supplementary Data Sheet 1).

2.4 Image preprocessing and region-of-interest segmentation

The MRI preprocessing and tumor region-of-interest (ROI) segmentation consisted of four main phases: MRI registration, bias-field correction, segmentation of tumor-associated regions, and spatial resampling. First, using 3D slicer software (v. 5.0.3, www.slicer.org), the CE-T1WI and FS-T2WI of each patient were subjected to 3D rigid transformation registration. Following this registration, bias-field unevenness was corrected using the Python N4-bias-field-correction function. Then, the ROI was accurately drawn, including the tumorous and peritumoral regions, as shown in Figure 1. Two primary radiologists manually segmented the tumor ROIs using ITK-SNAP software (v.3.8.0, http://www.itksnap.org). When there were disagreements over an ROI, it was reviewed and corrected by another senior expert radiologist, and the tumor area mask was formed. Then, Radiomics Intelligent Analysis Software (RIAS) was applied to expand the boundary of the tumor lesion mask 10 mm outward for each lesion to generate a peritumoral mask. The “tail sign” can be considered a sign of infiltration or fascial invasion and is an independent factor affecting patient prognosis. It had the same signal intensity as the main mass and had the same enhancement after the injection of gadolinium-based contrast material. Thus, the “tail sign” was segmented into the tumor area mask (21, 22). Large blood vessels, bone tissue, and air areas that had not been invaded in the peritumoral mask were removed manually. Finally, RIAS was used to resample all of the images and masks to an isotropic spatial voxel size of 1 × 1 × 1 mm3.

Figure 1

The signal intensity of the CE-T1WI and FS-T2WI was standardized using the normalizing technique for histogram intensity in Python before performing the intratumoral habitats analysis (23). The K-means clustering module in Python was used to cluster the voxels in the normalized CE-T1WI and FS-T2WI images into three clusters representing the functionally-coherent subregions of the STS intratumoral regions, i.e., the intratumoral habitats: a low-enhancement solid subregion with low signal intensity (Habitat 1), a high-signal intensity enhanced viable subregion (Habitat 2), and a low-activity subregion with intermediate signal intensity (Habitat 3).

2.5 Extraction of radiomics features

All radiomics feature extraction was performed using the PyRadiomics toolkit in Python. A total of 1906 radiomics features were extracted from the tumoral region masks and peritumoral masks for each sequence, with these including textural features, shape features, first-order features, and wavelet features. A further 93 radiomics features, including textural features and first-order features, were extracted from each of the three intratumoral habitats of each sequence. This resulted in final totals of 558 habitat features, 3812 tumor region features, and 3812 peritumoral features.

2.6 Image normalization and standardization method

The combat compensation method (24), which eliminates scanner and protocol influences while maintaining the salient features of texture patterns, was used to standardize the MR images and radiomics features. Then, all features were standardized to Z-scores according to the mean and standard deviation.

2.7 Feature selection

Minimum Redundancy Maximum Relevance (mRMR) was used to initially reduce the dimensionality of high-dimensional features, with this resulting in the retaining of 25 highly correlated and low-redundancy features. Then, to further reduce dimensionality and save the features with the greatest predictive ability, we employed the logistic regression method known as the least absolute shrinkage and selection operator (LASSO).

A total of seven subgroups of radiomics signatures were established, with these being the tumoral region signature subgroup, intratumoral habitats signature subgroup, peritumor signature subgroup, signature subgroup combining tumoral region and intratumoral habitats features (TH-combined signature), signature subgroup combining tumoral region and peritumor features (TP-combined signature), signature subgroup combining intratumoral habitats and peritumor features (HP-combined signature), and signature subgroup combining all radiomics features (THP-combined signature). The radiomics features selected for each predictive signature are shown in Supplementary Table S3.

2.8 Classifiers

Different radiomics signatures to predict STS grade were constructed and evaluated using the following 11 machine learning classifiers: logistic regression (LR), extremely randomized trees (ExtraTrees), support vector machine (SVM), NaiveBayes, K nearest neighbor (KNN), AdaBoost, random forest, eXtreme Gradient Boosting (XGBoost), Multi-Layer perceptron (MLP), GradientBoosting, and Light Gradient Boosting Machine (LightGBM). The classifiers were trained using 10-fold cross-validation applied to the training cohort, and the grade prediction performance of the classifiers was evaluated using area under the receiver operating characteristics (ROC) curve (AUC), accuracy, sensitivity, and specificity. The best machine-learning algorithm was determined to be the one with the highest AUC.

2.9 Adding the clinical model and constructing the nomogram

In the construction of the clinical model, the clinical information and MRI morphological characteristics connected to STS grading were first selected using univariate logistic regression. Subsequently, those clinical features with P < 0.05 were entered into a multivariate logistic regression, and peritumoral edema was selected to create clinical models. We then employed the Akaike information criterion-based likelihood ratio test to determine the factors related to STS grading. Finally, we selected the best radiomics signature and combined it with the peritumoral edema to build a nomogram. AUC and accuracy were used to evaluate the performance of the nomogram, radiomics signatures, and clinical model. Calibration curves and decision curve analysis (DCA) were conducted to assess the fit and clinical dependability of the models, respectively.

2.10 Statistical analysis

Statistical tests were conducted using SPSS 26.0 (IBM, New York, USA), Python (version 3.9.7, www.python.org), and R software 4.1.2 (https://www.r-project.org/). Continuous variables in the clinical data were analyzed using either independent sample t-tests or Mann-Whitney U tests, while categorical variables were analyzed using the chi-square test or Fisher’s exact test. The statistical significance level was set at P < 0.05 for all statistical tests.

3 Results

3.1 Clinical factors and modeling

The total 145 patients were divided into the training set and the external validation set. The clinical information and MRI characteristics of the patients with STS in the two sets are listed in Table 1. There were significant differences in age, depth, heterogeneous SI on FS-T2WI, and location (P < 0.05) between the two sets. Furthermore, no significant differences were observed in the remaining characteristics.

Table 1

Training cohortValidation cohortP
No. of patients10243
gradeLow47270.066
High5516
Clinical information
Age *56 ± 1647 ± 170.002
SexMale52280.118
Female5015
MRI features
DepthDeep31250.002
Superficial7118
NumberSingle80330.823
Many2210
Margin definitions at CE-T1WI<50%1090.192
50%–90%4717
≥90%4517
Tumor volume containing necrosis signalAreas without necrosis3080.031
1%–50%5620
>50% of tumor volume1615
Heterogeneous SI at T2WI<50%6312<0.001
≥50%3931
Peritumoral enhancement(+)50180.430
(-)5225
Peritumoral edemaNo2190.429
Limited7227
Large97
LocationLimb7718<0.001
Head and neck104
Internal trunk915
Trunk wall66
T-stage12170.749
23416
31810
42910
N-stage085350.778
1178
M-stage083350.997
1198

Patient clinical information and MRI characteristics in the training and validation cohorts.

No. of Patients: Total number of patients in each cohort; Grade: Tumor grade (Low vs. High); MRI Features: Various MRI characteristics such as depth, number, margin definitions, tumor volume containing necrosis signal, heterogeneous SI at T2WI, peritumoral enhancement, peritumoral edema, location, T-stage, N-stage, and M-stage; * Data are presented as mean ± standard deviation.

The results of the univariate and multivariate LR analysis showed that peritumoral edema was an independent predictor of STS grade. Table 2 shows the univariate and multivariate logistic regression results with P < 0.05. A clinical model was established with the inclusion of peritumoral edema. The AUC values for the clinical model applied to training and validation sets were 0.665 (95% CI: 0.582–0.749) and 0.613 (95% CI: 0.472–0.755), respectively.

Table 2

VariableUnivariateMultivariate
OR95% CIp valueOR95% CIp value
Age1.0100.985-1.0350.439
Sex1.3810.632-3.0170.418
Depth1.6600.709-3.8820.243
Number0.8180.318-2.1040.677
Margin definitions at CE-T1WI1.4450.784-2.6650.239
Tumor volume containing necrosis signal2.0541.093-3.8610.0251.5310.754-3.1060.238
Heterogeneous SI at T2WI2.8121.211-6.5290.0161.8150.721-4.5670.205
Peritumoral enhancement1.5990.730-3.5030.240
Peritumoral edema4.7551.868-12.104<0.0013.9701.497-10.5240.006
Location1.2540.832-1.8890.280
T stage1.4521.009-2.0900.045
N stage1.7080.579-5.0400.332
M stage1.5950.571-4.4520.373

Positive results of clinical information and MRI characteristics in univariate and multivariate logistic regression.

Variable: Clinical and MRI characteristics; Univariate: Odds ratio (OR), 95% confidence interval (CI), and p-value for each variable in univariate analysis; Multivariate: Same metrics for multivariate analysis, showing the adjusted effects of each variable.

3.2 Radiomics feature selection and radiomics signature performance

We initially identified 25 features using mRMR and then further screened these features using LASSO. We established seven radiomics signature subgroups and used 11 machine learning methods to construct separate signatures for each subgroup, resulting in a total of 77 radiomics signatures. The AUC and accuracy of these are shown in Table 3. For the validation set, the best signature and the optimal performance within each subgroup were as follows: the tumoral region signature built using the ExtraTrees classifier had a highest AUC value of 0.613; the intratumoral habitats signature built using the KNN classifier had a highest AUC value of 0.626; the peritumor signature built using the MLP classifier had a highest AUC value of 0.828; the TH-combined signature built using the AdaBoost classifier had a highest AUC value of 0.589; the TP-combined signature built using the LR classifier had a highest AUC value of 0.738; the HP-combined signature built using the MLP classifier had a highest AUC value of 0.856; and the THP-combined signature built using the MLP classifier had a highest AUC value of 0.657. In summary, we found that the HP-combined signature built by combining mRMR and LASSO with the MLP classifier had the highest predictive performance (for the training and validation sets the accuracy values were 0.873 and 0.814, respectively, and the AUC values were 0.923 and 0.856), and was therefore the model used in the subsequent studies. Eighteen features were selected by LASSO to establish the HP-combined signature (the best radiomics signature), as shown in Figure 2, with this including 12 peritumoral features and six habitat features.

Table 3

Training cohortValidation cohort
ACCAUC95% CIACCAUC95% CI
Tumor regionLR0.8920.9320.884 - 0.9880.5120.5810.405 - 0.757
ExtraTrees1.0001.0001.000 - 1.0000.6050.6130.446 - 0.781
SVM0.9220.9440.894 - 0.9940.6510.5830.408 - 0.759
NaiveBayes0.8820.9200.865 - 0.9740.6510.6020.427 - 0.777
KNN0.8140.9030.848 - 0.9570.5350.5910.425 - 0.758
AdaBoost0.9510.9890.976 - 1.0000.5810.5470.369 - 0.726
RandomForest0.9901.0000.999 - 1.0000.5580.5740.399 - 0.749
XGBoost1.0001.0001.000 - 1.0000.6740.4720.285 - 0.660
MLP0.8820.9160.862 - 0.9710.5810.5740.397 - 0.752
GradientBoosting0.9610.9960.990 - 1.0000.5580.4970.312 - 0.682
LightGBM0.8920.9400.898 - 0.9830.4650.4360.257 - 0.615
HabitatsLR0.8140.8310.7511 - 0.91160.5810.4950.317 - 0.674
ExtraTrees1.0001.0001.000 - 1.0000.4880.5680.398 - 0.739
SVM0.8040.8320.746 - 0.9190.6050.5690.398 - 0.741
NaiveBayes0.7750.8000.713 - 0.8860.5810.4540.277 - 0.631
KNN0.7650.8430.769 - 0.9160.5350.6260.464 - 0.789
AdaBoost0.8630.9310.886 - 0.9760.5350.5220.344 - 0.700
RandomForest0.9800.9970.991 - 1.0000.5350.6020.432 - 0.772
XGBoost1.0001.0001.000 - 1.0000.5810.6040.434 - 0.774
MLP0.7840.8310.751 - 0.9100.6280.6090.440 - 0.778
GradientBoosting0.9410.9770.955 - 0.1000.5810.5210.345 - 0.697
LightGBM0.8040.8690.798 - 0.9410.4880.5380.359 - 0.718
PeritumorLR0.8630.8920.829 - 0.9560.7640.7290.566 - 0.892
ExtraTrees1.0001.0001.000 - 1.0000.6280.7120.560 - 0.864
SVM0.8820.9600.928 - 0.9910.7910.7990.665 - 0.933
NaiveBayes0.7840.8210.735 - 0.9070.6740.7990.647 - 0.950
KNN0.7550.8540.787 - 0.9210.5120.6750.519 - 0.831
AdaBoost0.9020.9740.950 - 0.9990.6050.5870.403 - 0.770
RandomForest0.9901.0000.999 - 1.0000.6740.6700.505 - 0.835
XGBoost1.0001.0001.000 - 1.0000.6280.6410.475 - 0.808
MLP0.8430.9140.860 - 0.9680.7440.8280.719 - 0.947
GradientBoosting0.8820.9910.980 - 1.0000.6280.7780.639 - 0.917
LightGBM0.8040.9260.880 - 0.9730.6050.6090.434 - 0.784
Tumor region+ habitatsLR0.8920.9420.900 - 0.9840.4650.5300.352 - 0.709
ExtraTrees1.0001.0001.000 - 1.0000.4650.4700.293 - 0.647
SVM0.9120.9620.919 - 1.0000.5580.5860.410 - 0.761
NaiveBayes0.7940.9080.853 - 0.9630.6050.5230.346 - 0.701
KNN0.8630.9190.870 - 0.9690.5120.5730.399 - 0.747
AdaBoost0.9510.9870.972 - 1.0000.6050.5890.411 - 0.768
RandomForest0.9901.0001.000 - 1.0000.5120.4950.319 - 0.672
XGBoost1.0001.0001.000 - 1.0000.4880.4610.284 - 0.637
MLP0.8730.9400.898 - 0.9830.5350.5620.388 - 0.737
GradientBoosting0.9220.9940.986 - 1.0000.4190.5170.337 - 0.698
Tumor region+ peritumorLightGBM0.8530.9540.918 - 0.9890.5350.5790.405 - 0.753
LR0.8820.9610.930 - 0.9930.6980.7380.585 - 0.892
ExtraTrees1.0001.0001.000 - 1.0000.6740.6980.525 - 0.871
SVM0.9220.9690.934 - 1.0000.6510.7160.556 - 0.877
NaiveBayes0.8430.9050.848 - 0.9620.6280.6990.532 - 0.866
KNN0.8140.9100.859 - 0.9620.6510.6660.506 - 0.825
AdaBoost0.9410.9960.990 - 1.0000.6280.6130.437 - 0.790
RandomForest0.9801.0001.000 - 1.0000.6050.6150.445 - 0.784
XGBoost1.0001.0001.000 - 1.0000.6050.5660.383 - 0.749
MLP0.8630.9400.897 - 0.9820.6280.6880.525 - 0.851
GradientBoosting0.9710.9980.994 - 1.0000.5580.6010.426 - 0.776
LightGBM0.8730.9630.934 - 0.9930.5580.5460.367 - 0.726
Habitats+ peritumorLR0.8630.9120.857 - 0.9670.6740.6390.466 - 0.812
ExtraTrees1.0001.0001.000 - 1.0000.6050.7160.561 - 0.872
SVM0.9120.9560.915 - 0.9960.6740.7710.630 - 0.912
NaiveBayes0.8330.8670.794 - 0.9390.6510.6530.483 - 0.822
KNN0.8330.9120.862 - 0.9620.5810.6700.516 - 0.824
AdaBoost0.9120.9760.954 - 0.9980.6050.6820.522 - 0.842
RandomForest0.9901.0000.999 - 1.0000.7670.7650.615 - 0.915
XGBoost1.0001.0001.000 - 1.0000.6980.7040.541 - 0.866
MLP0.8730.9230.872 - 0.9740.8140.8560.739 - 0.974
GradientBoosting0.9710.9880.971 - 1.0000.6740.6190.448 - 0.790
LightGBM0.8820.9390.894 - 0.9840.6280.5900.417 - 0.764
Tumor region+ habitats+ peritumorLR0.8820.9500.912 - 0.9890.5350.6110.443 - 0.780
ExtraTrees1.0001.0001.000 - 1.0000.5350.5980.426 - 0.771
SVM0.8920.9760.954 - 0.9980.5810.6270.458 - 0.796
NaiveBayes0.7940.9080.853 - 0.9630.5350.5510.372 - 0.730
KNN0.8330.9180.869 - 0.9670.5810.5880.422 - 0.754
AdaBoost09610.9890.975 - 1.0000.6280.6090.435 - 0.783
RandomForest0.9901.0001.000 - 1.0000.4650.5060.334 - 0.678
XGBoost1.0001.0001.000 - 1.0000.5810.5460.371 - 0.722
MLP0.8630.9370.892 - 0.9820.6740.6570.490 - 0.824
GradientBoosting0.9510.9940.985 - 1.0000.5810.5960.420 - 0.772
LightGBM0.9020.9650.935 - 0.9960.5350.6030.433 - 0.773

The predictive performance of different radiomics machine learning signatures in the training and validation cohorts.

Accuracy (ACC), area under the curve (AUC), and 95% CI for each machine learning model in the training cohort; Validation Cohort: Same metrics for the validation cohort; Models: Various machine learning algorithms (e.g., LR, logistic regression; ExtraTrees, extremely randomized trees; SVM, support vector machine; KNN, K nearest neighbor; XGBoost, eXtreme Gradient Boosting; MLP, Multi-Layer perceptron; LightGBM, Light Gradient Boosting Machine).

Figure 2

3.3 Establishment of a radiomics nomogram

A nomogram (Figure 3A) combining the best radiomics signature with the clinical independent predictors was subsequently constructed. Table 4 shows the predictive performance of the radiomics signature, clinical model, and nomogram. The nomogram demonstrated superior performance than the top-performing machine learning signature and clinical model. Figures 3B–D illustrates the calibration curves and DCA of the nomogram. The calibration curves showed good calibration in the training set (Figure 3B) and confirmed favorable calibration in the validation set (Figure 3C), indicating that the nomogram discriminated well. The DCA (Figure 3D) demonstrated that the nomogram provided greater clinical application value than the radiomics signature and the clinical model. Consequently, the nomogram should achieve optimal performance in terms of clinical application.

Figure 3

Table 4

Training cohortValidation cohort
ACCAUC95% CISensitivitySpecificityPPVNPVACCAUC95% CISensitivitySpecificityPPVNPV
Habitats + peri (MLP)0.8730.9230.872 - 0.9740.9640.7660.8280.9470.8140.8560.739 - 0.9740.8750.7780.7000.913
Clinical model0.6670.6650.582 - 0.7490.9270.3620.6300.8100.5350.6130.472 - 0.7550.9380.2960.4410.889
Nomogram0.8920.9370.892 - 0.9810.9640.8090.8550.9500.8140.8680.758 - 0.9780.9380.7410.6820.952

Predictive performance of radiomics signature, clinical model, and nomogram in the training and validation cohorts.

Training Cohort: Accuracy (ACC), AUC, 95% CI, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) for each model in the training cohort; Validation Cohort: Same metrics for the validation cohort; Models: Radiomics signature (Habitats + peri with MLP), clinical model, and nomogram.

4 Discussion

Knowledge of the STS histological grade of a tumor is essential for formulating a therapeutic strategy and is a key factor affecting the prognosis (4, 25). The results of our study showed that a signature based on the radiomics features of the intratumoral habitats and peritumoral microenvironments extracted from preoperative MRI could predict the grade of STS with high accuracy. By analyzing the radiomics features derived from tumorous regions, intratumoral habitats, and peritumoral areas, we established a total of 77 single or combined radiomics signatures from seven different subgroups. Among these, the HP-combined signature established by the MLP classifier combined with mRMR and LASSO feature screening methods yielded the best predictive performance. Finally, the nomogram showed better prediction performance than the radiomics signature and the clinical model based on peritumoral edema, and could thus provide valuable information to clinicians and patients and help guide clinical decision-making. The nomogram calibration curve demonstrated that both cohorts could benefit from the approach, and the DCA confirmed that the nomogram achieved the best clinically applicable performance.

Several qualitative MRI characteristics have previously been reported as potential imaging biomarkers for STS grading. Zhang et al. (26) found peritumoral hyperintensity to be an independent risk factor for predicting histopathological grade. Crombé et al. (4) confirmed that high-grade STS was linked to the MRI morphological characteristics of heterogeneity, necrosis, and peritumoral enhancement. In addition, some studies have shown that in addition to the general MRI features related to the prognosis of STS, some specific STS subtypes are independent prognostic factors for specific STS subtypes, such as the “Tail sign” of undifferentiated pleomorphic sarcoma, the “Water−like” appearance of myxofibrosarcoma, the “Triple sign” and absence of calcifications in synovial sarcoma, signal heterogeneity in myxoid liposarcoma (22). In our study, we included peritumoral edema to establish the clinical model. The prediction performance of the clinical model was poor, with an AUC value of only 0.665 for the training set and 0.613 for the external validation set. This result indicates that the MRI characteristics and the low-dimensional clinical data only capture a small portion of the relevant information present in the imaging data and omit a great deal of the lesion heterogeneity detail.

One key aspect of our study is that we analyzed intratumoral habitats. Zhou et al. proposed the concept of extracting quantitative features from distinct tumor sub-regions (27), and later studies showed that sub-regional radiomics analysis methods may better quantify the tumorous subregion related to tumorous growth or aggressiveness than conventional radiomics, and obtain prediction models with higher accuracy (17, 19). These studies suggest that intratumoral habitats may provide valuable clues for tumor prediction and prognosis. In our study, to obtain a model for predicting STS grade with high accuracy, we considered integrating high-throughput radiomics feature analysis with voxel-based habitat segmentation to predict STS grade. The results showed that among the optimal signatures for each subgroup, the predictive performance of the intratumoral habitats radiomics signature was better than that of the conventional tumor radiomics signature, and the predictive performance of the HP-combined signature was better than that of the TP-combined signature. These results demonstrate that the intratumoral clustered segmented habitats contain important information for STS grading and are capable of predicting STS grade with higher accuracy than conventional radiomics approaches based on the whole tumor.

Another key aspect of our study is that we analyzed the peritumoral area. Clinical evidence demonstrates that the heterogeneity of STS extends beyond the tumor itself, encompassing the surrounding area (28). Consequently, the surrounding environment of the tumor can provide valuable data for assessing the aggressive biological behavior of the tumor. Combinations of intratumoral and peritumoral radiomics features were successfully used in recent research to identify the histological categories of renal cell carcinoma (29) and to differentiate between benign and malignant pulmonary nodules (30, 31). Previous research also revealed that the peritumoral microenvironment provides supplementary information for predicting STS histopathological grade (26). Among the optimal signatures of each subgroup in our study, the peritumor signature outperformed any other single radiomics signature, and the TP-combined signature and HP-combined signature performed better than the single tumoral region signature and intratumoral habitats signature, respectively. These results demonstrate that peritumoral radiomics features provide added predictive value.

In our study, the best-performing HP-combined signature contained six habitat features and 12 peritumoral features, with “T1C 1 original glcm ClusterShade” contributing the most to the performance. This feature was extracted in Habitat 1, which is a low-enhancing solid subregion, and we deduce that this region may have a greater association with tumor grade than the other regions. In terms of feature types, Gray Level Co-occurrence Matrix (GLCM) texture features (classified as higher-order features) have demonstrated significant clinical impacts in both radiology and nuclear medicine (32, 33). Although the blood supply to STS tumors is reflected in the CE-T1WI signal strength, it is difficult to distinguish minor signal intensity changes within tissue on conventional MRI. Specifically, “T1C 1 original glcm ClusterShade” quantifies the skewness and uniformity of gray-scale variability that is imperceptible to visual inspection. Previous studies showed GLCM features to possess strong predictive capabilities for tumor grade and found that they played an indispensable role in the construction of radiomic signatures (34, 35).

Regarding the selection of features in this study, mRMR is a novel method for feature screening that screens radiomics features by employing more rational coefficients and reducing redundancy (36). In comparison, LASSO is a feature screening technique that prevents overfitting during signature construction (37). In this study, 11 machine learning methods (LR, ExtraTrees, SVM, NaiveBayes, KNN, AdaBoost, RandomForest, XGBoost, MLP, GradientBoosting, and LightGBM) were selected for investigation, and in combination with the seven subgroups they resulted in a total of 77 radiomics signatures. MLP is a supervised learning method that can learn nonlinear models in real-time, achieving high accuracy and good generalization ability (3840). The present results of this study showed that out of all the tested machine learning signatures, the HP-combined signature built by the MLP classifier combined with the popular mRMR and LASSO feature screening techniques demonstrated the highest predictive ability.

There are some limitations to this study. First, it should be noted that this study was conducted retrospectively, which means that although we had strict inclusion and exclusion criteria, there may still have been selection bias. Second, because of the relatively small sample size, the histological subtypes were not distributed evenly, which may have caused some statistical bias. Additionally, our data were collected from four different institutions utilizing similar but not identical scanners and processes. Therefore, to enhance the stability of features, as well as to account for any variations among them, the combat compensation method and a resampling methodology were employed. Finally, manual segmentation was applied to the work, which may lead to deviations. A recent study (41) demonstrated that automatic segmentation achieved favorable performance. This study is only a preliminary analysis, and further research in a larger prospective clinical series is needed to determine associations between patient tumor grade and spatial habitat analysis.

In summary, we developed a nomogram that can accurately and noninvasively predict STS grade before surgery so that patients can obtain more effective and targeted treatment. By enhancing surveillance and improving adjuvant clinical trial design, our predictive model may help close the gap between radiology and precision healthcare.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Ethics statement

The studies involving humans were approved by The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China The Puyang Oilfield General Hospital, Puyang, Henan, China The Third Hospital of Hebei Medical University, Hebei, China the Shandong Provincial Hospital affiliated with Shandong First Medical University, Jinan, Shandong, China. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements. Written informed consent was not obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article because This retrospective investigation obtained ethical approval and written informed consent was not required.

Author contributions

BW: Data curation, Investigation, Writing – original draft. HG: Data curation, Investigation, Methodology, Writing – review & editing. MZ: Data curation, Investigation, Writing – review & editing. YH: Conceptualization, Investigation, Methodology, Writing – review & editing. LD: Formal Analysis, Methodology, Writing – review & editing. CH: Writing – review & editing. JX: Data curation, Investigation, Methodology, Validation, Writing – review & editing. HW: Conceptualization, Formal Analysis, Funding acquisition, Methodology, Project administration, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was funded by the Project Grant No. ZR2021MH159 and No. ZR2020MH286 supported by the Shandong Provincial Natural Science Foundation. This work was supported by the National Natural Science Foundation of China (Grant No. 82172035). This study was funded by the Clinical Medicine+X Project of the Affiliated Hospital of Qingdao University (Grant No. QDFY+ X2021015). This work was supported by the Medicine and Health Technology Development Program of Shandong Province (Grant No. 2019WS373).

Acknowledgments

We sincerely thank the Platform Onekey AI for Code consultation of this study. We thank Liwen Bianji (Edanz) (www.liwenbianji.cn) for editing the language of a draft of this manuscript.

Conflict of interest

Author CH was employed by the company Beijing Deepwise and League of Philosophy Doctor PHD Technology Co., Ltd.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1433196/full#supplementary-material

Abbreviations

STS, soft tissue sarcoma; MRI, magnetic resonance imaging; CE-T1WI, contrast enhanced fat-suppressed T1 weighted imaging; FS-T2WI, fat-suppressed T2 weighted imaging; TNM, the tumor-node-metastasis staging; FNCLCC, Fédération Nationale des Centres de Lutte Contre le Cancer; mRMR, Minimum Redundancy Maximum Relevance; LASSO, least absolute shrinkage and selection operator; TH-combined signature, signature combining tumoral region and intratumoral habitats features; TP-combined signature, signature combining tumoral region and peritumor features; HP-combined signature, signature combining intratumoral habitats and peritumor features; THP-combined signature, signature combining tumoral region, intratumoral habitats, and peritumor features; LR, Logistic regression; ExtraTrees, Extremely randomized trees; SVM, Support vector machine; KNN, K nearest neighbor; XGBoost, EXtreme Gradient Boosting; MLP, Multi-Layer perceptron; LightGBM, Light Gradient Boosting Machine; GLCM, Gray-level co-occurrence matrix.

References

  • 1

    ClarkMAFisherCJudsonIThomasJM. Soft-tissue sarcomas in adults. New Engl J Med. (2005) 353:701–11. doi: 10.1056/NEJMra041866

  • 2

    PasqualiSGronchiA. Neoadjuvant chemotherapy in soft tissue sarcomas: latest evidence and clinical implications. Ther Adv Med Oncol. (2017) 9:415–29. doi: 10.1177/1758834017705588

  • 3

    Sundby HallKBrulandØSBjerkehagenBZaikovaOEngellauJHagbergOet al. Adjuvant chemotherapy and postoperative radiotherapy in high-risk soft tissue sarcoma patients defined by biological risk factors-A Scandinavian Sarcoma Group study (SSG XX). Eur J Cancer. (2018) 99:7885. doi: 10.1016/j.ejca.2018.05.011

  • 4

    CrombéAMarcellinPJBuyXStoeckleEBrousteVItalianoAet al. Soft-tissue sarcomas: assessment of MRI features correlating with histologic grade and patient outcome. Radiology. (2019) 291:710–21. doi: 10.1148/radiol.2019181659

  • 5

    CoindreJM. Grading of soft tissue sarcomas: review and update. Arch Pathol Lab Med. (2006) 130:1448–53. doi: 10.5858/2006-130-1448-GOSTSR

  • 6

    LinXDavionSBertschECOmarINayarRLaskinWB. Federation Nationale des Centers de Lutte Contre le Cancer grading of soft tissue sarcomas on needle core biopsies using surrogate markers. Hum Pathol. (2016) 56:147–54. doi: 10.1016/j.humpath.2016.06.008

  • 7

    SchneiderNStraussDCSmithMJMiahABZaidiSBensonCet al. The adequacy of core biopsy in the assessment of smooth muscle neoplasms of soft tissues: implications for treatment and prognosis. Am J Surg Pathol. (2017) 41:923–31. doi: 10.1097/PAS.0000000000000867

  • 8

    FangSYangYXuNTuYYinZZhangYet al. An update in imaging evaluation of histopathological grade of soft tissue sarcomas using structural and quantitative imaging and radiomics. J Magn Reson Imaging: JMRI. (2022) 55:1357–75. doi: 10.1002/jmri.27954

  • 9

    GilliesRJKinahanPEHricakH. Radiomics: images are more than pictures, they are data. Radiology. (2016) 278:563–77. doi: 10.1148/radiol.2015151169

  • 10

    BeraKBramanNGuptaAVelchetiVMadabhushiA. Predicting cancer outcomes with radiomics and artificial intelligence in radiology. Nat Rev Clin Oncol. (2022) 19:132–46. doi: 10.1038/s41571-021-00560-7

  • 11

    AertsHJVelazquezERLeijenaarRTParmarCGrossmannPCarvalhoSet al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. (2014) 5:4006. doi: 10.1038/ncomms5006

  • 12

    SunRLimkinEJVakalopoulouMDercleLChampiatSHanSRet al. A radiomics approach to assess tumour-infiltrating CD8 cells and response to anti-PD-1 or anti-PD-L1 immunotherapy: an imaging biomarker, retrospective multicohort study. Lancet Oncol. (2018) 19:1180–91. doi: 10.1016/S1470-2045(18)30413-3

  • 13

    AertsHJ. The potential of radiomic-based phenotyping in precision medicine: A review. JAMA Oncol. (2016) 2:1636–42. doi: 10.1001/jamaoncol.2016.2631

  • 14

    SongJShiJDongDFangMZhongWWangKet al. A new approach to predict progression-free survival in stage IV EGFR-mutant NSCLC patients with EGFR-TKI therapy. Clin Cancer Res. (2018) 24:3583–92. doi: 10.1158/1078-0432.CCR-17-2507

  • 15

    GatenbyRAGroveOGilliesRJ. Quantitative imaging in cancer evolution and ecology. Radiology. (2013) 269:815. doi: 10.1148/radiol.13122697

  • 16

    ParkJEKimHSKimNParkSYKimYHKimJH. Spatiotemporal heterogeneity in multiparametric physiologic MRI is associated with patient outcomes in IDH-wildtype glioblastoma. Clin Cancer Res. (2021) 27:237–45. doi: 10.1158/1078-0432.CCR-20-2156

  • 17

    WuJCaoGSunXLeeJRubinDLNapelSet al. Intratumoral spatial heterogeneity at perfusion MR imaging predicts recurrence-free survival in locally advanced breast cancer treated with neoadjuvant chemotherapy. Radiology. (2018) 288:2635. doi: 10.1148/radiol.2018172462

  • 18

    KimJRyuSYLeeSHLeeHYParkH. Clustering approach to identify intratumour heterogeneity combining FDG PET and diffusion-weighted MRI in lung adenocarcinoma. Eur Radiol. (2019) 29:468–75. doi: 10.1007/s00330-018-5590-0

  • 19

    FengSYinJ. Dynamic contrast-enhanced magnetic resonance imaging radiomics analysis based on intratumoral subregions for predicting luminal and nonluminal breast cancer. Quant Imaging Med Surg. (2023) 13:6735–49. doi: 10.21037/qims-22-1073

  • 20

    SemenzaGLRuvoloPP. Introduction to tumor microenvironment regulation of cancer cell survival, metastasis, inflammation, and immune surveillance. Biochim Biophys Acta. (2016) 1863:379–81. doi: 10.1016/j.bbamcr.2015.12.015

  • 21

    SpinnatoPClincaR. MRI tail sign in soft-tissue sarcoma. Radiology. (2021) 299:276. doi: 10.1148/radiol.2021203877

  • 22

    ScalasGParmeggianiAMartellaCTuzzatoGBianchiGFacchiniGet al. Magnetic resonance imaging of soft tissue sarcoma: features related to prognosis. Eur J Orthop Surg Traumatol: Orthop Traumatol. (2021) 31:1567–75. doi: 10.1007/s00590-021-03003-2

  • 23

    MadabhushiAUdupaJK. New methods of MR image intensity standardization via generalized scale. Med Phys. (2006) 33:3426–34. doi: 10.1118/1.2335487

  • 24

    FortinJPCullenNShelineYITaylorWDAselciogluICookPAet al. Harmonization of cortical thickness measurements across scanners and sites. NeuroImage. (2018) 167:104–20. doi: 10.1016/j.neuroimage.2017.11.024

  • 25

    ZhaoFAhlawatSFarahaniSJWeberKLMontgomeryEACarrinoJAet al. Can MR imaging be used to predict tumor grade in soft-tissue sarcoma? Radiology. (2014) 272:192201. doi: 10.1148/radiol.14131871

  • 26

    ZhangLYangYWangTChenXTangMDengJet al. Intratumoral and peritumoral MRI-based radiomics prediction of histopathological grade in soft tissue sarcomas: a two-center study. Cancer Imaging: Off Publ Int Cancer Imaging Society. (2023) 23:103. doi: 10.1186/s40644-023-00622-2

  • 27

    ZhouMHallLGoldgofDRussoRBalagurunathanYGilliesRet al. Radiologically defined ecological dynamics and clinical outcomes in glioblastoma multiforme: preliminary results. Transl Oncol. (2014) 7:513. doi: 10.1593/tlo.13730

  • 28

    HoefkensFDehandschutterCSomvilleJMeijndersPVan GestelD. Soft tissue sarcoma of the extremities: pending questions on surgery and radiotherapy. Radiat Oncol. (2016) 11:136. doi: 10.1186/s13014-016-0668-9

  • 29

    ZhouZQianXHuJMaXZhouSDaiYet al. CT-based peritumoral radiomics signatures for Malignancy grading of clear cell renal cell carcinoma. Abdominal Radiol (New York). (2021) 46:2690–8. doi: 10.1007/s00261-020-02890-z

  • 30

    Pérez-MoralesJTunaliIStringfieldOEschrichSABalagurunathanYGilliesRJet al. Peritumoral and intratumoral radiomic features predict survival outcomes among patients diagnosed in lung cancer screening. Sci Rep. (2020) 10:10528. doi: 10.1038/s41598-020-67378-8

  • 31

    DasSKFangKWXuLLiBZhangXYangHF. Integrative nomogram of intratumoral, peritumoral, and lymph node radiomic features for prediction of lymph node metastasis in cT1N0M0 lung adenocarcinomas. Sci Rep. (2021) 11:10829. doi: 10.1038/s41598-021-90367-4

  • 32

    GalavisPEHollensenCJallowNPaliwalBJerajR. Variability of textural features in FDG PET images due to different acquisition modes and reconstruction parameters. Acta Oncol (Stockholm Sweden). (2010) 49:1012–6. doi: 10.3109/0284186X.2010.498437

  • 33

    ShirvaikarMHuangNDongXN. The measurement of bone quality using gray level co-occurrence matrix textural features. J Med Imaging Health Inform. (2016) 6:1357–62. doi: 10.1166/jmihi.2016.1812

  • 34

    ZhangXXuXTianQLiBWuYYangZet al. Radiomics assessment of bladder cancer grade using texture features from diffusion-weighted imaging. J Magn Reson Imaging: JMRI. (2017) 46:1281–8. doi: 10.1002/jmri.25669

  • 35

    KandalgaonkarPSahuASajuACJoshiAMahajanAThakurMet al. Predicting IDH subtype of grade 4 astrocytoma and glioblastoma from tumor radiomic patterns extracted from multiparametric magnetic resonance images using a machine learning approach. Front Oncol. (2022) 12:879376. doi: 10.3389/fonc.2022.879376

  • 36

    PengHLongFDingC. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell. (2005) 27:1226–38. doi: 10.1109/TPAMI.2005.159

  • 37

    GuiJLiH. Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data. Bioinf (Oxford England). (2005) 21:3001–8. doi: 10.1093/bioinformatics/bti422

  • 38

    RamkumarMShanmugarajaPAnusuyaVDhiyaneshB. Identifying cancer risks using spectral subset feature selection based on multi-layer perception neural network for premature treatment. Comput Methods Biomech Biomed Eng. (2023) 27:180416. doi: 10.1080/10255842.2023.2262662.

  • 39

    MakrisGMPouliakisASiristatidisCMargariNTerzakisEKoureasNet al. Image analysis and multi-layer perceptron artificial neural networks for the discrimination between benign and Malignant endometrial lesions. Diagn Cytopathol. (2017) 45:202–11. doi: 10.1002/dc.23649

  • 40

    SchmidhuberJ. Deep learning in neural networks: an overview. Neural Netw: Off J Int Neural Netw Society. (2015) 61:85117. doi: 10.1016/j.neunet.2014.09.003

  • 41

    GuoLShiPChenLChenCDingW. Pixel and region level information fusion in membership regularized fuzzy clustering for image segmentation. Inf Fusion. (2023) 92:479–97. doi: 10.1016/j.inffus.2022.12.008

Summary

Keywords

soft tissue sarcoma, habitats, nomogram, radiomics, grade

Citation

Wang B, Guo H, Zhang M, Huang Y, Duan L, Huang C, Xu J and Wang H (2024) Prediction of soft tissue sarcoma grading using intratumoral habitats and a peritumoral radiomics nomogram: a multi-center preliminary study. Front. Oncol. 14:1433196. doi: 10.3389/fonc.2024.1433196

Received

16 May 2024

Accepted

22 November 2024

Published

11 December 2024

Volume

14 - 2024

Edited by

Morgan Michalet, Institut du Cancer de Montpellier (ICM), France

Reviewed by

Paolo Spinnato, Rizzoli Orthopedic Institute (IRCCS), Italy

Yanfen Cui, Chinese Academy of Medical Sciences/Cancer Hospital Affiliated to Shanxi Medical University, China

Updates

Copyright

*Correspondence: Jun Xu, ; Hexiang Wang,

†These authors have contributed equally to this work and share first authorship

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics