Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol., 07 August 2025

Sec. Diabetes: Molecular Mechanisms

Volume 16 - 2025 | https://doi.org/10.3389/fendo.2025.1588718

LC-MS-based conventional metabolomics combined with machine learning models to identify metabolic markers for the diagnosis of type I diabetes

Muhadasi Tuerxunyiming,&#x;Muhadasi Tuerxunyiming1,2†Qing Zhao&#x;Qing Zhao3†Qiaosheng HuQiaosheng Hu3Ping Zhu*Ping Zhu4*Shiting Zhu*Shiting Zhu5*
  • 1Zhejiang University School of Medicine, Hangzhou, China
  • 2School of Medicine, Hangzhou City University, Hangzhou, China
  • 3Endocrinology Department, Lianshui County People’s Hospital, Huai’an, Jiangsu, China
  • 4Endocrinology Department, The Affiliated Chuzhou Hospital of Traditional Chinese Medicine of Jiangsu College of Nursing, Huai’an, Jiangsu, China
  • 5Rehabilitation Medicine Department, Lianshui County People’s Hospital, Huai’an, Jiangsu, China

Background: Changes in certain metabolites are linked to an increased risk of type I diabetes (T1D), making metabolite analysis a valuable tool for T1D diagnosis and treatment. This study aimed to identify a metabolic signature linked with T1D.

Methods: Untargeted metabolomic profiling was performed using liquid chromatography–mass spectrometry (LC-MS) on peripheral blood samples from T1D patients (n = 45) and healthy controls (n = 40). Data preprocessing and quality control were conducted using MetaboAnalyst 4.0. Differential metabolites (DMs) were identified via Wilcoxon rank-sum test (P< 0.05), and key diagnostic markers were selected using least absolute shrinkage and selection operator (LASSO) regression. A streptozotocin (STZ)-induced diabetic rat model was used for in vivo validation.

Results: A total of 157 annotated metabolites were detected (58 in ESI− and 99 in ESI+ mode). Twenty-six DMs were identified, including 25 upregulated and 1 downregulated in the T1D group, mainly involving Acylcarnitines and xanthine metabolites. LASSO regression selected Hydroxyhexadecanoyl carnitine, Propionylcarnitine, and Valerylcarnitine as candidate markers. In the rat model, Hydroxyhexadecanoyl carnitine and Valerylcarnitine demonstrated strong diagnostic performance, with AUCs of 0.9383 (95% CI: 0.8786–0.9980) and 0.8395 (95% CI: 0.7451–0.9338), respectively (P< 0.01).

Conclusion: Hydroxyhexadecanoyl carnitine and Valerylcarnitine are closely linked to altered lipid oxidation in T1D and show strong potential as diagnostic biomarkers. These findings provide new insights into the metabolic basis of T1D and offer promising targets for early detection.

Introduction

Type I diabetes (T1D) is a chronic autoimmune disease characterized by the destruction of insulin-producing cells in the pancreas, leading to insufficient insulin production and the inability to regulate blood glucose levels properly (1, 2). T1D primarily affects children and adolescents, significantly impacting their quality of life and life expectancy. Globally, an estimated 1.9 million children under the age of 15 are affected by T1D, with approximately 77,000 new cases diagnosed each year, representing an annual increase of about 3% (3). Although the exact causes of T1D remain unclear, it is believed to result from a combination of genetic, environmental, infectious, and nutritional factors (4). Research has indicated that specific changes in metabolites may be linked to an elevated risk of developing T1D (57). Metabolite analysis has thus emerged as a valuable tool in both the diagnosis and treatment of T1D, prompting extensive efforts to clarify the relationship between T1D and metabolic changes.

Metabolomics is the study of small molecules produced by metabolic reactions within biological systems. Low molecular weight molecules found in human body fluids, such as blood or urine, can be measured using analytical chemistry techniques, including gas or liquid chromatography coupled with mass spectrometry (811). This approach is instrumental in studying disease-related metabolic markers and predicting diseases in clinical settings. For example, Liu et al. found that metabolites like triglycerides, amino acids, and small intermediate compounds were associated with type 2 diabetes (T2D) and could serve as predictors of future T2D onset (12). Similarly, Takayuki Teruya and colleagues identified 33 metabolic markers, 12 of which were enriched in erythrocytes, that were linked to dementia, confirming their association with disease progression (13). In obesity research, metabolomic analyses highlighted significant differences in gamma-glutamyl, branched-chain amino acid metabolism, and triacylglycerols in lipid profiles of obese patients (10). These examples demonstrate the value of metabolomics in uncovering disease-specific metabolic markers.

In this study, to explore the metabolic marker profiles of T1D, we conducted routine metabolomics analysis using liquid chromatography–mass spectrometry (LC-MS) technology. Peripheral blood samples were collected from pediatric T1D patients and healthy controls, and a machine learning model was applied to identify key metabolic markers.

Materials and methods

Objects

In this study, 45 pediatric patients diagnosed with T1D at Hangzhou City University were included, along with 40 healthy non-diabetic controls hospitalized for other reasons. The inclusion criteria for T1D participants were based on the 2020 American Diabetes Association (ADA) diagnostic guidelines, including fasting plasma glucose (FPG) ≥ 7.0 mmol/L and/or glycated hemoglobin (HbA1c) ≥ 6.5%, in combination with positive pancreatic autoantibodies (e.g., GADA, ICA, or IAA) and insulin dependence. Healthy controls were defined as individuals with FPG< 5.6 mmol/L, HbA1c< 5.7%, and no known history of diabetes, metabolic disorders, or autoimmune disease. The case and control groups were matched for age and sex. Inclusion criteria were based on the new diagnostic guidelines from the Diabetes Society of the Chinese Medical Association. Diagnosis required (a) plus two or more of the following: (b) ketonuria, (c) nocturia >3–4 times per night, (d) significant weight loss, (e) a first-degree relative with diabetes requiring insulin, or (f) a history of autoimmune disease. Patients with diabetic ketoacidosis were excluded. Peripheral blood samples were collected from both groups and stored at -80°C. Informed consent was obtained from all participants, and the study was approved by the Ethics Committee of Hangzhou City University.

Sample metabolites were extracted by mixing 25 μL of plasma with 300 μL of cold acetonitrile:methanol (5:4:1, v:v:v) and vortexing the mixture. The sample was then placed on ice for 30 minutes before being centrifuged at 12,000 rpm for 10 minutes at 4°C. The supernatant was collected and transferred to a suitable LC/MS vial for analysis.

LC-MS technology for metabolomic analysis

Metabolomic analysis of peripheral blood samples was conducted using LC-MS technology. Data were collected with a 4000 QTRAP triple quadrupole mass spectrometer (Applied Biosystems/Sciex), coupled to a multiplex LC system that included two 1200-series pumps (Agilent Technologies) and an HTS PAL autosampler (Leap Technologies) equipped with two injection ports and a column selector valve. The pumps were configured for hydrophilic interaction chromatography (HILIC) using a 150 × 2.1 mm Atlantis HILIC column (Waters) and the following mobile phases: mobile phase A consisted of 10 mM ammonium formate with 0.1% formic acid (v/v), and mobile phase B was acetonitrile with 0.1% formic acid (v/v). A 5 μL injection volume was utilized. The multiplex technique facilitated the measurement of 61 metabolite conversions across the two LC systems, with each sample being injected once into each system. The optimized gradient was programmed as follows: each column was eluted isocratically with 5% mobile phase A for one minute, followed by a linear gradient to 60% mobile phase A over 10 minutes.

For MS detection, electrospray ionization (ESI) and multiple reaction monitoring (MRM) scanning in positive ion mode were utilized. Each metabolite was detected by injecting a reference standard before sample analysis. The dwell time for each transition was set at 30 ms, with an iron spray voltage of 4.5 kV and an ion source temperature of 425°C. Internal standard peak areas were monitored for quality control, and individual samples with peak areas deviating from the group mean by more than two standard deviations were re-analyzed.

All sample labeling and LC-MS analysis were conducted in a blinded manner. Sample IDs were randomly coded prior to analysis to ensure that the analysts were unaware of the group assignments.

Quality control (QC) samples were generated by pooling equal volumes of all individual samples and were injected at regular intervals throughout the LC-MS run. Principal component analysis (PCA) of the QC samples showed tight clustering, indicating good instrument stability and reproducibility. The coefficients of variation (CVs) of the internal standards across QC injections were consistently below 15%.

Metabolite identification

MS raw data were preprocessed using MultiQuant 2.0 (SCIEX), with compounds identified by comparison to purified standards or library entries for frequently occurring unknown entities. All identified metabolites were categorized by superclass, class, and subclass. Detailed classifications and chemical/physical properties can be accessed from the Human Metabolome Database (HMDB; http://www.hmdb.ca/).

Metabolomic data analysis

The preprocessed metabolite data were imported into MetaboAnalyst 4.0 (http://www.metaboanalyst.ca) for normalization, principal component analysis, partial least squares discriminant analysis, and volcano plotting. Prior to statistical analysis, the dataset was normalized using log transformation and auto-scaling. Principal component analysis (PCA) was performed to assess potential batch effects, and no significant clustering by batch was observed, indicating the absence of systematic batch variation. Differential metabolites (DMs) were identified using Wilcoxon analysis with the criteria of p-value< 0.05 and |logFC| > 0.5. Additionally, MetaboAnalyst was utilized for functional enrichment and metabolic pathway analysis.

LASSO regression prediction model

After data preprocessing, a dichotomous predictive model was constructed using the LASSO regression method. In this model, differential metabolites served as independent variables, while the diagnosis of diabetes mellitus (0 or 1) was the dependent variable. This approach included variable selection to identify key biomarkers associated with diabetes mellitus. LASSO regression minimizes the sum of squared residuals while incorporating an L1 penalty term on the regression coefficients, defined as follows:

limβ0,β12ni=1n(yiβ0xiTb)2+λj=1p|βj|

In the LASSO regression model, nnn represents the number of samples, ppp denotes the number of independent variables, yiy_iyi​ is the value of the dependent variable for the iii-th sample, xix_ixi​ is the vector of independent variables for the iii-th sample, β0\beta_0β0​ is the intercept term, β\betaβ is the vector of regression coefficients, and λ\lambdaλ is the penalty coefficient.

Next, a plot of the regression coefficients against log⁡(λ)\log(\lambda) log(λ) is created. As log⁡(λ)\log(\lambda) log(λ) increases, the regression coefficients gradually decrease towards zero. Using the R package, the regression coefficients corresponding to the optimal alpha value are extracted with the “coef” function, and non-zero regression coefficients are filtered out to identify the key biomarkers.

T1D rat model

Twenty male Wistar rats (2–3 months old; 300–400 g) were used in this study. The rats were housed in temperature-controlled rooms (24 ± 1°C) with continuous access to food and water. The animal experiments were approved by Hangzhou City University. The rats were divided into two groups of 10. Type 1 diabetes was induced as previously described (14), with rats receiving streptozotocin (STZ, 55 mg/kg, intraperitoneally), while control rats received the same dose of saline via intraperitoneal injection. After STZ injection, weight and blood glucose levels were measured every two days for 4 or 8 weeks. Random blood glucose levels were determined using a portable glucometer (Omron, Shanghai, China) with calibrated test strips. Additionally, peripheral blood levels of glycosylated serum protein (GSP) and insulin were measured at weeks 4 and 8, respectively. Successful diabetes induction in rats was confirmed by measuring fasting blood glucose levels, with values exceeding 250 mg/dL indicating hyperglycemia. Additionally, signs such as polyuria, polydipsia, and weight loss were observed to support the diagnosis.

Enzyme-linked immunosorbent assay

Serum GSP levels and insulin were determined using specific assay kits. Fasting blood samples were collected from the rats, and the GSP assay kit and insulin ELISA kit (both from Shanghai Sangong Biotechnology, Shanghai, China) were used according to the manufacturer’s instructions. Results were recorded at an OD of 450 nm absorbance using a spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA).

Detection of peripheral blood metabolic markers

Peripheral blood samples from the rats were collected and analyzed for three metabolic markers: Hydroxyhexadecanoyl carnitine, Propionylcarnitine, and Valerylcarnitine, using LC-MS. An internal standard solution of D3-Hydroxyhexadecanoyl carnitine, D3-Propionylcarnitine, and D3-Valerylcarnitine at a concentration of 1 μg/mL was added to each serum sample (10 μL). Next, 500 μL of a cold acetonitrile and methanol aqueous solution was added to precipitate proteins, thoroughly mixed, and centrifuged at 3000 rpm for 10 minutes to remove the precipitate. The supernatant was then transferred to a new vial for LC-MS analysis. The chromatographic and mass spectrometry conditions were consistent with those used in previous metabolomics studies. The metabolites were identified by their m/z values as follows: Hydroxyhexadecanoyl carnitine (m/z 402.3 → 85.1), D3-Hydroxyhexadecanoyl carnitine (m/z 405.3 → 88.1), Propionylcarnitine (m/z 218.1 → 85.1), D3-Propionylcarnitine (m/z 221.1 → 88.1), Valerylcarnitine (m/z 246.1 → 85.1), and D3-Valerylcarnitine (m/z 249.1 → 88.1). The relative amounts of the three metabolic markers in each serum sample were calculated using the internal standard method.

Statistical analysis

To evaluate the validity and reliability of the three metabolic markers as diagnostic tools, ROC curve analysis was conducted using SPSS software version 26.0 (IBM Corporation, Chicago, Illinois, USA). The sensitivity, specificity, accuracy, and AUC (area under the curve) for each marker were calculated and statistically tested, with a significance level set at P< 0.05. LASSO regression was conducted using the glmnet package in R. The optimal λ value (log(λ) = −3.77) was selected based on the minimum mean squared error.

Additionally, all animal experiments were repeated three times, and an unpaired Student’s t-test was employed to compare differences between the two groups. All data are presented as mean ± standard deviation (SD), with P< 0.05 indicating statistical significance.

Results

Metabolite profiling

Peripheral blood samples were collected from T1D patients and normal control populations for metabolomic analysis using LC-MS to investigate the metabolic profile of T1D. A total of 157 known metabolites were identified, with 58 detected in ESI- mode and 99 in ESI+ mode. Representative LC-MS spectra of the key metabolites are provided in Supplementary Figure S1.

Based on metabolite superclasses, the majority of metabolites were categorized into fatty acids (63), organic acids (60), and nucleic acids (19) (Figure 1A). Further classification revealed that 41 metabolites fell under the amino acids and peptides class, which constituted the largest group. Additionally, there were significant distributions in fatty esters (38) and fatty acids (20) (Figure 1B).

Figure 1
Three pie charts labeled A, B, and C. Chart A shows a large portion of organic acids and fatty acyls. Chart B highlights amino acids and peptides, with significant portions of fatty esters and fatty acids. Chart C emphasizes amino acids and acyl carnitines, with unsaturated fatty acids also featured prominently. Each chart includes a detailed legend.

Figure 1. Metabolite profiling. (A) Pie chart of metabolite super classes. (B) Pie chart of metabolite main classes. (C) Pie chart of metabolite sub classes.

A more detailed analysis showed that approximately 39 metabolites belonged to the amino acids subclass, followed by 37 in the acyl carnitines subclass and 10 in the unsaturated fatty acids subclass (Figure 1C).

Differential metabolites in T1D patients versus normal controls

Wilcoxon analysis identified 26 significantly different metabolites (DMs) (P< 0.05, |logFC| > 0.5), of which 25 were up-regulated and one was down-regulated in T1D patient samples (Figure 2A). A complete table of all 26 differential metabolites, including fold changes and both raw and FDR-adjusted p-values, was presented in Supplementary Table S1. The Kolmogorov-Smirnov test revealed that these metabolites were primarily enriched in the amino acids, acyl carnitines, and xanthines classes (Figure 2B). Furthermore, when grouping DMs by metabolite classification, significant differential expression was observed between the acyl carnitines and xanthines classes in T1D patient samples (Figures 2C, D).

Figure 2
A four-panel composite showing different aspects of metabolite analysis:  A. A scatter plot displaying log-transformed fold change versus p-value, highlighting upregulated, downregulated, and non-significant metabolites.  B. A bar graph depicting calculated p-values for amino acids, acyl carnitines, and xanthines.  C. A bar chart showing significantly altered metabolic classes, xanthines and acyl carnitines, based on the number of metabolites.  D. A bubble plot illustrating metabolite classes grouped by fraction directional change and p-value, highlighting acyl carnitines and xanthines.

Figure 2. Differential metabolites (DMs) in T1D patients versus normal controls. (A) Volcano plot of differential metabolites (DMs) between T1D and healthy controls. (B) Class Enrichment of DMs by Kolmorogov-Smirnov test. (C) Class Enrichment of DMs displayed in chart. (D) Bubble plot of results grouped by metabolite class.

Enrichment analysis of differential metabolites

To explore the metabolic profile of T1D, DMs were subjected to pathway enrichment analysis. Gene Ontology (GO) analysis showed that these DMs were primarily associated with biological processes such as mineral absorption, the AMPK signaling pathway, and the sulfur relay system (Figure 3A). Notably, the AMPK pathway plays a key role in cellular energy homeostasis and is closely linked to insulin signaling and glucose uptake, suggesting a metabolic adaptation in T1D.

Figure 3
Two plots labeled A and B analyze different metabolic pathways. Plot A is a horizontal scatter plot showing Differential Abundance (DA) scores for pathways, with point sizes varying by count and colored by p-values using a gradient scale. Plot B is a bubble plot displaying pathways' Rich Factor against p-values, also sized by count and color-coded similarly. Key metabolic pathways include mineral absorption, AMPK signaling, and others. A gradient color legend for p-values and a size legend for count are present.

Figure 3. Enrichment analysis of differential metabolites. (A) GO analysis shows enrichment in biological processes such as mineral absorption, AMPK signaling, and the sulfur relay system, which are related to energy metabolism and oxidative stress in T1D. (B) KEGG pathway analysis reveals enrichment in glyoxylate and dicarboxylate metabolism, carbon metabolism, cysteine and methionine metabolism, and linoleic acid metabolism, indicating disturbances in mitochondrial function, amino acid, and lipid metabolism. Point size reflects the number of metabolites; color indicates statistical significance.

KEGG pathway enrichment analysis revealed that the differential metabolites were mainly involved in glyoxylate and dicarboxylate metabolism, carbon metabolism, cysteine and methionine metabolism, and linoleic acid metabolism (Figure 3B). These pathways are associated with mitochondrial energy production, oxidative stress regulation, and lipid metabolism—processes known to be dysregulated in T1D. Together, these findings suggest that metabolic alterations in T1D may reflect disrupted energy balance and redox homeostasis.

Metabolites associated with the development of T1D were identified through LASSO regression modeling

To identify metabolites associated with T1D, a dichotomous predictive model was constructed using LASSO regression. DMs served as independent variables, while diabetes diagnosis (0 or 1) was the dependent variable. The optimal alpha value was found to be -3.77, corresponding to a mean squared error (MSE) of 0.52, which was used to fit the LASSO regression model and determine the final regression coefficients (Figure 4A). The plot of regression coefficients against Log(λ) demonstrated that as Log(λ) increased, the coefficients gradually decreased, with some reaching zero (Figure 4B). Non-zero coefficients were identified as key metabolites. Ultimately, three key metabolites associated with T1D were identified: Hydroxyhexadecanoyl carnitine (P=0.026, log2 Fold Change=0.751), Propionylcarnitine (P=0.020, log2 Fold Change=0.795), and Valerylcarnitine (P=0.016, log2 Fold Change=0.805). A metabolic pathway map highlighting where Hydroxyhexadecanoyl carnitine and Valerylcarnitine participate in fatty acid β-oxidation and branched-chain amino acid metabolism was shown in Supplementary Figure S2. Notably, all three metabolites belonged to the Fatty esters main class and the Acyl carnitines subclass.

Figure 4
Chart A shows a sequence of red points aligned horizontally, with vertical black lines busily filling the right side, indicating variability or error. Chart B depicts two lines; the blue line labeled 'Subgroup' decreases from left to right, and the green line labeled 'λ' remains flat throughout. Both charts span a horizontal axis from -4 to 1.

Figure 4. Metabolites associated with the development of T1D were identified through LASSO regression modeling. (A) Lasso regression cross-validation plot. Vertical coordinate: model mean square error; Horizontal coordinate: logarithmic value of λ. (B) Plot of λ vs. model regression coefficients.

Validation of 3 T1D-related metabolic markers in a diabetic rat model

A T1D rat model was created to assess the diagnostic potential of metabolic markers through STZ injection. In the T1D group, GSP levels were significantly higher than in the normal control (NC) group at both 4- and 8-weeks post-induction (Figure 5A). Additionally, insulin levels were notably reduced in the T1D group compared to NC (Figure 5B). Persistently elevated blood glucose levels and decreased body weight confirmed the establishment of the diabetic rat model (Figures 5C, D).

Figure 5
Graphs showing various metrics in CN and T1DM groups. A: GSP levels are higher in T1DM than in CN at both 4 and 8 weeks. B: Insulin levels are lower in T1DM compared to CN at 4 and 8 weeks. C: Weight increases over time, higher in CN than T1DM at 4 and 8 weeks. D: Blood glucose is consistently higher in T1DM than CN across the weeks measured. Significant differences are marked with asterisks.

Figure 5. Construction of diabetic rat models. (A) Comparison of GSP levels in blood between CN and T1D rat models at 4, 8W. (B) Comparison of insulin levels in blood between CN and T1D rat models at 4, 8W. (C) Comparison of body weight between CN and T1D rat models at 4, 8W. (D) Comparison of blood glucose between CN and T1D rat models at 4, 8W.

The concentrations of Hydroxyhexadecanoyl carnitine, Propionylcarnitine, and Valerylcarnitine in peripheral blood were analyzed using ROC curve analysis. Results indicated that Hydroxyhexadecanoyl carnitine and Valerylcarnitine were significant diagnostic markers (P< 0.05), with areas under the ROC curve (AUC) of 0.9383 and 0.8395, respectively, both with 95% confidence intervals (CIs) (Figures 6A–C). These findings suggest that Hydroxyhexadecanoyl carnitine and Valerylcarnitine could serve as effective indicators for T1D diagnosis.

Figure 6
Panel A shows the ROC curve for Hydroxyhexadecanoyl carnitine, Panel B for Propionylcarnitine, and Panel C for Valerylcarnitine. Each graph plots sensitivity against one hundred percent minus specificity, with a reference diagonal line indicating random chance.

Figure 6. Validation of 3 T1D-related metabolic markers in a diabetic rat model. (A) ROC curve of Hydroxyhexadecanoyl carnitine in blood between CN and T1D rat models. (B) ROC curve of Propionylcarnitine in blood between CN and T1D rat models. (C) ROC curve of Valerylcarnitine in blood between CN and T1D rat models. AUC values are shown with 95% confidence intervals estimated via bootstrapping.

Discussion

Metabolomic profiling captures the cumulative effects of both endogenous physiological processes and exogenous influences (15, 16). In this study, we employ advanced untargeted metabolomic analysis to enhance our understanding of T1D disease mechanisms and to identify potential new diagnostic and therapeutic opportunities. The predominance of amino acids, acyl carnitines, and fatty acid subclasses in the metabolomic profile of T1D patients is biologically plausible, as these metabolites are closely linked to insulin regulation, mitochondrial function, and lipid metabolism—key pathways disrupted in type 1 diabetes. Alterations in amino acid metabolism may reflect impaired glucose utilization and increased protein catabolism, while elevated acyl carnitines and fatty acids suggest shifts toward fatty acid oxidation due to insulin deficiency.

In this study, we recruited 50 T1D patients and healthy controls, collecting peripheral blood samples for metabolite analysis using LC-MS technology. We identified a total of 157 known metabolites: 58 in ESI- mode and 99 in ESI+ mode. The metabolites were classified by superclass, class, and subclass, revealing that most belonged to the fatty acid superclass, amino acids and peptides class, and amino acid subclass. Previous studies have also detected a significant presence of fatty acid metabolites in serum samples (17, 18). This is likely due to the distribution of fatty acids in red blood cell (RBC) membranes, which allows for their detection in blood samples (19, 20). Additionally, amino acid metabolites such as arginine-d7, tryptophan-d5, and glutamine-d5 were identified in the blood of diabetic patients (21). These findings align with our results, highlighting the substantial presence of fatty acids and amino acid class compounds in blood samples.

Further analysis using Wilcoxon tests identified 26 significantly DMs in T1D patient samples, with 25 compounds showing up-regulation and one down-regulation. Notable differences were observed in the expression of metabolites from the acylcarnitines and xanthine classes in T1D patient samples.

Acylcarnitines, which are fatty acid metabolites, play crucial roles in various cellular energy metabolic pathways (22). Metabolic abnormalities have been linked to changes in acylcarnitine production and excretion, indicating disruptions in energy metabolism (22, 23). For instance, prior studies have reported elevated serum levels of acylcarnitine metabolites in patients with type 2 diabetes, correlating with an increased risk of developing T2D (24, 25). An increased proportion of Acylcarnitines in hepatic metabolites can lead to enhanced lipid oxidation in the liver (26).

Xanthine, a product of purine metabolism, is converted to uric acid through oxidation by xanthine oxidase (27). Research has shown that xanthine oxidoreductase activity, which catalyzes the oxidation of hypoxanthine to xanthine, is elevated in patients with T2D and correlates with arterial stiffness in these individuals (28). This suggests that higher levels of Xanthines may be present in diabetic patients. Additionally, significant increases in xanthine, hypoxanthine, and AMP levels have been documented in plasma and liver tissues of diabetic rat models, indicating their potential as biomarkers for assessing diabetes progression (29). Together, these findings suggest that Acylcarnitines and xanthine metabolites are present in blood samples from T1D patients and may be closely related to the pathogenesis of the disease.

Emerging algorithms, particularly machine learning methods like LASSO regression, hold great promise for biomarker discovery in disease contexts (30, 31). Many studies have successfully employed machine learning models for disease diagnosis, prognosis, and marker identification. For instance, Jun Liu et al. utilized LASSO regression to identify 24 metabolic markers associated with diabetes, constructing a comprehensive model that integrated fasting glucose, TRF, and metabolic markers to optimize predictive accuracy (12). Similarly, Shi applied a combination of algorithms, including VIP, time series, LASSO, and SVM-RFE, to screen 25 biomarkers related to aging in a training group (32). LASSO regression has also been effective in identifying critical risk factors for prediabetes and diabetes, contributing to the development of early warning models for these conditions (33).

In this study, the LASSO regression method successfully identified key metabolites associated with T1D, including Hydroxyhexadecanoyl carnitine, Propionylcarnitine, and Valerylcarnitine. Notably, all three metabolites are categorized within the fatty ester main class and the acylcarnitine subclass. This aligns with findings from Yue Sun et al., who noted that elevated levels of Valerylcarnitine (C5) are linked to an increased risk of diabetes mellitus (34). Hydroxyhexadecanoyl carnitine (also known as 3-hydroxypalmitoylcarnitine) has been identified at elevated levels in the plasma of individuals with type 2 diabetes, with increased concentrations associated with a heightened risk of diabetes-related cardiovascular complications (35). Furthermore, Propionylcarnitine has demonstrated protective effects against cardiac damage in diabetic patients by alleviating mitochondrial respiratory depression (36). However, it should be noted that Propionylcarnitine, though selected, did not yield strong ROC values, suggesting its individual predictive power may be limited.

These findings suggest that Hydroxyhexadecanoyl carnitine, Propionylcarnitine, and Valerylcarnitine are closely linked to diabetes development. The study’s validation in STZ-induced diabetic rats further confirmed the significance of Hydroxyhexadecanoyl carnitine and Valerylcarnitine as potential diagnostic metabolites for T1D. Ultimately, these metabolites could serve as valuable biomarkers for diagnosing T1D.

The successful validation of only two out of three metabolites in the animal model suggests that while Hydroxyhexadecanoyl carnitine and Valerylcarnitine show strong diagnostic potential for T1D, Propionylcarnitine may lack robustness or biological relevance in this context. This highlights the importance of experimental validation to refine biomarker panels and underscores the need for further studies to confirm the reliability and reproducibility of candidate markers across different models and conditions. Additionally, distinguishing metabolite changes specific to T1D from those caused by STZ-induced metabolic stress requires comparing profiles from STZ-treated non-diabetic controls or dose-variant models. Metabolites consistently altered in both human T1D samples and diabetic rats—but not in STZ-only controls—are more likely to reflect disease-specific changes rather than general toxicity or stress responses.

This study represents a significant advancement in proposing metabolic markers for the diagnosis of T1D; however, it does have limitations. First, the study’s findings need further validation through a larger cohort of clinical samples to confirm the diagnostic validity and risk assessment of the identified metabolic markers. Second, the specific mechanisms by which Hydroxyhexadecanoyl carnitine, Propionylcarnitine, and Valerylcarnitine regulate blood glucose levels remain unclear, highlighting the need for additional experimental evidence to elucidate these pathways. Third, while LASSO regression was used to identify potential metabolic markers, the model selection was based solely on the minimum mean squared error without applying formal k-fold cross-validation or using an external validation set. This lack of validation reduces confidence in the model’s generalizability and predictive performance. Furthermore, confidence intervals for the ROC AUC values were initially omitted, limiting interpretability and assessment of variability in marker performance. Although these intervals have now been added for selected metabolites, comprehensive model uncertainty remains unaddressed. Lastly, the small sample size further restricts the reliability of multivariate modeling and increases the risk of overfitting, which can compromise the reproducibility of the identified markers. Addressing these limitations will be essential for strengthening the reliability and applicability of these metabolic markers in clinical practice.

Conclusions

In summary, routine metabolomics analysis using LC-MS technology was conducted on peripheral blood samples from pediatric T1D patients and normal controls. A machine learning approach, specifically LASSO regression, was employed to identify key metabolic markers associated with T1D. Of the 26 differentially expressed metabolites, three-Hydroxyhexadecanoyl carnitine, Propionylcarnitine, and Valerylcarnitine-were found to be significantly linked to T1D development. These markers were further validated in a diabetic rat model, where Hydroxyhexadecanoyl carnitine and Valerylcarnitine exhibited superior diagnostic capabilities. These findings suggest that the presence of Hydroxyhexadecanoyl carnitine and Valerylcarnitine in peripheral blood could serve as effective clinical indicators for diagnosing T1D.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Lianshui County People’s Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. The animal study was approved by the study was approved by the Ethics Committee of Hangzhou City University. The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

MT: Data curation, Validation, Writing – original draft, Writing – review & editing. QZ: Data curation, Validation, Writing – original draft, Writing – review & editing. QH: Data curation, Validation, Writing – original draft, Writing – review & editing. PZ: Data curation, Validation, Writing – original draft, Writing – review & editing. SZ: Data curation, Funding acquisition, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2025.1588718/full#supplementary-material

References

1. DiMeglio LA, Evans-Molina C, and Oram RA. Type 1 diabetes. Lancet. (2018) 391:2449–62. doi: 10.1016/s0140-6736(18)31320-5

PubMed Abstract | Crossref Full Text | Google Scholar

2. Primavera M, Giannini C, and Chiarelli F. Prediction and prevention of type 1 diabetes. Front Endocrinol (Lausanne). (2020) 11:248. doi: 10.3389/fendo.2020.00248

PubMed Abstract | Crossref Full Text | Google Scholar

3. Desai S and Deshmukh A. Mapping of type 1 diabetes mellitus. Curr Diabetes Rev. (2020) 16:438–41. doi: 10.2174/1573399815666191004112647

PubMed Abstract | Crossref Full Text | Google Scholar

4. Redondo MJ and Morgan NG. Heterogeneity and endotypes in type 1 diabetes mellitus. Nat Rev Endocrinol. (2023) 19:542–54. doi: 10.1038/s41574-023-00853-0

PubMed Abstract | Crossref Full Text | Google Scholar

5. Pflueger M, Seppänen-Laakso T, Suortti T, Hyötyläinen T, Achenbach P, Bonifacio E, et al. Age- and islet autoimmunity-associated differences in amino acid and lipid metabolites in children at risk for type 1 diabetes. Diabetes. (2011) 60:2740–7. doi: 10.2337/db10-1652

PubMed Abstract | Crossref Full Text | Google Scholar

6. Sysi-Aho M, Ermolov A, Gopalacharyulu PV, Tripathi A, Seppänen-Laakso T, Maukonen J, et al. Metabolic regulation in progression to autoimmune diabetes. PloS Comput Biol. (2011) 7:e1002257. doi: 10.1371/journal.pcbi.1002257

PubMed Abstract | Crossref Full Text | Google Scholar

7. Arneth B, Arneth R, and Shams M. Metabolomics of type 1 and type 2 diabetes. Int J Mol Sci. (2019) 20. doi: 10.3390/ijms20102467

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zamboni N, Saghatelian A, and Patti GJ. Defining the metabolome: size, flux, and regulation. Mol Cell. (2015) 58:699–706. doi: 10.1016/j.molcel.2015.04.021

PubMed Abstract | Crossref Full Text | Google Scholar

9. Feng D, Yuan J, Liu Q, Liu L, Zhang X, Wu Y, et al. UPLC−MS/MS−based metabolomic characterization and comparison of pancreatic adenocarcinoma tissues using formalin−fixed, paraffin−embedded and optimal cutting temperature−embedded materials. Int J Oncol. (2019) 55:1249–60. doi: 10.3892/ijo.2019.4898

PubMed Abstract | Crossref Full Text | Google Scholar

10. Guo K, Savelieff MG, Rumora AE, Alakwaa FM, Callaghan BC, Hur J, et al. Plasma metabolomics and lipidomics differentiate obese individuals by peripheral neuropathy status. J Clin Endocrinol Metab. (2022) 107:1091–109. doi: 10.1210/clinem/dgab844

PubMed Abstract | Crossref Full Text | Google Scholar

11. Ding X, Qin J, Huang F, Feng F, and Luo L. The combination of machine learning and untargeted metabolomics identifies the lipid metabolism -related gene CH25H as a potential biomarker in asthma. Inflammation Res. (2023) 72:1099–119. doi: 10.1007/s00011-023-01732-0

PubMed Abstract | Crossref Full Text | Google Scholar

12. Liu J, Semiz S, van der Lee SJ, van der Spek A, Verhoeven A, van Klinken JB, et al. Metabolomics based markers predict type 2 diabetes in a 14-year follow-up study. Metabolomics. (2017) 13:104. doi: 10.1007/s11306-017-1239-2

PubMed Abstract | Crossref Full Text | Google Scholar

13. Teruya T, Chen YJ, Kondoh H, Fukuji Y, and Yanagida M. Whole-blood metabolomics of dementia patients reveal classes of disease-linked metabolites. Proc Natl Acad Sci U.S.A. (2021) 118. doi: 10.1073/pnas.2022857118

PubMed Abstract | Crossref Full Text | Google Scholar

14. Zhu Y, Devi S, Kumar M, Dahiya RS, and Tarkeshwer. Evaluation of gamma amino butyric acid (GABA) and glibenclamide combination therapy in streptozotocin induced diabetes. Endocr Metab Immune Disord Drug Targets. (2021) 21:2005–16. doi: 10.2174/1871530320666201208110945

PubMed Abstract | Crossref Full Text | Google Scholar

15. Johnson CH, Ivanisevic J, and Siuzdak G. Metabolomics: beyond biomarkers and towards mechanisms. Nat Rev Mol Cell Biol. (2016) 17:451–9. doi: 10.1038/nrm.2016.25

PubMed Abstract | Crossref Full Text | Google Scholar

16. McGrath CM and Young SP. Can metabolomic profiling predict response to therapy? Nat Rev Rheumatol. (2019) 15:129–30. doi: 10.1038/s41584-018-0136-z

PubMed Abstract | Crossref Full Text | Google Scholar

17. Kalhan SC, Guo L, Edmison J, Dasarathy S, McCullough AJ, Hanson RW, et al. Plasma metabolomic profile in nonalcoholic fatty liver disease. Metabolism. (2011) 60:404–13. doi: 10.1016/j.metabol.2010.03.006

PubMed Abstract | Crossref Full Text | Google Scholar

18. Liu J, Geng W, Sun H, Liu C, Huang F, Cao J, et al. Integrative metabolomic characterisation identifies altered portal vein serum metabolome contributing to human hepatocellular carcinoma. Gut. (2022) 71:1203–13. doi: 10.1136/gutjnl-2021-325189

PubMed Abstract | Crossref Full Text | Google Scholar

19. Hernández MC, Rojas P, Carrasco F, Basfi-Fer K, Valenzuela R, Codoceo J, et al. Fatty acid desaturation in red blood cell membranes of patients with type 2 diabetes is improved by zinc supplementation. J Trace Elem Med Biol. (2020) 62:126571. doi: 10.1016/j.jtemb.2020.126571

PubMed Abstract | Crossref Full Text | Google Scholar

20. Thomas T, Cendali F, Fu X, Gamboni F, Morrison EJ, Beirne J, et al. Fatty acid desaturase activity in mature red blood cells and implications for blood storage quality. Transfusion. (2021) 61:1867–83. doi: 10.1111/trf.16402

PubMed Abstract | Crossref Full Text | Google Scholar

21. Jendle J, Hyötyläinen T, Orešič M, and Nyström T. Pharmacometabolomic profiles in type 2 diabetic subjects treated with liraglutide or glimepiride. Cardiovasc Diabetol. (2021) 20:237. doi: 10.1186/s12933-021-01431-2

PubMed Abstract | Crossref Full Text | Google Scholar

22. Dambrova M, Makrecka-Kuka M, Kuka J, Vilskersts R, Nordberg D, Attwood MM, et al. Acylcarnitines: nomenclature, biomarkers, therapeutic potential, drug targets, and clinical trials. Pharmacol Rev. (2022) 74:506–51. doi: 10.1124/pharmrev.121.000408

PubMed Abstract | Crossref Full Text | Google Scholar

23. Jones LL, McDonald DA, and Borum PR. Acylcarnitines: role in brain. Prog Lipid Res. (2010) 49:61–75. doi: 10.1016/j.plipres.2009.08.004

PubMed Abstract | Crossref Full Text | Google Scholar

24. Guasch-Ferré M, Ruiz-Canela M, Li J, Zheng Y, Bulló M, Wang DD, et al. Plasma acylcarnitines and risk of type 2 diabetes in a mediterranean population at high cardiovascular risk. J Clin Endocrinol Metab. (2019) 104:1508–19. doi: 10.1210/jc.2018-01000

PubMed Abstract | Crossref Full Text | Google Scholar

25. Morze J, Wittenbecher C, Schwingshackl L, Danielewicz A, Rynkiewicz A, Hu FB, et al. Metabolomics and type 2 diabetes risk: an updated systematic review and meta-analysis of prospective cohort studies. Diabetes Care. (2022) 45:1013–24. doi: 10.2337/dc21-1705

PubMed Abstract | Crossref Full Text | Google Scholar

26. Shigiyama F, Kumashiro N, Tsuneoka Y, Igarashi H, Yoshikawa F, Kakehi S, et al. Mechanisms of sleep deprivation-induced hepatic steatosis and insulin resistance in mice. Am J Physiol Endocrinol Metab. (2018) 315:E848–e858. doi: 10.1152/ajpendo.00072.2018

PubMed Abstract | Crossref Full Text | Google Scholar

27. Yagi C, Kusunoki Y, Tsunoda T, Murase T, Nakamura T, Osugi K, et al. Xanthine oxidoreductase activity is correlated with hepatic steatosis. Sci Rep. (2022) 12:12282. doi: 10.1038/s41598-022-16688-0

PubMed Abstract | Crossref Full Text | Google Scholar

28. Ohira M, Tanaka S, Watanabe Y, Nakamura S, Oka R, Yamaguchi T, et al. Association of plasma xanthine oxidoreductase with arterial stiffness in type 2 diabetes with liver dysfunction. Am J Med Sci. (2022) 363:242–50. doi: 10.1016/j.amjms.2021.09.011

PubMed Abstract | Crossref Full Text | Google Scholar

29. Varadaiah YGC, Sivanesan S, Nayak SB, and Thirumalarao KR. Purine metabolites can indicate diabetes progression. Arch Physiol Biochem. (2022) 128:87–91. doi: 10.1080/13813455.2019.1663219

PubMed Abstract | Crossref Full Text | Google Scholar

30. Li Z and Sillanpää MJ. Overview of LASSO-related penalized regression methods for quantitative trait mapping and genomic selection. Theor Appl Genet. (2012) 125:419–35. doi: 10.1007/s00122-012-1892-9

PubMed Abstract | Crossref Full Text | Google Scholar

31. Alhamzawi R and Ali HTM. The Bayesian adaptive lasso regression. Math Biosci. (2018) 303:75–82. doi: 10.1016/j.mbs.2018.06.004

PubMed Abstract | Crossref Full Text | Google Scholar

32. Shi D, Tan Q, Ruan J, Tian Z, Wang X, Liu J, et al. Aging-related markers in rat urine revealed by dynamic metabolic profiling using machine learning. Aging (Albany NY). (2021) 13:14322–41. doi: 10.18632/aging.203046

PubMed Abstract | Crossref Full Text | Google Scholar

33. Ou Q, Jin W, Lin L, Lin D, Chen K, and Quan H. LASSO-based machine learning algorithm to predict the incidence of diabetes in different stages. Aging Male. (2023) 26:2205510. doi: 10.1080/13685538.2023.2205510

PubMed Abstract | Crossref Full Text | Google Scholar

34. Sun Y, Gao HY, Fan ZY, He Y, and Yan YX. Metabolomics signatures in type 2 diabetes: A systematic review and integrative analysis. J Clin Endocrinol Metab. (2020) 105. doi: 10.1210/clinem/dgz240

PubMed Abstract | Crossref Full Text | Google Scholar

35. Zhao S, Feng XF, Huang T, Luo HH, Chen JX, Zeng J, et al. The association between acylcarnitine metabolites and cardiovascular disease in chinese patients with type 2 diabetes mellitus. Front Endocrinol (Lausanne). (2020) 11:212. doi: 10.3389/fendo.2020.00212

PubMed Abstract | Crossref Full Text | Google Scholar

36. Broderick TL, Haloftis G, and Paulson DJ. L-propionylcarnitine enhancement of substrate oxidation and mitochondrial respiration in the diabetic rat heart. J Mol Cell Cardiol. (1996) 28:331–40. doi: 10.1006/jmcc.1996.0031

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: type 1 diabetes, metabolomics, LC-MS, metabolic markers, LASSO

Citation: Tuerxunyiming M, Zhao Q, Hu Q, Zhu P and Zhu S (2025) LC-MS-based conventional metabolomics combined with machine learning models to identify metabolic markers for the diagnosis of type I diabetes. Front. Endocrinol. 16:1588718. doi: 10.3389/fendo.2025.1588718

Received: 06 March 2025; Accepted: 07 July 2025;
Published: 07 August 2025.

Edited by:

Hisamitsu Ishihara, Nihon University, Japan

Reviewed by:

Satyaveni Malasala, University of South Carolina, United States
Yanjinlkham Munkhsaikhan, The University of Tokyo, Japan

Copyright © 2025 Tuerxunyiming, Zhao, Hu, Zhu and Zhu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shiting Zhu, emh1c2hpdGluZzA3QDEyNi5jb20=; Ping Zhu, MTUxODk1NjY3NjFAMTYzLmNvbQ==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.