- 1Tianjin Key Laboratory of Retinal Functions and Diseases, Tianjin Branch of National Clinical Research Center for Ocular Disease, Eye Institute and School of Optometry, Tianjin Medical University Eye Hospital, Tianjin, China
- 2Handan Eye Hospital (The Third Hospital of Handan), Handan, Hebei, China
- 3Haihe Lab of ITAI, Naikai University, Tianjin, China
Purpose: Early identification of children at risk for accelerated axial elongation is essential for implementing timely myopia control strategies. Quantitative parameters derived from color fundus photography (CFP) may capture subtle structural and microvascular features relevant to axial length (AL) progression, yet their predictive value remains insufficiently characterized. To develop and validate a machine learning–based model integrating CFP-derived quantitative biomarkers and clinical characteristics to predict 1-year AL progression in school-aged children.
Methods: This cohort study included 693 children aged 6–10 years from Tianjin, China. AL progression >0.2 mm over 1 year was defined as significant elongation. Baseline clinical variables and 144 quantitative CFP metrics were evaluated. Feature selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO) regression, logistic regression screening, and expert ophthalmologic assessment. Seven machine learning algorithms were developed using fivefold cross-validation, with hyperparameters optimized by grid search. Model performance was evaluated on an independent validation set using the area under the receiver operating characteristic (ROC) curve (AUC), F1 score, and other metrics. The best-performing model was interpreted using Shapley Additive Explanations (SHAP) and Local Interpretable Model-Agnostic Explanations (lime).
Results: Of the 693 included children, 457 (65.9%) exhibited AL progression >0.2 mm. LASSO regression selected 39 candidate variables, and 12 predictors were ultimately incorporated into the model construction. Among all algorithms, the Random Forest (RF) model achieved the best discrimination, with an AUC of 0.961 (95% CI: 0.933–0.984) and the highest F1 score. Decision curve analysis (DCA) demonstrated a favorable net benefit across clinically relevant thresholds. SHAP analysis indicated that retinal venous density, venous fractal dimension, presence of leopard-spot lesions, vascular fractal dimension, and inferior-region vascular density were among the most influential predictors of AL progression.
Conclusion: The RF model, which combines clinical characteristics with CFP-derived quantitative biomarkers, accurately predicts short-term AL progression in children. Retinal microvascular and fundus structural parameters significantly contributed to model performance, underscoring their potential as early indicators of myopic AL elongation.
Introduction
Myopia among children is a serious public health issue (Burton et al., 2021). By 2050, approximately 50% of the global population is expected to be nearsighted, with 10% suffering from high myopia (HM). This trend continues to grow (Holden et al., 2016; Pan et al., 2025). To address this public health challenge, early detection of children with myopia or pre-myopia is crucial for preventing the progression of myopia during childhood (Zhang S. et al., 2025; Cai et al., 2025). The assessment of axial length (AL) serves not only as a critical parameter for myopia progression but also explains the ocular elongation and biomechanical alterations associated with myopia, ultimately leading to irreversible pathological changes in the fundus of the eye (Cai et al., 2025; Liu et al., 2025; Tao et al., 2025). Currently, there are no biological markers for the progression of AL in early childhood myopia.
As the AL increases, changes in the shape of the eyeball also alter the fundus (Sui et al., 2024). In recent years, color fundus photography (CFP) has also played a role in detecting HM fundus changes (Zhu et al., 2022; Hayashi et al., 2010). Currently, CFP remains the diagnostic basis for retinal lesions in ophthalmic diagnosis and treatment. However, in the early stages of myopia, no retinal lesions are detected, and CFP appears indistinguishable from normal CFP. Recent studies have identified an association between the emergence of leopard spots and the progression of childhood myopia (Gong et al., 2024; Sameshima et al., 2024). Nevertheless, similar CFP biological parameters still fail to indicate the risk of early childhood AL progression adequately.
With the advancement of deep learning (DL) and machine learning (ML), artificial intelligence (AI) has demonstrated significant advantages in image recognition and disease prediction (Hu M. et al., 2025; Yii et al., 2025). Previous studies have utilized CFP in elderly individuals to predict AL length, showing a correlation between CFP and AL (Wang et al., 2024; Oh et al., 2023; Yang et al., 2024). Wang et al. investigated Sphere Equivalent (SE) and its progression using CFP (Wang Z. et al., 2025; He et al., 2024). Quantified CFP using segmentation technology provides more biologically interpretable parameters (He et al., 2024; Chen et al., 2024). We previously employed quantitative indicators from the Children’s CFP to perform binary classification prediction of myopia risk (Zhang X. et al., 2025). However, the predictive value of CFP in childhood AL growth has not been studied.
Therefore, this study aims to establish a predictive model for AL progression using quantitative baseline CFP and ocular parameters, and to conduct an interpretability study of the parameters incorporated into the model via artificial intelligence to identify potential biomarkers indicating AL growth in school-aged children.
Methods
Data collection
The study was approved by the Ethics Committee of Tianjin Medical University Eye Hospital and conducted in accordance with the principles outlined in the Declaration of Helsinki (2024KY-67). This cohort study was conducted in a randomly selected elementary school in Xi Qing District, Tianjin, from April 2024 to October 2025. Children aged 6–10 years were included in the study. Children were excluded from the study if their parents or legal guardians did not provide consent for their participation. Children with systemic diseases (e.g., congenital heart disease, ocular trauma, or ophthalmic diseases such as glaucoma, cataracts, infectious eye infections, and strabismus) were also excluded. A total of 865 children were included in the study. We defined 1-year AL progression ≤0.2 mm as the AL nonprogressive group and AL progression >0.2 mm as the myopic AL progressive group (Lanca et al., 2021; Li et al., 2023). No standardized myopia control interventions were implemented as part of the study protocol. All children underwent a complete ophthalmic examination, including slit-lamp biomicroscope, best-corrected visual acuity (BCVA), Intraocular pressure (IOP) (CT-1, Topcon, Japan), AL (Lenstar LS-900, Haag-Streit AG, Switzerland), and SER (KR-800, Topcon, Japan). All CFP images were examined using a 45° fundus camera (Canon Inc., 9-1 Kanagawa, Japan). Due to poor photo quality and loss to follow-up, 693 children were ultimately included in the study. To avoid bilateral correlation, this study included only one CFP per child for analysis (Hu X. et al., 2025). All biometric and refractive measurements corresponded to the same eye as the included CFP image. The research design process is shown in Figure 1.
Figure 1. The overall flowchart of the study, including data collection and data types, training and validation set partitioning (A), feature selection for predictive models (B), model construction and optimal parameter selection (C), performance comparison and evaluation of predictive models (D), and interpretability analysis of predictive models (E).
Generation and selection of CFP-based indicators
Briefly, CFP images were acquired using a standardized protocol, followed by automated object detection and semantic segmentation to identify retinal structures. Quantitative metrics were subsequently derived from segmented outputs and reviewed by experienced ophthalmologists. Object detection and semantic segmentation models were applied to identify and quantify key retinal characteristics, including tessellation patterns, optic disc and cup morphology, parapapillary atrophy, and vascular features. Image labeling followed a semi-automated, machine-assisted pipeline. Two board-certified ophthalmologists (each with at least 5 years of experience in fundus evaluation) independently reviewed and refined the annotations. The first clinician conducted preliminary edits, while the second, a senior reviewer with over 8 years of experience, provided final adjudication. Owing to their greater expertise, the adjudicator’s markings served as the reference standard. Quantitative disclosure techniques were detailed in our previous research (Zhang X. et al., 2025). Additional methodological details are available in Document S1.
This study included baseline information such as sex, age, cycloplegic SE, Diopter Sphere (DS), and AL. We obtained 144 quantitative indicators of CFP. The quantitative CFP variables were categorized into the following areas: (1) the presence of leopard spots and atrophic arcs; (2) vascular-related parameters, including the length, area, and angle of arterioles and venules, and vascular density (VD) and parameters in a particular area; (3) optic disc-related parameters, including morphologic quantification, relative optic disc position of the optic disc, and optic cup ratio of the optic disc; (4) the distance of essential locations in the fundus (macula, optic disc, and vascular arch). In addition to the quantitative metrics of CFP segmentation itself, there are quantitative metrics representing the regularity and curvature of the vessels, calculated using higher-order mathematics. For example, the vascular fractal dimension (VFD) is computed using the box-counting principle: cover the target area with square boxes of different sizes and count the number of boxes that cover at least part of the target area for each box size (Prasad, 2025). The smaller the box, the higher the number. A linear fit is made between the logarithm of the box size and the logarithm of the number of boxes, and the slope is the fractal dimension. Vessel density in each area is calculated as the area of vessels in that area/the overall area of the fundus (Guan et al., 2025). Retinal vascular curvature was defined as the integral of the squared curvature of the retinal vascular pathway, normalized by the total path length (Sekiryu et al., 2025). Additional methodological details are available in Document S1.
The sample data (subjects) were divided into a training set and a validation set (training set: validation set = 8:2). Due to the large number of variables after image quantization, the Least Absolute Shrinkage and Selection Operator (LASSO) was used to select features, which is a method to introduce L1 regularization, select features, and reduce dimensions by compressing coefficients, screening features with significant contributions, and eliminating redundant features (Song et al., 2026). We also employ single-factor and multiple-factor regression for variable screening. The final determination of predictive factors was based on the clinical selection of comprehensive ophthalmology specialists and the sample size and number of included predictors in the clinical prediction model (Riley et al., 2019).
Model development and validation
Model training is conducted on the designated training set, while model validation is performed on a separate, independent validation set. The two processes are mutually independent. Seven machine learning algorithms—extreme gradient boosting (XGBoost), support vector machine (SVM), Random Forest (RF), artificial neural network (ANN), decision tree, logistic regression (LR), and light gradient boosting machine (LightGBM)—were implemented to develop the predictive models. Model robustness was assessed using fivefold cross-validation. Hyperparameters for each algorithm were tuned via grid search conducted exclusively within the training folds to avoid information leakage. The configuration yielding the highest area under the receiver operating characteristic curve (AUC) was selected as the optimal model. For performance estimation, 95% confidence intervals for AUC were calculated in the validation cohort using 1,000 bootstrap resamples. The predictive models were trained on the designated training dataset, and the optimal model was subsequently evaluated on the validation cohort.
Model performance was quantified using the AUC, sensitivity, specificity, F1 score, recall, accuracy, Kappa, and Youden’s J. To further assess the clinical applicability of the models, decision curve analysis (DCA) and calibration curves were generated. DCA evaluates the net clinical benefit of a predictive model across a range of threshold probabilities, thereby demonstrating its value in guiding clinical decisions (Chalkou et al., 2023). We ultimately selected the optimal model by comprehensively evaluating both F1 scores and AUC. After choosing the optimal model, further examine its confusion matrix and calibration curve on the validation set. The calibration curve demonstrates the degree of agreement between predicted probabilities and actual outcomes, serving as an indicator for assessing model reliability and goodness-of-fit (Van Calster et al., 2019).
Model interpretation
Shapley Additive Explanations (SHAP) values were calculated to assess the magnitude and direction of each variable’s influence on the model’s final classification. Features with larger absolute SHAP values were interpreted as having a more substantial contribution to predictive performance. These importance metrics were used to elucidate the behavior of the optimal model (Bhandari et al., 2022). To complement the global explanation provided by SHAP, the Local Interpretable Model-Agnostic Explanations (lime) method was further applied, offering case-level insights into how specific predictions were generated (Zhao et al., 2025; Huang et al., 2024; Guan et al., 2024). The SHAP dependence plot demonstrates the effect of a single feature on the output of the best model. Model training was conducted using Python 3.4 (https://www.python.org).
Statistical analysis
Group differences were evaluated using Student’s t-test for normally distributed variables and the Wilcoxon rank-sum test for non-normal data. Distributional assumptions were assessed with the Kolmogorov–Smirnov or Shapiro–Wilk tests. Categorical variables were compared using the chi-squared or Fisher’s exact test, as appropriate. Univariable logistic regression was first applied to all candidate predictors, and variables with p < 0.05 were entered into a stepwise multivariable logistic regression to derive the final model. Continuous data are reported as mean ± standard deviation, and statistical significance was defined as p < 0.05 (two-tailed). Normally distributed continuous variables are presented as mean ± standard deviation, whereas non-normally distributed variables are summarized as median [interquartile range]. Distributional assumptions were assessed using the Shapiro–Wilk or Kolmogorov–Smirnov tests. All analyses were conducted in R (v4.4.2).
Results
A total of 693 children were included in the analysis, of whom 236 (34.1%) were classified in the AL ≤ 0.2 mm group and 457 (65.9%) in the AL > 0.2 mm group. Baseline characteristics are summarized in Table 1; Supplementary Table S2.
The distribution of sex differed significantly between groups (p = 0.001), with a higher proportion of females in the AL > 0.2 mm group. The prevalence of atrophic regions did not differ between groups, whereas leopard spot lesions were more frequently observed in eyes with AL progression >0.2 mm (p < 0.001). Age did not differ significantly between the two groups.
Refractive and biometric parameters demonstrated several between-group differences. Children in the AL ≤ 0.2 mm group had higher spherical equivalent (p < 0.001) and DS values (p < 0.001), while baseline AL was slightly greater in the AL > 0.2 mm group (p = 0.043).
Multiple vascular morphological features differed between groups. Significant differences were noted in average vascular branching angle (p = 0.048), vascular fractal dimension (p < 0.001), inferior disk region fractal dimension (p < 0.001), vascular density (p < 0.001), vascular coverage density at 0.5–1.0 PD (p < 0.001), average vascular density within 3 mm of the fovea (p = 0.047, p < 0.05), inferior-region vascular density (p < 0.001), retinal arterial density (p = 0.019, p < 0.05), retinal venous density (p < 0.001), venous fractal dimension (p < 0.001), arterial branching angle (p = 0.039, p < 0.05), and venous and arterial vessel area (both p < 0.05). Several measures of vascular area across parafoveal regions also differed significantly, including values within 1.0–1.5 PD (p = 0.039, p < 0.05) and 2.0–2.5 PD (p < 0.001). In contrast, most optic disc and cup parameters—including optic disc area, diameters, tilt angle, rim distances, and cup–disc ratios—showed no significant differences between groups. Arterial and venous diameter measures, curvature metrics, and various regional vascular parameters also did not differ significantly.
Table 2; Supplementary Table S3 present the results of univariate and multivariate regression analyses for the included variables. Following the multivariate regression analysis, six variables were considered for inclusion in the model. Figure 2 shows the variables and their quantities ultimately selected by the LASSO regression method. The LASSO model minimized binomial deviance when the log(λ) value reached the optimal penalty parameter (0.0037). LASSO selected 39 variables for inclusion in the model. After ophthalmologists determined the variables to be included based on variable screening results and comprehensive consideration of sample size and included variables, the following baseline information was ultimately incorporated: sex, DS, AL, and nine retinal quantitative parameters (Retinal Venous Density, Venous Fractal Dimension, Presence of Leopard Spot Lesions, Vascular Fractal Dimension, Average Vascular Density in the Inferior Region, Vascular Coverage Density between 0.5-1.0 PD, Arterial Vessel Area, Venous Vessel Area, and Vascular Area within 1.0–1.5 PD) were collectively incorporated into the model construction.
Figure 2. Selection of model variable parameters for AL progression. For each variable, parameter selection was performed using single-factor, multiple-factor regression and LASSO (A,B). AL: Axial Length.
Hyperparameter tuning using fivefold cross-validation yielded the optimal parameter sets for all candidate algorithms (Table 3). The decision tree performed best with a maximum depth of 5 and a minimum split size of 10. The RF model achieved its best performance with 150 estimators and two features per split. For gradient boosting methods, XGBoost performed optimally with a learning rate of 0.1, 100 estimators, and a maximum depth of 3, whereas LightGBM required 100 estimators, 31 leaves, and subsampling and column-sampling rates of 0.6. The SVM model showed optimal performance with a linear kernel and C = 0.1, and the ANN model achieved the best results with two hidden layers of 50 units and a tanh activation function.
Model performance metrics for the validation cohort are summarized in Table 4. The AUC values ranged from 0.852 (SVM) to 0.965 (LightGBM). In addition to AUC, accuracy, precision, sensitivity, specificity, F1 score, Cohen’s kappa, and Youden’s index, these metrics demonstrated variable performance across algorithms. The RF model showed a high AUC (0.961; 95% CI, 0.933–0.984) and the highest F1 score among all evaluated models; therefore, it was selected as the final prediction model.
Figure 3 presents the comparative performance of all models. The ROC curves for the seven algorithms are displayed in Figure 3A. Decision curve analysis for the validation set is shown in Figure 3B. The RF Model exhibits superior performance. The confusion matrix for the RF Model (Figure 3C) depicts classification outcomes, and the calibration curve (Figure 3D) shows the agreement between predicted and observed probabilities.
Figure 3. Performance of various predictive models on the AL progression validation set. (A) ROC curves for the ML models. (B) Seven models of the Decision Curve Analysis for the Validation Sets. (C) Confusion matrix for the best model (RF). (D) Calibration Curve for the best model (RF). AL: Axial Length; ML: machine learning; RF: Random Forest.
Feature attribution for the final RF model was assessed using SHAP (Figure 4). The mean absolute SHAP value plot (Figure 4A) displays the ranked contribution of each variable to the model output, with higher values indicating greater influence on the prediction. The SHAP summary plot (Figure 4B) presents the distribution of SHAP values for all features across the dataset, showing the range and dispersion of each feature’s effect on the predicted probability of AL progression. The color gradient reflects the relative magnitude of the underlying feature values. SHAP analysis indicates that the more essential features in the predictive model are the Retinal Venous Density, Venous Fractal Dimension, Presence of Leopard Spot Lesions, Vascular Fractal Dimension, and Average Vascular Density in the Inferior Region.
Figure 4. SHAP-based feature attribution for the RF model. (A) Mean absolute SHAP values rank feature importance. (B) SHAP summary plot illustrating the direction and magnitude of each feature’s effect (red: high value, blue: low value). (C,D) SHAP waterfall plots for representative Class 0 (AL progression <0.2 mm) and Class 1 (AL progression ≥0.2 mm) predictions, showing features driving risk upward (red) or downward (blue). The figure below attempts to plot the features from the previous figure along a straight line.
Instance-level interpretability is demonstrated in the waterfall plots (Figures 4C,D). For a representative case classified as no progression (AL ≤ 0.2 mm), Figure 4C shows how individual features cumulatively shift the model output toward a lower predicted risk. For a representative case classified as progression (AL > 0.2 mm), Figure 4D depicts the sequential contribution of features that increase the expected probability of progression. The supplemental linear arrangement of SHAP values provides an aligned visualization of feature effects for comparison across samples.
SHAP summary plots were generated to visualize the contribution of each feature to the RF model’s prediction of AL elongation. As shown in Figure 5, multiple demographic, structural, and retinal microvascular parameters significantly influenced the predicted probability of AL progression. The presence of leopard-spot lesions showed consistently positive SHAP values across observations. Vascular and venous fractal dimensions showed a broad distribution of SHAP values, with lower feature levels generally associated with higher positive contributions. Several regional vascular metrics, including vascular coverage density between 0.5 and 1.0 PD, average vascular density in the inferior region, venous vessel area, and vascular area within 1.0–1.5 PD, also displayed positive SHAP contributions at varying feature levels. Retinal venous density showed a wide range of SHAP values, with lower values corresponding to higher positive contributions in the model. Additional variables such as base AL, arterial vessel area, and sex demonstrated relatively more minor but discernible contributions.
Figure 5. SHAP summary plots illustrate the contribution of demographic, structural, and retinal microvascular features to the RF model predicting AL progression. Each point represents one individual eye. The horizontal axis represents the SHAP value, indicating the marginal contribution of each feature to the predicted probability of AL progression (greater than 0.2 mm). Positive SHAP values indicate an increased predicted risk of axial elongation, whereas negative values indicate a protective or risk-reducing effect. Points are color-coded according to the normalized feature value, with red representing higher feature values and blue representing lower feature values. The vertical dispersion of points for each variable reflects inter-individual variability and potential nonlinear effects that the model captures. Lower retinal venous density and reduced venous or overall vascular fractal dimensions are associated with predominantly positive SHAP values at lower feature levels, indicating that microvascular rarefaction markedly increases the predicted risk of AL progression. In contrast, higher vascular coverage density and increased vascular area within specific annular regions (0.5–1.0 PD and 1.0–1.5 PD from the optic disc) show positive SHAP contributions at higher feature values, suggesting a region-specific compensatory vascular response. The presence of leopard-spot lesions consistently demonstrates positive SHAP values across observations, highlighting a strong association with accelerated axial elongation. Sex and baseline AL exhibit relatively smaller but discernible contributions, indicating their supportive role within the multivariable prediction framework rather than dominant driving factors. AL, axial length; PD, papillary diameter; SHAP, Shapley Additive Explanations; RF, Random Forest.
Discussion
To balance statistical significance and clinical relevance, we integrated statistical variable screening with clinical expert judgment, selecting sex, DS, AL, and nine quantitative retinal parameters to enhance the model’s interpretability and clinical acceptability. The RF model was ultimately chosen as the optimal model due to its balanced performance, characterized by high AUC values, accuracy, recall, and F1 score. This model achieves a good balance between accurately identifying progressive cases and avoiding false negatives by identifying retinal microvascular features (particularly venous-related parameters and fractal dimension) and leopard spot lesions as key predictors of AL progression. These findings provide new insights and potential biomarkers to understand the mechanisms of AL progression in children and to develop effective intervention strategies. To our knowledge, this study represents an early attempt to construct a predictive model for short-term AL progression in children using quantitatively derived CFP features. In particular, the inclusion of venous-related microvascular parameters, such as venous density and venous fractal dimension, extends existing CFP-based research beyond traditional structural descriptors.
This study analyzed basic information, including sex, DS, and baseline AL. The results showed that sex was one of the factors related to myopia progression, which was consistent with previous studies. Lee SS et al. found that female participants exhibited faster myopia progression and greater AL elongation (Lee et al., 2022). Mingming C et al. also found that the myopia rate in females was significantly higher than in males (Mingming et al., 2025). Many scholars also include sex in the prediction model of myopia progression and the myopia prevention and control effect (Yii et al., 2025; Gopalakrishnan et al., 2024; Fan et al., 2022; Xu et al., 2025). Our study found that female sex was also a factor in the myopia progression of school-age children. Surprisingly, DS and baseline AL showed limited significance in logistic regression analysis. This may be attributed to the study’s screening-based data, which included children with hyperopia reserve or mild-to-moderate myopia, with a narrow age range (6–10 years). Consequently, these parameters had limited predictive value for AL progression, underscoring the critical importance of dynamic AL monitoring. Although sex differences were incorporated into the prediction model, sex-stratified analyses of baseline refractive and biometric parameters were not the primary focus of this study and warrant further investigation in dedicated longitudinal analyses.
In the final model, the importance of microvascular parameters such as retinal venous density, venous fractal dimension, and vascular fractal dimension even surpasses that of some traditional refractive biomarkers (such as baseline AL and SE). This finding further validates the conclusion by Zhao M et al. that morphological parameters of vascular microstructure could serve as a potential biomarker for monitoring the progression of AL (Zhao et al., 2022). Wang Q et al. found that the choroidal Sattler’s and Haller’s layer densities below the macular fovea are critical for monitoring changes in myopia progression (Wang Q. et al., 2025). This is like the conclusion that the average vascular density below the macula is significant in this study. In previous studies, our team also included vascular density as a relevant factor for high myopia in Chinese college students (Zhang et al., 2023). Many scholars have also shown that vascular density is an independent risk factor and an early predictor of myopia progression (Veselinović et al., 2025; Cheng et al., 2019). Fractal dimension serves as a quantitative measure of vascular network complexity, with its reduction typically indicating structural simplification or sparsification. Our study demonstrates that lower vascular and venous fractal dimensions are associated with an elevated risk of AL progression. Previous research has also reported that decreased vascular fractal dimension is strongly associated with reduced choroidal and retinal perfusion, which may constitute a critical pathological basis for sustained AL growth and fundus alterations (Furnon et al., 2023). This phenomenon indicates that during rapid AL elongation, the choroid and retina undergo abnormal or adaptive changes in blood supply, leading to remodeling of the microvascular network (Zhu et al., 2025; Chen et al., 2025). Studies also suggest that myopia acts as a protective factor against diabetic retinopathy (Wang et al., 2016; Fu et al., 2016), likely due to its reduced vascular fractal dimension. The simplified vascular architecture may partially preserve the morphological integrity of microvessels. Meanwhile, the leopard spot lesion, as an essential structural feature, was identified as a strong predictor, possibly related to choroidal thinning and scleral remodeling. The association between choroidal thickness and choroidal vascular density mediated by AL further supports the central role of posterior pole structural changes in myopia progression (Yang et al., 2025).
Interpretation of the SHAP values from the RF model revealed a coherent vascular mechanism underlying AL elongation. Recent research by Wang et al., which quantified CFP for the interpretable analysis of HM in adults, identified particular optic disc and atrophic areas, along with vascular diameter, as effective predictors (Zou et al., 2025). Our study focuses on early changes in childhood myopia and has discovered biomarkers associated with AL progression in children. Lower retinal venous density and reduced venous or overall vascular fractal dimensions were associated with strong positive SHAP values, indicating that microvascular rarefaction markedly increased the probability of AL progression. Such attenuation of the retinal venous network likely reflects early perfusion insufficiency and localized metabolic stress. These hemodynamic alterations may trigger remodeling of the choroid–sclera complex, reduce scleral stiffness, and predispose the eye to accelerated AL elongation. In contrast, higher vascular coverage or vascular area within specific perifoveal and mid-peripheral regions (e.g., 0.5–1.0 PD and 1.0–1.5 PD zones) showed positive SHAP contributions at higher feature values, suggesting a secondary, compensatory dilation response to hypoxia. Although such dilation may occur to enhance perfusion, it likely represents maladaptive vascular remodeling that further destabilizes ocular wall biomechanics. The substantial positive contribution of leopard-spot lesions further supports this interpretation, marking regions of chronic chorioretinal stress and structural vulnerability. Collectively, the model suggests that global venous rarefaction, combined with region-specific compensatory vascular dilation, creates an imbalance in retinal and scleral biomechanics that increases the likelihood of AL elongation. Based on the ranking of variable importance within the predictive model used in this study, we hypothesize that the density and complexity of early-stage vascular development in childhood AL progression may precede the emergence of leopard spots or contribute to their appearance. However, because leopard spot density was suboptimally quantified in early childhood, our study employed a binary classification (present or absent). This categorical variable may also alter the ranking of variable importance.
This study has the following limitations: First, due to the lack of external validation, the results may be influenced by specific equipment, raising questions about the model’s generalization ability. Future research should expand the sample size for external validation. Second, all CFP images were acquired using a single imaging device in one city, which may limit the generalizability of the findings to other populations or imaging systems. Hybrid models integrating raw CFP images and quantitative imageomics features may further enhance predictive performance while preserving interpretability. Additionally, potential unrecorded variations in routine refractive correction during follow-up may have influenced AL progression. Finally, although relative CFP-derived metrics were used to reduce AL–related magnification effects, no formal AL–based image rescaling was applied. Future studies incorporating Littmann or Bennett correction models may further improve measurement accuracy, particularly in populations with extreme AL elongation.
Conclusion
In this school-based cohort of children aged 6–10 years, a machine learning model incorporating quantitative CFP parameters and clinical characteristics demonstrated high accuracy in predicting 1-year AL progression. Retinal microvascular complexity, vessel density metrics, venous structural features, and the presence of leopard-spot lesions emerged as key contributors to model performance. The RF model showed superior discrimination and calibration compared with other algorithms. These findings indicate that CFP-derived biomarkers provide valuable information for identifying children at elevated risk for rapid axial elongation and may support earlier, individualized myopia control interventions. Future studies focusing on causal inference and growth trajectory modeling may further clarify the interaction between age, baseline AL, refractive error, and axial elongation rate.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Tianjin Medical University Eye Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
Author contributions
ZW: Visualization, Software, Investigation, Writing – original draft, Conceptualization, Resources, Methodology. FH: Writing – original draft, Methodology, Data curation, Validation. XZ: Validation, Supervision, Writing – original draft, Software, Investigation. JD: Writing – review and editing, Validation, Formal Analysis, Resources. JY: Writing – review and editing, Project administration, Resources. XX: Supervision, Resources, Writing – review and editing. ZL: Resources, Funding acquisition, Writing – review and editing, Visualization. BD: Funding acquisition, Resources, Project administration, Writing – review and editing. RW: Visualization, Investigation, Conceptualization, Supervision, Writing – review and editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This study was supported by grants from the Tianjin Key Medical Discipline Construction Project (TJYXZDXK-3-004A-2).
Acknowledgments
This work was partially supported by EVision Technology (Beijing) Co., Ltd. We are grateful to Xingye ZW and Jie ZW for their technical support with image processing.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcell.2026.1753213/full#supplementary-material
References
Bhandari, M., Shahi, T. B., Siku, B., and Neupane, A. (2022). Explanatory classification of CXR images into COVID-19, pneumonia and tuberculosis using deep learning and XAI. Comput. Biology Medicine 150, 106156. doi:10.1016/j.compbiomed.2022.106156
Burton, M. J., Ramke, J., Marques, A. P., Bourne, R. R. A., Congdon, N., Jones, I., et al. (2021). The lancet global health commission on global eye health: vision beyond 2020. Lancet. Glob. Health 9, e489–e551. doi:10.1016/s2214-109x(20)30488-5
Cai, W., Hu, Y., Chen, X., Morgan, I. G., He, M., and Ding, X. (2025). Acceleration of myopic shifts in refraction and axial elongation begins in the premyopia stage. Investigative Ophthalmology and Visual Science 66, 33. doi:10.1167/iovs.66.13.33
Chalkou, K., Vickers, A. J., Pellegrini, F., Manca, A., and Salanti, G. (2023). Decision curve analysis for personalized treatment choice between multiple options. Med. Decision Making 43, 337–349. doi:10.1177/0272989x221143058
Chen, N., Zhu, Z., Yang, W., and Wang, Q. (2024). Progress in clinical research and applications of retinal vessel quantification technology based on fundus imaging. Front. Bioengineering Biotechnology 12, 1329263. doi:10.3389/fbioe.2024.1329263
Chen, Q., Song, K., Cun, Q., Yang, W., Tao, Y., Zhong, H., et al. (2025). Differences in macular structure and microvasculature between high myopia and non-high myopia. Front. Medicine 12, 1645015. doi:10.3389/fmed.2025.1645015
Cheng, D., Chen, Q., Wu, Y., Yu, X., Shen, M., Zhuang, X., et al. (2019). Deep perifoveal vessel density as an indicator of capillary loss in high myopia. Eye Lond. Engl. 33, 1961–1968. doi:10.1038/s41433-019-0573-1
Fan, Y., Yu, Z., Tang, T., Liu, X., Xu, Q., Peng, Z., et al. (2022). Machine learning algorithm improves accuracy of ortho-K lens fitting in vision shaping treatment. Contact Lens and Anterior Eye 45, 101474. doi:10.1016/j.clae.2021.101474
Fu, Y., Geng, D., Liu, H., and Che, H. (2016). Myopia and/or longer axial length are protective against diabetic retinopathy: a meta-analysis. Acta Ophthalmologica 94, 346–352. doi:10.1111/aos.12908
Furnon, L., Labarere, J., Trucco, E., Hogg, S., MacGillivray, T., and Chiquet, C. (2023). Lower fractal dimension of retinal vessel for patients with birdshot chorioretinopathy. Acta Ophthalmologica 101, 392–402. doi:10.1111/aos.15291
Gong, W., Wang, J., Deng, J., Chen, J., Zhu, Z., Seth, I., et al. (2024). Quantification of fundus tessellation reflects early myopic maculopathy in a large-scale population of children and adolescents. Transl. Vision Science and Technology 13, 22. doi:10.1167/tvst.13.6.22
Gopalakrishnan, A., Sivaraman, V., Hussaindeen, J. R., Swaminathan, M., Gentle, A., Armitage, J. A., et al. (2024). Ocular biometry percentile curves and their relation to myopia development in Indian children. J. Clinical Medicine 13, 2867. doi:10.3390/jcm13102867
Guan, C., Gong, A., Zhao, Y., Yin, C., Geng, L., Liu, L., et al. (2024). Interpretable machine learning model for new-onset atrial fibrillation prediction in critically ill patients: a multi-center study. Crit. Care London, Engl. 28, 349. doi:10.1186/s13054-024-05138-0
Guan, L., Yang, L., Sun, J., Liu, Y., Lu, Z., Feng, R., et al. (2025). Choroidal vascular alterations in diabetic retinopathy and their association with macular microvascular parameters. Sci. Reports 15, 29381. doi:10.1038/s41598-025-13414-4
Hayashi, K., Ohno-Matsui, K., Shimada, N., Moriyama, M., Kojima, A., Hayashi, W., et al. (2010). Long-term pattern of progression of myopic maculopathy: a natural history study. Ophthalmology 117, 1595–1611. doi:10.1016/j.ophtha.2009.11.003
He, H. L., Liu, Y. X., Liu, H., Zhang, X., Song, H., Xu, T. Z., et al. (2024). Deep learning-enabled vasculometry depicts phased lesion patterns in high myopia progression. Asia-Pacific Journal Ophthalmology Phila. Pa. 13, 100086. doi:10.1016/j.apjo.2024.100086
Holden, B. A., Fricke, T. R., Wilson, D. A., Jong, M., Naidoo, K. S., Sankaridurg, P., et al. (2016). Global prevalence of myopia and high myopia and temporal trends from 2000 through 2050. Ophthalmology 123, 1036–1042. doi:10.1016/j.ophtha.2016.01.006
Hu, M., Jiang, Y., Luo, Z., Lan, W., and Dai, W. (2025). Integrating biometric and multimodal imaging data for early prediction of myopia onset. Sci. Reports 15, 31416. doi:10.1038/s41598-025-15605-5
Hu, X., Wang, Z., Sun, B., Wan, X., Chang, F., Lin, W., et al. (2025). Orthokeratology for myopia control: a three-year longitudinal study on axial length modulation influenced by baseline age and spherical equivalent. BMC Ophthalmology 25, 466. doi:10.1186/s12886-025-04303-5
Huang, D., Gong, L., Wei, C., Wang, X., and Liang, Z. (2024). An explainable machine learning-based model to predict intensive care unit admission among patients with community-acquired pneumonia and connective tissue disease. Respir. Research 25, 246. doi:10.1186/s12931-024-02874-3
Lanca, C., Foo, L. L., Ang, M., Tan, C. S., Kathrani, B., Htoon, H. M., et al. (2021). Rapid myopic progression in childhood is associated with teenage high myopia. Investigative Ophthalmology and Visual Science 62, 17. doi:10.1167/iovs.62.4.17
Lee, S. S., Lingham, G., Sanfilippo, P. G., Hammond, C. J., Saw, S. M., Guggenheim, J. A., et al. (2022). Incidence and progression of myopia in early adulthood. JAMA Ophthalmology 140, 162–169. doi:10.1001/jamaophthalmol.2021.5067
Li, X., Hu, J., Peng, Z., Chen, S., Sun, L., Wang, K., et al. (2023). Association between choriocapillaris perfusion and axial elongation in children using defocus incorporated multiple segments (DIMS) spectacle lenses. Eye Lond. Engl. 37, 3847–3853. doi:10.1038/s41433-023-02629-2
Liu, S., He, X., Sankaridurg, P., Wang, J., Du, L., Zhang, B., et al. (2025). Axial elongation as a marker to identify obvious myopic shift in non-myopic eyes of Chinese children. Ophthalmic and Physiological Optics 45, 1435–1446. doi:10.1111/opo.13530
Mingming, C., Zhou, Y., Shaoli, L., Junting, L., Fangfang, C., Xinnan, Z., et al. (2025). Prevalence and associated factors of myopia and axial length-related visual impairment in children and adolescents aged 4-18 years. Sci. Reports 15, 35277. doi:10.1038/s41598-025-18400-4
Oh, R., Lee, E. K., Bae, K., Park, U. C., Yu, H. G., and Yoon, C. K. (2023). Deep learning-based prediction of axial length using ultra-widefield fundus photography. Korean Journal Ophthalmology KJO 37, 95–104. doi:10.3341/kjo.2022.0059
Pan, W., Saw, S. M., Wong, T. Y., Morgan, I., Yang, Z., and Lan, W. (2025). Prevalence and temporal trends in myopia and high myopia children in China: a systematic review and meta-analysis with projections from 2020 to 2050. Lancet Regional Health. West. Pac. 55, 101484. doi:10.1016/j.lanwpc.2025.101484
Prasad, B. (2025). Mathematical probe of nanoparticle aggregation in capillary-tissue system applying fractal model. Crit. Reviews Biomedical Engineering 53, 1–22. doi:10.1615/CritRevBiomedEng.2024055532
Riley, R. D., Snell, K. I., Ensor, J., Burke, D. L., Harrell, F. E., Jr., Moons, K. G., et al. (2019). Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes. Statistics Medicine 38, 1276–1296. doi:10.1002/sim.7992
Sameshima, S., Yamashita, T., Terasaki, H., Asaoka, R., Yoshihara, N., Kakiuchi, N., et al. (2024). Longitudinal changes of funduscopic optic disc size, color and cup-to-disc ratio in school children. Int. Journal Retina Vitreous 10, 51. doi:10.1186/s40942-024-00570-4
Sekiryu, T., Sugano, Y., Itagaki, K., Honjo, J., Kato, Y., and Mukai, R. (2025). Three-dimensional characteristics of choroidal vasculature associated with filling delay in central serous chorioretinopathy. Investigative Ophthalmology and Visual Science 66, 33. doi:10.1167/iovs.66.4.33
Song, T., Bu, M., Song, Q., Zhan, B., Liu, W., Hu, M., et al. (2026). Integrated bioinformatics and machine learning analysis identify CROT as a regulator of immunological features in idiopathic pulmonary fibrosis. Biochimica biophysica acta. Mol. basis Dis. 1872, 168100. doi:10.1016/j.bbadis.2025.168100
Sui, J., Li, H., Bai, Y., He, Q., Sun, Z., and Wei, R. (2024). Morphological characteristics of the foveal avascular zone in pathological myopia and its relationship with macular structure and microcirculation. Graefe's Archive Clinical Experimental Ophthalmology 262, 2121–2133. doi:10.1007/s00417-024-06403-2
Tao, Z., Wang, J., Lv, Z., Hu, G., Xu, Z., Chen, L., et al. (2025). Tracking myopia development through axial length progression: a retrospective longitudinal study. Ann. Medicine 57, 2563752. doi:10.1080/07853890.2025.2563752
Van Calster, B., McLernon, D. J., van Smeden, M., Wynants, L., and Steyerberg, E. W.Topic Group ‘Evaluating diagnostic tests and prediction models’ of the STRATOS initiative (2019). Calibration: the achilles heel of predictive analytics. BMC Medicine 17, 230. doi:10.1186/s12916-019-1466-7
Veselinović, M., Trenkić, M., Čanadanović, V., Jovanović, P., Veselinović, A., Petrović, M., et al. (2025). The significance of OCTA in studying vessel density and retinal thickness in individuals with myopia. Med. Kaunas. Lith. 61, 532. doi:10.3390/medicina61030532
Wang, X., Tang, L., Gao, L., Yang, Y., Cao, D., and Li, Y. (2016). Myopia and diabetic retinopathy: a systematic review and meta-analysis. Diabetes Research Clinical Practice 111, 1–9. doi:10.1016/j.diabres.2015.10.020
Wang, Y., Wei, R., Yang, D., Song, K., Shen, Y., Niu, L., et al. (2024). Development and validation of a deep learning model to predict axial length from ultra-wide field images. Eye Lond. Engl. 38, 1296–1300. doi:10.1038/s41433-023-02885-2
Wang, Z., Zou, H., Guo, Y., Sun, M., Zhao, X., Wang, Y., et al. (2025). Myopic-net: deep learning-based direct identification of myopia onset and progression. Transl. Vision Science and Technology 14, 38. doi:10.1167/tvst.14.8.38
Wang, Q., He, L. B., Li, S., Liu, H. L., and Liu, Y. (2025). Evaluating choroidal vascular changes in young adults with high myopia utilizing swept source optical coherence tomography angiography. Photodiagnosis Photodynamic Therapy 51, 104475. doi:10.1016/j.pdpdt.2025.104475
Xu, S., Ruan, Z., Wang, Y., Jiang, J., Zhao, F., Tang, X., et al. (2025). Establishment of myopia occurrence prediction model in children without myopia using cycloplegic refraction and prior axial length change. Ophthalmology 132, 1260–1272. doi:10.1016/j.ophtha.2025.06.010
Yang, C. N., Chen, W. L., Yeh, H. H., Chu, H. S., Wu, J. H., and Hsieh, Y. T. (2024). Convolutional neural network-based prediction of axial length using color fundus photography. Transl. Vision Science and Technology 13, 23. doi:10.1167/tvst.13.5.23
Yang, Z., Wu, Y., Gao, J., Hu, S., Wang, J., and Jing, R. (2025). Comparison of the peripapillary structure-vessel density relationship before and after axial length magnification correction using different methods. Clin. and Experimental Ophthalmology 53, 908–917. doi:10.1111/ceo.14585
Yii, F., MacCormick, I. J. C., Strang, N., Bernabeu, M. O., and MacGillivray, T. (2025). Fundus refraction offset as an individualized myopia biomarker. JAMA Ophthalmology 143, 597–606. doi:10.1001/jamaophthalmol.2025.1513
Zhang, W., Hou, X., Li, C., Wang, S., Liu, N., Zhang, Y., et al. (2023). Influencing factors associated with high myopia in Chinese college students. Front. Medicine 10, 1146291. doi:10.3389/fmed.2023.1146291
Zhang, S., Webber, A. L., and Dai, S. (2025). Myopia progression management: current trends and future directions. Clin. and Experimental Ophthalmology, ceo.70024. doi:10.1111/ceo.70024
Zhang, X., Wang, Z., Yu, J., Wang, J., Song, D., Zhang, B., et al. (2025). Predicting myopia risk using a machine learning model based on fundus imageomics. Sci. Rep. 15 (1), 43729. doi:10.1038/s41598-025-27848-3
Zhao, M., Lam, A. K., Ying, M. T., and Cheong, A. M. (2022). Hemodynamic and morphological changes of the central retinal artery in myopic eyes. Sci. Reports 12, 7104. doi:10.1038/s41598-022-11087-x
Zhao, X., Wang, Y., Li, J., Liu, W., Yang, Y., Qiao, Y., et al. (2025). A machine-learning-derived online prediction model for depression risk in COPD patients: a retrospective cohort study from CHARLS. J. Affective Disorders 377, 284–293. doi:10.1016/j.jad.2025.02.063
Zhu, Z., Hu, W., Chen, R., Xiong, R., Wang, W., Shang, X., et al. (2022). Retinal age gap as a predictive biomarker of stroke risk. BMC Medicine 20, 466. doi:10.1186/s12916-022-02620-w
Zhu, Q., Xing, X., Ma, L., Yuan, Y., and Song, E. (2025). Three-dimensional choroidal parameter differences in myopic anisometropia. BMC Ophthalmology 25, 583. doi:10.1186/s12886-025-04425-w
Keywords: axial length progression, color fundus photography, machine learning, pediatric myopia, retinal microvasculature, shap
Citation: Wang Z, Han F, Zhang X, Ding J, Yu J, Xie X, Li Z, Du B and Wei R (2026) Quantitative color fundus photography parameters as potential biomarkers of axial length progression: evidence from a machine learning cohort study. Front. Cell Dev. Biol. 14:1753213. doi: 10.3389/fcell.2026.1753213
Received: 24 November 2025; Accepted: 12 January 2026;
Published: 26 January 2026.
Edited by:
Guang-Yu Li, Jilin University, ChinaCopyright © 2026 Wang, Han, Zhang, Ding, Yu, Xie, Li, Du and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhiqing Li, ZHJ6aGlxaW5nbGlAMTYzLmNvbQ==; Bei Du, ZHViZWkxOTgyQDEyNi5jb20=; Ruihua Wei, cndlaUB0bXUuZWR1LmNu
†These authors have contributed equally to this work and share first authors
Feifei Han1†