Assessment of synthetic post-therapeutic OCT images using the generative adversarial network in patients with macular edema secondary to retinal vein occlusion

Feng, Shi; Yang, Jingyuan; Zhao, Xinyu; Zhao, Jianchun; Du, Yunfeng; Yu, Weihong; Ding, Dayong; Li, Xirong; Chen, Youxin

doi:10.3389/fcell.2025.1609567

ORIGINAL RESEARCH article

Front. Cell Dev. Biol., 04 June 2025

Sec. Molecular and Cellular Pathology

Volume 13 - 2025 | https://doi.org/10.3389/fcell.2025.1609567

This article is part of the Research TopicArtificial Intelligence Applications in Chronic Ocular Diseases, Volume IIView all 58 articles

Assessment of synthetic post-therapeutic OCT images using the generative adversarial network in patients with macular edema secondary to retinal vein occlusion

Shi Feng¹^†

Jingyuan Yang¹^†

Xinyu Zhao¹

Jianchun Zhao²

Yunfeng Du²

Weihong Yu¹

Dayong Ding²

Xirong Li³

Youxin Chen¹*

¹Ophthalmology Department, Peking Union Medical College Hospital, Beijing, China
²Vistel AI Lab, Visionary Intelligence Ltd, Beijing, China
³Key Lab of DEKE, Renmin University of China, Beijing, China

Aims: The aim of this study is to generate post-therapeutic optical coherence tomography (OCT) images based on pre-therapeutic OCT by using generative adversarial networks (GANs). The synthetic images enable us to predict the short-term therapeutic efficacy of intravitreal injection of anti-vascular endothelial growth factor (VEGF) in retinal vein occlusion (RVO) patients.

Methods: The study involved patients with RVO who received intravitreal anti-VEGF injection from 1 November 2018 to 30 November 2019. The OCT images taken before and shortly after treatment, with an interval of 4–8 weeks, were collected and randomly divided into the training set and test set at a ratio of approximately 3:1. The model is constructed based on the pix2pixHD algorithm, and synthetic OCT images are evaluated in terms of the picture quality, authenticity, the central retinal thickness (CRT), the maximal retinal thickness, the area of intraretinal cystoid fluid (IRC), and the area of subretinal fluid (SRF). Three supporting models, namely, the macular detection model, retinal stratification model, and lesion detection model, were constructed. Segmentation of macular location, retinal structure, and typical lesions were added to the input information. After verifying their accuracy, supporting models were used to detect the CRT, the maximal retinal thickness, IRC area, and SRF area of synthetic OCT images. The output predictive values are compared with real data according to the annotation on the real post-therapeutic OCT images.

Results: A total of 1,140 pairs of pre- and post-therapeutic OCT images obtained from 95 RVO eyes were included in the study, and 374 images were annotated. Of the synthetic images, 88% were considered to be qualified. The accuracy of discrimination of real versus synthetic OCT images was 0.56 and 0.44 for two retinal specialists, respectively. The accuracy to predict the treatment efficacy of CRT, the maximal retinal thickness, IRC area, and SRF area was 0.70, 0.70, 0.92, and 0.78, respectively.

Conclusion: Our study proves that the GAN is a reliable tool to predict the therapeutic efficacy of anti-VEGF injections in RVO patients. Evaluations conducted both qualitatively and quantitatively indicated that our model can generate high-quality post-therapeutic OCT images. Consequently, it has great potential in predicting the treatment efficacy and providing guidance to clinical decision-making.

1 Introduction

Retinal vein occlusion (RVO) is a significant cause of vision loss in elderly individuals worldwide and is the second-most common cause of vision loss due to retinal vascular disorders (Rogers et al., 2010; Romano et al., 2023). Central retinal vein occlusion (CRVO) is caused by blockage of the central retinal vein, usually due to thrombus formation; whereas branch retinal vein occlusion (BRVO) is caused by blockage of one of the branches of the central retinal vein, often occurring at arteriovenous crossings. Macular edema (ME) is a common complication of RVO, which severely affects central vision. ME is manifested as central retinal thickness (CRT) and the presence of intraretinal cystoid fluid (IRC) and subretinal fluid (SRF). The primary goal in the treatment of patients with RVO is to maintain central vision, with a particular focus on minimizing or preventing the formation of macular edema (Schmidt-Erfurth et al., 2019; Hogg et al., 2021). Current evidence suggests that intravitreal injection of anti-vascular endothelial growth factor (VEGF) is most efficient against ME-related visual impairment (Zhang, Liu, and Sang, 2022). Optical coherence tomography (OCT) images provide high-resolution image modality for quantifying retinal thickening and fluid accumulation and monitoring the treatment efficacy (Rayess et al., 2019). However, the response to anti-VEGF treatment shows significant heterogeneity in clinical practice (Tao et al., 2024; Wecker et al., 2017; Korobelnik et al., 2021), making it difficult to predict anatomic changes after anti-VEGF treatment.

Generative adversarial networks (GANs) involve a zero-sum competition between a generative model, which generates images, and a discriminative model, which evaluates whether the image came from real training data rather than being generated (Waisberg et al., 2025). It has been harnessed for image-to-image translation in ophthalmology to predict the responses to treatment by generating individualized post-therapeutic OCT images after anti-VEGF treatment for fundus diseases such as age-related macular degeneration (AMD) (Liu et al., 2020), RVO (Xu et al., 2022), and diabetic macular edema (DME) (Liu et al., 2023; Baek et al., 2024). Yet previous studies did not develop quantitative methods to assess the quality of synthetic images in an anatomical perspective. Herein, we established a series of supporting deep learning models to further evaluate the predictive performance of the GAN model so as to certify that our model can generate high-quality post-therapeutic OCT images.

2 Materials and methods

2.1 Study design and participants

Patients with RVO complicated by ME who underwent intravitreal anti-VEGF drug therapy at the Department of Ophthalmology, Peking Union Medical College Hospital, from November 2018 to November 2019, were retrospectively included in the study. The inclusion criteria were as follows: (1) patients diagnosed with ME secondary to RVO. The diagnosis was independently confirmed by at least two retinal specialists. (2) Patients who were administered intravitreal injections of anti-VEGF drugs. (3) Pre-therapeutic and post-therapeutic retinal OCT images were obtained within 4–8 weeks. The exclusion criteria were as follows: (1) a history of previous intraocular operation, laser photocoagulation, or intraocular injections of medications other than anti-VEGF agents. (2) A history of other ocular disorders, including glaucoma, pathological myopia, age-related macular degeneration, and other disorders involving systemic diseases.

This study was approved by the Clinical Research Ethical Committee of Peking Union Medical College Hospital, Chinese Academy of Medical Sciences (Project Number: S-K631). The research implementation adhered to the principles of the Declaration of Helsinki.

2.2 Dataset creation

2.2.1 Pairs of OCT images

Pre-therapeutic and post-therapeutic swept-source OCT (SS-OCT) images were captured in a 16-line 9-mm radial macula pattern by a Topcon Deep Range Imaging (DRI) OCT Triton device (Topcon, Tokyo, Japan) with a resolution of either 1,024 × 875 pixels or 1,024 × 992 pixels. The images before and after the treatment were scanned at the same location using the follow-up mode and were matched by two retinal specialists. Images with motion artifacts or insufficient quality for clinical assessment were excluded. The OCT image pairs were randomly split into the training set and test set at a ratio of approximately 3:1. Double checks were made to ensure that images from the same patient were not distributed to the training set and test set simultaneously. All OCT images included in the study were anonymized to protect patient privacy.

2.2.2 Annotation of OCT images

The OCT images were subjected to a secondary screening to identify those with the following characteristics for further image annotation: (1) OCT images with typical lesions of IRC and SRF. (2) OCT images with the maximum CRT measurement from the 16-line pre-therapeutic scans, along with its corresponding post-therapeutic image. Taking sample balance and training efficiency into consideration, no more than four pairs of OCT images were selected for each eye.

The following annotations were obtained to provide additional segmentation information: (1) normal retinal structures: upper limit of the inner limiting membrane (ILM), upper limit of the retinal pigment epithelium (RPE), and lower limit of the choroid. (2) Retinal thickness-related structures: central foveal position. (3) Major lesions: IRF and SRF.

The central foveal position was marked using a circular point, while the normal retinal structures, IRF and SRF, were annotated at the pixel level using multi-point polygon annotation. High-quality segmentation was accomplished by two retinal specialists, and the findings were cross-verified. Disagreements between specialists were resolved through consultation with a senior retinal specialist. Annotated OCT images were separated randomly into the training set and test sets at a ratio of 3:1. Three supporting models were developed based on annotated OCT images. No images of the same patient were simultaneously assigned to the two data sets.

2.3 Deep learning framework

All experiments were conducted with PyTorch deep learning framework (version 1.1.0) and Python (version 3.5) using the Linux operating system, Intel® Xeon® CPU E5-2680 v3 @2.50 GHz, and GeForce RTX 2080 Ti.

2.3.1 Image synthesis model training

The image synthesis model was constructed based on pix2pixHD, a variant of conditional GAN, which consists of a coarse-to-fine generator and multiscale discriminators to generate high-resolution images. The network framework of the pix2pixHD model is shown in Figure 1. Using pairs of real pre-therapeutic and post-therapeutic images as input images, the image synthesis model was trained to generate a synthetic post-therapeutic OCT image as output information. The discriminative model was employed to distinguish the real post-therapeutic OCT images from the synthetic fake post-therapeutic OCT images. After the termination of the training, the generator network is capable of translating any given pre-therapeutic OCT image into the synthetic post-therapeutic image.

Figure 1

Figure 1. A conceptual illustration of the pix2pixHD-based solution used in this study for generating post-therapeutic OCT images from pre-therapeutic OCT images.

2.3.2 Supporting model training and evaluation

Four anatomical parameters, namely, the central retinal thickness (CRT), the maximal retinal thickness, the area of IRC, and the area of SRF, were measured in order to comprehensively evaluate the synthetic images. Three supporting frameworks, the macular detection model, retinal stratification model, and lesion detection model, were constructed correspondingly.

2.3.2.1 Macular detection model training and evaluation

The high-resolution networks (HRNets) algorithm yielded satisfactory performance in preservation of accurate position and was, therefore, adopted to detect the position of macula. The model took OCT image pairs and abscissa of macular annotations from the training set as input information and output the accurate horizontal ordinate of the macular position.

The difference between the gold-standard label and the predicted values was calculated as the data sample, and a 95% confidence interval was computed. The accuracy of the model is calculated by the percentage of correctly identified macular positions.

2.3.2.2 Retinal stratification model training and evaluation

The U-Net algorithm gained wide application in the field of image segmentation and achieved better results even with relatively sparse annotation data due to the combination of the encoder and decoder. The retinal stratification model was constructed using U-Net and was trained based on OCT image pairs and segmentation information of ILM, RPE, and choroid annotated by retinal specialists. It ultimately outputs three categories, which were named class 0 (background), class 1 (from the upper limit of ILM to the upper limit of RPE), and class 2 (from the upper limit of RPE to the lower limit of choroid). The retinal stratification model has laid the foundation for precise measurement of CRT and maximal retinal thickness.

The performance of the test set in the trained retinal stratification model was compared with the results of two retinal specialists in terms of recall, precision, intersection over union (IOU), and Dice coefficient.

2.3.2.3 Lesion detection model training and evaluation

The U-Net algorithm was applied to develop the lesion detection model in order to achieve accurate measurement of IRC and SRF areas. OCT images from the training set and segmentation of IRC and SRF were inputted. Three categories, class 0 (background), class 1 (SRF), and class 2 (IRC), were outputted during the training process.

The constructed model recognized the OCT images of the test set and categorized each pixel point in the image into three classes. The lesions annotated by ophthalmologists were considered the ground truth labels. Subsequently, recall, precision, IOU, and Dice coefficient were calculated.

2.4 Evaluation of synthetic images

2.4.1 Quality of synthetic images

Synthetic images were assessed by two retinal specialists independently to determine whether they are qualified for clinical interpretation, such as retinal structure defects and repeated retinal structure. Notably, synthetic images with insufficient quality were excluded for further evaluation.

2.4.2 Authenticity of synthetic images

OCT images that adhered to the basic image regulation underwent evaluation of authenticity. All images were processed to remove irrelevant information, including contrast differences and pixel variations. The real post-therapeutic images and corresponding synthetic post-therapeutic images were simultaneously displayed to two retinal specialists in a random sequence without any mark, while the pre-therapeutic images were labeled for reference. The two retinal specialists were required to identify the synthetic images. The proportion of correct judgments made by the two doctors was then calculated separately.

2.4.3 Structural evaluations of synthetic images

The verified macular detection model, retinal stratification model, and lesion detection model were applied to recognize post-therapeutic OCT images generated from the GAN model, and predicted values of CRT, maximum retinal thickness, SRF area, and IRC area were obtained. The retinal thickness and area annotated by retinal specialists on real post-treatment images were regarded as the gold standard.

Quantitative evaluation of synthetic images was conducted by comparing the predicted values with the gold standard. If the predicted value of the synthetic post-therapeutic OCT image exceeded the gold standard value of the pre-therapeutic OCT image by 10%, it was recognized as an “increase.” If the gold standard value represented a 10% increase over the predicted value, it was regarded as a “decrease.” If the predictive value and gold standard measurements differed by ≤10% (absolute difference), two retinal specialists independently evaluated the OCT images and output data to determine the treatment trend. The absence of SRF and IRC both before and after the treatment was defined as “no change” (both 0). The gold standard value of the real post-therapeutic OCT image was also compared with that of the pre-therapeutic OCT image. The accuracy to predict the treatment efficacy was referred to as the proportion of synthetic post-therapeutic images that showed the same trend as the real post-therapeutic images, compared with the pre-therapeutic images.

2.5 Evaluation metrics and statistical analysis

Open-source Python library scipy.stats (version 0.14.0, The Scipy community) was applied for statistical analysis. For categorical variables, classification accuracy and the 95% confidence interval (CI) were calculated. The Shapiro test was used to determine whether the differences between the gold standard and the predicted values followed a normal distribution. A paired t-test was employed for significance analysis if the variables were identified as normally distributed. Otherwise, a Wilcoxon test was used for significance analysis. A p-value less than 0.05 was considered of significant differences.

3 Result

3.1 Dataset of synthetic images

According to the inclusion and exclusion criteria, the study finally included 726 pairs of OCT images for BRVO eyes and 307 pairs for CRVO eyes. A total of 570 pairs of OCT images from 49 cases of BRVO eyes and 226 pairs of OCT images from 21 cases of CRVO eyes were included in the training set. Additionally, 157 pairs of OCT images from 15 cases of BRVO eyes and 80 pairs of OCT images from eight cases of CRVO eyes were assigned into the test set.

3.2 Quality of synthetic images

Among 237 generated post-therapeutic OCT images, 29 images from 10 cases failed to meet the basic image regulation and were excluded from further evaluation. Therefore, 208 out of 237 images (87.86%) were identified as qualified for clinical interpretation. All 29 unqualified images were related to abrupt structural rupture, with 16 images showing retinal neuroepithelium discontinuity, 8 images showing retinal pigment epithelium discontinuity, and 5 images showing entire retinal discontinuity. Three cases of incompetent synthetic images are shown in Figure 2.

Figure 2

Figure 2. (A–C) illustrated three cases with image quality issues. A1–C1 represent pre-therapeutic OCT images, A2–C2 represent synthetic post-therapeutic OCT images, and A3–C3 represent the real post-therapeutic OCT images. RPE discontinuity is shown in A2 (green line), retinal neuroepithelium discontinuity is shown in B2 (yellow arrow), and the entire retinal discontinuity is shown in C2 (red arrow).

3.3 Authenticity of synthetic images

A total of 208 synthetic images and real images were shown to the retinal specialists to validate the authenticity. The rate to discriminate between synthetic and real images successfully was 0.56 (95% CI 0.47–0.65) for specialist 1 and 0.44 (95% CI 0.34–0.54) for specialist 2. It is challenging to distinguish between synthetic and real images, indicating that the GAN model has achieved eligible results. Six cases of pre-therapeutic, real post-therapeutic, and synthetic post-therapeutic OCT images are displayed in Figure 3.

Figure 3

Figure 3. (A–C) Three cases of pre-therapeutic, real post-therapeutic, and synthetic post-therapeutic OCT images. A1–C1 represent pre-therapeutic OCT images, A2–C2 represent synthetic post-therapeutic OCT images, and A3–C3 represent real post-therapeutic OCT images.

3.4 Verification of supporting models

3.4.1 Dataset pf supporting models

After a second screening, 280 OCT images from 64 eyes with BRVO and 94 OCT images from 29 eyes with CRVO were included in the annotation dataset. Less than four pairs of OCT images were selected for each eye. Two retinal disease specialists were assigned to annotate the OCT images, and there were no disagreements upon cross-validation. A total of 226 OCT images from 49 eyes with BRVO and 74 OCT images from 21 eyes with CRVO were involved in the training set. The test set consisted of 54 OCT images from 15 eyes with BRVO and 20 OCT images from 8 eyes with CRVO.

3.4.2 Verification of the macular detection model

The difference of the macula position between the synthetic image and the real image was calculated. The mean difference was 0.0204 ± 0.0604 for BRVO eyes, 0.0060 ± 0.0054 for CRVO eyes, and 0.0165 ± 0.0521 for RVO eyes. The accuracies of the macular detection model were 75.93% for BRVO eyes, 80.00% for CRVO eyes, and 77.03% for all RVO eyes. Overall, the model demonstrated relatively accurate detection of the macular center position. Figure 4 illustrates four cases of the detection.

Figure 4

Figure 4. (A–D) Four cases of macular detection. A1–D1 represent the macular position annotated by retinal specialists (red circle). A2–D2 represent the macular position detected by the model (green circle). A3–D3 show the overlap between the annotated and detected positions.

3.4.3 Verification of the retinal stratification model

Table 1 reports the performance of the retinal stratification model, and Figure 5 illustrates two examples of the stratification. Among them, class 1 represented the thickness of the retina, which was of great concern in our study. The recall, accuracy, IOU, and Dice coefficient of class 1 were 0.99, 0.99, 0.98, and 0.99, respectively, for BRVO, CRVO, and RVO eyes. Therefore, the retinal stratification model demonstrated accurate identification for retinal thickness and was eligible.

Table 1

Table 1. Performance of the retinal stratification model.

Figure 5

Figure 5. (A–B) Two cases of retinal stratification. A1–B1 represent the original OCT images. A2–B2 represent the retinal stratification annotated by retinal specialists. A3–B3 represent retinal stratification detected by the model. A4–B4 show the overlap between the annotated and detected stratification. Black indicates class 0, green indicates class 1, and red indicates class 2.

3.4.4 Verification of the lesion detection model

In the lesion detection model, class 1 (SRF) and 2 (IRC) represented the quantitative areas of lesions, which was highly indicative of treatment efficacy. As for eyes with BRVO, the recall, accuracy, IOU, and Dice coefficient were 0.87, 0.84, 0.76, and 0.85 for class 1 and 0.90, 0.66, 0.61, and 0.76 for class 2, respectively. When it came to eyes with CRVO, the recall, accuracy, IOU, and Dice coefficient were 0.68, 0.87, 0.62, and 0.76 for class 1 and 0.63, 0.94, 0.61, and 0.75 for class 2, respectively. Overall, the lesion detection model exhibited competent recognition capabilities for IRC and SRF lesions, meeting the requirements of the study. Table 2 presents the results of the lesion detection model, and Figure 6 displays two examples of the detection.

Table 2

Table 2. Performance of the lesion detection model.

Figure 6

Figure 6. (A–B) Two cases of lesion detection. A1–B1 represent the original OCT images. A2–B2 represent the lesions annotated by retinal specialists. A3–B3 represent the lesion detected by the model. A4–B4 show the overlap between the annotated and detected lesions. Black indicates class 0, green indicates class 1, and red indicates class 2.

3.5 Structural evaluations of synthetic images

3.5.1 Dataset of structural evaluation

The annotations made by two retinal specialists on real post-therapeutic OCT images were considered the gold standard. Therefore, only images with annotations could be used for structural evaluations. Since the evaluation required the application of three supporting models, the images from the training sets of the aforementioned models were not included in the test set to ensure the accuracy of evaluations. Therefore, the division of the structural evaluations dataset should be consistent with the above models. A total of 37 post-therapeutic OCT images generated by the GAN model were input into the macular detection model, retinal stratification model, and lesion detection model. The following parameters were outputted: (1) CRT: the macular detection model located the macular position, and the retinal stratification model identified the corresponding retinal thickness. (2) Maximal retinal thickness: the retinal stratification model recognized retinal thickness, and maximal thickness within a single OCT image was reported. (3) SRF area: the lesion detection model identified SRF lesions and outputted the specific area of the SRF. (4) IRC area: the lesion detection model identified IRC lesions and outputted the specific area of the IRC.

3.5.2 Evaluation of CRT

Evaluation of the predictive performance on the trend of CRT is shown in Table 3. The accuracy in predicting the trend of CRT changes in post-therapeutic OCT images was 0.70. There were no significant differences between the predicted values and the gold standard CRT for BRVO, CRVO, and RVO eyes, as shown in Table 9. The distribution of CRT is shown in Supplementary Figure S2. Bland–Altman analysis demonstrated that 89.2% predicted values were distributed within the 95% limits of agreement (LoA). This indicated that our model could predict changes in CRT after treatment with anti-VEGF agents with precision.

Table 3

Table 3. Evaluation of the predictive performance on the trend of CRT.

3.5.3 Evaluation of maximal retinal thickness

Evaluation of the predictive performance on the trend of maximal retinal thickness is shown in Table 4. The accuracy in predicting the trend of maximal retinal thickness changes in post-therapeutic OCT images was 0.70. There were no significant differences between the predicted values and the gold standard maximal retinal thickness for BRVO, CRVO, and RVO eyes, as shown in Table 9. The distribution of maximal retinal thickness is shown in Supplementary Figure S2. Bland–Altman analysis demonstrated that 94.6% predicted values were distributed within the 95% LoA. The predictive performance of our model on changes in maximal retinal thickness was qualified.

Table 4

Table 4. Evaluation of predictive performance on the trend of maximal retinal thickness.

3.5.4 Evaluation of the SRF area

The accuracy in predicting the presence of SRF in post-therapeutic OCT images was 0.86, as shown in Table 5. The specificity was 0.97 (95% CI 0.82–1), while the sensitivity was 0 (95% CI 0–0.60). Out of the 37 synthetic OCT images, only one revealed SRF. This suggested that our model had limited capability in predicting SRF after treatment, highly likely due to the limited dataset of SRF. Evaluation of the predictive performance on the trend of SRFarea is shown in Table 6. The accuracy in predicting the trend in SRF area changes in post-therapeutic OCT images was 0.92. It yielded satisfactory achievements in predicting a decrease or no change in SRF but made errors in predicting an increase in three cases. There were no significant differences between the predicted values and the gold standard for the SRF area in BRVO, CRVO, and RVO eyes, as shown in Table 9. The distribution of the SRF area is shown in Supplementary Figure S2. Bland–Altman analysis demonstrated that 97.3% predicted values were distributed within the 95% LoA.

Table 5

Table 5. Evaluation of predictive performance on the presence of SRF.

Table 6

Table 6. Evaluation of the predictive performance on the trend of the SRF area.

3.5.5 Evaluation of the IRC area

The accuracy in predicting the presence of IRC in post-therapeutic OCT images was 0.65, as shown in Table 7. The specificity was 0.86 (95% CI 0.64–0.96), but the sensitivity was 0.33 (95% CI 0.13–0.61), which was slightly superior compared with that for the SRF area. Evaluation of the predictive performance on the trend of IRCarea is shown in Table 8. The accuracy in predicting the trend of IRC area changes in post-therapeutic OCT images was 0.78. Among 37 cases; our model mistook an increase for a decrease or no change in six cases. The GAN model tended to underestimate the IRC area in post-therapeutic images. There were no significant differences between the predicted values and the gold standard for the IRC area in CRVO eyes, as depicted in Table 9. However, it failed to predict the IRC area accurately in BRVO and RVO eyes. The distribution of the IRC area is shown in Supplementary Figure S2. Bland–Altman analysis demonstrated that 94.6% predicted values were distributed within the 95% LoA.

Table 7

Table 7. Evaluation of predictive performance on the presence of IRC.

Table 8

Table 8. Evaluation of the predictive performance on the trend of IRC area.

Table 9

Table 9. Evaluation of the predictive performance on CRT, maximal retinal thickness, SRF area, and IRC area.

4 Discussion

Considering the high cost of anti-VEGF medications and the burden of frequent follow-ups, individualized prediction of treatment efficacy is of great significance in clinical practice. Predicting the efficacy of anti-VEGF treatment in RVO patients facilitates the development of personalized treatment plans, benefiting patients, society, and healthcare providers.

However, how to evaluate the accuracy of synthetic images remains controversial. Previous research on the evaluation of AI-generated images mostly involved qualitative assessments, and retinal specialists were requested to distinguish between real and synthetic images (Zheng et al., 2020). Xu et al. developed a GAN-based prediction model to generate short-term post-therapeutic OCT images of RVO patients. In addition to authenticity evaluation, they conducted a structural evaluation experiment by measuring the CRT of the synthetic images and real images. There was no statistical difference in CMT between the synthetic and the real images (Xu et al., 2022). We made exploratory attempts to make elaborate lesion labels and construct three deep learning models in order to quantitatively assess the quality of the synthetic images on a structural perspective. The macular detection model and retinal stratification model showed remarkable achievements, while the lesion detection model exhibited slightly lower recall and accuracy on recognizing SRF and IRC. Nevertheless, our model still surpassed the performance of the previous research on quantifying the IRC and SRF in RVO patients (Schlegl et al., 2018). Although these supporting models showed promising results, the application of more models inevitably introduced more errors into the structural evaluation of the GAN model. In the structural evaluation, regarding the specific fluid area, there was no significant difference between the predicted values and the gold standard. However, in terms of the trend and the classification of fluid presence or absence, the sensitivity and specificity of several indicators were slightly lower. This might be related to the small size of the test set and the inherent biases in the samples.

This study innovatively utilized the GAN algorithm to predict the efficacy of anti-VEGF treatment in RVO patients. It was found that 88% of the synthetic images were of sufficient quality for further clinical interpretation. Retinal specialists reported difficulty in distinguishing between real and synthetic images. As for CRVO patients, there were no significant differences between the predicted values and the gold standard on CRT, retinal maximum thickness, SRF area, and IRC area. Similarly, for BRVO patients, there were no significant difference in the prediction of CRT, retinal maximum thickness, and SRF area. The accuracy of predicting the trend in CRT, retinal maximum thickness, SRF area, and IRC area after treatment was 0.70, 0.70, 0.92, and 0.78, respectively. With more clinical data and algorithm optimization, the GAN model holds the promise of bringing new insights into the dilemma of treatment selection for RVO patients.

This study has several limitations. First, the small sample that only included Chinese patients and the lack of an external validation dataset restrict the potential for generalization. Second, the functional evaluation of the predictive performance is still absent. Visual acuity is also what ophthalmologists and patients are concerned about. Further research could be conducted to cover more prognostic factors to make the model more applicable in predicting the visual acuity. Third, the research only evaluated short-term treatment outcomes after treatment with a single dose of anti-VEGF drugs. Though researchers realized the long-term prediction for patients with DME, it remains unclear whether long-term efficacy predictions for patients with RVO can be made. Previous studies have found that for CRVO patients, the optimal treatment effect is achieved at 12 months after initiation, while BRVO patients recover to optimal visual acuity after 24 months of treatment. Visual acuity and central retinal thickness then remain stable at the same level until the end of follow-up (Arrigo et al., 2021). Therefore, predicting the optimal treatment effect for patients before the initiation of anti-VEGF treatment would be more meaningful. Additionally, it should be noted that our study did not investigate several prognostic biomarkers associated with visual function, such as disorganization of retinal inner layers (DRIL) (Horozoglu et al., 2023). DRIL is correlated with areas of ischemic damage and loss of flow in the superficial, middle, and deep capillary plexuses, highlighting its potential as a biomarker for ischemic injury in RVO (Zhu et al., 2021; Munk et al., 2024). Previous studies have found the association of DRIL with poorer visual outcomes (Yang et al., 2024) and recurrence (Costa et al., 2021) in RVO-associated ME. Further annotations are required to involve those prognostic biomarkers, constructing a more comprehensive deep learning model.

5 Conclusion

Herein, our results prove that the GAN is a reliable tool to predict the therapeutic efficacy of anti-VEGF injections in RVO patients. Innovative quantitative evaluations were conducted with the assistance of supporting deep learning models, confirming the generation of high-quality post-therapeutic OCT images. Consequently, it has great potential in predicting treatment efficacy and providing guidance to clinical decision-making.

Data availability statement

The datasets presented in this article are not readily available because it contains private data. Requests to access the datasets should be directed toZmVuZ3NoaUBwdW1jaC5jbg==.

Ethics statement

The studies involving humans were approved by Clinical Research Ethical Committee of Peking Union Medical College Hospital, Chinese Academy of Medical Sciences (Project Number: S-K631). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

SF: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Project administration, Validation, Writing – original draft. JY: Methodology, Conceptualization, Writing – original draft, Data curation. XZ: Methodology, Writing – review and editing, Data curation. JZ: Data curation, Formal Analysis, Writing – review and editing. YD: Data curation, Formal Analysis, Writing – review and editing. WY: Methodology, Supervision, Writing – review and editing. DD: Data curation, Formal Analysis, Writing – review and editing. XL: Data curation, Formal Analysis, Writing – review and editing. YC: Conceptualization, Funding acquisition, Methodology, Resources, Supervision, Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by: 1. National High Level Hospital Clinical Research Funding, China: 2022-PUMCH-B-101. 2. National Natural Science Foundation of China: 82271112. 3. National Natural Science Foundation of China: 82301239. 4. National Natural Science Foundation of China: 82301241. 5. Peking Union Medical College Hospital Young Reserve Talent Development Program (UHB12268).

Conflict of interest

Authors JZ, YD and DD were employed by company Visionary Intelligence Ltd.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcell.2025.1609567/full#supplementary-material

SUPPLEMENTARY FIGURE S1 | Workflow diagram of the synthetic OCT generation and evaluation.

SUPPLEMENTARY FIGURE S2 | Structural evaluations of synthetic images. A1–D1 show the scatterplots of CRT, maximal retinal thickness, SRF area, and IRC area. A2–D2 show the Bland–Altman analysis of CRT (mean and SD of difference were 1.95 and 47.59, respectively), maximal retinal thickness (mean and SD of difference were −0.35 and 29.08, respectively), SRF area (mean and SD of difference were 85.00 and 386.66, respectively), and IRC area (mean and SD of difference were 4085.10 and 9909.70, respectively). The horizontal axis represents the average of the predicted values and gold standard. The vertical axis represents the difference between them. Notably, the predicted values and gold standard were not the absolute value of the thickness and area but the output data of the deep learning model without the unit.

References

Arrigo, A., Crepaldi, A., Vigano, C., Aragona, E., Lattanzio, R., Scalia, G., et al. (2021). Real-Life management of central and branch retinal vein occlusion: a seven-year follow-up study. Thromb. Haemost. 121 (10), 1361–1366. doi:10.1055/s-0041-1725197

PubMed Abstract | CrossRef Full Text | Google Scholar

Baek, J., He, Y., Emamverdi, M., Mahmoudi, A., Nittala, M. G., Corradetti, G., et al. (2024). Prediction of long-term treatment outcomes for diabetic macular edema using a generative adversarial network. Transl. Vis. Sci. Technol. 13 (7), 4. doi:10.1167/tvst.13.7.4

PubMed Abstract | CrossRef Full Text | Google Scholar

Costa, J. V., Moura-Coelho, N., Abreu, A. C., Neves, P., Ornelas, M., and Furtado, M. J. (2021). Macular edema secondary to retinal vein occlusion in a real-life setting: a multicenter, nationwide, 3-year follow-up study. Graefes. Arch. Clin. Exp. Ophthalmol. 259 (2), 343–350. doi:10.1007/s00417-020-04932-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Hogg, H. D. J., Talks, S. J., Pearce, M., and Di Simplicio, S. (2021). Real-world visual and neovascularisation outcomes from anti-VEGF in central retinal vein occlusion. Ophthalmic Epidemiol. 28 (1), 70–76. doi:10.1080/09286586.2020.1792937

PubMed Abstract | CrossRef Full Text | Google Scholar

Horozoglu, F., Sener, H., Polat, O. A., Temizyurek, O., and Evereklioglu, C. (2023). Predictive impact of optical coherence tomography biomarkers in anti-vascular endothelial growth factor resistant macular edema treated with dexamethasone implant. Photodiagnosis Photodyn. Ther. 42, 103167. doi:10.1016/j.pdpdt.2022.103167

PubMed Abstract | CrossRef Full Text | Google Scholar

Korobelnik, J. F., Larsen, M., Eter, N., Bailey, C., Wolf, S., Schmelter, T., et al. (2021). Efficacy and safety of intravitreal aflibercept treat-and-extend for macular edema in central retinal vein occlusion: the CENTERA study. Am. J. Ophthalmol. 227, 106–115. doi:10.1016/j.ajo.2021.01.027

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, S., Hu, W., Xu, F., Chen, W., Liu, J., Yu, X., et al. (2023). Prediction of OCT images of short-term response to anti-VEGF treatment for diabetic macular edema using different generative adversarial networks. Photodiagnosis Photodyn. Ther. 41, 103272. doi:10.1016/j.pdpdt.2023.103272

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Y., Yang, J., Zhou, Y., Wang, W., Zhao, J., Yu, W., et al. (2020). Prediction of OCT images of short-term response to anti-VEGF treatment for neovascular age-related macular degeneration using generative adversarial network. Br. J. Ophthalmol. 104 (12), 1735–1740. doi:10.1136/bjophthalmol-2019-315338

PubMed Abstract | CrossRef Full Text | Google Scholar

Munk, M. R., Ceklic, L., Stillenmunkes, R., Chaudhary, V., Waheed, N., Chhablani, J., et al. (2024). Integrated assessment of OCT, multimodal imaging, and cytokine markers for predicting treatment responses in retinal vein occlusion associated macular edema: a comparative review of anti-VEGF and steroid therapies. Diagn. (Basel) 14 (17), 1983. doi:10.3390/diagnostics14171983

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayess, N., Rahimy, E., Ying, G. S., Pefkianaki, M., Franklin, J., Regillo, C. D., et al. (2019). Baseline choroidal thickness as a short-term predictor of visual acuity improvement following antivascular endothelial growth factor therapy in branch retinal vein occlusion. Br. J. Ophthalmol. 103 (1), 55–59. doi:10.1136/bjophthalmol-2018-311898

PubMed Abstract | CrossRef Full Text | Google Scholar

Rogers, S., McIntosh, R. L., Cheung, N., Lim, L., Wang, J. J., Mitchell, P., et al. (2010). The prevalence of retinal vein occlusion: pooled data from population studies from the United States, Europe, Asia, and Australia. Ophthalmology 117 (2), 313–319 e1. doi:10.1016/j.ophtha.2009.07.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Romano, F., Lamanna, F., Gabrielle, P. H., Teo, K. Y. C., Battaglia Parodi, M., Iacono, P., et al. (2023). Update on retinal vein occlusion. Asia Pac J. Ophthalmol. (Phila) 12 (2), 196–210. doi:10.1097/APO.0000000000000598

PubMed Abstract | CrossRef Full Text | Google Scholar

Schlegl, T., Waldstein, S. M., Bogunovic, H., Endstrasser, F., Sadeghipour, A., Philip, A. M., et al. (2018). Fully automated detection and quantification of macular fluid in OCT using deep learning. Ophthalmology 125 (4), 549–558. doi:10.1016/j.ophtha.2017.10.031

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmidt-Erfurth, U., Garcia-Arumi, J., Gerendas, B. S., Midena, E., Sivaprasad, S., Tadayoni, R., et al. (2019). Guidelines for the management of retinal vein occlusion by the European society of retina specialists (EURETINA). Ophthalmologica 242 (3), 123–162. doi:10.1159/000502041

PubMed Abstract | CrossRef Full Text | Google Scholar

Tao, Y., Ge, L., Su, N., Li, M., Fan, W., Jiang, L., et al. (2024). Exploration on OCT biomarker candidate related to macular edema caused by diabetic retinopathy and retinal vein occlusion in SD-OCT images. Sci. Rep. 14 (1), 14317. doi:10.1038/s41598-024-63144-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Waisberg, E., Ong, J., Kamran, S. A., Masalkhi, M., Paladugu, P., Zaman, N., et al. (2025). Generative artificial intelligence in ophthalmology. Surv. Ophthalmol. 70 (1), 1–11. doi:10.1016/j.survophthal.2024.04.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Wecker, T., Ehlken, C., Buhler, A., Lange, C., Agostini, H., Bohringer, D., et al. (2017). Five-year visual acuity outcomes and injection patterns in patients with pro-re-nata treatments for AMD, DME, RVO and myopic CNV. Br. J. Ophthalmol. 101 (3), 353–359. doi:10.1136/bjophthalmol-2016-308668

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, F., Yu, X., Gao, Y., Ning, X., Huang, Z., Wei, M., et al. (2022). Predicting OCT images of short-term response to anti-VEGF treatment for retinal vein occlusion using generative adversarial network. Front. Bioeng. Biotechnol. 10, 914964. doi:10.3389/fbioe.2022.914964

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, T., Lu, Y., Zeng, F., Yu, R., Zou, C., Hu, R., et al. (2024). Prognosis and factors related to anti-VEGF therapy in patients with retinal vein occlusion and concomitant carotid artery disease. Sci. Rep. 14 (1), 24634. doi:10.1038/s41598-024-75604-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, W., Liu, Y., and Sang, A. (2022). Efficacy and effectiveness of anti-VEGF or steroids monotherapy versus combination treatment for macular edema secondary to retinal vein occlusion: a systematic review and meta-analysis. BMC Ophthalmol. 22 (1), 472. doi:10.1186/s12886-022-02682-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, C., Xie, X., Zhou, K., Chen, B., Chen, J., Ye, H., et al. (2020). Assessment of generative adversarial networks model for synthetic optical coherence tomography images of retinal disorders. Transl. Vis. Sci. Technol. 9 (2), 29. doi:10.1167/tvst.9.2.29

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, Z., Meng, Y., Kozak, I., Xie, M., Liang, Y., Yan, B., et al. (2021). Microvascular structure changes after intravitreal ranibizumab injection in retinal vein occlusion patients with and without macular ischemia. Front. Med. (Lausanne) 8, 737537. doi:10.3389/fmed.2021.737537

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: generative adversarial networks, retinal vein occlusion, anti-vascular endothelial growth factor, optical coherent tomography, therapeutic efficacy prediction

Citation: Feng S, Yang J, Zhao X, Zhao J, Du Y, Yu W, Ding D, Li X and Chen Y (2025) Assessment of synthetic post-therapeutic OCT images using the generative adversarial network in patients with macular edema secondary to retinal vein occlusion. Front. Cell Dev. Biol. 13:1609567. doi: 10.3389/fcell.2025.1609567

Received: 10 April 2025; Accepted: 14 May 2025;
Published: 04 June 2025.

Edited by:

Weihua Yang, Southern Medical University, China

Reviewed by:

Haoyu Chen, The Chinese University of Hong Kong, China
Jinhai Huang, Fudan University, China

Copyright © 2025 Feng, Yang, Zhao, Zhao, Du, Yu, Ding, Li and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Youxin Chen, Y2hlbnl4QHB1bWNoLmNu

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.