A Novel Approach for Predicting Atrial Fibrillation Recurrence After Ablation Using Deep Convolutional Neural Networks by Assessing Left Atrial Curved M-Mode Speckle-Tracking Images

Hwang, Yi-Ting; Lee, Hui-Ling; Lu, Cheng-Hui; Chang, Po-Cheng; Wo, Hung-Ta; Liu, Hao-Tien; Wen, Ming-Shien; Lin, Fen-Chiung; Chou, Chung-Chuan

doi:10.3389/fcvm.2020.605642

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 22 January 2021
Sec. Cardiovascular Imaging
Volume 7 - 2020 | https://doi.org/10.3389/fcvm.2020.605642

A Novel Approach for Predicting Atrial Fibrillation Recurrence After Ablation Using Deep Convolutional Neural Networks by Assessing Left Atrial Curved M-Mode Speckle-Tracking Images

Yi-Ting Hwang¹^†

Hui-Ling Lee²^†

Cheng-Hui Lu³

Po-Cheng Chang³

Hung-Ta Wo³

Hao-Tien Liu³

Ming-Shien Wen^3,4

Fen-Chiung Lin³

Chung-Chuan Chou^3,4^*

¹Department of Statistics, National Taipei University, Taipei, Taiwan
²Department of Anesthesia, Chang Gung Memorial Hospital, Taipei, Taiwan
³Division of Cardiology, Department of Internal Medicine, Chang Gung Memorial Hospital, Linkou, Taiwan
⁴Chang Gung University College of Medicine, Taoyuan, Taiwan

Aims: Curved M-mode images of global strain (GS) and strain rate (GSR) provide sufficiently detailed spatiotemporal information of deformation mechanics. This study investigated whether a deep convolutional neural network (CNN) could accurately classify these images in patients with atrial fibrillation (AF) who underwent radiofrequency catheter ablation (RFCA) with different outcomes.

Methods and Results: We retrospectively evaluated 606 consecutive patients who underwent RFCA for drug-refractory AF. Patients were divided into AF-free (n = 443) and AF-recurrent (n = 163) groups. Transthoracic echocardiography was performed within 24 h after RFCA. Left atrial curved M-mode speckle-tracking images were acquired from randomly selected 163 patients in AF-free group and 163 patients in AF-recurrent group as the dataset for deep CNN modeling. We used the ReLu activation function and repeatedly performed CNN model for 32 times to evaluate the stability of hyperparameters. Logistic regression models with the left atrial dimension, emptying fraction, and peak systolic GS as predictor variables were used for comparisons. Images from the apical 2-chamber (2-C) and 4-chamber (4-C) views had distinct features, leading to different CNN performance between settings; of them, the “4-C GS+4-C GSR” setting provided the highest performance index values. All four predictor variables used for logistic regression modeling were significant; however, none of them, individually or in any combined form, could outperform the optimal CNN model.

Conclusion: The novel approach using deep CNNs for learning features of left atrial curved M-mode speckle-tracking images seems to be optimal for classifying outcome status after AF ablation.

Introduction

Speckle-tracking echocardiography (STE) is an imaging modality for analyzing and tracking small segments of the myocardium to provide greater detail for assessing global and regional cardiac motion and function. Recently, STE has been applied for assessing left atrial (LA) function, and has been proven to be superior to LA size as a predictor of atrial fibrillation (AF) recurrence after radiofrequency catheter ablation (RFCA) (1–3). LA longitudinal global strain (GS) and GS rate (GSR) are usually determined based on the average of six segmental values per view. Inaba et al. reported that the mean peak systolic GSR was significantly lower in patients with persistent AF than in age-matched controls (4). In addition to reduced LA deformation, LA mechanical dispersion is also pronounced in AF patients, accessed by calculating the standard deviations of segmental GS and GSR values (5). Alternatively, the curved M-mode color images of GS and GSR provide detailed spatiotemporal information of LA deformation mechanics. However, using visual estimation to precisely differentiate these images in challenging.

Deep learning, a class of machine-learning algorithms using multiple layers to progressively extract higher level features from raw input, has become a powerful method of classifying several diseases (6). Through model training, convolutional neural network (CNN) can interpret and analyze various features within a dataset and use them to learn how to generate an output label. CNNs have proven successful in learning patterns in images to aid experts in image-based diagnosis and classification (7). In the present study, we (a) assessed whether supervised deep learning with CNNs can be used to analyze curved M-mode STE images in patients with AF who have undergone RFCA and (b) analyzed whether the predictions of a deep CNN model are better than those of conventional logistic regression models with the LA dimension (LAD), emptying fraction (LAEF), apical 2-chamber peak systolic GS (2-C GS), and 4-chamber peak systolic GS (4-C GS) as predictor parameters.

Materials and Methods

Study Population

In this study, we retrospectively evaluated 606 consecutive AF patients (462 paroxysmal AF) who had undergone RFCA for symptomatic AF refractory to antiarrhythmic drugs between July 2008 and July 2019 at our institution. We obtained the detailed medical history of all patients regarding AF and related cardiovascular and systemic conditions. On the basis of the outcomes of AF ablation, we divided patients into the following two groups: Group 1, no AF recurrence with no antiarrhythmic drugs (n = 443, 73.1%) and Group 2, including both recurrence of atrial tachyarrhythmia responsive to antiarrhythmic drugs (n = 93, 15.3%) or refractory to antiarrhythmic drugs (n = 70, 11.6%). The Institutional Review Board of Chang Gung Memorial Hospital approved the study protocol (IRB No. 202000829B0), and written informed consent was obtained from all patients.

Electrophysiological Study and RFCA

AF ablation was performed using a three-dimensional electroanatomical mapping system (CARTO, Biosense Webster, Diamond Bar, CA, USA) as previously reported (8). Briefly, all patients underwent RFCA under endotracheal intubation and general anesthesia. A 3.5-mm open-tip irrigated catheter (NaviStar Thermo-Cool, Biosense Webster) was percutaneously introduced through the right femoral vein for mapping and ablation. Circumferential pulmonary vein antral isolation with confirmation of entrance block was verified in all patients. If AF persisted or left atrial flutter occurred after pulmonary vein isolation, additional LA linear ablation was performed at the operator's discretion. External cardioversion was performed to restore sinus rhythm if RFCA failed to convert AF. Non-pulmonary vein triggers that reinitiated AF were ablated as necessary.

Echocardiography

Patients underwent transthoracic echocardiography within 24 h after ablation. All patients were in sinus rhythm during echocardiography. Two-dimensional (2-D) echocardiography was performed using a commercially available ultrasonography machine (Vivid 9, General Electric Medical Health, Waukesha, WI, USA) with a 2.5-MHz phased-array transducer. All echocardiographic measurements were obtained in accordance with the guidelines of the American Society of Echocardiography (9). The 2-D LA volume was measured from the apical 4-chamber (4-C) view. LAEF was determined as the difference between the maximum LA volume in ventricular systole and the minimum LA volume in ventricular diastole, divided by the maximum LA volume (10).

STE images of the left atrium obtained in apical 4-C and 2-chamber (2-C) views with a frame rate between 60 and 100 frames/s were captured and stored digitally for offline analysis of LA GS and GSR (EchoPac PC, GE Vingmed, Horton, Norway). Special care was taken during echocardiographic image acquisition to ensure adequate LA tracking and avoid interference with the pulmonary veins and LA appendage to measure LA GS and GSR. The endocardium of the LA wall was manually traced starting from the medial/septal to the lateral mitral annulus in the apical 4-C view and inferior to anterior mitral annulus in the apical 2-C view, and was tracked by the 2-D speckle-tracking software along the border (Figure 1A). The operator manually adjusted segments that were not tracked. STE determined regional changes in length and was expressed as a positive value for lengthening or as a negative value for shortening. LA peak longitudinal systolic GS was assessed as the average of six segmental values per view.

FIGURE 1

Figure 1. Examples of 2-D speckle-tracking echocardiography in two groups. (A) Echocardiograms of the apical 4-chamber view depicted the region of interest generated using speckle-tracking echocardiography software. The depicted left atrial wall was divided into six segments marked by different color. GS = −13.3% indicates peak systolic left atrial global strain = −13.3%. (B) Above: the longitudinal strain curves of six segments with different color labeled in (A); below: curved M-mode strain image representing the cyclic changes of strain at the region of interest along the time axis. Blue indicates positive values and red indicates negative values. The yellow lines separate the image into six portions labeled in (A). Red triangles indicate the period of recording, which started at the end of the T wave (conduit phase), followed by the contraction phase. (C) Examples of curved M-mode strain (left) and strain rate (right) images obtained from apical 4-chamber and 2-chamber views in two groups, including paroxysmal atrial fibrillation (PAF) and non-PAF patients. Images in Group 1 show deeper red in the contraction phase and more homogeneous pattern of color distribution, indicating a better mechanical deformation and synchrony of the left atrium than those in Group 2.

The curved M-mode images of GS and GSR in both apical 4-C and 2-C views were also generated using the software, providing a unidimensional view of GS and GSR which illustrated the change in length and the change in strain/sec of the depicted LA wall along the time axis, respectively (Figures 1B,C). Curved M-mode STE images represented the cyclic changes of strain and strain rate at the region of interest along the time axis, starting from the end of the T wave (conduit phase) to the contraction phase and then the reservoir phase.

Intraclass correlation coefficients were calculated to quantify the intra-observer and inter-observer variability of GS in 48 randomly selected patients, measured first by the same investigator on two separate occasions for intra-observer variability, and then by two independent investigators for inter-observe variability. The two investigators were blinded to each other's measurements and the outcome status of AF ablation. Repeat measurement was made at the same cardiac cycle of the same image for each patient to avoid inherent variability caused by different cycle lengths.

Follow-Up and Definition

Patients were followed up at 1 week, 1 month, 3 months, 6 months and every 3–6 months after RFCA and whenever required due to AF symptoms. Twelve-lead electrocardiograms and 24-h Holter ambulatory electrocardiograms were recorded after RFCA and when the patient experienced palpitation symptoms. Recurrence was defined as typical palpitation episodes for >30 s or atrial tachyarrhythmia on a 12-lead electrocardiogram, Holter monitoring, or pacemaker/implantable cardioverter-defibrillator interrogation records. Repeat RFCA as well as continuation of a previously ineffective antiarrhythmic drug were suggested to patients with AF recurrence.

Processing of Data Import

The curved M-mode RGB images of GS and GSR were extracted and standardized into portable network graphic images of 120 × 120 pixel. Because the classification ability was unclear, we used four combinations of the curved M-mode GS and GSR images: 2-C GS + 4-C GS (2S+4S), 2-C GSR + 4-C GSR (2SR+4SR), 4-C GS + 4-C GSR (4S+4SR), and 2-C GS + 2-C GSR (2S+2SR). Because of an imbalanced feature of the study sample, images from 163 randomly selected patients from Group 1 and 163 patients from Group 2 were used. In each replicate, 80% of the patient data were randomly selected as training data and the remaining were treated as testing data. The process of selecting the study samples is illustrated in Figure 2A.

FIGURE 2

Figure 2. Import data and convolutional neural network (CNN) flow chart. (A) There was a total of 326 cases from which were split with 208 cases as the training set, 52 cases as the validation set, and 66 cases as the test set in building the CNN model. (B) Architecture of the final CNN model. 2S, apical 2-chamber strain map; 2SR, apical 2-chamber strain rate map; 4S, apical 4-chamber strain map; 4SR, apical 4-chamber strain rate map.

Architecture of the CNN

The CNN was then used to classify the subjects. As shown in Figure 2B, the CNN architecture was set to have an input layer, K sets of convolutional layer and max-pooling layer (feature maps), one flatten layer, M fully connected layers, and an output layer. For stable performance of the CNN model, the hyperparameters in the convolutional layer and K were determined dynamically (11). The 2-D convolutional layer built in Keras was used to construct the feature maps of the images (12). The hyperparameters in the convolutional layer included filter size, kernel size, stride, and padding. These parameters were determined dynamically among the parameter settings given in Table 1. We used the ReLu activation function for the CNN model. To determine the stability of the hyperparameters, the CNN model was performed repeatedly for 32 times. The decision on the final setting for the hyperparameters was based on the variability of the accuracy from 32 runs. The optimizer used the Adam algorithm with a learning rate set to 0.001 (13). CNNs were implemented using TensorFlow version 2.0 (Google Brain, 273 Mountain View, CA, USA) and Keras version 2.2.4 software (GitHub, San Francisco, CA) using Python version 3.5 programming language (Python Software Foundation, Beaverton, OR).

TABLE 1

Table 1. Clinical and echocardiographic data of the study groups.

Statistical Methods

Numbers and percentages were used to summarize the basic characteristics of the study sample. Two independent-sample t-tests were conducted to evaluate the association between continuous covariates and groups. A chi-square test was performed to assess the association between discrete covariates and groups. Logistic regression models were used to access the predictive power for group classification. The predictor variables included LAD, LAEF, 2-C GS, and 4-C GS. To understand the predictive power of these variables, seven settings were considered, including four individual predictors, LAEF+LAD, 2-C GS+4-C GS, and all four variables. Logistic regression models were constructed using the same selected subjects for training and testing the CNN model. The model was performed using SAS (Version 9.4, SAS Institute Inc., Cary, NC, USA).

The diagnostic performance of the CNN and the logistic regression models were evaluated using the confusion matrix and the area under the receiver operating characteristic curve (AUC) (14). The confusion matrix was constructed using a cutoff value of 0.5 and used to compute sensitivity, specificity, and accuracy. For the CNN model, the box chart was used to display the sampling distribution of the performance indices for 32 runs. Furthermore, because of the relatively small sample used, the final performance of the CNN model was determined by combining the results from 32 runs, which was also used to illustrate the ability of the source of images to predict abnormal status. The index values were computed three ways. For each subject, an abnormal status for one of the sources (i.e., “4S” or “4SR”) was defined as when the average probability of an abnormal status for 32 runs exceeded 0.5; for two subjects, an abnormal status for two sources (i.e., “4S+4SR”) was defined as when the average probability of an abnormal status for 64 runs exceeded 0.5.

Results

Table 1 summarizes patients' baseline clinical characteristics. Group 1 patients were significantly younger than Group 2 patients. The proportions of men and paroxysmal AF were higher in Group 1. Group 1 also exhibited lower percentages of dyslipidemia, stroke, end-stage renal disease, rheumatic heart disease, and sick sinus syndrome; shorter AF duration; and fewer RFCAs. The echocardiographic data showed all four predictor variables used for logistic regression modeling were significant. Compared with Group 2, Group 1 demonstrated significantly smaller LAD, better LAEF, and more negative values of 2-C GS and 4-C GS.

There was excellent reproducibility of GS analysis. For intra-observer variability, the mean difference and intraclass correlation coefficient were 0.88 and 98.5%, respectively, for 4-C GS and 0.81 and 98.8%, respectively, for 2-C GS. For inter-observer variability, the mean difference and intraclass correlation coefficient were 1.30 and 97.0%, respectively, for 4-C GS and 0.99 and 98.6%, respectively, for 2-C GS.

Logistic Regression Models

Table 2 lists the model estimates of the logistic regression models. When only one predictor variable was included, all four variables were significantly associated with outcome status after AF ablation (Table 2A). Table 2B presents the estimates of the logistic regression models when more than one predictor variable was included. Both LAD and LAEF were significant when they coexisted in the model. When 2-C GS and 4-C GS were included, 2-C GS became less important. When all four predictor variables were controlled for, 2-C GS and 4-C GS became non-significant, whereas LAD was only significant at P = 0.046. Overall, LAEF was the most influential variable for predicting outcome status after AF ablation using logistic regression.

TABLE 2A

Table 2A. Estimates for the logistic regression models with only one predicting variable.

TABLE 2B

Table 2B. Estimates for the logistic regression models with more than one predicting variables.

Performance Indices of CNN Models on Assessing Curved M-Mode STE Images

The final settings for the hyperparameters are given in the last column of Table 3. The performance of classification algorithms was evaluated by computing the AUC, accuracy, specificity, and sensitivity. Figure 3 presents the box plot for the index values of 32 runs for the training and testing samples for four image settings, and the summarized results of statistics for all the performance indices were shown in Supplementary Table 1. Under the selected hyperparameters, the AUC derived from the training sample and testing sample was more than 0.8 for 2S+4S, 2SR+4SR, and 4S+4SR (the AUC derived from the testing sample for 2S+2SR was <0.8). The accuracy for 4S+4SR was more than 0.8, whereas that for 2S+4S and 2SR+4SR was lower than 0.8. The sensitivity and specificity derived from 4S+4SR were also higher than those for the other image settings. Overall, 4S+4SR had the best performance, whereas that of 2S+2SR was low when the testing sample was used.

TABLE 3

Table 3. Parameter ranges for the hyperparameters in the convolutional layers.

FIGURE 3

Figure 3. Comparisons of performance index values for 32 runs for four parameter settings. The 4S+4SR setting yielded the largest area under the receiver operating characteristic curve, accuracy, sensitivity, and specificity scores during training and testing. 2S, apical 2-chamber strain map; 2SR, apical 2-chamber strain rate map; 4S, apical 4-chamber strain map; 4SR, apical 4-chamber strain rate map.

Table 4 presents the performance index values constructed by combining the results obtained from 32 runs. For each type, the performance indices were constructed based on three sources of data. For example, for 2S+4S, the performance index values were computed using results obtained individually from 2S and 4S and then computed with the 2S+4S results. For the 2S+4S setting, the performance index values computed from the results obtained using 4S images were much higher than those using 2S images for the testing sample. Moreover, the accuracy and specificity computed from the results obtained using 4S images were slightly higher than those using 2S+4S for the testing sample. Similarly, for the 2SR+4SR setting, the performance index values computed from the results obtained using 4SR were much higher than those using 2SR for the testing sample. That is, distinct features were observed for images from apical 2-C and 4-C views. Furthermore, the performance index values computed from 4S+4SR were much higher than that from 2S+2SR.

TABLE 4

Table 4. Performance index values constructed from combining 32 runs.

Comparisons of Model Performance

Table 5 provides the performance index values for all models. The CNN model using 4S+4SR images had the best performance, surpassing the logistic regression model using four predicting variables in terms of accuracy and sensitivity. The third best performance was achieved by the CNN model when using 2S+4S or 2SR+4SR images. The logistic regression model based on LAEF had similar sensitivity and specificity, whereas the one based on 2-C GS+4-C GS had similarly higher sensitivity but lower specificity.

TABLE 5

Table 5. Performance indices for all models.

Discussion

The main finding of this study is that a deep CNN based on curved M-mode STE images (4S+4SR) achieved the highest prediction accuracy, sensitivity, and specificity compared with logistic regression models using LAEF, LAD, 2-C GS, or 4-C GS, individually or combined, as predictor parameters to assess outcome status after AF ablation.

STE Imaging and CNN Models

Analysis of LA function is essential for the evaluation of patients with AF, and STE allows identifying those patients who are prone to develop AF and is a marker of LA fibrosis (15). Quantification of LA function using STE images enables evaluation of LA dysfunction due to AF (4), and is able to predict rhythm outcomes after AF ablation (16). A meta-analysis by Ma et al. indicated that LA STE images can facilitate the identification of patients with a high risk of AF recurrence in patients with paroxysmal AF, with a weighted mean AUC of 0.798 (17). STE is a sensitive tool to measure ultrastructural changes that affect LA mechanics before LA enlargement. LA deformation capacity measurement by STE provides a comprehensive assessment of atrial function and is more helpful for identifying abnormal atrial substrate than conventional echocardiographic variables, thus helping in the prediction of post-RFCA AF recurrence in both paroxysmal and persistent AF patients (18).

Furthermore, Sarvari et al. reported that inhomogeneous timing of LA contraction is potentially a predictor of AF recurrence after ablation (5). The segmental dysfunction corresponds to the LA substrate, and LA mechanical and electrical dysfunction coexist in the early phase prior to LA enlargement (19). STE can accurately assess regional myocardial function and timing. Chao et al. reported that applying artificial intelligence algorithms to the STE radial strain of the left ventricle can assist in identifying cardiac resynchronization therapy responders (20). They used complex mathematical methods to compute the difference and standard deviation of time to peak stain detected in each of the six regional strain curves and to convert two strain curves on two ventricular walls into a sequence of phase-space points (a total of 15 pairs) for phase space reconstruction. The authors applied a support vector machine using peak-strain timing and phase space reconstruction as parameters to build classifiers with an average accuracy, sensitivity, and specificity of >90%. For the left atrium, because the GSR curves include three peaks [strain rate during systole (SR-S), early diastole (SR-E), and atrial contraction (SR-AC)] to assess LA reservoir, conduit, and contractile function, it would be much complicated to apply artificial intelligence algorithms on the LA GS and GSR curves to build classifiers for patients post-AF ablation with different rhythm outcomes.

Alternatively, the spatiotemporal information of deformation can be displayed in curved M-mode RGB images of GS and GSR, on which red or blue, deep or light, and patterns of color distribution provide information of the direction, strength, and homogenization of LA deformation properties, respectively. These images can be useful for assessing LA function. However, these images have not been widely used clinically because visual assessment is inherently subjective and prone to classification error. Artificial intelligence in cardiac imaging is a fast-moving field (21), and deep learning is a form of machine learning devised to mimic the way the visual system works (22). Deep CNN models were reported to correctly classify all types of echocardiogram recordings (M-mode, Doppler, still images, and videos) (23). Our findings indicate that a deep CNN can successfully incorporate spatiotemporal features from these RGB images into an overall assessment of LA deformation mechanics. This deep CNN model may also be applicable to left ventricular studies for identifying cardiac resynchronization therapy responders in patients with severe heart failure and left bundle branch block.

Previously, we reported that LAEF provides the optimal prognostic information regarding the risk stratification of AF patients undergoing RFCA (10). In this study, we placed the P-wave in the middle of the time axis by setting zero reference at the end of the T-wave corresponding to the onset of mitral valve opening for a better illustration of LA shortening on curved M-mode images of GS. Because the LA wall is longest at this point, LA strain values were negative, and a prominent red color can be seen in the middle of the images. As shown in Figure 1C, Group 1 had a deeper and more homogeneous red color zone in the middle of the map than Group 2, indicating more effective and synchronized LA shortening. Current strain software packages usually provide an electrocardiogram trigger as a zero reference, which is frequently situated at the upslope of the R-wave as a surrogate for end-diastole or at the onset of LA contraction. Both are the currently used methods reported in the latest European standardization documents (24, 25). Because the entire strain curve changes its amplitude depending on the definition of zero reference, LA GS values obtained by setting zero reference at the peak of the LA strain curve in this study would be much smaller than those obtained by setting zero reference at the nadir of the LA strain curve in the currently used method.

Logistic Regression Models and CNN Models

By incorporating the spatiotemporal information of LA deformation properties, this study demonstrated that the STE image-based CNN model (4S+4SR) had the best performance and surpassed even the logistic regression model using all four parameters. It implies that mechanical synchrony and empty fraction of the left atrium both are important predictors of post-RFCA AF recurrence, and also demonstrates the potential advantages of supervised deep learning with CNNs for image classification tasks. Why the apical 4-C view-acquired images provided better discriminating information than those acquired through the apical 2-C view for CNNs is unclear. A possible reason is that the apical 4-C view is the easiest and most reproducible to perform. To increase feasibility, the consensus of the European Association of Cardiovascular Imaging/American Society of Echocardiography/Industry Task Force to Standardize Deformation Imaging (24) recommends that using the LA longitudinal strain values obtained from a single non-foreshortened apical 4-C view is acceptable.

Tracing the LA outline manually is time consuming. Automated measurement of the left ventricular longitudinal strain is feasible (26). Using the TOMTEC automatic cardiac measurement software package and CNN models, the UCSF Echocardiography Laboratory reported that the automated global longitudinal strain values deviated from manual values by an absolute value of 1.4% (relative value of 7.5%) (27). Therefore, a future goal would be to achieve completely automatic generation and interpretation of curved M-mode speckle-tracking images of the left atrium to provide rapid and reproducible assessment of LA deformation properties.

This study has some limitations. A large sample size is required to achieve sufficient CNN model performance. Because our patient number was limited, the performance index of the deep CNN may have been underestimated in this study. Moreover, results in the test datasets were substantially worse than in the training, which suggests overfitting because of small training datasets. Even if we have repeated the algorithms 32 times and summarized the result in terms of 32 runs to reduce the random selection bias in the CNN modeling, increasing the size of training dataset with growing number of AF patients who have undergone RFCA is necessary to validate and strengthen the performance of the proposed CNN model. Finally, all results were derived from retrospective data. A prospective validation study is required to verify the reliability of the proposed CNN model before this approach is considered for clinical use.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by The Institutional Review Board of Chang Gung Memorial Hospital (IRB No. 202000829B0). The patients/participants provided their written informed consent to participate in this study.

Author Contributions

Y-TH designed and performed CNN, logistic regression modelling, and interpret data. H-LL analyzed strain and strain rate data, including off-line acquisition of cured M-mode images, and wrote the draft of the manuscript. P-CC, C-HL, H-TW, and H-TL collaborated in electrophysiological procedures, echocardiography, and data analyses. All of them contributed significantly to the execution of this study. M-SW is the senior investigator who helped design the study and participated in electrophysiological studies and ablation. F-CL is the senior investigator who helped with speckle-tracking data analysis. C-CC is the corresponding author who designed the study, interpreted the data, and revised the manuscript critically. All authors contributed to the article and approved the submitted version.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2020.605642/full#supplementary-material

References

1. Saraiva RM, Demirkol S, Buakhamsri A, Greenberg N, Popović ZB, Thomas JD, et al. Left atrial strain measured by two-dimensional speckle tracking represents a new tool to evaluate left atrial function. J Am Soc Echocardiography. (2010) 23:172–80. doi: 10.1016/j.echo.2009.11.003

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Hammerstingl C, Schwekendiek M, Momcilovic D, Schueler R, SINNING JM, Schrickel JW, et al. Left atrial deformation imaging with ultrasound based two-dimensional speckle-tracking predicts the rate of recurrence of paroxysmal and persistent atrial fibrillation after successful ablation procedures. J Cardiovasc Electrophysiol. (2012) 23:247–55. doi: 10.1111/j.1540-8167.2011.02177.x

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Montserrat S, Gabrielli L, Bijnens B, Borràs R, Berruezo A, Poyatos S, et al. Left atrial deformation predicts success of first and second percutaneous atrial fibrillation ablation. Heart Rhythm. (2015) 12:11–8. doi: 10.1016/j.hrthm.2014.08.032

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Inaba Y, Yuda S, Kobayashi N, Hashimoto A, Uno K, Nakata T, et al. Strain rate imaging for noninvasive functional quantification of the left atrium: comparative studies in controls and patients with atrial fibrillation. J Am Soc Echocardiography. (2005) 18:729–36. doi: 10.1016/j.echo.2004.12.005

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Sarvari SI Haugaa KH Stokke TM Ansari HZ Leren IS Hegbom F . Strain echocardiographic assessment of left atrial function predicts recurrence of atrial fibrillation. Europ Heart J Cardiovasc Imaging. (2016) 17:660–7. doi: 10.1093/ehjci/jev185

CrossRef Full Text | Google Scholar

6. Shen D, Wu G, Suk H-I. Deep learning in medical image analysis. Ann Rev Biomed Eng. (2017) 19:221–48. doi: 10.1146/annurev-bioeng-071516-044442

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Alsharqi M, Woodward W, Mumith J, Markham D, Upton R, Leeson P. Artificial intelligence and echocardiography. Echo Res Practice. (2018) 5:R115–R25. doi: 10.1530/ERP-18-0056

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Yang CH, Chou CC, Hung KC, Wen MS, Chang PC, Wo HT, et al. Comparisons of the underlying mechanisms of left atrial remodeling after repeat circumferential pulmonary vein isolation with or without additional left atrial linear ablation in patients with recurrent atrial fibrillation. Int J Cardiol. (2017) 228:449–55. doi: 10.1016/j.ijcard.2016.11.020

CrossRef Full Text | Google Scholar

9. Lang RM, Badano LP, Mor-Avi V, Afilalo J, Armstrong A, Ernande L, et al. Recommendations for cardiac chamber quantification by echocardiography in adults: an update from the American Society of Echocardiography and the European Association of Cardiovascular Imaging. Europ Heart J Cardiovasc Imaging. (2015) 16:233–71. doi: 10.1093/ehjci/jev014

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Chou C-C, Lee H-L, Chang P-C, Wo H-T, Wen M-S, Yeh S-J, et al. Left atrial emptying fraction predicts recurrence of atrial fibrillation after radiofrequency catheter ablation. PLoS ONE. (2018) 13:e0191196. doi: 10.1371/journal.pone.0191196

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Chen H-H, Hwang Y-T, Terng H-J. Using dynamic method to determine the hyper-parameters in convolutional neural network for the classification of lung nodules in CT Images. J Chinese Statist Assoc. (2020) 58:93–127.

Google Scholar

12. Zafar I, Tzanidou G, Burton R, Patel N, Araujo L. Hands-on Convolutional Neural Networks With TensorFlow: Solve Computer Vision Problems With Modeling in TensorFlow and Python. Birmingham, UK: Packt Publishing Ltd. (2018).

Google Scholar

13. Kingma DP, Adam JB. A method for stochastic optimization. arXiv [preprint]. (2014) arXiv:14126980.

PubMed Abstract | Google Scholar

14. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. (1988) 44:837–45. doi: 10.2307/2531595

CrossRef Full Text | Google Scholar

15. Pastore MC, De Carli G, Mandoli GE, D'Ascenzi F, Focardi M, Contorni F, et al. The prognostic role of speckle tracking echocardiography in clinical practice: evidence and reference values from the literature. Heart Fail Rev. (2020). doi: 10.1007/s10741-020-09945-9. [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Kojima T, Kawasaki M, Tanaka R, Ono K, Hirose T, Iwama M, et al. Left atrial global and regional function in patients with paroxysmal atrial fibrillation has already been impaired before enlargement of left atrium: velocity vector imaging echocardiography study. Europ Heart J Cardiovasc Imaging. (2012) 13:227–34. doi: 10.1093/ejechocard/jer281

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Ma XX, Boldt LH, Zhang YL, Zhu MR, Hu B, Parwani A, et al. Clinical relevance of left atrial strain to predict recurrence of atrial fibrillation after catheter ablation: a meta-analysis. Echocardiography. (2016) 33:724–33. doi: 10.1111/echo.13184

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Ma X-X, Zhang Y-L, Hu B, Zhu M-R, Jiang W-J, Wang M, et al. The usefulness of global left atrial strain for predicting atrial fibrillation recurrence after catheter ablation in patients with persistent and paroxysmal atrial fibrillation. Arch Cardiovasc Dis. (2017) 110:447–55. doi: 10.1016/j.acvd.2016.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Chen Y, Li Z, Shen X, Wang W, Kang Y, Qiao Z, et al. Assessment of left atrial remodeling in paroxysmal atrial fibrillation with speckle tracking echocardiography: a study with an electrophysiological mapping system. Int J Cardiovasc Imaging. (2019) 35:451–9. doi: 10.1007/s10554-018-1470-6

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Chao P-K, Wang C-L, Chan H-L. An intelligent classifier for prognosis of cardiac resynchronization therapy based on speckle-tracking echocardiograms. Artificial Intell Med. (2012) 54:181–8. doi: 10.1016/j.artmed.2011.09.006

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Petersen SE, Abdulkareem M, Leiner T. Artificial intelligence will transform cardiac imaging–opportunities and challenges. Front Cardiovasc Med. (2019) 6:133. doi: 10.3389/fcvm.2019.00133

PubMed Abstract | CrossRef Full Text | Google Scholar

22. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. (2015) 521:436–44. doi: 10.1038/nature14539

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Madani A, Arnaout R, Mofrad M, Arnaout R. Fast and accurate view classification of echocardiograms using deep learning. NPJ Digital Med. (2018) 1:1–8. doi: 10.1038/s41746-017-0013-1

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Badano LP, Kolias TJ, Muraru D, Abraham TP, Aurigemma G, Edvardsen T, et al. Standardization of left atrial, right ventricular, and right atrial deformation imaging using two-dimensional speckle tracking echocardiography: a consensus document of the EACVI/ASE/Industry Task Force to standardize deformation imaging. Europ Heart J Cardiovasc Imaging. (2018) 19:591–600. doi: 10.1093/ehjci/jey042

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Voigt J-U, Malaescu G-G, Haugaa K, Badano LJEHJ-CI. How to do LA strain. Europ Heart J Cardiovasc Imaging. (2020) 21:715–7. doi: 10.1093/ehjci/jeaa091

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Knackstedt C, Bekkers SC, Schummers G, Schreckenberg M, Muraru D, Badano LP, et al. Fully automated versus standard tracking of left ventricular ejection fraction and longitudinal strain: the FAST-EFs multicenter study. J Am College Cardiol. (2015) 66:1456–66. doi: 10.1016/j.jacc.2015.07.052

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Zhang J, Gajjala S, Agrawal P, Tison GH, Hallock LA, Beussink-Nelson L, et al. Fully automated echocardiogram interpretation in clinical practice: feasibility and diagnostic accuracy. Circulation. (2018) 138:1623–35. doi: 10.1161/CIRCULATIONAHA.118.034338

CrossRef Full Text | Google Scholar

Keywords: atrial fibrillation, deep convolutional neural network, radiofrequency ablation, speckle tracking longitudinal strain, recurrence

Citation: Hwang Y-T, Lee H-L, Lu C-H, Chang P-C, Wo H-T, Liu H-T, Wen M-S, Lin F-C and Chou C-C (2021) A Novel Approach for Predicting Atrial Fibrillation Recurrence After Ablation Using Deep Convolutional Neural Networks by Assessing Left Atrial Curved M-Mode Speckle-Tracking Images. Front. Cardiovasc. Med. 7:605642. doi: 10.3389/fcvm.2020.605642

Received: 12 September 2020; Accepted: 31 December 2020;
Published: 22 January 2021.

Edited by:

Matteo Cameli, University of Siena, Italy

Reviewed by:

Maria Concetta Pastore, Università del Piemonte Orientale, Italy
Jolanda Sabatino, Royal Brompton Hospital, United Kingdom

Copyright © 2021 Hwang, Lee, Lu, Chang, Wo, Liu, Wen, Lin and Chou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chung-Chuan Chou, 2867@cgmh.org.tw; orcid.org/0000-0003-4465-0016

^†These authors have contributed equally to this work

ORIGINAL RESEARCH article

A Novel Approach for Predicting Atrial Fibrillation Recurrence After Ablation Using Deep Convolutional Neural Networks by Assessing Left Atrial Curved M-Mode Speckle-Tracking Images

Introduction

Materials and Methods

Study Population

Electrophysiological Study and RFCA

Echocardiography

Follow-Up and Definition

Processing of Data Import

Architecture of the CNN

Statistical Methods

Results

Logistic Regression Models

Performance Indices of CNN Models on Assessing Curved M-Mode STE Images

Comparisons of Model Performance

Discussion

STE Imaging and CNN Models

Logistic Regression Models and CNN Models

Data Availability Statement

Ethics Statement

Author Contributions

Conflict of Interest

Supplementary Material

References

People also looked at