MNL-Network: A Multi-Scale Non-local Network for Epilepsy Detection From EEG Signals

Zhang, Guokai; Yang, Le; Li, Boyang; Lu, Yiwen; Liu, Qinyuan; Zhao, Wei; Ren, Tianhe; Zhou, Junsheng; Wang, Shui-Hua; Che, Wenliang

doi:10.3389/fnins.2020.00870

ORIGINAL RESEARCH article

Front. Neurosci., 17 November 2020

Sec. Neuroprosthetics

Volume 14 - 2020 | https://doi.org/10.3389/fnins.2020.00870

This article is part of the Research TopicAdvanced Deep-Transfer-Leveraged Studies on Brain-Computer InterfacingView all 23 articles

MNL-Network: A Multi-Scale Non-local Network for Epilepsy Detection From EEG Signals

Guokai Zhang¹

Le Yang²

Boyang Li²

Yiwen Lu³

Qinyuan Liu³^*

Wei Zhao⁴

Tianhe Ren⁵

Junsheng Zhou⁵

Shui-Hua Wang^6,7^*

Wenliang Che⁸^*

¹School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai, China
²School of Software Engineering, Tongji University, Shanghai, China
³Department of Computer Science and Technology, Tongji University, Shanghai, China
⁴Chengyi University College, Jimei University, Xiamen, China
⁵School of Informatics, Xiamen University, Xiamen, China
⁶School of Architecture Building and Civil Engineering, Loughborough University, Loughborough, United Kingdom
⁷School of Mathematics and Actuarial Science, University of Leicester, Leicester, United Kingdom
⁸Department of Cardiology, Shanghai Tenth People's Hospital, Tongji University School of Medicine, Shanghai, China

Epilepsy is a prevalent neurological disorder that threatens human health in the world. The most commonly used method to detect epilepsy is using the electroencephalogram (EEG). However, epilepsy detection from the EEG is time-consuming and error-prone work because of the varying levels of experience we find in physicians. To tackle this challenge, in this paper, we propose a multi-scale non-local (MNL) network to achieve automatic EEG signal detection. Our MNL-Network is based on 1D convolution neural network involving two specific layers to improve the classification performance. One layer is named the signal pooling layer which incorporates three different sizes of 1D max-pooling layers to learn the multi-scale features from the EEG signal. The other one is called a multi-scale non-local layer, which calculates the correlation of different multi-scale extracted features and outputs the correlative encoded features to further enhance the classification performance. To evaluate the effectiveness of our model, we conduct experiments on the Bonn dataset. The experimental results demonstrate that our MNL-Network could achieve competitive results in the EEG classification task.

1. Introduction

As the center of cognitive processes and sensory stimuli, the brain controls the vital functions of the body and has a complicated information processing function (Türk and Özerdem, 2019). When the nervous system is active, the brain emits biopotential signals that can reflect dysfunction or disease. By magnifying and recording the spontaneous biological potential of the brain from the scalp through sophisticated electronic instruments, one can obtain electroencephalography (EEG) signals. Due to its excellent temporal resolution, easy implementation, and low cost, EEG has become one of the most effective techniques in monitoring the brain activity and diagnosing the neurological disorder (Ullah et al., 2018).

Epilepsy is a neurological disorder affecting about 50 million people around the world (Beghi et al., 2005; Megiddo et al., 2016). Epilepsy manifests in the form of seizures, which is an abnormal electrical activity that occurs temporarily in nerve cells (Bancaud, 1973). Since EEG can accurately record the intermittent slow waves, spikes, or irregular spikes during seizures by analyzing the wave morphology of EEG signals, one can give an explicit evaluation of the presence and level of epilepsy. Unfortunately, the detection of epilepsy from EEG requires signal records over a long-term period, which is a time-consuming and inefficient undertaking. Considering the shortage of professional doctors at present, it is therefore urgent and meaningful to detect epilepsy in an automatic way.

In recent years, the algorithms based on the hand-crafted feature engineering have shown great success in many medical image analysis fields (Jiang et al., 2017; Li et al., 2019; Xu et al., 2020); for EEG signal automatic detection tasks, some early attempts such as Gotman (1982) decomposed the EEG into elementary waves and detected the paroxysmal bursts of rhythmic activity. Furthermore, these works could detect the patterns specific to newborns and then give a warning to patients when a seizure is starting (Gotman, 1999). Recently, Gardner et al. proposed a Support Vector Machine (SVM)-based method in which seizure activity induced distributional changes in feature space that increased the empirical outlier fraction (Gardner et al., 2006). Moreover, an automatic epileptic seizure detection method was developed based on line length feature and artificial neural networks in Guo et al. (2010). After that, a different feature acquisition and classification technique in the diagnosis of both epilepsy and autism spectrum disorder (ASD) was developed by Ibrahim (Ibrahim et al., 2017). Lu's team used Kraskov Entropy based on the Hilbert Huang Transform (HHT) to obtain features. They used the Least Squares Version of Support Vector Machine (LS-SVM) for wavelet transformation (Lu et al., 2018). Although many hand-crafted feature algorithms have been proposed, it is still a challenging problem to identify epilepsy and non-epileptic EEG signals due to the noise and artifacts in the data as well as the inconsistency in seizure morphology of the epilepsy (Tao et al., 2017).

Recently, with the great success of deep learning in computer vision and data mining fields, considerable attention has been focused on the EEG signal classification task. Compared with the hand-crafted feature learning methods, the deep learning methods could generically learn stronger discriminative features with an end-to-end manner. For EEG classification, a Computer-Aided Diagnosis (CAD) system was developed in Acharya et al. (2017), which employed the Convolutional Neural Network (CNN) for analysis of EEG signals. In the follow-up study, the authors in Yuan et al. (2017) transformed EEG signals into EEG scalogram sequences using wavelet transformation, and they then obtained three different EEG features by using Global Principal Component Analysis (GPCA) (Vidal, 2016), Stacked Denoising Autoencoders (SDAE) (Vincent et al., 2010), and EEG segments. After that, the seizure detection was performed by combining all the obtained features and assigning them to the SVM classifier. As for the end-to-end feature learning, Türk et al. obtained two-dimensional (2D) frequency-time scalograms by applying continuous wavelet transform to EEG records, and they then used CNN to learn the properties of these scalogram images to classify the EEG signal (Türk and Özerdem, 2019).

Bhattacharyya et al. analyzed the underlying complexity and nonlinearity of EEG signals by computing a novel multi-scale entropy measure for the classification of seizure, seizure-free, and normal EEG signals (Abhijit et al., 2017). Hussein et al. transformed EEG data into a series of non-overlapping segments to reveal the correlation between consecutive data samples. The Long Short Term Memory (LSTM) network and the softmax classifier were exploited for classification to learn the high-level features of normal and seizure EEG models (Hussein et al., 2018). It should be noted that the majority of the automatic systems perform well in detecting binary epilepsy scenarios, but their performance degrades greatly in classifying the ternary case. To overcome this problem, Ullah et al. proposed an ensemble of pyramidal one-dimensional convolutional neural network (P-1D-CNN) models (Ullah et al., 2018), which could efficiently handle the small available data challenge in classifying the ternary case.

Despite some preliminary results that have been established in the literature, they ignore the multi-scale features which play an important role in the EEG classification task. For example, the long scale of the signal reflects more global representations of the EEG signal, and the short scale of signal embodies the information from the local EEG signal. Thus, those methods based on the single scale of the EEG signal could hinder the model from achieving a better performance due to the absence of multi-scale features. Moreover, the correlations of multi-scale signal features could also be an important factor in this classification task. The learned correlations of multi-scale features are capable of providing correlative dependencies of various lengths' EEG signals, which give more feature information to further improve the classification performance. Based on the discussion above, in this paper, we propose a Multi-scale Non-local (MNL) network to learn multi-scale and correlative features from the input EEG signals. The overview of our designed MNL-Network is illustrated in Figure 2. Different from the previous work that directly input the extracted features into the fully connected layer for classification, our MNL-Network developed a signal pooling layer to learn the multi-scale representations through different sizes of 1D max-pooling layers (Zhao et al., 2017) and then input these representations to a Non-local layer (Wang et al., 2018), which aims to encode more correlative features with multi-scale characteristic. To evaluate the performance of the proposed MNL-Network, we conduct comprehensive experiments on the public EEG Bonn dataset. The experimental results show the high classification accuracy of different EEG records, which convinces the effectiveness of our MNL-Network.

In the following section, we will first describe the experimental data in section 2.1. The detailed description of our proposed method is introduced from section 2.2 to section 2.4. The comparison results of different class combinations will be presented in section 3. Finally, we will give an overall discussion of our work in section 4.

2. Data and Methods

2.1. Data Description

Our experiments employ the EEG Bonn dataset (Andrzejak et al., 2001), which is public and widely used. There are five subsets/classes in this Bonn dataset, and they are denoted as set A, B, C, D, and E. Set A and B monitor the surface EEG records of healthy waking people with eyes open or closed. The other three sets, C, D, and E, are collected from epileptics. Set C it detects the records from the hippocampal formation of the brain during the seizure-free intervals. Set D is gained from the epileptogenic zone with the same intervals as set C, and set E only contains the records of the seizure activities. Each of the five sets is composed of 100 person signals of sampling rate 173.61 Hz and duration 23.6 s. Afterwards, the data samples were made into 4, 097 data points and then divided into 23 chunks for each person signal. Thus, the total record amount of the five sets could be 23 × 100 × 5 = 11, 500, and each set contains 2, 300 records. We show some samples of the different sets in Figure 1.

FIGURE 1

Figure 1. The data samples from the Bonn dataset.

2.2. Network Architecture

Recently, the deep convolutional neural network has achieved great success in computer vision and data analysis fields, and it has become the most rapidly developing technology in the machine learning domain. Compared with the traditional hand-defined feature learning methods, the CNN extracts highly sophisticated feature representations by an end-to-end learning mode, which could be more efficient and accurate. Since the EEG signal is a 1D time-series data, our main network is based on a 1D CNN, which mainly consists of the convolution layer, max-pooling layer, batch-normalization (BN) layer, and fully connected (FC) layer. The overview of our proposed network is illustrated in Figure 2. The network takes the EEG signal as the input and outputs the final EEG classification prediction result in an end-to-end manner. In order to accelerate the convergence of the network, we first use the z-score normalization to normalize the input EEG signal to [0, 1] range. Denoting the input signal data as s, the z-score normalization could be formulated as the following:

\begin{array}{l} s^{*} = \frac{s - μ}{θ} & (1) \end{array}

where μ is the mean value of s, the parameter of θ is the standard deviation of s, and the normalized data is s^*.

FIGURE 2

Figure 2. The overview of our MNL-Network. The main backbone is based on a 1D convolution neural network with two additional layers (i.e., signal pooling layer and multi-scale non-local layer), which are developed to learn multi-scale and correlative features from the EEG signals.

In our MNL-Network, the first layer is the convolution layer, which is generally used for filtering signals with fixed length to get discriminating features from the input. The filter number denotes the amounts of features that the kernel extracts. For reducing the complexities of the network, we use three convolution layers with the sizes 40, 20, and 10, respectively. Through using different kernels, discriminative categories of features are extracted and then fed into the next layer. Note that in the CNN, the deeper convolution layer usually extracts more high-level representations, while the lower one learns more tiny-detailed features. For learning more non-linear representations from the EEG signal, the Rectified Linear Unit (RELU) activation function is adopted with the form as follows:

\begin{array}{l} f (a) = m a x (0, a) & (2) \end{array}

where f(a) is the activation output of the input feature a. Since the BN layer can accelerate the learning process and maintain training stability, we add it after each convolution layer. After the BN layer, the max-pooling layer is followed to get the maximum signal value from the encoded features, and it is also used to down-sample or pool the input representation. The size of the max-pooling layer in our network is set as 2 with the stride 2. Before input the extracted features into the signal pooling layer, there are two max-pooling layers utilized to extract spatial information and enlarge the receptive field from the signal features. The detailed parameters of the MNL-Network are presented in Table 1.

TABLE 1

Table 1. The parameters of the MNL-Network.

For combining non-linear features from the previous layers, we use three FC layers, and the last FC layer is with a softmax function to output the prediction probability of each class. Mathematically, we denote the class labels as y_(i) ∈ {1, 2, ⋯ , C}, where the data samples have C classes totally. Given the normalized input data s^*, the softmax operation $h_{θ} (s^{*})$ could be formulated as the following:

\begin{array}{l} \begin{array}{l} h_{θ} (s^{*}) = (\begin{matrix} P (y = 1 | s^{*}; θ) \\ P (y = 2 | s^{*}; θ) \\ ⋮ \\ P (y = C | s^{*}; θ) \end{matrix}) \\ = \frac{1}{\sum_{j = 1}^{C} \exp (θ_{j}^{T} s^{*})} (\begin{matrix} \exp (θ_{1}^{T} s^{*}) \\ \exp (θ_{2}^{T} s^{*}) \\ ⋮ \\ \exp (θ_{C}^{T} s^{*}) \end{matrix}) \end{array} & (3) \end{array}

where θ₁, θ₂, ⋯ , θ_C are the parameters of the softmax operation.

2.3. Signal Pooling Layer

The EEG signal with different scales contains various multi-scale representations. However, the fixed size of the convolution or pooling layer could ignore the multi-scale features and thus hinder the model from achieving a higher classification performance. To address this challenge, we introduce a signal pooling layer that has two main parts: the multi-pooling part and the concatenation part to realize extracting the features from different scales. The detailed structure of the signal pooling layer is illustrated in Figure 3. Let x ∈ ℝ^w×c be the output features from the third convolution layer, w the length of x, and c indicate the channels of x. Then, we define the M(·)_p as the max-pooling operation with the size of p ∈ {1, 2, 4} and the stride as d with the value of 1; the output size of feature o could be calculated as Equation (4).

\begin{array}{l} o = (w - p) / d + 1 & (4) \end{array}

FIGURE 3

Figure 3. The architecture of the signal pooling layer. The M(x)₁, M(x)₂, M(x)₄ denote the max-pooling operations with the sizes of {1, 2, 4}, respectively, and the Z(M(x)₁), Z(M(x)₂), Z(M(x)₄) is the zero-padding layer with sizes of {1, 2, 4} that aim to reshape the feature map resolution to the same size.

For merging different sizes of the multi-scale features, we define the operator of Z(·) is the zero-padding layer with the left and right padding size of l and r, and the value of l, r is calculated as follows:

\begin{array}{l} l = ⌈ \frac{w - o}{4} ⌉ & (5) \end{array}

\begin{array}{l} r = w - o - l & (6) \end{array}

where ⌈·⌉ denotes the round up value operation. Specifically, for the input feature x, we first perform M(·)₁, M(·)₂, M(·)₄ parallelly to extract the multi-scale features. We then use the Z(·) to pad the features to the same size. Finally, a concatenation operation of Z(M(x)₁), Z(M(x)₂), Z(M(x)₄) is conducted before taking them into the multi-scale non-local layer. The final output feature x_ms of the signal pooling layer could be given as follows:

\begin{array}{l} x_{m s} = C o n c a t (Z (M {(x)}_{1}), Z (M {(x)}_{2}), Z (M {(x)}_{4})) & (7) \end{array}

where the Concat(·) represents the concatenation operation of different features.

2.4. Multi-Scale Non-local Layer

The long-range correlations of different scale EEG signals are of a vital importance in the epilepsy classification task. However, the traditional non-local method has been impeded by the lack of considering the multi-scale features. Thus, in this section, we introduce our designed multi-scale non-local layer, which could learn discriminative multi-scale EEG signal features in a non-local manner. The detailed structure of our multi-scale non-local layer is shown in Figure 4. Instead of using the hierarchical feature from the network, the input feature of the multi-scale non-local layer is extracted from the signal pooling layer, which contains more discriminative multi-scale representations. By measuring the correlations of different multi-scale features, the final category of the EEG signal could be predicted by learning similarities across different scopes.

FIGURE 4

Figure 4. The architecture of the multi-scale non-local layer. The x_ms is the extracted multi-scale features from the previous layer, ϕ, δ, and ϱ are the output features from Conv 1D, the variable of R denotes the similarity matrix of ϕ and δ, the ${\hat{R}}_{ϱ}$ is gained by multiplying R and ϱ, and the final output of the multi-scale non-local layer is ${\hat{x}}_{m s}$ .

Mathematically, we consider that the multi-scale input feature of the multi-scale non-local layer is $x_{m s} \in ℝ^{w \times c}$ , which is extracted from the previous signal pooling layer. Then, we use three 1D convolutions with the receptive field and filter size of one to transform $x_{m s} \in ℝ^{w \times c}$ to embedding space, the output features ϕ ∈ ℝ^w×ĉ, δ ∈ ℝ^w×ĉ, ϱ ∈ ℝ^w×ĉ could be defined as:

\begin{array}{l} ϕ = C o n v_{ϕ} (x_{m s}) & (8) \end{array}

\begin{array}{l} δ = C o n v_{δ} (x_{m s}) & (9) \end{array}

\begin{array}{l} ϱ = C o n v_{ϱ} (x_{m s}) & (10) \end{array}

where ĉ is the channel number of ϕ, δ, and ϱ. After that we flatten the three embeddings and use the matrix multiplication operation g(·, ·) between ϕ and δ to calculate the similarity matrix R, which could be formulated as the following:

\begin{array}{l} R = g (ϕ, δ) = ϕ^{T} \times δ & (11) \end{array}

Next, we apply the a softmax operation to h_η(·) to normalize the similarity matrix R and gain the attention weight matrix $\hat{R}$ =h_η(R); here, η is the parameters of softmax operation. Then, we perform a matrix multiplication between $\hat{R}$ and ϱ, which is formulated as the following:

\begin{array}{l} {\hat{R}}_{ϱ} = \hat{R} \times ϱ^{T} & (12) \end{array}

and the ${\hat{R}}_{ϱ} \in ℝ^{w \times ĉ}$ denotes the output feature. Afterward, a residual connection between $\hat{R}$ and ϱ is performed, and the final output ${\hat{x}}_{m s}$ of the multi-scale non-local layer is given as the following:

\begin{array}{l} {\hat{x}}_{m s} = C o n v_{m s} (C o n v_{m s} (\hat{R_{ϱ}}) + x_{m s}) & (13) \end{array}

2.5. Implementation Details

The proposed method is implemented by the Keras with 1 RTX 2070 GPU, and we use the cross-entropy as our loss function to train the model end-to-end. The parameters of the method are optimized by the Adam optimizer, the initial learning rate is set as 0.0005, and we reduce it by 0.1 after the validation accuracy is not improved after 10 epochs. All the training data is trained in a mini-batch size mode, and we set the mini-batch as 100 for each epoch. For each fold, we choose the best checkpoint on the validation accuracy as our final predicted model.

3. Experimental Results

3.1. Evaluation Metrics

For evaluation, well-known performance metrics, such as accuracy, precision, sensitivity, specificity and F1-score, are adopted. The definitions of these performance metrics are given below:

\begin{array}{l} A c c u r a c y = \frac{T P + T N}{T P + T N + F P + F N} & (14) \end{array}

\begin{array}{l} P r e c i s i o n = \frac{T P}{T P + F P} & (15) \end{array}

\begin{array}{l} S e n s i t i v i t y = \frac{T P}{T P + F N} & (16) \end{array}

\begin{array}{l} S p e c i f i c i t y = \frac{T N}{F P + T N} & (17) \end{array}

\begin{array}{l} F 1 - s c o r e = 2 \times \frac{P r e c i s i o n \times S e n s i t i v i t y}{P r e c i s i o n + S e n s i t i v i t y} & (18) \end{array}

where TP (true positives) are the number of the EEG records that are abnormal and actually identified as abnormal; TN (true negatives) are the number of the EEG records that are normal and actually identified as normal; FP (false positives) are the number of the EEG records that are normal but are actually predicted as abnormal; and FN (false negatives) are the number of the EEG records that are abnormal but are actually predicted as normal.

In order to ensure the system is tested over different categories of data, we used 10-fold cross-validation to evaluate all the data in system performance. That is, we randomly divided the 2,300 EEG signals of each class into ten non-overlapping folds. Then each fold, in turn, is used for testing while the other nine folds are used for training the model. We calculated the average values of accuracy, sensitivity, and specificity for 10-folds to get the average performance of the system.

3.2. The Performance of Double Classes Classification

In this section, we conduct the experiment by comparing the double classes classification performance. The combinations of different classes are A-E, B-E, C-E, D-E, AB-E, AC-E, AD-E, BC-E, BD-E, CD-E, ABC-E, ABD-E, BCD-E, and ABCD-E. The accuracy comparison result of k-10 testing is shown in Table 2, the best performance is achieved on the A-E classes classification with the performance of 99.93%, and the hardest classification is CD-E with the score of 98.54%. It could explain that the healthy waking with eyes open classes could have a big difference from the seizure epileptic, and thus it could achieve a higher classification performance. We also compare other metrics as shown in Table 3. The best overall performance is by classifying A-E classes, which further proves the reasons presented above. Overall, the accuracy performance of different double classes is all above 98%, which indicates that our proposed method could be very generalized in this classification task.

TABLE 2

Table 2. The k-10 accuracy performance of double classes classification.

TABLE 3

Table 3. The overall performance of double classes classification.

3.3. The k-10 Performance of Multiple Classes Classification

For a more comprehensive comparison, we conduct multiple classes classification in this section. The experimental combinations contain A-C-E, A-D-E, B-C-E, B-D-E, AB-CD-E, and A-B-C-D-E, separately. The accuracy performance of different multiple classes is illustrated in Table 4. Compared with the double classes classification, the multiple classes classification tends to be more difficult, and the overall accuracy is lower than the double classes. The reason behind this could be the multiple classes classification having a more complex data distribution than the double classes classification. The overall comparison result of other metrics is shown in Table 5. The best performance is gained by the B-D-E combination, and it achieves 98.62% accuracy, 97.14% sensitivity, 98.57% specificity, 97.14% precision, and an F1-score of 97.14%. Meanwhile, the result shows that the A-B-C-D-E five classes combination obtains the lowest performance, and it is mostly because the five classes combination has a more complicated data characteristic from each class data.

TABLE 4

Table 4. The k-10 performance of multiple classes classification.

TABLE 5

Table 5. The overall performance of multiple classes classification.

3.4. Compare With Other Methods

To further evaluate the effectiveness of our proposed network, we compare our method with other previous works. In Table 6, it shows the comparison result of different methods. Since there are multiple combinations of EEG classes, for simplicity, we use the A-E, B-E, C-E, D-E, AB-E, BC-E, CD-E, A-D-E, and A-B-C-D-E combinations to evaluate our MNL-Network performance. The comparison result demonstrates that our method could achieve competitive performance on double classed classification when compared with other previous works. The best performance of the double classes combination is achieved by A-E, and the reason could be that the class A and E have a large gap between each other, and some other combinations also gain high classification accuracy, which are all above 94%. Moreover, the five classes classification result is also reported in Table 6, our proposed method has achieved an accuracy classification performance of 94.01%, which is higher than the recent works. In particular, the overall CNN based methods usually have better performance than the traditional hand-crafted ones, which further proves that they can extract stronger discriminative representations thus could perform more prominently in the classification task.

TABLE 6

Table 6. The overall performance of double classes classification.

4. Conclusion

In this paper, we propose an automatic EEG signal detection network to help the physicians diagnose the epilepsy more efficiently. The whole architecture is based on the 1D convolution neural network, and two additional layers (signal pooling layer and multi-scale non-local layer) are proposed to learn the multi-scale and correlative information. Extensive comparative evaluations on the Bonn dataset are conducted, and they validate the effectiveness of our proposed method. In future works, we will explore the possibilities of incorporating reinforcement learning in this classification task.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: https://archive.ics.uci.edu/ml/datasets/Epileptic+Seizure+Recognition.

Author Contributions

GZ, LY, and BL conceived the idea and designed the algorithm. GZ conducted the experiments. YL wrote the initial paper. All the remaining authors contributed to refining the ideas and revised the manuscript.

Funding

This work was supported by the Fundamental Research Funds for the Central Universities NO.22120190211.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Abhijit, B., Ram, P., Abhay, U., and Acharya, U. (2017). Tunable-q wavelet transform based multiscale entropy measure for automated classification of epileptic EEG signals. Appl. Sci. 7:385. doi: 10.3390/app7040385

CrossRef Full Text | Google Scholar

Acharya, U. R., Oh, S. L., Hagiwara, Y., Tan, J. H., and Adeli, H. (2017). Deep convolutional neural network for the automated detection and diagnosis of seizure using EEG signals. Comput. Biol. Med. 100, 270–278. doi: 10.1016/j.compbiomed.2017.09.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Andrzejak, R. G., Lehnertz, K., Mormann, F., Rieke, C., David, P., and Elger, C. E. (2001). Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: dependence on recording region and brain state. Phys. Rev. E 64:061907. doi: 10.1103/PhysRevE.64.061907

PubMed Abstract | CrossRef Full Text | Google Scholar

Aydemir, E., Tuncer, T., and Dogan, S. (2020). A tunable-q wavelet transform and quadruple symmetric pattern based EEG signal classification method. Med. Hypothes. 134:109519. doi: 10.1016/j.mehy.2019.109519

PubMed Abstract | CrossRef Full Text | Google Scholar

Bancaud, J. (1973). Clinical electroencephalography: L. g. kiloh, a. j. mccomas, j. w. osselton, butterworth, third edition, london 1972. Neuropsychologia 11:251. doi: 10.1016/0028-3932(73)90021-3

CrossRef Full Text

Beghi, E., Berg, A., Carpio, A., Forsgren, L., Hesdorffer, D. C., Hauser, W. A., et al. (2005). Comment on epileptic seizures and epilepsy: definitions proposed by the international league against epilepsy (ILAE) and the international bureau for epilepsy (IBE). Epilepsia 46:1698. doi: 10.1111/j.1528-1167.2005.00273_1.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Gardner, A. B., Krieger, A. M., Vachtsevanos, G., Litt, B., and Kaelbing, L. P. (2006). One-class novelty detection for seizure analysis from intracranial EEG. J. Mach. Learn. Res. 7, 1025–1044. doi: 10.5555/1248547.1248584

CrossRef Full Text | Google Scholar

Gotman, J. (1982). Automatic recognition of epileptic seizures in the EEG. Electroencephalogr. Clin. Neurophysiol. 54:540. doi: 10.1016/0013-4694(82)90038-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Gotman, J. (1999). Automatic detection of seizures and spikes. J. Clin. Neurophysiol. 16, 130–140. doi: 10.1097/00004691-199903000-00005

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, L., Rivero, D., Dorado, J., RabuñAl, J. R., and Pazos, A. (2010). Automatic epileptic seizure detection in EEGs based on line length feature and artificial neural networks. J. Neurosci. Methods 191, 101–109. doi: 10.1016/j.jneumeth.2010.05.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Hussein, R., Elgendi, M., Wang, Z. J., and Ward, R. K. (2018). Robust detection of epileptic seizures based on L1-penalized robust regression of EEG signals. Expert Syst. Appl. 104, 153–167. doi: 10.1016/j.eswa.2018.03.022

CrossRef Full Text | Google Scholar

Ibrahim, S., Djemal, R., and Alsuwailem, A. (2017). Electroencephalography (EEG) signal processing for epilepsy and autism spectrum disorder diagnosis. Biocybern. Biomed. Eng. 38, 16–26. doi: 10.1016/j.bbe.2017.08.006

CrossRef Full Text | Google Scholar

Jia, J., Goparaju, B., Song, J., Zhang, R., and Westover, M. B. (2017). Automated identification of epileptic seizures in EEG signals based on phase space representation and statistical features in the CEEMD domain. Biomed. Signal Process. Control 38, 148–157. doi: 10.1016/j.bspc.2017.05.015

CrossRef Full Text | Google Scholar

Jiang, Y., Wu, D., Deng, Z., Qian, P., Wang, J., Wang, G., et al. (2017). Seizure classification from EEG signals using transfer learning, semi-supervised learning and tsk fuzzy system. IEEE Trans. Neural Syst. Rehabil. Eng. 25, 2270–2284. doi: 10.1109/TNSRE.2017.2748388

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaya, Y., Uyar, M., Tekin, R., and Yildirim, S. (2014). 1d-local binary pattern based feature extraction for classification of epileptic EEG signals. Appl. Math. Comput. 243, 209–219. doi: 10.1016/j.amc.2014.05.128

CrossRef Full Text | Google Scholar

Lee, S.-H., Lim, J. S., Kim, J.-K., Yang, J., and Lee, Y. (2014). Classification of normal and epileptic seizure EEG signals using wavelet transform, phase-space reconstruction, and Euclidean distance. Comput. Methods Prog. Biomed. 116, 10–25. doi: 10.1016/j.cmpb.2014.04.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, W., Zhang, L., Qiao, L., and Shen, D. (2019). Toward a better estimation of functional brain network for mild cognitive impairment identification: a transfer learning view. IEEE J. Biomed. Health Inform. 24, 1160–1168. doi: 10.1101/684779

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, Y., Ma, Y., Chen, C., and Wang, Y. (2018). Classification of single-channel EEG signals for epileptic seizures detection based on hybrid features. Technol. Health Care 26(Suppl.), 337–346. doi: 10.3233/THC-174679

PubMed Abstract | CrossRef Full Text | Google Scholar

Megiddo, I., Colson, A., Chisholm, D., Dua, T., Nandi, A., and Laxminarayan, R. (2016). Health and economic benefits of public financing of epilepsy treatment in India: an agent-based simulation model. Epilepsia 57, 464–474. doi: 10.1111/epi.13294

PubMed Abstract | CrossRef Full Text | Google Scholar

Nigam, V. P., and Graupe, D. (2004). A neural-network-based detection of epilepsy. Neurol. Res. 26, 55–60. doi: 10.1179/016164104773026534

PubMed Abstract | CrossRef Full Text | Google Scholar

Polat, K., and Güneş, S. (2007). Classification of epileptiform EEG using a hybrid system based on decision tree classifier and fast Fourier transform. Appl. Math. Comput. 187, 1017–1026. doi: 10.1016/j.amc.2006.09.022

CrossRef Full Text | Google Scholar

Sharmila, A. E. A. (2016). DWT based detection of epileptic seizure from EEG signals using naive bayes and K-NN classifiers. IEEE Access 4, 7716–7727. doi: 10.1109/ACCESS.2016.2585661

PubMed Abstract | CrossRef Full Text | Google Scholar

Swami, P., Gandhi, T. K., Panigrahi, B. K., Tripathi, M., and Anand, S. (2016). A novel robust diagnostic model to detect seizures in electroencephalography. Expert Syst. Appl. 56, 116–130. doi: 10.1016/j.eswa.2016.02.040

CrossRef Full Text | Google Scholar

Tao, Z., Chen, W., and Li, M. (2017). AR based quadratic feature extraction in the VMD domain for the automated seizure detection of EEG using random forest classifier. Biomed. Signal Process. Control 31, 550–559. doi: 10.1016/j.bspc.2016.10.001

CrossRef Full Text | Google Scholar

Tawfik, N. S., Youssef, S. M., and Kholief, M. (2016). A hybrid automated detection of epileptic seizures in EEG records. Comput. Electr. Eng. 53, 177–190. doi: 10.1016/j.compeleceng.2015.09.001

CrossRef Full Text | Google Scholar

Tuncer, T., Dogan, S., and Akbal, E. (2019). A novel local senary pattern based epilepsy diagnosis system using EEG signals. Austral. Phys. Eng. Sci. Med. 42, 939–948. doi: 10.1007/s13246-019-00794-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Türk, Ö., and Özerdem, M. S. (2019). Epilepsy detection by using scalogram based convolutional neural network from EEG signals. Brain Sci. 9:115. doi: 10.3390/brainsci9050115

PubMed Abstract | CrossRef Full Text | Google Scholar

Ullah, I., Hussain, M., Aboalsamh, H., et al. (2018). An automated system for epilepsy detection using EEG brain signals based on deep learning approach. Expert Syst. Appl. 107, 61–71. doi: 10.1016/j.eswa.2018.04.021

CrossRef Full Text | Google Scholar

Vidal, R. (2016). Generalized Principal Component Analysis (GPCA). New York, NY: Springer. doi: 10.1007/978-0-387-87811-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., and Manzagol, P. A. (2010). Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11, 3371–3408. doi: 10.5555/1756006.1953039

CrossRef Full Text | Google Scholar

Wang, X., Girshick, R., Gupta, A., and He, K. (2018). “Non-local neural networks,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (Salt Lake City, UT), 7794–7803. doi: 10.1109/CVPR.2018.00813

CrossRef Full Text | Google Scholar

Xu, X., Li, W., Mei, J., Tao, M., Wang, X., Zhao, Q., et al. (2020). Feature selection and combination of information in the functional brain connectome for discrimination of mild cognitive impairment and analyses of altered brain patterns. Front. Aging Neurosci. 12:28. doi: 10.3389/fnagi.2020.00028

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan, Y., Xun, G., Jia, K., and Zhang, A. (2017). “A novel wavelet-based model for eeg epileptic seizure detection using multi-context learning,” in 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), (Kansas City, MO), 694–699. doi: 10.1109/BIBM.2017.8217737

CrossRef Full Text | Google Scholar

Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017). “Pyramid scene parsing network,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (Honolulu, HI), 2881–2890. doi: 10.1109/CVPR.2017.660

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, W., Zhao, W., Wang, W., Jiang, X., Zhang, X., Peng, Y., et al. (2020). A novel deep neural network for robust detection of seizures using EEG signals. Comput. Math. Methods Med. 2020:9689821. doi: 10.1155/2020/9689821

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: convolution neural network, EEG, epilepsy, multi-scale, non-local, seizure, ictal, interictal

Citation: Zhang G, Yang L, Li B, Lu Y, Liu Q, Zhao W, Ren T, Zhou J, Wang S-H and Che W (2020) MNL-Network: A Multi-Scale Non-local Network for Epilepsy Detection From EEG Signals. Front. Neurosci. 14:870. doi: 10.3389/fnins.2020.00870

Received: 01 June 2020; Accepted: 27 July 2020;
Published: 17 November 2020.

Edited by:

Yizhang Jiang, Jiangnan University, China

Reviewed by:

Yuanpeng Zhang, Nantong University, China
Xuejing Lan, Guangzhou University, China

Copyright © 2020 Zhang, Yang, Li, Lu, Liu, Zhao, Ren, Zhou, Wang and Che. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qinyuan Liu, bGl1cXlAdG9uZ2ppLmVkdS5jbg==; Shui-Hua Wang, aHVpaHVhd2FuZ0BpZWVlLm9yZw==; Wenliang Che, Y2hld2VubGlhbmdAdG9uZ2ppLmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.