ORIGINAL RESEARCH article

Front. Neurosci., 04 September 2020

Sec. Neuroprosthetics

Volume 14 - 2020 | https://doi.org/10.3389/fnins.2020.00837

An Intelligence EEG Signal Recognition Method via Noise Insensitive TSK Fuzzy System Based on Interclass Competitive Learning

  • School of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, China

Abstract

Epilepsy is an abnormal function disease of movement, consciousness, and nerve caused by abnormal discharge of brain neurons in the brain. EEG is currently a very important tool in the process of epilepsy research. In this paper, a novel noise-insensitive Takagi–Sugeno–Kang (TSK) fuzzy system based on interclass competitive learning is proposed for EEG signal recognition. First, a possibilistic clustering in Bayesian framework with interclass competitive learning called PCB-ICL is presented to determine antecedent parameters of fuzzy rules. Inherited by the possibilistic c-means clustering, PCB-ICL is noise insensitive. PCB-ICL learns cluster centers of different classes in a competitive relationship. The obtained clustering centers are attracted by the samples of the same class and also excluded by the samples of other classes and pushed away from the heterogeneous data. PCB-ICL uses the Metropolis–Hastings method to obtain the optimal clustering results in an alternating iterative strategy. Thus, the learned antecedent parameters have high interpretability. To further promote the noise insensitivity of rules, the asymmetric expectile term and Ho–Kashyap procedure are adopted to learn the consequent parameters of rules. Based on the above ideas, a TSK fuzzy system is proposed and is called PCB-ICL-TSK. Comprehensive experiments on real-world EEG data reveal that the proposed fuzzy system achieves the robust and effective performance for EEG signal recognition.

Introduction

Epilepsy occurs randomly and may occur multiple times in a day. In the case of epileptic seizures, the patients have a sudden physical convulsions and loss of consciousness, which bring great physical and psychological pain to patients (Ahmadlou and Adeli, 2011; Gummadavelli et al., 2018; Cury et al., 2019). Seizures will lead to brain cell death, affect brain function, and even threaten patients' lives in serious cases. The incidence of epilepsy is high, and the age range is very wide, including children, adolescents, and the elderly, but the incidence of children and adolescents is the highest. Both men and women are likely to have the disease, and men are more likely to have this disease than women. As an important clinical means of monitoring and diagnosing epilepsy, EEG provides a more rapid and stable low-cost and non-invasive technology in monitoring the brain activity of the cerebral cortex. It provides information that other physiological methods cannot provide. The specific waveforms such as spike, sharp, and complex wave can be reflected by EEG. Therefore, the prevention and treatment of epilepsy research for epilepsy patients is of great significance. In the process of diagnosis and treatment of epilepsy, EEG plays an irreplaceable role. Doctors usually judge the condition of patients by observing their EEG.

The traditional way to judge the EEG signal is not only inefficient, but also because of the difference of experts' subjective experience, the automatic detection of EEG signal is still one of the hot issues in biomedical research (Jiang et al., 2017a; Martinez-Vargas et al., 2017; Li et al., 2019). An automatic epilepsy detection method can help doctors improve the accuracy of epilepsy diagnosis and also greatly save time. The research of automatic epilepsy detection is of great value to the prevention, diagnosis, and treatment of epilepsy. At present, epilepsy can be detected by machine learning and data mining. Firstly, the effective feature information is extracted from EEG and preprocessed for data analysis; secondly, the preprocessed EEG data are sent to the classifier for analysis and detection of epileptic and non-epileptic EEG data. In the above implementation process, the key research is to design an effective prediction and discrimination method that can be applied to normal EEG signal and epileptic EEG signal. Many effective methods have been successfully applied to automatic epilepsy detection system, including extreme learning machine (ELM), artificial neural network, Bayesian linear discriminant analysis, support vector machine (SVM), and fuzzy system (Kabir and Zhang, 2016; Qi et al., 2017; Akhavan and Moradi, 2018; Truong et al., 2018; Hossain et al., 2019; Liu et al., 2019; Sreej and Samanta, 2019; Xia et al., 2020). The fuzzy system is a model constructed to deal with the thinking, analysis, reasoning, and decision-making processes in production and practice. It can directly translate natural language into computer language. Due to its ability to process uncertain and ambiguous information, it has a high degree of interpretability and strong learning ability (Juang et al., 2007; Gu et al., 2017a; Jiang et al., 2017b,c; Gu and Wang, 2018). However, the traditional fuzzy system has poor robustness and anti-interference ability, and the classification accuracy is not high in data noise scenarios. But in real life, the classification of noise data is widely used. For example, in actual application scenarios, due to differences in machine advices or scanning technology, such as different rotation angles and noise, the quality of medical images may vary greatly (Siuly and Li, 2015; Hussein et al., 2019; Razzak et al., 2019).

Based on key technology of fuzzy system modeling, this paper proposes a novel noise-insensitive Takagi–Sugeno–Kang (TSK) fuzzy system. How to determine the antecedent and consequent parameters is the key to modeling the noise-insensitive fuzzy system (Takagi and Sugeno, 1985; Jiang et al., 2015). For the antecedent part of fuzzy rules, clustering is one kind of a commonly used strategy, such as fuzzy c-means (FCM) clustering (Bezdek et al., 1984), fuzzy (c + p) clustering (Leski, 2015), Bayesian fuzzy clustering (BFC) (Glenn et al., 2015), and possibilistic c-means (PCM) clustering (Krishnapuram and Keller, 1993). However, FCM, fuzzy (c + p), and BFC are sensitive to noise and will lead to unsatisfactory partition in noisy scenarios. PCM inherits the practicability and flexibility of fuzzy clustering and greatly enhances the clustering performance of data with noise or outliers. However, the unsupervised nature of PCM makes it unable to use the class label information of samples, which easily causes the insufficient fuzzy space partition, thus further affecting the learning of antecedent parameters of fuzzy rules. The principle of antecedent parameter learning using PCM clustering is shown in Figure 1A. PCM clustering is directly used on whole datasets or on samples in each class, and then the antecedent parameters are learned using the obtained clustering results. Then the data samples are simply divided into several clusters, without fully taking advantage of the geometry of data and the label information of samples. In this case, in the data overlapping regions, the distance between clustering centers may be too small or the centers may overlap.

Figure 1

In this paper, we first propose a noise-insensitive possibilistic clustering in Bayesian framework with interclass competitive learning called PCB-ICL. Inherited by PCB, PCB-ICL is noise insensitive; meanwhile, different classes of cluster centers will produce a competitive relationship during the learning process. That is, in the sample overlapping area, the clustering centers are attracted by the samples of the same class and also excluded by the samples of other classes and pushed away from the heterogeneous data. The principle of antecedent parameter learning using PCB-ICL clustering is shown in Figure 1B. PCB-ICL integrates the competitive learning mechanism of clustering centers among different classes in the Bayesian framework. PCB-ICL considers the structure information of samples in the clustering procedure and realizes the competition between clustering centers among different classes. We obtain the antecedent part of fuzzy rules by performing PCB-ICL alternatively on each class samples. Then, a Ho–Kashyap procedure (Leski, 2003) with an asymmetric expectile term (Huang et al., 2014a,b) is adopted to estimate the consequent parameters of fuzzy rules. Due to the statistical characteristics of the asymmetric expectile term, it is insensitive to noise; so the asymmetric expectile term is used to measure the misclassification error. Based on the above idea, the TSK fuzzy system called PCB-ICL-TSK is developed, which learns antecedent parameters by PCB-ICL clustering and consequent parameters by the Ho–Kashyap procedure with an asymmetric expectile term. We apply the proposed algorithm on the Bonn EEG dataset, and the experimental results on several noisy classification tasks demonstrate that PCB-ICL-TSK can achieve satisfactory performance in EEG signal classification. The novelty of our study is as follows. (1) Both the PCB-ICL and Ho–Kashyap procedure with an asymmetric expectile term are insensitive to noise; thus, the obtained antecedent and consequent parameters are noise insensitive. (2) With the Bayesian framework, the clustering results of PCB-ICL are globally optimal. In addition, the competitive relationship strategy between cluster centers enhances the interpretability of the antecedents of fuzzy rules. (3) The experiments on real-word EEG datasets confirm the effectiveness of PCB-ICL-TSK.

The detailed chapters are arranged as follows. Section Backgrounds introduces the TSK fuzzy system and PCM clustering. Section Possibilistic Clustering in Bayesian With Interclass Competitive Learning explores PCB-ICL clustering. Section Noise-Insensitive TSK Fuzzy System via Interclass Competitive Learning explores the noise-insensitive TSK fuzzy system PCB-ICL-TSK. Section Experiment is experiments on noisy EEG data. Section Conclusion is the conclusion.

Backgrounds

Dataset

The epileptic EEG in the experiment is the Bonn dataset from Bonn University, Germany (Tzallas et al., 2009). The Bonn EEG dataset consists of five groups of data, namely, A to E, shown in Figure 2. Each group of data contains 100 EEG signal segments of 23.6 s, which were selected from continuous single-channel EEG recordings. The EEG signals were recorded under different conditions with five patients and five healthy volunteers. The basic information of groups A–E is shown in Table 1.

Figure 2

Table 1

GroupDescription
Healthy volunteersAEEG signals of healthy volunteers in an awakened state with eyes open
BEEG signals of healthy volunteers in an awakened state with eyes closed
PatientsCEEG signals of patients in hippocampal formation of the opposite hemisphere of the brain
DEEG signals of patients in the epileptogenic zone during periodic lulls
EEEG signals of patients during seizure activity

The basic information of EEG data groups of A–E.

TSK Fuzzy System

The most commonly used rule in the zero-order TSK fuzzy system can be represented by

Rule Rk: IF x1 is Ak,1 and x2 is Ak,2 and … and xd is Ak,d,

where x1, x2, …, xd are input variables, Ak,i is a fuzzy subset, and K is the number of fuzzy rules. For an input vector x, the output of the corresponding TSK fuzzy system is represented by

where the fuzzy membership μk(x) and the normalized fuzzy membership is

For the sample xi, we can rewrite it by

Generally, antecedent and consequent parameters of rules are determined separately. A popular way to estimate antecedent parameters is to use a certain fuzzy clustering method (Takagi and Sugeno, 1985; Gu et al., 2017b; Salgado et al., 2017). Then can be computed by

where the width parameter δk,i can be obtained by

where h is the scale parameter and uk,j is the fuzzy membership of the jth input sample xj belonging to the kth cluster.

Then the learning of consequent parameters can be represented by

Using the least square solution to minimize the squared loss, Equation (8) can be written by

where D = [l1d(x1)T, …, lNd(xN)T]T, the matrix H = diag(h1, h2, …, hN), hi = 1/|lid(xi)Tp−1| for lid(xi)Tp−1 < 0, and hi = 0 otherwise. τ is the regularization parameter. Using the Ho–Kashyap iterative method (Leski, 2003), p can be computed by

where I is the identify matrix.

PCM Clustering

PCM clustering is a probability clustering based on FCM. Based on the framework of possibility theory, PCM not only takes into account the general criteria of clustering with the minimum distance within one class and the maximum distance between classes but also emphasizes the principle of the maximum membership value to avoid ordinary solution problems. The objective function of PCM is

The closed solution of U and Y can be obtained by minimizing the objective function with respect to unc and yc by.

Possibilistic Clustering in Bayesian with Interclass Competitive Learning

Objective Function

A clustering method implements data partition with some certain degree of similarity. In the clustering process, the samples of one class will have a repulsive effect on the clustering center of other classes, especially in the overlapping regions of different classes of samples; the greater the overlap density, the greater the repulsive force. In these sample overlapping regions, clustering centers of different classes form the competitive learning relationship. On the one hand, the clustering centers are attracted by samples of this class; on the other hand, the clustering centers are excluded by different classes of samples and far away from the overlapping region. In this paper, this idea is embedded into PCM clustering. Based on the Bayesian framework, we propose the possibilistic clustering in Bayesian with interclass competitive learning.

Suppose a given binary classification dataset , in which and represent two class samples and ln ∈ {+1, −1} is the class label of the nth sample. Let the cluster number of one class samples be C1 and the cluster centers of the other class Z be priorly known , where the cluster number is C2. We suppose data X follows the normal distribution, and each sample xi has an independent probability distribution. The maximum posterior estimation of data and parameters in X1 is expressed by

where is the unknown cluster center matrix of one class sample. By taking the logarithm of Equation (14), the objective function of PCB-ICL method can be obtained as

From Equations (14) and (15), we can see that (1) the PCB-ICL method shows the competition relationship between clustering centers of different classes. Different from the traditional PCM clustering method, PCB-ICL not only considers the label information of samples but also considers the competition relationship between clustering centers, as shown in the first two items. On the premise that the clustering centers of the other class are priorly known, the clustering centers of the current class will inevitably have a competition relationship with these known clustering centers in the overlapping region. (2) Due to simultaneously utilizing the global distribution structure and the discrimination information of the samples, the obtained antecedent part of fuzzy rules by PCB-ICL can realize the clarity of fuzzy space partition and enhance the interpretability of the fuzzy rules.

Parameter Learning

To obtain the optimal fuzzy partition matrix U, the PCB-ICL method uses the Metropolis–Hastings method (Chib and Greenberg, 1995; Elvira et al., 2017) to construct a Markov chain to make p(U|X1, Y) stable. The conditional distribution p(U|X1, Y) is proportional to the joint distribution p(X1, U, Y) when the sample and clustering center are known and also is proportional to the conditional distribution p(U|X1, Y). Therefore, we only need compute p(xn, un|Y) of the sample xn:

Thus, the process of the ith iteration of the Markov chain is

  • 1) Generate a new state of un with a uniform distribution as

  • 2) The newly generated membership is accepted by the probability au as

    Then accepting au as the current state with probability un,

    where μ is a random number in [0, 1]. The distribution of the new state obtained by sampling is independent of the current sample, and the state is independent, so au does not need Hasting correction.

  • 3) Compare and , where Y* and are the optimal values of Y and un. If, is replaced by .

    When the matrix U is fixed, we use Metropolis–Hastings to sample the conditional distribution p(Y|X, U). In this case, p(Y|X, U) is proportional to the joint distribution p(X, U, Y). We estimate yc by using the Gaussian distribution as

where centers on the current value yc. σ is a positive number and is used to control the compactness of cluster centers. In the experiment, we empirically set σ to 10.

For the newly generated , it is independent of other clustering centers. Then the conditional distribution p(X, yc|U) is represented by

Similarly, the newly generated membership is accepted by the probability ay as

Since the Gaussian distribution is symmetric, ay does not need Hasting correction.

Finally, we compute p(X, U*, Y*) using Equation (15) and compare it with the current p(X, U, Y). If p(X, U, Y) > p(X, U*, Y*), the {U, Y} is replaced by {U*, Y*}.

Based on the above analysis, we give the procedure of the PCB-ICL method in Algorithm 1.

Algorithm 1

Input: Dataset X1 of one class, the number of clustering C, priorly known clustering center matrix Z of the other class;
Output: Fuzzy partition matrix U* and clustering center matrix Y*.
Step 1 Initiate ;
Step 2 Initiate , ∀c;
Step 3 Set = , ;
For iter = 1, 2, …, Niter
      For n = 1, 2, …, N
      Step 4 Sample using Equation (17) and accept it as un using Equations (18) and (19);
      Step 5 If , then = ;
      Endfor
      For c = 1, 2, …, C
      Step 6 Sample using Equation (20) and accept it as using Equations (21) and (22);
      Step 7 If > , then ;
      Endfor
Step 8 If p(X, U*, Y*) > p(X, U, Y), then U*← U, Y*← Y;
Endfor.

PCB-ICL method.

Noise-Insensitive TSK Fuzzy System via Interclass Competitive Learning

Antecedent Parameter Learning in PCB-ICL-TSK

In this section, we compute the antecedent parameters in PCB-ICL-TSK. The premise of PCB-ICL clustering in Algorithm 1 is that the clustering centers of other class are priorly known, which is obviously not feasible in practical application. To perform the fuzzy partition on the whole data set, we take the strategy of an alternating cycle to perform Algorithm 1 on different classes. In this case, the clustering results of one class influence the ones of the other class. Taking binary classification as an example, we perform Algorithm 1 on positive class X1 and negative class X2 alternately. The detailed fuzzy partition of the whole data is shown in Algorithm 2.

Algorithm 2

Input: Two class samples X1 and X2, the numbers of clustering C1 and C2 in two classes;
Output: Fuzzy partition matrix U(1)*, U(2)* and clustering center matrix Y(1)*, Y(2)*.

Step 1 Initiate in two classes;
Step 2 Initiate in two classes;
Step 3 Set , , , ;
iter = 0;
Do
      Step 4 Perform Algorithm 1 on X1;
      Step 5 Perform Algorithm 1 on X2;
iter = iter + 1;
Until Y(1)* is |Y(1)*(ν) – Y(1)*(ν – 1)| ≤ ε or iter > Niter

Fuzzy partition on the whole data.

The numbers of clustering in two classes are C1 and C2, and the cluster centers in two classes are Y1 and Y2, respectively. After applying Algorithm 2 on the whole data, the center matrix Y can be described by Y* = [Y(1)*; Y(2)*].

Consequent Parameter Learning in PCB-ICL-TSK

In this section, we compute the noise-insensitive consequent parameters in PCB-ICL-TSK. As discussed before, using the obtained the antecedent parameters, the dataset is represented as , where . Defining the vector , the consequent vector can be computed by

where the vector and w is the decision threshold. If we multiply Equation (23) by the class label, Equation (23) is represented as li(p*)Td(xi) ≥ 0 (i = 1, …, N). Then, the vector p* can be computed by

In particular, ε0 = 1 leads to the classical SVM. For simplicity, we set ε0 = 1, and Equation (24) can be written as li(p*)Td(xi) ≥ 1. Thus, Equation (24) can be written as

Denote the matrix D = [l1d(x1)T, l2d(x2)T, …, lNd(xN)T]T and the error vector e = D*p*1. Equation (25) can be rewritten as

where the matrix H = (λ/N)diag(h1, h2, …, hN), with hi = 0 for error ei ≥ 0 and 1 otherwise.

However, the misclassification error in Equation (24) is noise sensitive. To further improve the robustness of the TSK fuzzy system, we use the asymmetric expectile term, which is noise insensitive, especially to noise around the decision boundary. The weight hi of the ith sample can be expressed by

where hi is the q (lower) expectile parameter. Obviously, when q = 0, the loss term obtained in Equation (27) is equal to the hinge loss, and when q = 0.5, the loss term is equal to the l2 loss in Huang et al. (2014a,b).

At the same time, considering the regularization term, Equation (26) can be rewritten as

where τ is the regularization parameter. p*(k), H(k), and e(k) are the kth iteration of p*, H, and e, respectively.

The condition for optimality of Equation (28) in the kth iteration is obtained by setting dJ/dp* = 0:

where is the identity matrix with the last element on the main diagonal set to 0.

The consequent parameter learning in IB-TSK-FC on dataset X is shown in Algorithm 3.

Algorithm 3

Input: The dataset X; the number of clusters (C1 + C2); the cluster centers Y1, Y2 and the membership matrix U1, U2; the expectile parameter q; and the regularization parameter τ;
Output: Consequent parameters p0.

Step 1 Run Algorithm 2 to obtain the antecedent parameters;
Step 2 Compute the membership function d(xi)= by using Equations (5)–(7);
k = 0;
Do
Step 3 Obtain the parameters (p*)(k) using Equation (29);
Step 4 Compute the parameter e(k) using e(k) = D(p*)(k)1;
Step 5 Compute the parameter H(k+1) using Equation (27);
k = k + 1;
Until p* is convergence or k > kmax.

Learning algorithm for consequent parameters.

Experiment

Experimental Settings

The real-world EEG signals have characters of high dimensionality and instability. Feature extraction is a necessary stage before classification for EEG signal recognition. In general, time domain and frequency domain feature extractions are two types of feature extraction methods (Wen and Zhang, 2017). In our experiments, we extract EEG features using kernel principal component analysis (KPCA) and short-time Fourier transform (STFT) (Blanco et al., 1997). The former is the time domain feature extraction, and the latter is the frequency domain feature extraction. In the experiment, we design eight classification tasks, namely, four binary classification and four three-class classification tasks, as shown in Table 2. We corrupt the original datasets with different amounts of random noises at 5, 10, and 15% noise levels.

TABLE 2

TasksNumber of classesDatasets
T1Two classesA and C
T2A and E
T3B and D
T4B and E
T5Three classesA, C and E
T6A, D, and E
T7B, C, and E
T8B, D, and E

EEG classification tasks in the experiment.

The experimental environment in this study is a computer with Intel Core i3-3317U 3.40-GHz CPU and 8-GB RAM. To validate the performance of MST-TSK, we compare three fuzzy systems (FS-FCSVM; et al., 2007, ε-margin-TSK-FS; Leski, 2005, and IB-TSK-FC; Gu et al., 2017b) and two robust classification methods (CS-SVM; Iranmehr et al., 2019 and FRSVM-ANCH; Gu et al., 2019). The Gaussian kernel is used for two SVM methods. The parameter settings for all methods are listed in Table 3. All parameters are obtained by a 5-fold cross-validation strategy.

TABLE 3

MethodsParameter settings
ε-margin-TSK-FS FS-FCSVMNumber of rules ∈ {1, 2, …, 12}, regularization parameter ∈ {10−3, 10−2, …, …, 103}, scale parameter ∈ {0.42, 0.62, …, 32}, fuzzy index = 2
IB-TSK-FCModel sparsity parameter ∈ {1, 2, …, 6}, fuzzy index = 2, number of particles = 10, convergence thresholds = 10−3, convergence threshold = 10–3, strength parameter = 3
CS-SVMGaussian kernel parameter ∈ {10−2, 10−1, …, …, 102}, regularization parameter ∈ {10−3, 10−2, …, …, 103}
FRSVM-ANCHGaussian kernel parameter ∈ {10−2, 10−1, …, …, 102}, regularization parameter ∈ {10−3, 10−2, …, …, 103}, pinball loss parameter = 0.05
PCB-ICL-TSKFuzzy index = 2, number of rules ∈ {1, 2, …, 12}, convergence threshold = 10−3, strength parameter = 3, expectile parameter = 0.05, maximum number of iterations = 1,000 and 200 in Algorithms 1 and 2, respectively

Parameter settings for all methods in the experiment.

Classification Performance Comparison

In this section, eight EEG classification tasks are used to verify the classification performance of PCB-ICL-TSK. Tables 4, 5 show the experimental results of six classification algorithms using STFT and KPCA feature extraction methods at the 5% noise level. Tables 6, 7 show the experimental results of six classification methods using STFT and KPCA feature extraction methods at the 10% noise level. Tables 8, 9 show the experimental results of six classification methods using STFT and KPCA feature extraction methods at the 15% noise level. From the experimental results, it can be seen that the noise data seriously affect the classification performance of the method. During the learning process, considering the noise of the data is helpful to promote the classification performance. Therefore, the performances of FS-FCSVM, ε-margin-TSK-FS, and IB-TSK-FC are poor. CS-SVM, FRSVM-ANCH, and PCB-ICL-TSK are not sensitive to noise, and they can achieve good classification results. In particular, PCB-ICL-TSK shows excellent classification performance in different levels of noise occasions, and it reflects strong robustness. Since PCB-ICL-TSK uses the PCB-ICL and Ho–Kashyap procedure with an asymmetric expectile term to compute antecedent and consequent parameters of fuzzy rules, it is noise insensitive. In addition, in the Bayesian framework, PCB-ICL obtains global optimal clustering results, and the strategy of competitive relationship of clustering centers can enhance the interpretability of the antecedents of fuzzy rules.

TABLE 4

TasksFS-FCSVMε-margin-TSK-FSIB-TSK-FCCS-SVMFRSVM-ANCHPCB-ICL-TSK
Task 194.5694.6995.2296.2796.7396.74
Task 294.8894.8195.3496.1996.5196.62
Task 394.6494.5795.3996.4596.7096.69
Task 494.2194.3495.5796.5896.5496.67
Task 593.4693.2593.9896.2796.3296.29
Task 693.3793.4293.9695.8996.1496.12
Task 793.2593.3493.8695.7895.9796.04
Task 893.3893.2993.7595.8696.0996.10

The classification accuracy for the 5% noise level using STFT features.

The bold values indicate the best classification performance in the tasks.

TABLE 5

TasksFS-FCSVMε-margin-TSK-FSIB-TSK-FCCS-SVMFRSVM-ANCHPCB-ICL-TSK
Task 194.5394.7095.2396.2896.6396.75
Task 294.9294.7895.3996.2396.5096.59
Task 394.6294.5595.4096.4296.7296.72
Task 494.2394.3395.5696.6296.5196.65
Task 593.4793.2593.9596.2696.3396.31
Task 693.3893.4493.9795.9496.1196.19
Task 793.2393.3693.8595.7995.9596.01
Task 893.3793.3493.7695.8996.1096.11

The classification accuracy for the 5% noise level using KPCA features.

The bold values indicate the best classification performance in the tasks.

TABLE 6

TasksFS-FCSVMε-margin-TSK-FSIB-TSK-FCCS-SVMFRSVM-ANCHPCB-ICL-TSK
Task 192.7992.8493.4494.6495.0995.29
Task 293.1293.0193.5894.5795.2195.28
Task 392.7692.7693.5994.8695.3995.30
Task 492.4892.5293.7695.0394.9995.21
Task 591.6091.4592.1694.6394.6694.93
Task 691.6191.6592.2394.2394.7194.70
Task 791.4491.5892.0994.1894.6194.68
Task 891.5891.5591.9494.1994.5694.67

The classification accuracy for the 10% noise level using STFT features.

The bold values indicate the best classification performance in the tasks.

TABLE 7

TasksFS-FCSVMε-margin-TSK-FSIB-TSK-FCCS-SVMFRSVM-ANCHPCB-ICL-TSK
Task 192.7592.8093.3994.6195.2495.27
Task 293.1292.9893.5394.5695.2195.21
Task 392.7792.7293.6394.8595.1495.26
Task 492.4992.5893.8495.0894.9595.20
Task 591.6391.4292.2094.6694.7794.96
Task 691.5391.6692.2294.2494.5894.69
Task 791.4591.5292.1094.2494.5194.68
Task 891.6191.5891.8794.1994.5494.69

The classification accuracy for the 10% noise level using KPCA features.

The bold values indicate the best classification performance in the tasks.

TABLE 8

TasksFS-FCSVMε-margin-TSK-FSIB-TSK-FCCS-SVMFRSVM-ANCHPCB-ICL-TSK
Task 190.7390.7791.3992.9893.4893.88
Task 291.1390.9491.4992.8493.8593.90
Task 390.7090.7691.5393.1893.7093.88
Task 490.4890.5791.8493.3393.8593.84
Task 589.6889.3990.1992.9193.0793.57
Task 689.5589.7490.2092.5992.9293.33
Task 789.4989.4890.0592.4893.0493.28
Task 889.6289.6789.8592.5093.2893.32

The classification accuracy for the 15% noise level using STFT features.

The bold values indicate the best classification performance in the tasks.

TABLE 9

TasksFS-FCSVMε-margin-TSK-FSIB-TSK-FCCS-SVMFRSVM-ANCHPCB-ICL-TSK
Task 190.7390.7791.3992.9693.4793.87
Task 291.1390.9491.4992.8693.3493.82
Task 390.7090.7691.5393.2093.7393.90
Task 490.4890.5791.8493.3393.8293.82
Task 589.6889.3990.1992.9393.1293.49
Task 689.5589.7490.2092.5292.9193.32
Task 789.4989.4890.0592.4792.9993.24
Task 889.6289.6789.8592.4993.2093.22

The classification accuracy for the 15% noise level using KPCA features.

The bold values indicate the best classification performance in the tasks.

Interpretability Comparison

In this section, we compare the number of fuzzy rules of four fuzzy systems in Task 8. Figures 3, 4 show the number of fuzzy rules on the 5 and 15% noise levels for four fuzzy systems using KPCA features. Figures 5, 6 show the number of fuzzy rules on the 5 and 15% noise levels for four fuzzy systems using STFT features. From the results in Figures 36, compared with the three fuzzy systems, the number of fuzzy rules obtained by PCB-ICL-TSK is the least in all EEG classification tasks. It is known that for fuzzy systems, the interpretability of fuzzy rules is related to the number of fuzzy rules and the definition of fuzzy subsets. The fuzzy membership function obtained by PCB-ICL on Task 1 at the 5% noise level using KPCA features is shown in Figure 7. Because PCB-ICL clustering considers the influence of clustering centers of different classes in the process of clustering, that is, the competition relationship between different classes of clustering centers, PCB-ICL clustering can obtain clustering centers with a large interval, which guarantees the partition clarity of feature space and the classification accuracy of the obtained fuzzy system and the interpretation of rules.

Figure 3

Figure 4

Figure 5

Figure 6

Figure 7

Conclusion

The noise-insensitive PCB-ICL-TSK fuzzy system is proposed in this paper. In the learning of rule antecedent parameters, the proposed noise-insensitive PCB-ICL clustering based on the Bayesian probability model is used. PCB-ICL clustering considers the repulsion between different clustering centers, which can ensure the interpretability of the rule antecedent. PCB-ICL can learn the global optimal solution of clustering results by using the Markov model. PCB-ICL-TSK learns consequent parameters using the Ho–Kashyap procedure with an asymmetric expectile term. Thus, it not only has strong noise resistance but also has high classification performance. The experimental results of a real EEG dataset show that PCB-ICL-TSK has achieved satisfactory results in classification performance and high interpretability. Our future work is to further improve its practicability when the sample dimension is large.

Statements

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: The dataset analyzed for this study can be found in the Department of Epileptology University of Bonn [http://epileptologie-bonn.de/cms/upload/workgroup/lehnertz/eegdata.html].

Author contributions

TN and XG conceived and developed the theoretical framework of the manuscript. All authors carried out the experiment and data process and drafted the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant 61806026 and by the Natural Science Foundation of Jiangsu Province under Grant BK20180956.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    AhmadlouM.AdeliH. (2011). Functional community analysis of brain: a new approach for EEG-based investigation of the brain pathology. Neuroimage58, 401408. 10.1016/j.neuroimage.2011.04.070

  • 2

    AkhavanA.MoradiM. H. (2018). Detection of concealed information using multichannel discriminative dictionary and spatial filter learning. IEEE Trans. Inform. Foren. Secur. 13, 26162627. 10.1109/TIFS.2018.2825940

  • 3

    BezdekJ.EhrlichR.FullW. (1984). FCM: the fuzzy c-means clustering algorithm. Comp. Geosci. 10, 191203. 10.1016/0098-3004(84)90020-7

  • 4

    BlancoS.KochenS.RossoO. A.SalgadoP. (1997). Applying time frequency analysis to seizure EEG activity. IEEE Eng. Med. Biol. Mag. 16, 6471. 10.1109/51.566156

  • 5

    ChibS.GreenbergE. (1995). Understanding the metropolis-hastings algorithm. Am. Stat. 49, 327335. 10.1080/00031305.1995.10476177

  • 6

    CuryC.MaurelP.GribonvalR.BarillotC. (2019). A sparse EEG-informed fMRI model for hybrid EEG-fMRI neurofeedback prediction. Front. Neurosci.13:1451. 10.3389/fnins.2019.01451

  • 7

    ElviraV.MíguezJ.DjurićP. M. (2017). Adapting the number of particles in sequential Monte Carlo methods through an online scheme for convergence assessment. IEEE Trans. Signal Process.65, 17811794. 10.1109/TSP.2016.2637324

  • 8

    GlennT. C.ZareA.GaderP. D. (2015). Bayesian fuzzy clustering. IEEE Trans. Fuzzy Syst. 23, 15451561. 10.1109/TFUZZ.2014.2370676

  • 9

    GuX.ChungF.WangS. (2017a). Bayesian Takagi-Sugeno-Kang fuzzy classifier. IEEE Trans. Fuzzy Syst. 25, 16551671. 10.1109/TFUZZ.2016.2617377

  • 10

    GuX.ChungF. L.IshibuchiH.WangS. (2017b). Imbalanced TSK fuzzy classifier by cross-class Bayesian fuzzy clustering and imbalance learning, IEEE Trans. Syst. Man Cybernet. Syst. 47, 20052020. 10.1109/TSMC.2016.2598270

  • 11

    GuX.NiT.FanY. (2019). A fast and robust support vector machine with anti-noise convex hull and its application in large-scale ncRNA data classification. IEEE Access. 7, 134730134741. 10.1109/ACCESS.2019.2941986

  • 12

    GuX.WangS. (2018). Bayesian Takagi-Sugeno-Kang Fuzzy model and its joint learning of structure identification and parameter estimation. IEEE Trans. Indust. Inform. 14, 53275337. 10.1109/TII.2018.2813977

  • 13

    GummadavelliA.ZaveriH. P.SpencerD. D.GerrardJ. L. (2018). Expanding brain-computer interfaces for controlling epilepsy networks: novel thalamic responsive neurostimulation in refractory epilepsy. Front. Neurosci.12:474. 10.3389/fnins.2018.00474

  • 14

    HossainM. S.AminS. U.AlsulaimanM.MuhammadG. (2019). Applying deep learning for epilepsy seizure detection and brain mapping visualization, ACM Trans. Multimed. Comput. Commun. Appl.15, 117. 10.1145/3241056

  • 15

    HuangX. L.ShiL.PelckmansbK.SuykensJ. A. K. (2014a). Asymmetric ν-tube support vector regression. Comput. Stat. Data Anal. 77, 371382. 10.1016/j.csda.2014.03.016

  • 16

    HuangX. L.ShiL.SuykensJ. A. K. (2014b). Support vector machine classifier with pinball loss. IEEE Trans. Pattern Anal. Mach. Intell. 36, 984997. 10.1109/TPAMI.2013.178

  • 17

    HusseinR.PalangiH.WardR. K.WangZ. J. (2019). Optimized deep neural network architecture for robust detection of epileptic seizures using EEG signals. Clin. Neurophysiol. 130, 2537. 10.1016/j.clinph.2018.10.010

  • 18

    IranmehrA.ShiraziH. M.VasconcelosN. (2019). Cost-sensitive support vector machines. Neurocomputing343, 5064. 10.1016/j.neucom.2018.11.099

  • 19

    JiangY.DengZ.ChungF.WangG.QianP.ChoiK. S.et al. (2017c). Recognition of epileptic EEG signals using a novel multiview TSK fuzzy system. IEEE Trans. Fuzzy Syst. 25, 320. 10.1109/TFUZZ.2016.2637405

  • 20

    JiangY.DengZ.ChungF.WangS. (2015). Multi-task TSK fuzzy system modeling using inter-task correlation information. Inform. Sci. 298, 512533. 10.1016/j.ins.2014.12.007

  • 21

    JiangY.DengZ.ChungF.WangS. (2017b). Realizing two-view TSK fuzzy classification system by using collaborative learning. IEEE Transac. Syst. Man Cybernet. Syst. 47, 145160. 10.1109/TSMC.2016.2577558

  • 22

    JiangY.WuD.DengZ.QianP.WangJ.WangG.et al. (2017a). Seizure classification from EEG signals using transfer learning, semi-supervised learning and TSK fuzzy system. IEEE Trans. Neural Syst. Rehabil. Eng. 25, 22702284. 10.1109/TNSRE.2017.2748388

  • 23

    JuangC. F.ChiuS. H.ShiuS. J. (2007). Fuzzy system learned through fuzzy clustering and support vector machine for human skin color segmentation. IEEE Trans. Syst. Man Cybernet. Part A Syst. Hum. 37, 10771087. 10.1109/TSMCA.2007.904579

  • 24

    KabirE.ZhangY. (2016). Epileptic seizure detection from EEG signals using logistic model trees. Brain Inform. 3, 93100. 10.1007/s40708-015-0030-2

  • 25

    KrishnapuramR.KellerJ. M. (1993). A possibilistic approach to clustering. IEEE Trans. Fuzzy Syst. 1, 98110. 10.1109/91.227387

  • 26

    LeskiJ. M. (2003). Ho-Kashyap classifier with generalization control. Pattern Recogn. Lett. 24, 22812290. 10.1016/S0167-8655(03)00054-0

  • 27

    LeskiJ. M. (2005). TSK-fuzzy modeling based on ε - insensitive learning. IEEE Trans. Fuzzy Syst. 13, 181193. 10.1109/TFUZZ.2004.840094

  • 28

    LeskiJ. M. (2015). Fuzzy (c+p)-means clustering and its application to a fuzzy rule-based classifier: towards good generalization and good interpretability. IEEE Trans. Fuzzy Syst. 23, 802812. 10.1109/TFUZZ.2014.2327995

  • 29

    LiX.YangH.YanJ.WangX.LiX.YuanY. (2019). Low-intensity pulsed ultrasound stimulation modulates the nonlinear dynamics of local field potentials in temporal lobe epilepsy. Front. Neurosci.13:287. 10.3389/fnins.2019.00287

  • 30

    LiuC. L.XiaoB.HsaioW. HTsengV. S. (2019). Epileptic seizure prediction with multi-view convolutional neural networks. IEEE Access. 7, 170352170361. 10.1109/ACCESS.2019.2955285

  • 31

    Martinez-VargasJ. D.StrobbeG.VonckK.Van MierloP.Castellanos-DominguezG. (2017). Improved localization of seizure onset zones using spatiotemporal constraints and time-varying source connectivity. Front. Neurosci.11:156. 10.3389/fnins.2017.00156

  • 32

    QiF.LiY.WuW. (2017). RSTFC: a novel algorithm for spatio-temporal filtering and classification of single-trial EEG. IEEE Trans. Neural Netw. Learn. Syst. 26, 30703082. 10.1109/TNNLS.2015.2402694

  • 33

    RazzakI.HameedI. A.XuG. D. (2019). Robust sparse representation and multiclass support matrix machines for the classification of motor imagery EEG signals. IEEE J. Transl. Eng. Health Med. 7, 21682372. 10.1109/JTEHM.2019.2942017

  • 34

    SalgadoC. M.ViegasJ. L.AzevedoC. S.FerreiraM. C.VieiraS. M.SousaJ. M. C. (2017). Takagi-Sugeno fuzzy modeling using mixed fuzzy clustering. IEEE Trans. Fuzzy Syst.25, 14171429. 10.1109/TFUZZ.2016.2639565

  • 35

    SiulyS.LiY. (2015). Designing a robust feature extraction method based on optimum allocation and principal component analysis for epileptic EEG signal classification. Comput. Methods Prog. Biomed. 119, 2942. 10.1016/j.cmpb.2015.01.002

  • 36

    SreejS. R.SamantaD. (2019). Classification of multiclass motor imagery EEG signal using sparsity approach. Neurocomputing368, 133145. 10.1016/j.neucom.2019.08.037

  • 37

    TakagiT.SugenoM. (1985). Fuzzy identification of systems and its application to modeling and control,. Trans. Syst. Man Cybernet.15, 116132. 10.1109/TSMC.1985.6313399

  • 38

    TruongN. D.NguyenA.KuhlmannD. L.BonyadiM. R.YangJ. W.IppolitoS.et al. (2018). Integer convolutional neural network for seizure detection. IEEE J. Emerg. Select. Top. Circuits Syst. 8, 849857. 10.1109/JETCAS.2018.2842761

  • 39

    TzallasA. T.TsipourasM. G.FotiadisI. D. (2009). Epileptic seizure detection in EEGs using time-frequency analysis. IEEE Trans. Inform. Technol. Biomed. 13, 703710. 10.1109/TITB.2009.2017939

  • 40

    WenT.ZhangZ. (2017). Effective and extensible feature extraction method using genetic algorithm-based frequency-domain feature search for epileptic EEG multiclassification. Medicine96:e6879. 10.1097/MD.0000000000006879

  • 41

    XiaK.NiT.YinH.ChenB. (2020). Cross-domain classification model with knowledge utilization maximization for recognition of epileptic EEG signals. IEEE/ACM Trans. Comput. Biol. Bioinformatics. 10.1109/TCBB.2020.2973978. [Epub ahead of print].

Summary

Keywords

noise insensitive, TSK fuzzy system, Bayesian framework, possibilistic clustering, Ho–Kashyap procedure, asymmetric expectile term

Citation

Ni T, Gu X and Zhang C (2020) An Intelligence EEG Signal Recognition Method via Noise Insensitive TSK Fuzzy System Based on Interclass Competitive Learning. Front. Neurosci. 14:837. doi: 10.3389/fnins.2020.00837

Received

21 June 2020

Accepted

20 July 2020

Published

04 September 2020

Volume

14 - 2020

Edited by

Mohammad Khosravi, Persian Gulf University, Iran

Reviewed by

Shan Zhong, Changshu Institute of Technology, China; Juan Yang, Suzhou University, China

Updates

Copyright

*Correspondence: Xiaoqing Gu

This article was submitted to Neuroprosthetics, a section of the journal Frontiers in Neuroscience

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics