EEG-based finger movement classification with intrinsic time-scale decomposition

Degirmenci, Murside; Yuce, Yilmaz Kemal; Perc, Matjaž; Isler, Yalcin

doi:10.3389/fnhum.2024.1362135

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 05 March 2024

Sec. Brain-Computer Interfaces

Volume 18 - 2024 | https://doi.org/10.3389/fnhum.2024.1362135

This article is part of the Research TopicRecent Advancements in Brain-Computer Interfaces-based Limb RehabilitationView all 5 articles

EEG-based finger movement classification with intrinsic time-scale decomposition

Murside Degirmenci¹

Yilmaz Kemal Yuce²

Matjaž Perc^3,4,5,6

Yalcin Isler⁷^*

¹Department of Biomedical Technologies, Izmir Katip Celebi University, Izmir, Türkiye
²Department of Computer Engineering, Alanya Alaaddin Keykubat University, Alanya, Antalya, Türkiye
³Faculty of Natural Sciences and Mathematics, University of Maribor, Maribor, Slovenia
⁴Department of Medical Research, China Medical University Hospital, China Medical University, Taichung, Taiwan
⁵Complexity Science Hub Vienna, Vienna, Austria
⁶Department of Physics, Kyung Hee University, Seoul, Republic of Korea
⁷Department of Biomedical Engineering, Izmir Katip Celebi University, Izmir, Türkiye

Introduction: Brain-computer interfaces (BCIs) are systems that acquire the brain's electrical activity and provide control of external devices. Since electroencephalography (EEG) is the simplest non-invasive method to capture the brain's electrical activity, EEG-based BCIs are very popular designs. Aside from classifying the extremity movements, recent BCI studies have focused on the accurate coding of the finger movements on the same hand through their classification by employing machine learning techniques. State-of-the-art studies were interested in coding five finger movements by neglecting the brain's idle case (i.e., the state that brain is not performing any mental tasks). This may easily cause more false positives and degrade the classification performances dramatically, thus, the performance of BCIs. This study aims to propose a more realistic system to decode the movements of five fingers and the no mental task (NoMT) case from EEG signals.

Methods: In this study, a novel praxis for feature extraction is utilized. Using Proper Rotational Components (PRCs) computed through Intrinsic Time Scale Decomposition (ITD), which has been successfully applied in different biomedical signals recently, features for classification are extracted. Subsequently, these features were applied to the inputs of well-known classifiers and their different implementations to discriminate between these six classes. The highest classifier performances obtained in both subject-independent and subject-dependent cases were reported. In addition, the ANOVA-based feature selection was examined to determine whether statistically significant features have an impact on the classifier performances or not.

Results: As a result, the Ensemble Learning classifier achieved the highest accuracy of 55.0% among the tested classifiers, and ANOVA-based feature selection increases the performance of classifiers on five-finger movement determination in EEG-based BCI systems.

Discussion: When compared with similar studies, proposed praxis achieved a modest yet significant improvement in classification performance although the number of classes was incremented by one (i.e., NoMT).

1 Introduction

Neuroimaging covers various direct and indirect techniques used to visualize both the structure and the function of the nervous system. These methods include MR (Magnetic Resonance Imaging), CT (Computed Tomography), PET (Positron Emission Tomography), and EEG (electroencephalography). Among them, aside from being non-invasive, EEG retains some advantages over others such as high temporal resolution, easy accessibility, and low cost. Since EEG can capture brain activity in real-time in millisecond precision, it has become popular in neuroscience research, clinical diagnostics, and BCI (Brain-Computer Interface) systems. BCIs translate neural signals into commands for controlling external devices through software applications. In recent developments, researchers have delved into analyzing EEG patterns linked to particular finger movements. BCIs engineered to decipher these patterns offer the prospect of individuals operating external devices or interfaces solely through brain activity, eliminating the necessity for physical muscle movements. This advancement holds immense promise in crafting prosthetic hands capable of individual finger control, managing numerous devices, facilitating neurorehabilitation, and extending into applications within gaming and entertainment industries (Aricò et al., 2018). In the following subsections, after a literature review on both brain-computer interfaces and state-of-the-art finger movement classification studies, we mentioned our aim, our contributions to the literature, and the structural organization of this article, respectively.

1.1 Brain-computer interfaces

BCIs are computer-assisted systems that record the brain's electrical signals based on different brain monitoring techniques, analyze the signals on the interface, and convert them to specific commands to control external devices such as computers, wheelchairs, and prostheses without any physical movement (Belkacem et al., 2020). Consequently, BCI technology can help people suffering from various motor disabilities such as stroke patients to communicate with the outside, and indirectly perform motor function (Wolpaw et al., 2002). Among several different neuroimaging modalities, Electroencephalography (EEG) is widely used to capture brain activities. It is preferred for designing BCI systems due to the fact that EEG has many advantages such as its high temporal resolution, non-invasiveness, easy operation, relatively low cost, and portability (Vidal, 1977; Chen et al., 2015).

EEG-based BCI systems that manipulated motor imagery signals generated through the movements of large body parts such as hands, feet, and tongue have been proposed to control assistive devices throughout the past several decades (Pfurtscheller and Neuper, 2001; Alazrai et al., 2019; Degirmenci et al., 2023). However, such systems propose only limited control dimensions for prosthetic devices, thereby, the potential of utilizing these systems to control further complex assistive devices is restricted (Sciaraffa et al., 2022). In the last decade, numerous research studies have examined the decoding of movements of fine body parts to improve such systems (Alazrai et al., 2019).

The decoding of the movements performed by various fingers of a hand may increase the control dimensions of the EEG-based BCI systems. This, in turn, might provide subjects who utilize assistive devices to better carry out numerous skillful tasks. However, the decoding of finger movements (FM) within the same hand is considered as a demanding research area among motor imagery signal analysis studies (Alazrai et al., 2019). Employing and analyzing different kinds of feature extraction methods, feature selection methods, and classification algorithms play an important role in order to improve the efficiency of EEG-based BCI systems, which analyze FM and generate relevant commands from the recorded EEG data. In the literature, various feature extraction methods, feature reduction methods, and classification algorithms have been suggested for decoding FM. Different time-domain, frequency-domain, and spatial-domain EEG features have been calculated to predict FM in the past decade. The raw EEG time series (Kaya et al., 2018; Mwata-Velu et al., 2022; Zahra et al., 2022), different amplitude-based, and statistical-based EEG signal features (Degirmenci et al., 2024) were utilized to examine the effectiveness of the time domain. As for the spectral-domain features, different frequency-domain [Fourier transform (Kaya et al., 2018)] and time-frequency domain [Wavelet transform (Yahya et al., 2019), Short-time Fourier transform (Azizah et al., 2022), Empirical mode decomposition (Mwata-Velu et al., 2021)] representation algorithms and their various versions were investigated to classify FM. Common spatial pattern (Anam et al., 2019, 2020) and its different versions (Kato et al., 2020) are one of the most experimented methods for the analysis of spatial domain in FM classification. These different extracted features have been successfully classified using various machine learning algorithms. However, it is a challenging scientific task to determine and choose the most efficient combination of these methods. Providing optimal and relevant features is important for improving classifier performance (Narin et al., 2014; Degirmenci et al., 2023). Therefore, the implementation of effective feature extraction methods and feature reduction methods is essential for facilitating the following task of machine learning algorithms.

1.2 State of the art for finger movement classification

In the last decade, various signal processing and classification methods have been successful in FM classification (up to 91.70%) applied in the classification of EEG signals for FM tasks.

Kaya et al. (2018) conducted a Support Vector Machine (SVM) based classification study to classify the five FM of a hand. In their study, they used the data set they collected from a total of eight subjects who agreed to participate. They exploited the power of EEG subbands, Fourier Transform (FT) amplitudes, and EEG time series to represent 19-channel EEG signals as features. An average accuracy of 43.00% was obtained. Moreover, a subject-dependent classification study was also carried out and the performances of eight subjects were found to vary in the range of (20.00, 60.00%).

In Anam et al. (2019), the classification of five FM for the subject-dependent condition using the EEG signals of four subjects was aimed. To this purpose, the Common Spatial Pattern (CSP) based feature extraction process was performed and the Random Forest (RF) algorithm was executed. The classification performance was found to be 100% for training accuracy for each subject and the test accuracy performances ranged between 51.00 and 56.00%.

In 2022, Azizah et al. (2022) carried out a subject-dependent FM classification study. They performed channel selection based on One vs. Rest Common Spatial Pattern (CSP-OVR) and four out of 19 EEG channels were defined as relevant channels in their study. They extracted the spectrogram features from these selected channels. Their subject-dependent experimental results showed that the accuracy in classifications employing SVM ranged from 21.20 to 66.60%.

In the study conducted by Kato et al. (2020) in 2020, a multi-class CSP and Complex Fourier amplitudes-based feature extraction process was presented. They extracted features using 19 EEG channels for FM classification. According to their subject-dependent results, the training results of classifications carried out with the SVM algorithm were reported in the range of 23.90–58.30%.

Recently, deep learning approaches from machine learning methods have been the focus of attention by researchers in many different research areas such as disease detection from medical images (Narin and Isler, 2021), emotion recognition from biological signals (Ozdemir et al., 2021) and Electrocardiography (ECG) based arrhythmia detection (Degirmenci et al., 2022a) due to the fact that these architectures provide improved performance of classification. In addition, the main reason for this is that feature extraction and classification stages can be performed together in the hidden layers of deep learning structures. Considering these structures' benefits and advantages, deep learning approaches are also included for the classification of FM and motor imagery tasks in the literature.

In 2021, Mwata-Velu et al. (2021) performed a feature extraction process based on Empirical. Mode Decomposition (EMD) using four effective EEG channels which were selected from 19 EEG channels. They performed deep learning (BiLSTM) based subject-dependent classifications for the prediction of FM. Using EMD-based feature extraction and deep learning structure, training accuracy values in eight subjects were calculated in the range of 73.47- -98.69%, and test performances were calculated in the range of 66.00–76.13%.

In another study conducted in 2022, Mwata-Velu et al. (2022) worked on the classification of EEG time series with deep learning (EEGNet) structure. EEG signals of four subjects were used from a dataset that included EEG data of eight subjects, and at the same time, four out of 19 EEG channels were selected for their suggested study. In the subject-dependent analyses performed with four subjects, training successes were reported in the range of 80.10–91.70%.

In Anam et al. (2020), an FM classification study, a model that uses CSP algorithm-based feature extraction and Autonomous Deep Learning (ADL) based classification was proposed. They used 19-channel EEG signals from four subjects for their experimental process. With respect to the subject-dependent classifications, training performances ranged from 74.73 to 77.61%, and test performances ranged from 74.61 to 77.75%.

In another related paper Zahra et al. (2022), which was published recently in 2022, the performance of a Convolutional Neural Networks (CNN) was evaluated based on an original study design. In their model, EEG time series were combined with sliding window (Dietterich, 2002) and noise enhancement (Mitaim and Kosko, 1998) methods to extract the features. They obtained the features from 19-channel EEG signals of eight subjects. They conducted a subject-independent FM classification and achieved a training accuracy of 57.50%.

In their study conducted in 2022, Limbaga et al. (2022) carried out a CNN (EEGNet) based study for feature extraction and signal classification of five motor imagery classes of a hand. They reinforced their suggested model using a transfer learning approach through an EEG data set that includes 19-channel EEG signals of eight subjects. They reduced the EEG channel number to 14 and utilized the EEG signals of only four subjects. In addition to this data set, they recorded EEG signals from a subject while the subject imagined five different hand positions. According to their subject-independent evaluations, they achieved an accuracy of 51.74% success with the transfer learning model which is a reinforced model.

When the studies mentioned above that aimed to classify the motor imagery tasks of FM of a hand are examined, it was observed that the performances remained at relatively low rates in studies using all EEG channels and in subject-independent classification studies. The studies showed that the performances got higher with channel selection-based and subject-dependent classifications. The cause for the low level of performance in the classification of FM may be that the movements of the fingers on a hand are actually controlled from the same region of the motor cortex (Kaya et al., 2018). Kaya et al. (2018) investigated the event-related potential (ERP) curves of motor imagery tasks of other body limb movements together with motor imagery tasks of FM. They reported that the curves could not be clearly differentiated in the motor imagery tasks of FM. Therefore, there is a need to increase the classification performance by using effective feature extraction methods, feature selection methods, and classification algorithms for the classification of FM tasks.

1.3 The aim of the study

In 2007, an iterative signal decomposition technique, which is known as intrinsic time-scale decomposition, (ITD) was introduced to analyze nonlinear or non-stationary signals (Frei and Osorio, 2007). Recent studies have performed ITD-based approaches for the analysis of biomedical signals. ITD-based feature extraction processes were conducted in various EEG-based studies for different objectives such as epilepsy detection (Martis et al., 2013; Degirmenci and Akan, 2020), and attention deficit hyperactivity disorder (ADHD) recognition (Karabiber Cura et al., 2023). Considering its ability to discriminate different classes, we studied to explore whether ITD promises superior use, or not, in classifying other biomedical signals.

In this study, therefore, we suggest new praxis for the classification of FM tasks using ITD of EEG signals. The different modes that are defined as Proper Rotation Components (PRCs) and their combinations are acquired through ITD. Various features are evaluated using only modes and their combinations. In addition to the ITD-based feature extraction process, the effectiveness of statistically significance-based feature selection (ANOVA) is also investigated. The extracted ITD-based features are classified by eight different machine learning algorithms (Decision Tree, Discriminant Analysis, Naive Bayes, K-Nearest Neighbors, Support Vector Machine, Ensemble Learning, Neural Networks, and Kernel Approximation). Different performance evaluation metrics are employed for the accurate evaluation of the outputs of the suggested study.

1.4 Contributions

The novel contributions of this research study are summarized as follows:

• The classification of EEG signals of FM tasks is presented, using the ITD signal decomposition, and various feature extraction methods.

• Modes extracted by the ITD are utilized to evaluate several features, including Power, Mean, Sample Entropy, High-Frequency Moments (First Moment, Second Moment, Third Moment, Fourth Moment), and Hjorth Parameters (Activity, Mobility, Complexity).

• The first 3 modes ({PRC1},{PRC2}, and {PRC3}), and different combinations of them ({PRC1,PRC2},{PRC1,PRC3},{PRC2,PRC3}, and {PRC1,PRC2,PRC3}) are used for feature extraction and the effectiveness of only modes and their combinations are investigated with different machine learning algorithms, separately.

• The investigation of an appropriate and sustainable machine learning model for the proposed features to differentiate the FM tasks, and improve classification performance (success rate) as compared with the existing methods.

Finally, it must also be noted that this is the first study with a model that brings different combinations of PRCs extracted by ITD and various other features to classify FM tasks, to the best of our knowledge.

1.5 Paper organization

The rest of the paper is organized as follows: The EEG dataset used in this study, and EEG signal analysis methods are performed by the proposed ITD method which are ITD-based feature acquisition, statistical significance-based feature selection, classifier algorithms, and performance evaluation metrics are presented in Section 2. Experimental results are given in Section 3 and the results of the proposed approaches are discussed in Section 4. The outcomes of the study are summarized in Section 5.

2 Materials and methods

This study design mainly consists of five stages that are described in Isler (2009). These are EEG Data Acquisition, ITD-based Feature Extraction, Feature Reduction, Classification, and Performance Evaluation. The processes performed in each stage were delineated with details in the sub-headings. Out of these five stages/steps, the first four stages constitute the proposed classification model. Figure 1 shows the block diagram for the proposed model with its stages/steps.

Figure 1

Figure 1. The block diagram of the study.

2.1 EEG dataset description

In this study, the EEG dataset, which is a large electroencephalographic motor imagery dataset for EEG-based BCIs, presented by Kaya et al. (2018) is benefited. The dataset consists of motor imagery EEG signals that were recorded from 13 healthy subjects through 19 channels. 19 EEG electrodes together with two reference electrodes and the ground electrode were placed according to the international 10/20 EEG electrode placement system. The researchers reported that they recorded the EEG signals using an EEG-1200 JE-912A system. They performed an individual motor imagery experiment based on the movements of 10 different body limbs for four different BCI interaction paradigms. Among these planned paradigms, Paradigm #1-(CLA), which means classical left/right-hand motor imagery includes three imageries, and these are left and right-hand movements, and one passive mental imagery in which subjects remained neutral in no motor imagery. Paradigm #2-(HaLT), which means hand/leg/tongue motor imagery contains six tasks, and it is an extended version of the 3-state CLA paradigm with motor imagery tasks of right and left foot movement and tongue movement. Paradigm #3 (5F), which means 5-finger motor imagery includes FM imageries of the five-finger movement of a hand. During the tasks given for different fingers, subjects implemented the corresponding imageries invoking as flexion of the relevant finger up or down. Finger movement imageries were coded as follows: Thumb (Class 1), Index finger (Class 2), Middle finger (Class 3), Ring finger (Class 4), and Pinkie finger (Class 5). Paradigm #4 (NoMT), which means no imagery, visual stimuli only is the case in which no visual stimulus is presented to the subjects and they passively watch the computer screen. In this study, we aimed to carry out a 6-class classification using the 5F and NoMT paradigms. Whilst recording of EEG signals, the action signal remained on the screen for 1 s to implement the corresponding motor imageries. At the end of the given time, the task was not shown on the screen. Instead, the relevant task was interrupted for 1.5–2.5 s until the next task. In this dataset, two different sampling frequencies, 200 and 1,000 Hz, were set for experiments. EEG signals recorded with a 1000 Hz sampling frequency were extracted to be used in this study. In recording of EEG signals acquired at 1,000 Hz, a 0.53–100 Hz band-pass filter was applied to signals using hardware filters. In addition, a 50 Hz notch filter was applied to reduce the electrical grid interface. Before performing the feature extraction and the following steps, to have a balanced distribution among the classes and provide adjusted chance level (Galiotta et al., 2022), 100 samples of 1,000 Hz EEG signals for the 5F (five classes) and NoMT (one class) paradigms were studied for each class as the preprocessing stage. Hence, a total of 600 trials were performed for one subject. After obtaining the 5F and NoMT EEG signals for each subject, each EEG segment is decomposed to the finite number of PRCs by applying ITD.

2.2 Intrinsic time-scale decomposition (ITD)

ITD is introduced by Frei and Osorio for time-frequency-energy (TFE) analysis of signals with precision (Frei and Osorio, 2007). The ITD decomposes a signal into (i) a sum of PRCs, and (ii) a monotonic trend without the need for laborious and ineffective sifting or splines. It is an iterative decomposition algorithm for the analysis of nonlinear and non-stationary signals, decomposing the original signal into low-frequency, which is known as baseline signal (L_t), and high-frequency, which are known as proper rotation (H_t) components. ITD preserves precise temporal information (Frei and Osorio, 2007; Voznesensky and Kaplun, 2019; Degirmenci and Akan, 2020).

For the application of ITD, suppose there is an EEG signal X_t to be processed. To extract the low-frequency component (“baseline signal”) from the EEG signal, an operator 𝔏 is introduced and the remainder is the high-frequency component (“proper rotation”). Hence, the EEG signal X_t is defined as in Equation 1.

\begin{array}{l} X_{t} = 𝔏 X_{t} + (1 - 𝔏) X_{t} = L_{t} + H_{t} & (1) \end{array}

where the baseline signal is indicated as L_t = 𝔏X_t, and the proper rotation component is indicated as H_t = (1 − 𝔏)X_t. The extraction of baseline and proper rotation components are explained in detail with the following three steps (Frei and Osorio, 2007; Martis et al., 2013; Voznesensky and Kaplun, 2019):

• A real-valued signal is assumed as X_t, t ≥ 0 and τ_k, k = 1, 2, ⋯ denotes the its local extremes. Let the value of the signal at τ_k is denoted as X(τ_k) and the value of its baseline at τ_k is denoted as L(τ_k).

• We assume that L_t, and H_t have been defined over the interval [0, τ_k], and X_t is available for [0, τ_k + 2]. The baseline extraction operator, 𝔏 is provided as a piece-wise linear function on the interval (τ_k, τ_k + 1] between the two extrema as defined in Equations 2, 3.

\begin{array}{l} L_{t} = L_{k} + (\frac{L_{k + 1} - L_{k}}{L_{k + 2} - L_{k}}) (X_{t} - X_{k}), t ϵ (τ_{k}, τ_{k} + 1] & (2) \end{array}

where

\begin{array}{l} L_{k + 1} = α [X_{k} + (\frac{τ_{k + 1} - τ_{k}}{τ_{k + 2} - τ_{k}}) (X_{k + 2} - X_{k})] + (1 - α) X_{k + 1}, & (3) \end{array}

and 0 < α < 1, is typically set with $α = \frac{1}{2}$ . The baseline signal, L_t is constructed in this way to obtain the monotonicity of X_t between extrema. Hence, the baseline signal is reconstructed as a linearly transformed contraction of the original signal in conformity with Equations 2, 3.

• Once the baseline signal is defined, the residual or high-frequency component, PRC is computed as defined in Equation 4.

\begin{array}{l} H X_{t} = (1 - 𝔏) X_{t} = H_{t} = X_{t} - L_{t} & (4) \end{array}

Using the baseline L_t, and the high frequency H_t modes, the original signal X_t can be reconstructed using Equation 5.

\begin{array}{l} X_{t} = L_{t}^{D} + \sum_{j = 0}^{D} H_{t}^{j}, j = 0, \dots, D & (5) \end{array}

where D denotes the number of PRCs that are provided during ITD processing.

An exemplary motor imagery EEG signal decomposition process conducted through the ITD algorithm is given in Figure 2A. To decide which of the separate PRCs to work with, the PRCs were examined in the frequency domain and their energy spectrums were computed. In Figure 2B, a case of energy spectrums of PRCs, decomposed into an EEG signal is provided. Figure 2B shows that the first PRC (i.e., PRC1) has the highest frequency content, while the fifth PRC (i.e., PRC5) exhibits the lowest frequency content. Hence, we selected the first three PRCs and their different combinations for our suggested feature extraction process due to the fact that they include high-frequency contents that best represent the signal characteristic of the original EEG. Various feature extraction methods are implemented to the determined high-frequency PRCs (H_t), which are decomposed through ITD. In our study design, seven different sets of high-frequency PRCs which are only PRC1, PRC2, and PRC3, and their different combinations [PRC1–PRC2, PRC1–PRC3, PRC2–PRC3, and PRC1–PRC2–PRC3 (denoted as PRC1-to-3)] are acquired and utilized to evaluate 10 features.

Figure 2

Figure 2. (A) PRCs extracted by the intrinsic time-scale decomposition (ITD) from a 1-s segment of EEG signals, (B) Energies of each PRCs (the first five modes are given as examples).

2.3 ITD features

Following the extraction of low-frequency baseline signal and high-frequency PRCs by running the ITD algorithm, EEG signal properties including the power, mean value, sample entropy, high-frequency moments (first moment, second moment, third moment, and fourth moment), and Hjorth parameters (activity, mobility, and complexity) were computed from various combinations of PRCs. Their details are described below:

• The mean value was calculated based on time-domain information for 3 PRCs. It is defined as in Equation 6.

\begin{array}{l} μ = \frac{1}{N} \sum_{n = 0}^{N - 1} X [n] & (6) \end{array}

where PRCs are denoted as X[k], the mean value is denoted as μ and the size of PRCs is described as N.

• The total power of PRCs was obtained using the spectrum of signals. The spectrum of PRCs was evaluated by implementing the periodogram method, which allows for analysis of the frequency content of a signal (Iscan et al., 2011; Karabiber Cura et al., 2023). From definitions of k-th frequency (Equation 7) and power power spectral desity estimation of the k-th frequency component (Equation 8), the total power is defined as in Equation 9 (Iscan et al., 2011):

\begin{array}{l} w_{k} = \frac{2 π}{N} k, k = 0, 1, \dots, N - 1 & (7) \end{array}

\begin{array}{l} S (w_{k}) = \frac{1}{N} | X (w_{k}) |^{2} & (8) \end{array}

\begin{array}{l} S_{T} = \sum_{k = 0}^{N - 1} S (w_{k}) & (9) \end{array}

where S(w_k) indicates the power spectral density of the signal provided by the periodogram method, X(w_k) indicates the discrete Fourier transform of the PRC x[n], and S_T is the total power of PRCs. N shown in Equations 8, 9, refers to the size of the corresponding signal.

• The higher order spectral moments (1st, 2nd, 3rd, and 4th) were computed using the spectrum of signals like total power. These moments are defined as in Equations 10–13, respectively (Degirmenci et al., 2018):

\begin{array}{l} M_{1} = \sum_{k = 0}^{N - 1} {(w_{k})}^{1} S (w_{k}) & (10) \end{array}

\begin{array}{l} M_{2} = \sum_{k = 0}^{N - 1} {(w_{k})}^{2} S (w_{k}) & (11) \end{array}

\begin{array}{l} M_{3} = \sum_{k = 0}^{N - 1} {(w_{k})}^{3} S (w_{k}) & (12) \end{array}

\begin{array}{l} M_{4} = \sum_{k = 0}^{N - 1} {(w_{k})}^{4} S (w_{k}) & (13) \end{array}

Here, M₁, M₂, M₃, and M₄ represent the 1st, 2nd, 3rd, and 4th higher order spectral moments of the corresponding PRCs, respectively.

• Hjorth parameters were introduced by Hjorth (1970) in 1970, and these are time-domain statistical features used in signal processing. These parameters include the Activity parameter (A_x), Mobility parameter (M_x), and Complexity parameter (C_x) of the signal. In the following mathematical equations for Activity, Mobility, and Complexity parameters, y(n) indicates the auto-correlation function of one PRC after the ITD application. y[n] = [y1, y2, ⋯ , yN], and N indicates the length of the signal.

Activity parameter, defines the power of vibration signal and can be evaluated using the variance of signal amplitude. It is formulated in Equation 14 (Hjorth, 1970; Yu and Fang, 2022):

\begin{array}{l} A_{x} = (y (n)) = σ_{y}^{2} & (14) \end{array}

where σ_y denotes the standard deviation of y(n) and it can be described with the Equation 15.

\begin{array}{l} σ_{y} = \sqrt{\frac{1}{N - 1} \sum_{n = 1}^{N} {[y (n) - μ]}^{2}} & (15) \end{array}

Here, the mean value of the signal is represented with μ.

Mobility parameter describes the ratio of standard deviations of first-order derivatives, and it can be evaluated using the slope of the signal. It is defined as in Equation 16.

\begin{array}{l} M_{x} = \sqrt{\frac{σ_{y^{'}}^{2}}{σ_{y}^{2}}} = \frac{σ_{y^{'}}}{σ_{y}} & (16) \end{array}

where $σ_{y^{'}}$ indicates the first-order standard deviation of signals.

Complexity parameter denotes the similarity of signal to sinusoidal signal and it is expressed as the ratio between the mobility of the first derivative of the EEG signal and the mobility of the EEG signal itself (Hjorth, 1970; Yu and Fang, 2022). The mathematical expression of complexity parameters is given in Equation 17.

\begin{array}{l} C_{x} = \frac{M_{x} (y^{'} (t))}{M_{x} (y (t))} = \frac{M_{x} (\frac{d y (t)}{d t})}{M_{x} (y (t))} = \sqrt{\frac{\frac{σ_{y^{″}}^{2}}{σ_{y^{'}}^{2}}}{\frac{σ_{y^{'}}^{2}}{σ_{y}^{2}}}} & (17) \end{array}

Here, the second-order standard deviation of signal y(t) is expressed as $σ_{y^{″}}$ .

• The sample entropy indicates a time series complexity measure that represents the probability of a system generating new patterns. It can be defined as the embedding theory that utilizes the time series directly instead of probability values. The original time series is defined as L_t(i), i = 1, 2, ⋯ , N. The new vector sequences which each of size m, u(1) by u(N − m + 1) are created, and expressed as u(i) = {L_t(i), L_t(i + 1), ⋯ , L_t(i + m − 1)} (Higuchi, 1988; Martis et al., 2013). The defined length m indicates the embedding dimension. The distance d[u(i), u(j)] between vectors u(i), and u(j) is described in Equation 18 (Higuchi, 1988):

\begin{array}{l} d (u (i), u (j)) = m a x {| u (i + k) - u (j + k) |}, 0 \leq k \leq m - 1 & (18) \end{array}

Here, k is an index. The probability of providing another vector within a distance r from vector u(i) is defined as in Equation 19 (Higuchi, 1988):

\begin{array}{l} C_{i}^{' m} (r) = \frac{1}{N - m + 1} & (19) \end{array}

The number of j, j ≠ i, j ≤ N − m + 1 such that d(u(i), u(j)) ≤ r

The entropy can be defined in Equation 20.

\begin{array}{l} \emptyset^{m} (r) = {(N - m + 1)}^{- 1} \sum_{i = 1}^{N - m + 1} C_{i}^{' m} (r) & (20) \end{array}

Then, the sample entropy is described in Equation 21 (Martis et al., 2013):

\begin{array}{l} S a m p E n (m, r, N) = - l n [\frac{\emptyset^{' m} (r)}{\emptyset^{' m + 1} (r)}] & (21) \end{array}

2.4 Feature reduction using statistical significance (ANOVA)

Applying too many features to classifiers could unnecessarily complicate the implementation of classifiers. The application of redundant information in EEG signals can cause confusion, which is defined as the curse of dimensionality (Hart et al., 2000). Trying different combinations one by one and finding the most suitable classification causes computational load (Narin et al., 2014). Feature reduction algorithms can be used instead of feature selection based on trying different combinations. The purpose of feature reduction is to investigate small-size subsets of features that can provide the same or better optimal classification performances (Yesilkaya et al., 2023). Using fewer data presenting some relevant features of motor imagery EEG signals is important to obtain optimal classifier performance without computational load.

In this study, a feature reduction method based on statistical significance was applied to determine relevant ITD features that provide the best discrimination of the FM imageries for each sample. The statistical significance-based feature selection method used in this study was also performed in other BCI studies (Bulut et al., 2022; Degirmenci et al., 2022c, 2023). One-way variance analysis (ANOVA test), which is mainly used to indicate whether there is a difference between the means in conditions where there are two or more groups was used in this study. We preferred the ANOVA test from statistical significance-based feature selection methods since a total of six motor imagery tasks including five FM imageries and NoMT cases tried to be classified. Thus, the effect of the ANOVA test-based feature selection method was investigated with ITD features. The statistical significance of all extracted EEG features was determined by calculating p-values. The statistical significance level (α) is defined as 0.05 and the features that ensure the statistical evidence range were indicated and selected as statistically significant features. In addition to the classifications performed without the feature selection process in our study, the feature vector including selected statistically significant ITD features were also given to the classification algorithms as input data to differentiate FM imageries. The effectiveness of the ANOVA-based feature selection process is investigated by comparing the results of classifications with all features and selecting statistically significant features.

2.5 Classification

In this study for differentiation of FM imageries, the provided ITD-based EEG features have been evaluated using eight well-known machine learning algorithms, such as Decision Tree (Tzallas et al., 2009; Sharma et al., 2022), Discriminant Analysis (Hart et al., 2000; Chakrabarti et al., 2003; Lotte et al., 2018), Naive Bayes (Hart et al., 2000; Miao et al., 2017), Support Vector Machine (Vapnik, 1999; Hart et al., 2000; Bascil et al., 2016), k-Nearest Neighbor (Hart et al., 2000; Isler, 2009; Tzallas et al., 2009), Ensemble Learning (Sayilgan et al., 2019, 2020, 2021a,b, 2022; Degirmenci et al., 2022b,c; Karabiber Cura et al., 2023), Neural Networks (Richard and Lippmann, 1991; Pan et al., 2012; Narin and Isler, 2021; Ozdemir et al., 2021; Degirmenci et al., 2022a), and Kernel Approximation (Maji et al., 2008; Lei et al., 2019). The classifiers and corresponding algorithms that were adopted in this study are listed below in Table 1. Each of these algorithms was implemented via utilizing the Classification Learner Toolbox, which is part of the Statistics and Machine Learning Toolbox available in the Matlab software package (Matlab, 2023). Since the technical details of these classifiers have become so trivial that inherited details are not explained. For further details regarding the classifiers, studies that are cited in the table can be accessed.

Table 1

Table 1. List of adopted classifiers with their implemented algorithms.

2.6 Performance evaluation

Training is defined as updating the classifier-specific parameters according to the available data. Testing is determining the performance of classifiers by the correct decisions made on the unseen data before. For this reason, the feature set was divided into two groups as train data (80%) and test data (20%) using the random splitting method (Hart et al., 2000).

In addition, during training, classifiers are expected to generalize rather than over-fit (or memorize) the available data. However, it may be difficult to make generalizations, especially when the size of the data is not large enough. Cross-validation (CV) is a method employed to evaluate the predictive performance of a model on data it has not processed (classified) before. Several cross-validation methods, including hold-out, leave-one-out, k-fold, and Monte-Carlo (MC) exist. All in all, hold-out (k equals 2) and leave-one-out (k equals the number of samples) methods are special cases of the k-fold method (Hart et al., 2000; Isler et al., 2015; Patro, 2021).

Differences between k-fold and MC methods are emphasized in the recent literature: (a) the k-fold uses each data in the validation although MC uses samples arbitrary times (0 or more), (b) the k-fold divides the data into k parts, although MC separates large number data parts, (c) the k-fold results in unbiased accuracy with a high variance where the MC results in highly biased accuracy with low variance. These differences cause a trade-off among CV methods (Patro, 2021). A recent study emphasizes that a large number of simulated data may cause over-fitting and using independent data for extra validation is necessary (Labriffe et al., 2022).

Therefore, we preferred the k-fold CV method as in our similar studies (Isler, 2009; Isler and Kuntalp, 2009; Degirmenci et al., 2022a,b) and the recent literature (Anam et al., 2019, 2020; Kato et al., 2020; Mwata-Velu et al., 2021, 2022; Azizah et al., 2022; Zahra et al., 2022). Using k-fold cross-validation (CV), the training data set was divided into k equal-sized subsets. One subset was used as test data, other subsets (k − 1) were determined as training data, and this classification process was repeated k times (Hart et al., 2000). Regarding Brownlee's article on the Machine Learning Mastery website (Brownlee, 2023), there is no general rule for choosing the k value, but as the k value decreases, the bias value also decreases (Kuhn and Johnson, 2013). Additionally, it is stated that empirically selected values of 5 or 10 give a balanced bias-variance test error (James et al., 2013). The average classification performance of these iterations is defined as the training performance (Hart et al., 2000). In conclusion, k was set as 5 for this study as in similar studies.

The accuracy (ACC) performance criterion is used in this study to evaluate the performance of various machine learning algorithms. The mathematical expression of the accuracy performance criterion is given in Equation 22 (Hart et al., 2000).

\begin{array}{l} A C C = \frac{T P + T N}{T P + F N + T N + F P} & (22) \end{array}

Here, TP and TN indicate the number of correctly assigned samples into the true class. In addition, FP and FN indicate the number of incorrectly assigned samples into positive class and negative class, respectively.

3 Results

The suggested methods were applied to EEG segments of 19-channel EEG signals collected from 8 subjects. Firstly, the ITD approach was used to decompose EEG signals into PRCs. Then the power, mean value, sample entropy, high-frequency moments (first moment, second moment, third moment, and fourth moment), and Hjorth parameters (activity, mobility, and complexity) were evaluated as features utilizing distinct combinations of PRCs. In the feature extraction process performed in this study, both the first three components (PRC1, PRC2, and PRC3) and their different combinations (PRCs1-2, PRCs1-3, PRCs2-3 and PRC1-to-3) were used and their effectiveness was investigated, individually. The same feature extraction process was also performed on EEG signals without any ITD approach to show the effectiveness of the ITD algorithm in FM classification. Additionally, the ANOVA-based feature selection process was carried out on the PRC1-to-3 feature set and its effectiveness was investigated. Finally, a variety of classifiers including Decision Tree, Discriminant Analysis, Naive Bayes, Support Vector Machine, k-Nearest Neighbor, Ensemble Learning, Neural Networks, and Kernel Approximation were used to classify FM imagery of EEG segments, and the experimental results of each were analyzed.

The classification performances of the ITD-based features computed using the different components and EEG-based features were evaluated to compare and analyze the effectiveness of the suggested ITD-based process. The classification performances of features acquired through our suggested ITD-based approaches with various classifiers are given in Tables 2–9. These classification performances were evaluated using both feature sets provided using single PRCs (PRC1, PRC2, and PRC3), their combinations (PRCs1-2, PRCs1-3, and PRC1-to-3), and ANOVA-selected PRC1-to-3 combination. In tables, EEG indicates that the feature set utilized in the classification step is generated using the EEG signal itself without applying ITD. Additionally, boldface characters show which feature set obtained the highest accuracy performance in subject-dependent and subject-independent analyses separately.

Table 2

Table 2. All components' performances were tested in this study using the Decision Tree classifier.

Decision Tree classification performances evaluated using ITD-based features are presented in Table 2. With respect to these results, the first three PRCs combined with an ANOVA-based feature selection process obtain the highest accuracy value of 44.17% in S4 (Subject E). The performance comparison of the ITD-based approach with the EEG-based case (without the ITD process), shows that the highest performance values were obtained with the use of ITD-based features in all subjects except S2 (Subject B). The results for S2 (Subject B) were further investigated and it was noticed that the highest accuracy value reported was 30.83% in classifications performed using both PRCs1-3 combination and EEG features with an ANOVA-based feature selection process.

Linear Discriminant Analysis classification performances were evaluated using ITD-based features presented in Table 3. When the results are compared, the first three PRCs combined with the ANOVA-based feature selection process obtain the highest accuracy value of 47.50% in S4 (Subject E). The comparison of classifier performances with the features extracted through the ITD-based approach and the performances of the same classifiers with the features of the EEG-based case (without the ITD process) could not be conducted clearly since the results of the EEG-based case could not be computed. The EEG-based feature set could not be classified because they do not fit the Linear Discriminant Analysis classifier's parameters.

Table 3

Table 3. All components' performances were tested in this study using the Linear Discriminant Analysis classifier.

Naive Bayes classification performances evaluated using ITD-based features are presented in Table 4. According to these results, EEG features with an ANOVA-based feature selection process obtain the highest accuracy value of 40.00% in S4 (Subject E). The performances of the ITD-based approach were compared with the performances of the EEG-based case (without the ITD process), and the comparison reflects that the highest performance values were obtained with the use of ITD-based features in all subjects except three subjects. The analyses performed for S3 (Subject C) were further investigated, it was found that the highest accuracy value was 34.17% in classifications performed using both the first three PRCs combination with ANOVA-based feature selection and EEG features with ANOVA-based feature selection process.

Table 4

Table 4. All components' performances were tested in this study using the Naive Bayes classifier.

Support Vector Machine classification performances evaluated using ITD-based features are presented in Table 5. The results expose that the first three PRCs combined with an ANOVA-based feature selection process and without an ANOVA-based feature selection process obtain the highest accuracy value of 49.17% in S4 (Subject E). On the other hand, the same highest accuracy value is also found for the first three PRCs in combination with the ANOVA-based feature selection process in S3 (Subject C). The performances of the ITD-based approach were compared with the performances of the EEG-based case (without the ITD process) and the comparison shows that the highest performance values were obtained with the use of ITD-based features in all subjects.

Table 5

Table 5. All components' performances were tested in this study using the Support Vector Machine classifier.

k-Nearest Neighbors classification performances acquired using ITD-based features are presented in Table 6. According to these results, the PRCs1-3 combination obtains the highest accuracy value of 46.67% in S3 (Subject C). The performances of the ITD-based approach were compared with the performances of the EEG-based case (without the ITD process) and the highest performance values were obtained with the use of ITD-based features in all subjects.

Table 6

Table 6. All components' performances were tested in this study using the k-Nearest Neighbors classifier.

Ensemble Learning classification performances evaluated using ITD-based features are presented in Table 7. With regard to these results, the first three PRCs combined with an ANOVA-based feature selection process obtained the highest accuracy value of 55.00% for S4 (Subject E). When the performances of the ITD-based approach were compared with the performances of the EEG-based case (without the ITD process), it was evident that the highest performance values were obtained with the use of ITD-based features in all subjects.

Table 7

Table 7. All components' performances were tested in this study using the Ensemble Learning classifier.

Neural Networks classification performances evaluated using ITD-based features are presented in Table 8. The results indicate that the first three PRCs combined with an ANOVA-based feature selection process achieved the highest accuracy value of 53.00% for S3 (Subject C). Comparison of the performances of the ITD-based approach with the performances of the EEG-based case (without the ITD process) shows that the highest performance values were realized with the use of ITD-based features in all subjects except S6 (Subject G) and S8 (Subject I). Further analyses performed for S6 (Subject G) showed that the highest accuracy value attained was 38.33% in classifications performed using both the first three PRCs' combination with ANOVA-based feature selection process and EEG features with ANOVA-based feature selection process. On the other hand, the analyses performed for S8 (Subject I) revealed that the highest accuracy value reached was 35.00% using EEG features with an ANOVA-based feature selection process.

Table 8

Table 8. All components' performances were tested in this study using the Neural Networks classifier.

Kernel Approximation classification performances evaluated using ITD-based features are presented in Table 9. In reference to the results, one can infer that the first three PRCs combination without an ANOVA-based feature selection process obtained the highest accuracy value of 40.83% in S4 (Subject E). The performances of the ITD-based approach and the performances of the EEG-based case (without the ITD process) were compared and it was apparent that the highest performance values were obtained with the use of ITD-based features in only S4 (Subject E) and S7 (Subject H). In S1 (Subject A), S3 (Subject C), S6 (Subject G), and S8 (Subject I), the highest performance values were obtained with the use of EEG-based features with or without an ANOVA-based feature selection process. In other subjects, the highest performance values were obtained with the use of both ITD-based features and EEG-based features.

Table 9

Table 9. All components' performances were tested in this study using the Kernel Approximation classifier.

4 Discussion

The observed results reveal that the ITD algorithm mostly yields a considerable improvement in classification performance when the classification performance of ITD-based approaches are compared with the classification performance of EEG-based analysis conducted without utilizing the ITD algorithm. The highest accuracy values are obtained using the ITD algorithm for most of all classification algorithms except the Naive Bayes algorithm. Among all ITD-based feature sets, all PRCs and their combinations provide a higher classification performance compared to the EEG case in most of the classifications except the Naive Bayes and Kernel Approximation classifications. The classification performance of a single PRC is lower compared to their combinations. The most successful component is the first three PRC combinations (PRC1-to-3). In addition to using PRCs1-to-3, the classification performance is further improved with the implementation of an ANOVA-based feature selection process. The experimental results revealed that the evaluation of different components together provides the highest performance and improves the classification performance.

Next, the component-based and EEG-based classification accuracies in the Ensemble Learning classifier for subject-dependent and subject-independent cases have been investigated to reveal the efficacy of the proposed ITD-based method more accurately. The performances that are obtained using both feature sets generated utilizing EEGs, single PRCs (PRC1, PRC2, and PRC3), and their combinations (PRCs1-2, PRCs1-3, and PRC1-to-3) by running Ensemble Learning are given in Figure 3. The results reveal that the ITD algorithm provides a significant improvement in terms of accuracy performance compared to the classification performed without using the algorithm. Additionally, the combinations of different components achieved the highest classification performance for subject-dependent and subject-independent cases. Moreover, ANOVA-selected the first three PRC combinations (PRC1-to-3) realized the highest classification performance in analyses for all subjects except S1 (Subject A).

Figure 3

Figure 3. The component-based classification accuracies in Ensemble Learning classifier for all subjects.

The classification performance of ITD-based features from different PRCs with ANOVA-based feature selection and without feature selection process were compared on the basis of providing more accurate information about the performance of the suggested ANOVA-selected ITD features. The classification accuracies for the PRC1-to-3 combination and ANOVA-selected PRC1-to-3 combination achieved by the Ensemble Learning classifier are presented in Figure 4. It can be noticed that the ANOVA-selected PRC1-to-3 combination succeeded in higher classification accuracies than the PRC1-to-3 combination for both subject-dependent and subject-independent cases. The observed results reveal that the suggested statistical significance-based feature reduction process obtains considerably noticeable differences and improves the classifiers' performance.

Figure 4

Figure 4. Comparison of accuracy values evaluated using PRCs1-to-3 features and ANOVA-selected PRCs1-to-3 features as regards Ensemble Learning classifier.

The results of our study are compared to the state-of-the-art studies, which conducted FM classification based on EEG signals. Table 10 presents a comparison of the suggested study to relevant prior studies. Clearly, both subject-dependent (Kaya et al., 2018; Anam et al., 2019, 2020; Kato et al., 2020; Mwata-Velu et al., 2021, 2022; Azizah et al., 2022) and subject-independent (Kaya et al., 2018; Zahra et al., 2022) studies were conducted for FM classification in literature. In general, the highest performance values were achieved in subject-dependent classification as in our study. An important distinction between studies regarding FM classification was the number of subjects. In some studies (Anam et al., 2019, 2020; Mwata-Velu et al., 2022), classification was computed over the EEG data of four subjects. In contrast, some studies computed and reported using data from eight subjects. As an example of four-subject studies, Anam et al. (2019) reports on the analysis of the data of only four subjects and the classification performance varied between 51.00 and 56.00%. To make a meaningful comparison between the results of Anam et al. (2019) and our study, the sample sizes must be equal. Hence, we think that the two results are incomparable. In Anam et al. (2020), in addition to working with only four subjects, classification was carried out with deep learning structures. Despite the fact that the hidden layers in deep learning structures create a significant amount of workload and necessitate a significant amount of time for training, the reported classification performance in all subjects was not as high as expected (over 90.00%). In another study (Zahra et al., 2022), another deep learning-based classification with very high training time was adopted and considering the same drawbacks of the previous study (Anam et al., 2020), although a significant improvement in performance was achieved since the sample size of this study (i.e., only four subjects) and number of EEG channels (i.e., only four channels) were limited when compared with the sample size and number of EEG channels in our study. Thus, a comparison between the results of this study (Anam et al., 2020) and ours would not be meaningful. On the other hand, some of these prior studies (Mwata-Velu et al., 2021, 2022; Azizah et al., 2022) performed channel reduction. In these studies, four out of all 19 channels were defined as effective channels and used for the feature extraction stage. Among these studies, although deep learning-based classification was performed in addition to channel reduction in the Mwata-Velu et al. (2021), the performance values were only as high as 76%. In one of the studies of the same set (Mwata-Velu et al., 2022), EEG signals of 4 subjects were included, and deep learning-based classification was performed together with the channel reduction process. When their classification results are examined and compared, it is clear that high performances were obtained with regard to already noted certain limitations in the study design. However, our study uses passive condition (NoMT case) EEG signals in addition to EEG signals of FM. Prior studies had focused only on FM and classified them without considering the passive state of the subjects. The 6-class FM classification study we propose appears to be more suitable for the real BCI design and applications. In this study, we used ITD-based features for FM classification. According to our experimental results, 55.00% is the highest accuracy achieved using the pair of the ANOVA-selected first three PRC combinations and the Ensemble Learning classifier.

Table 10

Table 10. Comparison of classifier performances with the state-of-the-art studies for both subject-independent and subject-dependent cases from the literature.

There are a few aspects that distinguish this study from previous studies in this field. These distinctional aspects to it, together with the contributions of this study to the literature can be explained as follows:

• ITD-based feature extraction study is conducted for FM classification. The first three higher frequency components and their different combinations were evaluated and their success rates were investigated with respect to different classifiers separately. In addition to the ITD-based features, EEG-based features have been evaluated without ITD decomposition to analyze the impact of the suggested ITD-based process. The observed results reveal that the highest performance values are mostly achieved in ITD-based approaches. Among ITD approaches, the most successful feature set is the first three PRC combinations (PRCs1-to-3).

• Additionally, the statistical significance-based feature selection process was applied to the first three PRC combinations. It has been observed that the performance of the classifier increases further in classifications performed using the first three PRC combinations. Thereby, in this study, the highest accuracy value was obtained by applying the combination of the first three modes to the Ensemble Learning classifier with ANOVA-based feature selection.

• To the best of our knowledge, our study presents the first approach where different combinations of PRCs were decomposed through ITD, and various features are utilized together to classify FM of EEG signals,

• We used both EEG signals of all subjects (eight subjects) and all channels (19 channels) of their EEG data in analyses, hence, excluding study design limitations (e.g., number of channels) to perform effective comparisons,

• Furthermore, this study is advantageous in all its stages (ITD-based feature extraction, and classification) in terms of workload and does not contain any complexity in the classification stage as in deep learning structures.

• Finally, we carried out a 6-class classification of FM by including the NoMT condition in FM in order to realize a more realistic BCI design and application for paralyzed patients. Such a design choice is crucial since it does not exclude occurrences of the Midas Touch Problem (Velichkovsky et al., 1997), which is actually the misinterpreted intention of interactive action fired by the interface. In the case of BCI development, when NoMT is discarded, it might easily cause Midas Touch occurrences to become the source of false positives and cause classification performance to degrade dramatically.

5 Conclusion

The accurate decoding of FM is accepted as a challenging task because the fingers are smaller than other limbs such as arms and hands and have a noisy signal nature. As a result, it is a more complicated task to discriminate among FM. In this study, an ITD-based machine learning approach is proposed for rapid and accurate classification of FM by using multi-channel EEG signals. Nineteen channel EEG data collected from eight subjects are used in our analysis. Firstly, the different modes are extracted from EEG signals using the ITD. The different features such as power, mean, sample entropy, high-frequency moments (first moment, second moment, third moment, fourth moment), and Hjorth parameters (activity, mobility, complexity) are evaluated using the first three modes of EEG signals. The single version of these modes and their different combinations are investigated in our suggested study, separately. Finally, FM classification through these extracted feature sets is performed using eight different machine-learning algorithms. Basically, we compared the performances of EEG-based features and the features extracted using the ITD algorithm. The experimental results reveal that the highest performance values are mostly (six out of eight classifier algorithms) acquired in ITD-based approaches. Additionally, the combinations of different modes mostly obtain the highest performance. Among all the different combinations, the first three combinations form the most successful feature set, and the highest accuracy values are achieved using this combination. On the other hand, the effectiveness of the ANOVA-based feature selection method is also investigated in this study. The results demonstrate that ANOVA-based feature selection improves the classifier performance by making it possible to find out the more discriminatory and relevant features. Among the classifier algorithms, the Ensemble Learning classifier appears to be the most successful classifier algorithm tested in this study. Therefore, in this study, the highest accuracy value of 55.00% is obtained in S4 (Subject E) by applying the combination of the first three modes to the Ensemble Learning classifier with ANOVA-based feature selection. The accuracy rates of subject-dependent analyses performed according to the Ensemble Learning classifier are found between 35.83 and 55.00% using the first three modes' combination (PRCs1-to-3) with ANOVA-based feature selection.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: https://www.nature.com/articles/sdata2018211.

Ethics statement

Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants' legal guardians/next of kin in accordance with the national legislation and the institutional requirements.

Author contributions

MD: Formal analysis, Investigation, Methodology, Validation, Writing—original draft. YY: Formal analysis, Investigation, Supervision, Writing—original draft, Writing—review & editing. MP: Writing—original draft, Writing—review & editing, Funding acquisition. YI: Conceptualization, Methodology, Supervision, Writing—original draft, Writing—review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. MP was supported by the Slovenian Research and Innovation Agency (Javna agencija za znanstvenoraziskovalno in inovacijsko dejavnost Republike Slovenije) (Grant No. P1-0403). This study was also supported by Izmir Katip Celebi University Scientific Research Council Agency as project number 2023-TDR-FEBE-0002 for MD's doctoral thesis studies. In addition, MD has a research fellowship from the Higher Education Institution 100/2000 Ph.D. scholarship and the 2211A general doctorate scholarship from the Scientific and Technological Research Council of Turkey (TUBITAK).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Alazrai, R., Alwanni, H., and Daoud, M. I. (2019). EEG-based BCI system for decoding finger movements within the same hand. Neurosci. Lett. 698, 113–120. doi: 10.1016/j.neulet.2018.12.045

PubMed Abstract | Crossref Full Text | Google Scholar

Anam, K., Bukhori, S., Hanggara, F. S., and Pratama, M. (2020). “Subject-independent classification on brain-computer interface using autonomous deep learning for finger movement recognition,” in 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) (Montreal, QC), 447–450.

PubMed Abstract | Google Scholar

Anam, K., Nuh, M., and Al-Jumaily, A. (2019). “Comparison of EEG pattern recognition of motor imagery for finger movement classification,” in 6th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI) (Bandung), 24–27.

Google Scholar

Aricò, P., Borghini, G., Di Flumeri, G., Sciaraffa, N., and Babiloni, F. (2018). Passive BCI beyond the lab: current trends and future directions. Physiol. Meas. 39:08TR02. doi: 10.1088/1361-6579/aad57e

PubMed Abstract | Crossref Full Text | Google Scholar

Azizah, R. N., Zakaria, H., and Hermanto, B. R. (2022). Channels selection for pattern recognition of five fingers motor imagery electroencephalography signals. J. Phys. 2312:012019. doi: 10.1088/1742-6596/2312/1/012019

Crossref Full Text | Google Scholar

Bascil, M. S., Tesneli, A. Y., and Temurtas, F. (2016). Spectral feature extraction of EEG signals and pattern recognition during mental tasks of 2-D cursor movements for BCI using SVM and ANN. Aust. Phys. Eng. Sci. Med. 39, 665–676. doi: 10.1007/s13246-016-0462-x

PubMed Abstract | Crossref Full Text | Google Scholar

Belkacem, A. N., Jamil, N., Palmer, J. A., Ouhbi, S., and Chen, C. (2020). Brain computer interfaces for improving the quality of life of older adults and elderly patients. Front. Neurosci. 14:692. doi: 10.3389/fnins.2020.00692

PubMed Abstract | Crossref Full Text | Google Scholar

Brownlee, J. (2023). A Gentle Introduction to k-Fold Cross-Validation. Machine Learning Mastery. Available online at: https://machinelearningmastery.com/k-fold-cross-validation/ (accessed February 07, 2024).

Google Scholar

Bulut, A., Ozturk, G., and Kaya, I. (2022). Classification of sleep stages via machine learning algorithms. J. Intell. Syst. Appl. 5, 66–70. doi: 10.54856/jiswa.202205210

Crossref Full Text | Google Scholar

Chakrabarti, S., Roy, S., and Soundalgekar, M. V. (2003). Fast and accurate text classification via multiple linear discriminant projections. VLDB J. 12, 170–185. doi: 10.1007/s00778-003-0098-9

Crossref Full Text | Google Scholar

Chen, X., Wang, Y., Nakanishi, M., Gao, X., Jung, T., and Gao, S. (2015). High-speed spelling with a noninvasive brain-computer interface. Proc. Nat. Acad. Sci. U. S. A. 112, E6058–E6067. doi: 10.1073/pnas.1508080112

PubMed Abstract | Crossref Full Text | Google Scholar

Degirmenci, M., and Akan, A. (2020). “EEG based epileptic seizures detection using intrinsic time-scale decomposition,” in 2020 Medical Technologies Congress (TIPTEKNO) (Antalya), 1–4.

Google Scholar

Degirmenci, M., Ozdemir, M. A., Izci, E., and Akan, A. (2022a). Arrhythmic heartbeat classification using 2D convolutional neural networks. Innovat. Res. BioMed. Eng. 43, 422–433. doi: 10.1016/j.irbm.2021.04.002

Crossref Full Text | Google Scholar

Degirmenci, M., Ozdemir, M. A., Sadighzadeh, R., and Akan, A. (2018). “Emotion recognition from EEG signals by using empirical mode decomposition,” in 2018 Medical Technologies National Congress (TIPTEKNO) (Magusa), 1–4.

Google Scholar

Degirmenci, M., Yuce, Y. K., and Isler, Y. (2022b). Classification of multi-class motor imaginary tasks using poincare measurements extracted from EEG signals. J. Intell. Syst. Appl. 5, 74–78. doi: 10.54856/jiswa.202212204

Crossref Full Text | Google Scholar

Degirmenci, M., Yuce, Y. K., and Isler, Y. (2022c). “Motor imaginary task classification using statistically significant time-domain EEG features,” in 2022 30th Signal Processing and Communications Applications Conference (SIU) (Safranbolu: IEEE), 1–4.

Google Scholar

Degirmenci, M., Yuce, Y. K., and Isler, Y. (2024). Classification of finger movements from statistically-significant time-domain EEG features. J. Fac. Eng. Arch. Gazi Univ. 39, 1597–1609. doi: 10.17341/gazimmfd.1241334

Crossref Full Text | Google Scholar

Degirmenci, M., Yuce, Y. K., Perc, M., and Isler, Y. (2023). Statistically significant features improve binary and multiple motor imagery tasks predictions from EEGs. Front. Hum. Neurosci. 17:1223307. doi: 10.3389/fnhum.2023.1223307

PubMed Abstract | Crossref Full Text | Google Scholar

Dietterich, T. G. (2002). “Structural, syntactic, and statistical pattern recognition,” in Lecture Notes in Computer Science, Vol. 2396, eds T. Caelli, A. Amin, R. P.W. Duin, D. de Ridder, and M. Kamel (Berlin; Heidelberg: Springer). doi: 10.1007/3-540-70659-3_2

Crossref Full Text | Google Scholar

Frei, M. G., and Osorio, I. (2007). Intrinsic time-scale decomposition: time—frequency—energy analysis and real-time filtering of non-stationary signals. Proc. R. Soc. A Math. Phys. Eng. Sci. 463, 321–342. doi: 10.1098/rspa.2006.1761

Crossref Full Text | Google Scholar

Galiotta, V., Quattrociocchi, I., D'Ippolito, M., Schettini, F., Aricò, P., Sdoia, S., et al. (2022). EEG-based brain-computer interfaces for people with disorders of consciousness: features and applications. A systematic review. Front. Hum. Neurosci. 16:1040816. doi: 10.3389/fnhum.2022.1040816

Crossref Full Text | Google Scholar

Hart, P. E., Stork, D. G., and Duda, R. O. (2000). Pattern Classification, 2nd Edn. New York, NY: A Wiley-Interscience Publication.

Google Scholar

Higuchi, T. (1988). Approach to an irregular time series on the basis of the fractal theory. Phys. D Nonlinear Phenomena 31, 277–283. doi: 10.1016/0167-2789(88)90081-4

Crossref Full Text | Google Scholar

Hjorth, B. (1970). EEG analysis based on time domain properties. Electroencephalogr. Clin. Neurophysiol. 29, 306–310. doi: 10.1016/0013-4694(70)90143-4

PubMed Abstract | Crossref Full Text | Google Scholar

Iscan, Z., Dokur, Z., and Demiralp, T. (2011). Classification of electroencephalogram signals with combined time and frequency features. Expert Syst. Appl. 38, 10499–10505. doi: 10.1016/j.eswa.2011.02.110

Crossref Full Text | Google Scholar

Isler, Y. (2009). A Detailed Analysis of the Effects of Various Combinations of Heart Rate Variability Indices in Congestive Heart Failure (Ph.D. thesis). Dokuz Eylul University, Izmir.

Google Scholar

Isler, Y., and Kuntalp, M. (2009). “Diagnosis of congestive heart failure patients using Poincare measures derived from ECG signals,” in 2009 14th National Biomedical Engineering Meeting (IEEE), 1–4.

Google Scholar

Isler, Y., Narin, A., and Ozer, M. (2015). Comparison of the effects of cross-validation methods on determining performances of classifiers used in diagnosing congestive heart failure. Meas. Sci. Rev. 15, 196–201. doi: 10.1515/msr-2015-0027

Crossref Full Text | Google Scholar

James, G., Witten, D., Hastie, T., and Tibshirani, R. (2013). An Introduction to Statistical Learning: with Applications in R. 1st Edn (Springer), 181–184.

Google Scholar

Karabiber Cura, O., Kocaaslan Atli, S., and Akan, A. (2023). Attention deficit hyperactivity disorder recognition based on intrinsic time-scale decomposition of EEG signals. Biomed. Signal Process. Control 81:104512. doi: 10.1016/j.bspc.2022.104512

Crossref Full Text | Google Scholar

Kato, M., Kanoga, S., Hoshino, T., and Fukami, T. (2020). “Motor imagery classification of finger motions using multiclass CSP,” in 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) (IEEE), 2991–2994.

PubMed Abstract | Google Scholar

Kaya, M., Binli, M. K., Ozbay, E., Yanar, H., and Mishchenko, Y. (2018). A large electroencephalographic motor imagery dataset for electroencephalographic brain computer interfaces. Sci. Data 5, 1–16. doi: 10.1038/sdata.2018.211

PubMed Abstract | Crossref Full Text | Google Scholar

Kuhn, M., and Johnson, K. (2013). Applied Predictive Modeling. 1st Edn. New York, NY: Springer, 70.

Google Scholar

Labriffe, M., Woillard, J.-B., Debord, J., and Marquet, P. (2022). Machine learning algorithms to estimate everolimus exposure trained on simulated and patient pharmacokinetic profiles. Pharm. Syst. Pharmacol. 11, 1018–1028. doi: 10.1002/psp4.12810

PubMed Abstract | Crossref Full Text | Google Scholar

Lei, D., Tang, J., Li, Z., and Wu, Y. (2019). Using low-rank approximations to speed up kernel logistic regression algorithm. IEEE Access 7, 84242–84252. doi: 10.1109/ACCESS.2019.2924542

Crossref Full Text | Google Scholar

Limbaga, N. J., Mallari, K. L., Yeung, N. R., and Monje, J. C. (2022). “Development of an EEG-based brain-controlled system for a virtual prosthetic hand,” in 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) (Las Vegas, NV), 1714–1717.

Google Scholar

Lotte, F., Baugrain, L., Cichocki, A., Clerc, M., Congedo, M., Rakotomamonjy, A., et al. (2018). A review of classification algorithms for EEG-based brain-computer interfaces: a 10 year update. J. Neural Eng. 15:031005. doi: 10.1088/1741-2552/aab2f2

PubMed Abstract | Crossref Full Text | Google Scholar

Maji, S., Berg, A. C., and Malik, J. (2008). “Classification using intersection kernel support vector machines is efficient,” in 2008 IEEE Conference on Computer Vision and Pattern Recognition (Anchorage, AK), 1–8.

Google Scholar

Martis, R. J., Acharya, U. R., Tan, J. H., Petznick, A., Tong, L., Chua, C. K., et al. (2013). Application of intrinsic time-scale decomposition (ITD) to EEG signals for automated seizure prediction. Int. J. Neural Syst. 23:1350023. doi: 10.1142/S0129065713500238

PubMed Abstract | Crossref Full Text | Google Scholar

Mathworks Matlab 2023b (2023). Train Classification Models in Classification Learner App. Book Chapter 23 in Statistics and Machine Learning Toolbox User's Guide, 23.1–23.22. Available online at: https://www.mathworks.com/help/pdf_doc/stats/stats.pdf (accessed February 07, 2024).

Google Scholar

Miao, M., Zeng, H., Wang, A., Zhao, C., and Liu, F. (2017). Discriminative spatial-frequency-temporal feature extraction and classification of motor imagery EEG: an sparse regression and Weighted Naïve Bayesian Classifier-based approach. J. Neurosci. Methods 278, 13–24. doi: 10.1016/j.jneumeth.2016.12.010

PubMed Abstract | Crossref Full Text | Google Scholar

Mitaim, S., and Kosko, B. (1998). Adaptive stochastic resonance. Proc. IEEE 86, 2152–2183. doi: 10.1109/5.726785

Crossref Full Text | Google Scholar

Mwata-Velu, T. Y., Avina-Cervantes, J. G., Cruz-Duarte, J. M., Rostro-Gonzalez, H., and Ruiz-Pinales, J. (2021). Imaginary finger movements decoding using empirical mode decomposition and a stacked BiLSTM architecture. Mathematics 9:3297. doi: 10.3390/math9243297

Crossref Full Text | Google Scholar

Mwata-Velu, T. Y., Avina-Cervantes, J. G., Ruiz-Pinales, J., Garcia-Calva, T. A., González-Barbosa, E. A., Hurtado-Ramos, J. B., et al. (2022). Improving motor imagery EEG classification based on channel selection using a deep learning architecture. Mathematics 10:2302. doi: 10.3390/math10132302

Crossref Full Text | Google Scholar

Narin, A., and Isler, Y. (2021). Detection of new coronavirus disease from chest x-ray images using pre-trained convolutional neural networks. J. Fac. Eng. Archit. Gazi Univ. 36, 2095–2107. doi: 10.17341/gazimmfd.827921

Crossref Full Text | Google Scholar

Narin, A., Isler, Y., and Ozer, M. (2014). Investigating the performance improvement of HRV Indices in CHF using feature selection methods based on backward elimination and statistical significance. Comput. Biol. Med. 45, 72–79. doi: 10.1016/j.compbiomed.2013.11.016

PubMed Abstract | Crossref Full Text | Google Scholar

Ozdemir, M. A., Degirmenci, M., Izci, E., and Akan, A. (2021). EEG-based emotion recognition with deep convolutional neural networks. Biomed. Eng. 66, 43–57. doi: 10.1515/bmt-2019-0306

PubMed Abstract | Crossref Full Text | Google Scholar

Pan, S., Iplikci, S., Warwick, K., and Aziz, T. Z. (2012). Parkinson's disease tremor classification-A comparison between support vector machines and neural networks. Expert Syst. Appl. 39, 10764–10771. doi: 10.1016/j.eswa.2012.02.189

Crossref Full Text | Google Scholar

Patro, R. (2021). Cross-Validation: K Fold vs Monte Carlo. Towards Data Science. Available online at: https://towardsdatascience.com/cross-validation-k-fold-vs-monte-carlo-e54df2fc179b (accessed February 07, 2024).

Google Scholar

Pfurtscheller, G., and Neuper, C. (2001). Motor imagery and direct brain-computer communication. Proc. IEEE 89, 1123–1134. doi: 10.1109/5.939829

Crossref Full Text | Google Scholar

Richard, M. D., and Lippmann, R. P. (1991). Neural network classifiers estimate Bayesian a posteriori probabilities. Neural Comput. 3, 461–483. doi: 10.1162/neco.1991.3.4.461

PubMed Abstract | Crossref Full Text | Google Scholar

Sayilgan, E., Yuce, Y. K., and Isler, Y. (2019). Prediction of evoking frequency from steady-state visual evoked frequency. Nat. Eng. Sci. 4, 91–99.

Google Scholar

Sayilgan, E., Yuce, Y. K., and Isler, Y. (2020). Determining gaze information from steady-state visually-evoked potentials. Karaelmas Sci. Eng. J. 10, 151–157. doi: 10.7212/zkufbd.v10i2.1588

Crossref Full Text | Google Scholar

Sayilgan, E., Yuce, Y. K., and Isler, Y. (2021a). Evaluation of mother wavelets on steady-state visually-evoked potentials for triple-command brain-computer interfaces. Turk. J. Elect. Eng. Comp. Sci. 29, 2263–2279. doi: 10.3906/elk-2010-26

Crossref Full Text | Google Scholar

Sayilgan, E., Yuce, Y. K., and Isler, Y. (2021b). Evaluation of wavelet features selected via statistical evidence from steady-state visually-evoked potentials to predict the stimulating frequency. J. Fac. Eng. Archit. Gazi Univ. 36, 593–605. doi: 10.17341/gazimmfd.664583

Crossref Full Text | Google Scholar

Sayilgan, E., Yuce, Y. K., and Isler, Y. (2022). Investigating the effect of flickering frequency pair and mother wavelet selection in steady-state visually-evoked potentials on two-command brain-computer interfaces. Innovat. Res. BioMedical Eng. 43, 594–603. doi: 10.1016/j.irbm.2022.04.006

Crossref Full Text | Google Scholar

Sciaraffa, N., Di Flumeri, G., Germano, D., Giorgi, A., Di Florio, A., Borghini, G., et al. (2022). Evaluation of a new lightweight EEG technology for translational applications of passive brain-computer interfaces. Front. Hum. Neurosci. 16:901387. doi: 10.3389/fnhum.2022.901387

PubMed Abstract | Crossref Full Text | Google Scholar

Sharma, R., Kim, M., and Gupta, A. (2022). Motor imagery classification in brain-machine interface with machine learning algorithms: classical approach to multi-layer perceptron model. Biomed. Signal Process. Control 71:103101. doi: 10.1016/j.bspc.2021.103101

Crossref Full Text | Google Scholar

Tzallas, A. T., Tsipouras, M. G., and Fotiadis, D. I. (2009). Epileptic seizure detection in EEGs using time–frequency analysis. IEEE Transact. Inf. Technol. Biomed. 13, 703–710. doi: 10.1109/TITB.2009.2017939

Crossref Full Text | Google Scholar

Vapnik, V. (1999). The Nature of Statistical Learning Theory. 2nd Edn. New York, NY: Springer.

Google Scholar

Velichkovsky, B., Sprenger, A., and Unema, P. (1997). “Towards gaze-mediated interaction: collecting solutions of the “Midas touch problem”,” in Human-Computer Interaction INTERACT'97, IFIP—The International Federation for Information Processing (Boston, MA: Springer).

Google Scholar

Vidal, J. J. (1977). Real-time detection of brain events in EEG. Proc. IEEE 65, 633–641. doi: 10.1109/PROC.1977.10542

Crossref Full Text | Google Scholar

Voznesensky, A., and Kaplun, D. (2019). Adaptive signal processing algorithms based on EMD and ITD. IEEE Access 7, 171313–171321. doi: 10.1109/ACCESS.2019.2956077

Crossref Full Text | Google Scholar

Wolpaw, J. R., Birbaumer, N., McFarland, D. J., Pfurtscheller, G., and Vaughan, T. M. (2002). Brain–computer interfaces for communication and control. Clin. Neurophysiol. 113, 767–791. doi: 10.1016/S1388-2457(02)00057-3

Crossref Full Text | Google Scholar

Yahya, N., Musa, H., Ong, Z. Y., and Elamvazuthi, I. (2019). Classification of motor functions from electroencephalogram (EEG) signals based on an integrated method comprised of common spatial pattern and wavelet transform framework. Sensors 19:4878. doi: 10.3390/s19224878

PubMed Abstract | Crossref Full Text | Google Scholar

Yesilkaya, B., Sayilgan, E., Yuce, Y. K., Perc, M., and Isler, Y. (2023). Principal component analysis and manifold learning techniques for the design of brain-computer interfaces based on steady-state visually evoked potentials. J. Comput. Sci. 68:102000. doi: 10.1016/j.jocs.2023.102000

Crossref Full Text | Google Scholar

Yu, M., and Fang, M. (2022). Feature extraction of rolling bearing multiple faults based on correlation coefficient and Hjorth parameter. ISA Trans. 129, 442–458. doi: 10.1016/j.isatra.2022.02.015

PubMed Abstract | Crossref Full Text | Google Scholar

Zahra, H. N., Zakaria, H., and Hermanto, B. R. (2022). Exploration of pattern recognition methods for motor imagery EEG signal with convolutional neural network approach. J. Phys. 2312:012064. doi: 10.1088/1742-6596/2312/1/012064

Crossref Full Text | Google Scholar

Keywords: brain-computer interfaces (BCIs), electroencephalogram (EEG), feature reduction, machine learning, finger movements (FM) classification, intrinsic time-scale decomposition (ITD)

Citation: Degirmenci M, Yuce YK, Perc M and Isler Y (2024) EEG-based finger movement classification with intrinsic time-scale decomposition. Front. Hum. Neurosci. 18:1362135. doi: 10.3389/fnhum.2024.1362135

Received: 27 December 2023; Accepted: 15 February 2024;
Published: 05 March 2024.

Edited by:

Kang Hao Cheong, Singapore University of Technology and Design, Singapore

Reviewed by:

Kun Li, Hebei University of Technology, China
Lipeng Pan, Northwest A&F University, China

Copyright © 2024 Degirmenci, Yuce, Perc and Isler. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yalcin Isler, aXNsZXJ5YUB5YWhvby5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.