Comparison of the ERP-Based BCI Performance Among Chromatic (RGB) Semitransparent Face Patterns

Li, Shurui; Jin, Jing; Daly, Ian; Zuo, Cili; Wang, Xingyu; Cichocki, Andrzej

doi:10.3389/fnins.2020.00054

ORIGINAL RESEARCH article

Front. Neurosci., 31 January 2020

Sec. Neural Technology

Volume 14 - 2020 | https://doi.org/10.3389/fnins.2020.00054

This article is part of the Research Topic Datasets for Brain-Computer Interface Applications View all 16 articles

Comparison of the ERP-Based BCI Performance Among Chromatic (RGB) Semitransparent Face Patterns

$\r\nShurui Li$ Shurui Li¹

Jing Jin^1*

Ian Daly²

Cili Zuo¹

Xingyu Wang¹

Andrzej Cichocki^3,4,5

¹Key Laboratory of Advanced Control and Optimization for Chemical Processes, Ministry of Education, East China University of Science and Technology, Shanghai, China
²Brain-Computer Interfacing and Neural Engineering Laboratory, School of Computer Science and Electronic Engineering, University of Essex, Colchester, United Kingdom
³Skolkowo Institute of Science and Technology, Moscow, Russia
⁴Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland
⁵Department of Informatics, Nicolaus Copernicus University, Toruń, Poland

Objective: Previous studies have shown that combing with color properties may be used as part of the display presented to BCI users in order to improve performance. Build on this, we explored the effects of combinations of face stimuli with three primary colors (RGB) on BCI performance which is assessed by classification accuracy and information transfer rate (ITR). Furthermore, we analyzed the waveforms of three patterns.

Methods: We compared three patterns in which semitransparent face is overlaid three primary colors as stimuli: red semitransparent face (RSF), green semitransparent face (GSF), and blue semitransparent face (BSF). Bayesian linear discriminant analysis (BLDA) was used to construct the individual classifier model. In addition, a Repeated-measures ANOVA (RM-ANOVA) and Bonferroni correction were chosen for statistical analysis.

Results: The results indicated that the RSF pattern achieved the highest online averaged accuracy with 93.89%, followed by the GSF pattern with 87.78%, while the lowest performance was caused by the BSF pattern with an accuracy of 81.39%. Furthermore, significant differences in classification accuracy and ITR were found between RSF and GSF (p < 0.05) and between RSF and BSF patterns (p < 0.05).

Conclusion: The semitransparent faces colored red (RSF) pattern yielded the best performance of the three patterns. The proposed patterns based on ERP-BCI system have a clinically significant impact by increasing communication speed and accuracy of the P300-speller for patients with severe motor impairment.

Introduction

Brain-computer interface (BCI) systems enable their users to achieve direct communication with others or the outside environment by brain activity alone, independent of muscle control. There are many potential user groups for BCI systems, including, but not limited to, individuals living with amyotrophic lateral sclerosis (ALS) who are in the locked-in state (LIS).

The brain activity used to control a BCI can be measured using different signal acquisition approaches such as electroencephalogram (EEG), magnetoencephalography (MEG), functional magnetic resonance imaging (fMRI), electrocorticogram (ECoG), or near infrared spectroscopy (NIRS) (Vidal, 1973, 1977; Wolpaw et al., 2002). Since EEG signals are recorded via non-invasive electrodes placed on the surface of the scalp, EEG-based BCI systems are very commonly used. Three key signal components of the EEG are frequently used for BCI control: event-related potentials (ERPs), steady-state visual evoked potentials (SSVEP), and motor imagery (MI) (Sutton et al., 1965; Coles and Rugg, 1995). The focus of the present study is the ERP-based BCI.

The P300 speller, is a visual ERP-based BCI system, that can elicit a P300 ERP component using an Oddball paradigm. The P300 potential is the largest positive deflection with a latency around 300 ms after the oddball stimulus onset, and is associated with various cognitive processes such as attention, working memory, and executive function (Van Dinteren et al., 2014). In addition, P300-based BCI systems can evoke P100, N200, and N400 components. The P300 speller was originally described by Farwell and Donchin (1988). In this study, participants were requested to watch a screen displaying a 6 × 6 matrix containing 26 letters and 10 digits. They were asked to focus on the rare target stimuli and ignore the common non-target stimuli. Stimuli were flashed (highlighted) in a row-column pattern (RCP). However, the RCP results in the adjacency-distraction and double-flash problems, which can cause false positive P300 ERPs during flashes of non-target stimuli that are adjacent to the target. Thus, some researchers investigated ways to avoid this issue, and strengthen the performance of the P300 BCI system.

For example, Takano et al. identified that the color of the stimuli could influence P300-speller system performance. They replaced the white/gray flicker matrix with a green/blue flicker matrix and found that the chromatic stimulus improved the performance of the P300-speller system (Takano et al., 2009). Jin et al. (2012) proposed a set of stimuli patterns that made use of images of the face with different emotional content and degrees of movement, including neutral faces, smiling faces, shaking neutral faces, and shaking smiling faces. The results revealed that BCIs that make use of face-based stimuli paradigms are superior to the traditional RCP. Kaufmann et al. (2011) attempted to overlay characters used in a P300 speller with semitransparent images of familiar faces. This resulted in a higher classification accuracy by evoking N170 and N400 ERPs. The N170 is a negative voltage deflection occurring approximately 200 ms after stimulus onset, which is generally related to motion of the stimuli (Jin et al., 2015), speech processing (Niznikiewicz and Squires, 1996), and vocabulary selection (Kutas and Hillyard, 1980). The N400 component occurs at 300-500 ms post-stimulus, and is connected with face recognition (Kaufmann et al., 2011) and language understanding (Johnson and Hamm, 2000). The influences produced by stimuli have also been reflected in other factors, such as, but not limited to, the inter-stimulus intervals (Sellers et al., 2006), stimulus intensity (Cass and Polich, 1997), and stimulus motion (Sutton et al., 1965; Martens et al., 2009). A large number of works have attempted to design optimal paradigms based on face stimuli to improve the performance of BCI systems. For example, Li et al. (2015) observed that compared with a paradigm that only used semitransparent famous faces, the green semitransparent famous face paradigm could lead to improved classification performance. Based on this, we further explore the performance differences between red semitransparent face (RSF), green semitransparent face (GSF), and blue semitransparent face (BSF) patterns. In addition, Guo et al. (2019) investigated how red, green, and blue (RGB) colors may be used as stimuli in a new layout of flash patterns based on single character presentation. They reported that the red stimuli paradigm yielded the best performance. Thus, we hypothesize that faces, that are colored red, can produce a higher classification accuracy compared to patterns that combine red, green, and blue colors with faces.

Although a large number of works have attempted to design optimal paradigms to improve the performance of BCI systems, there are scarce studies on the pattern of chromatic difference and face combination. In our new patterns, the flashing row or column in the BCI display grid is overlaid with semitransparent faces that are colored red, green, or blue and we compare the effect of these three new spelling patterns on BCI performance. In addition, we investigate the ERP waveforms induced by the proposed “red semitransparent face” (RSF), “green semitransparent face” (GSF), and “blue semitransparent face” (BSF) patterns and evaluate the classification performance among the three patterns.

Materials and Methods

Participants

Twelve healthy participants (S1–S12, five females and seven males, aged 22–25 years, mean 24 years old) with normal or corrected to normal vision volunteered for the current study. All participants’ native language is Mandarin Chinese, and they are familiar with the Western characters used in the display. They are all right-handed and had normal color vision. Before the experiment began, all participants provided informed consent via a process which the local ethics committee approved. Two participants’ data was abandoned because the accuracy of three patterns were all lower than 60%. According to Kubler et al. (2004), these two participants may be described as “BCI-illiterate.” Four of the ten participants (S1, S3, S6, and S7) had participated in a BCI experiment previously. All participants were informed of the whole experimental process in advance.

Experimental Design

A 20-inch LCD monitor (Lenovo LS2023WC) with standard RGB gamut and 1600 × 900 resolution was used for stimuli presentation. Its maximum luminance was set to 200cd/m². In the experiment, we instructed participants sit approximately 105 cm in front of the display, which was 30 cm tall (visual angle: 16.3°) and 48 cm wide (visual angle: 25.7°) in a quiet laboratory which was relatively dim with the optic intensity of the environment approximately 40lx. Participants were asked to relax themselves and avoid unnecessary movement throughout the experiment. The graphical interface of the BCI was developed using “Qt Designer 4.8” software. The semitransparent images of faces, painted with three primary colors, red (255,0,0), green (0,255,0), and blue (0,0,255), were selected as stimuli, as shown in the Figure 1, and the transparency was set to 50%. The stimulus onset asynchrony (SOA) was set to 250 ms, and the stimulus interval was set to 100 ms throughout all stages of the experiment.

FIGURE 1

Figure 1. The experimental pattern. (A) Character matrix; (B) Red semitransparent face (RSF) pattern; (C) Green semitransparent face (GSF) pattern; (D) Blue semitransparent face (BSF) pattern; (E) the legend of the three stimuli. Note that in order to avoid copyright infringement, faces are portrayed with censor boxes. (During the experiment censor boxes were not presented). In addition, (B–D) presented the fifth flash.

Figure 1A shows the interface of the 6 × 6 spelling matrix before the experiment began; it contains 26 letters and 10 digits. The parameters of the three patterns including background color, the appearance and distance of characters and the stimuli style remain the same throughout the experiment. In Figure 1B, the pattern showed a semitransparent face colored red as the stimulus covered the characters. For the sake of convenience, we refer to this as the RSF pattern. Figure 1C shows the semitransparent face colored green as the stimulus covered the characters. This is referred to as the GSF pattern. Figure 1D shows the semitransparent face colored blue as the stimulus covered the characters. This is called the BSF pattern. In addition, Figures 1B–D presented the fifth flash.

In the current study, three patterns were presented to participants in sequence. During the experiment, participants were requested to silently count the number of times target characters flashed. The stimulus presentation pattern is based on binomial coefficients (Jin et al., 2010, 2014a). The formulation is C(n,k) = n!/k!(n−k)!, 0≤k≤n, where n refers to the number of flashes per trial and k refers to the number of flashes per trial for an element in the matrix. In this study, the combination of C(12,2) was used to represent the 12-flash pattern. Table 1 describes the coding of the stimulus sequence in the 12-flash pattern with 36 flash pattern pairs. The locations in Table 1 correspond to the locations of the 36 characters in Figure 1A. Specifically, the first pair (1,4) in Table 1 means the first and the fourth flash will cover character “A”. During the offline and online block – for each of the three patterns – the presentation sequences for each stimulus are consistent with Table 1.

TABLE 1

Table 1. The coding of stimulus sequence of the 12-flash pattern.

The flow diagram of the experiment is shown in Figure 2. Each pattern was presented during both offline and online blocks. The offline block included three runs. In each run participants were asked to attempt to spell five targets without any break. After each offline run, participants had 3–5 min rest. Moreover, each target needed to be presented in 16 trials before it can be identified, and each trial consisted of 12 stimuli flashes. In the offline block, no feedback was displayed to the participants. The online block contained one run, which included a spelling task with 36 targets, each of which contained n trials, where n was decided by online adaptive strategy (Jin et al., 2011) for each target. Before each run began, the prompt box over the character indicated the target character.

FIGURE 2

Figure 2. The flow diagram of the whole experiment.

Given the order that the three patterns were tested in could affect the performance, we kept split the participants into three, uniformly sized, groups. Each group was presented the three patterns in a different order. Table 2 lists the order of presentation of the three patterns for all 12 participants. Specifically, participants S1, S4, S5, and S8 attempted to use the RSF pattern, followed by the GSF pattern, and then the BSF pattern. Participants S2, S3, S6, and S11 used the GSF pattern, BSF pattern, and then the RSF pattern, Finally, participants S7, S9, S10, and S12 used the BSF pattern, RSF pattern, and then the GSF pattern (see Table 2).

TABLE 2

Table 2. The order of patterns for 12 participants.

Stimulus Consistency

We prepared the interface composed of a black background and white characters, which was used to show a traditional P300 speller interface (Farwell and Donchin, 1988). In order to ensure the consistency of the color lightness and saturation across the three stimuli, we referred to G. Saravanan’s study (Saravanan et al., 2016) which transformed RGB values to the Hue, Saturation, and Luminance (HLS) color scale. The conversion formula is expressed in the following equation.

R^{'} = R / 255; G^{'} = G / 255; B^{'} = B / 255 (1)

C_{m a x} = M A X (R^{'}, G^{'}, B^{'}) (2)

C_{m i n} = M I N (R^{'}, G^{'}, B^{'}) (3)

Δ = C_{m a x} - C_{m i n} (4)

The HSL values can be calculated by the following formula.

H = {\begin{matrix} 0^{°}, & Δ = 0 \\ 60^{°} \times (\frac{G^{'} - B^{'}}{Δ} + 0), & C_{m a x} = R^{'} \\ 60^{°} \times (\frac{B^{'} - R^{'}}{Δ} + 2), & C_{m a x} = G^{'} \\ 60^{°} \times (\frac{R^{'} - G^{'}}{Δ} + 4), & C_{m a x} = B^{'} \end{matrix} (5)

S = {\begin{matrix} 0, Δ = 0 \\ \frac{Δ}{1 - | 2 L - 1 |}, o t h e r \end{matrix} (6)

L = (C_{m a x} + C_{m i n}) / 2 (7)

In this work, we calculated the corresponding values of hue, saturation, and luminance of the three stimuli. The three stimuli refer to the red (255,0,0), green (0,255,0), and blue (0,0,255) colors. It is noteworthy that the background of the interface was black with white characters and the three stimuli were consistent in saturation and luminance while differing in hue. This is shown in Table 3.

TABLE 3

Table 3. The corresponding HSL value of the three stimuli.

Electroencephalogram Acquisition

These EEG signals were recorded with g.USBamp and g.EEGcap systems (Guger Technologies, Graz, Austria). The sample rate of the amplifier was set as 256 Hz, the sensitivity value was100μV, and a third-order Butterworth band-pass filter was applied from 0.1 to 30 Hz (Munssinger et al., 2010; Halder et al., 2016). In this paper, we chose 16 electrode positions, based on the international 10–20 system (Jin et al., 2014a), which were positioned over areas of the brain associated with vision. These electrodes were Fz, F3, F4, FC1, FC2, C3, Cz, C4, P3, Pz, P4, P7, P8, O1, Oz, and O2. The ground electrode was placed at position FPz, while the reference electrode was placed on the right mastoid (R) (Jin et al., 2010, 2012, 2015). According to Petten and Kutas (1988), the use of the right mastoid reference leads to conclusions which are somewhat similar to those with the average of left and right mastoids. The electrode impedance was kept below 5 kΩ in the experiment (Munssinger et al., 2010). Figure 3 shows the configuration of the selected electrode positions.

FIGURE 3

Figure 3. The configuration of the selected electrode positions from the 10–20 system.

Feature Extraction and Classification

After completing the offline block, feature extraction is used to reduce dimensionality and hence computation time. Extracted features were used to construct the individual classifier model, which was applied during the online block. A band pass filter was applied to filter the EEG between 1 and 30 Hz to reduce high frequency noise. The filtering algorithm we applied was a third-order Butterworth filter. In order to eliminate the impact of electrical noise, the IIR notch filter of 50 Hz was also applied. In order to decrease dimensionality of the data and complexity of the classification model, the filtered EEG data was down-sampled from 256 to 36.6 Hz by taking every 7th sample.

The first 800 ms of EEG after stimulus presentation was extracted from each channel. This resulted in a feature vector of size 16 × 29, where 16 is the number of channels we used and 29 is the number of sample points recorded on each channel after down-sampling. Moreover, we used winsorizing to remove ocular artifacts by filtering amplitudes which were less than or greater than 10 and 90% of the amplitude distribution across the feature set (Jin et al., 2014b).

In this study, we applied Bayesian linear discriminant analysis (BLDA) to construct the individual classifier model which was used during the online block. Due to its regularization, it can avoid the problem of overfitting of high-dimensional data or noise interference. Hoffmann et al. (2008) first proposed BLDA and applied it to the P300-based BCI system effectively. In addition, after constructing the model, the score per flash was obtained. Within one trial, that is twelve flashes, the target flash should achieve the highest mark.

In accordance with widely used standardized metrics for assessing BCI performance, the classification accuracy and information transfer rate (ITR) are applied to assess the performance of our BCI. The ITR is defined as:

B = l o g_{2} N + A c c * l o g_{2} A c c + (1 - A c c) * l o g_{2} \frac{1 - A c c}{N - 1} (8)

I T R = B * \frac{60}{T} (9)

where N represents the total number of targets, Acc denotes the classification accuracy, and T represents the time performing each trial.

Online Adaptive System Setting

In order to improve system performance, an adaptive strategy was used with the online spelling system (Jin et al., 2011). In the online spelling system, the number of trials used to select each character is related to the classifier output after each trial. Specifically, when the classifier recognized the same character on two successive trials, no new flashes are needed and the recognized character is presented on the screen as feedback to the BCI user. If the number of trials needed to recognize a character reaches 16 without any pair of consecutive trials recognizing the same character, the classifier will automatically choose the target recognized in the final trial. For example, suppose that “A” is the target character which the classifier recognized in the first trial. If the character “A” was recognized again in the second trial, the final output will be “A”. We can describe this process via cha(n) = cha(n−1)[cpsbreak](1 < n≤16).

Statistical Analysis

The One-Sample Ryan-Joiner test based on the correction of Shapiro-Wilk was used to analyze whether the samples were normally distributed. A Repeated-measures ANOVA (RM-ANOVA) was chosen to evaluate the effect of stimuli pattern. Mauchly’s test of sphericity was first used to check the data meets the assumptions of the RM-ANOVA. If the assumption was broken, Greenhouse-Geisser correction was performed to adjust the degrees of freedom. Finally, we applied Bonferroni multiple comparisons correction in post hoc tests (Kathner et al., 2015). The alpha level was set to 0.05 after Bonferroni correction.

Results

ERP Analysis

Figure 4 illustrates the grand averaged ERP amplitudes in response to the target stimuli for ten participants over 16 electrodes across the three patterns, after applying baseline correction with a 100 ms pre-stimulus baseline. In Figure 4, the three colors of the curves in each channel illustrate the three different kinds of patterns respectively. Four color blocks lie around the peak point, which represents four types of potentials including the vertex positive potential (VPP), the N200, P300, and N400 potentials. We selected the latency of the potentials as the peak point with the range (min −10 ms, max + 10 ms). As we can see in Figure 4, the VPP components exist in frontal and central sites while the N200 and P300 components are centered over parietal and occipital areas. In addition, the peak amplitude of BSF curve performed lower than RSF and GSF curves (see Figure 4). According to studies of W. D. Wright (Gregory, 1973) and Fuortes (Fuortes et al., 1973), the human eye is composed of three color-sensitive cone-cell types (red, green, and blue). These three cone types have different responses for different stimulus wavelengths. Red cones are more sensitive to red color, green cones are more sensitive to green color. Among the three cone types, the red-cone presents the best response followed closely by the green-cone, with the blue cones having the lowest response, which may cause the difference. Furthermore, according to a RM-ANOVA, the P300 amplitude evoked by the RSF pattern is significantly larger than the other two patterns (p < 0.05) on parietal and occipital sites, corresponding to electrode P3, P7, Pz, P8, O1, Oz, and O₂.

FIGURE 4

Figure 4. The grand averaged ERP amplitudes of targets for 10 participants over 16 electrodes among the three patterns.

Figure 5 shows the signed R-squared value maps from 0 to 800 ms for ten participants over 16 electrodes for each of the three patterns, which reflects the difference between the target and non-target stimuli over 16 channels. In order to show the difference among R-square map for RSF, GSF, and BSF patterns, the additional three R-square maps for the differences between RSF and GSF pattern, between RSF and BSF pattern and between GSF and BSF have also shown in Figure 5. The R-squared values of the ERPs evaluate the separation between target and non-target signals. The formula is given as:

FIGURE 5

Figure 5. The signed R-squared value maps from 0 to 800 ms for 10 participants over 16 electrodes for each of the three patterns and for the differences of the three patterns.

r^{2} = {(\frac{\sqrt{N_{1} N_{2}}}{N_{1} + N_{2}} \cdot \frac{m e a n (X_{1}) - m e a n (X_{2})}{s t d (X_{1} \cup X_{2})})}^{2} (10)

where X₁ and X₂ refer to the features of class 1 and class 2 respectively, and N₁ and N₂ are the number of corresponding samples. In Figure 5, the darker the color, the more distinct the features.

Classification Accuracy and Bit Rate

Figure 6 illustrates the classification accuracy and raw bit rates for each of the three patterns, which were overlapped and averaged from all trials for the ten participants based on the offline data. This valued were acquired from 15-fold cross-validation. As shown in Figure 6, the RSF pattern achieved the best offline accuracy and bit rate by averaging 16 trials. This pattern also used required the fewest the least trials to attain an accuracy of 100%. Figure 7 depicts the classification accuracy based on offline single trials, which shows no significant differences across the three patterns.

FIGURE 6

Figure 6. Classification accuracy and raw bit rate based on the offline data.

FIGURE 7

Figure 7. The classification accuracy based on offline single trial classification.

In order to observe the differences between the N200, VPP, P300, and N400 ERP components between the three patterns, we chose channel P8 for measuring the N200, Cz for measuring the VPP, Pz for measuring the P300 and Cz for measuring the N400 (Farwell and Donchin, 1988; Jeffreys and Tukmachi, 1992; Duncan et al., 2009). The selected channels generally cover the highest ERP amplitude of the corresponding component.

Table 4 describes the averaged amplitudes of the VPP on channel Cz, N200 on channel P8, P300 on channel Pz and N400 on channel Cz from the peak point ± 10 ms for the ten participants. The averaged values of VPP, P300, and N400 are largest when the RSF pattern is used, and the stability of the P300 during presentation of the RSF pattern is better than that the other patterns.

TABLE 4

Table 4. The averaged amplitudes from each ERP peak point ± 10 ms of all participant.

Figure 8 presents the averaged contributions of the N200, P300, and N400 components to the offline classification accuracy for the ten participants. The N200 had a latency of 150–300 ms after stimulation, the P300 had a latency of 300–450 ms, and the N400 had a latency of 350–600 ms (Zhou et al., 2016). The result of the three patterns all delineated N200 and P300 played a pivotal role in offline classification. Moreover, the N400 potential has positive effect on the offline classification accuracy.

FIGURE 8

Figure 8. The contributions on N200, P300, and N400 time windows on the classification accuracy.

Online Analysis

Table 5 shows the online accuracies, bit rates, and the averaged numbers of trials for participants S1–S10 for each of the three patterns. The calculated p-values indicate the significance of the difference between each pair of accuracies. Our one-way RM-ANOVA shows a significant effect of the factor “color” on the online accuracy (F(1.30,11.65) = 8.87,p < 0.05,eta² = 0.50) and bit rate (F(1.11,10.02) = 9.25.p < 0.05,eta² = 0.51). The online accuracy of the RSF pattern was significantly higher than that of the GSF pattern (t = 3.24,p < 0.05,df = 9) and the BSF pattern (t = 4.39,p < 0.05,df = 9). In addition, the bit rate of the RSF pattern was significantly higher than that of the GSF pattern (t = 5.77,p < 0.05,df = 9) and the BSF pattern (t = 3.93,p < 0.05,df = 9). However, there are no significant differences in the number of average trials needed for the classification across the three patterns. A boxplot of online accuracies is illustrated in Figure 9.

TABLE 5

Table 5. Online accuracies, bit rates, and average trials analysis results.

FIGURE 9

Figure 9. Boxplot of online classification accuracies, bit rates, and numbers of trials used to construct the averaged ERPs.

Participants’ Feedback

At the end of the whole experiment, every participant was asked to grade their perception of the tiredness and difficulty of each pattern. Tiredness and difficulty were each given a rating between 1 and 3. A score of 1 corresponded to a little, a score of 2 medium, and a score of 3 quite a lot of tiredness or difficulty. The questions were asked in Mandarin Chinese. For the sake of distinguishing the differences among three patterns, a non-parametric Friedman test was applied to reveal the differences in feedback. Table 6 delineates the feedback of all participants among three patterns. No significant difference (χ² = 5.034,p > 0.05) was found between the patterns in terms of difficulty or tiredness (χ²=0.636,p > 0.05).

TABLE 6

Table 6. The feedback of all participants for each of the three patterns.

Discussion

ERP-based BCI systems have been widely investigated over many years and some researchers have designed novel stimulus paradigms to optimize system performance. Previous work has indicated that familiar faces, colored green, may be used as a part of the ERP-based BCI display pattern to achieve higher performance than other display patterns, such as the familiar face pattern based in P300-speller BCI system (Li et al., 2015). Therefore, we evaluated how this paradigm was influenced by other colors (red, green, and blue).

Related studies have indicated that, when familiar faces are used as stimuli, they may strongly elicit several ERPs, including the VPP, N200, P300, and N400 components. Cheng et al. (2017) reported that the semitransparent face pattern can evoke larger N200 components, which can contribute to improving classification accuracy. Eimer (2000) revealed that familiar faces could elicit an N400 in parietal and central cortical areas. In addition, the VPP component remarkably increase for face-related stimuli over frontal and central sites (Zhang et al., 2012). Among the three patterns evaluated in this study, we found all the ERP components, shown in Figure 4. Moreover, we can see from Figure 8 that the P300, N200, and the N400 all contribute to the classification accuracy. The results also indicate that the RSF pattern could elicit larger P300 potentials on parietal and occipital areas.

Generally, the performance of a BCI can be evaluated by online accuracy and ITR. The results listed in Table 5 indicate that the RSF pattern achieved the highest online averaged accuracy of 93.89%, followed by the GSF pattern with 87.78%, while the lowest accuracy was achieved with the BSF pattern (81.39%). Four of the participants using the RSF pattern obtained 100% online accuracy. Furthermore, the online accuracy achieved with the RSF is significantly higher than that achieved with the GSF pattern (p < 0.05) and the BSF pattern (p < 0.05). In addition, significant differences in bit rate were found between the RSF and GSF patterns (p < 0.05) and between RSF and BSF patterns (p < 0.05). The averaged bit rate of the RSF pattern was 38.45 bit/min, and the bit rate of the GSF pattern was 33.71 bit/min, while the bit rate of the BSF was 28.76 bit/min. Due to the averaged presentation order of the three patterns for all participants, the effect caused by the order of pattern presentation can be ignored. Consequently, we may conclude that the RSF pattern yielded the best performance of the three patterns.

In order to further explain the findings, it is necessary to consider relevant psychological and physiological studies. Research has shown that long-wavelength colors (e.g., red and yellow) are more arousing than short-wavelength colors (e.g., blue and green) (Wilson, 1966). In our experiment, each face stimulus was presented for more than half an hour, which may induce some effects on the emotions of the participants. Additionally, an association has been reported between colors and physiological indices of cognition. For instance, the color red is frequently associated with fire and blood which can lead to excitement and fear (Kaiser, 1984; Camgöz et al., 2004). Sorokowski and Szmajke (2011) found that red could improve performance in a target-hitting task. This result indicated that participants attempting to hit a red moving objects can achieve better performance than participants attempting to hit blue or black targets.

In previous studies, the green/blue chromatic flicker as a visual stimulus yielded an 80.6% online accuracy (Takano et al., 2009). Li et al. (2015) proposed that a translucent green familiar face spelling paradigm could achieve an 86.1% averaged online accuracy. This SSVEP-based BCI system used LEDs of four different colors (red, green, blue, and yellow) flickering at four distinct frequencies (8, 11, 13, and 15 Hz) (Mouli et al., 2013). It was observed that the red color obtained the highest accuracy and bit rate in most frequencies. Therefore, a novel spelling pattern that combines chromatic difference (RGB) with semitransparent faces resulted in consistency and efficiency in online BCI performance and offline ERP waveform detection.

Conclusion

In the present work, we combined chromatic difference (RGB) with semitransparent face stimuli to explore the performance of different colored stimuli patterns in an ERP based BCI system. The results demonstrated that the RSF pattern yielded the best averaged online accuracy and ITR. In future work, we will attempt to train offline models using neural networks to boost the classification performance. In addition, according to Xu’s study (Xu et al., 2018), a new BCI speller based on miniature asymmetric visual evoked potentials (aVEPs) could reduce visual fatigue for users. This demonstrates the feasibility to implement an efficient BCI system. We will further explore the effect of color preference on system performance and take user-friendliness into account to improve the usability of BCI systems. This may have a clinically significant impact by increasing communication speed and accuracy of the P300-speller for patients with severe motor impairment.

Data Availability Statement

The datasets generated for this study are available on request to the corresponding author.

Ethics Statement

The studies involving human participants were reviewed and approved by the Ethics Committee of East China University of Science and Technology. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

SL was the main author to raise the idea of the manuscript, design the whole experiment, and collect the original dataset. All authors contributed to the manuscript revision, read, and approved the submitted version.

Funding

This work was supported by the National Key Research and Development Program 2017YFB13003002. This work was also supported in part by the Grant National Natural Science Foundation of China, under Grant Nos. 61573142, 61773164, and 91420302, the programme of Introducing Talents of Discipline to Universities (the 111 Project) under Grant B17017, and the “ShuGuang” project supported by Shanghai Municipal Education Commission and Shanghai Education Development Foundation under Grant 19SG25.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Camgöz, N., Yener, C., and Güvenç, D. (2004). Effects of hue, saturation, and brightness: part 2: attention. Color Res. Appl. 29, 20–28. doi: 10.1002/col.10214

CrossRef Full Text | Google Scholar

Cass, M., and Polich, J. (1997). P300 from a single-stimulus paradigm: auditory intensity and tone frequency effects. Biol. Psychol. 46, 51–65. doi: 10.1016/s0301-0511(96)05233-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, J., Jin, J., and Wang, X. Y. (2017). Comparison of the BCI performance between the semitransparent face pattern and the traditional face pattern. Comput. Intell. Neurosci. 2017:1323985. doi: 10.1155/2017/1323985

PubMed Abstract | CrossRef Full Text | Google Scholar

Coles, M. G., and Rugg, M. D. (1995). Event-Related Brain Potentials: An Introduction. Oxford: Oxford University Press.

Google Scholar

Duncan, C. C., Barry, R. J., Connolly, J. F., Fischer, C., Michie, P. T., Naatanen, R., et al. (2009). Event-related potentials in clinical research: guidelines for eliciting, recording, and quantifying mismatch negativity, P300, and N400. Clin. Neurophysiol. 120, 1883–1908. doi: 10.1016/j.clinph.2009.07.045

PubMed Abstract | CrossRef Full Text | Google Scholar

Eimer, M. (2000). Event-related brain potentials distinguish processing stages involved in face perception and recognition. Clin. Neurophysiol. 111, 694–705. doi: 10.1016/s1388-2457(99)00285-0

CrossRef Full Text | Google Scholar

Farwell, L. A., and Donchin, E. (1988). Talking off the top of your head: toward a mental prosthesis utilizing event-related brain potentials. Electroencephalogr. Clin. Neurophysiol. 70, 510–523. doi: 10.1016/0013-4694(88)90149-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Fuortes, M. G., Schwartz, E., and Simon, E. (1973). Colour-dependence of cone responses in the turtle retina. J. Physiol. 234, 199–216. doi: 10.1113/jphysiol.1973.sp010341

PubMed Abstract | CrossRef Full Text | Google Scholar

Gregory, R. L. (1973). Eye and Brain: The Psychology of Seeing. New York, NY: McGraw-Hill.

Google Scholar

Guo, M., Jin, J., Jiao, Y., Wang, X., and Cichocki, A. (2019). Investigation of visual stimulus with various colors and the layout for the oddball paradigm in ERP-based BCI. Front. Comput. Neurosci. 13:24. doi: 10.3389/fncom.2019.00024

PubMed Abstract | CrossRef Full Text | Google Scholar

Halder, S., Takano, K., Ora, H., Onishi, A., Utsumi, K., and Kansaku, K. (2016). An evaluation of training with an auditory P300 brain-computer interface for the Japanese hiragana syllabary. Front. Neurosci. 10:446. doi: 10.3389/fnins.2016.00446

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoffmann, U., Vesin, J.-M., Ebrahimi, T., and Diserens, K. (2008). An efficient P300-based brain–computer interface for disabled subjects. J. Neurosci. Methods 167, 115–125. doi: 10.1016/j.jneumeth.2007.03.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Jeffreys, D., and Tukmachi, E. (1992). The vertex-positive scalp potential evoked by faces and by objects. Exp. Brain Res. 91, 340–350.

PubMed Abstract | Google Scholar

Jin, J., Allison, B. Z., Kaufmann, T., Kubler, A., Zhang, Y., Wang, X. Y., et al. (2012). The changing face of P300 BCIs: a comparison of stimulus changes in a P300 BCI involving faces, emotion, and movement. PLoS One 7:e49688. doi: 10.1371/journal.pone.0049688

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, J., Allison, B. Z., Sellers, E. W., Brunner, C., Horki, P., Wang, X. Y., et al. (2011). An adaptive P300-based control system. J. Neural Eng. 8:036006. doi: 10.1088/1741-2560/8/3/036006

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, J., Allison, B. Z., Zhang, Y., Wang, X. Y., and Cichocki, A. (2014a). An ERP-based BCI using an oddball paradigm with different faces and reduced errors in critical functions. Int. J. Neural Syst. 24:1450027. doi: 10.1142/S0129065714500270

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, J., Daly, I., Zhang, Y., Wang, X. Y., and Cichocki, A. (2014b). An optimized ERP brain-computer interface based on facial expression changes. J. Neural Eng. 11:036004. doi: 10.1088/1741-2560/11/3/036004

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, J., Horki, P., Brunner, C., Wang, X. Y., Neuper, C., and Pfurtscheller, G. (2010). A new P300 stimulus presentation pattern for EEG-based spelling systems. Biomed. Tech. 55, 203–210. doi: 10.1515/BMT.2010.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, J., Sellers, E. W., Zhou, S. J., Zhang, Y., Wang, X. Y., and Cichocki, A. (2015). A P300 brain-computer interface based on a modification of the mismatch negativity paradigm. Int. J. Neural Syst. 25:1550011. doi: 10.1142/S0129065715500112

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, B. W., and Hamm, J. P. (2000). High-density mapping in an N400 paradigm: evidence for bilateral temporal lobe generators. Clin. Neurophysiol. 111, 532–545. doi: 10.1016/s1388-2457(99)00270-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaiser, P. K. (1984). Physiological response to color: a critical review. Color Res. Appl. 9, 29–36. doi: 10.1002/col.5080090106

CrossRef Full Text | Google Scholar

Kathner, I., Kubler, A., and Halder, S. (2015). Rapid P300 brain-computer interface communication with a head-mounted display. Front. Neurosci. 9:207. doi: 10.3389/fnins.2015.00207

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaufmann, T., Schulz, S. M., Grunzinger, C., and Kubler, A. (2011). Flashing characters with famous faces improves ERP-based brain-computer interface performance. J. Neural Eng. 8:056016. doi: 10.1088/1741-2560/8/5/056016

PubMed Abstract | CrossRef Full Text | Google Scholar

Kubler, A., Neumann, N., Wilhelm, B., Hinterberger, T., and Birbaumer, N. (2004). Brain-computer predictability of brain-computer communication. J. Psychophysiol. 18, 121–129.

Google Scholar

Kutas, M., and Hillyard, S. A. (1980). Reading senseless sentences: brain potentials reflect semantic incongruity. Science 207, 203–205. doi: 10.1126/science.7350657

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Q., Liu, S., Li, J., and Bai, O. (2015). Use of a green familiar faces paradigm improves P300-speller brain-computer interface performance. PLoS One 10:e0130325. doi: 10.1371/journal.pone.0130325

PubMed Abstract | CrossRef Full Text | Google Scholar

Martens, S. M. M., Hill, N. J., Farquhar, J., and Scholkopf, B. (2009). Overlap and refractory effects in a brain-computer interface speller based on the visual P300 event-related potential. J. Neural Eng. 6:026003. doi: 10.1088/1741-2560/6/2/026003

PubMed Abstract | CrossRef Full Text | Google Scholar

Mouli, S., Palaniappan, R., Sillitoe, I. P., and Gan, J. Q. (2013). “Performance analysis of multi-frequency SSVEP-BCI using clear and frosted colour LED stimuli,” in Proceedings of the 13th IEEE International Conference on Bioinformatics and Bioengineering (Bibe), Chania.

Google Scholar

Munssinger, J. I., Halder, S., Kleih, S. C., Furdea, A., Raco, V., Hosle, A., et al. (2010). Brain painting: first evaluation of a new brain-computer interface application with ALS-patients and healthy volunteers. Front. Neurosci. 4:182. doi: 10.3389/fnins.2010.00182

PubMed Abstract | CrossRef Full Text | Google Scholar

Niznikiewicz, M., and Squires, N. K. (1996). Phonological processing and the role of strategy in silent reading: behavioral and electrophysiological evidence. Brain Lang. 52, 342–364. doi: 10.1006/brln.1996.0016

PubMed Abstract | CrossRef Full Text | Google Scholar

Petten, C. V., and Kutas, M. (1988). The use of event-related potentials in the study of brain asymmetries. Int. J. Neurosci. 39, 91–99. doi: 10.3109/00207458808985695

PubMed Abstract | CrossRef Full Text | Google Scholar

Saravanan, G., Yamuna, G., and Nandhini, S. (2016). “Real Time implementation of RGB to HSV/HSI/HSL and its reverse color space models,” in Proceedings of the International Conference on Communication and Signal, Vol. 1(Jakarta: ICCSP), 462–466.

Google Scholar

Sellers, E. W., Krusienski, D. J., McFarland, D. J., Vaughan, T. M., and Wolpaw, J. R. (2006). A P300 event-related potential brain-computer interface (BCI): the effects of matrix size and inter stimulus interval on performance. Biol. Psychol. 73, 242–252. doi: 10.1016/j.biopsycho.2006.04.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Sorokowski, P., and Szmajke, A. (2011). The influence of the “Red Win” effect in sports: a hypothesis of erroneous perception of opponents dressed in red-Preliminary test. Hum. Mov. 12, 367–373.

Google Scholar

Sutton, S., Braren, M., Zubin, J., and John, E. (1965). Evoked-potential correlates of stimulus uncertainty. Science 150, 1187–1188. doi: 10.1126/science.150.3700.1187

PubMed Abstract | CrossRef Full Text | Google Scholar

Takano, K., Komatsu, T., Hata, N., Nakajima, Y., and Kansaku, K. (2009). Visual stimuli for the P300 brain-computer interface: a comparison of white/gray and green/blue flicker matrices. Clin. Neurophysiol. 120, 1562–1566. doi: 10.1016/j.clinph.2009.06.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Dinteren, R., Arns, M., Jongsma, M. L. A., and Kessels, R. P. C. (2014). Combined frontal and parietal P300 amplitudes indicate compensated cognitive processing across the lifespan. Front. Aging Neurosci. 6:294. doi: 10.3389/fnagi.2014.00294

PubMed Abstract | CrossRef Full Text | Google Scholar

Vidal, J. J. (1973). Toward direct brain-computer communication. Annu. Rev. Biophys. Bioeng. 2, 157–180. doi: 10.1146/annurev.bb.02.060173.001105

CrossRef Full Text | Google Scholar

Vidal, J. J. (1977). Real-time detection of brain events in EEG. Proc. IEEE 65, 633–641. doi: 10.1109/proc.1977.10542

CrossRef Full Text | Google Scholar

Wilson, G. D. (1966). Arousal properties of red versus green. Percept. Mot. Skills 23(3 Pt 1), 947–949. doi: 10.2466/pms.1966.23.3.947

CrossRef Full Text | Google Scholar

Wolpaw, J. R., Birbaumer, N., McFarland, D. J., Pfurtscheller, G., and Vaughan, T. M. (2002). Brain-computer interfaces for communication and control. Clin. Neurophysiol. 113, 767–791.

PubMed Abstract | Google Scholar

Xu, M. P., Xiao, X. L., Wang, Y. J., Qi, H. Z., Jung, T. P., and Ming, D. (2018). A brain-computer interface based on miniature-event-related potentials induced by very small lateral visual stimuli. IEEE Trans. Biomed. Eng. 65, 1166–1175. doi: 10.1109/Tbme.2018.2799661

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., Zhao, Q. B., Jin, J., Wang, X. Y., and Cichocki, A. (2012). A novel BCI based on ERP components sensitive to configural processing of human faces. J. Neural Eng. 9:026018. doi: 10.1088/1741-2560/9/2/026018

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, S. J., Jin, J., Daly, I., Wang, X. Y., and Cichocki, A. (2016). Optimizing the face paradigm of BCI system by modified mismatch negative paradigm. Front. Neurosci. 10:444. doi: 10.3389/fnins.2016.00444

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: brain-computer interface, ERP, chromatic stimuli, semitransparent face, visual stimuli

Citation: Li S, Jin J, Daly I, Zuo C, Wang X and Cichocki A (2020) Comparison of the ERP-Based BCI Performance Among Chromatic (RGB) Semitransparent Face Patterns. Front. Neurosci. 14:54. doi: 10.3389/fnins.2020.00054

Received: 25 September 2019; Accepted: 14 January 2020;
Published: 31 January 2020.

Edited by:

Hans-Eckhardt Schaefer, University of Stuttgart, Germany

Reviewed by:

Zhen Liang, Kyoto University, Japan
Minpeng Xu, Tianjin University, China

Copyright © 2020 Li, Jin, Daly, Zuo, Wang and Cichocki. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jing Jin, jinjingat@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.