Bilingual Contexts Modulate the Inhibitory Control Network

The present functional magnetic resonance imaging (fMRI) study investigated influences of language contexts on inhibitory control and the underlying neural processes. Thirty Cantonese–Mandarin–English trilingual speakers, who were highly proficient in Cantonese (L1) and Mandarin (L2), and moderately proficient in English (L3), performed a picture-naming task in three dual-language contexts (L1-L2, L2-L3, and L1-L3). After each of the three naming tasks, participants performed a flanker task, measuring contextual effects on the inhibitory control system. Behavioral results showed a typical flanker effect in the L2-L3 and L1-L3 condition, but not in the L1-L2 condition, which indicates contextual facilitation on inhibitory control performance by the L1-L2 context. Whole brain analysis of the fMRI data acquired during the flanker tasks showed more neural activations in the right prefrontal cortex and subcortical areas in the L2-L3 and L1-L3 condition on one hand as compared to the L1-L2 condition on the other hand, suggesting greater involvement of the cognitive control areas when participants were performing the flanker task in L2-L3 and L1-L3 contexts. Effective connectivity analyses displayed a cortical-subcortical-cerebellar circuitry for inhibitory control in the trilinguals. However, contrary to the right-lateralized network in the L1-L2 condition, functional networks for inhibitory control in the L2-L3 and L1-L3 condition are less integrated and more left-lateralized. These findings provide a novel perspective for investigating the interaction between bilingualism (multilingualism) and inhibitory control by demonstrating instant behavioral effects and neural plasticity as a function of changes in global language contexts.


INTRODUCTION
Bilingualism is a form of "mental juggler" (Kroll and Bialystok, 2013), as speaking one language often involves simultaneous access to the non-target language in the brain (Dijkstra and Van Heuven, 1998;Green, 1998;Bialystok, 2007;Thierry and Wu, 2007;Thierry, 2010, 2017;De Groot, 2012). Therefore, for bilingual speakers, managing two languages requires constantly selecting words in the intended language and suppressing activations of the non-target language, a routine that necessitates the engagement of inhibitory control. As a result, the experience of using multiple languages might enhance bilinguals' performance in non-linguistic domains.
Previous studies have shown that bilinguals are less distracted, as compared to monolinguals, when performing inhibitory control tasks, including the Simon task (Bialystok et al., 2004;Martin-Rhee and Bialystok, 2008), the Stroop task , and the flanker task (Costa et al., 2008). Moreover, interpreting training has been shown to improve inhibitory control processes (Dong and Zhong, 2017).
The past decade has witnessed a dramatic increase in the use of neuroimaging techniques such as functional magnetic resonance imaging (fMRI) to study the neural system of bilingual language control and the effects of second language acquisition on inhibitory control (e.g., Bialystok et al., 2005;Luk et al., 2010;Weissberger et al., 2015). Abutalebi and Green (2007) proposed a brain network for language control during bilingual speech production. The network includes the left prefrontal cortex, anterior cingulate cortex (ACC), basal ganglia, and inferior parietal lobule. Abutalebi and Green's (2007) hypothesis is that this neural network is dedicated to the selection and temporal sequencing of language representations during bilingual word production, and the pipeline works in the following order: The left basal-ganglia and ACC modulate the neural activity levels in the left prefrontal cortex, which influences neural activity in the inferior parietal cortex. Each of these areas has implications in distinct cognitive processes: The prefrontal cortex inhibits the non-target language and corrects errors; the ACC monitors conflicts and detects errors; the basal ganglia, especially the caudate nuclei, supervises the language selection and lexical access; the inferior parietal lobule, as a key region for working memory, serves a goal maintenance function. Abutalebi and Green (2008) further clarified distinct contributions of the left and right supramarginal gyri (SMG) in the inferior parietal lobules: The Left SMG is responsible for bias selection away from the language not in use; on the contrary, the right SMG is responsible for bias selection toward the language in use. This neural network for bilingual language control has been testified in a number of studies. In an fMRI study on German-Italian-English trilinguals, Abutalebi et al. (2013a,b) showed that language-switching directions influenced brain activation levels in the caudate nuclei, while activation levels of the supplementary motor area (SMA)/ACC did not vary as the function of language proficiency, which suggests a domain-general role for SMA/ACC in control tasks. Using meta-analysis approaches, Luk et al. (2012) showed that 10 neuroimaging studies involving language switching reported significant and reliable neural responses in the following brain regions: The left inferior frontal gyrus (IFG), left middle temporal gyrus (MTG), left middle frontal gyrus (MFG), right precentral gyrus, right superior temporal gyrus (STG), midline pre-supplementary motor area, and bilateral caudate nuclei. Taken together, it is clear that the prefrontal cortex and caudate nuclei are highly involved in the regulation of bilingual speeches. These brain areas have also been reported in studies of non-linguistic cognitive tasks (e.g., Luk et al., 2010). However, how these areas are connected with one another as part of the control network remains unclear.
Taking the advantage of the functional connectivity approach, researchers have attempted to identify the interactions between the language control network and the cognitive control network during L2 acquisition. For example, Ghazi Saidi et al. (2013) in an L2 vocabulary training study showed that the language processing network and cognitive control network were highly integrated at the initial stage of vocabulary learning, but as the learning proceeded and the vocabularies are being consolidated, this integration decreased. Grant et al. (2015) expanded this line of research. Instead of lab-based short-term vocabulary training, they studied neural adaptations in the development of L2 processing by examining a group of classroom Spanish L2 learners who were native English speakers over the course of one academic year. Their results show that with increased L2 experience, the overall activations in the control areas such as the ACC decreases while its connectivities with semantically related regions such as the MTG increases. The authors claim that the ability to utilize cognitive control mechanisms to regulate access to the L2 representations is a more critical issue in the beginning, relative to the latter stage, of L2 acquisition. Taken together, these studies suggest an important role of the cognitive control network in early L2 acquisition.
One possibility is that the high demand and long-term practice of language control, which involves inhibitory control, during in L2 acquisition, that allows bilinguals outperforming monolinguals in several cognitive control tasks. However, participants' background variables, such as socioeconomic status and ethnic origins (Sabbagh et al., 2006;Morton and Harper, 2007;Li et al., 2013), also seem to matter for the cognitive advantage in bilinguals. It is also possible that language processing contexts account for some of the variances (e.g., Wu and Thierry, 2013). Studies of neural plasticity on high temporal scales (Fields, 2005;Delekate et al., 2011;Bercury and Macklin, 2015) support the notion that different global language contexts (single or dual-language contexts) may lead to distinguished neural activation patterns during target word selection (Green, 2011). Green and Abutalebi (2013) proposed an adaptive control hypothesis: Language control processes adapt to the recurrent demands of the interactional context. For example, in a dual-language context, in which both languages are used (but to different speakers), language processing engages the control network comprising bilaterally inferior frontal and parietal cortices, the ACC/pre-SMA, basal ganglia, and thalamus (Abutalebi and Green, 2016). In a dense code-switching context, however, speakers routinely interleave their languages in the course of a single utterance and adapt words from one of language in the context of the other language. The neural network of language control would rely more on a cerebellarprefrontal connection as compared to the dual-language context because, in a dense code-switching context, language control involves higher demands for opportunistic planning (Abutalebi and Green, 2016).
Although the adaptive control hypothesis is a recent theory on the neural mechanisms of bilingual control, there has been increasing interest in the influence of language contexts on nonlinguistic executive functions, such as inhibitory control. In a study using event-related potentials (ERPs), Wu and Thierry (2013) examined effects of immediate changes in language processing contexts on executive function in a group of early Welsh-English bilinguals. The cognitive control performance of these participants was measured using a modified version of the classic flanker task, in which participants were instructed to press a button to indicate the direction of an arrow presented within an array of flankers (arrows pointing to the same or the opposite direction). Critically, a word is presented before the flanker trial to implicitly prime a language context. The contextual words were either in Welsh (L1), English (L2), or both languages in separated blocks. The results showed higher accuracy rates when bilingual participants performed incongruent trials of the flanker task in the dual-language context as compared to singlelanguage contexts. The P300 amplitude was also reduced in the dual-language, as compared to the single-language context, indicating less flanker interference effect in the Welsh-English context. Therefore, the authors claimed that changes in language processing contexts could modulate non-linguistic cognitive control in bilinguals.
In a further exploration, Liu et al. (2016) examined the effect of language contexts on cognitive control in a group of Chinese-English bilinguals. Unlike the highly proficient Welsh-English bilinguals in Wu and Thierry (2013), participants in Liu et al. (2016) were native speakers of Chinese who have a moderate level of proficiency in English. All participants performed an antisaccade task, which measures response inhibition (or response suppression), interference inhibition (or inhibitory control), and task switching, three key subcomponents of executive functions (Bialystok et al., 2006). Response suppression refers to the ability to withhold an inappropriate response (e.g., triggered by a habitual cue), as is most classically established in the go/no-go paradigm. Inhibitory control refers to the process when multiple sources of information (e.g., the printing color and the word meaning in the classic Stroop paradigm) are competing for attention which needs to be drawn to the target attribute of the stimulus. Task switching refers to the ability to alter between two tasks that require different cognitive processes and responses. The critical difference between response suppression and inhibitory control is that the former taps onto the process of response execution, whereas the latter mainly measures the control of selective attention. In Liu et al. (2016), Chinese-English participants performed an antisaccade task in the pre-test and then complete a cued digit-naming task involving both Chinese and English. Following the naming task, the participants performed the same antisaccade task again in the post-test. The results showed that the bilingual naming task enhanced response suppression, impeded the inhibitory control, and made no influence on the performance of task switching. Therefore, the authors suggest that moderate proficient bilinguals may rely heavily on response suppression when making speech production in two languages. As a consequence, the bilingual naming task improved their performance in the antisaccade task. Meanwhile, because of the limited cognitive resources and more involvement of response suppression, inhibitory control might have been allocated with less cognitive resources when moderate proficient bilinguals name digits using alternating languages, explaining the decreased performance in inhibitory control. Task switching involves a different mechanism from response suppression and inhibitory control and was not influenced by the bilingual context.
To reconcile discrepancies in previous studies, the present study explores the effect of language contexts on the neurocognitive mechanism of inhibitory control in a group of Cantonese-Mandarin-English trilinguals, who were highly proficient in Cantonese (L1) 1 and Mandarin (L2), and moderate proficient in English (L3). One possibility is that the discrepancies between Wu and Thierry (2013) and Liu et al. (2016) are not necessarily contradictory; they might arise as a result of differences in the participants' language background. The Welsh-English bilingual participants in Wu and Thierry (2013) were highly proficient in both languages; in contrast, the Chinese-English bilinguals in Liu et al. (2016) were intermediate learners of English. Bilinguals with high and low levels of L2 proficiency might adopt different processing strategies during speech production and, therefore, have incomparable implications for executive functions. In the same vein, age of L2 acquisition could also explain discrepancies between the two studies. Early and late bilinguals might engage different cognitive and neural mechanisms during language processing, so that the effect of language context on executive control might not be comparable between the two types of bilinguals. Finally, it is worth noting that unlike Chinese and English, Welsh and English are both alphabetical languages. Switching between two languages with more similarities in linguistic structures might engage different executive components as compared to switching between two languages that differ more radically.
To verify that language contexts may exert different effects on the inhibitory control of bilinguals with different language backgrounds, the present study examined trilingual speakers while they performed a flanker task (Luk et al., 2010) following picture naming in different dual-language contexts: the L1-L2 context, the L2-L3 context, and the L1-L3 context. Withinsubject comparisons of their performance in the flanker task following the three contexts will provide a more confident answer to the modulation effect of language context on inhibitory control. It is our hypothesis that in the L1-L2 context (Cantonese-Mandarin), as in Wu and Thierry (2013), bilingual context would facilitate inhibitory control performance; in the L2-L3 (Mandarin-English) and the L1-L3 (Cantonese-English) contexts, as in Liu et al. (2016), the bilingual context would have no beneficial effect on inhibitory control.
The second goal of the current study is to examine how dual-language contexts modulate the functional brain network for inhibitory control, which is usually right-lateralized. For this purpose, effective connectivity analyses, following a recently developed procedure for valid group modeling, namely Group Interactive Multiple Model Estimation (GIMME, Gates and Molenaar, 2012) was performed to identify causal relationships between key brain regions that subserve inhibitory control in different dual-language contexts. If dual-language contexts do not influence inhibitory control process, participants should display a typical flanker effect and comparable brain activation patterns as well as common functional brain network when performing the flanker task. If dual-language contexts do modulates inhibitory control, it is our hypothesis that the L1-L2 context would elicit a right-lateralized network for inhibitory control, while the L1-L3 and L2-L3 contexts might engage a less typical inhibitory control network, because of the more demanding task on linguistic processing and language control in the L2-L3 and L1-L3 contexts, relative to the L1-L2 context.

Participants
Thirty students (10 males; age range 18-25) were recruited from the Guangdong University of Foreign Studies in Guangzhou, a city with a large Cantonese-Mandarin bilingual community. All participants were highly proficient early bilinguals of Cantonese (first language, L1) and Mandarin (second language, L2): They were raised up in a Cantonese family and have acquired Mandarin since early childhood. At the time of testing, participants use both languages on a regular basis.
All participants were late learners of English (third language, L3) in the mainstream classroom and had a moderate level of proficiency. They started to learn English at an average age of 7.4 (±1.82). According to their self-report, English and Mandarin were used as the main instruction languages in their English class (English usage: 52% ± 0.22; Mandarin usage: 40% ± 0.2; Cantonese usage: 7% ± 0.11), implying considerable experiences of switching and translation between English and Mandarin as a result of English learning.
As shown in Table 1, to assess the participants' linguistic knowledge and background variables in each of their three languages, we asked them to complete the following measures: (1) responses to the Language History Questionnaire (LHQ 2.0; Li et al., 2014) including the age of language acquisition (AoA), usage habits, switching frequency, and language abilities, (2) vocabulary knowledge in each language as examined through naming accuracy rates in a picture naming task (48 out of the 96 high-frequency non-living objects were selected as the stimuli from the battery of Snodgrass and Vanderwart (1980) and matched between languages, and (3) the Oxford Quick Placement Test (2001) as measurements of their English proficiency.
Based on language experience, usage habits, and language proficiency, the participants in the current study were characterized as (1) highly proficient in Cantonese and Mandarin, and with extensive experiences of switching between the two languages during conversations (i.e., in the L1-L2 context), and (2) moderately proficient in English but with more Mandarin-English switching experience (i.e., in the L2-L3 context) than Cantonese-English switching experience (i.e., in the L1-L3 context). All participants were right-handed as measured by the handedness inventory (Snyder and Harris, 1993). Written informed consent was obtained from all participants prior to the experiment. The Human Research Ethics Committee for Non-Clinical Faculties at the School of Psychology of South China Normal University approved this study. All subjects gave written informed consent in accordance with the Declaration of Helsinki.

Cognitive Assessments
Before the fMRI session, all participants received a battery of behavioral tests that are designed to measure their non-verbal intelligence (the Raven's Standard Progressive Matrices; Raven et al., 1988) and working memory (the odd-even sequencing task, an adaptation of number-sequencing subtest form the WAIS-III; Wechsler, 1997) as shown in Table 1.

fMRI Procedure
Participants completed six event-related fMRI runs, each lasting 6 min and 36 s. As shown in Figure 1B, every picture-naming run was presented prior to a flanker run. The order of the three dual-language contexts (i.e., L1-L2, L2-L3, and L1-L3) was counterbalanced between participants in a Latin square design.
The Picture Naming Task: As shown in Figure 1A, picturenaming tasks in the three dual-language contexts followed  the same paradigm, each involving two different languages. During the picture-naming task, participants named pictures in two languages alternatingly, with 24 pictures per language. The stimuli were randomly selected from 96 black and white drawings for concrete non-living objects (e.g., piano) in the UCSD International Picture Naming Project (IPNP) picture database 2 (Bates et al., 2003). All stimuli corresponded to high frequency words in both Chinese (Liu et al., 2011) and English (Brysbaert and New, 2009), and were matched for word frequency (t 95 = 0.4, p = 0.69) between the two languages. To ensure the familiarity of the object names, we asked an independent group of 35 individuals (age range: 18-21) from the same population to rate the familiarity of the object names on a 5-point Likert scale (1 = very infrequent; 5 = very frequent). There were no significant differences between the levels of familiarities of object names in the three languages [L1 = 4.0 ± 0.62, L2 = 4.13 ± 0.71, L3 = 3.94 ± 0.74; F (2,54) = 2.65, ps > 0.05].
In each trial of the picture naming task, a frame was presented for 500 ms and then a picture of an object appeared in the center of the frame for 3000 ms, followed by a blank screen of 500 ms. The color of the frame (blue, red, and green) served as the naming cue (blue for Cantonese, red for Mandarin, and green for English). Participants were instructed to covertly name the picture in the target language within 3000 ms. The 48 picturenaming trials were presented in a pseudo-random order with a jittered inter-stimulus interval (min = 2000 ms, max = 12000 ms) optimized with OptSeq2 (Dale, 1999). 3 During the inter-trial interval, a central fixation cross was presented. Due to equipment limitation and to minimize head movements during naming, naming responses were collected outside of the MRI scanner. As an orientation procedure, we informed participants that they would later complete a behavioral test related to the naming task inside the scanner. We collected behavioral data with the same task outside of the scanner 2 weeks after the fMRI sessions (e.g., Zou et al., 2012;Li et al., 2013).
The Flanker Task: Immediately following each picture-naming task, participants were scanned in a flanker task session, to examine the influence of language context on their inhibitory control process. During the flanker task (Luk et al., 2010), participants responded to the direction of a red chevron (i.e., the target) surrounded by other black chevrons (i.e., the flankers). As shown in Figure 1A, in congruent trials, flanker chevrons point in the same direction as the target, whereas in incongruent trials, flankers pointed in the opposite direction to the target. Twentyfour congruent trials and 24 incongruent trials were randomly presented during each flanker task scanning session, with jittered inter-stimulus intervals (min = 2000 ms, max = 12000 ms). Each trial began with the presentation of a red fixation for 500 ms, followed by the stimulus for 3000 ms, and then a blank buffer of 500 ms.

MRI Acquisition
MRI images were acquired on a 3-T scanner (Siemens Trio Tim) equipped with a 12-channel phased-array head coil at the South China Normal University, using a T2 * -weighted gradientecho EPI sequence (TE = 30 ms; TR = 2s; flip angle = 90 • ; slices = 32; matrix size = 64 × 64; FoV = 192 mm × 192 mm; thickness = 4 mm). Participants lay supine in the scanner and viewed the visual stimuli via a back-projection mirror, while their heads were immobilized with cushions. For each run, the functional scanning was always preceded by 6 s of dummy scans (fixation) to ensure tissue steady-state magnetization. Highresolution (1 mm × 1 mm × 1 mm) anatomical images were acquired using a T1-weighted, 3D inversion-recovery gradientecho (MP-RAGE) sequence.

Whole Brain Activations
The fMRI data were preprocessed using the Statistical Parametric Mapping (SPM) software running under MATLAB (SPM12; Wellcome Department of Imaging Neuroscience, University College London). 4 All three flanker runs followed the same data processing procedure. The first three scans (dummy scans) of the 198 volumes collected were discarded to allow for T1 equilibration. The remaining 195 volumes were then realigned to the first volume for head-motion correction, co-registered to the individual anatomical images and then to the EPI template in SPM12 based on the Montreal Neurological Institute (MNI) stereotactic space, and resampled into 3 mm × 3 mm × 3 mm cubic voxels. The head motion and rotation of all participants were less than 3 mm of displacement or 3 • of rotation.
For each participant, functional images collected from each flanker run were grouped into congruent and incongruent conditions. Individual brain activations corresponding to congruent or incongruent conditions (in contrast to fixation) were analyzed using general linear model (GLM) and were entered into the second level of group analysis to show the neural correlates underlying inhibitory control.
AlphaSim program in the REST (Song et al., 2011) software was used to correct for multiple comparisons in SPM (10,000 interactions). All the brain activations reported below survived an FWE-corrected cluster-level threshold of p < 0.05 (single voxel: p < 0.001, number of voxels > 12) (Woo et al., 2014) and were in the MNI coordinate space.
To identify activation changes in those regions between congruent and incongruent conditions following the three different dual-language contexts, the present study sorted time series of the 12 ROIs by experimental conditions (e.g., congruent and incongruent). The averaged time course signals across all trials of the congruent/incongruent conditions were converted to percentage signal changes (PSC) using the formula (signalbaseline)/baseline × 100 for each time point, and the baseline constant was the mean signal of the fixation baseline (e.g., Li et al., 2013). The averaged PSC values for each condition in every context were considered as representative activation level of each ROI for every participant.

Effective Connectivity
To examine the influence of dual-language context on functional brain connectivity of inhibitory control process in the trilinguals, we made use of recent advances in connectivity modeling (extended unified structural equation modeling, euSEM) Hillary et al., 2011Hillary et al., , 2014Gates and Molenaar, 2012;Yang et al., 2015) and a recently developed Group Iterative Multiple Model Estimation (GIMME), an automatic and freely distributed MATLAB-based program 6 .
The euSEM approach has provided a flexible and efficient method for analyzing the causal interactions of brain regions for cognitive functions, as has previously been applied in Grant et al. (2015) and Yang et al. (2015). The procedure for using the euSEM in the current study is consistent with Yang et al. (2015), but with two experimental conditions in every language context, namely congruent and incongruent conditions of the flanker task. As with other SEM-based approaches, GIMME works from individual-level correlation matrices. The covariance matrices used for the euSEM analysis include the ROI time series at time t (contemporaneous series, where each "t" is a single brain volume or TR) and the same ROI time series at the next time t + 1 (lagged series). For the euSEM analysis, the covariance matrices also include two time series of the effects of the task inputs (congruent and incongruent) for both time t and t + 1, convolved with a canonical hemodynamic response function. In addition, the bilinear series can be used to measure the influence of task inputs on the relationship between ROIs by examining time series of each ROI at each time t multiplied by the convolved task input series at time t. The model selections at the group and individual levels are implemented in the following steps. First, Lagrange Multiplier equivalents (i.e., modification indices; Sörbom, 1989) are used to identify which effects (including connections among ROIs, the direct and bilinear effects), if freed, optimally improve model fit across all individuals. The probability of detecting an effect across all individuals was set at 75%; selection of this criterion was informed by empirical and simulated studies on the likelihood of detecting a true effect should it exist in a given sample (e.g., Hillary et al., 2011Hillary et al., , 2014Gates and Molenaar, 2012;Yang and Li, 2012;Yang et al., 2015). The program iterates until the 75% criterion is met. Second, the model is pruned by eliminating connections that are no longer significant for 75% of the group after other connections are freed. Third, individual-level models are estimated in a semi-confirmatory manner. All connections freed in the group model (described in the two steps above) are freed at the individual level. The automatic search procedure within LISREL (Cziráky, 2004) then iteratively frees connections that optimally improve model fit, according to the Lagrange Multiplier equivalents (Gates et al., 2010). Finally, the model is pruned by eliminating individual-level connections that become non-significant after other individual-level connections are freed, and a confirmatory model is fitted. Model fit parameters, that were found to demonstrate reliability in simulation studies (e.g., Gates et al., 2010) and fMRI studies (e.g., Hillary et al., 2014), were chosen a priori so that two of the following four criteria were satisfied in the final model: confirmatory fit index (CFI) ≥ 0.9; non-normed fit index (NNFI) ≥ 0.9.

Behavioral Results
As shown in Figure 2, participants performed more quickly in the congruent condition as compared to the incongruent condition, displaying a typical flanker interference effect (Eriksen and Eriksen, 1974) in all three dual-language contexts (ps < 0.001). However, there were no significant differences in response times between the congruent or incongruent trials following the three dual-language contexts (ps > 0.05).
For accuracy rates, the flanker interference effect (higher accuracy rates in the congruent as compared to the incongruent trials) was found following the L2-L3 context (two-sample t-test, t 29 = 2.27, p < 0.05) and the L1-L3 context (t 29 = 2.32, p < 0.05), but not the L1-L2 context (two-sample t-test, t 29 = 1.14, p = 0.26), suggesting a facilitation effect of the L1-L2 context on inhibitory control process.

Whole-Brain Analysis
As shown in Table 2, in the L1-L2 context, incongruent trials elicited additional brain activations in the left inferior parietal lobe. In the L2-L3 context, incongruent conditions involved extra neural responses in the right inferior frontal gyrus and left supramarginal gyrus. In the L1-L3 context, both congruent and incongruent trials evoked neural activities in the right prefrontal cortex, right insula, and subcortical areas. See Supplementary Figure S1 for brain activations during the naming tasks.
No significant differences of neural activations were found between congruent and incongruent conditions in the L1-L2 and L2-L3 contexts. In the L1-L3 context, incongruent condition, as compared to the congruent condition, showed more neural responses in the bilateral inferior occipital gyri (Brodmann area 19,or BA 19), right middle occipital gyrus and bilateral middle temporal gyri (MTG). It is well-known that the medial temporal lobe (MTL) is the hub for declarative memory and keeps semantic representation (Squire et al., 2004). MTG might be a multimodal semantic processing hub, storing long-term conceptual knowledge, processing lexico-semantic information, and fulfilling semantic integration, especially in the L2 lexical processing (Rodríguez-Fornells et al., 2009). The stronger activation of the bilateral MTG in incongruent trials following L1-L3 context might imply competition of cognitive resources FIGURE 2 | Reaction times (bars; left axis) and accuracy rates (lines; right axis) in the flanker task for the Cantonese-Mandarin-English trilinguals in the L1-L2, L2-L3, and L1-L3 contexts. Reaction times for the congruent condition (C) and incongruent condition (I) were significantly different in the three contexts. For accuracy rate, there was no significant difference between congruent and incongruent conditions when the flanker task was presented in the L1-L2 context. The asterisks indicate significant differences ( * * p < 0.001; * p < 0.05). Error bars depict SEM in reaction time data.
between inhibitory control task and the demanding semantic processing in L1-L3 context. As illustrated in Figure 3A, duallanguage contexts (i.e., the L2-L3 and L1-L3 contexts) involving a moderate proficient language (L3) displayed increased brain activity in the right prefrontal cortex, bilateral insula and inferior parietal lobules, as well as subcortical areas, particularly the bilateral caudate and putamen, as compared to the L1-L2 context.

ROI Analyses
The

Connectivity Analysis
An extended unified Structural Equation Model (euSEM) analysis was conducted on the fMRI data of the flanker tasks following the L1-L2, L2-L3 and L1-L3 contexts. All group maps (Figure 4) had an excellent fit to the data for roughly 97-100% of the participants, depending on the measure (Brown, 2006). Specifically, in the L1-L2 context, the Comparative Fit Index (CFI) evaluated the model fit as excellent for 100% of the participants' data, while Standardized ROOT Mean Square Residual (SRMR) and the Root Mean Square Error of Approximation (RMSEA) showed excellent fit for 97% of the data. In L2-L3 context, CFI, SRMR and the RMSEA results indicated an excellent fit for 100% of the data, as is the same in the L1-L3 context. FIGURE 4 | Connectivity maps for inhibitory control process in three dual-language contexts (L1-L2; L2-L3; L1-L3). Significant contemporaneous and lagged relationships at group-level were shown, and auto-regressive lagged connections are omitted for clarity. Connection strength is denoted by beta coefficients, reflected here as the color the line. The solid line indicates contemporaneous relationship, namely area X at time T influences brain activation of area Y at time T; dotted line illustrates lagged relationship, namely area X at time T influences brain activation of area Y at time T + 1. As shown in Figure 4, the cortical-subcortical-cerebellar network for inhibitory control following the three dual-language contexts shared common connections: The right INS and thalamus strongly influence their left homologous areas, implying the right-dominant network for inhibitory control; the right IFG feeds to the right MFG, suggesting that the right IFG is highly engaged in bottom-up process of inhibitory control; the right thalamus influences brain activations in the right caudate in a lagged relationship, implying the key role of right thalamus in the communications between cortical and subcortical areas. In all the three contexts, right INS influences right IFG and left SMG. Not surprisingly, the left and right SMG are connected to each other as for the bilateral caudate nuclei.
Obviously, in all three dual-language contexts, inhibitory control relied on collaborations between a frontal-parietal network, a cortical-thalamic-striato pathway and bilateral cerebellum. However, dual-language contexts influence inhibitory control network in the trilinguals. To be specific, inhibitory control in the L1-L2 context recruits an efficient and right lateralized network: The right INS as the hub of the network feeds to the right IFG, modulating brain activation in the right MFG indirectly and activating the right SMG; the right THA, as a mediator of frontal-thalamic-striato pathway, receiving positive information from the right INS, feeds to the right CN; right THA is also a key relay station for cortico-cerebellar pathway, forwarding information from the right INS to the left cerebellum, which sends strong and positive influence to the right cerebellum; right SMG forward information from the right IFG to the left SMG, which passes information to the right insula, completing the frontal-parietal sub-network.
When the dual-language context involves a moderate proficient language (L3), the functional brain network for inhibitory control in the same group of participants changed immediately. Specifically, in the L2-L3 context, the frontalparietal sub-network runs in the reverse pipeline: The right IFG actively influences the right INS, which as a hub sends direct and positive information to the left SMG, feeding directly to the right SMG; the right IFG receives feedback from the right SMG and connects to the right MFG. As in the L1-L2 context, right INS influences bilateral caudate via THA in a lagged relationship. The left cerebellum receives inputs directly from the right SMG and feeds to the right cerebellum. Interestingly, the right CN activates left CN in the L2-L3 context, as is the same in the condition of the L1-L3 context. L1-L3 context involves a moderate proficient language and engages less switching experience between the two languages. Compared with the L1-L2 and L2-L3 contexts, inhibition control in the L1-L3 context relies on a more left-lateralized and less integrated network, compared with the networks in the other two dual-language contexts. In L1-L3 context, the left SMG works as the hub of the inhibitory control network: The right MFG takes the lead and sends orders directly to the left SMG, which relays to the right INS, right SMG, right THA, and right cerebellum. The right THA, as in the other two conditions, forward positive information to the right CN in a lagged relationship, thus completing the cortico-thalamic-striato pathway. For the cerebellar components, the right cerebellum receives weak influence from SMG on the contralateral side and feeds to the left cerebellum.
Taken together, the cortical-subcortical-cerebellar network for inhibitory control involves a frontal-parietal sub-network, cortico-thalamic-striato pathway, and bilateral cerebellum. However, this neural pattern can be modulated by language contexts on a short timescale. In dual-language contexts with intensive code switching between two proficient languages (i.e., the L1-L2 context), the inhibitory control process seems to be facilitated and rely on a right-lateralized control network. When the dual-language context involves a less proficient language, especially when the two languages are radically different from each other, the inhibitory control process relies on a more left-lateralized and less integrated neural network: In the L2-L3 context, the right IFG feeds to right INS and receives feedback from the right SMG; in the L1-L3 context, the right MFG is highly engaged and the whole network relies on the left SMG to modulate brain activations in the cortical, subcortical, and cerebellar areas.

DISCUSSION
The current study examines the dynamic influences of language contexts on inhibitory control in trilinguals. We explored the neural correlates and functional brain networks activated while Cantonese-Mandarin-English trilingual speakers performed the flanker tasks in three dual-language contexts. As bilingual language processing engages inhibition of the non-target language (Dijkstra and Van Heuven, 1998;Green, 1998;Linck et al., 2008;Green and Abutalebi, 2013;Ventura-Campos et al., 2013;Abutalebi and Green, 2016) and acquiring a second language facilitates the development of cognitive control (Linck et al., 2009;Hosoda et al., 2013;Grant et al., 2015), we expected a facilitatory effect of dual-language contexts on participants' cognitive control performance (e.g., Wu and Thierry, 2013). Our participants were highly proficient in L1 and L2, but moderately proficient in L3. They frequently switch between L1 and L2 in everyday life, but not between L1 and L3 or L2 and L3. We, therefore, expected to observe significant facilitatory effect of the L1-L2 context as compared to the L2-L3 and L1-L3 contexts on inhibitory control.
The results showed the classic flanker effect in the L2-L3 and L1-L3 contexts, but not in the L1-L2 context. Consistent with results in a previous study (Wu and Thierry, 2013), the effect of contextual priming was observed in accuracy rates but not reaction times, suggesting that independent cognitive mechanisms might account for flanker effects in the two types of measurements. Previous studies (e.g., Luk et al., 2010) using a similar flanker task showed that in the incongruent condition, bilinguals activated a widespread set of brain regions, including the fusiform gyri, inferior frontal gyri, supplementary motor area, inferior parietal regions, and subcortical areas. In the present study, those brain regions for inhibitory control failed to show significant activations in incongruent condition (when compared to the congruent condition) in the L1-L2 context. Moreover, none of the ROIs showed significant interference effects in the L1-L2 context in terms of their BOLD signal change ( Figure 3B). These findings suggest that the neural efficiency of the inhibitory control network was enhanced in the L1-L2 context, reducing the classic flanker effect in both the behavioral and the neural anatomical level (e.g., Jäncke et al., 2000;Stevens et al., 2007). Effective connectivity analysis shows the following pattern of results: The right insula functions as the hub of the frontal-parietal network, feeding to the right IFG, which mediates the right MFG and bilateral SMG; the right MFG and bilateral SMG then send information to the right THA, which positively modulates brain activations of bilateral caudate nuclei in a lagged relationship and directly influences the cerebellar pathway. The important role of the right insula in inhibitory control has been well documented. In an event-related fMRI study, Garavan et al. (1999) showed that performing a response inhibition task activated the right hemisphere, including the right MFG, IFG, insula, and inferior parietal lobule. A more recent study dissociated the functional role of the right IFG and insula in inhibitory control and suggests that the right insula is particularly important for detecting behaviorally salient events, while the right IFG is more involved in implementing inhibitory control (Cai et al., 2014). Meanwhile, it is interesting to note that the right frontal-insular cortex has been implicated in switching between central-executive and default-mode networks (Sridharan et al., 2008).
In the L2-L3 context, behavioral results showed significant flanker effects in both reaction times and accuracy rates (see Figure 2). Analysis of the neuroimaging data showed that the flanker effect was associated with brain activations in the right inferior frontal gyrus, bilateral insula, left rolandic operculum, bilateral supramarginal gyrus, and right thalamus. This pattern of activations is highly consistent with neural mechanisms underlying typical flanker effects (e.g., Luk et al., 2010). ROI analyses showed that the flanker effect was associated with activations in the right IFG and left cerebellum, where no such significant activations were found in the L1-L2 context, suggesting a priming effect on the inhibitory control when the dual-language context involves two highly proficient languages. Connectivity map shows a similar frontal-parietal network, but in the reverse relationship. To be specific, in the L1-L2 context, the right IFG sends feedback to the right MFG and completes the frontal-parietal circuitry via the right SMG and left SMG; in the L2-L3 context, flanker task involves more engagement of the IFG: The IFG passes positive influences to the right INS, forwarding information to the left SMG and the right THA; the right SMG receives signals from the left SMG and sends feedback to the IFG, completing the frontal-parietal loop.
In the L1-L3 context, the flanker effect is associated with a different neural network, in which the left SMG is the hub. As illustrated in Figure 4, the right MFG influences the left SMG, which influences the right INS, the right IFG, and finally the right MFG, completing a frontal-insula-parietal network without the right SMG; meanwhile, the left SMG directly modulates brain activations in the right THA, which communicates with the right CN in a lagged relationship as in the other two contexts; furthermore, the left SMG feeds to the right cerebellum, which connects to the left cerebellum. This distinction of the neural network in the L1-L3 context, as compared to the L1-L2 and the L2-L3 contexts, is further supplemented by increased activations in the right MFG, right IFG, bilateral basal ganglia and cerebellum (see Table 2). ROI analyses also showed that activations in the right MFG and right cerebellum were associated with the flanker effect.
Research in neuropsychology and cognitive neuroscience has established the role of the left supramarginal gyrus in the inferior parietal lobule in second language acquisition. Neuroimaging data showed that early bilingualism is associated with increased gray matter density in the left inferior parietal lobe (Mechelli et al., 2004). In addition, researchers have found that the lateral inferior parietal cortex contributes to attentional focalization and target detection in both auditory and visual modalities, indicating its involvement in domaingeneral attentional processes (e.g., Green et al., 2006;Shomstein and Yantis, 2006). As summarized by Della Rosa et al. (2013), second language acquisition might tune this attentional control area into a "multilingual talent area" as phonological storage and attentional control functions were also subserved by this left inferior parietal lobe. The right SMG, as suggested by Abutalebi and Green (2008), was particularly involved in language selection in conversations that involve multiple languages. As illustrated in Figure 4, in the L1-L2 context, the right SMG is influenced by the right IFG in a top-down control process, while in the L2-L3 context, it sends feedback to the right IFG, forming a bottom-up stream.
It is worth noticing that in the L1-L2 context, consistent with the adaptive control hypothesis (Abutalebi and Green, 2016), the right IFG feeds to the right insula, which influences the right thalamus, thus modulating subcortical areas such as the caudate and connecting the cerebellum. The thalamus has been reported to directly connect to the regions of basal ganglia (Smith et al., 2011) and has reciprocal structure connections with the cerebellum as a relay station (e.g., Glickstein and Doron, 2008). The left caudate and putamen might be more involved in verb interference effects (Abutalebi and Green, 2008;Ali et al., 2010), while the right homologous areas play a more important role in inhibitory control. Based on our results of trilinguals, dual-language contexts modulate the involvement of inhibitory control areas and their interactions.

CONCLUSION
The finding that dual-language contexts lead to functional reorganizations of the inhibitory control network not only reconciles discrepancies in previous studies (e.g., Wu and Thierry, 2013;Liu et al., 2016), but also provides a novel perspective for investigating the interplay between language control and non-linguistic cognitive processes. To fully understand the nature of the neural mechanisms subserving non-linguistic skills (e.g., executive functions), researchers have to consider the influences of processing contexts. Results of the current study provides empirical evidence in favor of the adaptive control hypothesis (Green and Abutalebi, 2013), which suggests that interactional contexts (e.g., single-language, dual-language, or frequent-switching) modulate language control processes by adaptive changes in the neural regions and circuits associated with specific control processes. Critically, our results showed that the cognitive system and its underlying neural network are highly plastic, allowing quick development of functional reconfigurations. Short-term language engagement, in the form of contextual priming, can instantly rewire the related brain mechanisms. This finding sheds new light on therapy training programs for individuals with minor cognitive impairment (MCI). Whether or not L2 proficiency, age of acquisition, and cross-language similarities (e.g., alphabetical or non-alphabetical) distinctly contribute to the modulation effects of bilingual contexts requires further exploration.

AUTHOR CONTRIBUTIONS
JYa, JYe, RW, KZ, and YW designed the study. JYa and JYe acquired and analyzed the data. JYa, RW, KZ, and YW wrote the manuscript.

FUNDING
This project is supported by National Natural Science Foundation of China (Grant Nos. 31500924