Original Research ARTICLE
The Neural Correlates of the Interaction between Semantic and Phonological Processing for Chinese Character Reading
- 1Key Lab for Behavior & Cognitive Neuroscience of Shaanxi Province, School of Psychology, Shaanxi Normal University, Xi’an, China
- 2Department of Psychology and Linguistics, University of Southern California, Los Angeles, CA, USA
Visual word recognition involves mappings among orthographic, phonological, and semantic codes. In alphabetic languages, it is hard to disentangle the effects of these codes, because orthographically well-formed words are typically pronounceable, confounding orthographic and phonological processes, and orthographic cues to meaning are rare, and where they occur are morphological, confounding orthographic and semantic processes. In Chinese character recognition, it is possible to explore orthography to phonology (O-P) and orthography to semantics (O-S) processes independently by taking advantage of the distinct phonetic and semantic components in Chinese phonograms. We analyzed data from an fMRI experiment using lexical decision for Chinese characters to explore the sensitivity of areas associated with character recognition to orthographic, phonological, and semantic processing. First, a correlation approach was used to identify regions associated with reaction time, frequency, consistency and visual complexity. Then, these ROIs were examined for their responses to stimuli with different types of information available. These results revealed two neural pathways, one for O-S processing relying on left middle temporal gyrus and angular gyrus, and the other for O-P processing relying on inferior frontal gyrus and insula. The two neural routes form a shared neural network both for real and pseudo-characters, and their cooperative division of labor reflects the neural basis for processing different types of characters. Results are broadly consistent with findings from alphabetic languages, as predicted by reading models that assume the same general architecture for logographic and alphabetic scripts.
Visual word recognition involves mappings among orthographic, phonological, and semantic codes. Reading aloud is thought as the cooperative division of labor between orthography-to-phonology and orthography-to-semantics processes (Harm and Seidenberg, 2004; Carreiras et al., 2014). This view has received the support of a considerable amount of evidence from studies of behavior (Strain et al., 1995), neuropsychology (Woollams et al., 2007) and computational modeling (Harm and Seidenberg, 2004), as well as a small amount of neural evidence from alphabetic languages (Frost et al., 2005; Graves et al., 2010; Boukrina and Graves, 2013). In alphabetic writing systems, orthography (O) and phonology (P) are highly confounded, O-P mappings are overwhelmingly regular, and so it’s hard to know whether these interactions are due to the functioning of the neural networks involved in reading, or due to unavoidable correlations in stimulus materials. Nonetheless, a number of recent studies have suggested that the functional organization of the reading system in the brain is generally similar across alphabetic and logographic writing systems (Nakamura et al., 2012; Rueckl et al., 2015; Wang et al., 2015).
In the literature of fMRI studies of word reading from alphabetic languages, a small amount of neural evidence has showed the interaction between the neural routes of orthography-to-phonology (O-P) and orthography-to-semantics (O-S) processing. For example, Frost et al. (2005) found that highly imageable words reduced the activation in left inferior frontal gyrus (IFG) for O-P processing, and correspondingly increased the activation at left middle temporal gyrus (MTG) and angular gyrus (AG) for semantic processing. The result indicates a trade-off between phonology and semantic processing in word reading. In a recent study, Boukrina and Graves (2013) used an effective connectivity algorithm to show that areas supporting semantic processing [e.g., inferior temporal sulcus (ITS)] interacted with phonological areas [e.g., posterior superior temporal gyrus (pSTG)]. The connectivity from ITS to pSTG changed as a function of word properties, in which the connectivity emerged for high- compared to low-imageability words, and for low-consistency words under certain conditions.
The neural basis of O-P processing is consistent in many studies of alphabetic languages. It relies on connections from the ventral occipitotemporal (vOT) region to the temporoparietal junction (Pugh et al., 2000; Price, 2012). In a meta-analysis of 35 neuroimaging studies (Jobard et al., 2003), grapheme–phoneme conversion was proposed to rely on left pSTG, supramarginal gyrus, and the opercular part of IFG. This neural route is in agreement with the findings of the most recent meta-analysis (Cattinelli et al., 2013; Taylor et al., 2013) and the literature review (Carreiras et al., 2014).
On the contrary, the neural basis of O-S processing is far from clear. Price (2012) reviewed the literature and suggested an O-S route as the connection from inferior occipital (iO) to MTG via vOT. This route is in agreement with previous meta-analysis (Jobard et al., 2003). However, it is not consistent with the finding from a work of the effective connectivity (Richardson et al., 2011). Using dynamic causal modeling, Richardson et al. (2011) tested the sub-routes of the connection from the visual cortex to the left temporal lobe. They suggested that O-S processing relies on the link from iO to anterior superior temporal sulcus (STS) via vOT.
The lack of the consensus of the neural basis of O-S processing may be because it is hard to investigate in alphabetic word reading. Alphabetic writing systems represent the phonological structure of the language more or less directly and componentially. Pronunciations can be computed directly from spelling, resulting in a relatively limited role for semantics in reading aloud (Raman and Baluch, 2001). Thus, unlike the consistent findings of the neural basis of O-P processing, the cortical substrate of semantic processing consists of extending left-brain areas in anterior temporal lobule (ATL), MTG, AG, and IFG (see reviews, Binder et al., 2009; Carreiras et al., 2014).
Considering the difficulty of testing the neural basis of O-S processing, prior studies mainly examined the neural correlates of lexico-semantic processing. To do this, researchers commonly manipulated the stimulus property and the task demand. Typically, semantic tasks (e.g., meaning relatedness judgment) recruited more activation at semantic regions than a phonological task (e.g., rhyming judgment, Booth et al., 2006; Vigneau et al., 2006). Stimulus properties are also manipulated to examine additional lexico-semantic processing for particular types of word. For example, according to the connectionist approach, if both the frequency and spelling–sound consistency is low, word reading requires additional input via a semantically mediated pathway (Plaut et al., 1996; Harm and Seidenberg, 2004). Consequently, low frequency irregular/inconsistent words recruited more activation at left AG and MTG for lexico-semantic processing (Taylor et al., 2013). Imageability is another stimuli manipulation on the semantic attributes of the whole word. High imageable word facilitated recognition, particularly for words that depend on semantics for correct naming (Strain et al., 1995), and recruited more activation at left MTG and AG for semantic processing than low imageability words (Frost et al., 2005). These studies suggest a neural pathway for semantic processing contributed to the word reading, and its interaction with the neural pathway for phonological processing. But research on alphabetic writing systems cannot disambiguate lexico-semantic from orthography-to-semantic processing.
The O-S processing can be test in Chinese character reading since the unique logographic properties of the Chinese writing system. In modern Chinese, more than 80% characters are phonograms (Zhu, 1988). Phonograms consist of two components, a phonetic indicating the character’s pronunciation, and a semantic radical indicating the character’s semantic category. The pronunciation of the phonetic influenced Chinese character reading by showing the regularity and consistency effect (Hue, 1992; Lee et al., 2005; Yang et al., 2009). The meaning of some phonetics, as well as the semantic component, was also found to influence processing of Chinese characters (Feldman and Siok, 1999; Zhou and Marslen-Wilson, 1999). Thus, semantic processing has been proposed to interact with phonological processing in the studies of patients (Han et al., 2005; Bi et al., 2007), children (Shu et al., 2005), and computer modeling (Yang et al., 2013).
However, only a few neuroimaging studies in Chinese have addressed the neural basis of O-S processing and its interaction with O-P processing. Based on the task manipulation, the semantic processing in Chinese reading relied on the left MTG and anterior ventral IFG (Wu et al., 2012). The left MTG associated with meaning association judgment than rhyme judgment (Booth et al., 2006), and anterior ventral IFG was more active for meaning judgment than a perceptual control task (Chou et al., 2009). To our knowledge, no existing fMRI study has examined the cortical substrate for processing of semantic cues in the Chinese orthography. Furthermore, the neural network of character reading is more uncertain when considering the language differences. Although, pSTG was consistently active for O-P processing in alphabetic word reading (Paulesu et al., 2000), it was null or deactivated for Chinese character either in naming (Tan et al., 2001a) or lexical decision (LD) task (Yang et al., 2012), whereas, it was active in Chinese during continuous reading (Wang et al., 2015) and semantic judgment tasks (Rueckl et al., 2015). That is, the task demands might influence the previous findings of the Chinese reading network.
In the current study, we reanalyze the data from an fMRI study of Chinese character reading (Yang et al., 2012) to investigate the independent contributions of orthographic, phonological, and semantic processing. Rather than examining semantic or phonological processing manipulated via instructions, we can show it in the context of a single task. To do this, we first used a stimulus correlation approach (Graves et al., 2010) to identify reading network overlaps with alphabetic languages in correlation with reaction time (RT), word frequency, visual complex, and orthography-to-phonology consistency. The parametric approach can use one data set to reveal distinct brain areas those are correlated with highly intercorrelated variables (Hauk et al., 2008). Then, ROIs were examined for their responses to stimuli with different types of information available. In this way, we can investigate the relative contribution of the orthographic, phonological, and semantic processing on character reading. Based on data from alphabetic languages and task manipulations in Chinese fMRI studies, we predicted that the orthographic processing mainly relied on the visual cortex, and orthography-to-semantics processing would rely on the activation of posterior MTG, and inferior parietal cortex (mainly located at the angular and the supramarginal gyrus). The orthography-to-phonology processing would rely on IFG, but not on the posterior STS due to the absence of grapheme-to-phoneme corresponds for Chinese characters. The findings will shed a light on the general architecture for reading of logographic and alphabetic scripts.
Materials and Methods
Data from 16 participants (18–25 years) that participated in a previously published study (Yang et al., 2012) were reanalyzed in the current study. Written informed consent was obtained from each participant, and the study was approved by the Institutional Review Board of the State Key Laboratory of Cognitive Neuroscience and Learning at Beijing Normal University. A complete description of the methods is described in the initial publication. Here, we provide a concise description of the aspects of the data most relevant for the new analyses. There is no overlap between the goal of the present study, which focused on the interaction among brain regions in LD task, and the prior study, which characterized activation of brain regions driven by stimulus properties and tasks demands.
Sixteen participants performed LD task on six types of word-like stimuli: real characters (RW), pseudo-characters containing Phonology plus Semantics (PS), Orthography plus Semantics (OS) or Only Orthography (OO), and two types of orthotactic violations: the reversed radicals (RR) condition, created by reversing the position of components in the OO pseudo-characters, and nonsense strokes (NN) condition, created by randomizing the individual strokes in RR stimuli. Ninety additional real characters (Filler) were included to balance the number of “word” responses in LD task. Here, we analyzed the real characters both including 30 RW and 90 fillers.
By comparing the neural difference between two types of pseudo-character, we can reveal three processing pathways: PS > OS for phonological processing, OS > OO for semantic processing, and OO > RR for orthographic processing. If a region is active both for PS > OS and for OO > RR, its function may be related to orthography-to-phonology mapping. In the same way, the neural basis of orthography-to-semantics mapping can be identified by combining the activity for OS > OO and OO > RR.
Participants were familiarized with the LD task and symbol detection (SD) task, then lay comfortably in the scanner and viewed stimuli via rear projection during the tasks. Participants performed SD task first. Both tasks were run using fast random interval event-related designs. On each trial, a 200 ms fixation cross was presented, followed by a stimulus presented for 500 ms, followed by a randomly jittered inter-trial interval (mean of 5.3 s, range from 1 to 14 s). Stimulus presentation was controlled, and response time and accuracy were recorded using E-Prime software.
In the LD task, participants were asked to respond by button press with their right index finger to real characters, and with their middle finger to all non-character stimuli. This task was completed in two consecutive runs of 135 trials (15 trials for each of the six critical conditions and 45 filler trials). The data of SD task was not included in this study.
Functional and anatomical images were collected using a 3T Siemens Magnetom Trio Timsyngo MR system, with a 12-channel head coil in the State Key Laboratory of Cognitive Neuroscience and Learning of Beijing Normal University. Functional images were collected using a gradient-recalled-echo echo-planar imaging (EPI) sequence sensitive to the BOLD signal. Forty-one axial slices were collected with the following parameters: TR = 2500 ms, TE = 30 ms, flip angle = 90°, FOV = 20 cm, matrix = 64 × 64, 3 mm thickness, yielding a voxel size of 3.125 mm × 3.125 mm × 3 mm, interleaved slices with no gap. The LD task was accomplished in two runs of 332 volumes (13 m, 50 s) including four TRs of rest at the beginning and end of each run.
Following the acquisition of functional data, high resolution T1-weighted anatomical reference images were obtained using a 3D magnetization prepared rapid acquisition gradient echo (MPRAGE) sequence, TR = 2530 ms, TE = 3.45 ms, flip angle = 7°, FoV = 25.6 cm, matrix = 256 × 256 with 1 mm thick sagittal slices.
MRI Data Analysis
Functional data were analyzed using AFNI (Cox, 1996); program names appearing in parentheses below are part of the AFNI suite. Cortical surface models were created with FreeSurfer (available at: http://surfer.nmr.mgh.harvard.edu/), and functional data projected into anatomical surface space using SUMA (AFNI and SUMA are available at http://afni.nimh.nih.gov/afni, Saad et al., 2004; Argall et al., 2006).
After reconstructing 3D AFNI datasets from 2D images (to3d), the anatomical and functional datasets for each participant were co-registered using positioning information from the scanner. The first three volumes were discarded, and functional datasets preprocessed to correct slice timing (3dTshift) and head movements (3dvolreg), reduce extreme values (3dDespike) and detrend linear and quadratic drifts (3dDetrend) from the time series of each run, with no smoothing or filtering.
General Linear Models and Contrasts
Preprocessed data for 2 runs of the LD task were analyzed in a general linear model (GLM) including six regressors of no interest (Six estimates of head movement from motion correction from 3dVolreg). The six experimental regressors were hypothetical hemodynamic response functions (HRFs) constructed by convolving the stimuli presenting time of each condition (Real, PS, OS, OO, RR, and NN) with a model HRF (waver). The contrasts were created by comparing real characters and combined pseudo-characters (PS, OR, OO, and RR) to NN condition.
Stimulus Correlation Analysis
To test for stimulus property-related activation, a real character-related time serial dataset was synthesized by selecting sub-brick and matrix column for real characters from initial GLM (3dSynthesize). Resulting time serial dataset was analyzed in separately GLMs including two regressors: one binary variable for onsets of trials and another continuous, mean-centered values of each stimulus property: RT in the LD task, orthography-to-phonology consistency degree, frequency and number of strokes. The frequency was from a corpus of Modern Chinese Frequency Dictionary (1986). The consistency degree was computed as the summed frequency of friends divided by the summed frequency of all family members (including the character itself, as in Shu et al., 2003). The correlation matrix of stimulus properties is given in Table 1. Frequency is significantly correlated to RT and strokes number, but not to consistency. There is no correlation among consistency, RT and strokes number.
Group analyses were conducted for each contrast or correlation by comparing the mean coefficients from all participants to zero for each node in the standard surface (3dttest). The resulting surface was mapped to an AFNI volume based on a mesh of averaged brain (3dSurf2Vol), resulting in maps with t statistics for each voxel for each contrast. The volume datasets were then converted to Talairach space (@auto_tlrc, using the N27 template) at 2 mm × 2 mm × 2 mm resolution. Activation maps and regions reported as active in tables were obtained by first thresholding individual voxels at p < 0.005 (uncorrected), and then applying a subsequent cluster-size threshold (at least 50 voxels) based on Monte Carlo simulation (AlphaSim), resulting in a corrected threshold of p < 0.05.
Surface Reconstruction and Projection of Functional Data into Surface Space
Surface-based spatial normalization of anatomical and functional data was accomplished using Freesurfer (Fischl et al., 1999) and SUMA (Argall et al., 2006). Anatomical data were reconstructed (to3d), and a surface model for each participant was made with Freesurfer: cortical meshes were extracted from the structural volumes, then inflated to a sphere and registered anatomically. Using the surface atlas, an averaged subject was created by averaging surfaces, curvatures, and volumes from all subjects. The averaged surface was converted into SUMA and was then put into a standard mesh on the SUMA surfaces. The standard mesh was then converted to a volume and transformed (using @auto_tlrc, to the N27 template) to Talairach space (Talairach and Tournoux, 1988) for visualization and reference purposes. Functional data were normalized by transforming volumes resulting from AFNI analyses into surface representations using the standardized surfaces, and computing averages over surfaces.
Region of Interest (ROI) Analysis
To test the function of specific regions sensitive to real characters’ properties, nine ROIs were identified in the left hemisphere by combining areas significantly correlated with parametric variables: six ROIs were highly correlated to frequency, including three ROIs with positive correlations, in temporoparietal junction, MTG and Inferior Parietal Lobule, and three ROIs with negative correlations in pre-central/middle frontal gyrus, insula, and fusiform gyrus; Two ROIs were highly correlated to number of strokes for positive region at inferior occipital gyrus and negative region at AG; One ROI was negatively correlated with consistency degree at IFG. In each ROI, the mean beta value was computed and compared across four types of pseudo-characters.
Overlapping and Distinct Regions for Real and Pseudo-characters
To identify the neural networks for reading Chinese characters, we first analyzed the GLM contrasts for both Real and Pseudo-character against the nonsense strokes (NN condition) baseline. We subsequently conducted a conjunction analysis to examine whether the neural networks were driven by stimulus type. The results were presented in brain maps (Figure 1), and the clusters were listed in Table 2.
FIGURE 1. The contrasts of Real > NN (A) and Pseudo > NN (B) showed a similar neural network for lexical decision (LD) task. The conjunction map (C) showed pseudo-characters activated more regions at right hemisphere and more spatial extending regions at left hemisphere than real characters.
TABLE 2. Brain regions activated more for real characters and pseudo-characters contrasted to stroke patterns (NN).
As viewed side-by-side, the contrasts of Real > NN (Figure 1A) and Pseudo > NN (Figure 1B) showed a similar neural network in the LD task. As shown in Figure 1C, the conjunction map revealed overlapping regions in bilateral intraparietal sulcus (IPS) and insula, and in left hemispheric areas including IFG/MFG, precentral gyrus and fusiform gyrus (FFG). The shared regions for two types of stimuli are largely consistent with the previous contrast of task vs. rest in numerous studies of Chinese character recognition (Peng et al., 2004; Liu et al., 2007; Wang et al., 2011) and meta-analysis (Bolger et al., 2005; Tan et al., 2005; Wu et al., 2012).
The distinct regions observed only for the contrast of Pseudo > NN, not for Real > NN, were mainly located in bilateral IPS, right FFG, right IFG, and right MFG. Also, a greater spatial extent of activation for pseudo-characters was observed in these shared regions.
Brain Regions Sensitive to Stimulus Properties
Using a multi-parametric approach, we explored the brain regions sensitive to visual/orthographic, phonological, and lexical processing in reading Chinese characters. We characterized stimulus-related regions by computing the correlations with BOLD signal level for RT and stimulus properties of characters: character frequency, consistency degree, and the number of strokes. The multi-parametric approach was generally consistent with the networks of Chinese character reading identified in the contrasts described above. Maps of these correlations are shown in Figure 2.
FIGURE 2. Brain areas showing a significant correlation between neural activity and reaction time in LD task, frequency, consistency degree, and stroke numbers.
Correlations between BOLD signal and RT were observed in the Chinese reading network. Positive correlations with RT indicate increasing activity for items with slow responses, or decreasing activity for fast items. As shown in Figure 2 and Table 3, slow items were associated with increased BOLD signal (positive correlation) in bilateral regions including insula, IFG, IPS, and FFG, as well as in left regions at middle occipital gyrus (MOG), IPS, MFG, and pre/post-central gyrus. These findings are consistent with RT-related regions in English word naming (Binder et al., 2005; Graves et al., 2010). In addition, slow items were associated with decreased activity (negative correlation) in left MFG, right anterior MTG, and bilateral areas of precuneus, anterior cingulate cortex (ACC) and AG.
Positive correlations for frequency indicated the increasing BOLD signal intensity for high frequency characters. As shown in Figure 2 and Table 4, these regions included clusters along the midline in bilateral precuneus and ACC, and bilateral clusters in AG, supramarginal gyrus (SMG), and MTG. Negative correlations for word frequency were mainly found in bilateral portions of insula and MFG, and in portions of left-lateralized precentral gyrus, FFG, and precuneus.
TABLE 4. Brain regions showing significant correlations with BOLD signal for stimuli properties: word frequency, consistency, and the number of strokes.
Correlations with consistency degree (ratio of homophones in the same phonetic family) were negative, including bilateral IFG and insula, indicating increasing neural activity in those regions for characters with more enemies than friends (See Table 4). This is consistent with prior studies of English word reading (Graves et al., 2010).
Positive correlations with the number of strokes (See Table 4) mainly occurred in right calcarine sulcus for processing of additional visual input, as well as bilateral parahippocampal gyrus, anterior and posterior cingulate. Negative correlations were in bilateral AG and precuneus, indicating more deactivation for more complex visual stimuli.
ROIs Response for Sublexical Properties
In order to further examine the function of brain regions in the reading network identified both from GLM and multi-parametric approach, we conducted further analyses of ROIs identified in those analyses. Nine ROIs were selected to compare activation levels among four types of pseudo-characters (Figure 3). ANOVAs revealed a significant main effect of the types of pseudo-character for those ROIs (except for the inferior occipital gyrus IOG, F(3,45) = 1.70, p = 0.18), indicating that although these regions were selected via a lexicality contrast, they are also modulated by the sublexical information.
FIGURE 3. Map of nine ROIs (A) identified from correlation analysis, with bar graphs (B) illustrating patterns of percent signal change for four types of pseudo-characters.
Post hoc comparisons were conducted among four types of pseudo-characters in each ROI. The results are presented in Table 5. Based on contrasts designed to reveal phonological (PS > OS), semantic (OS > OO), and orthographic (OO > RR) processing, three distinct activation patterns emerged so that these ROIs were summarized as three groups engaging in O-P, O-S, and orthographic processing, respectively. The first group of ROIs located in frontal cortex (IFG and MFG) and insula. These ROIs showed a monotonic effect of sublexical information (insula, F(3,45) = 27.82, p < 0.01; IFG, F(3,45) = 17.46, p < 0.01; and MFG, F(3,45) = 17.07, p < 0.01), in which the PS (Phonology plus Semantics) condition evoked the strongest activities and the RR (Reversed Radical) condition evoked the weakest. Both for IFG and insula, their activities were stronger for PS than for OS, and stronger for OO than for RR, indicating that they are sensitive to O-P processing. However, for MFG, its activity was incremental from RR to PS and only OS > RR and PS > OS reached significant.
The second group of ROIs located at MTG and temporoparietal junction (SMG and AG). All these ROIs showed similar stimuli effects (MTG, F(3,45) = 8.12, p < 0.01; SMG, F(3,45) = 12.81, p < 0.01; AG, F(3,45) = 16.44, p < 0.01). Their activities were stronger for OS > OO, but not for PS > OS, indicating that they are sensitive to sublexical semantic processing but not to phonological processing. Only in AG, a significantly stronger effect was found for OO > RR, indicating a role in O-S processing.
The third group ROIs was related to orthographic processing (FFG and IPL). There was a significant stimuli effect both for FFG (F(3,45) = 6.23, p < 0.01) and for IPL (F(3,45) = 3.92, p < 0.05). Only a significant difference was observed between the strongest and weakest activated condition (FFG, PS > RR; IPL, OS > RR). No significant difference was observed for other pairs of conditions. It indicated FFG and IPL were not sensitive to O-P or O-S processing.
Sub-Regions in Ventral Inferior Frontal Cortex and Insula
From ROI analysis, we observed a similar pattern at IFG and insula. It might result from the activity of overlapping voxels drawn from these two adjacent ROIs. The ROI labeled as IFG for consistency shared some voxels with the ROI labeled as insula for frequency. To test the function of sub-regions, we created a conjunction map (shown in Figure 4A) by selecting voxels specific for the consistency degree, specific for the frequency and common for both. As shown in Figure 4B, ROI analysis revealed two types of pattern among pseudo-characters in three sub-regions, multiple comparison results were listed in Table 5.
FIGURE 4. Brain map (A) of conjunction analysis on inferior frontal cortex correlated to frequency (blue), consistency (red) and both (yellow), with the bar graphs (B) illustrating patterns of percent signal change for four types of pseudo-characters.
For common and frequency specific sub-regions in insula, we observed a similar stimulus effect, in which the BOLD intensity was greater for PS > OS and OO > RR indicating a function of O-P processing. The difference between OS and OO condition did not reach significant indicating this region was not sensitive to O-S processing.
As for the sub-region specific for consistency degree at IFG, we observed a different stimulus effect. The PS condition evoked higher activation than three other conditions, and there was no difference among OS, OO, and RR conditions. This pattern indicated left IFG was only sensitive to phonological processing.
The main goal of the current study was to investigate the neural networks involved in Chinese character reading and the independent contribution of orthographic, phonological, and semantic processing. Consistent with prior studies of reading in alphabetic languages, we identified two neural pathways in the left hemisphere. One pathway relied on left posterior middle temporal gyrus (pMTG) and AG for semantic processing, and the other pathway relied on left IFG and insula for phonological processing. The cooperative division of labor of these two neural pathways implicates the neural basis for processing different types of characters. The findings are consistent with reading models that assume the same general architecture for logographic and alphabetic scripts.
Neural Correlates for Semantic Processing
A significant finding of the present study is the neural basis of semantic processing in Chinese character reading. Since the mapping from orthography to meaning is arbitrary in alphabetic words, the logographic properties of the Chinese writing system provide an opportunity to directly examine the neural basis of semantic processing. Pseudo-characters with semantic cues were associated with stronger activity in left MTG and AG. This is potentially related to lexico-semantic processing, as semantic cues in Chinese character provide probabilistic information about the whole character. Our result is consistent with previous findings that these regions were active more for semantic tasks than phonological tasks (Price et al., 1997; Booth et al., 2006), and high, compared to low imageability words (Frost et al., 2005).
But, the engagement of left AG might go beyond the lexico-semantic processing. In current study, the left AG was not only active for lexico-semantic (OS > OO) processing, but also for orthographic (OO > RR) processing (see Figure 3). The result indicated a function of AG for mapping from orthography to semantics. Prior studies have shown that AG plays a role in complex information integration and knowledge retrieval (Binder et al., 2009; Bonner et al., 2013). For Chinese O-S mapping, we might suggest that AG plays a role in combining the perceptual information, integrating the meaning, and manipulating the relevant information in the semantic system (Seghier, 2013; Price et al., 2015).
Thus, the neural basis of semantic processing in reading Chinese characters relied on the involvement of left MTG and AG. The MTG associated more strongly with lexico-semantic processing, whereas, the AG associated with integration of semantic cues and other sources of information.
Neural Correlates for Phonological Processing
Another interesting finding is the neural correlates of the phonological processing in Chinese character reading. We found a correlation with consistency effects and selectivity for pseudo-characters including phonological information in left IFG and insula. This finding is consistent with previous studies that these two regions are sensitive to consistency in Chinese (Lee et al., 2004, 2010). It is also consistent with findings in alphabetic languages (e.g., Fiez et al., 1999; Graves et al., 2010) indicating left IFG and insula could be involved in phonological processing across languages. The IFG and insula can be understood as contributing to phonological processing in the context of multiple different models of word recognition. As Taylor et al. (2013) point out in a meta-analysis of FMRI research on reading in alphabetic languages, the IFG is more strongly activated for both pseudowords and irregular words. This is consistent with the “triangle” model, in which computing phonology for both pseudowords and words with inconsistent spelling-to-sound correspondences generates a high degree of ambiguity in phonological representations. Similarly, Taylor et al. (2013) assert that in a dual-route framework, the IFG may be thought of as a phonological buffer, where ambiguities arising during grapheme-to-phoneme conversion (e.g., Rastle and Coltheart, 1999) or from computation between addressed and assembled phonological representations may play out.
The role of the insula in visual word recognition is less well-understood, although, it is engaged in visual word reading both in English (Fiebach et al., 2002) and in Chinese (Kuo et al., 2004), and is proposed to be sensitive to phonological processing (particularly sublexical spelling-sound translation, Borowsky et al., 2006). In addition, some researchers have suggested that insula is responsive to cognitive processing load (Zaccarella and Friederici, 2015) because of its increasing activity with the increasing processing demands (Bahlmann et al., 2008). This explanation is consistent with the view of insula as a function of general processing demand in English word reading (Graves et al., 2010). Consistent with this view, we found activity in the insula was strongly correlated with RT, and, further, showed effects of both phonological (PS > OS) and orthographic (OO > RR, see Figure 3) structure in the pseudo-characters.
Lack of Phonological Effects in Posterior Superior Temporal Cortex
The left pSTG is a core region for sublexical phonological processing in alphabetic languages (Paulesu et al., 2000; Pugh et al., 2000), but was not found to be engaged in Chinese phonological processing in our study. A common explanation is that the pSTG is specific to alphabetic languages, and plays a specific role in “assembled phonology” (e.g., Paulesu et al., 2000; Tan et al., 2005). Chinese orthography is extremely opaque, and in some models, is assumed to lack assembled phonological processes entirely (Coltheart et al., 2001; Perfetti et al., 2005). The finding is compatible with the relatively weak (or null) activation in previous Chinese fMRI studies (Tan et al., 2005; Liu et al., 2007). This is also consistent with the finding that a patient with a lesion in the left STG showed phonological dyslexia in English but intact reading in Japanese (Wydell and Kondo, 2015). On the other hand, another possibility is that the LD task itself puts minimal demands on O-P processing, and may not be optimal for observing activity in pSTG. We have observed activity in this region for Chinese reading under naturalistic contexts (Wang et al., 2015), and in a working memory task (Wang et al., 2011; Yang et al., 2011). Furthermore, Rueckl et al. (2015) recently reported activity in this area for Chinese during a semantic decision task, that did not differ from three alphabetic languages (Spanish, English, and Hebrew).
Middle Frontal Gyrus and Meta-linguistic Processing Demands
It is also worth mentioning the engagement of left MFG in Chinese character reading. Although, this region was proposed as a typical Chinese-specific region for reading (Siok et al., 2004), its function was a long-debated issue in prior studies. The left MFG has been linked to the semantic access (Tan et al., 2000; Luke et al., 2002), addressed phonology (Tan et al., 2005), or integrating the visual spatial analysis and the semantic (or phonological) analysis (Tan et al., 2001b). In our data, left MFG was highly correlated with frequency, but not consistency. Its function seems unrelated to phonological processing for Chinese characters, but it was sensitive to sublexical phonological (PS > OS) processing for pseudo-characters. These results reflect that the processing demand drove the activity of the left MFG. This notion is in concordance with the view that left MFG was linked to working memory (Liu et al., 2007) for retaining the character form until both meaning and pronunciation was retrieved. It hence can explain the observation that left MFG was active in alphabetic word processing under particular task demands (Nakamura et al., 2012), and in naturalistic reading (Wang et al., 2015).
The Interaction between Semantic and Phonological Processing
In addition to patterns of activity that were apparently specific to O-P and O-S processing, we also found evidence for interactions between semantic and phonological processing. For example, areas supporting phonological processing (IFG, insula) showed a gradient activation from RR to PS pseudo-characters. Meanwhile, areas supporting semantic processing (MTG, AG) were deactivated gradually from RR to PS. A similar trade-off effect has been found in English word reading (Frost et al., 2005), and has been interpreted in terms of the connectionist triangle model (Harm and Seidenberg, 2004) of reading (Carreiras et al., 2014).
Consistent with our findings, some patient studies have shown interactions between semantic and phonological processing in reading logographic characters. Bi et al. (2007) presented a case study of a patient with semantic deficits following a left temporal lobe ischemic damage, whose reading performance depended on whether he knew the meaning of the target character: he could correctly name those characters for which he could also provide a definition, whereas, for those characters he could not understand, his reading performance was poor overall. Nonetheless, for characters he could not understand, his performance was better for regular-consistent than irregular-inconsistent characters, indicating that phonological processing was engaged while the semantic processing was unavailable. This interaction is consistent with findings in neuropsychological studies both in alphabetic (Woollams, 2014) and logographic scripts (e.g., see review of Chinese, Weekes et al., 2006; review of Japanese, Sato, 2015), in which damage to visual/orthographic, phonological, and semantic processing abilities are the root causes of different types of acquired dyslexia. This is consistent with connectionist approaches to understanding reading in general, and with our own model of reading Chinese characters (Yang et al., 2009, 2013).
The current study is the first application of a multi-parametric approach in investigating the neural network for Chinese character reading. The findings were consistent with studies in alphabetic word reading using the same approach (Graves et al., 2010). One difficulty of the multi-parametric approach is to avoid the intercorrelation among variables. Our analysis used data from a previous study (Yang et al., 2012) that manipulated stimulus properties relevant to the differential contribution of orthographic, phonological, and semantic properties to character recognition, so that phonological consistency was not correlated with other variables. However, frequency was significantly correlated with number of strokes and, unavoidably in a LD experiment, with RT. Further study using multi-parametric approach should deliberately select stimuli to avoid intercorrelation among variables, and include more semantic factors such as imageability.
In this work, we sought to determine the cortical substrate of the relationship between orthography-to-phonology and orthography-to-semantics processing in visual word reading. We found a shared neural network both for real and pseudo-characters, and evidence for cooperative division of labor between semantic and phonological processing. The finding provided neural evidence for the connectionist model that assumes a general framework for reading both alphabetic words (Seidenberg, 2011; Carreiras et al., 2014) and Chinese characters (Yang et al., 2009, 2013).
XW, JZ, and JY designed research; XW, RZ, and JY analyzed data; XW and JY interpreted data; XW, JZ, and JY wrote the paper.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This work was supported by “the Fundamental Research Funds for the Central Universities” (GK201402056 to XW), the Natural Science Foundation of China (31400966 to XW and 31171077 to JY), the Humanities and Social Sciences Foundation of the Ministry of Education (14YJC190019 to XW) and the National Institutes of Health (P01 HD001994, P01 HD070837 and R01 HD06579 to JZ).
Bi, Y., Han, Z., Weekes, B., and Shu, H. (2007). The interaction between semantic and the nonsemantic systems in reading: evidence from Chinese. Neuropsychologia 45, 2660–2673. doi: 10.1016/j.neuropsychologia.2007.02.007
Binder, J. R., Desai, R. H., Graves, W. W., and Conant, L. L. (2009). Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. Cereb. Cortex 19, 2767–2796. doi: 10.1093/cercor/bhp055
Binder, J. R., Medler, D. A., Desai, R., Conant, L. L., and Liebenthal, E. (2005). Some neurophysiological constraints on models of word naming. Neuroimage 27, 677–693. doi: 10.1016/j.neuroimage.2005.04.029
Bolger, D. J., Perfetti, C. A., and Schneider, W. (2005). Cross-cultural effect on the brain revisited: universal structures plus writing system variation. Hum. Brain Mapp. 25, 92–104. doi: 10.1002/hbm.20124
Booth, J. R., Lu, D., Burman, D. D., Chou, T. L., Jin, Z., Peng, D. L., et al. (2006). Specialization of phonological and semantic processing in Chinese word reading. Brain Res. 1071, 197–207. doi: 10.1016/j.brainres.2005.11.097
Borowsky, R., Cummine, J., Owen, W., Friesen, C., Shih, F., and Sarty, G. (2006). fMRI of ventral and dorsal processing streams in basic reading processes: insular sensitivity to phonology. Brain Topogr. 18, 233–239. doi: 10.1007/s10548-006-0001-2
Cattinelli, I., Borghese, N. A., Gallucci, M., and Paulesu, E. (2013). Reading the reading brain: a new meta-analysis of functional imaging data on reading. J. Neurolinguist. 26, 214–238. doi: 10.1016/j.jneuroling.2012.08.001
Chou, T. L., Chen, C. W., Wu, M. Y., and Booth, J. R. (2009). The role of inferior frontal gyrus and inferior parietal lobule in semantic processing of Chinese characters. Exp. Brain Res. 198, 465–475. doi: 10.1007/s00221-009-1942-y
Coltheart, M., Rastle, K., Perry, C., Langdon, R., and Ziegler, J. (2001). DRC: a dual route cascaded model of visual word recognition and reading aloud. Psychol. Rev. 108, 204–256. doi: 10.1037/0033-295X.108.1.204
Fiebach, C. J., Friederici, A. D., Muller, K., and von Cramon, D. Y. (2002). fMRI evidence for dual routes to the mental lexicon in visual word recognition. J. Cogn. Neurosci. 14, 11–23. doi: 10.1162/089892902317205285
Fiez, J. A., Balota, D. A., Raichle, M. E., and Petersen, S. E. (1999). Effects of lexicality, frequency, and spelling-to-sound consistency on the functional anatomy of reading. Neuron 24, 205–218. doi: 10.1016/S0896-6273(00)80833-8
Fischl, B., Sereno, M. I., and Dale, A. M. (1999). Cortical surface-based analysis. II: inflation, flattening, and a surface-based coordinate system. Neuroimage 9, 195–207. doi: 10.1006/nimg.1998.0396
Frost, S. J., Mencl, W. E., Sandak, R., Moore, D. L., Rueckl, J. G., Katz, L., et al. (2005). A functional magnetic resonance imaging study of the tradeoff between semantics and phonology in reading aloud. Neuroreport 16, 621–624. doi: 10.1097/00001756-200504250-00021
Graves, W. W., Desai, R., Humphries, C., Seidenberg, M. S., and Binder, J. R. (2010). Neural systems for reading aloud: a multiparametric approach. Cereb. Cortex 20, 1799–1815. doi: 10.1093/cercor/bhp245
Han, Z., Bi, Y., Shu, H., and Weekes, B. S. (2005). The interaction between semantic and sublexical routes in reading: converging evidence from Chinese. Brain Lang. 95, 235–236. doi: 10.1016/j.bandl.2005.07.120
Harm, M. W., and Seidenberg, M. S. (2004). Computing the meanings of words in reading: cooperative division of labor between visual and phonological processes. Psychol. Rev. 111, 662–720. doi: 10.1037/0033-295X.111.3.662
Hauk, O., Davis, M. H., and Pulvermüller, F. (2008). Modulation of brain activity by multiple lexical and word form variables in visual word recognition: a parametric fMRI study. Neuroimage 42, 1185–1195. doi: 10.1016/j.neuroimage.2008.05.054
Jobard, G., Crivello, F., and Tzourio-Mazoyer, N. (2003). Evaluation of the dual route theory of reading: a metanalysis of 35 neuroimaging studies. Neuroimage 20, 693–712. doi: 10.1016/S1053-8119(03)00343-4
Kuo, W. J., Yeh, T. C., Lee, J. R., Chen, L. F., Lee, P. L., Chen, S. S., et al. (2004). Orthographic and phonological processing of Chinese characters: an fMRI study. Neuroimage 21, 1721–1731. doi: 10.1016/j.neuroimage.2003.12.007
Lee, C.-Y., Huang, H.-W., Kuo, W.-J., Tsai, J.-L., and Tzeng, J. L. O. (2010). Cognitive and neural basis of the consistency and lexicality effects in reading Chinese. J. Neurolinguist. 23, 10–27. doi: 10.1016/j.jneuroling.2009.07.003
Lee, C.-Y., Tsai, J.-L., Kuo, W.-J., Yeh, T.-C., Wu, Y.-T., Ho, L.-T., et al. (2004). Neuronal correlates of consistency and frequency effects on Chinese character naming: an event-related fMRI study. Neuroimage 23, 1235–1245. doi: 10.1016/j.neuroimage.2004.07.064
Luke, K.-K., Liu, H.-L., Wai, Y.-Y., Wan, Y.-L., and Tan, L. H. (2002). Functional anatomy of syntactic and semantic processing in language comprehension. Hum. Brain Mapp. 16, 133–145. doi: 10.1002/hbm.10029
Nakamura, K., Kuo, W. J., Pegado, F., Cohen, L., Tzeng, O. J., and Dehaene, S. (2012). Universal brain systems for recognizing word shapes and handwriting gestures during reading. Proc. Natl. Acad. Sci. U.S.A. 109, 20762–20767. doi: 10.1073/pnas.1217749109
Peng, D. L., Ding, G. S., Perry, C., Xu, D., Jin, Z., Luo, Q., et al. (2004). fMRI evidence for the automatic phonological activation of briefly presented words. Cogn. Brain Res. 20, 156–164. doi: 10.1016/j.cogbrainres.2004.02.006
Perfetti, C. A., Liu, Y., and Tan, L. H. (2005). The lexical constituency model: some implications of research on Chinese for general theories of reading. Psychol. Rev. 112, 43–59. doi: 10.1037/0033-295X.112.1.43
Plaut, D. C., McClelland, J. L., Seidenberg, M. S., and Patterson, K. (1996). Understanding normal and impaired word reading: computational principles in quasi-regular domains. Psychol. Rev. 103, 56–115. doi: 10.1037/0033-295X.103.1.56
Price, A. R., Bonner, M. F., Peelle, J. E., and Grossman, M. (2015). Converging evidence for the neuroanatomic basis of combinatorial semantics in the angular gyrus. J. Neurosci. 35, 3276–3284. doi: 10.1523/JNEUROSCI.3446-14.2015
Pugh, K. R., Mencl, W. E., Jenner, A. R., Katz, L., Frost, S. J., Lee, J. R., et al. (2000). Neurobiological studies of reading and reading disability. J. Commun. Disord. 34, 479–492. doi: 10.1016/S0021-9924(01)00060-0
Richardson, F. M., Seghier, M. L., Leff, A. P., Thomas, M. S., and Price, C. J. (2011). Multiple routes from occipital to temporal cortices during reading. J. Neurosci. 31, 8239–8247. doi: 10.1523/JNEUROSCI.6519-10.2011
Rueckl, J. G., Paz-Alonso, P. M., Molfese, P. J., Kuo, W.-J., Bick, A., Frost, S. J., et al. (2015). Universal brain signature of proficient reading: evidence from four contrasting languages. Proc. Natl. Acad. Sci. U.S.A. 112, 15510–15515. doi: 10.1073/pnas.1509321112
Saad, Z., Reynolds, R., Cox, R. J., Argall, B., and Japee, S. (2004). SUMA: an interface for surface-based intra- and inter-subject analysis with AFNI. Paper Presented at the International Symposium on Biomedical Imaging, Washington, DC.
Sato, H. (2015). Do different orthographies share the same mechanisms of reading? A review of research on and models for Japanese acquired dyslexia. Aphasiology 29, 1189–1218. doi: 10.1080/02687038.2015.1034084
Seidenberg, M. (2011). “Reading in different writing systems: one architecture, multiple solutions,” in Dyslexia Across Language: Orthography and the Gene-Brain-Behavior Link, eds P. McCardle, J. Ren, and O. Tzeng (Baltimore, MD: Paul Brooke Publishing), 151–174.
Tan, L. H., Laird, A. R., Li, K., and Fox, P. T. (2005). Neuroanatomical correlates of phonological processing of Chinese characters and alphabetic words: a meta-analysis. Hum. Brain Mapp. 25, 83–91. doi: 10.1002/hbm.20134
Tan, L. H., Spinks, J. A., Gao, J. H., Liu, H. L., Perfetti, C. A., Xiong, J., et al. (2000). Brain activation in the processing of Chinese characters and words: a functional MRI study. Hum. Brain Mapp. 10, 16–27. doi: 10.1002/(SICI)1097-0193(200005)10:1<16::AID-HBM30>3.0.CO;2-M
Taylor, J. S. H., Rastle, K., and Davis, M. H. (2013). Can cognitive models explain brain activation during word and pseudoword reading? A meta-analysis of 36 neuroimaging studies. Psychol. Bull. 139, 766–791. doi: 10.1037/a0030266
Vigneau, M., Beaucousin, V., Herve, P. Y., Duffau, H., Crivello, F., Houde, O., et al. (2006). Meta-analyzing left hemisphere language areas: phonology, semantics, and sentence processing. Neuroimage 30, 1414–1432. doi: 10.1016/j.neuroimage.2005.11.002
Wang, X., Yang, J., Shu, H., and Zevin, J. D. (2011). Left fusiform BOLD responses are inversely related to word-likeness in a one-back task. Neuroimage 55, 1346–1356. doi: 10.1016/j.neuroimage.2010.12.062
Wang, X., Yang, J., Yang, J., Mencl, W. E., Shu, H., and Zevin, J. D. (2015). Language differences in the brain network for reading in naturalistic story reading and lexical decision. PLoS ONE 10:e0124388. doi: 10.1371/journal.pone.0124388
Woollams, A. M., Ralph, M. A., Plaut, D. C., and Patterson, K. (2007). SD-squared: on the association between semantic dementia and surface dyslexia. Psychol. Rev. 114, 316–339. doi: 10.1037/0033-295X.114.2.316
Wu, C.-Y., Ho, M.-H. R., and Chen, S.-H. A. (2012). A meta-analysis of fMRI studies on Chinese orthographic, phonological, and semantic processing. Neuroimage 63, 381–391. doi: 10.1016/j.neuroimage.2012.06.047
Yang, J., McCandliss, B. D., Shu, H., and Zevin, J. D. (2009). Simulating language-specific and language-general effects in a statistical learning model of Chinese reading. J. Mem. Lang. 61, 238–257. doi: 10.1016/j.jml.2009.05.001
Yang, J., Shu, H., McCandliss, B. D., and Zevin, J. D. (2013). Orthographic influences on division of labor in learning to read Chinese and English: insights from computational modeling. Biling. (Camb. Engl.) 16, 354–366. doi: 10.1017/S1366728912000296
Yang, J., Wang, X., Shu, H., and Zevin, J. D. (2012). Task by stimulus interactions in brain responses during Chinese character processing. Neuroimage 60, 979–990. doi: 10.1016/j.neuroimage.2012.01.036
Zaccarella, E., and Friederici, A. D. (2015). Reflections of word processing in the insular cortex: a sub-regional parcellation based functional assessment. Brain Lang. 142, 1–7. doi: 10.1016/j.bandl.2014.12.006
Zhu, X. (1988). “Analysis of cueing function of phonetic components in modern Chinese,” in Proceedings of the Symposium on the Chinese Language and Characters, ed. X. Yuan (Bejing: Guang Ming Daily Press), 85–99.
Keywords: visual word reading, fMRI, stimulus correlation approach, Chinese, connectionist model
Citation: Wang X, Zhao R, Zevin JD and Yang J (2016) The Neural Correlates of the Interaction between Semantic and Phonological Processing for Chinese Character Reading. Front. Psychol. 7:947. doi: 10.3389/fpsyg.2016.00947
Received: 17 December 2015; Accepted: 08 June 2016;
Published: 23 June 2016.
Edited by:Denise Hsien Wu, National Central University, Taiwan
Reviewed by:Olaf Hauk, MRC Cognition and Brain Sciences Unit, UK
Lorenzo Magrassi, University of Pavia, Italy
Copyright © 2016 Wang, Zhao, Zevin and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.