“I Can Read These Colors.” Orthographic Manipulations and the Development of the Color-Word Stroop

The color-word Stroop is a popular measure in psychological assessments. Evidence suggests that Stroop performance relies heavily on reading, an ability that improves over childhood. One way to influence reading proficiency is by orthographic manipulations. To determine the degree of interference posed by orthographic manipulations with development, in addition to standard color-Words (purple) we manipulated letter-positions: First/last letter in correct place (prulpe) and Scrambled (ulrpep). We tested children 7–16 years (n = 128) and adults (n = 23). Analyses showed that Word- and First/last-incongruent were qualitatively similar, whereas Word-congruent was different than other conditions. Results suggest that for children and adults, performance was hindered the most for incongruent and incorrectly spelled words and was most facilitated when words were congruent with the ink color and correctly spelled. Implications on visual word recognition and reading are discussed.


INTRODUCTION
The color-word Stroop task (Stroop, 1935) is a widely used measure that has been theorized to be an index of executive functioning such as interference control (e.g., van Mourik et al., 2005), selective attention and cognitive flexibility (e.g., Homack and Riccio, 2004;Charchat-Fichman and Oliveira, 2009), and response inhibition (Pocklington and Maybery, 2006). The Stroop task requires an individual to identify the ink color of stimuli as quickly as possible. Typically, participants are asked to name the color of the ink, of a list of "X"s (color-baseline condition) or the color of the ink of congruent color-words (i.e., the word red written in red ink; congruent condition). In the Stroop condition, the color of the ink is incongruent with the written word (i.e., the word blue written in red ink). Research consistently finds that it takes longer to name the color of the ink in the incongruent, Stroop condition. Many versions of the Stroop have been designed, such as the number Stroop and the emotional Stroop (MacLeod, 1991). When considering only the prototypical color-word Stroop, relative to the hundreds of adult studies, investigations over early development are scarce. Learning to read is a key contributor for detecting this effect, and as the color-word Stroop contains words it lends itself to orthographic manipulations. The main purpose of this study was to examine the effects of orthographic manipulation (i.e., changing letter-positions in color-words) on interference elicited by the Stroop task developmentally. Comalli et al. (1962) were the first to use the Stroop with children and adults ranging from 7 to 80 years old (N = 235). Using 100-item cards they showed (a) colored rectangles (colorbaseline), (b) color-words in black ink, and (c) color-words written in incongruent colors. Participants became progressively faster in responding to the three conditions as a function of age, but they were slowest on the incongruent colors. A large body of clinical and experimental research uses the color-word Stroop, such as in detecting deficits in inhibition in individuals with attention deficit disorder (e.g., Homack and Riccio, 2004;Schwartz and Verhaeghen, 2008 for meta-analyses). The majority of the studies using the color-word Stroop are individual difference rather than developmental studies. We found relatively few reports that examined three or more age groups of typically developing children and adolescence using the Stroop (Comalli et al., 1962;Schiller, 1966;Berninger et al., 1991;Armengol, 2002;Leon-Carrion et al., 2004;Pritchard and Neumann, 2004;Peru et al., 2006;Charchat-Fichman and Oliveira, 2009;Polderman et al., 2009), overall showing a negative relation between age and performance on the Stoop (i.e., as age increases, response times decrease).
Inhibitory control, assessed with measures other than the Stroop (e.g., Stop signal), also shows a protracted development from childhood to adulthood (Williams et al., 1999;Bedard et al., 2002;Davidson et al., 2006), although some suggest it develops very early (by grade 2; Schachar and Logan, 1990;Christ et al., 2001). It appears, however, that the rate at which inhibition develops changes as a function of age (Luna and Sweeney, 2004;Best et al., 2009; for reviews). Specifically, improvements in inhibitory abilities are easily detected in pre-school children (Montgomery and Koeltzow, 2010) yet improvements are also reported for middle-school children and adolescents (Leon-Carrion et al., 2004;Luna and Sweeney, 2004), with 13-year-olds still not attaining complete adult levels (Davidson et al., 2006). On average, younger children (6-8 years) are about 50 ms slower in stopping a prepotent response than older children (9-12) who in turn are about 30 ms slower than adolescents (13-17 years; Williams et al., 1999). The latter results are consistent with neuroimaging findings showing that the pre-frontal cortex, an area highly correlated with executive functions, continues to develop through childhood and adolescence (Kolb and Whishaw, 2003) and this protracted maturation is reflected in the development of inhibitory abilities (Luna, 2009). Meta-analysis evidence verifies that the pre-frontal cortex plays a key role on the Stroop performance in adults (Laird et al., 2005).
Developmental functional magnetic resonance imaging (fMRI) studies using the Stroop (Adleman et al., 2002;Marsh et al., 2006) have used a sub-vocal response modality, which the authors acknowledged was a limitation in their study due to lack of task compliance assessment during scanning (Adleman et al., 2002). Sub-vocal responding may also increase voluntary or involuntary movements that could compromise the quality of brain images. Therefore, with a future aim to study the brain correlates of orthographic effects in the Stroop, we designed our protocol by modifying the Stroop paradigm to be compatible for use with fMRI and incorporated a speeded, manual response.
Apart from presentation and response modality modifications, the Stroop paradigm has been adapted widely to investigate the effect of interference in many domains and in different contexts (MacLeod, 1991(MacLeod, , 2005 for comprehensive reviews). Past studies modified the Stroop by altering pronounceability of non-words (e.g., "hrwd" and "swal") and the meaning of words in relation to their color (e.g.,"carrot"and"chair"); these were found to affect the intensity of interference (e.g., longer responses to "carrot" when written in incongruent ink color; MacLeod, 1991). Also, using only certain letters of the color word (e.g., the first letter; Regan, 1978 or the first three letters; McCown and Arnoult, 1981) were enough to elicit interference in adults; comparable investigations were not completed with children. Relevant developmental work was performed by Berninger et al. (1991), who showed children in grades 2, 4, and 6, color-words in which either two letters of the word (e.g., green, "en" printed in red) or single-letter combinations (e.g., green, "r" printed in red) were printed in incongruent colors, as well as whole words (e.g., green printed in red). The authors observed that students' responses were slowest in the following order: word > single-letter > two-letter cluster. Berninger et al. (1991) did not include stimuli with transposed letters (i.e., students viewed the whole-word spelled correctly). We are not aware of any studies that directly manipulated orthography of the color-words in the Stroop to examine age-related effects.
The ability to read is clearly a component for observing the Stroop effect, as children under the age of six do not experience this effect (e.g., Comalli et al., 1962;Peru et al., 2006), but at the age of seven this effect is observed (e.g., Comalli et al., 1962;Armengol, 2002;Peru et al., 2006). Learning to read is a critical achievement for children, which requires concurrent coordination of semantic, phonological, and orthographic features (Ehri, 2005). According to phase theory (e.g., Ehri, 1995Ehri, , 2005 all words, via appropriate practice, are read through sight. Sight word reading, as it is referred to, undergoes four successive phases: pre-alphabetic, partial, full, and consolidated alphabetic phases (Ehri, 1995(Ehri, , 2005. Using various measures of reading development (e.g., test of alphabetical knowledge, vocabulary, and reading comprehension), Vellutino et al. (2007) proposed a comprehensive model of reading proficiency in younger (grades 2-3) and older readers (grades 6-7), showing the multifaceted aspects of reading. Across development, reading becomes increasing automatic in grades 1-5 (Paris, 2005), with practiced words attaining mastery sooner than others (Ehri, 2005). Reading skills follow a sigmoid (S-) growth function; learning begins slowly, followed first by a sharp learning curve and then by slow improvements toward a plateau (Paris, 2005). Specifically, children read about 50 words correctly per minute when they start to read (e.g., grade 1; 5-6 years) and improve by about 13 more words per minute, per year, up to grade 5 (10-11 years; Paris, 2005). Overall, reading is a complex ability that is typically achieved, via practice, in the first decade of life.
Intricate processes that underlie reading ultimately become automatic. Adult research clearly shows that letter position in a word has an effect on its readability (Grainger and Van Heuven, 2003). Grainger and Whitney (2004) wrote "Does the huamn mnid raed wrods as a wlohe?"; by summarizing research on this topic they explained that printed words are encoded in a special way, making reference to studies examining two phenomena: (a) relative-position priming and (b) transposition priming. Primes that either retain their position pattern (e.g., "mthr" prime for "mother") or have adjacent letters transposed (e.g., "mohter" prime for "mother") lead to the targets being processed faster. Although, letter position has been manipulated to study its effect on inhibition in adults using the Stroop (Regan, 1978;McCown and Arnoult, 1981), there are no reports of such effects in children and adolescence.
Here we investigated interference based on orthographic manipulations in the Stroop across development. Specifically, we examined (a) orthographic effects on interference elicited by the Stroop and (b) age effects on performance as they relate to the different orthographic manipulations. As letter position affects readability of a word, we anticipated that it would, in turn, affect interference experienced in the color-word Stroop, in children and adults. We included whole color-words, words that retained the position of the first and last letters and scrambled color-words in congruent and incongruent trials. We expected that words that retained the position of the first and last letters would elicit more interference than the scrambled words. In addition, we wanted to validate the parameters of our protocol (e.g., stimulus presentation intervals and manual response) to confirm that we could successfully detect the interference effects and in turn establish its suitability for neuroimaging methods.

PARTICIPANTS
We present data from 151 participants. Children were recruited from Toronto public schools, enrolled in mainstream classes, from grades 2 (7-8 years), 4 (9-10 years), 6 (11-12 years), 8 (13-14 years), and 10 (15-16 years), and adults (n = 23, ages 19-30 years) were recruited from the community ( Table 1). None of the participants had any history of neurological or psychiatric disorders. All school-aged participants were recruited from the classrooms and their teachers confirmed verbally that none of those included in this study had reading difficulties, dyslexia, or learning disabilities. All participants provided informed consent; for the children, this included consent from the child's parent. The Research Ethics Board at the Hospital for Sick Children approved all procedures.

MATERIALS AND METHOD
Four colors were chosen for this task. Criteria for color selection were based on the color-word length and how commonplace the color was. Orange, yellow, purple, and white were selected as they contained five or more letters, which allowed flexibility in manipulating the orthography and generating the stimuli. Also, we carefully selected the hues such that the colors were easily recognizable and distinguishable by the participants. Participants were first asked to read four color-words (orange, yellow, purple, and white) printed in black ink to verify proficiency in reading these words and to name the color of rectangular blocks to verify proficiency in identifying the colors. All participants were able to accurately read and name colors. We used a computerized, speeded manual response protocol. To familiarize participants with the timing of the task and location of the four color buttons on the keyboard they completed a 16trial training session. Training stimuli were presented for 1500 ms with an inter-stimulus interval of 500 ms. Participants responded successfully to training: 97% made two or fewer errors.
We used three word-type manipulations: (a) Word, (b) First/last letter in place, and (c) Scrambled (Figure 1). Task conditions consisted of color-words written in either congruent (e.g., yellow written in yellow ink) or incongruent (yellow written in purple ink) color. In the First/last condition, the first and last letters of the color word were kept in place while the middle letters were scrambled and the words were either congruent (e.g., ylloew written in yellow ink) or incongruent (e.g., yleolw written in purple ink) with ink color. The Scrambled condition consisted of scrambledcongruent (e.g., wlyloe written in yellow ink) and incongruent (e.g., wylleo written in purple ink) color-word pairings, which was added to account for the visual presentation of letters arranged in a non-word format. The Color-baseline condition consisted of a line "x"s printed in the same four colors. Stimuli were presented on a gray background. Care was taken to ensure each color appeared www.frontiersin.org with equal frequency across the conditions and that stimuli would not positively or negatively prime the subsequent stimulus, which was a key reason for using a four alternative force choice key press task. Stimuli were presented for 1350 ms with an inter-stimulus interval of 300 ms.
Each of the six conditions, plus Color-baseline, consisted of two blocks of 10 trials pseudo-randomly presented resulting in a total of 140 trials. Participants were instructed to respond to ink color of stimuli as quickly as possible while maintaining accuracy by pressing colored keys on a standard keyboard; we used colored stickers on the relevant keys to remove demands on memory. Using Presentation software (Neurobehavioral Systems), we recorded both accuracy and RTs.

DATA SCREENING AND ANALYSES
Prior to analyses, scores were examined through SPSS programs for accuracy of data entry, missing values, and the assumptions of univariate and multivariate analyses. Pairwise linearity was checked using scatterplots and found to be satisfactory.
Trials were coded as incorrect if the participant failed to respond or provided an incorrect response. The dependent variable was the average RT per item (in milliseconds). Individual RT trials were based on trimmed raw data (i.e., excluded if RT was less than 200 ms or greater than 3 SD from the mean). Eight participants [six in grade 2 (7-8 years, 4 females) and two in grade 4 (9-10 years, 2 males)] were found to be outliers and were not included in our sample or in analyses, as they performed at chance level (i.e., below 60% correct). Statistical tests were performed on data from 151 participants. Age effects were tested using multivariate analyses of variance, in which age was treated as a categorical variable. To test the orthographic effects of interference among conditions we conducted planned contrasts with Bonferroni multiple comparison control. Structural equation modeling (SEM) and correlational methods were conducted to examine the relation of age with interference in each condition; these analyses treated age as a continuous variable.

AGE EFFECTS
A MANOVA assessed RTs across age groups on a linear combination of performance in Color-baseline and incongruent and congruent trials for all three conditions (i.e., Word, First/Last, and Scrambled; Figure 2; Table 1). By forming linear combinations of dependent variables, this test identifies differences among the age groups. A significant effect was found, Wilk's Λ = 0.35, F (35, 587) = 4.78, p < 0.0001, multivariate η 2 = 0.19. Table 2 summarizes significant post hoc age group differences. Specifically, Color-baseline, Word-congruent trials and Scrambled-congruent trials showed the same developmental patterns. Most differences in RT were observed earlier in development; performance of grade 10 children did not differ from that of adults.

WORD-TYPE DIFFERENCES AMONG INCONGRUENT TRIALS
To determine RT differences among word-type conditions we performed a series of contrasts, collapsed across groups ( Table 1). Word-incongruent RTs and First/last-incongruent RTs were marginally different (t = 1.94, DF = 150, p = 0.054, partial η 2 = 0.03). Word-incongruent RTs and Scrambled-incongruent RTs yielded a significant difference (t = 6.63, DF = 150, p < 0.0001, partial η 2 = 0.24). This contrast yielded a large effect size, as did the contrast between First/last-incongruent RTs and Scrambled-incongruent RTs (t = 5.72, DF = 150, p < 0.0001, partial η 2 = 0.18). These results suggest that on average participants required significantly more time to complete incongruent Word and First/Last than Scrambled trials.

RELATIONS AMONG RT SCORES AND AGE
We examined the relations between the various scores and age ( Table 3). All scores remained significant even after controlling for the effects of age and RT to Color-baseline trials. Together these findings suggest that age and Color-baseline RT (i.e., responding to a stimulus that only included x's) do not fully account for the relations among the scores on the congruent and incongruent trials.
Thus, a path model was used to determine qualitative differences in performance among word-types ( Figure 3A). We hypothesized that shared variance between Word-incongruent and First/last-incongruent would load onto an incongruent factor. www.frontiersin.org Scrambled-incongruent, the three congruent sets of trials and Color-baseline were hypothesized to load significantly onto a congruent factor. Age was a directly linked to both factors and their error terms were correlated. Using maximum likelihood estimate this model yielded a good fit to the data, as shown by a nonsignificant chi-square value, χ 2 (18, N = 151) = 28.30, p = 0.059, root mean squared error of approximation (RMSEA) = 0.06, comparative fit index (CFI) = 0.99, and normed fit index (NFI) = 0.98. Standardized factor loadings for the indicator variables are presented in Figure 3A and were significant at p < 0.001.
An alternative model B was also tested to assess whether Wordcongruent was better positioned as a factor on its own as all age groups produced faster RTs on Word-congruent, suggesting that the condition might be qualitatively different (i.e., facilitating; congruent word would speed up the identification of the ink color). Model B depicted in Figure 3B posited a three factor model. This alternative model also yielded a very good fit to the data as the chisquare was non-significant, χ 2 (16, N = 151) = 22.30, p = 0.14; RMSEA = 0.05; CFI = 0.99, and NFI = 0.98. Positioning Wordcongruent as a facilitating construct appeared to improve the fit of the model. Therefore, a chi-square difference test was conducted, comparing model A with model B. The chi-square for this model was equal to 28.30 − 22.3 = 6.00 which, with a 2 DF was significant (p = 0.05). Interestingly, age was significantly linked to all three constructs; however, age accounted for the least amount of variance in the Word-congruent condition.

DISCUSSION
This study determined the extent to which orthographic manipulations influence interference control across development. We manipulated color-word orthography in a Stroop task and examined performances in ages 7-30 years. There were three main findings: (a) Age was a significant predictor for all factors, incongruent, congruent, and facilitating. A novel age-related finding was that unlike younger age groups, late adolescent's behavioral performance was adult-like, cautioning against averaging over age ranges including children and adolescents. (b) Performances on Word-incongruent and First/last-incongruent trials were qualitatively similar, suggesting that children, like adults, attempt to read pseudo-color-words with the first and last letter in place. This suggests that children detected the wrong spelling in color-words and their performance was delayed as they strived to recover from the incongruent ink color, similar to what they experienced with correctly spelled color-words. (c) Performance on Word-congruent was different from performances on First/last-congruent, Scrambled-congruent, Scrambled-incongruent, and Color-baseline, which were all qualitatively similar. This is in agreement of the hypothesis that Word-congruent is facilitating, which we showed to be facilitating for children as well.

AGE EFFECTS
We examined the effects of age on task performance in children and young adults. Children in grade 2 (7-8 years-olds), the youngest age group, were significantly slower than grade 8s (13-14 years) and older for Word-incongruent and grade 6s (11-12 years) and older for First/last-incongruent; suggesting a sharper decrease in response time for First/last-incongruent as a function of age ( Table 2). RT differences were not observed for children in grades 4 (9-10 years), 6, and 8 for Word-incongruent; however, children in grades 4 and 8 differed for First/last-incongruent.
Results of age group differences on the congruent trials revealed that Word-congruent and Scrambled-congruent forms echoed the developmental pattern found in the Color-baseline. We highlight Frontiers in Psychology | Developmental Psychology that adults and students in grade 10 (15-16 years) exhibited comparable response times. In the developmental literature reviewed, only one study reported normative data for late adolescence (ages 15-17; Leon-Carrion et al., 2004). Despite the lack of normative data, particularly during the adolescent years, some clinical studies average over large age ranges (e.g., Reeve and Schandler, 2001;White et al., 2001;Favre et al., 2009;Peterson et al., 2009). We showed that adolescents' performance was adult-like; thus, we recommend against averaging over large age groups of children, particularly when younger children are included in the same age group as late adolescents (e.g., 15-16 years). In the path analyses, age was positioned as a predictor for all constructs and these were found to be significant (Figure 3). Specifically, age accounted for slightly more variance in the Incongruent factor and for the least amount for variance for the Wordcongruent. Previous research on the Stroop documented that children became progressively faster as they responded verbally to stimuli (Comalli et al., 1962;Schiller, 1966;Berninger et al., 1991;Armengol, 2002;Leon-Carrion et al., 2004;Pritchard and Neumann, 2004;Peru et al., 2006;Charchat-Fichman and Oliveira, 2009;Polderman et al., 2009), and this was what we also found with speeded manual responses (Figure 2; Table 3). Adult studies suggest that greater interference is sometimes observed with vocal compared to manual responses (White, 1969;Redding and Gerjets, 1977;MacLeod, 1991). Although the response times we observed were much faster (i.e., under 1 s, Figure 2) than those requiring verbal response (Comalli et al., 1962;Schiller, 1966;Berninger et al., 1991;Armengol, 2002;Leon-Carrion et al., 2004;Pritchard and Neumann, 2004;Peru et al., 2006;Charchat-Fichman and Oliveira, 2009;Polderman et al., 2009), relations with age were strong; we showed that age shared approximately 33% of the variance with all conditions (Table 3). Inter-correlations with conditions were stronger, showing greater common variance ranging from 53 to 74%. We also accounted for the variance of age; however correlations remained significant among conditions, albeit the strength of the relations decreased ( Table 3, upper diagonal-top value). This suggests that age alone cannot account for the variance shared among conditions. As response times improve with age regardless of task, then response time to Color-Baseline condition could account for these relations. When the correlations controlled for responses to Color-baseline (i.e., controlled for the ubiquitous agerelated decreases in RTs), the strength of the relations decreased, but the outcome remained significant ( Table 3, upper diagonalbottom value). Significant partial correlations may be attributed to individual differences and related executive processing or working memory. Working memory, the ability to hold and manipulate information for a short time, improves with age. Particularly, research shows that working memory capacity is better assessed by measures that contain task-irrelevant features (Arsalidou et al., 2010), thus likely contributes to the performance changes we observed.
Overall, it appears that responses to the Stroop task, linked as it is to executive functions such as inhibition, continue to develop throughout middle-childhood and adolescence (Comalli et al., 1962;Williams et al., 1999;Bedard et al., 2002;Luna and Sweeney, 2004;Peru et al., 2006;Best et al., 2009). Although the traditional response modality in the Stroop is vocal, this poses limitations when applied with imaging technologies that are susceptible to movement artifacts. Sub-vocal responses used previously in developmental fMRI studies with children preclude assessment of task compliance or performance during scanning (Adleman et al., 2002;Marsh et al., 2006). Our data show that speeded manual responses accurately capture performance trajectories in children.

EFFECTS OF ORTHOGRAPHIC MANIPULATIONS
To assess the effects of orthography, we used three word-type conditions: whole color-words, color-words with first and last letters in place and scrambled color-words; all had both congruent (ink color consistent) and incongruent (ink color inconsistent) trials. For incongruent trials, RTs were affected by word-type, such that Word > First/last > Scrambled (Figure 2; Table 1); the largest effect size was observed when Word was compared to Scrambled, suggesting that the Scrambled-incongruent was the most different of the incongruent trials. For congruent trials, response times on the Scrambled and First/last-congruent were not significantly different; however these trials differed significantly from Wordcongruent, with moderate effect sizes. In agreement with previous results (MacLeod, 1991), this suggests that Word-congruent trials may be facilitating. Children, as adults, experienced the least interference for Word-congruent. The highest interference was experienced during Word-incongruent, although First/last-incongruent had very similar performance curves.
Path analyses showed that Word and First/last-incongruent trials were qualitatively different from the rest of the trials, and loaded onto the same Incongruent factor (Figure 3). This suggests that our participants, all experienced interference when the first and last letters retained the correct position in the colorword. As Stroop interference is produced by the conflict between the tendency to read the color-word and naming the ink color, these data suggest that children as young as seven were "reading" the pseudo-color-words with the first and last letter in the correct place. This may also suggest that children recognized that these words were spelled wrong, and in turn experienced similar incongruence effects observed with correctly spelled color-words; the Scrambled-incongruent condition did not elicit this effect. Research, primarily based on adults, showed that letter position has an effect on the readability of words (Grainger and Van Heuven, 2003;Grainger and Whitney, 2004). Adult Stroop studies demonstrated that retaining the first letter interferes more than retaining the middle or last two letters of color-words (Singer et al., 1975). A similar finding was observed by Regan (1978) who showed that the first letter of color word could cause interference. Even if the first letter of a non-color-word matches the color-word, interference is generated in adults (e.g., Marmurek et al., 2006). Although we have not come across a study that examined this effect in children, developmental studies that manipulated letter-position in reading tasks, emphasize primarily its relation to lexical stress in the process of learning (Bowman and Treiman, 2002;Perea and Estevez, 2008;Ktori and Pitchford, 2009;Arciuli et al., 2010). These findings were linked to the work of Ehri (1995) on the phases of reading development, which suggests that ultimately all words become automatic and are read through sight. In the case of the current experiment, if the children were familiar with the color-words and were not trying to read them, we would not observe interference either with www.frontiersin.org the whole word or the words with first/last letter in place. Our data suggest that at 7-years of age (grade 2) children were attempting to read using similar whole-word cues, and experienced Stroop incongruence effects as older children and adults, giving support to the sight word reading hypothesis (Ehri, 1995(Ehri, , 2005. The path analyses also showed that Word-congruent trials were qualitatively different from all the other trials. The model that accounted for Word-congruent as a separate entity ( Figure 3B) had a better fit to the data than the one that allowed for Wordcongruent to load onto the Congruent factor ( Figure 3A). This is consistent with the notion that response times are facilitated when the distractor color-word is the same as the ink color (MacLeod, 1991(MacLeod, , 2005 for review). Usually, Stroop facilitation scores are calculated by subtracting RTs to Color-baseline from congruent conditions (Regan, 1978). Adult studies occasionally report Stroop facilitation scores to represent this effect (e.g., Stirling, 1979); however, these data are scarce developmentally. In a study with a small sample size -9-13 years old children (n = 11) -a significant difference in Stroop facilitation was observed compared to adults, but not interference (Wright and Wanley, 2003). In a larger sample (11 year olds, n = 80; adults, n = 70) an effect of facilitation (comparing congruent vs. neutral condition) was only observed in children, not adults (Fagot et al., 2009). The only large developmental study that mentioned Stroop facilitation was by Charchat-Fichman and Oliveira (2009); however, they did not report facilitation scores in their sample. For completeness we report difference scores on facilitation ( Table 1). The youngest children experience the least facilitation and these scores appear more adult-like by about grade 6 ( Table 1).
Our findings are consistent with research that shows that children do not rely merely on rote memorization, but also rely on letter positions in reading (Bowman and Treiman, 2002;Peressotti et al., 2010). Adopting a multiple orthographicphonological approach of teaching children to read had been found to facilitate learning, particularly in the early years (Hart et al., 1997). Brain research shows that visual word recognition elicits activity in the left fusiform gyrus, which is particularly affected by orthographic structure (Binder et al., 2006), and assimilates features during recognition of visual stimuli (Allison et al., 1994;Starrfelt and Gerlach, 2007;Arsalidou and Taylor, 2011). Thus, as children become expert readers, the fusiform gyrus may become more efficient or specialized. Even a year or two of practicing reading elicits a predisposition to reading words as a whole, as the First/last effect was present in the youngest children tested.

CONCLUSION
Our primary finding indicates that children as young as seven can experience interference from words that only retain the position of first and last letters in color-words. This suggests that children process color-words as a whole, as is evident from the rate with which they can control irrelevant cues as they mature. Although performance trajectories were similar, and predicted by age, the underlying mechanisms for processing incongruent and congruent materials were qualitatively different. Characterizing congruency between color-word and ink color as facilitating generated a stronger model for predicting performance on this task and its relation with age. Our findings contribute to the understanding of the developmental relation among inhibition, interference control, orthography, and reading. The speeded, manual responses required in our protocol make it appropriate for use with neuroimaging technologies. Future work examining the brain correlates of orthographic manipulations will elucidate the brain mechanisms that underlie these relations over childhood.