Skip to main content


Front. Neurosci., 21 December 2012
Sec. Auditory Cognitive Neuroscience
Volume 3 - 2012 |

Effects of a school-based instrumental music program on verbal and visual memory in primary school children: a longitudinal study

  • 1Department of Music, Carl von Ossietzky University, Oldenburg, Germany
  • 2Department of Psychology, Goethe University, Frankfurt am Main, Germany

This study examined the effects of a school-based instrumental training program on the development of verbal and visual memory skills in primary school children. Participants either took part in a music program with weekly 45 min sessions of instrumental lessons in small groups at school, or they received extended natural science training. A third group of children did not receive additional training. Each child completed verbal and visual memory tests three times over a period of 18 months. Significant Group by Time interactions were found in the measures of verbal memory. Children in the music group showed greater improvements than children in the control groups after controlling for children’s socio-economic background, age, and IQ. No differences between groups were found in the visual memory tests. These findings are consistent with and extend previous research by suggesting that children receiving music training may benefit from improvements in their verbal memory skills.


Previous research has examined whether and how musical abilities might influence other cognitive functions in cross-sectional (e.g., Cheek and Smith, 1999; Butzlaff, 2000; Schellenberg, 2011) as well as in longitudinal studies (Gardiner et al., 1996; Hyde et al., 2009; Schellenberg, 2009; Rickard et al., 2010; Moreno et al., 2011b). In particular, researchers have debated the benefits of music training on verbal and visual memory (Chan et al., 1998; Costa-Giomi, 1999; Kilgour et al., 2000; Ho et al., 2003; Jakobson et al., 2003; Forgeard et al., 2008; Piro and Ortiz, 2009; Rickard et al., 2010; Moreno et al., 2011a). Some studies suggest that music training is beneficial for verbal memory, but less so for visual memory, while other studies have produced conflicting evidence (Gardiner et al., 1996; Costa-Giomi, 1999; Jakobson et al., 2003). The present study was designed to assess some of these issues further within an extended longitudinal framework. In particular, verbal and visual memory skills of three cohorts of primary school children, one of which received instrumental training, were assessed three times over one and a half school years.

Musicians and non-musicians differ with regard to specific aspects of their brain structures and functions (Schlaug et al., 1995, 2009; Münte et al., 2002; Gaser and Schlaug, 2003; Moreno et al., 2009). For example, music training has been presumed to affect the development of the planum temporale (PT), a structure within the secondary auditory cortex. Whereas the left PT primarily mediates verbal memory (e.g., Frisk and Milner, 1990), the right PT primarily serves visual memory functions (e.g., Saykin et al., 1992). Based on the assumption of an asymmetric enhancement of the PT in response to music training, Ho et al. (2003) compared 90 male children between the age of 6 and 15 years who were learning an instrument (for 1–5 years) with schoolmates who had not received music training. Groups were matched in terms of age, education level, socio-economic background, and intelligence. Children in the music group achieved significantly higher scores in verbal but not in visual memory tasks. The music group learned approximately 20% more words from a 16-word list as compared to the non-music group. Moreover, children in the music group showed better verbal retention abilities (10-min delayed, 30-min delayed) than children in the control group. No significant differences were observed in the visual memory tests scores, however. Duration of music training and verbal learning performances correlated positively in this study, even after controlling for age and education level.

In a follow-up longitudinal study 1 year later, Ho et al. (2003) compared those children from the same cohort, who had just begun or continued their music training (for 1 year) with those who discontinued music training (at least 9 months prior to data collection). Children in the beginner and advanced music training groups significantly increased their verbal learning and retention performances, whereas no such increase was observed in the group who discontinued their music training. Remarkably, the verbal memory performances in the discontinued group remained stable at least 9 months after terminating the training program. Taken together, these findings indicate long-term enhancing effects of music training on verbal memory performances. Moreover, they support the hypothesis that musical training improves verbal rather than visual memory functions.

Although in the investigations referred to so far, no effects on visual memory were found, other studies reported improved visual memory performance for individuals who received music training (Rauscher et al., 1995; Bilhartz et al., 1999; Costa-Giomi, 1999; Hetland, 2000; Brochard et al., 2004; Sluming et al., 2007; Stoesz et al., 2007; Jakobson et al., 2008).

One reason for these different findings in the literature could be differences in methodological details. For example, studies on visual memory differ in demographic variables such as the participant’s age as well as in the duration and level of music training (from children with 1–5 years of training, Ho et al., 2003, to highly trained adults, Brochard et al., 2004). There is also inconsistency with respect to visual performance tasks across investigations. Whereas reaction time measures were used in one study (Brochard et al., 2004), Brandler and Rammsayer (2003), Ho et al. (2003) as well as Chan et al. (1998) examined recall accuracy. Moreover, participants in the latter study could have benefited from prior expertise with regard to the processing and recognition of complex visual designs by using an ideographic language system for reading and writing. Therefore, the fact that Chinese participants in the studies of Ho et al. and Chan et al. are highly skilled in processing complex visual designs may have inhibited positive effects of formal music training on visual memory.

Whereas studies cited so far were mainly based on private instrumental music lessons, only a few investigators looked at school-based music programs (e.g., Gardiner et al., 1996; Rauscher and Zupan, 2000; Rickard et al., 2010). For example, Gardiner et al. (1996) showed that children with a lower score on literacy and mathematics tests at baseline achieved similar scores on reading tests after 1 year of visual arts and music training at school as compared to control groups. They even outperformed controls in terms of mathematic achievement. However, the impact of music education remains unclear as music and art training were confounded in this study. Moreover, a non-musical comparison group with a different intervention was missing, which leaves open the possibility of a Hawthorne Effect. In a longitudinal study, Rickard et al. (2010) investigated a total of 142 children from nine regional state primary schools using tests of verbal learning and immediate verbal recall. Out of the entire group, 82 children participated in an enhanced school-based music program; the remaining 66 children served as controls and received standard music education. Participants were tested three times within the first 2 years of the study. In the third year a subset of the control sample (N = 44) was tested again. Children received either juggling training (n = 20) or continued standard music classes (n = 24) for about 1 year. Despite the differences among the time-displaced juggling, the music training group and the control group in terms of age, sample sizes, and duration of the training program, results showed that verbal learning and immediate verbal recall scores significantly increased after 1 year of school-based music training, whereas no such increase was found in the control group and in the juggling group. Nevertheless, these advantages disappeared in the second year and no effect was measured for the delayed verbal memory test at each time point. Moreover, no effects were observed in the visual recall test. However, results show a significantly enhanced visual perceptual score in the second year of the study for the music training group. In sum, mounting evidence suggests transfer effects from music training to non-musical cognitive benefits like verbal memory abilities (Strait and Kraus, 2011).

The present study examined the effects of a school-based music program on verbal and visual memory of primary school children over a period of 18 months. We hypothesized that children who participated in a school-based music program would significantly improve their verbal memory abilities (Verbal Learning, Verbal Delayed Recall, and Verbal Recognition values) over time as compared to their peers who received training in an unrelated area (natural sciences) or received curriculum-based music lessons only. In light of the conflicting results in previous work, we expected no differential effects on visual memory tasks between treatment and control groups.

Materials and Methods

Schools and Participants

The sizes of groups of participants needed was calculated with the software program G*Power (Faul et al., 2007). According to this program, a total sample size of 54 participants was sufficient to obtain a medium effect size of Cohen’s f = 0.25 (Cohen, 1988) as a result from a repeated measures analysis of variance [ANOVA; within-between interactions; α-level: 0.05, Power (1 − β): 0.95, correlations among repeated measurements: 0.50]. However, a total of 73 children (mean age = 7.73 years; SD = 0.84, 37 male, 36 female) participated in this study. They were recruited from seven primary schools located in different parts of Germany. The 25 children (11 male, 14 female) in the music group received instrumental instruction through a special program established in primary schools in Germany. Children in this group were randomly chosen from six different classes out of three primary schools. Natural science group children (14 male, 11 female) were recruited from six different classes out of two primary schools that emphasized natural science skills. Finally, a total of 23 children drawn from another four classes in two primary schools with no additional training, served as controls.

Participants provided demographic information and were evaluated in terms of IQ, socio-economic background, and musical background, to determine any systematic differences in these variables prior to the intervention at group level (see Table 1).


Table 1. Means (and SD) and test statistics of measures of the Musical Training (MT), Natural Science Training (NST), and No Training (CG) Groups at baseline.

The study was approved by the respective institutional review boards of the universities of Frankfurt and Oldenburg, Germany, and met ethical requirements for recruiting participants. Additional written informed consent for participation in the current study was obtained from school administrations, parents, and children. All schools participating in the music program needed to belong to a specific catchment area previously marked by the program administering foundation1. Furthermore, teachers from primary schools and music colleges were trained to administer pre-defined program standards. The allocation of children to the intervention groups (music and natural science) was not random. However, the selection of participants in the music group from several different cohorts ensured a wide distribution of socio-economic background variables and a reduction of systematic influences at school and class levels. Children in the control group participated in a natural science training program with the objective of promotion mathematics and general studies in primary schools. Hence math and general studies lessons at school were extended by the implementation of new educational standards consistently evaluated by a research program of the Leibnitz Institute for Science and Mathematics Education (IPN) in Kiel, Germany.


The music group children received weekly lessons for 45 min on musical instruments of their choice (guitar, violin, cello, flute, trumpet, clarinet, and drums). Depending on available resources, lessons were organized in different group sizes with a maximum of five. Thus the specific features of the instructional setting were subject to variation between schools. Nevertheless, the music program was entirely delivered by professionally trained instrumental teachers from public music schools. Standard elements of this training included singing, rhythm (clapping and percussion), and pitch identification exercises from first grade on. The instrumental instruction started from the second and was continued to the end of the fourth grade. Individual practice time was collected from the children, but due to high percentages of missing values as well as inconsistencies in the children’s self-reports, we refrained from using this information in the analyses. Nevertheless, parental responses indicated a certain amount of practice at home of each of the participating children, although the intensity of practice may have varied with respect to the specific domain.

Children in the natural science group received enhanced education in mathematics and general studies not confined to extra lessons but related to the school curriculum as a whole. Children in the second control group did not obtain additional training at school nor private instrumental lessons. Children in the natural science and in the no training groups still received curriculum-based music lessons at school. Assessments of additional extra-curricular school activities showed no significant differences among the three groups. Some uncertainty remains with respect to musical experiences of the children prior to the study. According to information provided by parents of the music group children, 12 children had participated in early music education programs. No such information was obtained from parents of children in the other groups.


The German adaptation of Cattell’s Culture Free Intelligence Test (Cattell, 1961) by Weiß (2006, CFT-20R) was administered at T1 to measure fluid intelligence using its four subtests: series, classifications, matrices, and typologies. According to Weiß (2006) these subtests correlate highly with the “g”-factor of intelligence (r = 0.78 to 0.83). Results were assessed by raw means for each of the four subtests and standardized IQ scores adapted for age (M = 100, SD = 15). Socio-economic background was determined by standardized variables related to parental education, income, and the number of books in their homes. The latter is considered as a strong indicator for cultural practice (Bos et al., 2005; see Table 1). This information was acquired through questionnaires and telephone interviews with the parents. Musical background was assessed particularly with respect to extra-curricular activities including instrumental training in the control group just to make sure that no participants in the control groups (natural science training and no training) received any instrumental music training during the period of the study.

Verbal memory

To assess verbal memory performance, participants completed the German adaptation of Rey’s Auditory Verbal Learning Test (RVLMT; Rey, 1941) by Helmstaedter et al. (2001, verbal memory tests; VLMT). The VLMT was applied to assess Verbal Learning, Verbal Delayed Recall, and Verbal Recognition abilities. For each of five trials, participants were asked to listen and recall a list of 15 different words directly after hearing (Verbal Learning), followed by a 25-min delayed recall without read out (Verbal Delayed Recall). Finally, participants were requested to identify those 15 previously presented words out of a list of 30 words (Verbal Recognition). Retest-reliabilities reported in the manual range from rtt = 0.81 (Verbal Recognition), rtt = 0.82 (Verbal Learning) to rtt = 0.87 (Verbal Delayed Recall).

Visual memory

The standardized and computerized Corsi Block Test and Matrix Span Test by Hasselhorn et al. (2012) were used to access visual scribe and visual cache memory (for an overview to visual scribe and cache memory, see Logie et al., 1990). Both tests only minimally, if at all, evoke auditory recoding processes. Retest-reliabilities for both subtests ranged from rtt = 0.66 for 5- to 8-year-old children to rtt = 0.67 for the 9- to 12-year-old children.

The starting sequence of each subtest depended on the age of the subject. Eight-year-old children received a longer starting sequence span than the 7-year-old children. Both subtests based on an adaptive procedure with increasing (for a correct reproduction task) and decreasing (for an incorrect reproduction task) span tasks. Within these subtests the procedure was repeated until the last of the 10 sequences was finished.

In the Corsi Block Test children were asked to memorize and reproduce the path of a “smiley face” that moves randomly through an array of nine squares depicted on the computer screen. If the reproduction was correct for two trials, the movement of the “smiley” was increased by one step (to a maximum of nine) and the path between the “smiley” movements from square to square would get longer than before. If the recall failed for two trails the movement was decreased by one (to a minimum of two).

The Matrix Span Test displayed a pattern on a small chessboard with 16 fields, which disappeared after 4 s. Afterward children were asked to tag the remembered pattern on the touch screen. If they failed for two trials, the number of fields in the pattern was decreased by one (up to a minimum of two fields). Otherwise it was increased by one until the maximum of nine fields was reached.


Participants were tested individually in their classrooms in the course of one and a half hours. Assistants who were blind to the type of training (music, natural science, or no training) ran the data collection. The VLMT and the visual memory tests (Corsi Block Test and Matrix Span Test) were included in a larger research procedure encompassing an IQ test as well as standardized and demographic questionnaires. The IQ test and the standardized questionnaires had been conducted a day prior to the verbal and visual memory tests. The order of the verbal and visual memory tests was as follows: first, the Verbal Learning abilities were assessed. During the required break of at least 25 min after the Verbal Learning subtest, both tests for the visual memory abilities were administered (first the Corsi Block Test followed by the Matrix Span Test). Finally, children received the Verbal Delayed Recall and the Verbal Recognition tests. The order of all tests was the same for each child at all three points of measurement in order to avoid novelty effects.

Baseline measurements were conducted at the beginning of the school year (T1; October, 2009); the second measurement was taken at the beginning of the following school year (T2; October, 2010). The final measurement was completed at the end of the same school year (T3; July, 2011).


Table 1 shows the means, standard deviations, and test statistics for age, education, CFT-IQ scores, and socio-economic status for groups at the beginning of the study. Mean age and CFT-IQs differed significantly between groups at the beginning of the study. Children in the musical group were younger [95% CI (7.09, 7.55)] than children in the control group [95% CI (7.79, 8.65)]. They also earned higher CFT-IQ scores [95% CI (109, 120)] than children in the natural science group [95% CI (96.69, 107)]. There was no significant difference for IQ between music and control group children as well as between natural science and control group children. All groups were similar at the beginning of the study in terms of fathers’ and mothers’ educational level, parental income, and cultural practice. Gender was well balanced across groups (χ2 = 0.75; p > 0.69). Varying sample sizes are subject to missing data.

Repeated measures ANOVA were performed for the dependent measures of the verbal memory (Verbal Learning, Verbal Delayed Recall, and Verbal Recognition Tests) and the visual memory tasks (Corsi Block Test and Matrix Span Test). The quasi-experimental design was a mixed model, with Group as the between-subjects factor and Time as the within-subjects factor. Because of significant differences observed in the basic demographic variables (see Table 1), age and IQ were included as covariates within this model. Preconditions for conducting ANCOVAs were tested (normality and Mauchly’s test of sphericity) and were met in all cases except the Mauchly’s test, where the sphericity was significant for the verbal learning test and the verbal recognition test. Therefore, in the respective analyses the Greenhouse–Geisser correction for p-values and degrees of freedom was adjusted to calculate F-values. Mean values and standard deviations for all verbal and visual memory subtests were reported in Table 2.


Table 2. Means (and SD) of verbal and visual memory assessment data for music, natural science, and control groups at three time points (T1–T3).

Figure 1 reveals that across one and a half years, children in the music group showed a greater increase on every measure of the verbal memory than the natural science and the control group. Repeated measures ANCOVA revealed significant Time by Group effects for Verbal Learning [F(3.63, 123.4) = 5.19, p = 0.001, ηp2=0.13], Verbal Delayed Recall [F(4, 136) = 3.17, p = 0.016, ηp2=0.09] and for Verbal Recognition [F(3.05, 103.56) = 2.72, p = 0.048, ηp2=0.07] test scores. However, no main effects on time were found for any verbal memory measures (all Fs ≤ 1.95, p ≥ 0.15).


Figure 1. Mean performance of the Verbal Learning (A), the Verbal Delayed Recall (B), and the Verbal Recognition Test (C) for experimental and control groups at baseline (T1), after 1 year (T2) and after one and a half years (T3) of music, natural science, or non-additional training (N = 73). Error flags indicate Standard Errors of Means (SEM).

Post hoc analyses with pairwise comparisons at each time point of measurement showed that whereas groups did not differ at T1 [F(2, 72) = 1.48, p = 0.24, n.s.] the music group [95% CI (46.64, 53.77)] scored higher than the natural science group [95% CI (40.28, 47.41)] in the Verbal Learning Test at T2 [F(2, 72) = 3.62, p = 0.032, ηp2=0.09]. Furthermore, the music group [95% CI (57.27, 63.53)] outperformed natural science group [95% CI (43.99, 50.25)] and control group [95% CI (51.52, 58.04)] at T3 [F(2, 72) = 18.03, p < 0.001, ηp2=0.34]. Similarly, the music group [95% CI (12.05, 13.55)] showed superior performance in Verbal Delayed Recall at T3 [F(2, 72) = 8.91, p < 0.001, ηp2=0.20] compared to the natural science group [95% CI (9.85, 11.35)] and the control group [95% CI (10.57, 12.13)]. Finally, the music group [95% CI (13.76, 14.88)] scored higher than the natural science group [95% CI (12.20, 13.32)] and the control group [95% CI (12.33, 13.49)] in the Verbal Recognition Test at T3 [F(2, 72) = 9.43, p < 0.001, ηp2=0.21]. The observed effects in the music group were large: effect sizes of the improvement for all verbal memory measures at T3 were large (for an overview of effect size benchmarks, see Cohen, 1988).

Subsequent comparison of means for within subject effects with adjusted alpha level (p = 0.05/2 = 0.025) showed that the music group significantly improved their performances in the Verbal Learning [t(24) = 5.71, p < 0.001, d = 1.16], Verbal Delayed Recall [t(24) = 5.66, p < 0.001, d = 1.27], and Verbal Recognition [t(24) = 3.09, p = 0.005, d = 0.67] test scores from T1 to T2, and from T2 to T3 [Verbal Learning t(24) = 6.43, p < 0.001, d = 1.35; Verbal Delayed Recall t(24) = 4.92, p < 0.001, d = 1.06, and Verbal Recognition t(24) = 3.01, p = 0.006, d = 0.65]. In contrast, participants in the natural science group only improved their verbal memory performances in the Verbal Delayed Recall score [t(24) = 2.46, p = 0.021, d = 0.49] from T1 to T2. The control group showed significant increases for the Verbal Learning Score from T1 to T2 [t(22) = 5.14, p < 0.001, d = 1.07] and from T2 to T3 [t(22) = 4.22, p < 0.001, d = 0.88] notwithstanding that the increases were smaller than in the music group. Finally, the control group significantly improved their performances in the Verbal Delayed Recall score only from T2 to T3 [t(22) = 3.33, p < 0.05, d = 0.70].

For the visual memory tests, no interaction or main effects were found across the study (Corsi Block Test: all Fs ≤ 2.74; p ≥ 0.07; Matrix Span Test: all Fs ≤ 1.26; p ≥ 0.29). The respective values for these memory tests are displayed in Figure 2.


Figure 2. Mean scores of the Corsi Block Test (A) and the Matrix Span Test (B) for the experimental and control groups (N = 73) at baseline (T1), after 1 year (T2) and after one and a half years (T3) of music, natural science, or non-additional training. Error flags indicate Standard Errors of Means (SEM).


This study was designed to investigate the effects of a school-based music training program on the development of verbal and visual memory skills in primary school children. We hypothesized that over a period of one and a half years, musically trained children would significantly outperform both a comparison group of children obtaining enhanced natural science training and a control group without specific training, with respect to measures of verbal memory performance. Furthermore, we did not expect to find any differences between groups in their performances concerning visual memory abilities. Both of these hypotheses were confirmed according to the present results.

First of all, although performance levels in memory tests were similar at baseline, children receiving music instruction scored significantly higher in tests addressing verbal memory as compared to controls, after 18 months of instrumental training. Specifically, music-trained children showed significant increases in the Verbal Learning Test score over the whole testing period, whereas no such improvement was observed in the natural science and the non-training groups. These patterns of changes prevailed after adjustment of the model to account for influences of individual IQ and age. In addition, the music training group scored significantly higher in the Verbal Delayed Recall Test and in the Verbal Recognition Test after 18 months and thus outperformed the natural science and the non-training group. These results suggest that musically trained children developed more efficient short-term (verbal learning skills) and long-term (verbal delayed recall and verbal recognition skills) memory strategies as compared to their peers in the two control groups. The large effect sizes observed in the music group for all verbal memory measures at the last time point (T3) strengthen our interpretation. Moreover, musically trained children continuously improved their performances over time (from T1 to T2 and from T2 to T3) in each of the three verbal memory measures, whereas such an increase was found neither in the natural science nor in the non-training group.

The findings obtained here are consistent with previous quasi-experimental research (Chan et al., 1998; Kilgour et al., 2000; Ho et al., 2003; Jakobson et al., 2008) which showed that musically trained children significantly improved their verbal memory abilities as compared to control groups. The current study extends these findings by showing the same effect on verbal memory performances during a school-based music program within a longitudinal design. It is noteworthy that effect sizes for the three verbal memory tests in the present study were higher than expected from prior research (e.g., Ho et al., 2003; Rickard et al., 2010). Hence the large effect sizes strengthen our confidence in the relationship between instrumental music training and verbal memory abilities examined in our study.

Differences in Culture, Research Methods, and Musical Experiences

Jakobson et al. (2008) suggested that the lack of effects in visual memory performance as observed by Chan et al. (1998) and Ho et al. (2003) could be explained by cultural differences. Both investigators worked with native Asian participants, who are trained in an ideographic reading and writing system. This training was found to be associated with improved memory for abstract designs but not for words (e.g., Flaherty, 2000). Therefore it was suggested that formal music training may have provided no additional benefit for visual memory performances in these groups. However, such cultural influences appear unlikely with respect to the present findings, because our participants were trained in an alphabetical rather than ideographic language system. The present results are in line with Rickard et al. (2010), who found no significant improvement in simple visual memory abilities (visual recall) for music-trained children in another Western (Australian) sample of participants.

Without denying possible cultural influences on certain domains of cognitive performance, we suggest that the discrepancy in the results reported for the relationship between formal music training and verbal memory on the one hand, and between formal music training and visual memory on the other, lie in individual music training histories as well as in the different performance measures for visual memory abilities. For example, Brandler and Rammsayer (2003), Chan et al. (1998), Jakobson et al. (2008), and Kilgour et al. (2000) examined highly trained adults (from 10 to 14 years of formal music training) whereas Ho et al. (2003), Brochard et al. (2004), and Rickard et al. (2010) investigated children with relatively restricted music training experiences (to a maximum of 5 years). Moreover, visual memory in prior research had been assessed by a variety of measures including reaction time (Brochard et al., 2004), accuracy of recall (Chan et al., 1998; Ho et al., 2003; Rickard et al., 2010), topographical memory skills (Brandler and Rammsayer, 2003), visual perception ability (Brochard et al., 2004; Rickard et al., 2010), and visual-auditory memory (Moreno et al., 2011b).

Private Music Lessons vs. School-Based Music Training

Another factor, which could explain diverging findings between the current results and previous investigations for visual memory abilities of musically trained individuals (e.g., Jakobson et al., 2008), might be the nature of a school-based music training program, and the associated instructional setting. Typically, group-based training appears to place stronger emphasis on listening skills, because all members of the group need to monitor and respond to acoustic information from different sources (teacher, fellow students). By contrast, fewer demands should arise with respect to score reading, which must accommodate the skills of the weaker score readers. Therefore, specific advantages for visual cognition might only occur in more intensive private music lessons (see Rickard et al., 2010).

Importantly, the implementation of an alternative school-based natural science training program as a comparison group should have accounted for novelty effects in the target group (controlling for paying more attention and additional training to the treatment group). Hence, a school-based music training program suggests similar effects on verbal memory abilities as has been shown for private instrumental instruction in previous work.

Neurocognitive Mechanisms of Verbal Memory Effects

The most likely interpretation of our findings is that musically trained children develop particularly efficient memory strategies for materials (here: words), which are represented as auditory codes. Playing music requires continued monitoring of meaningful chunks of information. Rather than individual notes, these chunks entail clusters of notes that are combined into meaningful melodic gestures and phrases under the influence of Gestalt principles as well as syntactic rules that are common to Western tonal music (Lerdahl and Jackendoff, 1983). Therefore, the basic units on which cognitive processes operate in the case of musical materials are “word-like” clusters of notes, which comprise melodies that may extend well beyond the limits of working memory. Although musical motifs do not entail denotative meaning as do words (Koelsch et al., 2004), they do have a quasi-syllabic structure, and, notably, they are based on a temporal framework that entails metric structures (weak and strong beats in music are paralleled by stresses on syllables in language; see Patel and Daniele, 2003). Such timing properties are thought to be important in the representation of materials stored in working memory (Jakobson et al., 2003). Neuropsychological studies suggest a correlation between variables related to the years of music training, the age of onset of music training and the amount of practice on the one hand, and the degree of structural or functional enhancement in specific brain areas, on the other (Pantev et al., 2001; Bengtsson et al., 2005). For example, the studies by Chan et al. (1998) were based on the hypothesis that music training affects the neuroplasticity in auditory regions of the temporal cortex, especially in the left PT (Schlaug et al., 1995), which is still consistent with enhanced auditory temporal processing ability (Jakobson et al., 2003). Williamson et al. (2010) maintained that short-term memory for tones has several similarities to verbal memory. The authors suggest that verbal and musical sounds are processed in the auditory short-term memory. In the papers by Kraus and Chandrasekaran (2010) and Strait and Kraus (2011) the authors argued that music is a powerful tool for modeling neural structures and functions via general auditory processing (including the processing of speech). Hence it might be possible that musicians would also gain an advantage in their verbal memory performances.

However, further research will be necessary to clarify the conditions under which music training might affect verbal and visual memory in music and non-music groups.

Instrumental Training and Child Literacy

Previous research showed that music training might promote cognitive mechanisms that underlie child literacy and language development (Besson et al., 2011; Moreno et al., 2011a; Strait and Kraus, 2011) as well as reading abilities (Anvari et al., 2002; Moreno et al., 2011b). Such mechanisms include auditory cognitive skills such as verbal or auditory working memory (Tierney et al., 2008; Kraus and Chandrasekaran, 2010; Williamson et al., 2010; Roden et al., in press). Supporting the interpretation of common processes in music and speech, Bidelman et al. (2009) investigated the influence of domain-specific experience of music and language on the encoding of music (musical intervals) and on speech (lexical tones) pitch processing. Brainstem frequency-following responses in native Chinese, English musicians, and English non-musicians were recorded while the participants listened to homologs of musical intervals and lexical tones. Results showed that pitch experience in music or language could be transferred from one domain to another. Compared to the group of English non-musicians, both Chinese and English musicians showed a higher pitch tracking accuracy and pitch strength. However, some interesting difference was found between the Chinese and English musicians: the Chinese group more focused on the rapid changes of pitch that were similar in Mandarin Chinese language, whereas English musicians were more sensitive to the parts of the stimuli similar to the notes of musical scale. Hence, the authors argue that pitch encoding in the auditory brainstem is domain-general for music and language but different in pitch extraction mechanisms. Further studies presented more evidence for a relationship between musical abilities and phonological processing (Anvari et al., 2002; Moreno et al., 2009; Dege and Schwarzer, 2011). Along those lines we found that music-trained children showed greater improvements in tasks addressing the phonological loop (word span test and non-word recall test) than non-music children over a period of one and a half year (Roden et al., in press).

In sum, prior results argue for a positive transfer effect from musical expertise onto speech and language processing. The three reported measures of the auditory verbal memory test used in the present study extend the above findings by showing that children receiving music lessons demonstrated larger improvements of verbal memory skills than the control group children, even after IQ was controlled. Our results provide additional converging support to recent findings due to the fact that children in the music group did not differ on average from the control groups on any verbal memory measure at the beginning of our study (T1), and that there was no significant difference observed in the visual memory tests at any time point (T1, T2, and T3). They further extend the previous finding of Tierney et al. (2008), showing that musical experience may affect short-term verbal memory abilities (immediate verbal rehearsal processes) as well as long-term verbal memory skills (delayed verbal recall and verbal recognition) in children.

Limitations and Conclusion

Considering the range from medium to high effect sizes of all interactions for the verbal memory tests used in the present study, these results suggest some validation of a transfer effect from music to more specific cognitive domains. Nevertheless this interpretation must be treated with caution. The quasi-experimental design of this study suggests that participant groups were self-selected rather than constituted by chance. Experimental studies based on completely randomized trails should provide more persuasive evidence for the benefits of music training. However, the inclusion of two comparison groups and three time points of measurements may have diluted some of the self-selection bias. Although the same tests were used for each child at all three points of measurement, it is unlikely that practice effects would account for our results. In particular, no main effects on time were found for any verbal or visual memory measures. Therefore, the interaction effects suggesting benefits for the music group in the verbal memory measures could be attributed to effects of the intervention rather than of practicing.

There are some further limitations to be considered in the present study. Firstly, the size of the group of children receiving music training was limited to five, whereas the natural science training involved all children in one school-class. However, at T2 and T3 the group size for music training increased by teaching the third and fourth graders together. Secondly, the individual practice time for children in the music group showed at home is uncertain due to high percentages of missing values as well as inconsistencies in the children’s self-reports. In addition, the available data concerning the participation in teaching and learning activities at home was insufficient to clarify any bias on our reported results across groups. Nevertheless, it is unlikely that such activities would only affect the verbal memory and not the visual memory skills. Similarly, it is possible that the general learning activity in the three groups differed systematically, thus providing a bias that has been undetected so far. Nevertheless, we observed no differences at baseline in the verbal and visual memory tests between groups, which indicate, that the bias of individual practice time or learning activities could not be responsible for the reported development in the verbal memory measures. Finally, to approximate exposure to instruction outside of the classroom, we checked the extra-curricular activities of each of the children in the three groups and we did not find any differences among the groups.

Taken together, school-based instrumental training was shown here to affect memory for verbal rather than visual materials. The study corroborates and extends notions of transfer effects of cognitive skills from the music to the linguistic domain. These findings are consistent with previous experimental and quasi-experimental work (Chan et al., 1998; Ho et al., 2003; Rickard et al., 2010). To our knowledge, the present data extend these findings in that they are the first to show that components of the verbal memory (verbal learning, verbal delayed recall and verbal recognition) in primary school children increases over a time period of 18 months in response to a school-based music training program. Although the mechanisms that drive these effects remain unclear, the findings reinforce the more general notions of beneficial effects of music education for children at primary schools, and, possibly, pre-schools in terms of their cognitive development, in general and language acquisition in particular.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.


This study is supported by a grant from the German Federal Ministry of Education and Research, which was awarded to the second and third authors (No. 01KJ0807). The founding source had no involvement on the study design, the collection, analysis and interpretation of data, the writing of the report, or the decision to submit the article for publication. We thank all our student assistance involved in the study for their support during data acquisition and processing. Particularly we like to thank all children, parents, teachers, and administrators for their support. Furthermore, we want to thank Andrea Halpern and Janani Dhinakaran for proofreading.


  1. ^The musical training program is guided by a foundation called “Jedem Kind ein Instrument” (JeKi), [An Instrument for Every Child]. For further information, please visit


Anvari, S. H., Trainor, L. J., Woodside, J., and Levy, B. A. (2002). Relations among musical skills, phonological processing, and early reading ability in preschool children. J. Exp. Child. Psychol. 83, 111–130.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Bengtsson, S. L., Nagy, Z., Skare, S., Forsman, L., Forssberg, H., and Ullén, F. (2005). Extensive piano practicing has regionally specific effects on white matter development. Nat. Neurosci. 8, 1148–1150.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Besson, M., Chobert, J., and Marie, C. (2011). Transfer of training between music and speech: common processing, attention, and memory. Front. Psychol. 2:94. doi:10.3389/fpsyg.2011.00094

CrossRef Full Text

Bidelman, G. M., Gandour, J. T., and Krishnan, A. (2009). Cross-domain effects of music and language experience on the representation of pitch in the human auditory brainstem. J. Cogn. Neurosci. 29, 13165–13171.

Bilhartz, T. D., Bruhn, R. A., and Olson, J. E. (1999). The effect of early music training on child cognitive development. J. Appl. Dev. Psychol. 20, 615–636.

CrossRef Full Text

Bos, W., Lankes, E.-M., Prenzel, M., Schwippert, K., Valtin, R., Voss, A., et al. (2005). IGLU: Skalenhandbuch zur Dokumentation der Erhebungsinstrumente [Progress in International Reading Literacy Study]. Münster: Waxmann.

Brandler, S., and Rammsayer, T. H. (2003). Differences in mental abilities between musicians and non-musicians. Psychol. Music 31, 123–138.

CrossRef Full Text

Brochard, R., Dufour, A., and Despres, O. (2004). Effect of musical expertise on visuospatial abilities: evidence from reaction times and mental imagery. Brain Cogn. 54, 103–109.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Butzlaff, R. (2000). Can music be used to teach reading? J. Aesthet. Educ. 34, 167–178.

CrossRef Full Text

Cattell, R. B. (1961). Culture Free Intelligence Test, Scale 3. Champaign, IL: Institute for Personality and Ability Testing.

Chan, A. S., Ho, Y. C., and Cheung, M. C. (1998). Music training improves verbal memory. Nature 396, 128–128.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Cheek, J. M., and Smith, L. R. (1999). Music training and mathematics achievement. Adolescence 34, 759–761.

Pubmed Abstract | Pubmed Full Text

Cohen, J. (1988). Statistical Power Analysis for the Behavioral Science, 2nd Edn. Hillsdale, NJ: Lawrence Erlbaum.

Costa-Giomi, E. (1999). Effects of three years of piano instruction on children’s cognitive development. J. Res. Music Educ. 47, 198–212.

CrossRef Full Text

Dege, F., and Schwarzer, G. (2011). The effect of a music program on phonological awareness in preschoolers. Front. Psychol. 2:124. doi:10.3389/fpsyg.2011.00124

CrossRef Full Text

Faul, F., Erdfelder, E., Lang, A.-G., and Buchner, A. (2007). G*Power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods 39, 175–191.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Flaherty, M. (2000). Memory in the deaf: a cross-cultural study in English and Japanese. Am. Ann. Deaf 145, 237–244.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Forgeard, M., Winner, E., Norton, A., and Schlaug, G. (2008). Practicing a musical instrument in childhood is associated with enhanced verbal ability and nonverbal reasoning. PLoS ONE 3:e3566. doi:10.1371/journal.pone.0003566

CrossRef Full Text

Frisk, V., and Milner, B. (1990). The roll of the left hippocampal region in the acquisition and retention of story content. Neuropsychologia 28, 349–359.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gardiner, M. F., Fox, A., Knowles, F., and Jeffrey, D. (1996). Learning improved by arts training. Nature 381, 284–284.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gaser, C., and Schlaug, G. (2003). Brain structures differ between musicians and non-musicians. J. Neurosci. 23, 9240–9245.

Pubmed Abstract | Pubmed Full Text

Hasselhorn, M., Schumann-Hengsteler, R., Gronauer, J., Grube, D., Mähler, C., Schmid, I., et al. (2012). Arbeitsgedächtnistestbatterie für Kinder im Alter von 5-12 Jahren (AGTB 5-12) [Working Memory Test Battery for Children from 5-12 Years of Age (AGTB 5-12)]. Göttingen: Hogrefe.

Helmstaedter, C., Lendt, M., and Lux, S. (2001). Verbaler Lern und Merkfähigkeitstest (VLMT) [Auditory Verbal Learning Test]. Göttingen: Hogrefe.

Hetland, L. (2000). Listening to music enhances spatial-temporal reasoning: evidence for the “Mozart Effect.” J. Aesthet. Educ. 34, 105–148.

CrossRef Full Text

Ho, Y. C., Cheung, M. C., and Chan, A. S. (2003). Music training improves verbal but not visual memory: cross-sectional and longitudinal explorations in children. Neuropsychology 17, 439–450.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hyde, K. L., Lerch, J., Norton, A., Forgeard, M., Winner, E., Evans, A. C., et al. (2009). The effects of musical training on structural brain development a longitudinal study. Ann. N. Y. Acad. Sci. 1169, 182–186.

Pubmed Abstract | Pubmed Full Text

Jakobson, L. S., Cuddy, L. L., and Kilgour, A. R. (2003). Time tagging: a key to musicians’ superior memory. Music Percept. 20, 307–313.

CrossRef Full Text

Jakobson, L. S., Lewycky, S. T., Kilgour, A. R., and Stoesz, B. M. (2008). Memory for verbal and visual material in highly trained musicians. Music Percept. 26, 41–55.

CrossRef Full Text

Kilgour, A. R., Jakobson, L. S., and Cuddy, L. L. (2000). Music training and rate of presentation as mediators of text and song recall. Mem. Cognit. 28, 700–710.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Koelsch, S., Kasper, E., Sammler, D., Schulze, K., Gunter, T., and Friederici, A. D. (2004). Music, language and meaning: brain signatures of semantic processing. Nat. Neurosci. 7, 302–307.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kraus, N., and Chandrasekaran, B. (2010). Music training for the development of auditory skills. Nat. Rev. Neurosci. 11, 599–605.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Lerdahl, F., and Jackendoff, R. (1983). A Generative Theory of Tonal Music. Cambridge, MA: MIT Press.

Logie, R. H., Zucco, G. M., and Baddeley, A. D. (1990). Interferences with visual short-term memory. Acta Psychol. (Amst.) 75, 55–74.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Moreno, S., Bialystok, E., Barac, R., Schellenberg, E. G., Cepeda, N. J., and Chau, T. (2011a). Short-term music training enhances verbal intelligence and executive function. Psychol. Sci. 22, 1425–1433.

CrossRef Full Text

Moreno, S., Friesen, D., and Bialystok, E. (2011b). Effect of music training on promoting preliteracy skills: preliminary causal evidence. Music Percept. 29, 165–172.

CrossRef Full Text

Moreno, S., Marques, C., Santos, A., Santos, M., Castro, S. L., and Besson, M. (2009). Musical training influences linguistic abilities in 8-year-old children: more evidence for brain plasticity. Cereb. Cortex 19, 712–723.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Münte, T. F., Altenmuller, E., and Jancke, L. (2002). The musician’s brain as a model of neuroplasticity. Nat. Rev. Neurosci. 3, 473–478.

Pubmed Abstract | Pubmed Full Text

Pantev, C., Engelien, A., Candia, V., and Elbert, T. (2001). Representational cortex in musicians: plastic alterations in response to musical practice. Ann. N. Y. Acad. Sci. 930, 300–314.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Patel, A. D., and Daniele, J. R. (2003). An empirical comparison of rhythm in language and music. Cognition 87, B35–B45.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Piro, J. M., and Ortiz, C. (2009). The effect of piano lessons on the vocabulary and verbal sequencing skills of primary grade students. Psychol. Music 37, 325–347.

CrossRef Full Text

Rauscher, F. H., Shaw, G. L., and Ky, K. N. (1995). Listening to Mozart enhances spatial-temporal reasoning: towards a neurophysiological basis. Neurosci. Lett. 185, 44–47.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Rauscher, F. H., and Zupan, M. A. (2000). Classroom keyboard instruction improves kindergarten children’s spatial-temporal performance: a field experiment. Early Child. Res. Q. 15, 215–228.

CrossRef Full Text

Rey, A. (1941). L’examen psychologique dans les cas d’encéphalopathie traumatique [The Rey Auditory Verbal Learning Test, RVLMT]. Arch. Psychol. (Geneve) 28, 21.

Rickard, N. S., Vasquez, J. T., Murphy, F., Gill, A., and Toukhsati, S. R. (2010). Benefits of a classroom based music program on verbal memory of primary school children: a longitudinal study. Aust. J. Music Educ. 1, 36–47.

Roden, I., Grube, D., Bongard, S., and Kreutz, G. (in press). Does music training enhance working memory (performance)? Findings from a quasi-experimental longitudinal study. Psychology of Music.

Saykin, A. J., Robinson, L. J., and Stafiniak, P. (1992). “Neuropsychological changes after anterior temporal lobectomy: acute effects on memory, language, and music,” in The Neuropsychology of Epilepsy, ed. T. L. Bennett (New York: Plenum Press), 263–290.

Schellenberg, E. G. (2011). Examining the association between music lessons and intelligence. Br. J. Psychol. 102, 283–302.

Pubmed Abstract | Pubmed Full Text

Schellenberg, G. (2009). “Musikunterricht, geistige Fähigkeiten und Sozialkompetenzen: Schlussfolgerungen und Unklarheiten [Music lessons, intellectual abilities, and social skills: conclusions and confusions],” in Pauken mit Trompeten? Lassen sich Lernstrategien, Lernmotivation und soziale Kompetenzen durch Musikunterricht fördern? [Drilling with trumpets? Is it possible to enhance learning strategies, learning motivation and social competence with music lessons?], ed. (Bonn/Berlin: BMBF), 114–124.

Schlaug, G., Forgeard, M., Zhu, L., Norton, A., Norton, A., and Winner, E. (2009). Training-induced neuroplasticity in young children. Ann. N. Y. Acad. Sci. 1169, 205–208.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Schlaug, G., Jancke, L., Huang, Y. X., and Steinmetz, H. (1995). In vivo evidence of structural brain asymmetry in musicians. Science 267, 699–701.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Sluming, V., Brooks, J., Howard, M., Downes, J. J., and Roberts, N. (2007). Broca’s area supports enhanced visuospatial cognition in orchestral musicians. J. Neurosci. 27, 3799–3806.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Stoesz, B. M., Jakobson, L. J., Kilgour, A. R., and Lewycky, S. T. (2007). Local processing advantage in musicians: evidence from disembedding and constructional tasks. Music Percept. 25, 153–165.

CrossRef Full Text

Strait, D., and Kraus, N. (2011). Playing music for a smarter ear: cognitive, perceptual and neurobiological evidence. Music Percept. 29, 133–146.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Tierney, A., Bergeson, T. R., and Pisoni, D. B. (2008). Effects of early musical experience on auditory sequence memory. Empir. Musicol. Rev. 3, 178–186.

Pubmed Abstract | Pubmed Full Text

Weiß, R. H. (2006). Grundintelligenztest Skala 2 (CFT 20-R). Revision (4. Auflage) [Culture Fair Intelligence Test: CFT 20-R]. Göttingen: Hogrefe.

Williamson, V. J., Baddeley, A. D., and Hitch, G. J. (2010). Musicians’ and nonmusicians’ short-term memory for verbal and musical sequences: comparing phonological similarity and pitch proximity. Mem. Cognit. 38, 163–175.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Keywords: music lessons, music training, verbal memory, visual memory, cognitive development, children, longitudinal study

Citation: Roden I, Kreutz G and Bongard S (2012) Effects of a school-based instrumental music program on verbal and visual memory in primary school children: a longitudinal study. Front. Psychology 3:572. doi: 10.3389/fpsyg.2012.00572

Received: 05 October 2012; Accepted: 04 December 2012;
Published online: 21 December 2012.

Edited by:

Claude Alain, Rotman Research Institute, Canada

Reviewed by:

Theodor Rueber, Bonn University Hospital, Germany
Shinya Fujii, Harvard Medical School, USA

Copyright: © 2012 Roden, Kreutz and Bongard. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.

*Correspondence: Ingo Roden, Department of Music, Carl von Ossietzky University, Ammerländer Heerstraße 114-118, 26111 Oldenburg, Germany. e-mail: