BRIEF RESEARCH REPORT article

Front. Psychol., 03 August 2021

Sec. Educational Psychology

Volume 12 - 2021 | https://doi.org/10.3389/fpsyg.2021.668139

The Contribution of Word-, Sentence-, and Discourse-Level Abilities on Writing Performance: A 3-Year Longitudinal Study

  • 1. Portucalense Institute for Human Development (INPP), Department of Psychology and Education, Universidade Portucalense, Porto, Portugal

  • 2. Faculty of Psychology and Education Sciences, University of Porto, Porto, Portugal

  • 3. Department of Education and Psychology, University of Aveiro, Aveiro, Portugal

  • 4. Centro de Linguística da Universidade Nova de Lisboa, Universidade Nova de Lisboa, Lisboa, Portugal

Abstract

Writing is a foundational skill throughout school grades. This study analyzed the development of different levels of written language (word, sentence, and discourse) and explored the relationship between these levels and writing performance. About 95 Portuguese students from two cohorts—Grades 4–7 (n = 47) 6–9 (n = 48)—were asked to produce a descriptive text two times, with a 3-year interval. The produced texts were used to assess spelling, syntactic correctness and complexity, and descriptive discourse as well as text length and quality. The main results showed that there were improvements from Grades 4 to 7 and 6 to 9 in word- and sentence-level skills, along with increases in some dimensions of the descriptive discourse. Moreover, the older cohort performed better than the younger cohort in terms of spelling, syntactic complexity, and text quality, but not in terms of syntactic correctness, one dimension of the descriptive discourse, and text length. Regression analyses showed that writing performance was predicted by word and sentence levels in the younger cohort only, and by discourse-level variables in both cohorts. Overall, despite indicating a generalized growth in writing skills throughout schooling, this study also highlighted the areas that may need additional attention from teachers, mainly in terms of the descriptive features.

Introduction

Writing is a complex skill (Dockrell et al., 2014). It requires the production of legible letters following conventional spellings, to produce words that are organized into sentences and form a coherent written text, expressing the writer's ideas (Abbott et al., 2010). Given the complexity of writing, research into the development of this ability throughout schooling is particularly relevant to understand the trajectories of learning and, based on these, to provide educational guidelines to foster writing skills. Much of extant research provides cross-sectional comparisons (Lerkkanen et al., 2004), and a few studies provide a longitudinal analysis of writing development (Berninger et al., 2010; Jagaiah et al., 2020), mainly with long gaps between the measurement points. These may help to better gauge the development of writing, given its long learning curve. This was the goal of the present study in which we examined the development of different levels of written language and their contribution to writing performance in two cohorts of Portuguese students (from Grades 4 to 7 and 6 to 9). These grades were chosen as they represent critical transitions for Portuguese students, from the first cycle of basic education (Grades 1–4) to the second one (Grades 5–6), and from the second cycle to the third cycle (Grades 7–9).

Levels of language, an analytic tool, is used to understand the complexity of oral and written language (Berninger and Garvey, 1982) based on the analysis of words, sentences, and discourse. In the word level, spelling could be defined as the ability to retrieve, assemble, and select orthographic symbols (Abbott and Berninger, 1993). Large-span cross-sectional studies found that spelling errors (an indicator of spelling skill) decreased throughout schooling, for example, from Grades 2 to 5 (Alves and Limpo, 2015), Grades 2 to 6 (Llaurado and Dockrell, 2020; Magalhães et al., 2020), and Grades 1 to 9 (Bahr et al., 2012). However, a few studies examined the type of errors produced, which can inform about the spelling difficulties of students in each language. Portuguese is a romance language with a simple syllabic structure and orthographic complexities and inconsistencies, classified as an intermediate depth orthography (Seymour et al., 2003; see more details about the Portuguese spelling system in Supplementary Material, section 1). Among several error categorization systems (Treiman et al., 2019), phonological, orthographic, and morphological assessment of spelling (POMAS) seems to be particularly useful, given its specificity of analysis and theoretical support. Grounded on the triple word form theory (Bahr et al., 2012), POMAS codes misspellings into three categories: phonological, orthographic, and morphological. Findings from POMAS revealed that from Grades 1 to 9 there was a decrease in phonological errors coupled with an increase in morphological ones, with most errors across grades being orthographic (Bahr et al., 2012). Despite not assessing morphological errors, Magalhães et al. (2020) found a similar pattern in Portuguese children from Grades 2, 4, and 6. The authors also found that stress marks errors—largely underexplored in Portuguese studies—were present equally in the assessed grades.

Writing a text also requires sentence-level abilities as children need to convert their ideas into sentences. Two key sentence-level measures are syntactic complexity and correctness (Dockrell et al., 2014). One of the most frequent measures of syntactic complexity is clause length (Jagaiah et al., 2020), which is the mean number of words per clause (Berman and Slobin, 1994). Syntactic correctness can be measured through the correctness of word sequences, defined as two contiguous syntactically and semantically acceptable writing units (Videen et al., 1982). A systematic review on Grades 1–12 concluded that syntactic complexity increased throughout schooling (Jagaiah et al., 2020). Similar evidence was found for syntactic correctness. In Grades 3–5, Dockrell et al. (2014) found that younger students produced significantly less correct word sequences than older ones. Likewise, similar findings were found by Malecki and Jewell (2003) showed that several indicators of syntactic correctness consistently increased from early elementary (Grades 1–2) to elementary grades (Grades 3–5), and from elementary to middle grades (Grades 6–8).

By serving specific communicative goals and functions, writing a text requires discourse-related knowledge concerning the structural features of each genre (Berman and Nir-sagiv, 2007; Graham et al., 2013; Dockrell et al., 2014). Like word and sentence levels, discourse-level abilities seem to increase throughout schooling, with students progressively producing texts with more and more genre-specific features. Tolchinsky (2019) found that descriptiveness (i.e., degree to which descriptive texts include the representative features of this genre) increased from Grades 1 to 4. Berman and Nir-sagiv (2007) found similar increases across grades in narrative and expository writing in older samples (Grades 4, 7, and 11 and University).

In addition to improvements in word, sentence, and discourse levels throughout schooling, research has shown that mastering these levels is important for writing performance, assessed in terms of the quality and amount of writing (Berninger, 2012). Both in primary and middle grades, a few studies found that better writing performance is predicted by (a) higher spelling skills [Grades 1–3 in Graham et al. (1997); Grades 3–6 in Abbott et al. (2010); and Grades 7–8 in Limpo et al. (2017)], (b) greater sentence-level abilities [Grades 2 and 3 in Arfé et al. (2016); Grades 3–5 and 5–7 in Beers and Nagy (2011); and Grades 7–8 in Limpo et al. (2017)]; and (c) more genre-related knowledge, including descriptive texts (Beers and Nagy, 2011; Tolchinsky, 2019). Providing stronger evidence on these links, meta-analyses showed that interventions promoting the writing levels of students improved the overall writing performance [Grades 1–6 in Graham et al. (2012) and Grades 4–12 in Graham and Perin (2007)].

Present Study

As most findings surveyed above came from cross-sectional studies, it seems crucial to complement them with longitudinal findings to bring new inputs about writing development. To that end, the following research questions were addressed in a two-cohort sample of Portuguese students: Which are the developmental trajectories of word, sentence, and discourse levels of written language? Moreover, to which degree do these levels predict writing performance? Word, sentence, and discourse levels were measured through spelling, syntactic complexity/correctness, and descriptiveness, whereas writing performance was measured via text length and writing quality. The younger and older cohort were assessed at Grades 4 and 7 and 6 and 9, respectively. Based on the previous research, we expected a skill increase in all levels and an association between these and writing performance. We also hypothesized that the older cohort would show better writing skills than the younger cohort.

Method

Participants and Design

Participants were 101 students from Grades 4 to 9 and enrolled in a cluster of public schools located in urban middle-class neighborhoods from the Center of Portugal. Among these, six children were dropped from the analyses based on the following criteria: four had special education needs and two were identified as extreme outliers in one of the variables under analysis (viz., morphological misspellings per 100 words, which lied more than 3.0 times the interquartile range above the third quartile). All analyses were then based on the data from 95 students and were divided into two cohorts that were assessed twice (T1–T2), with a 3-year gap. The younger cohort was composed of 47 students in Grade 4 (51% girls) with an average age of 9.28 years at T1 (SD = 0.45). The older cohort included 48 students in Grade 6 (62% girls) with an average age of 11.35 years at T1 (SD = 0.53).

Procedure

After the formal agreement from the principal of the school cluster, permission was given to contact the teachers of Grade 4 classes (a total of five) and Grade 6 classes (a total of four). After being explained about the goals and procedures of the study, including the possibility to withdraw at any moment, all teachers and students agreed to participate. In group, students were asked to produce a descriptive text in response to the prompt “Please describe your school,” which has been successfully used in previous research (e.g., Berninger et al., 2009; Dockrell et al., 2014). The full administration procedure lasted for 50 min. This is the typical duration of writing tasks in the participating schools, also used in prior studies (e.g., Llaurado and Dockrell, 2020). The exact same procedure was followed for both cohorts and testing moments. To potentiate the engagement of students, they were told that the best texts would be posted at the school webpage.

Measures

Further details on the measures described below can be found in Supplementary Material, sections 2 and 3.

Word-Level Measures

Based on POMAS (Bahr et al., 2012), misspellings were counted separately by category: phonological errors, orthographic errors, morphological errors, stress marks, and illegible errors. However, the illegible errors were ignored as they were negligible (below 1%). This measure was re-scored by a second judge in the written products of 20% of the pupils at both T1 and T2. Reliability was good for all misspelling types, as indicated by the intraclass correlation coefficient (ICC) for single measures (>0.80).

Sentence-Level Measures

The sentence-level measures included the clause length and percentage of incorrect word sequences. Clause length was computed by averaging the number of words per clause, employing the computerized language analysis (CLAN) software (MacWhinney, 2000). The percentage of incorrect word sequences was calculated by examining the total number of incorrect sequences divided by the total number of sequences. Based on the scoring of 20% of the measures by a second judge, we concluded that reliability was high (ICC for individual measures >0.94).

Discourse-Level Measures

To measure descriptiveness (i.e., the presence of features typical of the descriptive text), we followed the taxonomy proposed by Adam (2001), including anchoring, aspectualization, relation, and subthematization categories. This taxonomy was used in previous studies, which relied on a dichotomous scale to indicate the presence or absence of the category [Grades 4–6 in Moura et al. (2015); Grade 3 in Pereira and Gonçalves (2017)]. Because our sample was older and we were concerned that this dichotomic coding would lead to ceiling effects, we added a new level indicating the presence and elaboration of information in each category. Thus, we used a three-point scale from 0 to 2, with the highest scores indicating a higher degree of discourse elaboration [for a similar coding scheme in argumentative texts, see Limpo and Alves (2013)]. Two independent judges scored these dimensions across all texts. Disagreements were solved through a discussion.

Writing Performance

Two measures were used: text length and text quality. Text length was measured through the total number of words provided by CLAN. Text quality was assessed using a holistic scale ranging from 1 (low quality) to 7 (high quality), on creativity, coherence, syntax, and vocabulary (Alves et al., 2016). To avoid transcription biases on quality assessment, texts were previously typed, and misspellings were corrected (Berninger and Swanson, 1994). Two independent judges rated the text quality of all texts produced. Inter-reliability was high, as measured by the ICC for average measures, which was 0.91 at T1 and T2.

Results

Excepting one measure, we confirmed that our study revealed no distributional problems, as the absolute values of these indexes of skewness and kurtosis did not exceed 3.0 and 10.0, respectively (Kline, 2005). We found a ceiling effect in the descriptive dimension of aspectualization in the younger cohort at T2. Thus, this variable was not included in the analyses. Descriptive statistics for variables are presented on Table 1.

Table 1

Younger cohortOlder cohort
T1 (Grade 4)T2 (Grade 7)T1 (Grade 6)T2 (Grade 9)
MSDMSDMSDMSD
Word level (per 100 words)
Phonological misspellings0.971.350.500.790.640.890.170.34
Orthographic misspellings1.771.450.980.991.111.420.400.55
Morphological misspellings0.911.010.660.781.021.100.510.59
Stress mark misspellings2.842.461.711.841.881.650.800.88
Sentence level
Incorrect word sequences (%)6.593.154.002.635.892.944.723.38
Clause length5.110.845.230.835.370.956.071.13
Discourse level
Anchoring0.600.580.280.450.270.450.170.43
Sub thematization1.110.731.600.581.290.621.690.47
Relationship0.130.340.340.520.270.540.290.50
Writing performance
Text length141.4943.61169.0650.30137.4449.66177.0451.14
Text quality3.201.144.671.183.901.325.041.07

Means and SD for all variables by cohort and time.

Cohort and Time Differences at the Word Level

To examine whether the type of misspellings varied across cohort and time, we conducted a 2 (cohort [younger, older]) ×2 (time [T1, T2]) × 4 (misspellings type [phonological, orthographic, morphological, stress mark]) ANOVA, with repeated measures on the two last factors. The results revealed a main effect of time, [F(1, 279) = 64.30, p < 0.001, η = 0.41], a main effect of misspellings type, [F(1, 279) = 37.67, p < 0.001, η = 0.39], and a main effect of cohort, [F(1, 93) = 10.76, p < 0.001, η = 0.10]. Additionally, there were significant two-way interactions of misspellings type with both cohort, [F(3, 279) = 4.86, p = 0.003, η = 0.05], and time, [F(3, 279) = 5.97, p = 0.001, η = 0.06]. These were decomposed with simple-effect analyses followed up by pairwise comparisons with Bonferroni adjustments, for misspellings type × cohort and misspellings type × time. Overall, the results showed that stress mark errors were the most frequent and that, except morphological errors, the younger cohort produced more misspellings of all types than the older one (for complete results, see Supplementary Material, section 4).

Cohort and Time Differences at the Sentence Level

To examine whether the percentage of incorrect word sequences and clause length varied across cohort and time, we conducted two 2 (cohort [younger, older]) × 2 (time [T1, T2]) ANOVAs. For incorrect word sequences, the results revealed a main effect of time, [F(1, 93) = 24.21, p < 0.001, η = 0.21], with a decrease from T1 to T2. For clause length, the results showed a main effect of time, [F(1, 93) = 9.14, p = 0.003, η = 0.09], a main effect of cohort, [F(1, 93) = 15.85, p < 0.001, η = 0.15], and an interaction between the two, [F(1, 93) = 4.56, p = 0.04, η = 0.05]. Simple-effect analyses showed that clause length increased across time only for the older cohort, [F(1, 93) = 13.45, p < 0.001, η = 0.13], and that clause length differed between the cohorts at T2 only, [F(1, 93) = 17.11, p < 0.001, η = 0.16]. The older cohort produced longer clauses than the younger one.

Cohort and Time Differences at the Discourse Level

To examine whether descriptive dimensions varied across cohort and time, we conducted a 2 (cohort [younger, older]) × 2 (time [T1, T2]) × 3 (descriptive dimensions [anchoring, sub-thematization, relationship]) ANOVA, with repeated measures on the two last factors. The results revealed a main effect of time, [F(1, 186) = 6.38, p = 0.01, η = 0.06] and a main effect of descriptive dimensions, [F(2, 186) = 293.42, p < 0.001, η = 0.76]. In addition, we found significant two-way interactions of descriptive dimensions with cohort, [F(2, 186) = 5.90, p = 0.003, η = 0.06], and with time, [F(2, 186) = 22.25, p < 0.001, η = 0.19]. These were decomposed with simple effects analyses followed up by pairwise comparisons with Bonferroni adjustments, for descriptive dimensions × cohort and descriptive dimensions × time. In general, the findings showed different performances between dimensions, with higher results in sub-thematization, which increased from T1 to T2 in both cohorts (for complete results, see Supplementary Material, section 4).

Cohort and Time Differences in Writing Performance

To examine whether text length and text quality varied across cohort and time, we conducted two 2 (cohort [younger, older]) × 2 (time [T1, T2]) ANOVAs. For text length, the results revealed a main effect of time, [F(1, 93) = 30.63, p < 0.001, η = 0.25], with pupils writing longer texts at T2 than T1. For text quality, the results showed a main effect of cohort, [F(1, 93) = 6.99, p = 0.01, η = 0.07], with the older cohort producing better texts than the younger one; and a main effect of time, [F(1, 93) = 94.80, p < 0.001, η = 0.51], with pupils writing better texts at T2 than T1.

Contribution of Written Language Levels to Writing Performance

Table 2 presents the bivariate correlations between all variables at T1 and T2 for both cohorts. In general, we found that (a) for the younger cohort, writing performance was associated with word and discourse variables at T1, but only with discourse variables at T2 and (b) for the older cohort, text length was associated with sentence-level variables and text quality with spelling-level, sentence-level, and discourse-level variables at T1, whereas at T2 only discourse level was related to writing performance.

Table 2

Time 1Time 2
12345678910111234567891011
Time 1
1. Phonological misspellings-0.050.070.330.240.33−0.01−0.06−0.003−0.14−0.120.320.250.030.03−0.04−0.170.09−0.12−0.12−0.100.12
2. Orthographic misspellings0.33-−0.080.240.08−0.070.01−0.080.190.02−0.150.030.22−0.010.460.11−0.09−0.030.210.080.020.01
3. Morphological misspellings0.060.23-0.190.160.050.040.160.230.030.120.310.190.210.160.30−0.24−0.30−0.01−0.01−0.14−0.17
4. Stress mark misspellings0.580.380.31-0.110.05−0.08−0.040.06−0.13−0.110.400.400.100.42−0.010.02−0.16−0.27−0.050.05−0.05
5. Incorrect word sequences0.260.180.160.06-0.220.080.14−0.040.130.11−0.02−0.110.080.060.240.06−0.02−0.140.20−0.14−0.09
6. Clause length0.200.16−0.01−0.040.27-−0.030.16−0.030.020.170.120.150.240.090.220.13−0.070.060.170.010.17
7. Anchoring−0.24−0.010.07−0.230.08−0.11-0.17−0.140.150.010.120.100.18−0.02−0.06−0.04−0.02−0.10−0.070.120.02
8. Sub thematization−0.33−0.32−0.12−0.19−0.40−0.110.26-0.140.460.48−0.02−0.05−0.030.13−0.0010.060.130.03−0.070.170.30
9. Relationship0.100.20−0.01−0.220.150.230.05−0.15-0.150.24−0.13−0.16−0.020.000.16−0.16−0.11−0.16−0.060.02−0.02
10. Text length−0.31−0.160.16−0.29−0.020.200.410.400.14-0.78−0.17−0.23−0.03−0.11−0.01−0.080.27−0.040.220.270.33
11. Text quality−0.32−0.20−0.13−0.48−0.17−0.180.490.430.160.60-−0.14−0.21−0.06−0.10−0.04−0.020.070.020.030.220.32
Time 2
1. Phonological misspellings0.210.35−0.110.230.210.14−0.05−0.320.24−0.22−0.16-0.630.350.270.07−0.20−0.11−0.15−0.02−0.01−0.02
2. Orthographic misspellings0.300.620.290.450.15−0.14−0.05−0.370.13−0.15−0.130.44-0.080.190.10−0.06−0.11−0.090.02−0.17−0.13
3. Morphological misspellings−0.120.16−0.11−0.010.10−0.30−0.04−0.22−0.14−0.27−0.190.290.36-0.390.320.130.050.040.020.14−0.12
4. Stress mark misspellings0.310.190.170.560.09−0.210.002−0.07−0.04−0.20−0.280.150.400.24-0.100.04−0.06−0.002−0.200.200.001
5. Incorrect word sequences−0.050.320.270.240.26−0.110.160.190.02−0.08−0.090.100.450.340.18-0.23−0.15−0.320.25−0.20−0.22
6. Clause length−0.09−0.04−0.10−0.07−0.22−0.15−0.19−0.040.040.040.220.050.110.160.040.10-−0.02−0.030.170.09−0.11
7. Anchoring−0.03−0.14−0.020.040.20−0.010.27−0.030.190.110.10−0.06−0.11−0.320.020.01−0.09-0.16−0.030.380.42
8. Sub thematization0.21−0.12−0.03−0.030.030.180.090.210.050.002−0.06−0.09−0.35−0.32−0.02−0.21−0.100.02-0.210.490.28
9. Relationship−0.22−0.11−0.07−0.07−0.16−0.02−0.110.19−0.010.030.03−0.14−0.21−0.26−0.20−0.02−0.060.050.18-0.170.04
10. Text length0.03−0.11−0.020.02−0.060.0040.350.260.060.260.27−0.100.04−0.100.240.320.360.090.190.08-0.62
11. Text quality−0.08−0.08−0.07−0.17−0.32−0.070.300.330.050.380.46−0.20−0.160.35−0.08−0.060.310.070.380.380.56-

Bivariate correlations for all variables at T1 and T2 for the younger cohort (below the diagonal) and the older cohort (above the diagonal).

Correlations above |0.30| are significant at an alpha level of 0.05 and signaled in bold.

To test the contribution of the three levels of language to writing performance, we conducted a set of stepwise regression analyses to predict text length and quality at T1 and T2, separately by cohort (final model estimates are presented on Tables 3, 4, for text length and text quality, respectively). For predicting text length and quality at T1, we progressively introduced word-, sentence-, and discourse-level variables at T1 step-by-step. For predicting text length and quality at T2, we introduced text length and quality at T1 as a first step, followed by a step-by-step inclusion of writing levels at T2.

Table 3

Younger cohortOlder cohort
BSEbtpBSEbtp
PREDICTING T1 TEXT LENGTH
Word-level predictors
Phonological misspellings−4.655.62−0.14−0.830.41−5.858.93−0.10−0.660.52
Orthographic misspellings−3.254.47−0.11−0.730.471.405.250.040.270.79
Morphological misspellings10.225.970.241.710.10−2.936.86−0.07−0.430.67
Stress mark misspellings−1.323.34−0.07−0.400.70−2.864.68−0.10−0.610.55
Sentence-level predictors
Incorrect word sequences0.102.040.010.050.962.122.530.130.840.41
Clause length14.357.110.282.020.05−1.418.02−0.03−0.180.86
Discourse-level predictors
Anchoring21.3210.540.282.020.058.3616.020.080.520.61
Sub thematization18.099.040.302.000.0533.6412.000.422.800.01
Relationship15.8918.200.120.870.3911.2814.010.120.810.43
PREDICTING T2 TEXT LENGTH
T1 Text length0.320.150.282.150.040.230.140.231.710.10
Word-level predictors
Phonological misspellings−1.589.03−0.03−0.180.8637.9026.290.261.440.16
Orthographic misspellings−5.588.67−0.11−0.640.52−21.9114.90−0.23−1.470.15
Morphological misspellings−10.3110.17−0.16−1.010.32−4.5213.05−0.05−0.350.73
Stress mark misspellings8.673.790.322.290.0314.537.740.251.880.07
Sentence-level predictors
Incorrect word sequences7.602.750.402.770.01−1.232.22−0.08−0.550.58
Clause length22.237.550.372.940.017.135.840.161.220.23
Discourse-level predictors
Anchoring0.4714.670.000.030.9831.3815.000.262.090.04
Sub thematization18.6511.990.211.560.1347.2015.290.433.09<0.001
Relationship5.6812.450.060.460.658.7814.320.090.610.54

Parameter estimates of the final regression models predicting text length at T1 and T2 for each cohort.

Table 4

Younger cohortOlder cohort
BSEbtpBSEbtp
PREDICTING T1 TEXT QUALITY
Word-level predictors
Phonological misspellings0.100.140.120.740.47−0.210.23−0.14−0.920.36
Orthographic misspellings0.020.110.020.150.88−0.130.13−0.14−0.990.33
Morphological misspellings0.000.150.000.030.98−0.010.18−0.01−0.050.96
Stress mark misspellings−0.190.08−0.40−2.280.03−0.030.12−0.04−0.280.78
Sentence-level predictors
Incorrect word sequences−0.020.05−0.07−0.490.630.040.060.090.600.56
Clause length−0.220.17−0.16−1.260.210.180.200.130.880.39
Discourse-level predictors
Anchoring0.660.260.332.560.02−0.100.41−0.03−0.240.81
Sub thematization0.450.220.292.040.050.870.310.412.840.01
Relationship0.430.450.130.970.340.520.360.211.460.15
PREDICTING T2 TEXT QUALITY
T1 text quality0.430.120.413.45<0.0010.230.110.292.150.04
Word-level predictors
Phonological misspellings−0.120.19−0.08−0.660.510.770.610.251.260.22
Orthographic misspellings0.100.180.080.540.60−0.320.35−0.16−0.920.36
Morphological misspellings−0.270.21−0.18−1.280.21−0.440.30−0.24−1.430.16
Stress mark misspellings0.070.080.110.880.380.150.180.120.810.42
Sentence-level predictors
Incorrect word sequences0.020.060.050.370.72−0.010.05−0.02−0.120.91
Clause length0.400.160.282.460.02−0.020.13−0.02−0.170.87
Discourse-level predictors
Anchoring−0.060.31−0.02−0.200.840.980.340.392.920.01
Sub thematization0.710.250.352.840.010.530.350.231.510.14
Relationship0.700.260.312.700.010.080.320.040.240.81

Parameter estimates of the final regression models predicting text quality at T1 and T2 for each cohort.

Predicting T1 Text Length

For the younger cohort, Steps 1 and 2 did not reach statistical significance, but the inclusion of discourse-level variables increased the amount of variance explained in text length for both the younger, Rchange = 0.19, [Fchange(3, 37) = 4.09, p = 0.01]. The final model with all predictors explained 43% of the variance in text length at T1, [F(9, 37) = 3.04, p = 0.01]. Significant and independent predictors were clause length (b = 0.28) and the descriptive dimensions of anchoring (b = 0.28) and sub-thematization (b = 0.30). The first steps and the final model were, however, not significant for the older cohort, R2 = 0.26, [F(9, 38) = 1.52, p = 0.18].

Predicting T2 Text Length

For the younger cohort, Step 1 with text length at T1 and Step 2 with word-level predictors did not reach statistical significance. However, the inclusion of sentence-level predictors at Step 3 increased the amount of variance explained in text length, R = 0.26, [Fchange(2, 39) = 8.70, p = 0.001]. When Step 4 was added, there was no significant increase. Together, text length at T1 and the three levels of written language at T2 explained 47% of the text length at T2, [F(10, 36) = 3.19, p = 0.01]. Significant and independent predictors were T1 text length (b = 0.28), stress mark errors (b = 0.32), incorrect word sequences (b = 0.40), and clause length (b = 0.37). For the older cohort, there was no significant contribution of Steps 1–3, but the inclusion of descriptive dimensions significantly raised the amount of variance explained, R = 0.26, [Fchange(3, 37) = 6.43, p = 0.001]. The final model explained 49% of the variance in T2 text length, [F(10, 37) = 3.61, p = 0.02]. Significant and independent predictors were the descriptive dimensions of anchoring (b = 0.26) and sub-thematization (b = 0.43).

Predicting T1 Text Quality

For the younger cohort, Step 1 with word-level predictors made a significant contribution to text quality, R2 = 0.23, [F(4, 42) = 3.14, p = 0.02]. The inclusion of sentence-level predictors did not increase the amount of variance explained, but the inclusion of discourse-level predictors did, R = 0.22, [Fchange(3, 37) = 5.37, p = 0.004]. Together, the word-, sentence, and discourse-level predictors explained 50% of the variance in text quality at T1, [F(9, 37) = 4.05, p = 0.001]. The variables with significant and independent contributions were the number of stress mark errors per 100 words (b = −0.40) and the descriptive dimensions of anchoring (b = 0.33) and sub-thematization (b = 0.29). For the older cohort, none of the first steps nor the final model reached statistical significance, R2 = 0.32, [F(9, 38) = 1.99, p = 0.07].

Predicting T2 Text Quality

Step 1 of the analyses, including T1 text quality, proved significant for both the younger, R2 = 0.22, [F(1, 45) = 12.35, p = 0.001], and older cohorts, R2 = 0.08, [F(1, 46) = 5.25, p = 0.03]. Similar across the two cohorts, Steps 2 and 3 did not reach significance, but the inclusion of discourse-level predictors on Step 4 increased the amount of variance explained in text quality for both the younger, R = 0.20, [Fchange(3, 36) = 5.74, p = 0.003], and older cohorts, R = 0.21, [Fchange(3, 37) = 4.10, p = 0.01]. The final model for the younger cohort explained 58% of the variance in text quality, [F(10, 36) = 4.96, p < 0.001]. Significant and independent predictors were text quality at T1 (b = 0.41), clause length (b = 0.28), and the descriptive dimensions of sub-thematization (b = 0.35) and relationship (b = 0.31). The final model for the older cohort explained 37% of the variance in text quality at T2, [F(10, 37) = 2.22, p = 0.04]. Only text quality at T1 (b = 0.29) and the descriptive dimension of anchoring (b = 0.39) made a significant contribution.

Discussion

The first goal of the present study was to trace the developmental path of writing at word, sentence, and discourse levels. The overall results indicated that older students showed better performance at word and sentence levels than younger students, but mixed findings were found for the discourse level.

In line with prior studies (Alves and Limpo, 2015; Llaurado and Dockrell, 2020; Magalhães et al., 2020), we found a general decrease in misspellings from T1 to T2 and more misspellings in the younger than the older cohort. This finding is not surprising as students in higher grades had more years of formal instruction and therefore had more spelling knowledge and writing experience. Moreover, it has been suggested that in free writing older students may be better at selecting words they know how to spell correctly (Graham and Santangelo, 2014). A misspelling analysis revealed four noteworthy findings. First, the pattern of older students producing less misspellings than younger ones was not observed for morphological errors. This finding aligns well with the proposal of Bahr et al. (2012), who suggested that more morphological errors may occur in older students due to the use of more complex vocabulary. This may require advanced morphological (derivational) knowledge that takes time to growth (Nagy et al., 2006; Berninger et al., 2010). In the future, POMAS could be used to examine the development of misspellings beyond Grade 9. Second, phonological misspellings decreased from T1 to T2 in both cohorts, being the least frequent type of misspelling in the older cohort. This is in line with previous research (Bahr et al., 2012) and suggests that sound-based spellings are learned in the earliest phases of spelling development (Treiman and Bourassa, 2000). Third, stress mark errors were the most frequent misspelling in both timepoints and cohorts, also corroborating past findings with younger Portuguese students (Magalhães et al., 2020). Stress mark errors indicate poor lexical knowledge of stress and difficulties in prosodic and orthographic mapping (Defior et al., 2012). From an applied standpoint, this means that spelling instruction is not being entirely successful in fostering this kind of knowledge. Finally, after stress mark errors, orthographic misspellings were the most predominant errors in Grades 4, 6, and 7. This is a common finding in the field (Bahr et al., 2012; Magalhães et al., 2020; Mesquita et al., 2020), suggesting that the Portuguese orthographic complexities and inconsistencies take several years to be mastered. Future research seems to be needed for developing evidence-based practices for improving orthographic knowledge beyond primary grades.

In line with previous meta-analytic findings (Jagaiah et al., 2020), the sentence-level results showed longer clauses in the older than the younger cohort. Moreover, we found stronger time-related increases in the ability of students to craft complex sentences in the older cohort. These results indicate that improvements in syntactic complexity are more salient in older writers, as it has been proposed by scholars in the field (Hunt, 1970; Berninger et al., 2011). This may be related to teacher practices, with a sentence-related explicit instruction, including vocabulary and sentence expansion exercises only at later stages when students are more familiarized with complex genres (Connors, 2000). Our results showed a growth in syntactic correctness from Grades 4 to 7 and 6 to 9, with better performances in the older than the younger cohort. Similar findings have been reported in the field (Malecki and Jewell, 2003; Dockrell et al., 2014), indicating that the ability to craft syntactically correct sentences progresses throughout the primary and middle school. The measure of incorrect word sequences seems particularly sensitive to gauge that progress, including in older students (Espin et al., 2000; Weissenburger and Espin, 2005).

The analyses examining cohort and grade differences in the discourse level revealed three main findings. First, the younger cohort performed better than the older one in terms of anchoring. This unexpected finding may be related to the Portuguese curricula, which, in the initial grades, emphasize the use of titles and introductory sentences to contextualize the theme to readers (Buescu et al., 2015). Though vivid in late primary and early middle grades, these recommendations may be lost over the years. Second, we observed the anticipated increase from T1 to T2 in the ability of students to connect the prompt with other topics, with older students performing better than younger ones. Tolchinsky (2019) already suggested that older students tend to provide elaborated descriptions of topic-related aspects, whereas younger ones usually present lists of attributes, with a few efforts to articulate content. This progressive increase in ideas elaboration over time is common to other genres (Berman and Nir-sagiv, 2007; Beers and Nagy, 2009) and may be linked to progressive mastery of writing of students. Third, relationship was the most absent descriptive feature (either alone in Grade 4, or together with anchoring in Grades 6, 7, and 9). Clearly, ability of the students to relate and compare concepts was poor in all grades assessed, which is alarming, given the importance of this feature. Descriptive texts should establish links between sub-topics through comparisons or metaphors, which allow readers to form a picture in their minds (Adam, 2001). More research is needed to understand which factors underlie this poor performance and which strategies may be used to foster it.

The second goal of this study was to examine the contribution of word, sentence, and discourse levels to writing performance. However, considering the participants/predictors ratio, some caution is needed when interpreting these findings, which should be replicated in future studies with larger samples. Concerning word-level predictors, we found that more stress mark errors were associated with poorer texts at T1. This finding aligns with those of Magalhães et al. (2020) showing that in Grade 4, stress mark errors were reliable predictors of text quality. Regarding sentence-level predictors, we found the overall contribution of syntactic complexity to the amount and quality of students writing, confirming the importance of producing complex and good sentences to perform well in writing (Beers and Nagy, 2011; Limpo et al., 2017). Interestingly, the contribution of word- and sentence-level predictors to writing performance was only observed in the younger cohort, indicating that once students master a writing level, its role in writing performance diminishes (Graham, 2006). A striking finding involving both word- and sentence-level predictors at T2 in the younger cohort was that more stress mark errors, and more incorrect word sequences were associated with longer texts. Although this finding may be an artifact of the current study, it may also hint that by ignoring some aspects of writing, such as word stress and sentence correctness, students may be able to write more. Additional research is needed to replicate and explore these results.

Discourse-level variables were the most salient predictors of writing performance in both cohorts, confirming that the amount and quality of the writing of students is heavily dependent on their ability to follow genre-specific structures (Graham, 2006). This means that a powerful way to increase writing performance in a given genre is to improve student's knowledge about its underlying. Previous meta-analyses on the best methods to develop writing are in line with this conclusion (Graham and Perin, 2007; Graham et al., 2012). It should, however, be noted that our findings showed that not all descriptive categories contributed equally to writing performance. The more relevant category seems to be subthematization, that is, the ability of students to elaborate on the content. Experimental research is needed to examine the degree to which teaching of each of the descriptive categories results in better writing performance.

Implications for Applied Settings

The findings of the current study provide relevant hints for practice. Despite the general growth in writing, there seems to be room for improvement. In younger students, teachers may need to provide additional instruction in terms of syntactic complexity, whereas in older students, they may need to focus on stress mark and orthographic knowledge as well as on descriptive features, mainly, anchoring.

Conclusion

In sum, this study showed the overall growth in word and sentence levels and an increase in the discourse level only for sub-thematization. Moreover, whereas word and sentence level predicted writing performance only in the younger cohort, the discourse level was a relevant predictor in both cohorts. By helping us to understand the long-term curve of writing development, these findings provide hints for researchers to develop evidence-based practices tailored to the writing needs of students.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

LP was responsible for initial study design and data collection. LC coordinated data coding and the manuscript preparation. TL analyzed the data. All authors contributed to the design and conceptualization of the study, literature review, discussion, and wrote and reviewed the manuscript and approved its final version.

Funding

This project was supported by National Funds through FCT—Fundação para a Ciência e Tecnologia, I. P., under the project UIDB/00914/2020. Universidade Portucalense, Infante D. Henrique funded this work by supporting its publication fees.

Acknowledgments

The authors would like to thank Inês Soares de Moura for her collaboration in the data collection and Filipa Reis for data coding. We would also like to thank Protextos team (University of Aveiro) for the support in the design and discussion of instruments.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2021.668139/full#supplementary-material

References

  • 1

    AbbottR.BerningerV.FayolM. (2010). Longitudinal relationships of levels of language in writing and between writing and reading in grades 1 to 7. J. Educ. Psychol.102, 281298. 10.1037/a0019318

  • 2

    AbbottR. D.BerningerV. W. (1993). Structural equation modeling of relationships among developmental skills and writing skills in primary and intermediate grade writers. J. Educ. Psychol.85, 478508. 10.1037/0022-0663.85.3.478

  • 3

    AdamJ. M. (2001). Les Textes: Types et Prototypes: Récit, Description, Argumentation, Explication et Dialogue.Paris: Nathan.

  • 4

    AlvesR. A.LimpoT. (2015). Progress in written language bursts, pauses, transcription, and written composition across schooling. Sci. Stud. Read.19, 374391. 10.1080/10888438.2015.1059838

  • 5

    AlvesR. A.LimpoT.FidalgoR.CarvalhaisL.PereiraL. A.CastroS. L. (2016). The impact of promoting transcription on early text production: effects on bursts and pauses, levels of written language, and writing performance. J. Educ. Psychol.108, 665679. 10.1037/edu0000089

  • 6

    ArféB.DockrellJ. E.De BernardiB. (2016). The effect of language specific factors on early written composition: the role of spelling, oral language, and text generation skills in a shallow orthography. Read. Writ.29, 501527. 10.1007/s11145-015-9617-5

  • 7

    BahrR.SillimanE.BerningerV.DowM. (2012). Linguist pattern analysis of misspellings of typically developing writers in Grades 1-9. J. Speech Lang. Hear. Res.55,15871599. 10.1044/1092-4388(2012/10-0335)

  • 8

    BeersS. F.NagyW. E. (2009). Syntactic complexity as a predictor of adolescent writing quality: which measures? which genre?Read. Writ. Interdiscip. J.22, 185200. 10.1007/s11145-007-9107-5

  • 9

    BeersS. F.NagyW. E. (2011). Writing development in four genres from grades three to seven: syntactic complexity and genre differentiation. Read. Writ.24, 183202. 10.1007/s11145-010-9264-9

  • 10

    BermanR. A.Nir-sagivB. (2007). Comparing narrative and expository text construction across adolescence: a developmental paradox. Discourse Process.43, 79120. 10.1080/01638530709336894

  • 11

    BermanR. A.SlobinD. I.. (1994). Relating Events in Narrative: A Crosslinguistic Developmental Study. Mahwah, NJ: Erlbaum.

  • 12

    BerningerG.GarveyC. (1982). Tag constructions: structure and function in child discourse. J. Child Lang.9, 151168. 10.1017/S0305000900003688

  • 13

    BerningerV.AbbottR.SwansonH. L.LovittD.TrivediP.LinS.et al. (2010). Relationship of word- and sentence-level working memory to reading and writing in second, fourth, and sixth grade. Lang. Speech Hear. Serv. Sch.41, 179193. 10.1044/0161-1461(2009/08-0002)

  • 14

    BerningerV.SwansonH. L. (1994). Modifying Hayes and Flower's model of skilled writing to explain beginning and developing writing, in Children's Writing: Toward a Process Theory of Development of Skilled Writing, ed E. Butterfield (Greenwich, CT: JAI Press), 5781.

  • 15

    BerningerV. W. (2012). Past, Present, and Future Contributions of Cognitive Writing Research to Cognitive Psychology. London: Psychology Press.

  • 16

    BerningerV. W.AbbottR. D.AugsburgerA.GarciaN. (2009). Comparison of pen and keyboard transcription modes in children with and without learning disabilities. Learn. Disabil. Q.32, 123141. 10.2307/27740364

  • 17

    BerningerV. W.NagyW. E.BeersS. F. (2011). Child writers' construction and reconstruction of single sentences and construction of multi-sentence texts: contributions of syntax and transcription to translation. Read. Writ. Interdiscip. J.24, 151182. 10.1007/s11145-010-9262-y

  • 18

    BuescuH. C.MoraisJ.RochaM. R.MagalhãesV. F. (2015). Programa e metas curricularesde português do ensino básico [Program and Curriculum Goals of Portuguese in Basic Education]. Retrieved from the Directorate-General for Education of the Portuguese Ministry of Education and Science website: https://www.dge.mec.pt/sites/default/files/Basico/Metas/Portugues/pmcpeb_julho_2015.pdf (accessed May 17, 2021).

  • 19

    ConnorsR. J. (2000). The erasure of the sentence. Coll. Compost. Commun.52, 96128. 10.2307/358546

  • 20

    DefiorS.Gutiérrez-PalmaN.Cano-MarínM. J. (2012). Prosodic awareness skills and literacy acquisition in Spanish. J. Psycholinguist. Res.41, 285294. 10.1007/s10936-011-9192-0

  • 21

    DockrellJ. E.ConnellyV.WalterK.CrittenS. (2014). Assessing children's writing products: the role of curriculum-based measures. Br. Educ. Res. J.41, 575595. 10.1002/berj.3162

  • 22

    EspinC.ShinJ.DenoS. L.SkareS.RobinsonS.BennerB. (2000). Identifying indicators of written expression proficiency for middle school students. J. Spec. Educ.34, 140153. 10.1177/002246690003400303

  • 23

    GrahamS. (2006). Writing, in Handjournal of Educational Psychology, eds P. Alexander and P. Winne (Mahwah, NJ: Erlbaum), 457478.

  • 24

    GrahamS.BerningerV. W.AbbottR. D.AbbottS. P.WhitakerD. (1997). Role of mechanics in composing of elementary school students: a new methodological approach. J. Educ. Psychol.89, 170182. 10.1037/0022-0663.89.1.170

  • 25

    GrahamS.GillespieA.McKeownD. (2013). Writing: importance, development, and instruction. Read. Writ.26, 115. 10.1007/s11145-012-9395-2

  • 26

    GrahamS.McKeownD.KiuharaS.HarrisK. R. (2012). A meta-analysis of writing instruction for students in the elementary grades. J. Educ. Psychol.104, 879896. 10.1037/a0029185

  • 27

    GrahamS.PerinD. (2007). A meta-analysis of writing instruction for adolescent students. J. Educ. Psychol.99, 445476. 10.1037/0022-0663.99.3.445

  • 28

    GrahamS.SantangeloT. (2014). Does spelling instruction make students better spellers, readers, and writers? a meta-analytic review. Read. Writ. Interdiscip. J.27, 17031743. 10.1007/s11145-014-9517-0

  • 29

    HuntK. W. (1970). Syntactic maturity in school children and adults. Monogr. Soc. Res. Child Dev.35:67. 10.2307/1165818

  • 30

    JagaiahT.OlinghouseN. G.KearnsD. M. (2020). Syntactic complexity measures: variation by genre, grade-level, students' writing abilities, and writing quality. Read. Writ.33, 25772638. 10.1007/s11145-020-10057-x

  • 31

    KlineR. B. (2005). Principles and Practice of Structural Equation Modeling, 2nd Edn. New York, NY: The Guilford Press.

  • 32

    LerkkanenM.Rasku-PuttonenH.AunolaK.NurmiJ. (2004). The developmental dynamics of literacy skills during the first grade. Educ. Psychol.24, 793810. 10.1080/0144341042000271782

  • 33

    LimpoT.AlvesR. A. (2013). Modeling writing development: contribution of transcription and self-regulation to Portuguese students' text generation quality. J. Educ. Psychol.105, 401413. 10.1037/a0031391

  • 34

    LimpoT.AlvesR. A.ConnellyV. (2017). Examining the transcription-writing link: Effects of handwriting fluency and spelling accuracy on writing performance via planning and translating in middle grades. Learn. Individ. Differ.53, 2636. 10.1016/j.lindif.2016.11.004

  • 35

    LlauradoA.DockrellJ. (2020). The impact of orthography on text production in three languages: Catalan, English, and Spanish. Front. Psychol.11:878. 10.3389/fpsyg.2020.00878

  • 36

    MacWhinneyB. (2000). The Childes Project: Tools for Analyzing Talk. Mahwah, NJ: Erlbaum.

  • 37

    MagalhãesS.MesquitaA.FilipeM.VelosoA.CastroS. L.LimpoT. (2020). Spelling performance of portuguese children: comparison between grade level, misspelling type, and assessment task. Front. Psychol.11:547. 10.3389/fpsyg.2020.00547

  • 38

    MaleckiC.JewellJ. (2003). Developmental, gender, and practical considerations in scoring curriculum-based measurement writing probes. Psychol. Sch.40, 379390. 10.1002/pits.10096

  • 39

    MesquitaA.CarvalhaisL.LimpoT.CastroS. L. (2020). Portuguese spelling in primary grades: complexity, length, and lexicality effects. Read. Writ.33, 13251349. 10.1007/s11145-019-10012-5

  • 40

    MouraI.PereiraL. Á.CoimbraR. L. (2015). Ortografia e produção textual: estudo de caso no sistema de ensino português. Indagatio Didática7, 6286. 10.34624/id.v7i4.2893

  • 41

    NagyW.BerningerV. W.AbbottR. D. (2006). Contributions of morphology beyond phonology to literacy outcomes of upper elementary and middle-school students. J. Educ. Psychol.98, 134147. 10.1037/0022-0663.98.1.134

  • 42

    PereiraÁ.GonçalvesC. (2017). Aprender a escrever em contextos multilingues através da descrição de obras de arte. Signo Pensamiento36, 8499. Retrived from: https://revistas.javeriana.edu.co/index.php/signoypensamiento/article/view/21062 (accessed June 06, 2021).

  • 43

    SeymourP. H. K.AroM.ErskineJ. M.the COST Action A8 network (2003). Foundation literacy acquisition in European orthographies. Br. J. Psychol.94, 143174. 10.1348/000712603321661859

  • 44

    TolchinskyL. (2019). Evolving structure of descriptive texts and learner's abilities. J. Lit. Res.51, 293314. 10.1177/1086296X19858354

  • 45

    TreimanR.BourassaD. C. (2000). Children's written and oral spelling. Appl. Psycholinguist.21, 183204. 10.1017/S0142716400002022

  • 46

    TreimanR.CaravolasM.KesslerB. (2019). What methods of scoring young children's spelling best predict later spelling performance?J. Res. Read.42, 8096. 10.1111/1467-9817.12241

  • 47

    VideenJ.DenoS. L.MarstonD. (1982). Correct Word Sequences: A Valid Indicator of Proficiency in Written Expression, Vol. IRLD-RR-84. Minneapolis, MN: University of Minnesota, Institute for Research on Learning Disabilities.

  • 48

    WeissenburgerJ.EspinC. (2005). Curriculum-based measures of writing across grade levels. J. Sch. Psychol.43, 153169. 10.1016/j.jsp.2005.03.002

Summary

Keywords

writing performance, levels of language, grade level, longitudinal study, spelling, syntactic complexity measures, descriptive text

Citation

Carvalhais L, Limpo T and Pereira LÁ (2021) The Contribution of Word-, Sentence-, and Discourse-Level Abilities on Writing Performance: A 3-Year Longitudinal Study. Front. Psychol. 12:668139. doi: 10.3389/fpsyg.2021.668139

Received

15 February 2021

Accepted

22 June 2021

Published

03 August 2021

Volume

12 - 2021

Edited by

Kuniyoshi L. Sakai, The University of Tokyo, Japan

Reviewed by

Paz Suárez-Coalla, University of Oviedo, Spain; Gavin Bui, Hang Seng University of Hong Kong, China; Sarah Critten, Coventry University, United Kingdom

Updates

Copyright

*Correspondence: Lénia Carvalhais

This article was submitted to Educational Psychology, a section of the journal Frontiers in Psychology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics