Skip to main content

ORIGINAL RESEARCH article

Front. Psychol., 11 January 2023
Sec. Psychology of Language
This article is part of the Research Topic Eye-tracking While Reading for Psycholinguistic and Computational Models of Language Comprehension View all 12 articles

Segmented relations between online reading behaviors, text properties, and reader–text interactions: An eye-movement experiment

\r\nTao Gong,,,*Tao Gong1,2,3,4*Lan Shuai,Lan Shuai1,3
  • 1Haskins Laboratories, New Haven, CT, United States
  • 2School of Foreign Languages, Zhejiang University of Finance and Economics, Hangzhou, Zhejiang, China
  • 3Educational Testing Service, Princeton, NJ, United States
  • 4Google, New York, NY, United States

Purpose: To investigate relations between abilities of readers and properties of words during online sentence reading, we conducted a sentence reading eye-movements study on young adults of English monolinguals from the US, who exhibited a wide scope of individual differences in standard measures of language and literacy skills.

Method: We adopted mixed-effects regression models of gaze measures of early and late print processing stages from sentence onset to investigate possible associations between gaze measures, text properties, and skill measures. We also applied segmented linear regressions to detect the dynamics of identified associations.

Results: Our study reported significant associations between (a) gaze measures (first-pass reading time, total reading times, and first-pass regression probability) and (b) interactions of lexical properties (word length or position) and skill measures (vocabulary, oral reading fluency, decoding, and verbal working memory), and confirmed a segmented linear dynamics between gaze measures and lexical properties, which was influenced by skill measures.

Conclusion: This study extends the previous work on predictive effects of individual language and literacy skills on online reading behavior, enriches the existing methodology exploring the dynamics of associations between lexical properties and eye-movement measures, and stimulates future work investigating factors that shape such dynamics.

1. Introduction

Contemporary views of reading highlight connections among cognitive abilities of readers, properties of texts, reading comprehension, and online reading behavior. The simple view of reading (SVR) proposes that reading comprehension is a function of visual word recognition, decoding, and language comprehension, the first two of which are print-specific aspects of reading skill (Gough and Tunmer, 1986), and the latter is construed as an amodal (not limited to a particular module like reading or listening) aspect of language. However, how language and literacy skills relate to lexical properties (e.g., word frequency, length, predictability, and position in sentence) and online reading behavior remains implied, at best, in SVR. In addition, the self-teaching hypothesis (STH) (Share, 1995) proposes that decoding allows developing readers to transform unfamiliar printed letter strings into recognizable sounds from their spoken language. This process helps readers to internalize the orthographic features of new words. Although highlighting that decoding skill predicts development of reading comprehension, thus being necessary for a reader to learn all words, orthographically regular or not, STH does not state explicitly how decoding helps comprehension during online sentence reading. Furthermore, the lexical quality hypothesis (LQH) (Perfetti, 2007) and the verbal efficiency theory (VET) (Perfetti, 1985) advocate that what distinguishes good and poor readers is the ability to efficiently map orthographic forms to phonological representations, and ultimately to semantics. However, it is unclear how different aspects or levels of language and literacy skills influence reading processes.

Existing studies of individual differences in reading often focus on offline outcomes (e.g., reading comprehension), and these outcomes are in fact cumulative end products of various processes involved in meaning construction (Snow, 2002). Recent studies have begun to shift their attentions from reading outcomes to reading processes, as in moment-to-moment measures (e.g., eye-movements) of reading behavior (e.g., Traxler, 2007; Rayner, 2009b; Kuperman and Van Dyke, 2011; Radach and Kennedy, 2012; Rayner et al., 2012, 2015; Kuperman et al., 2018). Eye-movement patterns during reading are found to vary with lexical properties (Rayner and Duffy, 1986; Rayner, 1998; Joseph et al., 2013). In addition, eye-movement patterns also rely on cognitive capacities that support reading. The dynamics of information processing during reading is governed not only by lexical properties of the text (Radach and Kennedy, 2012), but also by knowledge and cognitive resources of the reader (Beck et al., 1982; Gough and Tunmer, 1986; Hoover and Gough, 1990; Catts et al., 2006). Reading comprehension emerges as a juxtaposition of the lexical properties and the skills, knowledge, and experience of the reader (Perfetti and Lesgold, 1977; Nelson Taylor and Perfetti, 2016; Kuperman et al., 2018).

Many previous studies have reported the “direct” roles of language and literacy skills in reading outcomes or predicting online reading behavior, but there lack enough investigations on whether those skills could also “indirectly” influence reading behavior through interactions with lexical properties, given that lexical properties are central to regulation of gaze behavior during connected text reading and influence of effortful lexical, syntactic, semantic, and pragmatic processing (Duffy et al., 1988; Rayner, 2009a). In addition, many studies have focused selectively on university students (there are exceptions though, e.g., one on participants of similar age and skill to those in our study (Kuperman and Van Dyke, 2011), and two on readers younger than those in our study (Joseph et al., 2013; Valle et al., 2013). University students often have a narrow range of language and literacy skills centered above average, which makes them insufficient to reveal the potentially much wider scope of individual differences in those skills and the general effects of such differences on online reading behavior (Henrich et al., 2010). Furthermore, through simple regression analyses, many existing studies only reported whether or not an online reading behavior is correlated with certain lexical properties and/or individual skills, yet there lack investigations on the dynamics of identified relations, e.g., does an identified correlation follow a simple linear relation, a nonlinear relation, or else? Given that there has been accumulated evidence informing us about what lexical properties or individual skills may or may not influence or be correlated with online reading behavior, it is time to further examine the dynamics of such causal or correlational relations concerning lexical properties, individual skills, and online reading behavior.

Noting these and given the dearth of research on how differences in basic reading skills and vocabulary exert influence on online reading at a sentence level, this study was designed to yield empirical data and inform relevant theories. Based on eye-movement measures and online reading process, this study aimed to investigate two research questions:

(a) Can interactions between language and literacy skills and lexical properties influence online reading behavior?

(b) What is the dynamics of the correlation between online reading behavior and lexical properties?

Answers to these questions will bring an intimate view of reading process at the levels of words, phrases, and larger units, and contribute to the research on how lexical properties and individual differences in language and literacy skills jointly affect online reading behavior.

Following a data-driven approach, this study centered on the skills concerned with reading process as gauged by online reading behaviors, and investigated how these skills interact with lexical properties during online reading. In view of the existing theories (e.g., SVR, STH, LQH, and VET), we focused on four skills: decoding, reading fluency, word knowledge (vocabulary), and working memory (see next section for details). Some of them were omitted in early studies [e.g., working memory was not included by Kuperman and Van Dyke (2011)]. In our study, all these skills were assessed by a battery of standard tests. In addition, as part of a general research program aimed at developing profiles of adolescent and young adult readers (aged from 16 to 25 years), especially those whose educational and occupational prospects might be constrained by their limited language and literacy skills, our study targeted on non-university students, who possess a much wider range of individual differences in these skills than would typically be found in university students (Braze et al., 2007). This enables a more detailed examination of the role of individual differences in reading behavior than would be possible with a more restricted range of differences (Peterson, 2001). Furthermore, after identifying significant interactions between lexical properties and individual skills on online reading behavior gauged by eye-movement measures, we further quantified the dynamics between the aspects involved in the significant interactions, i.e., the lexical properties and the eye-movement measures in participants with high and low levels of the skills. This data-driven analysis helps reveal how the skills influence online reading behavior via interactions with lexical properties.

In terms of methodology, we applied mixed-effects regression models on rich observations, and carefully controlled the family-wise errors, collinearity, and overfitting. This type of models can simultaneously address the main effects of the skill measures and their interactions with lexical properties in one model, and collectively reveal the key interactions with lexical properties. In addition, we designed a way to visualize the correlations between eye-movement measures and lexical properties under different levels of skills, and selected among popular regression models the “best” one to reflect the dynamics. A similar method was practiced to detect quantitative relations between decoding skills and comprehension scores in reading assessment (Wang et al., 2019).

Our study did not find significant effects attributable to individual skill differences, due primarily to the wider spans of the abilities in our participants than those of university students recruited in previous studies. Nonetheless, we identified significant interactions between lexical properties and individual skills, including: interactions between word length and verbal working memory and oral comprehension plus vocabulary in regulating first-pass reading time, interactions between word position and oral reading fluency and verbal working memory in shaping total reading times, and interaction between word position and decoding in adjusting first-pass regression probability. Our analysis revealed a segmented linear dynamics between lexical properties and eye-movement measures, which could be further manipulated by individual skills. All these findings reveal important predictability of those skills on online reading behavior, and contribute to theoretical discussions on how those skills regulate reading behavior at a sentence level through interactions with lexical properties. Note that more research is needed to better understand what factors shape the pivot points in the segmented linear curves.

2. Target skills and recent studies on them

Among various language and literacy skills, we focused on four of them.

Decoding is the ability to apply the orthography-to-phonology correspondence rules to pronounce written words. It is essential to translating print to spoken language, and includes, at least, the knowledge of letter patterns and letter-sound relationships, upon which all other reading skills are built (Share, 1995). SVH claims that decoding, together with listening comprehension, makes substantial contributions to variation in reading comprehension. Studies have revealed that reading comprehension differences are associated with decoding skill differences in children and adolescent readers (Shankweiler et al., 1999) and that the ability to retrieve phonological cues can predict individual differences in reading fluency (Barth et al., 2009). Studies of online reading processes have discovered that a high decoding skill enables a rapid access to a word’s orthographic form and its meaning, thus accelerating word naming speed (Manis and Freedman, 2001) and reflecting high text-level reading fluency and word-level recognition during connected text reading (Wolf et al., 2002).

Reading fluency is the ability to read connected text quickly, accurately, and with expression. Conventional measures like the Gray Oral Reading Test (Wiederholt and Bryant, 2001) assess oral reading fluency. Recent tests measure this skill through silent reading, e.g., the Silent Reading Efficiency and Comprehension Test (Wagner et al., 2010). Regardless of modality, reading fluency measures draw on important capacities to lexical access (Perfetti, 1985) and mediate reading comprehension (e.g., Tilstra et al., 2009; Macaruso and Shankweiler, 2010; Silverman et al., 2012). Longitudinal and corpus-based studies have shown that reading fluency is a reliable index of reading comprehension in students (Fuchs et al., 2001; Miller and Schwanenflugel, 2008; Reschly et al., 2009; Petscher and Kim, 2011) and it performs as well as or better than other reading comprehension tests as a predictor for higher stakes comprehension tasks (Baker et al., 2008; Marcotte and Hintze, 2009). Eye-movement studies have also revealed that phonemic awareness, a known predictor for early word recognition and decoding, contributes to reading fluency (Ashby et al., 2013).

Vocabulary is another key component of reading skills. Orally assessed vocabulary knowledge captures variance in reading comprehension, even if comprehension and decoding skill are accounted for (Braze et al., 2007; Tunmer and Chapman, 2012). Vocabulary breadth and depth, as well as semantic relatedness can predict individual differences in reading comprehension of fourth-grade students (Swart et al., 2016). Oral vocabulary makes an independent contribution to reading comprehension in grade school children (Ouellette and Beers, 2010) and young adult readers (Braze et al., 2007), and serves as a strong predictor for reading comprehension in typically developing Grades 1–3 students and dyslexic readers of Grades 4–5 (Chik et al., 2010). During sentence reading, high-vocabulary readers are found more likely to make online elaborative inferences than low-vocabulary ones (Calvo et al., 2003). Nelson Taylor and Perfetti (2016) report that: readers with greater knowledge of less common words tend to read faster and with greater accuracy in paragraph reading, and the amount of exposure to phonological and semantic constituents of words during training modulates re-reading behavior in this process.

Verbal working memory enables readers to hold on to verbal cues to comprehend lengthy or complex sentences, and thus facilitates readers’ abilities to derive compositional meanings of sentences. High working memory capacity can accelerate the time course of predictive inferences during sentence reading (Estevez and Calvo, 2000). Compared to readers with higher working memory capacity, those with lower capacity exhibit more difficulties (in terms of longer regression and total fixation time) in associating relative clauses with preceding fragments (Traxler, 2007), and spend more time re-reading ambiguous regions of texts (Clifton et al., 2003). Higher working memory capacity is also associated with higher reading fluency (with lower gaze durations and fewer look-backs from the final word of a sentence) (Calvo, 2004).

Motivated by previous studies on those skills, our study attempted to explore how reader-text interaction predicts reading patterns between good and poor readers differing in those skills.

In this line of research, existing studies often focus on identifying (by mixed-effects or generalized regression models, or machine learning models) the language and literacy skills that directly or indirectly (via interaction with lexical properties) cast important effects on reading process, but rarely touch upon the dynamics of any identified correlations between lexical or individual properties and reading process, e.g., whether and how the correlation between target skills and lexical properties change alongside the levels of the skills. For example, a recent study (Kuperman and Van Dyke, 2011) has shown that individual scores in rapid automatized letter naming (RAN) and word identification tests can supersede the effects of word length and frequency at early processing stages, and serve as stronger predictors than word frequency across eye-movement measures. However, family-wise Type-I error was not carefully controlled in the analyses (e.g., the same critical p value of 0.05 was used over 150 models involving multiple predictors that are correlated with each other), which weakens the claims that those skill measures are reliable predictors for online reading behavior.

Another study from the same group (Kuperman et al., 2018) incorporated more cognitive and linguistic skills, used sentence stimuli with increasing lexical, syntactic, and discourse complexity, and adopted random forest models to detect key predictors for eye-movement measures. This work analyzed the effects of lexical properties, individual skills, interactions between word length and those skills, and sentence complexity on eye-movements around words inside sentences, at the end of sentences, and whole passages. The analyses reported reading habit, vocabulary size, reading efficiency, vocabulary IQ, and rapid naming scores as key predictors on eye-movement patterns during online reading.

This data-driven approach fails to identify multiple factors having dominant and comparatively small yet still important effects. In a random forest model, extremely-high relative importance score of a predictor could mask the roles of other predictors. Since importance scores are relative to predictors, one random forest model cannot address all possible interactions between lexical properties and skill measures. In addition, the work indirectly examined the effects of interactions with word length: word length was segmented into long and short groups, and two random forest models were fitted respectively on the two groups to detect important skill measures whose effects exhibited different tendencies between the two models. The arbitrary, binary segmentation of word length groups presumes that if a skill measure has an influence on word length, the tendencies of the effect should be different on short and long words. This is not always the case; some factors may take effect on very long words, and others may trigger different reading patterns on very short words. A question on whether reading processes differ between individuals with high and low levels of skills is more meaningful than whether such processes differ between long and short words; in this sense, segmentation on skill levels is more informative than segmentation on lexical properties.

3. Materials and methods

The data in this study consisted of: (a) participants’ skill measure data obtained from a battery of standard psycho-educational tests; and (b) their eye-movement data gathered in a sentence reading experiment. The data were collected by trained research assistants. Informed consent was obtained from the participants of at least 18 years old; for those under 18, the participants provided assent and their parents or guardians signed written permissions. All participants were paid a proper remuneration for completing the protocols reported here together with the fMRI protocols reported elsewhere (Shankweiler et al., 2008; Braze et al., 2011). The procedures described here took ∼3.5 h; breaks were provided as needed.

3.1. Participants

Forty-five participants (age in 16–25 years, 27 females) were recruited from adult education centers, community college, and neighborhood-gathering places. Some participants had their secondary schooling interrupted but were then seeking a high school equivalency certificate or resuming work toward a regular high school diploma. At the time of experiment, most participants were enrolled in education programs (e.g., high school, adult school, or community college) (Braze et al., 2007, 2016). All participants were English monolinguals, and had normal or corrected-to-normal vision. They were prescreened to ensure the ability of reading simple sentences with comprehension. Data from one participant were excluded due to not completing all study components.

According to the power analysis in mixed-effects models (Brysbaert and Stevens, 2018), this number of sample size, together with the rich amount of eye-movement observations obtained during reading of multiple (72) sentences containing numerous (358) word types (see section 3.3 Materials and design), is sufficient to detect reliable significant factors.

3.2. Skill measures

Each participant was assessed in six domains of language and literacy skills, which served as the bases for analysis. Table 1 shows the raw (and normative wherever available) scores of each measure and a key to the labels of them. The domains and the tests used to measure them were:

TABLE 1
www.frontiersin.org

Table 1. Raw scores and keys of the skill measures over 44 participants.

(1) Vocabulary, assessed by the Peabody Picture Vocabulary Test-Revised (ppvt) (Dunn and Dunn, 1997) and the Wechsler Abbreviated Scales of Intelligence Expressive Vocabulary Test (wasi.v) (Psychological Corporation, 1999). Table 1 shows both raw and standard scores (normative sample mean = 100, SD = 15) of ppvt and both raw and t-scores (normative sample mean = 50, SD = 10) of wasi.v. Differences in word knowledge stem from (a) variations in language experience (in speech or print) and (b) differences in the ability to profit from it. Vocabulary is a good proxy for general, amodal, language ability of the community sample recruited in our study (Braze et al., 2016).

(2) Listening comprehension, assessed by the even-numbered items from the Reading Comprehension subtest of the Peabody Individual Achievement Test-Revised (piat.l) (Markwardt, 1998). Using the odd numbered items from this test for reading comprehension and the even numbered items for listening comprehension gives us a pair of tests well matched in task demand for both input modalities. Table 1 shows both raw and grade equivalent scores, the latter of which were calculated following Markwardt (1998) (see Braze et al., 2007 for details). Knowledge of vocabulary, compositional semantics, and syntax constitute the bases of oral language comprehension (Birch and Rayner, 1997; Frisson and McElree, 2008). The ability to understand language presented to the ear is a good indicator of general, amodal, language comprehension ability.

(3) Decoding, assessed by the Woodcock-Johnson-III Word Identification subtest (wid) (Woodcock et al., 2001) and the Woodcock-Johnson-III Test of Achievement Word Attack subtest (watt) (Woodcock et al., 2001). These are untimed tests for the ability to accurately pronounce printed words and non-words. Table 1 contains both raw and grade equivalent scores of the two measures.

(4) Reading comprehension, assessed by the odd numbered items from the Reading Comprehension subtest of the Peabody Individual Achievement Test-Revised (piat.r) (Markwardt, 1998) and the accuracies of the Passages 5, 7, and 9 from the Gray Oral Reading Test (gort.comp) (Wiederholt and Bryant, 2001). Calculation of grade equivalent scores of piat.r followed Braze et al. (2007). There were no standard scores of gort.comp, due to using only a subset of passages. Reading comprehension has been usefully thought of as the product of an individual’s facility with language and decoding skill (Gough and Tunmer, 1986).

(5) Oral reading fluency, assessed as the reading speed (words per minute) for Passages 5, 7, and 9 from the Gray Oral Reading Test (gort.wpm); the total number of words in these passages is 361 (Wiederholt and Bryant, 2001). There were no standard scores, since the measure was based on an abbreviated form of the Gray Oral Reading Test. Oral reading fluency consists of visual scanning, decoding, and high level language processing (Silverman et al., 2012).

(6) Verbal working memory, assessed by a listening version of the Sentence Span task (sspan.corr) (Daneman and Carpenter, 1980). This ensures non-confoundness with reading skills. Verbal working memory has been shown to account for differences in vocabulary growth independent of language exposure (Gathercole and Baddeley, 1989; Gathercole et al., 1999; Gupta, 2006).

These individual difference measures can be grouped into two sets: those explicitly linked to reading ability (reading comprehension, decoding skill, and oral reading fluency), and those not (listening comprehension, vocabulary, and verbal working memory) (Gough and Tunmer, 1986; Hoover and Gough, 1990). They tap into abilities equally important to comprehension, no matter whether the language input arrives by ear or by eye.

In addition to these domains, we also assessed print experience by a magazine title recognition checklist (MRT) and an author recognition checklist (ART) (cf., Stanovich and Cunningham, 1992) to gauge a person’s experience with language in printed form, which for literate individuals may well be a substantial part of their overall language experience, and visual working memory based on a computerized version of the Corsi Blocks task (corsi) (Corkin, 1974) implemented in Psyscope (Cohen et al., 1993). Given the fact ART and MRT only show high validity and reliability in proficient readers (e.g., university students) in their dominant language (McCarron and Kuperman, 2021), whereas our study is based upon participants having a wide span of reading skills, we excluded print experience in the regression analyses. In addition, compared to visual working memory, verbal working memory is more relevant to our sentence reading experiment, so we also excluded visual working memory in the regression analyses.

Prior to regression modeling, we examined the distributions of raw scores for deviations from normality. Several scores showed high skewness (absolute values over .5). To them, we applied Box-Cox transformations (Box and Cox, 1964) using the bcpower function in the R package car (Fox and Weisberg, 2011). All variables, transformed or not, were standardized by converting to Z-scores. Table 2 is the correlation table of the transformed and standardized measures (cf., Braze et al., 2007, 2016).

TABLE 2
www.frontiersin.org

Table 2. Correlations between the age and the 9 skill measures, after Box-Cox transformation (for ppvt, watt, piat.r, and gort.comp) and standardization.

To reduce collinearity and the total number of predictors in the regression models, we combined measures tapping into common latent constructs. This was done by (a) taking the average of the transformed and standardized scores, and then (b) converting the average scores back to Z-scores. Measures of vocabulary and listening comprehension were combined into a composite measure of oral comprehension plus vocabulary (oral.comp) (Tunmer and Chapman, 2012; Braze et al., 2016; Kukona et al., 2016). Composites were also derived for decoding (decod.comp) and reading comprehension (readcomp.comp). Table 3 shows the correlation table of the centered and transformed skill measures.

TABLE 3
www.frontiersin.org

Table 3. Correlations between the age and the 5 composite or independent measures.

It is not surprising that the correlation between reading comprehension and oral comprehension plus vocabulary is high, since oral knowledge is an important indicator of reading comprehension (see section 2. Target skills and recent studies on them). Table 4 shows the statistics of the regression models between reading comprehension and oral comprehension plus vocabulary, decoding, and both, respectively. Consistent with early findings (Braze et al., 2007), a combination of both skills largely explains the variation of reading comprehension: R2 of the model using decoding is .370, R2 of the model using oral comprehension plus vocabulary is .712, and multiple R2 of the regression model using both decoding and oral comprehension plus vocabulary is .738. Notably, we exclude reading comprehension from the list of predictors in the regression models.

TABLE 4
www.frontiersin.org

Table 4. Regression models targeting reading comprehension.

After these preprocessing stapes, the skill measures used in our regression analyses are: (a) oral comprehension plus vocabulary (oral.comp); (b) decoding (decod.comp); (c) oral reading fluency (gort.wpm); and (d) verbal working memory (sspan.corr).

3.3. Materials and design

Participants were asked to read 72 individual sentences while their eye-movements were recorded. Presentation order was pseudo-random across participants. These sentences were filler items in a study of comprehension process in young adults with limited literacy skills (Braze et al., 2006). All of the sentences were grammatical and transparent in meaning. The word types in them were carefully selected among high frequent words, and common names for persons, states, or holidays. The linguistic aspects of these sentences, such as part of speech or syntactic complexity, were carefully controlled. Supplementary Table 1 shows the complete list of the sentences. Many of these sentences were simple in terms of structure; forty-six stimuli sentences (over 79%) had no embedding structures, e.g., “Most of the students will be going to the class picnic next month.”; and the other 26 had one dependent clause, e.g., “The waiter had told the customer that the pies were fresh.” There were 503 unique word types (819 word tokens) in these sentences, an average of 11.375 word tokens per sentence (range = 11–16). Note that previous studies on university students involved sentences with increasing complexity in semantics and syntax (e.g., Kuperman and Van Dyke, 2011; Kuperman et al., 2018)), we leave the investigation of the relations between sentence complexity, reading skills, and online reading behavior for future work.

Before the experiment, we asked some individuals to evaluate the understandability of these sentences, based on a scale of 5, from “easy to understand” to “hard understand”. These individuals were recruited similarly as the experiment participants, but did not participate the experiment. All of them marked the filler sentences as “easy to understand”.

For each word in a sentence, we recorded its ordinal position in the sentence (note that the sentence initial and final words were excluded), its length in characters (LenW), and its frequency of occurrence per million words (FreqW). Word position is a context-dependent property, but word length and frequency are independent of sentence. Lexical frequencies were obtained from the Corpus of Contemporary American English (COCA).1 Frequency summaries for our materials exclude contractions (n = 2) and proper nouns (n = 23), both having no COCA frequencies. Possessive forms (n = 6) used the COCA frequencies of their uninflected forms. Analyses otherwise included all the remaining words found in the sentences. Most of the type frequencies showed skewed distributions, and thus log-transformed (base e). Following Kuperman and Van Dyke (2011, 2013) and other standard practice, we excluded words with a high likelihood of being skipped (i.e., highly-frequent and very short words).

Table 5 shows the lexical properties of the words contained in the sentences. Regression models targeting online reading indicators (gaze measures) at a word also included parameters for length and frequency of the previous and subsequent words. Differences between LenW and LenW–1 (or LenW+1) are due to the exclusion of sentence initial and final words in the current word set (see Eye-movement measures), so are differences between FreqW and FreqW–1 (or FreqW+1).

TABLE 5
www.frontiersin.org

Table 5. Lexical properties of the words in the sentence stimuli.

Prior to the analyses, we mean-centered lexical properties. Word length was measured in terms of number of characters. Log-transformed word frequency was standardized. Word frequencies were highly correlated with lengths of respective words: Pearson’s r between current word length and current word frequency was −0.731 (p < 0.001), −0.775 (p < 0.001) between previous word length and previous word frequency, and −0.752 (p < 0.001) between next word length and next word frequency. Following Kuperman and Van Dyke (2011), we residualized word frequencies against lengths of respective words. This was done by fitting a regression model for each of the three properties (previous, current, and next words) in which the frequency of the relevant word was predicted by its length. We took the residuals (distances between the observed and fitted values) of these models as the values of word frequency. The residualized frequencies remained strongly correlated with the original frequencies but orthogonal to the lengths of respective words: Pearson’s r between residualized and original frequencies was 0.697 (p < 0.001) for current word frequencies, 0.643 (p < 0.001) for previous word frequencies, and 0.665 (p < 0.001) for next word frequencies. The residualization (or orthogonalization) procedure does not change the result for the residualized variable, the overall explanatory power of the model, and any indices of model fit. Some scholars pointed out that such orthogonalization (Wurm and Fisicaro, 2014) could not be a useful remedy for collinearity; note that in our experiment, the significant factors reported by the regression analyses using the orthogonalized or unorthogonalized word frequency and word length values are the same.

3.4. Apparatus and procedure

During the test session, participants were instructed to read, one by one, a number of sentences, and to answer yes/no comprehension questions about the contents of the sentences just read (see Supplementary Table 1). Comprehension questions occurred immediately after some sentences on about a sixth of trials to ensure that participants stayed focused on the reading comprehension task throughout the session. The mean response accuracy to the comprehension questions was 0.913 (SD = 0.067).

Each sentence was presented on a single line vertically centered on a monitor, which was positioned approximately 64 centimeters from the participants’ eyes. The sentences were displayed in a monospace font (Bitstream MonoSpace 821) in black with a light background, at a screen resolution 1,280 × 1,024 and a refresh rate 85 Hz. Font size was set such that each character subtended about 17 minutes of visual arc. Participants wore an EyeLink II head-mounted eye tracker (SR Research), the sampling rate of which was set to 250 Hz. Before the test session, the accuracy of the eye tracker was calibrated based on a 9-point full-screen calibration. Over the course of the session, measurement accuracy was monitored, and if needed, the device was re-calibrated (this was rarely necessary). Data were collected binocularly. Our analyses were based primarily on the right eye data. The right eye data of one participant was problematic, and therefore, the left eye data of the participant were used.

In each trial, a fixation point appeared first at the position of the second character of the first word of the sentence (vertically centered on the screen and about 1.5 inches from the left edge). After fixating on this point, participants pressed a button to bring up a sentence and started to read it. Sentences would not show up if participants were not fixating on this point. After reading the whole sentence, participants clicked the button again. This prompted either the next trial or the display of a comprehension question. Participants gave answers to the comprehension questions by pressing the buttons denoting “yes” and “no,” respectively.

3.5. Eye-movement measures

We calculated the eye-movement measures using the in-house software (Braze, 2005), which served to tally gaze measures for each word. We removed fixations shorter than 50 ms, as well as blinks and instances of track-loss. We also excluded the sentence initial and final words from analysis, as a common practice (Kliegl et al., 2004). There remained a total of 15,733 eye-movement observations, covering 358 word types in 72 sentences. The volume of the data is comparable to other eye-tracking studies of individual differences. We focused on five informative, widely-used eye-movement measures (Rayner, 1998):

(1) First fixation duration, the duration of the initial fixation a reader makes on a region (word) during first-pass reading. It is typically considered to reflect early stage processes during lexical access (Inhoff, 1984).

(2) First-pass reading time (a.k.a. gaze duration), the summed duration of all fixations a reader makes on a word before fixating any subsequent word, and before gaze leaves the word for the first time, whether advancing to the next word or regressing to an earlier word. It is often considered to reflect sentence structure, parsing decisions (Rayner et al., 1983; Ferreira and Clifton, 1986), or predictability of words in context (Boston et al., 2008). First fixation duration and first-pass reading time are conditional upon a word receiving a first-pass reading. If a word was initially skipped and thus nominally accrued a zero value for these measures, then that data point was omitted from the following analyses, because we do not wish to infer from word-skipping that a word is not processed at all, or that its processing load is zero (Rayner and Pollatsek, 1989).

(3) Total reading time, the sum of all fixations falling again into the current word region. It reflects the integrative effect of both early and late stage processes during lexical access.

(4) Incidence of first-pass regression, coding for whether the eye-movement at the end of first pass reading moved back to a previous part of the sentence (= 1), or advanced to a subsequent word (= 0).

(5) Refixation incidence, being 1 if a word is refixated after the first-pass, or 0 otherwise.

Measures (1) – (3) are continuous, and (4) and (5) are binary (0/1) to capture possible effects on late stage of processing. Measures (4) and (5) are generally treated as indices of processing load associated with integration difficulty (Rayner et al., 1983, 1989).

Table 6 summarizes the gaze measures, which reflect different, but perhaps overlapping stages of word recognition, text comprehension, and integration during online sentence reading. First fixation duration and first-pass reading time reflect the early stages of print processing involving first encounter of a word by the reader following the default reading direction (left to right in English). By contrast, incidence of first-pass regression and refixation incidence reflect the later stages of print processing involving integration of word information with syntactic and/or discourse context or resolution of ambiguity whenever necessary (Vasishth et al., 2013). Total reading time is a cumulative index of “early” and “late” stages of processing. Individual differences in several components of skilled reading (e.g., decoding, oral reading fluency, vocabulary knowledge, working memory) may have different effects as gauged by these eye-movement measures (Kuperman and Van Dyke, 2011; Nelson Taylor and Perfetti, 2016).

TABLE 6
www.frontiersin.org

Table 6. Summary of the eye-movement measures.

In our dataset, 11,965 out of the total 15,733 eye-movement observations (76.050%) were first-pass eye-movements, and only 3,768 had distinct first fixation durations and first-pass reading times. This indicates that during first-pass reading, most words were fixated exactly once (many words in our simple stimuli sentences were short; see Table 5, over half of the words are shorter than 6 characters). Therefore, it is expected that if any factors can exert significant effects during first-pass reading, they might be captured mainly by first-pass reading time, not by first fixation duration. In addition, our stimuli sentences were simple in structure, which might not trigger many regressive eye-movements or second-pass reading in our participants. Therefore, incidence of first-pass regression and refixation incidence might not capture many significant effects, unlike previous studies involving more complex sentence stimuli (Kuperman and Van Dyke, 2011; Kuperman et al., 2018)).

3.6. Analytic approach

We conducted two types of statistical analysis.

First, we used linear and logistic mixed-effects regression models (Baayen, 2008; Quené and van den Bergh, 2008) with crossed random effects to analyze respectively the continuous and categorical eye-movement measures and identify interactions between lexical properties and reading related skills. Mixed-effects models allow for simultaneous consideration of multiple covariates, while keeping the between-participants and between-items variance under statistical control (Pinheiro and Bates, 2000; Baayen et al., 2008). Unlike the random forest models used in Kuperman et al. (2018), mixed-effects models can simultaneously address multiple factors having different scales of effect sizes and directly report significance of main effects and/or interactions.

We fit five mixed-effects regression models (Quené and van den Bergh, 2008) targeting the five eye-movement measures, respectively. To reflect the collinearity of a model, we reported the condition number kappa of the model and the maximum variance inflation factor (VIF) of all predictors in the model. A condition number kappa smaller than 10 and a VIF smaller than 5 typically indicate a low degree of collinearity (Kutner et al., 2004).

Each of the five models included 23 fixed effects, consisting of seven lexical properties, four composite and single skill measures, and 12 interactions between each of the skill measures and each of the lexical properties, namely word position in a sentence, word frequency and word length. This approach provides an integrative picture of the effects of multiple skill measures on eye-movement patterns. We controlled the family-wise Type I error probability by setting the critical p value for identifying significance as 0.05/23 ≈0.00217. Given this extremely strict setting of critical p value, we focused on both the significant (p < 0.00217) and marginally significant (p is close to 0.00217) factors.

Each model included the same random effect structure, consisting of two intercepts respectively for subject and for word nested under sentence, and one slope of word frequency for subject. In principle, the slope of word length for subject should also be added in each model. However, as shown above, word length was negatively correlated with word frequency, and post-hoc analyses revealed that the separate contributions of word length to the variation in the dependent variables was <1%. Therefore, we excluded this slope in the regression models. In addition, maximal random effect structures involving other types of slopes are theoretically desirable (Barr et al., 2013) and have been applied in recent individual difference studies (e.g., Protopapas and Kapnoula, 2016). However, we did not pursue such complicated models in consideration of practical constraints on model convergence (Bates et al., 2015a).

All the mixed-effects models were implemented using the R packages lme4 (Bates et al., 2015b) and lmerTest (Kuznetsova et al., 2017).

Second, after identifying significant interactions, we continued examining the dynamics of lexical properties and eye-movement measures in individuals having different levels of target skills. Very few existing studies have investigated such dynamics. Our approach proceeded as follows. Given a two-way interaction between a lexical property and a skill measure, we first divided the participants into a high and a low group based on the medium value of the skill measure to ensure the same number of participants in each group. Then, we plotted the eye-movement measure in each group against the lexical property. A cross-group comparation of the correlations between lexical properties and eye-movement measures could reveal the effects of individual skill on online reading behavior. Instead of binary groups, quartile or quintile groups were used in some studies (e.g., Protopapas and Kapnoula, 2016), given enough participants in each group for statistical analysis. To identify correlation, we first fit a nonlinear polynomial regression (loess) between the lexical property and the eye-movement measure as the baseline, and then, used widely-adopted regression models in psychological and educational research to quantify the pattern of the correlation. For simplicity, the current study only compared simple linear regression (or logistic regression) and segmented linear regression. For each model, lexical property was treated as an independent variable, and eye-movement measure a dependent one.

Models were compared based on Akaike information criterion (AIC) and mean squared error (MSE). AIC deals with the trade-off between the simplicity and goodness of fit of a model (Akaike, 1974), but AIC alone is less informative when multiple models have similarly high or low AICs (Burnham and Anderson, 2002). In this situation, MSE is referred to, which compromises variance and bias to minimize both (see Equation 1, where obsi is the observed essay score, prei is the predicted score from a model, and n is the number of data points). The best model that appropriately reflects the correlation between lexical property and eye-movement measure is the one having smaller AIC and MSE.

M S E = 1 n i = 1 n ( o b s i - p r e i ) 2 (1)

A recent study examining the correlation between typing speed and writing essay score has used a similar method to identify the dynamics of such correlation (Gong et al., 2022). In that study, additional models like logistic regression and ordinal categorical regression were used for model fitting, but the segmented linear regression remained the best fitting model.

In our study, the segmented regression was implemented using the R package segmented (Muggeo, 2008).

4. Results

The analyses were carried out in R 3.2.4 (R Core Team, 2013). The raw data, R codes, and the results can be found at: https://github.com/gtojty/IndDiff_EM.

All the regression models showed a low degree of collinearity; the kappas of these models were all below 10 and the VIFs of the independent factors in these models were all below 5. The significant main effects of lexical properties reported in these models are shown in Supplementary Table 2 and discussed in Supplementary material. No skill measures showed significant main effects on any eye-movement measures (their p values were all above. 00217), due primarily to the wide spans of the skill measures in our study.

Our study focuses on the interactions whose p values are smaller than (significant) or close to (marginally significant) the threshold. 00217. For the sake of completeness, Tables 710 list all the interactions between lexical properties and skill measures having p values below 0.05/5 = 0.01. Effect size (Cohen’s d) of each interaction was measured using the lme.dscore function in the R package EMAtools.2 Significant (and marginally significant) interactions are visualized in Figures 13. For each interaction, the correlation between the involved lexical property and eye-movement measure in the participants having high and low levels of the involved skill measure can be best described as a segmented linear relation. Below, we discuss these interactions identified in the regression models.

TABLE 7
www.frontiersin.org

Table 7. Interaction on first fixation duration.

TABLE 8
www.frontiersin.org

Table 8. Interactions on first-pass reading time.

TABLE 9
www.frontiersin.org

Table 9. Interactions on total reading time.

TABLE 10
www.frontiersin.org

Table 10. Interaction on incidence of first-pass regression.

FIGURE 1
www.frontiersin.org

Figure 1. Interactions between word length and oral comprehension plus vocabulary (A) and verbal working memory (B) on first-pass reading time. Word length is mean-centered. The two panels in each figure represent the high and low skill groups. The titles of the panels show the level ranges (within round or square brackets) of the skill measure in the two groups and the numbers of participants in these groups. In each panel, the blue line is the loess fitting curve and the shaded area is standard error. The black line is the linear regression fitting curve (“Lm”). “Int.” shows the interception (β0), and “Slope” the slope (β1). Numbers in square brackets are 95% confidence interval of the slope. The red line is the segmented linear regression fitting curve (“SegLm”). “Seg.Point” shows the pivot point at word length, below and above which the slopes of the curve are distinct (see “Slopes”). See Supplementary Table 3A for AIC and MSE of these models. The segmented linear models have the smallest AIC and MSE closest to that of the loess regressions.

FIGURE 2
www.frontiersin.org

Figure 2. Interactions between word position and oral fluency (A) and verbal working memory (B) on total reading time. Word position is mean-centered. The two panels in each figure represent the high and low skill groups. See Supplementary Table 3B for AIC and MSE of different models, which shows the segmented linear models have the smallest AIC and MSE closest to that of the loess regressions.

FIGURE 3
www.frontiersin.org

Figure 3. Interaction between word position and decoding on incidence of first-pass regression. Word position is mean-centered. The two panels represent the high and low decoding groups. See Supplementary Table 3C for AIC and MSE of different models, which shows the segmented linear models have the smallest AIC and MSE closest to that of the loess regressions. “Lm” here is logistic regression. Note that in the left panel, it seems that the loess regression fitting curve also has a pivot point near the lower bound of word position. Since it is much closer to the boundary, there are insufficient data points for the segmented linear model to identify it as a pivot point.

4.1. First fixation duration

Table 7 lists one interaction between word frequency and decoding skill in determining first fixation duration. Its p value is over .00217, so it is not marked as a significant interaction.

4.2. First-pass reading time

Table 8 shows two interactions on first-pass reading time whose p values are below .01. Given their p values are smaller than (or close to) .00217, they are marked significant (or marginally significant). Figure 1 illustrates these interactions by showing that the correlation between word length and first-pass reading time is contingent on oral comprehension plus vocabulary and verbal working memory.

Figure 1 shows that the sensitivity of first-pass reading time to word length is better described as a segmented linear relation than a simple linear relation: the segmented linear curves well match the baseline loess curve and have smaller AIC and MSE than the linear curve (see Supplementary Table 2). In each panel, the segmented linear curve shows a pivot value of word length, below which the slop of the fitting curve remains small, whereas above which the slope increases, indicating that the participants showed longer first-pass reading time when reading longer words. Between the two panels in each figure, the sensitivity of first-pass reading time to word length exhibits different tendencies.

In Figure 1A, compared to the poor readers having low levels of oral comprehension plus vocabulary (the right panel), for words of the same length, the good readers having high levels of that skill (the left panel) had shorter first-pass reading time. Also, the good readers showed smaller slopes in the segmented linear curve than the poor readers (i.e., 5.585 vs. 8.938 and 19.39 vs. 29.757), indicating that the good readers were less sensitive to word length. Finally, the pivot points of word length were similar in the poor (1.652) and good (1.276) readers.

In Figure 1B, similarly, compared to the good readers having high levels of verbal working memory, the poor readers having low levels of that skill spent relatively more time in reading long words, and for both long and short words, their first-pass reading times remained more sensitive to word length (shown by the slopes of the segmented linear curves, 26.938 vs. 21.662 and 8.051 vs. 5.974). Nonetheless, the pivot points of word length in the poor and good readers were similar (1.435 vs. 1.418).

4.3. Total reading time

Table 9 shows three interactions on total reading time whose p values are below .01, two of which are marked as marginally significant and visualized in Figure 2.

Figure 2 illustrates a segmented linear relation between total reading time and word position in a sentence. Total reading time drops when the participants read the first few words in a sentence, and then, increases when they read the latter words in a sentence. The negative and positive slopes of the segmented linear fitting curves clearly reflect this bifurcating tendency.

In Figure 2A, compared to the good readers having high levels of oral reading fluency, the total reading time of the poor readers having low levels of that skill is generally longer, and it is more sensitive to the beginning words in a sentence, as shown by the more negative slopes (−27.098 vs. −11.428) below the pivot points of word position. However, the smaller positive slopes (3.008 vs. 17.01) above the pivot points suggest that the total reading time of the poor readers is less sensitive to the latter words in a sentence. In addition, the pivot points of word position increases from −0.715 in the poor readers to 1.375 in the good readers.

In Figure 2B, compared to the good readers having high levels of verbal working memory, the total reading time of the poor readers having low levels of that skill is less sensitive to word position in a sentence, as shown by the smaller absolute slopes both below (−15.694 vs. −19.821) and above (8.387 vs. 10.615) the pivot points of word position. In addition, the pivot points in the two panels drop from 0.684 in the poor readers to 0.250 in the good readers.

A comparison of Figures 1, 2 reveals that verbal working memory casts its influence on first-pass reading time via interaction with word length and total reading time via interaction with word position. To be specific, compared to the poor readers having low levels of verbal working memory, the first-pass reading time of the good readers is less sensitive to word length, but their total reading time is more sensitive to word position.

4.4. Incidence of first-pass regression

Table 10 shows that the interaction between decoding and word position had a p value below .00217. Figure 3 visualizes this significant interaction.

Figure 3 shows a segmented linear relation between first-pass regression and word position in a sentence. The probability of regression during the first-pass reading starts to increase when the participants read the latter words in a sentence. Compared to the poor readers having low levels of decoding, the probability of regression during the first-pass reading of the good readers increases a lot on the latter words in a sentence, as shown by bigger slopes (.042 vs. .016) above the pivot points of word position. The pivot points of word position are similar in the poor (2.108) and good (2.537) readers.

4.5. Refixation incidence

No interactions on refixation incidence have p values below .01.

5. Discussion

5.1. Effects of interactions between language and literacy skills and lexical properties on online reading behavior

Previous studies have reported significant main effects of some of the language and literacy skills discussed in this paper, or bigger effect sizes of these skills than those of lexical properties (e.g., Kuperman and Van Dyke, 2011). However, in our analyses, main effects of skill measures never reach statistical significance, though those of lexical properties often do. The effect sizes of the skill measures are also smaller than those of lexical properties. This is because that our study focused on individuals with a much wider range of language and literacy skills; only those having the highest scores of the skill measures were comparable to university students (cf. Braze et al., 2007). Such wide range of individual differences in the skill measures could result in insignificance and low effect sizes of the measures on online reading behavior. These findings can enrich existing evidence and trigger revisits on the theoretical discussions of individual differences and their roles in reading process and outcome (comprehension) (Bennink and Spoelstra, 1979; Bleckley et al., 2003).

Although lacking direct influence on online reading behavior, some of the language and literacy related skills could significantly influence online reading behavior via interactions with lexical properties. Our study showed that oral comprehension, vocabulary, verbal working memory, oral reading fluency, and decoding could predict online reading patterns via interactions with word length or position in a sentence. We also compared the effects of the interactions involving these skills on online reading patterns between the good and poor readers with respect to these skills.

To be specific, oral comprehension and vocabulary interact with word length to predict first-pass reading time (see Figure 1A); readers with good oral comprehension skill and vocabulary knowledge could efficiently process words with various lengths, thus being less troubled by long words during first-pass reading. First-pass reading time arguably reflects the duration of lexical processing, including recognition of orthographic or phonological features of a word and retrieval of semantic information from memory once attention is allocated to the word (Inhoff, 1984). This finding contributes to recent discussions on whether vocabulary knowledge could influence reading comprehension over and above the effect of language comprehension including listening comprehension (Braze et al., 2007, 2016; Tunmer and Chapman, 2012; Protopapas et al., 2013). At the early stage of print processing vocabulary knowledge already helps good readers efficiently reduce first-pass reading time on words of various lengths.

Verbal working memory presumably affects the rate at which word information is assimilated during first-pass reading, especially on long words. As shown in Figure 1B, the first-pass reading time of the good readers with high levels of verbal working memory are less sensitive to word length than the poor readers. In addition, verbal working memory helps predict total reading time via interaction with word position (see Figure 2B). Total reading time reflects the integration of early and late processing during lexical access. Word position in a sentence is a context-dependent property. A general increase in total reading time on words toward the end of a sentence reflects so-called wrap-up effects (Rayner et al., 2000; Warren et al., 2009). In our study, such effects became more explicit in readers having high levels of verbal working memory; efficient verbal working memory reduces the processing time for the first few words of a sentence but induces more wrap-up effects towards the end of a sentence.

Oral reading fluency interacts with word position to predict total reading time (see Figure 2A); a high level of this skill is associated with a less sensitivity to the first few words in a sentence, but more sensitivity to latter words in a sentence, in line with the wrap-up effects. In addition, less fluent readers generally have more difficulty in processing individual words and integrating word semantics with context, and hence spend more time reading a few words of a sentence; by contrast, more fluent readers spend less time reading words in a sentence, especially those near the beginning or in the middle of a sentence. These findings are in line with and complement the existing theories on oral and/or silent reading fluency (Fuchs et al., 2001; Tilstra et al., 2009; Kim et al., 2011; Silverman et al., 2012; Ashby et al., 2013). Furthermore, as shown in Figure 2, there is no monotonic change of the correlation between word position and total reading time. This indicates that the effects of oral reading fluency and verbal working memory on regulating online reading patterns are complex, possibly also subject to other factors.

Decoding skill interacts with word position to predict probability of first-pass regression (see Figure 3); good decoders tended to have more regressive reading when reading words towards the end of a sentence, reflecting their sentence decoding processes. Early studies have reported the effects of decoding on early (first-pass reading time) and overall (total reading time) reading and re-reading probability (Kuperman and Van Dyke, 2011; Nash and Heath, 2011; Kuperman et al., 2018). In our study, the effect of decoding on re-reading probability was fulfilled via an interaction with word position. All these are in line with the claims that decoding skill is among the key factors in lexical access (Barth et al., 2009; Hulme and Snowling, 2012) and provide evidence for VET (Perfetti, 1985; Shankweiler and Crain, 1986) and LQH (Perfetti and Hart, 2002; Perfetti, 2007) by showing how decoding influences reading processes.

5.2. Segmented linear dynamics of the correlation between lexical properties and eye-movement measures

In addition to confirming that language and literacy skills can influence online reading behavior indirectly via interactions with lexical properties, our study further investigated the dynamics of the correlation between lexical properties and eye-movement measures regulated by particular individual skills. Our quantitative analyses revealed that such dynamics cannot be simply described as a linear relation; instead, many of the correlations follow a segmented linear relation, with at least two distinct slopes throughout the values of the relevant lexical properties. Some of the dynamics are monotonic (see Figure 1), with positive and increasing slopes around long words, whereas others are not (see Figures 2, 3), with a transition from a negative to a positive slope. The observed segmented linear relations suggest a complex effect of key language and literacy skills on regulating reading patterns via interactions with word length or position. Between the good and poor readers based on some skills, the durations of reading time are different, so are the sensitivity of reading time or regression probability to word length or position. In addition, the pivot values of word length or position in the segmented linear correlations indicate a transition of the degree of correlation. Note that in many cases, the pivot points are not close to the mean value 0, so arbitrary binary segmentation based on word length or position (Kuperman et al., 2018) cannot clearly reveal such dynamics. This dynamics echoes the effects of interactions between lexical properties and skill measures on online reading behavior: due to individual skills, the unimodal associations between eye-movement patterns and lexical properties are broken, the degrees of associations become different when the values of lexical properties are below or above the pivot points, and the high and low levels of the skills further influence the pivot lexical property values and the degrees of associations below and above the pivot values.

The observed dynamics in all these aspects can lead to more comprehensive theories on the dynamic relations between individual skills, text properties, and reading process. For example, some theories of reading (Perfetti and Hart, 2002) and empirical studies (Johnston and Kirby, 2006; Savage, 2006) have challenged the linear assumption between decoding and reading outcomes like reading comprehension. For example, Johnston and Kirby (2006) showed that naming speed, a measure of decoding skill, had its primary effect on less able readers. A recent study of reading assessment has shown distinct relations between decoding skill and comprehension scores between good and poor decoders in Grades 5 to 10 (Wang et al., 2019). Some eye-movement studies have revealed close relations between components of decoding (e.g., phonemic awareness) and other skills (e.g., reading fluency) (Barth et al., 2009; Ashby et al., 2013). Our study enriched the findings in this line of research by visualizing the segmented linear relations between lexical properties and online reading behavior, which are manipulated by individual differences in individual language and literacy skills. This study can also inspire more empirical studies to further investigate what factors help shape the slopes and pivot values in the segmented linear models.

6. Conclusion

This study investigated the eye-movement data of simple sentence reading from 44 young adults in high schools, adult education centers, community colleges, or neighborhood communities. A total of six domains of individual differences, plus age, were tested to assess their effects via themselves and interactions with lexical properties on online reading behavior. Three of these domains tap into components of reading ability: reading comprehension, decoding skill, and oral reading fluency. The other three tap into domains not reading specific: listening comprehension, vocabulary, and verbal working memory. By evaluating the effect of each domain while controlling for the others, we identified a series of interactions between properties of text (length and position) and skills of readers (oral comprehension, vocabulary, verbal working memory, oral reading fluency, and decoding), which manipulated both the early and late stages of online reading process as gauged by eye-movement measures (first-pass reading time, total reading time, and first-pass regression). We also visualize segmented linear dynamics of the effects of these interactions on online reading patterns. All these findings speak to the necessity of incorporating interactions between lexical properties and reading-related skills to enrich empirical evidence, extend and refine theories about reading outcomes and processes, and trigger new theories or hypotheses on how language and literacy skills interact with lexical properties to influence reading process.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://github.com/gtojty/IndDiff_EM.

Ethics statement

The studies involving human participants were reviewed and approved by Haskins Laboratories. Informed consent was obtained from the participants of at least 18 years old; for those under 18, the participants provided assent and their parents or guardians signed written permissions.

Author contributions

TG designed and implemented the experiment. Both author collected and analyzed the data and wrote and edited the manuscript.

Conflict of interest

TG was employed by Google.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2022.1006662/full#supplementary-material

Footnotes

  1. ^ http://corpus.byu.edu/coca/
  2. ^ https://cran.r-project.org/web/packages/EMAtools/index.html

References

Akaike, H. (1974). A new look at the statistical model identification. IEEE Trans. Automat. Contr. 19, 716–723. doi: 10.1109/TAC.1974.1100705

CrossRef Full Text | Google Scholar

Ashby, J., Dix, H., Bontrager, M., Dey, R., and Archer, A. (2013). Phonemic awareness contributes to text reading fluency: Evidence from eye movements. Sch. Psychol. Rev. 42, 157–170. doi: 10.1080/02796015.2013.12087482

CrossRef Full Text | Google Scholar

Baayen, R. H. (2008). Analyzing linguistic data: A practical introduction to statistics using R. Cambridge: Cambridge University Press. doi: 10.1017/CBO9780511801686

CrossRef Full Text | Google Scholar

Baayen, R. H., Davidson, D. J., and Bates, D. M. (2008). Mixed-effects modeling with crossed random effects for subjects and items. J. Mem. Lang. 59, 390–412. doi: 10.1016/j.jml.2007.12.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Baker, S. K., Smolkowski, K., Katz, R., Fien, H., Seeley, J. R., Kame’enui, E. J., et al. (2008). Reading fluency as a predictor of reading proficiency in low-performing, high-poverty schools. Sch. Psychol. Rev. 37, 18–37. doi: 10.1080/02796015.2008.12087905

CrossRef Full Text | Google Scholar

Barr, D. J., Levy, R., Scheepers, C., and Tily, H. J. (2013). Random effects structure in mixed effects models: Keep it maximal. J. Mem. Lang. 68, 255–278. doi: 10.1016/j.jml.2012.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Barth, A., Catts, H., and Anthony, J. (2009). The component skills underlying reading fluency in adolescent readers: A latent variable analysis. Read. Writ. 22, 567–590. doi: 10.1007/s11145-008-9125-y

CrossRef Full Text | Google Scholar

Bates, D., Kliegl, R., Vasishth, S., and Baayen, R. H. (2015a). Parsimonious mixed models. arXiv [Preprint]. arXiv:1506.04967v1.

Google Scholar

Bates, D., Machler, M., Bolker, B., and Walker, S. (2015b). Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48. doi: 10.18637/jss.v067.i01

CrossRef Full Text | Google Scholar

Beck, I. L., Perfetti, C. A., and McKeown, M. G. (1982). Effects of long-term vocabulary instruction on lexical access and reading comprehension. J. Educ. Psychol. 74, 506–521. doi: 10.1037/0022-0663.74.4.506

CrossRef Full Text | Google Scholar

Bennink, C. D., and Spoelstra, T. (1979). Individual differences in field articulation as a factor in language comprehension. J. Res. Pers. 13, 480–489. doi: 10.1016/0092-6566(79)90010-2

CrossRef Full Text | Google Scholar

Birch, S. L., and Rayner, K. (1997). Linguistic focus affects eye movements during reading. Mem. Cogn. 25, 653–660. doi: 10.3758/BF03211306

PubMed Abstract | CrossRef Full Text | Google Scholar

Bleckley, M., Durso, F. T., Crutchfield, J. M., Engle, R. W., and Khanna, M. M. (2003). Individual differences in working memory capacity predict visual attention allocation. Psychon. Bull. Rev. 10, 884–889. doi: 10.3758/BF03196548

PubMed Abstract | CrossRef Full Text | Google Scholar

Boston, M. F., Hale, J., Kliegl, R., and Patil, U. (2008). Parsing costs as predictors of reading dificulty: An evaluation using the potsdam sentence corpus. J. Eye Mov. Res. 2, 1–12. doi: 10.16910/jemr.2.1.1

CrossRef Full Text | Google Scholar

Box, G. E. P., and Cox, D. R. (1964). An Analysis of Transformations. J. R. Stat. Soc. Series B Methodol. 26, 211–252. doi: 10.1111/j.2517-6161.1964.tb00553.x

CrossRef Full Text | Google Scholar

Braze, D. (2005). Fixation analysis (Version 3.01).

Google Scholar

Braze, D., Katz, L., Magnuson, J. S., Mencl, W. E., Tabor, W., Van Dyke, J. A., et al. (2016). Vocabulary does not complicate the simple view of reading. Read. Writ. 29, 435–451. doi: 10.1007/s11145-015-9608-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Braze, D., Mencl, W. E., Shankweiler, D. P., Tabor, W., and Schultz, A. (2006). “Skill-related differences in the online reading behavior of young adults: Evidence from eye-movements,” in Proceeding of the talk given at the 13th annual meeting of the society for the scientific study of reading, Vancouver.

Google Scholar

Braze, D., Mencl, W. E., Tabor, W., Pugh, K. R., Constable, R. T., Fulbright, R. K., et al. (2011). Unification of sentence processing via ear and eye: An fMRI study. Cortex 47, 416–431. doi: 10.1016/j.cortex.2009.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Braze, D., Tabor, W., Shankweiler, D. P., and Mencl, W. E. (2007). Speaking up for vocabulary: Reading skill differences in young adults. J. Learn. Disabil. 40, 226–243. doi: 10.1177/00222194070400030401

PubMed Abstract | CrossRef Full Text | Google Scholar

Brysbaert, M., and Stevens, M. (2018). Power analysis and effect size in mixed effects models: A tutorial. J. Cogn. 1:9. doi: 10.5334/joc.10

PubMed Abstract | CrossRef Full Text | Google Scholar

Burnham, K. P., and Anderson, D. R. (2002). Model selection and multimodel inference: A practical information-theoretic approach, 2rd Edn. London: Springer-Verlag.

Google Scholar

Calvo, M. G. (2004). Relative contribution of vocabulary knowledge and working memory span to elaborative inferences in reading. Learn. Individ. Differ. 15, 53–65. doi: 10.1016/j.lindif.2004.07.002

CrossRef Full Text | Google Scholar

Calvo, M. G., Estevez, A., Dowens, M. G., and Calvo, M. G. (2003). Time course of elaborative inferences in reading as a function of prior vocabulary knowledge. Learn. Instr. 13, 611–631. doi: 10.1016/S0959-4752(02)00055-5

CrossRef Full Text | Google Scholar

Catts, H. W., Adlof, S. M., and Weismer, S. E. (2006). Language deficits in poor comprehenders: A case for the simple view of reading. J. Speech Lang. Hear. Res. 49, 278–293. doi: 10.1044/1092-4388(2006/023)

PubMed Abstract | CrossRef Full Text | Google Scholar

Chik, P., Ho, C., Yeung, P.-s, Wong, Y.-k, Chan, D., Chung, K., et al. (2010). Contribution of discourse and morphosyntax skills to reading comprehension in Chinese dyslexic and typically developing children. Ann. Dyslexia 62, 19–21. doi: 10.1007/s11881-011-0062-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Clifton, C., Traxler, M. J., Mohamed, M. T., Williams, R. S., Morris, R. K., and Rayner, K. (2003). The use of thematic role information in parsing: Syntactic processing autonomy revisited. J. Mem. Lang. 49, 317–334. doi: 10.1016/S0749-596X(03)00070-6

CrossRef Full Text | Google Scholar

Cohen, J. D., MacWhinney, B., Flatt, M., and Provost, J. (1993). PsyScope: An interactive graphic system for designing and controlling experiments in the psychology laboratory using Macintosh computers. Behav. Res. Methods Instr. Comput. 25, 257–271. doi: 10.3758/BF03204507

CrossRef Full Text | Google Scholar

Corkin, S. (1974). Serial-ordering deficits in inferior readers. Neuropsychologia 12, 347–354. doi: 10.1016/0028-3932(74)90050-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Daneman, M., and Carpenter, P. A. (1980). Individual differences in working memory and reading. J. Verbal Learn. Verbal Behav. 19, 450–466. doi: 10.1016/S0022-5371(80)90312-6

CrossRef Full Text | Google Scholar

Duffy, S. A., Morris, R. K., and Rayner, K. (1988). Lexical ambiguity and fixation times in reading. J. Mem. Lang. 27, 429–446. doi: 10.1016/0749-596X(88)90066-6

CrossRef Full Text | Google Scholar

Dunn, L. M., and Dunn, L. M. (1997). Peabody picture vocabulary test, 3rd Edn. Circle Pines, MN: American Guidance Service, Inc. doi: 10.1037/t15145-000

CrossRef Full Text | Google Scholar

Estevez, A., and Calvo, M. G. (2000). Working memory capacity and time course of predictive inferences. Memory 8, 51–61. doi: 10.1080/096582100387704

PubMed Abstract | CrossRef Full Text | Google Scholar

Ferreira, F., and Clifton, C. (1986). The independence of syntactic processing. J. Mem. Lang. 25, 348–368. doi: 10.1016/0749-596X(86)90006-9

CrossRef Full Text | Google Scholar

Fox, J., and Weisberg, S. (2011). An R companion to applied regression, 2nd Edn. Thousand Oaks, CA: Sage.

Google Scholar

Frisson, S., and McElree, B. (2008). Complement coercion is not modulated by competition: Evidence from eye movements. J. Exp. Psychol. Learn. Mem. Cogn. 34, 1–11. doi: 10.1037/0278-7393.34.1.1

PubMed Abstract | CrossRef Full Text | Google Scholar

Fuchs, L. S., Fuchs, D., Hosp, M. K., and Jenkins, J. R. (2001). Oral reading fluency as an indicator of reading competence: A theoretical, empirical, and historical analysis. Sci. Stud. Read. 5, 239–256. doi: 10.1207/S1532799XSSR0503_3

CrossRef Full Text | Google Scholar

Gathercole, S. E., and Baddeley, A. D. (1989). Evaluation of the role of phonological STM in the development of vocabulary in children: A longitudinal study. J. Mem. Lang. 28, 200–213. doi: 10.1016/0749-596X(89)90044-2

CrossRef Full Text | Google Scholar

Gathercole, S. E., Service, E., Hitch, G. J., Adams, A. M., and Martin, A. J. (1999). Phonological short-term memory and vocabulary development: Further evidence on the nature of the relationship. Appl. Cogn. Psychol. 13, 65–77. doi: 10.1002/(SICI)1099-0720(199902)13:1<65::AID-ACP548>3.0.CO;2-O

CrossRef Full Text | Google Scholar

Gong, T., Zhang, M., and Li, C. (2022). Association of keyboarding fluency and writing performance in online-delivered assessment. Assess. Writ. 51:100575. doi: 10.1016/j.asw.2021.100575

CrossRef Full Text | Google Scholar

Gough, P. B., and Tunmer, W. E. (1986). Decoding, reading, and reading disability. Remedial Spec. Educ. 7, 6–10. doi: 10.1177/074193258600700104

CrossRef Full Text | Google Scholar

Gupta, P. (2006). Nonword repetition, phonological storage, and multiple determinations. Appl. Psycholinguist. 27, 564–568. doi: 10.1017/S0142716406260399

CrossRef Full Text | Google Scholar

Henrich, J., Heine, S. J., and Norenzayan, A. (2010). The weirdest people in the world? Behav. Brain Sci. 33, 61–83. doi: 10.1017/S0140525X0999152X

PubMed Abstract | CrossRef Full Text | Google Scholar

Hoover, W. A., and Gough, P. B. (1990). The simple view of reading. Read. Writ. 2, 127–160. doi: 10.1007/BF00401799

CrossRef Full Text | Google Scholar

Hulme, C., and Snowling, M. J. (2012). Learning to read: What we know and what we need to understand better. Child Dev. Perspect. 7, 1–5. doi: 10.1111/cdep.12005

PubMed Abstract | CrossRef Full Text | Google Scholar

Inhoff, A. W. (1984). Two Stages of word processing during eye fixations in the reading of prose. J. Verbal Learn. Verbal Behav. 23, 612–624. doi: 10.1016/S0022-5371(84)90382-7

CrossRef Full Text | Google Scholar

Johnston, T. C., and Kirby, J. R. (2006). The contribution of naming speed to the simple view of reading. Read. Writ. 19, 339–361. doi: 10.1007/s11145-005-4644-2

CrossRef Full Text | Google Scholar

Joseph, H. S. S. L., Nation, K., and Liversedge, S. P. (2013). Using eye movements to investigate word frequency effects in children’s sentence reading. Sch. Psychol. Rev. 42, 207–222. doi: 10.1080/02796015.2013.12087485

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, Y.-S., Wagner, R. K., and Foster, E. (2011). Relations among oral reading fluency, silent reading fluency, and reading comprehension: A latent variable study of first-grade readers. Sci. Stud. Read. 15, 338–362. doi: 10.1080/10888438.2010.493964

PubMed Abstract | CrossRef Full Text | Google Scholar

Kliegl, R., Grabner, E., Rolfs, M., and Engbert, R. (2004). Length, frequency, and predictability effects of words on eye movements in reading. Eur. J. Cogn. Psychol. 16, 262–284. doi: 10.1080/09541440340000213

CrossRef Full Text | Google Scholar

Kukona, A., Braze, D., Johns, C. L., Mencl, W. E., Van Dyke, J. A., Magnuson, J. S., et al. (2016). The real-time prediction and inhibition of linguistic outcomes: Effects of language and literacy skill. Acta Psychol. 171, 72–84. doi: 10.1016/j.actpsy.2016.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuperman, V., and Van Dyke, J. A. (2011). Effects of individual differences in verbal skills on eye-movement patterns during sentence reading. J. Mem. Lang. 65, 42–73. doi: 10.1016/j.jml.2011.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuperman, V., and Van Dyke, J. A. (2013). Reassessing word frequency as a determinant of word recognition for skilled and unskilled readers. J. Exp. Psychol. Hum. Percept. Perform. 39, 802–823. doi: 10.1037/a0030859

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuperman, V., Matsuki, K., and Van Dyke, J. A. (2018). Contributions of reader- and text-level characteristics to eye-movement patterns during passage reading. J. Exp. Psychol. Learn. Mem. Cogn. 44, 1687–1713. doi: 10.1037/xlm0000547

PubMed Abstract | CrossRef Full Text | Google Scholar

Kutner, M. H., Nachtsheim, C. J., and Neter, J. (2004). Applied linear regression models, 4th Edn. Burr Ridge: McGraw-Hill Irwin.

Google Scholar

Kuznetsova, A., Brockhoff, P. B., and Christiansen, R. H. B. (2017). lmerTest package: Tests in linear mixed-effects models. J. Stat. Softw. 82, 1–26. doi: 10.18637/jss.v082.i13

CrossRef Full Text | Google Scholar

Macaruso, P., and Shankweiler, D. P. (2010). Expanding the simple view of reading in accounting for reading skills in community college students. Read. Psychol. 31, 454–471. doi: 10.1080/02702710903241363

CrossRef Full Text | Google Scholar

Manis, F., and Freedman, L. (2001). “The relatiosnhip of naming to multiple reading measures in disabled and non-disabled normal readers,” in Dyslexia, fluency and the brain, ed. M. Wolf (Walmgate: York Press), 65–92.

Google Scholar

Marcotte, A. M., and Hintze, J. M. (2009). Incremental and predictive utility of formative assessment methods of reading comprehension. J. Sch. Psychol. 47, 315–335. doi: 10.1016/j.jsp.2009.04.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Markwardt, F. C. (1998). Peabody individual achievement test–revised. Circle Pines, MN: American Guidance Service, Inc. doi: 10.1037/t15139-000

CrossRef Full Text | Google Scholar

McCarron, S. P., and Kuperman, V. (2021). Is the author recognition test a useful metric for native and non-native English speakers? An item response theory analysis. Behav. Res. Methods 53, 2226–2237.

Google Scholar

Miller, J., and Schwanenflugel, P. (2008). A longitudinal study of the development of reading prosody as a dimension of oral reading fluency in early elementary school children. Read. Res. Q. 43, 336–354. doi: 10.1598/RRQ.43.4.2

PubMed Abstract | CrossRef Full Text | Google Scholar

Muggeo, V. (2008). Segmented: An r package to fit regression models with broken-line relationships. R. News 8, 20–25.

Google Scholar

Nash, H., and Heath, J. (2011). The role of vocabulary, working memory and inference making ability in reading comprehension in down syndrome. Res. Dev. Disabil. 32, 1782–1791. doi: 10.1016/j.ridd.2011.03.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Nelson Taylor, J., and Perfetti, C. A. (2016). Eye movements reveal readers’ lexical quality and reading experience. Read. Writ. 29, 1069–1103. doi: 10.1007/s11145-015-9616-6

CrossRef Full Text | Google Scholar

Ouellette, G., and Beers, A. (2010). A not-so-simple view of reading: How oral vocabulary and visual-word recognition complicate the story. Read. Writ. 23, 189–208. doi: 10.1007/s11145-008-9159-1

CrossRef Full Text | Google Scholar

Perfetti, C. A. (1985). Reading ability. New York, NY: Oxford University Press.

Google Scholar

Perfetti, C. A. (2007). Reading ability: Lexical quality to comprehension. Sci. Stud. Read. 11, 357–383. doi: 10.1080/10888430701530730

CrossRef Full Text | Google Scholar

Perfetti, C. A., and Hart, L. (2002). “The lexical quality hypothesis,” in Precursors of functional literacy, ed. L. Verhoeven (Philadelphia, PA: John Benjamins), 189–213. doi: 10.1075/swll.11.14per

PubMed Abstract | CrossRef Full Text | Google Scholar

Perfetti, C. A., and Lesgold, A. M. (1977). “Discourse comprehension and sources of individual differences,” in Cognitive processes in comprehension, eds M. A. Just and P. A. Carpenter (Hillsdale, NJ: Lawrence Erlbaum Assoc).

PubMed Abstract | Google Scholar

Peterson, R. A. (2001). On the use of college students in social science research: Insights from a second-order meta-analysis. J. Consum. Res. 28, 450–461. doi: 10.1086/323732

CrossRef Full Text | Google Scholar

Petscher, Y., and Kim, Y. S. (2011). The utility and accuracy of oral reading fluency score types in predicting reading comprehension. J. Sch. Psychol. 49, 107–129. doi: 10.1016/j.jsp.2010.09.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Pinheiro, J. C., and Bates, D. M. (2000). Mixed-effects models in S and S-PLUS. New York, NY: Springer-Verlag. doi: 10.1007/978-1-4419-0318-1

CrossRef Full Text | Google Scholar

Protopapas, A., and Kapnoula, E. C. (2016). Short-term and long-term effects on visual word recognition. J. Exp. Psychol. Learn. Mem. Cogn. 42, 542–565. doi: 10.1037/xlm0000191

PubMed Abstract | CrossRef Full Text | Google Scholar

Protopapas, A., Mouzaki, A., Sideridis, G. D., Kotsolakou, A., and Simos, P. G. (2013). The role of vocabulary in the context of the simple view of reading. Read. Writ. Q. 29, 168–202. doi: 10.1080/10573569.2013.758569

CrossRef Full Text | Google Scholar

Psychological Corporation (1999). Wechsler abbreviated scale of intelligence. San Antonio: Harcourt Brace & Co.

Google Scholar

Quené, H., and van den Bergh, H. (2008). Examples of mixed-effects modeling with crossed random effects and with binomial data. J. Mem. Lang. 59, 413–425. doi: 10.1016/j.jml.2008.02.002

CrossRef Full Text | Google Scholar

R Core Team (2013). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing.

Google Scholar

Radach, R., and Kennedy, A. (2012). Eye movements in reading: Some theoretical context. Q. J. Exp. Psychol. 66, 429–452. doi: 10.1080/17470218.2012.750676

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayner, K. (1998). Eye movements in reading and information processing: 20 years of research. Psychol. Bull. 124, 372–422. doi: 10.1037/0033-2909.124.3.372

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayner, K. (2009b). Eye movements in reading: Models and data. J. Eye Mov. Res. 2, 1–10. doi: 10.16910/jemr.2.5.2

CrossRef Full Text | Google Scholar

Rayner, K. (2009a). Eye movements and attention in reading, scene perception, and visual search. Q. J. Exp. Psychol. 62, 1457–1506. doi: 10.1080/17470210902816461

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayner, K., Abbott, M. J., and Plummer, P. (2015). “Individual differences in perceptual processing and eye movements in reading,” in Handbook of individual differences in reading: Reader, text and context, ed. P. Afflerbach (New York, NY: Informa UK Limited), 348–363.

Google Scholar

Rayner, K., and Duffy, S. A. (1986). Lexical complexity and fixation times in reading: Effects of word frequency, verb complexity, and lexical ambiguity. Mem. Cogn. 14, 191–201. doi: 10.3758/BF03197692

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayner, K., and Pollatsek, A. (1989). The psychology of reading. Englewood Cliffs, NJ: Prentice Hall.

Google Scholar

Rayner, K., Carlson, M., and Frazier, L. (1983). The interaction of syntax and semantics during sentence processing: Eye movements in the analysis of semantically biased sentences. J. Verbal Learn. Verbal Behav. 22, 358–374. doi: 10.1016/S0022-5371(83)90236-0

CrossRef Full Text | Google Scholar

Rayner, K., Kambe, G., and Duffy, S. A. (2000). The effect of clause wrap-up on eye movements during reading. Q. J. Exp. Psychol. Hum. Exp. Psychol. 53a, 1061–1080. doi: 10.1080/713755934

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayner, K., Pollatsek, A., Ashby, J., and Clifton, C. (2012). The psychology of reading. New York, NY: Psychology Press. doi: 10.4324/9780203155158

PubMed Abstract | CrossRef Full Text | Google Scholar

Rayner, K., Sereno, S. C., Morris, R. K., Schmauder, A. R., and Clifton, C. Jr. (1989). Eye movements and on-line language comprehension processes. Lang. Cogn. Process. 4, 21–49. doi: 10.1080/01690968908406362

CrossRef Full Text | Google Scholar

Reschly, A. L., Busch, T. W., Betts, J., Deno, S. L., and Long, J. D. (2009). Curriculum-based measurement oral reading as an indicator of reading achievement: A meta-analysis of the correlational evidence. J. Sch. Psychol. 47, 427–469. doi: 10.1016/j.jsp.2009.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Savage, R. (2006). Reading comprehension is not always the product of nonsense word decoding and linguistic comprehension: Evidence from teenagers who are extremely poor readers. Sci. Stud. Read. 10, 143–164. doi: 10.1207/s1532799xssr1002_2

CrossRef Full Text | Google Scholar

Shankweiler, D. P., and Crain, S. (1986). Language mechanisms and reading disorder: A modular approach. Cognition 24, 139–168. doi: 10.1016/0010-0277(86)90008-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Shankweiler, D. P., Lundquist, E., Katz, L., Stuebing, K. K., Fletcher, J. M., Brady, S., et al. (1999). Comprehension and decoding: Patterns of association in children with reading difficulties. Sci. Stud. Read. 3, 69–94. doi: 10.1207/s1532799xssr0301_4

CrossRef Full Text | Google Scholar

Shankweiler, D. P., Mencl, W. E., Braze, D., Tabor, W., Pugh, K. R., and Fulbright, R. K. (2008). Reading differences and brain: Cortical integration of speech and print in sentence processing varies with reader skill. Dev. Neuropsychol. 33, 745–776. doi: 10.1080/87565640802418688

PubMed Abstract | CrossRef Full Text | Google Scholar

Share, D. L. (1995). Phonological recoding and self-teaching: Sine qua non of reading acquisition. Cognition 55, 151–218. doi: 10.1016/0010-0277(94)00645-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Silverman, R. D., Speece, D. L., Harring, J. R., and Ritchey, K. D. (2012). Fluency has a role in the simple view of reading. Sci. Stud. Read. 17, 108–133. doi: 10.1080/10888438.2011.618153

CrossRef Full Text | Google Scholar

Snow, C. E. (2002). Reading for understanding: Toward a research and development program in reading comprehension. Santa Monica, CA: RAND.

Google Scholar

Stanovich, K. E., and Cunningham, A. E. (1992). Studying the consequences of literacy within a literate society: The cognitive correlates of print exposure. Mem. Cogn. 20, 51–68. doi: 10.3758/BF03208254

PubMed Abstract | CrossRef Full Text | Google Scholar

Swart, N. M., Muijselaar, M., Steenbeek-Planting, E. G., Droop, M., de Jong, P. F., and Verhoeven, L. (2016). Differential lexical predictors of reading comprehension in fourth graders. Read. Writ. 30, 489–507. doi: 10.1007/s11145-016-9686-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Tilstra, J., McMaster, K., Van den Broek, P., Kendeou, P., and Rapp, D. (2009). Simple but complex: Components of the simple view of reading across grade levels. J. Res. Read. 32, 383–401. doi: 10.1111/j.1467-9817.2009.01401.x

CrossRef Full Text | Google Scholar

Traxler, M. J. (2007). Working memory contributions to relative clause attachment processing: A hierarchical linear modeling analysis. Mem. Cognit. 35, 1107–1121. doi: 10.3758/BF03193482

PubMed Abstract | CrossRef Full Text | Google Scholar

Tunmer, W. E., and Chapman, J. W. (2012). The simple view of reading redux: Vocabulary knowledge and the independent components hypothesis. J. Learn. Disabil. 45, 453–466. doi: 10.1177/0022219411432685

PubMed Abstract | CrossRef Full Text | Google Scholar

Valle, A., Binder, K. S., Walsh, C. B., Nemier, C., and Bangs, K. E. (2013). Eye movements, prosody, and word frequency among average-and high-skilled second-grade readers. Sch. Psychol. Rev. 42, 171–190. doi: 10.1080/02796015.2013.12087483

CrossRef Full Text | Google Scholar

Vasishth, S., von, der Malsburg, T., and Engelmann, F. (2013). What eye movements can tell us about sentence comprehension. Wiley Interdiscip. Rev. Cogn. Sci. 4, 125–134. doi: 10.1002/wcs.1209

PubMed Abstract | CrossRef Full Text | Google Scholar

Wagner, R. K., Torgesen, J. K., Rashotte, C. A., and Pearson, N. A. (2010). Test of silent reading efficiency and comprehension. Austin, TX: Pro-Ed.

Google Scholar

Wang, Z., Sabatini, J., O’Reilly, T., and Weeks, J. (2019). Decoding and reading comprehension: A test of the decoding threshold hypothesis. J. Educ. Psychol. 111, 387–401. doi: 10.1037/edu0000302

CrossRef Full Text | Google Scholar

Warren, T., White, S. J., and Reichle, E. D. (2009). Investigating the causes of wrap-up effects: Evidence from eye movements and E-Z Reader. Cognition 111, 132–137. doi: 10.1016/j.cognition.2008.12.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Wiederholt, J. L., and Bryant, B. R. (2001). Gray oral reading test (GORT), 4th Edn. Austin, TX: Pro-Ed.

Google Scholar

Wolf, M., O’Rourke, G. A., Gidney, C., Lovett, M., Cirino, P., and Morris, R. K. (2002). The second deficit: An investigation of the independence of phonological and naming-speed deficits in developmental dyslexia. Read. Writ. 15, 43–72. doi: 10.1023/A:1013816320290

CrossRef Full Text | Google Scholar

Woodcock, R. W., McGrew, K. S., and Mather, N. (2001). Woodcock-johnson III tests of achievement. Itasca, IL: Riverside Publishing.

Google Scholar

Wurm, L. H., and Fisicaro, S. A. (2014). What residualizing predictors in regression analyses does (and what it does not do). J. Mem. Lang. 72, 37–48. doi: 10.1016/j.jml.2013.12.003

CrossRef Full Text | Google Scholar

Keywords: eye-movement, lexical properties, individual differences, mixed-effects regression, segmented linear regression

Citation: Gong T and Shuai L (2023) Segmented relations between online reading behaviors, text properties, and reader–text interactions: An eye-movement experiment. Front. Psychol. 13:1006662. doi: 10.3389/fpsyg.2022.1006662

Received: 29 July 2022; Accepted: 19 December 2022;
Published: 11 January 2023.

Edited by:

Marijan Palmovic, University of Zagreb, Croatia

Reviewed by:

Xiaolu Wang, Zhejiang University City College, China
Yuxia Wang, Shanghai Jiao Tong University, China

Copyright © 2023 Gong and Shuai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tao Gong, www.frontiersin.org gtojty@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.