Preference for Object Relative Clauses in Chinese Sentence Comprehension: Evidence From Online Self-Paced Reading Time

Xu, Kunyu; Duann, Jeng-Ren; Hung, Daisy L.; Wu, Denise H.

doi:10.3389/fpsyg.2019.02210

ORIGINAL RESEARCH article

Front. Psychol., 01 October 2019

Sec. Psychology of Language

Volume 10 - 2019 | https://doi.org/10.3389/fpsyg.2019.02210

Preference for Object Relative Clauses in Chinese Sentence Comprehension: Evidence From Online Self-Paced Reading Time

Kunyu Xu¹

Jeng-Ren Duann^1,2

Daisy L. Hung^1,3

Denise H. Wu^1*

¹Institute of Cognitive Neuroscience, National Central University, Taoyuan, Taiwan
²Institute for Neural Computation, University of California, San Diego, La Jolla, CA, United States
³College of Humanities and Social Sciences, Taipei Medical University, Taipei, Taiwan

Most prior studies have reported that subject-extracted relative clauses (SRCs) are easier to process than object-extracted relative clauses (ORCs). However, whether such an SRC preference is universal across different languages remains an open question. Several reports from Chinese have provided conflicting results; thus, in the present study, we conducted two self-paced reading experiments to examine the comprehension of Chinese relative clauses. The results demonstrated a clear ORC preference that Chinese ORCs were easier to comprehend than Chinese SRCs. These findings were most compatible with the prediction of the integration cost account, which claims that the processing difference between SRCs and ORCs arises at the point of dependency formation. The ORC preference in Chinese poses a challenge to the universality of the SRC preference assumed by the structural distance hypothesis and highlights the values of cross-linguistic research.

Introduction

Empirical cross-linguistic research on different aspects of sentence processing has provided evidence for the universality and specificity of linguistic processing mechanisms. Researchers investigating cross-linguistic syntactic processing have studied relative clause (RC) structures. Because most languages have RC sentences, such materials provide an opportunity to investigate the universality of the processing mechanisms across languages (Keenan and Comrie, 1977).

There are different types of RCs in languages. Based on the syntactic role of the head noun being modified by RCs, RCs are mainly classified into subject-extracted relative clauses (SRCs) and object-extracted relative clauses (ORCs). SRCs and ORCs differ minimally in the surface form but substantially in the syntactic structure. In the previous literature, research on a wide range of languages has frequently found that there is a preference for SRCs over ORCs when RCs contain full noun phrases (NPs), such as English (e.g., King and Just, 1991; King and Kutas, 1995; Stromswold et al., 1996; Gibson, 1998; Traxler et al., 2002; Grodner and Gibson, 2005), Dutch (e.g., Frazier, 1987; Mak et al., 2002), French (e.g., Holmes and O’Regan, 1981), and German (e.g., Mecklinger et al., 1995). In English, one exception to the general SRC advantage is when the RC contains a personal pronoun (e.g., ‘the people that you like’ vs. ‘the people that like you’). The less common preference for ORCs over SRCs in this case might be due to the higher frequency of the former than the latter structure (Reali and Christiansen, 2007). Findings against the SRC preference in some languages such as Chinese (e.g., Hsiao and Gibson, 2003; Chen et al., 2008; Gibson and Wu, 2013; Sung et al., 2016) and Basque (Carreiras et al., 2010) have also been reported. Even within the same language, such as Chinese, the preference for SRCs (e.g., Lin and Bever, 2006) or ORCs (as cited above) is still paradoxical as shown in conflicting results. Considering that Chinese is a language with the unique combination of subject-verb-object word order and a head-final property (e.g., Dryer, 1992; Lin, 2008; Wu et al., 2012), revealing its RC processing preference is important to examine the universality of the SRC preference that is dominant in the literature.

Theoretical Accounts for Relative Clause Processing

In parallel with the perplexing results observed in behavioral experiments, various theoretical accounts also make different predictions about Chinese RC processing preference.

Structural Distance Account

According to the structural distance account put forward by O’Grady (1997), the longer the structural distance between the filler and the gap is, the more complex the sentence is. During the parsing process, the structural distance is defined as the number of syntactic nodes/projections that intervene between the filler and the gap in the syntactic tree. As shown in Figure 1, in an SRC, the subject-gap position is within the inflection phrase (IP), whereas in an ORC the object-gap is embedded in the verb phrase (VP), which is deeper than the IP in the syntactic tree. Thus, there are more syntactic nodes (NP, NP, CP, IP, VP, NP) intervening between the gap e and the head noun (i.e., the student) in the ORC than the nodes (NP, NP, CP, IP, NP) in the SRC.

FIGURE 1

Figure 1. The syntactic structures of an example of English SRC and ORC sentences. As shown in the figure, there are more syntactic nodes (NP, NP, CP, IP, VP, NP) intervening between the gap e and the head noun (i.e., the student) in the ORC (B) than the nodes (NP, NP, CP, IP, NP) in the SRC (A), meaning that the English ORC is predicted to be more difficult to process than the English SRC by the structural distance account.

The structural distance account predicts that the sentences with a larger number of syntactic nodes are more difficult to process than the sentences with a smaller number of syntactic nodes in a syntactic tree. As a result, an SRC preference is predicted for English. Moreover, this account assumes that the underlying syntactic structure is universal across languages. Thus, a preference for SRCs over ORCs is hypothesized for all languages. Following this account, in a Chinese ORC the distance (N, NP, CP, IP, VP, N) between the head noun (i.e., tongxue) and the object gap e is greater, hence structurally deeper, than the distance (N, NP, CP, IP, N) between the same head noun and the subject gap e in a Chinese SRC as illustrated in Figure 2.

FIGURE 2

Figure 2. The syntactic structures of an example of Chinese SRC and ORC sentences. As shown in the figure, there are more syntactic nodes (N, NP, CP, IP, VP, N) between the head noun (i.e., tongxue) and the object gap e in the Chinese ORC (B), thus a deeper structure, than the nodes (N, NP, CP, IP, N) between the same head noun and the subject gap e in the Chinese SRC (A), hence a preference for Chinese SRCs over ORCs is hypothesized by the structural distance account.

Memory-Based Account

One prominent example among the memory-based accounts is the Dependency Locality Theory (DLT) proposed by Gibson (1998). According to the DLT, processing difficulty is mainly due to working memory constraints on sentence comprehension. Specifically, the DLT claims that a sentence is parsed based on two metrics: the storage cost and the integration cost. The storage cost is related to the process of maintaining temporarily incomplete dependencies during sentence processing. The integration cost, on the other hand, is related to the process of establishing connections between the incoming words and the current syntactic structure (Gibson, 1998). The processing difficulty is assumed to increase with a large number of new discourse references that intervene between the element that is currently being processed and the elements with which a syntactic dependency has to be built. For English, both the storage and the integration costs are greater in an ORC than in an SRC. Specifically, incomplete head-dependencies in an ORC are more distant than those in an SRC. Therefore, the storage demand to keep track of the syntactic heads that are needed to form a grammatical sentence is greater in the former than in the latter case. For syntactic integration, on the other hand, the object-subject-verb word order of English ORCs demands non-local integration (see also Gordon and Lowder, 2012), resulting in a greater integration cost in processing English ORCs than SRCs. That is, an SRC preference is hypothesized for English by both the storage and integration costs of the DLT.

Different from the prediction of an SRC preference for English, an ORC preference is predicted for Chinese according to the DLT. During the comprehension of the Chinese SRC, rènshi Zhāngsān de sījī (‘The driver who knew Zhangsan’), the three syntactic heads before the noun sījī (i.e., rènshi, Zhāngsān and de) need to be stored in working memory. On the other hand, only one predicted head is required during the comprehension of the Chinese ORC, Zhāngsān rènshi de sījī (‘The driver who Zhangsan knew’). Thus, a smaller number of temporarily incomplete dependences results in a lower storage cost in a Chinese ORC than a Chinese SRC. As for the integration cost, when the relativizer de and the head noun are encountered, a greater processing demand is required to complete the integration across a longer filler-gap distance in an SRC whose word order is non-canonical (object-verb-subject), compared with that in an ORC whose word order is canonical (subject-verb-object). Therefore, according to both the storage and integration costs of the DLT, SRCs should be more difficult to comprehend than ORCs in Chinese.

Experience/Frequency-Based Accounts

The experience-based account, proposed by Mitchell et al. (1995), suggests that the human sentence parser is experience-based, and the relative frequency of each type of relative clauses determines its relative processing difficulty. For English, some corpus statistics have found that SRCs with full NPs are more frequent than ORCs with full NPs; thus, the latter structure is more difficult to process than the former structure (Roland et al., 2007). MacDonald and Christiansen (2002) employed a Simple Recurrent Network (SRN) model to investigate the importance of learning or experience to RC comprehension. The results indicated that distributional constraints have effects on comprehension of SRCs and ORCs in English. That is, processing of SRCs (with full NPs) benefited from many simple transitive sentences that shared the overwhelmingly frequent word order (subject-verb-object), while processing of ORCs benefited almost exclusively from direct experience with ORCs themselves. Based on such results, Wells et al. (2009) further manipulated participants’ reading experience on RC construction and found that the reading pattern of RCs strikingly resembled the results from MacDonald and Christiansen’s (2002) computational model, showing reduced reading times more for ORCs than SRCs after equivalent exposure to the respective RC structures. Besides, Reali and Christiansen (2007) conducted a series of behavioral experiments to examine the role of experience in sentence processing by investigating the processing difference between pronominal SRCs and ORCs. They found that highly frequent pronominal ORCs, as revealed by a large-scale corpus analysis, were indeed easier to process than pronominal SRCs when the embedded pronoun was personal. These results together highlighted the contribution of experience to RC processing in mature readers and provided strong support for experience-based approaches (see Christiansen and Chater, 2016 for more details).

For Chinese, corpus studies have reported that Chinese SRCs are more frequent than Chinese ORCs, thus predicting an SRC advantage in Chinese (Pu, 2007; Vasishth et al., 2013). However, Hsiao and MacDonald (2013) adopted a similar SRN model as that employed in MacDonald and Christiansen (2002) and found that sentence difficulty might not be simply determined by sentence frequency. Specifically, they found that ORCs yielded fewer errors than SRCs did, especially at the head noun region. Although such results were not predicted by the frequency-based accounts, they were consistent with previous studies that reported faster reading times for ORCs than SRCs (e.g., Hsiao and Gibson, 2003; Gibson and Wu, 2013).

Previous Findings of the Processing Preference for Chinese Relative Clauses

The first study to explore the Chinese RC preference was conducted by Hsiao and Gibson (2003). They found that ORCs were easier to comprehend than SRCs when both types of RCs were at the subject-modifying position, particularly being reflected in the self-paced reading time of the embedded clause region. These results are regarded as evidence for the storage cost account of the DLT. On the contrary, Lin and Bever (2006) claimed that there was no significant preference between the reading of Chinese SRCs and ORCs at the subject-modifying position. Rather, they reported an SRC preference only at the object-modifying position. However, as Gibson and Wu (2013) pointed out, the SRC preference observed for object-modifying RCs might be due to local syntactic ambiguity (see similar ERP findings in Bulut et al., 2018). To overcome this potential problem, Gibson and Wu (2013) designed a disambiguating preceding context to minimize the garden path effect in sentences. Consistent with their earlier results (Hsiao and Gibson, 2003), SRCs were read significantly more slowly than ORCs, which again was in line with the prediction of the DLT. Vasishth et al. (2013) later attempted to replicate the results from Hsiao and Gibson (2003) and Gibson and Wu (2013) by conducting three self-paced reading experiments. Similar to the conflicting results in the literature, two of their experiments showed the SRC preference, while the other one presented the ORC preference. Based on a meta-analysis of 15 previous RC studies in Chinese (including their own experiments), Vasishth et al. (2013) claimed that the SRC preference was more dominant than the ORC preference. Thus, their results were against the predictions of the DLT but supported the experience-based account. In addition to syntactic structure and memory demands, the factor of thematic order was also found to affect Chinese RC processing. Lin (2014) employed similar materials with preceding disambiguating contexts as in Gibson and Wu (2013) and found that the comprehension of Chinese RCs was sensitive to the thematic role orders both in the preceding discourse context and in the subsequent RC. Specifically, the PATIENT-AGENT-action order of a passive sentence in the preceding discourse context (e.g., lìngyíwèi zhùhù zé bèi zhèwèi fángdōng chǎoxǐngle, ‘the other tenant was woken up by the landlord’) did not facilitate either the action-PATIENT-AGENT order of an SRC (e.g., chǎoxǐng fángdōng de zhùhù, ‘the tenant that woke up the landlord’) or the AGENT-action-PATIENT order of an ORC (e.g., fángdōng chǎoxǐng de zhùhù, ‘the tenant that the landlord woke up’). However, when the preceding discourse context (e.g., zhèwèi fángdōng zé chǎoxǐngle lìngyíwèi zhùhù, ‘the landlord then woke up the other tenant’) presented a thematic order consistent with the subsequent ORC (e.g., fángdōng chǎoxǐng de zhùhù, ‘the tenant that the landlord woke up’), the Chinese ORC was read faster than the Chinese SRC.

In addition to findings from mature readers, the processing of RCs during language acquisition can also shed light on the processing asymmetry between SRCs and ORCs. The results from English- and German-speaking children were consistent with previous studies in showing an advantage for SRCs over ORCs (Diessel and Tomasello, 2005). However, the disadvantage of ORCs was found to be mitigated and even eliminated when the subject was pronominal and the direct object was inanimate (Kidd et al., 2007). For Chinese, similar to the controversial findings from adults, the results from children were also mixed. Chang (1984) used an act-out task to test preschool Mandarin-speaking children on their comprehension of different types of RCs, but the results pointed to neither an SRC or ORC advantage. Hsu et al. (2009) then used an elicited production task and found that Chinese SRCs were easier to comprehend than ORCs for children. However, Su (2006) failed to identify a clear SRC advantage with the same task. Afterward, Chan et al. (2011) adopted a picture-pointing task and reported an ORC preference during children’s acquisition of RC structures. Despite the recent behavioral and developmental research on Chinese RC processing, the processing preference between SRCs and ORCs remains undetermined.

Neurophysiological Studies on Relative Clause Processing

The advances of techniques have allowed new approaches to examine the processing difficulty of SRCs and ORCs. Neurophysiological measurements provided the evidence in support of an SRC advantage in English by showing that the processing of both written and spoken ORC sentences elicited greater negative waveforms than SRC sentences at the ‘gap’ position of the sentence in single-word ERPs, while only the SRC but not ORC sentences elicited a slow frontal positivity at the multiword level, which was interpreted as an index of ease of processing or integration (King and Kutas, 1995; Muller et al., 1997). On the other hand, Weiss et al. (2005) examined large-scale oscillatory activity during the comprehension of English SRCs and ORCs by using EEG coherence analysis and also found a continuous higher coherence for ORCs than SRCs in the theta, beta and gamma frequency bands, which were suggested to be associated with memory processes, attentional effort and semantic-pragmatic integration, respectively. In contrast with the generally consistent findings of an SRC advantage in English, the relatively scarce evidence on Chinese RC processing seems to support an ORC advantage. The neurophysiological studies have identified greater ERP components elicited by SRCs than ORCs (Yang and Perfetti, 2006; Packard et al., 2010; Yang et al., 2010), though the time windows of the observed effects might vary in different studies (see Bulut et al., 2018, for a summary).

The Current Study

Following the earlier research on the RC processing preference, an increasing number of empirical studies have examined the factors that influence the reading of Chinese RCs, including thematic order (e.g., Gibson and Wu, 2013; Lin, 2014), animacy (e.g., Wu et al., 2012; He and Chen, 2013), relative frequency occurrence (e.g., Vasishth et al., 2013), discourse context (e.g., Yang and Perfetti, 2006; Gibson and Wu, 2013), and working memory (e.g., Hsiao and Gibson, 2003). Furthermore, the compounding effects caused by different aspects of language (i.e., syntax, semantics, and pragmatics) might reflect multiple mechanisms that contribute to the RC processing preference. In the present study, to determine the contribution of the syntactic structure to the RC processing, we focused on the basic form of Chinese RC sentences (as shown below) by employing the self-paced reading task. The self-paced reading task has been widely used to observe online reading time and offline performance to probe questions in sentence comprehension (e.g., Just et al., 1982; King and Just, 1991). As introduced above, the structured-based and experience-based accounts predict an SRC advantage in Chinese, while the DLT proposes an ORC advantage in Chinese. It is expected that the processing time for the embedded clause region of SRCs should be significantly longer than that of ORCs according to the storage cost account of the DLT. On the other hand, longer processing time on the relativizer and/or the head noun of SRCs than that of ORCs is predicted by the integration cost account of the DLT. We originally conducted three self-paced reading experiments to examine these predictions with subject- and object-modifying RCs separately. Because the reading preference of subject-modifying RCs predicted by the integration cost accounts such as the DLT would be identical to that resulting from ambiguity resolution, however, the conclusion from subject-modifying RCs would be less interpretable than that from object-modifying RCs. Therefore, we reported the findings from two experiments on object-modifying RCs in the main text below, while describing the results from one experiment on subject-modifying RCs in the Appendix.

Experiment 1

Experiment 1 was designed to investigate the processing preference between Chinese SRCs and ORCs at the object-modifying position.

Method

Participants

Forty-six native Chinese-speaking students (26 females, aged from 20 to 28 years) from National Central University participated in this experiment. All participants reported being right-handed with a normal or corrected-to-normal vision. The study was carried out in line with the recommendations of the Social and Behavioral Research Ethical Principles and Regulations of National Taiwan University. All participants gave written informed consent in accordance with the Declaration of Helsinki. The protocol was approved by the Research Ethics Committee of National Taiwan University.

Stimuli

Sixty-four pairs of Chinese SRCs and ORCs were constructed as the examples shown in (1a) and (1b). Each sentence contained two animate nouns or noun phrases, which were equally likely to be the patient or agent of the verb so that they were semantically reversible. Therefore, it was necessary to map thematic roles to the arguments for syntactic processing. Besides, a separate group of 20 students, who were naive to the purpose of the study and did not take part in any of the experiments, was asked to rate the naturalness of these sentences. The rating result showed that there was no significant difference in naturalness between the SRC and ORC conditions [t₁(19) = 1.311, p = 0.205; t₂(126) = 0.762, p = 0.448]. In addition to the target sentences, another 64 filler sentences with various structures (e.g., bàba zuótiān maile hìnduō haochī de shuiguo, ‘yesterday the father bought a lot of delicious fruits’) were also created as a control condition. All the pairs of SRCs and ORCs were evenly divided into two lists with an equal number of each type of sentences appearing in each list. Half of the participants received one list of the sentences, while the other half of the participants received the other list, so that one did not encounter both SRCs and ORCs in a pair. The order of the sentences in a sub-list was completely randomized across participants.

(1) The example of stimuli in Experiment 1

(a) Subject-extracted relative clause (SRC)

www.frontiersin.org

‘The photographer framed the driver who knew Zhangsan.’

(b) Object-extracted relative clause (ORC)

www.frontiersin.org

‘The photographer framed the driver who Zhangsan knew.’

Procedure

Each participant was individually tested in a quiet and appropriately illuminated room. The whole experiment included four sessions (each contained eight target sentences for each type), and between sessions, participants could take a break. For each trial, a fixation cross first appeared on the center of the screen until the participants pressed the space bar to read the following sentence. Each sentence was divided into six frames, each of which contained one to four characters, appearing in the center of the screen. Participants were instructed to carefully read the sentences, proceeding by pressing the space bar on a computer keyboard at their own pace. The amount of time participants spent on each frame was recorded as the time between key-presses (RT1, RT2, RT3, RT4, RT5, RT6, as shown in Figure 3). After the final word of each sentence, a true/false comprehension question about the preceding sentence appeared on the computer screen. Feedback was displayed immediately when participants made their responses by pressing ‘F’ or ‘J’ to indicate true or false, respectively. Among all the sentences, the correct answers for half of the comprehension questions were true, while the correct answers for the other half were false. Afterward, a reminder to prompt participants to press the space bar to proceed to the next trial was shown in the center of the screen. Prior to the formal experiment, a practice with eight sentences was conducted to make sure that participants were familiar with the procedure. Stimuli presentation and data collection were programed via Python¹.

FIGURE 3

Figure 3. The experiment paradigm of Experiment 1. Each sentence was divided into six frames, each of which contained one to four characters, appearing in the center of the screen. Participants were instructed to carefully read the sentences, proceeding by pressing the space bar on a computer keyboard at their own pace. The amount of time the participants spent on each frame was recorded as the time between key-presses (RT1, RT2, RT3, RT4, RT5, RT6). After the presentation of the sentence, the participants were asked to make a true/false judgment in response to the probe question to determine whether they comprehend the meaning of the sentence.

Statistical Analysis

The online reading time of each frame of the sentences with correct responses to the offline comprehension questions was analyzed with the R software (R Development Core Team, 2011) using a mixed-effects model. The model was fit with the lme4 package (Bates et al., 2014) and the lmerTest package (Kuznetsova et al., 2014). The formula in R was

reading time \sim sentence type + (1 | subject) + (1 | item)

The effect size of each significant difference, indicated by Cohen’s d, was further calculated based on the suggestions from Brysbaert and Stevens (2018). In addition to the word-by-word reading time, the total reading time of the two words in the RC structure was also obtained for further comparison. That is, the reading time of W3 + W4 was compared between the SRC and the ORC conditions (see similar practice in Gibson and Wu, 2013 and Vasishth et al., 2013). For the accuracy and reaction time of the offline comprehension questions, the conventional analysis of variance (ANOVA) was employed to determine the statistical relationship across sentence types.

Results

Offline Comprehension Question Performance

The mean accuracy and reaction time (only from accurate responses and also after removal of the outliers beyond two standard deviations around the mean for each condition) of each sentence type were depicted in Figure 4. Generally, the comprehension accuracy of all trials was 94.72%, and that of target sentences was 93.72%. The accuracy difference between different types of sentences was significant in the by-subject analysis but not in the by-item analysis [F₁(2,135) = 5.045, MSE = 29.050, p = 0.008; F₂(2, 189) = 2.553, MSE = 79.882, p = 0.081]. Pairwise comparisons with Bonferroni correction showed that accuracy of filler sentences (96.74%) was significantly higher than that of the SRCs (93.34%, p = 0.009), and marginally higher than that of the ORCs (94.08%, p = 0.060), but there was no significant difference between the SRC and the ORC conditions (p = 1.000).

FIGURE 4

Figure 4. The comprehension performance of the RCs and the filler sentences. The results showed that the accuracy in the filler sentences was significantly higher than that in the SRC condition, but only numerically higher than that in the ORC condition. Moreover, there was also no significant accuracy difference between the SRC and the ORC conditions. On the other hand, the reaction time in answering the probe question of the filler sentence was significantly faster than that in the SRC and the ORC conditions, but no significant difference of the reaction time was found between SRCs and ORCs. The error bar indicates the 95% confidence interval. ^∗p < 0.05.

For the reaction time, the ANOVA found a significant difference among the three sentence types [F₁(2,135) = 18.262, MSE = 0.259, p < 0.001; F₂(2,121.237) = 38.172, MSE = 0.216, p < 0.001). Pairwise comparisons with Bonferroni correction showed that answering questions following filler sentences (1408 ms) was significantly faster than answering questions following the SRC sentences (2004 ms) and the ORC sentences (1912 ms) in both the by-subject and the by-item analysis (all ps < 0.001). Although the mean reaction time was numerically faster in the ORC sentences than the SRC sentences, the difference between them was far from significance (ps > 0.685).

Online Reading Time Performance

Only those sentences whose comprehension questions were answered correctly were included for further analysis. The mean reading time of each frame and each type of target sentences after removal of outliers beyond two standard deviations around the mean in each condition (which included 5.3% of the raw data) was depicted in Figure 5 below. A mixed-effects model with sentence types as the fixed factor and subjects and items being modeled for random intercepts was applied to examine the significance of the effects of sentence types, different frames, and their interaction. The results showed that in addition to a significant main effect for frames (β = 0.04, SE = 0.002, t = 25.991, p < 0.001, Cohen’s d = 0.1014) and sentence types (β = −0.01, SE = 0.005, t = −2.737, p = 0.006, Cohen’s d = 0.0362), a significant interaction between sentence types and frames was also observed (β = 0.02, SE = 0.001, t = 19.11, p < 0.001, Cohen’s d = 0.0398). The significant reading preference for ORCs over SRCs was found in the embedded clause region (i.e., W3, β = −0.04, SE = 0.01, t = −3.065, p = 0.002, Cohen’s d = 0.0882; W4, β = −0.04, SE = 0.01, t = −2.693, p = 0.007, Cohen’s d = 0.0785). When combining the W3 and W4 as the whole one to represent the performance in the embedded clause region, we also observed a significant difference between SRCs and ORCs (i.e., W3 + W4, β = −0.04, SE = 0.01, t = −3.602, p = 0.0003, Cohen’s d = 0.0920). No statistical significance was observed in any other regions (W1, β = −0.0004, SE = 0.01, t = −0.045, p = 0.964; W2, β = −0.01, SE = 0.009, t = −1.183, p = 0.237; W5, β = 0.001, SE = 0.02, t = 0.088, p = 0.93; W6, β = −0.005, SE = 0.02, t = −0.271, p = 0.786). Further, the reading time differences between SRCs and ORCs in each session were depicted in Figure 6. The results indicated that no significant main effect was found in sessions (β = 0.02, SE = 0.02, t = 0.933, p = 0.351) and there was also no significant interaction between sentence types, frames and sessions (β = 0.002, SE = 0.003, t = 0.799, p = 0.424).

FIGURE 5

Figure 5. The reading time of each frame in the SRC and ORC conditions. Significantly longer reading time was found in the embedded clause region of the SRC condition compared with that of the ORC condition, which supports an ORC preference in Chinese. The error bar indicates the 95% confidence interval.

FIGURE 6

Figure 6. The reading time of each frame in the SRC and ORC conditions in each session. The results indicate that there is no significant preference difference across sessions, suggesting that the observed ORC preference does not develop due to exposure. The error bar indicates the 95% confidence interval.

Discussion

The results of Experiment 1 clearly showed a processing preference for ORCs over SRCs, which is in line with previous findings (e.g., Yang and Perfetti, 2006; Packard et al., 2010; Bulut et al., 2018). The observed ORC preference was also presented consistently across sessions, implying that the ORC preference might not develop due to exposure. Moreover, the results indicated a significantly faster reading time at the embedded clause region of the SRC condition compared to that of the ORC condition. One probable reason for this finding is the greater number of incomplete dependencies that need to be maintained in memory at the encounter of the embedded clause in the SRC than the ORC conditions, as proposed by the storage account of the DLT. Another potential reason might be due to the local ambiguity encountered at the embedded clause region, particularly in the SRC condition. Specifically, in the SRC condition, when participants encounter two verbs in successive W2 and W3 regions, there is temporarily ambiguous between an RC analysis and a main-clause analysis with a sequence of two verbs. On the other hand, in the ORC condition, only the interpretation of an RC analysis is activated. Thus, the additional demand is presumed to be required for this disambiguation in the SRC condition. It is not until the relativizer (i.e., de) in the SRC sentence is presented, which signals the reading of an RC structure, participants then realize that W3 is the embedded verb in an RC rather than the matrix verb. In other words, due to the cost for ambiguity resolution in the reading of SRCs, the significant difference of the reading time in SRCs and ORCs is more likely to be observed at the embedded clause region. To verify this speculation, we further conducted Experiment 2 to examine participants’ comprehension of SRCs and ORCs with an aspectual word ‘le’ added to the matrix verb.

Experiment 2

One may concern that the observed ORC advantage in Experiment 1 might be (at least partially) attributed to increased ambiguity, hence processing load, when encountering two successive verbs in the SRC sentences. To minimize this concern, an aspectual marker ‘le’ was added to the matrix verb of Chinese object-modifying RC sentences. Moreover, the number of target sentences was reduced from 64 to 24 pairs to minimize the syntactic priming/learning effect if any and further examine whether the preference was still there. Besides, we excluded the usage of proper names as the noun phrases in the target sentences to avoid any difference that might be caused by specific properties of different kinds of noun phrases.

Participants, Stimuli, and Procedure

A different group of 25 native Chinese-speaking students (10 females, aged from 21 to 27 years) from National Central University was recruited to determine the processing preference of Chinese RCs in this experiment. The same ethical standards as in Experiment 1 were applied. In this experiment, 24 pairs of RCs (24 SRCs, 24 ORCs) were created based on the materials of Experiment 1 (as shown in the example (2a) and (2b) below). The naturalness ratings from an independent group of 20 participants showed that there was no significant difference between the target sentences [t₁(19) = −1.143, p = 0.267, t₂(46) = 0.696, p = 0.490]. One might argue that some filler sentences with the canonical SVO structure, which shared the same word order with the ORC structure, might partially create the structural priming effect to drive the ORC advantage. To remove this potential concern, we selected 24 filler sentences with equal numbers of sentences starting with the Verb-Noun (VN) combination and sentences beginning with the Noun-Verb (NV) combination as a control condition. Among these 24 filler sentences, the first two words of eight sentences were the NV combination, which resembled the structure of ORCs, while the first two words of another eight filler sentences were the VN sequence, which resembled the structure of SRCs. The remaining eight filler sentences were with other various structures. The procedure and statistical analysis were the same as those described in Experiment 1.

(2) The examples of stimuli in Experiment 2

(a) Subject-extracted relative clause with ‘le’ (SRC)

www.frontiersin.org

‘The photographer framed the driver who attacked the principal.’

(b) Object-extracted relative clause with ‘le’ (ORC)

www.frontiersin.org

‘The photographer framed the driver who the principal attacked.’