A fan effect in anaphor processing: effects of multiple distractors

Research suggests that the presence of a non-referent from the same category as the referent interferes with anaphor resolution. In five experiments, the hypothesis that multiple non-referents would produce a cumulative interference effect (i.e., a fan effect) was examined. This hypothesis was supported in Experiments 1A and 1B, with subjects being less accurate and slower to recognize referents (1A) and non-referents (1B) as the number of potential referents increased from two to five. Surprisingly, the number of potential referents led to a decrease in anaphor reading times. The results of Experiments 2A and 2B replicated the probe-recognition results in a completely within-subjects design and ruled out the possibility that a speeded-reading strategy led to the fan-effect findings. The results of Experiment 3 provided evidence that subjects were resolving the anaphors. These results suggest that multiple non-referents do produce a cumulative interference effect; however, additional research is necessary to explore the effect on anaphor reading times.


INTRODUCTION
Many theorists have argued that language comprehension processes can be explained in large part by appealing to general memory processes (e.g., Lewis, 1996;Gerrig and McKoon, 1998;Myers and O'Brien, 1998;Lewis and Vasishth, 2005;van den Broek et al., 2005); this hypothesis has been widely supported by empirical evidence. For example, general theories of memory processes have been shown to provide explanations for linguistic tasks such as establishing common ground between multiple parties (Horton and Gerrig, 2005) and resolving anaphors (O'Brien et al., 1990;Almor, 1999). Anaphor comprehension (often called anaphor resolution) in particular appears to rely heavily upon memory to determine co-reference between an anaphor and antecedent. Even within a sentence, limitations on working memory capacity induce the need for retrieval of referents (McElree, 2000). There are also instances, such as pronouns that refer to implicit referents (Greene et al., 1994) and bridging inferences (Garrod and Sanford, 1981), where anaphors are resolved even though the intended referent has not been explicitly mentioned. Such processes clearly rely on memory to produce an acceptable referent. Further evidence for the relationship between memory and anaphor resolution is provided by the findings that many factors affecting memory also affect anaphor resolution, including distance and elaboration (O'Brien et al., 1990), salience of the anaphor (Klin et al., 2004), salience of the referent (Foraker and McElree, 2007), and frequency (van Gompel and Majid, 2004). In the research reported here, we focus on anaphor resolution across sentences. Nevertheless, models of retrieval processes both across (Myers and O'Brien, 1998) and within (e.g., Lewis and Vasishth, 2005) sentences have many commonalities, which we highlight below.
Of particular interest for the current research are studies that have examined the effects of multiple potential referents on anaphor resolution (e.g., Corbett and Chang, 1983;Corbett, 1984;Mason, 1997;Levine et al., 2000;Wiley et al., 2001;Badecker and Straub, 2002;Klin et al., 2004Klin et al., , 2006Ditman et al., 2007;Levine and Hagaman, 2008). In one of the first studies examining the effect of multiple potential referents, Corbett found longer reading time for an anaphoric noun phrase (e.g., the frozen vegetable) that included a category label when a text contained two members of that category (e.g., fresh corn and frozen asparagus) than when there was only a single category member (e.g., frozen asparagus). Badecker and Straub similarly found an increase in reading time shortly after subjects read reflexives when multiple gender-matched referents had been mentioned (e.g., John thought that Bill owed himself another opportunity to solve the problem). Levine et al. (see also Klin et al., 2004Klin et al., , 2006 found evidence suggesting that under some conditions anaphors (e.g., the dessert) appear not to be resolved at all when a text contains two potential referents from the same category (e.g., tart and cake), likely due to the increased difficulty in identifying a unique referent. The increased difficulty in processing anaphors in these studies suggests that readers engage in additional processing when a distractor (i.e., a non-referent) is present. Presumably this occurs because the both nouns are considered as potential referents, a process that is initiated by simple memory matching and that leads to retrieval-based interference. This explanation follows straightforwardly from global memory models (e.g., Ratcliff, 1978;Gillund and Shiffrin, 1984;Hintzman, 1986), which assume that stored memory representations that are related to a memory cue are activated in parallel and to the degree that they share features with the memory cue. Somewhat surprisingly, this additional processing appears to occur regardless of disambiguating material that should identify the proper referent, such as a prenominal adjective like frozen or the grammatical constraints that govern interpretation of reflexives (e.g., Reinhart, 1983). The reliability and time course of distractor interference, especially for within-sentence retrieval, is a matter of debate. Recent evidence is consistent with a very early role for grammatical constraints in retrieval. For example, Chow et al. (2014) were unable to replicate Badecker and Straub's results, and they found evidence that grammatical constraints prevent distractor interference (see also Dillon et al., 2013). Across sentence boundaries, some features, such as parallel structure (e.g., Josh criticized Paul. Then Marie insulted him.), may play an early role in limiting referent search (Chambers and Smyth, 1998). Nevertheless, for definite nounphrase anaphors like the dessert, reported findings suggest that retrieval processes rely on semantic matching between an anaphor and potential referents, with no evidence as yet indicating that there are grammatical constraints on this process.
Whereas results like those from Badecker and Straub (2002), Corbett (1984), and Klin and colleagues (Levine et al., 2000;Klin et al., 2004Klin et al., , 2006 illustrate indirectly that distractors are considered during anaphor resolution, direct evidence that distractors are activated during anaphor resolution comes from results reported by O'Brien et al. (1990). O'Brien et al. had subjects read passages with two potential antecedents (e.g., train and plane), which appeared early and late in a passage and were sometimes described elaborately. At the end of a passage, a sentence (e.g., Mark's neighbor asked him how he had traveled to his parent's) appeared that required retrieval of only one of the antecedents. Following this sentence, subjects had to name aloud one of the potential antecedent nouns. Relative to a no-anaphor control condition, referent nouns were named more quickly, replicating findings that suggest that referents are activated by anaphor resolution processes (e.g., Dell et al., 1983). Of perhaps greater interest was the finding that non-referent concepts were also activated relative to a control condition, especially when they were elaborated and appeared in the late position in the passage, between the anaphor and the correct antecedent. These results are consistent with the hypothesis that an anaphor acts like any other memory cue, activating related information in parallel. The finding that non-referent concepts were activated, especially when they occurred late and were elaborated, again fits very well with wellestablished findings from the memory literature that recency and elaboration lead to easier memory access.
Taken together, these studies demonstrate that people consider multiple potential referents when resolving anaphors, and further, that the resolution of the anaphor increases activation for the referent. However, studies involving distractors have typically been limited to situations with a single distractor. Therefore, the effect of additional distractors remains an open empirical question. A yet-stronger case that general memory processes govern anaphor resolution can be made if there is a cumulative effect of additional distractors. Both Myers and O'Brien's (1998) resonance model and Lewis and Vasishth's (2005) implementation of ACT-R (e.g., Anderson, 2005) as a theory of memory-retrieval in sentence-processing make similar predictions about the effect of multiple distractors. The resonance model states that elements in the mental representation resonate to signals from retrieval cues. In the case of anaphor resolution, the retrieval cue is the anaphor and the resonating elements are related items in the mental representation. Critically, the signal (i.e., resonance strength) of any item in the representation is divided among receiving elements, and only a subset of the elements with the strongest signal enter working memory (WM). Thus, the strength of a referent will be reduced in the presence of related distractors, reducing the probability that the correct referent will be selected into WM. Similarly, Lewis and Vasishth's model states that the activation that a chunk in memory will receive is reduced as there are more chunks in memory associated with a particular cue. Given the assumption that activation determines retrieval latency and the probability of the retrieval of a memory chunk, there should be greater difficulty in retrieving the correct referent with every additional distractor.
We can also draw on the memory literature to provide empirical guidance about the possible effects of multiple distractors. Specifically, research has shown that reading sentences that pair a person with multiple locations (or a location with multiple people) slows later recognition of the sentences (Anderson, 1974;Radvansky, 1998;Anderson and Reder, 1999). This result, known as the fan effect, is hypothesized to occur because of interference among competing associations in memory. Unlike the anaphor literature, which has focused on single distractors, the fan effect literature has explored situations with more than two associations and has demonstrated a cumulative effect, such that additional associations cause additional interference.
In the original demonstration of the fan effect (Anderson, 1974), subjects studied sentences in which a person was paired with a location (see 1-4 below).
(1) A hippie is in the park.
(2) A hippie is in the church.
(3) A policeman is in the park. (4) A sailor is in the park. Importantly, some people were associated with more than one location and some locations were associated with more than one person. For example, the sailor was associated only with the park (i.e., a fan of one), the hippie was associated with both the park and the church (i.e., a fan of two), and the park was associated with hippie, the policeman, and the sailor (i.e., a fan of three). Thus, the nouns varied in the number of associations with other nouns. After the study phase, subjects read another set of sentences, some of which were the same as those studied previously and some of which were novel pairings of people and locations that the subjects had not seen. For each sentence, subjects indicated whether it was the same as one they had read during the study phase or not. Consistent with the hypothesis that multiple associations interfere with one another, subjects were slower to recognize sentences with nouns that were associated with more nouns compared to sentences with nouns associated with fewer nouns. That is, subjects were slower to respond as the size of the noun's fan increased.
If anaphor resolution relies on general memory processes, and increasing the number of associations with a noun increases interference, then we can predict that increasing the total number of potential referents for an anaphor should also show a cumulative retrieval-interference effect (i.e., a fan effect). The present study tested this prediction across five experiments by exploring the effects of multiple distractors on anaphor resolution and the subsequent activation levels of referents and distractors. In particular, we used a probe recognition task after anaphor sentences to measure the relative activation of an anaphoric referent when there were a variable number of distractors. We also used the probe task to measure activation of those distractors as a function of the number of distractors. Our results demonstrate evidence of a fan effect in anaphor resolution.

EXPERIMENT 1A
In Experiment 1A, subjects read pairs of sentences. The first provided an antecedent and one or more distractors in a serial list, and the second included an anaphoric noun phrase that co-referred with the antecedent; these were followed by a probe recognition task that was used to measure the activation of the referent concept (see Table 1 for a sample passage and Appendix A in Supplementary Materials for a full list of experimental passages). In particular, the first sentence ended with a list of two, three, four, or five potential referents from the same taxonomic category, and the second sentence referred with a disambiguating adjective and categorical anaphor to a single item mentioned in the list. Following each sentence-pair, subjects completed a probe recognition task to measure the activation level of the referent following the anaphor. For example, the first sentence in the example in Table 1 describes a person looking through a toolbox with a number of tools in it. The last tool mentioned in the sentence, a saw, is the antecedent concept. The second sentence then describes the person fixing a table using the cutting tool. The latter noun phrase serves as an unambiguous reference to the entity introduced by the antecedent. After the second sentence was completed, the word saw was presented in an old-new recognition task, the correct response for which is "old." We assume that reaction time and accuracy in responding to the probes will reflect the ease or difficulty the subjects have in selecting the correct referent (cf. Dell et al., 1983;Levine et al., 2000) from the list of potential referents, including the distractors and the referent. We hypothesized that increasing the number of distractors would lead activation from the anaphor to spread among the referent and distractor concepts (Kintsch, 1988;Myers and O'Brien, 1998;Lewis and Vasishth, 2005). It was expected that the spread of activation from the anaphor to all conceptually-related potential referents would cause the referent to be less active following anaphor resolution as the number of distractors increased (i.e., a monotonic increasing trend in reaction time and decreasing trend in accuracy was expected), resulting in lower probe accuracy and longer probe recognition times. Additionally, this spread of activation should interfere with the selection of the appropriate referent during anaphor resolution, thus slowing reading of the reference sentence, replicating several findings (e.g., Corbett and Chang, 1983;Corbett, 1984;Mason, 1997;Badecker and Straub, 2002). Alternatively, it is possible that a backward, parallel-search process occurs such that the earlier-occurring distractors have little or no detectable impact on anaphor resolution (O'Brien, 1987). A backward, serial, self-terminating search would also predict no impact of early distractors on resolution of later referents. This latter strategy seems attractive especially in short passages with a list-like first sentence (cf. Townsend and Fifíc, 2004).

Subjects
Ninety-five students enrolled in a general psychology course at the University of Arkansas participated in the experiment to partially fulfill a research requirement. All subjects were native-English speakers. Informed consent was obtained from all subjects in this and all subsequent experiments.

Materials and design
There were 31 1 experimental sentence-pairs that appeared in one of four conditions (see Table 1). Each sentence-pair began with a list sentence that introduced a character by proper name (half stereotypically male, half stereotypically female) and ended in a list of either two, three, four, or five nouns from the same taxonomic category. The nouns were all single words, common, and were selected to be roughly equal in typicality as judged by the first author and several research assistants. Furthermore, each of the last two nouns in the list was able to be distinguished from the other nouns by means of an adjective (e.g., saws can be distinguished from the other tools in the list using the adjective cutting). The list sentence was followed by a reference sentence that unambiguously referred to the final item in the list using an adjective and a categorical anaphor (e.g., cutting tool) that was the same for all conditions. The anaphor always occurred three words prior to the end of the reference sentence to ensure that there was enough time for the anaphor to be resolved by the time the sentence was fully read (i.e., by the time the probe-word task was presented).
In addition, there were 68 filler sentence-pairs that each included a list sentence with two to five nouns but that were not limited by the same restrictions on nouns in the experimental lists (e.g., the nouns could be proper or multiple words). As with the experimental sentence pairs, the filler reference sentences also included a categorical anaphor modified by an adjective; however, the referent of the anaphor was not always completely unambiguous. Moreover, the referent of the anaphor was not always the last item in the list. These two features of the fillers were expected to encourage subjects to put forth more effort in resolving anaphors across all trials.
Each experimental and filler sentence-pair also had a corresponding recognition probe and comprehension question. Following the reference sentence, subjects completed a probe recognition task in which they indicated whether a word on the screen had occurred in the previous sentence-pair. For experimental sentence-pairs, the probe word was always the final noun from the list, which required a "yes" response. To ensure an equal number of "yes" and "no" responses across the experiment, the majority of the filler probe tasks presented a word that did not occur in the sentence pair and therefore required a "no" response. Other fillers presented a probe word that was not the final noun from the list, requiring a "yes." Finally, a comprehension question was presented following the probe recognition task, half of which required a "yes" response and half of which required a "no" response. Comprehension questions frequently, but not always, focused on correct resolution of the anaphor (e.g., Did Amelia use the saw?).
Subjects saw each experimental sentence-pair in one of the four conditions along with all filler sentence-pairs. Four counterbalanced lists were created with the following constraints: one quarter of the list sentences had two nouns, one quarter had three nouns, one quarter had four nouns, and one quarter had five nouns. Furthermore, a second set of materials 2 was created that reversed the order of the final two nouns in the list, such that final noun in the first set of materials (e.g., saw) became the penultimate noun and the formerly penultimate noun (e.g., hammer) became the final noun. This also required a change in the disambiguating adjective in the reference sentence (e.g., cutting changed to pounding) such that the referent of the categorical anaphor was always the final noun. The manipulation of these factors resulted in a design that was 4 (nouns: 2, 3, 4, 5) × 2 (noun order: order 1, order 2).

Procedure
The experiment began with three practice blocks of five trials each, which were intended to familiarize the subject with the yes/no response keys, the probe recognition task, and the comprehension question, respectively. For all practice trials, feedback about the correctness of subjects' responses was provided.
Subjects then began the experimental session. Subjects were instructed to read the sentences as they normally would for comprehension and to respond to the probe words as quickly and accurately as possible. Each trial consisted of a list sentence, a reference sentence, a probe word, and a comprehension question. At the beginning of each trial, subjects were given the instruction "PRESS THE SPACEBAR WHEN READY" centered on a computer monitor. When they pressed the spacebar, the list sentence appeared left-justified in the middle of the screen. Subjects pressed the spacebar to indicate when they had finished reading the list sentence, which removed the list sentence from the screen and replaced it with the reference sentence. Subjects pressed the spacebar again to indicate when they had finished reading the reference sentence, which removed the reference sentence from the screen and replaced it with a probe word in all capital letters in the center of the screen. Subjects used the left and right arrow keys labeled "Y" and "N" for yes and no, respectively, to respond to the probe task. This removed the probe word and replaced it with a comprehension question in the center of the screen; no feedback about correctness was provided for probes or questions. Subjects again used the yes and no keys to respond to the comprehension question, which ended the trial.
The experimental session consisted of 99 trials (31 experimental and 68 fillers) in three blocks of 25 trials and one block of 24 trials. The order of the blocks, as well as the order of the trials within each block, was randomized with the restriction that the first sentence-pair of each block was always a filler sentence-pair, to allow time for the subjects to fully return their attention to the task after a mandatory 10 s break between blocks. Subjects were free to take breaks between trials. The experiment lasted approximately 30 min. The procedure for this and all subsequent experiments were approved by the University of Arkansas Institutional Review Board.

Data exclusion and general analytic considerations
A subject's data were excluded from further analysis if they met any of the following criteria: (1) they had more than 30% of reading times less than 1000 ms or greater than 7500 ms; (2) they had lower than 70% probe recognition accuracy; (3) they had more than 30% of probe reaction times less than 500 ms or greater than 2500 ms; (4) they had no non-outlying probe recognition observations in at least one condition; or (5) they had less than 70% comprehension question accuracy. Based on these criteria, the data from eight subjects were excluded from further analysis. Additionally, two experimental items were removed from further analysis due to counterbalancing errors. Therefore, the reported analyses include 85 subjects and 29 items.
For all experiments reported in this paper, subject and item condition means were analyzed separately; a subscript of 1 indicates that subjects were treated as a random-effects variable, whereas a subscript of 2 indicates that items were treated as a random-effects variable. For all significance tests, an alpha level of 0.05 was used. Predictions about monotonic increasing and decreasing trends were tested using polynomial contrasts. For all repeated-measures effects with more than one numerator df, Huynh-Feldt adjusted p-values are reported to correct for sphericity violations. Effect-size measures that are reported are based on the subject analyses, and all within-subject standard errors in figures and tables were computed using the method recommended by Loftus and Masson (1994).

Comprehension
In general, the number of nouns did not affect comprehension (see Table 2 for comprehension results across all experiments). The linear trend was non-significant, F 1(1, 84) = 0.34, p = 0.56, F 2(1, 56) = 0.07, p = 0.79, with no significant higherorder trends. (See Appendix C in Supplementary Materials for the results of the noun-order factor in this experiment and Experiment 1B.)

Probe accuracy
Figure 1 presents mean probe word accuracy and reaction times along with mean reference-sentence reading times as a function of the number of referents. In general, accuracy decreased as the number of nouns in the list sentence increased. The linear trend was significant, F 1(1,84) = 9.63, p = 0.003, F 2(1,28) = 9.99, p = 0.004, η 2 p = 0.10, with no significant higher-order trends.

Probe reaction times (RT)
Only correct probes were analyzed. Outliers were first classified as RTs that were less than 400 ms or greater than 3000 ms.

FIGURE 1 | Experiment 1A antecedent probe reaction times and accuracies by noun condition (error bars indicate SE of the mean).
Remaining reaction times more extreme than 1.5 times the interquartile range above the 75th percentile or below the 25th percentile for each subject were classified as outliers (Tukey, 1977), resulting in 8.6% of the data being excluded from further analyses. In general, reaction time increased as the number of nouns in the list sentence increased (see Figure 1). The linear trend was significant, F 1(1, 84) = 8.03, p = 0.006, F 2(1,28) = 6.68, p = 0.02, η 2 p = 0.09, with no significant higher-order trends.

Reference-sentence reading times
Reference-sentence reading times were transformed to percharacter reading times by dividing the full-sentence reading time by the number of characters in the sentence, not counting spaces and punctuation (see Table 3). Outliers were first identified as trials with less than 15 ms/char or more than 150 ms/char. Outliers among the remaining reading times were then identified within each subject based on Tukey's (1977) criteria. 7.6% of the trials were excluded from further analysis. In general, reading time on the reference sentence decreased as the number of nouns in the list sentence increased. The linear trend was significant, F 1(1,84) = 19.55, p < 0.001, F 2(1,30) = 10.87, p = 0.003, η 2 p = 0.19, with no significant higher-order trends.

DISCUSSION
The results of the probe word analyses were consistent with the fan-effect hypothesis and generally favor models of anaphor resolution that posit a parallel-search mechanism in retrieval of the correct referent. As predicted, the presence of distractors interfered with the probe recognition task. Increasing the number of distractors in the list sentence decreased recognition accuracy and increased reaction times for referents, which suggests that the activation level of referents decreased as the number of distractors increased. The existing literature has shown via a variety of measures and paradigms that the presence of one distractor interferes with anaphor resolution (e.g., Corbett and Chang, 1983;Corbett, 1984;Mason, 1997;Levine et al., 2000;Wiley et al., 2001;Klin et al., 2004Klin et al., , 2006Ditman et al., 2007;Levine and Hagaman, 2008). The present results extend this finding by demonstrating a cumulative effect of distractors.
The effect of additional nouns on the subsequent referencesentence reading times, however, was unexpected. It was predicted, based on previous research (e.g., Corbett, 1984), that anaphor resolution would be slowed by the presence of distractors, resulting in longer full-sentence reading times as the number of distractors increased. However, the results were exactly the opposite, indicating that the subjects actually read the reference sentences more quickly as the number of distractors increased. Assuming this is not a Type I error, one plausible explanation for this result is that subjects adopted a strategy of speeding through the reference sentence to reduce the time between the referents and the probe recognition task when there were more distractors. A similar finding was reported by Van Dyke and McElree (2006), who had subjects reading sentences of variable complexity while holding or not holding a memory load and found that reading was faster for more-complex sentences with a memory load than without one. This speededreading strategy as a potential alternative explanation for the fan effect was explored in further detail in Experiments 2A and 2B; we defer discussion until the presentation of those experiments.

EXPERIMENT 1B
Experiment 1A established that referents were less active following anaphor resolution when there were more potential referents available in the discourse. Experiment 1B replicated Experiment 1A but used distractors as the probe words to test the effect of multiple distractors on the activation level of a distractor. As in Experiment 1A, it was hypothesized that additional distractors would decrease probe accuracy and slow probe recognition times. If anaphors act like any other cue to memory, the activation of the referent and distractors should be split (Kintsch, 1988;Myers and O'Brien, 1998;Lewis and Vasishth, 2005), resulting in less activation to go around (i.e., a fan effect) as there are more related concepts in the list sentence. Because the anaphor contains two cues (i.e., adjective plus noun) to retrieve the referent but only one (i.e., the noun) that matches the distractors, referents should become more active and experience less interference (i.e., a reduced fan effect) than distractors following anaphor resolution. Moreover, later items may overwrite or displace earlier items, leading to degraded representations of the referent and especially earlier-occurring distractors (Nairne, 1990;Lewis, 1996). We examine these predictions in a cross-experiment comparison after presenting the results of Experiment 1B, and then examine them more directly (i.e., in a completely within-subjects design) in Experiments 2A and 2B.

Subjects
Seventy-eight students enrolled in a general psychology course at the University of Arkansas participated in the experiment to partially fulfill a research requirement. All subjects were native-English speakers.

Materials, design, and procedure
Experiment 1B was identical to Experiment 1A except that the probe words in the probe recognition task for experimental trials were distractors (i.e., the penultimate word in the list).

Data exclusion and general analytic considerations
Based on the data exclusion criteria detailed in Experiment 1A, the data from eight subjects were excluded from further analysis. Sixteen more subjects were removed from further analysis for a systematic misunderstanding of the instructions. These subjects consistently responded "no" to distractors on the probe task when they should have been responding "yes." This pattern of responding suggests that these subjects were correctly identifying the correct referent of the anaphor, but misunderstanding that this identification was unrelated to the probe task. Therefore, the comprehension accuracy, probe accuracy, and reading time analyses included 54 subjects and 31 items.

Probe accuracy
Figure 2 presents mean probe word accuracy and reaction times along with mean reference-sentence reading times as a function of the number of referents. In general, accuracy decreased as the number of nouns in the list sentence increased. The linear trend was significant, F 1(1, 53) = 39.08, p < 0.001, F 2(1, 30) = 45.28, p < 0.001, η 2 p = 0.42, with no significant higher-order trends.

Probe reaction times
Based on outlier exclusion criteria, 9.6% of the data were excluded from further analyses. In general, reaction time increased as the number of nouns in the list sentence increased (see Figure 2). The linear trend was significant, F 1(1, 53) = 16.79, p < 0.001, F 2(1, 30) = 6.59, p = 0.01, η 2 p = 0.24. There was also an unexpected cubic trend, F 1(1, 53) = 12.81, p = 0.001, F 2(1, 30) = 3.13, p = 0.09. There was no theoretical expectation of this effect, and it did not appear in Experiment 1A, so we did not try to interpret it.

Reference-sentence reading times
Based on outlier exclusion criteria, 5.2% of the data were excluded from further analyses. In general, as in Experiment 1A, reading time (see Table 4) on the reference sentence decreased as  the number of nouns in the list sentence increased. The linear trend was significant, F 1(1,53) = 11.74, p = 0.001, F 2(1, 30) = 11.52, p = 0.002, η 2 p = 0.18, with no significant higher-order trends.

DISCUSSION
The probe word results were again consistent with the fan-effect hypothesis. As predicted, the presence of distractors interfered with the probe recognition task. Increasing the number of referents in the list sentence decreased recognition accuracy and increased reaction times for distractors similar to the effect found for referents in Experiment 1A. This result extends the findings of Experiment 1A to show that distractors also decrease in activation as the number of referents increases.
As in Experiment 1A, the reading-time results did not support the fan-effect hypotheses. Subjects again read the reference sentence more quickly as the number of distractors increased. This replication provides additional confidence that the unexpected results were not occurring due to chance. This issue was explored in further detail in Experiments 2A and 2B.

EXPERIMENTS 1A AND 1B COMBINED ANALYSIS
As noted in the introduction to Experiment 1B, the effect of fan size should be different for referents (Experiment 1A) and distractors (Experiment 1B). To compare the magnitude of the effect of the number of nouns on referents and distractors, an additional analysis was conducted for the probe reaction times from Experiments 1A and 1B. Probe reaction times for each subject in both experiments were first linearly regressed on the number of nouns (cf. Lorch and Myers, 1990), and the slopes were then examined in an independentsamples t-test with experiment (i.e., probe: referent vs. distractor) as a between-subjects variable. This analysis revealed a nonsignificant effect of probe in the expected direction, with a substantially smaller mean slope among subjects responding to referents in Experiment 1A (M slope = 15.2 ms/noun, SE = 5.4) than among subjects responding to distractors in Experiment 1B (M slope = 31.3 ms/noun, SE = 7.6), t (137) = 1.77, p = 0.08, d = 0.30.
A similar analysis performed on the accuracy data revealed a large and significant effect of probe, with a substantially smaller mean slope among subjects responding to referents in Experiment 1A (M slope = −0.014 accuracy/noun, SE = 0.0046) than among subjects responding to distractors in Experiment 1B (M slope = −0.052 accuracy/noun, SE = 0.0084), t (137) = 4.33, p < 0.001, d = 0.74. Although referents likely gained an advantage in both accuracy and speed of responding due to having appeared more recently than distractors, these analyses focused on the linear trends in which distance from the probe were equal. Therefore, these results provide evidence that the interference effect is greater for distractors than referents; this effect was tested more directly in Experiments 2A and 2B.

EXPERIMENT 2A
The procedure for Experiment 2A was modified from that in Experiments 1A and 1B such that subjects read the reference sentence one word at a time. This allowed for a more detailed analysis of the reading times, which was necessary to help understand the unexpected reference-sentence reading time results of Experiments 1A and 1B. The prediction that additional distractors should slow reading of the reference sentence was based on the hypothesis that multiple distractors would interfere with anaphor resolution. This means that the expected slowdown should occur specifically on the anaphor or immediately after the anaphor during spillover processing. According to this hypothesis, it was expected that there should be no difference in reading times on the reference-sentence until subjects reach the anaphor and post-anaphor regions, where they were expected to read more slowly as the number of distractors increased. However, if the results of Experiments 1A and 1B are reliable, then there should be longer reading times when there are more distractors at some point in the reference sentence prior to the anaphor.
In addition, Experiments 1A and 1B demonstrated that the presence of multiple distractors made recognition of both referents and distractors more difficult, as indexed by both reaction time and accuracy. Experiments 2A and 2B were designed to manipulate the probe word within subjects to address potential concerns about comparing results across experiments. Thus, in these experiments, probe word (referent vs. distractor) and number of distractors (two vs. five) were manipulated within subjects. The fan-effect hypothesis predicts that additional distractors would slow recognition and decrease accuracy for both referents and distractors. Moreover, to the extent that anaphor resolution focuses activation on the referent, thereby minimizing interference, the degree of interference should be greater for distractors than for referents.

Subjects
Seventy-five students enrolled in a general psychology course at the University of Arkansas participated in the experiment to partially fulfill a research requirement. All subjects were native-English speakers.

Materials and design
Thirty of the experimental materials from Experiment 1 were used and appeared in only the two-and five-noun list conditions. This also required some modification of the list length in the filler sentences to maintain an equal distribution of list lengths across the entire experiment. In addition, the probe words were manipulated within subjects, such that each subject saw an equal number of referent and distractor probes following experimental items.
Subjects saw each experimental sentence pair in one of the four conditions along with all filler sentence pairs. Four counterbalanced lists were created with the following constraints: approximately (i.e., 7 or 8 items) one quarter of the list sentences had two nouns followed by a referent probe, approximately one quarter had two nouns followed by a distractor probe, approximately one quarter had five nouns followed by a referent probe, and approximately one quarter had five nouns followed by a distractor probe. Because counterbalancing order did not have any important effects in Experiments 1A and 1B, order was no longer manipulated, resulting in a 2 (nouns: 2, 5) × 2 (probe word: referent, distractor) completely within-subjects design.

Procedure
The experiment was conducted using Linger (Rohde, 2003) to present the materials using a moving window (Just et al., 1982). Before starting the experiment, subjects completed three practice trials to familiarize themselves with the procedure. Each trial began with two rows of dashes, centered on the left-hand side of the screen, with each dash replacing a character or space in the sentences. Subjects pressed the spacebar to initially present the list sentence in its entirety. When they finished reading the list sentence, subjects pressed the spacebar again which replaced the list sentence with dashes and revealed the first word of the reference sentence. Subjects continued to press the spacebar to advance from one word to the next, with each press replacing the previous word with dashes and revealing the next word in the sentence. Pressing the spacebar after the final word of the reference sentence removed all of the dashes from the screen and presented a probe word in all capital letters in the center of the screen. Subjects responded to the probe word using the F key for yes and the J key for no. The response removed the probe word from the screen and replaced it with a comprehension question. Subjects again responded using the F and J keys, which advanced the screen to the next trial.
The experimental session consisted of 98 trials (30 experimental and 68 fillers) in two blocks of 49 trials each with the order of the trials completely randomized. Subjects were instructed to read the sentences as they normally would for comprehension and to respond to the probe words as quickly and accurately as possible. Subjects were free to take breaks between trials. The experiment lasted approximately 30 min.

Data exclusion and general analytic considerations
Based on the data exclusion criteria, the data from six subjects were excluded from further analysis. Therefore, the reported analyses included 69 subjects and 30 items.

Reference-sentence reading times
Outliers were first identified as words read for less than 150 ms or more than 700 ms; different criteria were used in this experiment to try to approximate in a per-word measure the per-character measures used in the previous experiments. Outliers among the remaining reading times were then identified within each subject based on Tukey's (1977) criteria. This resulted in 8.1% of the trials being excluded from further analysis 3 . The individual-word reading times were combined into three regions of three words each. The pre-anaphor region consisted of the three words prior to the anaphor; the anaphor region consisted of the three-word noun phrase involving the determiner, adjective, and anaphor (e.g., the cutting tool); and the post-anaphor region consisted of the three words following the anaphor. Although some items had more than three words prior to the anaphor noun phrase, the analysis was restricted to this point because there was a dramatic drop in the number of observations starting four words prior to the anaphor region. The post-anaphor region was always the final three words of the anaphor sentence. Thus, each region consisted of three words, making their reading times roughly comparable.

DISCUSSION
As predicted by the fan-effect hypothesis, and consistent with Experiments 1A and 1B, probe word accuracy was higher and responses were made faster in the two-noun condition than in the five-noun condition for both referents and distractors. Moreover, the cross-experiment interaction of number of nouns and probe type was replicated; the fan effect is larger for distractors. The 3 The pattern of results remained similar using a less-strict cutoff of 1500 ms/word. reading time results replicated those from Experiments 1A and 1B: subjects read the reference sentence faster in the five-noun condition than in the two-noun condition. However, measuring reading time per-word enabled a more detailed analysis of the reference-sentence reading times and revealed that the faster reading primarily occurred in the pre-anaphor region. Because this region was identical across conditions and made no reference to the list sentence, there is no theoretical reason to expect this difference based on anaphor resolution processes. Instead, these results support the speeded-reading explanation suggested in the discussion of Experiment 1A, that subjects may have adopted a particular strategy in order to mitigate the increased difficulty of the probe-word task in the five-noun condition by reaching the probe word task and comprehension questions more quickly. Furthermore, per-character reading times on the list sentence (see Appendix B in Supplementary Material ) increased as the number of nouns increased, suggesting that the speeded-reading strategy was adopted only on the reference sentence after subjects became aware of the increased difficulty imposed by the longer lists.

EXPERIMENT 2B
Because subjects appeared to be adjusting their reading speed to accommodate the difficulty of representing multiple referents, it was important to assess whether the probe word results were dependent on this apparent strategy. Experiment 2B was thus a replication of Experiment 2A using a fixed-rate presentation of the passages. By controlling the pace of reading, any effects found on the probe recognition task can be assumed to reflect processes that occurred independent of subjects' variable reading speed. Holding reading-rate constant was not expected to change the probe-word results, so it was expected that responses to both referents and distractors would be faster and more accurate when there were two referents in the list sentence than when there were five referents. Moreover, this experiment provided one more opportunity to examine the prediction that the effect of fan would be greater among distractors than among referents. In the accuracy data, the fan effect has been reliably much stronger for distractors than it has been among referents. In the reaction-time data, between Experiments 1A and 1B, this effect was significant only in a one-tailed test, and in Experiment 2A, the same pattern emerged but it was not reliable.

Subjects
Sixty-six students enrolled in a general psychology course at the University of Arkansas participated in the experiment to partially fulfill a research requirement. All subjects were native-English speakers.

Materials, design, and procedure
The materials, design, and procedure were identical to Experiment 2A except that the materials were presented at a fixed pace of 450 ms per word 4 .

Data exclusion and general analytic considerations
Based on the data exclusion criteria, the data from 10 subjects were excluded from further analysis. Four more subjects were removed from further analysis for a systematic misunderstanding of the instructions. Therefore, the analyses included 50 subjects and 30 items.

Probe reaction times
Based on outlier exclusion criteria, 8.7% of the data were excluded from further analyses. Like the accuracy results, reaction time tended to be shorter for referents than for distractors and when there were two nouns in the list sentence than when there were five. A 2 (nouns: 2, 5) × 2 (noun probed: referent, distractor) repeated-measures ANOVA showed that there was a significant main effect of the number of nouns, F 1(1, 49) = 25.8, p < 0.001, F 2(1, 29) = 19.4, p < 0.001, η 2 p = 0.35, as well as a significant main effect of the noun probed, F 1(1, 49) = 44.1, p < 0.001, F 2(1, 29) = 31.9, p < 0.001, η 2 p = 0.47. Once again, the pattern of means replicated the cross-experiment pattern seen in Experiments 1A and 1B as well as that seen in Experiment 2A, with the effect of number of nouns being larger for distractors than for referents. Despite this, there was not a significant interaction between the number of nouns in the sentence and the noun being probed, F 1(1, 49) = 0.70, p = 0.41, F 2(1, 29) = 2.03, p = 0.17. The effect of the number of nouns was significant among the referents, t 1(49) = 2.86, p = 0.006, t 2(29) = 2.54, p = 0.02, as well as among the distractors, t 1(49) = 4.55, p < 0.001, t 2(29) = 4.87, p < 0.001; this effect was numerically smaller for referents (94 ms, d = 0.40) than for distractors (130 ms, d = 0.64). The slopes corresponding to these effects, 31.2 ms/noun for referents and 43.2 ms/noun for the distractors, were substantially larger than the respective slopes seen in the previous experiments, possibly due to the change in the presentation of the passages to experimenter-paced.

DISCUSSION
The results confirmed the predictions of the fan-effect hypothesis, and the probe-word results were conceptually identical to Experiment 2A. Although subjects in the previous experiments seemed to be adopting a special strategy of reading the reference sentence more quickly when there were more distractors, the results of Experiment 2B indicate that this strategy was not necessary for the emergence of the probe-word results we had previously observed because subjects did not have the opportunity to employ it. The replication of the finding that the activation level of nouns decreases as the number of distractors increases therefore appears to be the result of a diffusion of activation to all potential referents.
However, this conclusion relies on the assumption that subjects were resolving the anaphor and that the anaphor processing affected the activation level of the referents. There is some evidence, however, that anaphor resolution may not always occur during reading (Greene et al., 1992;Levine et al., 2000;Klin et al., 2004Klin et al., , 2006Love and McKoon, 2011), making it possible that the present results could be occurring independent of anaphor resolution. The effect of nouns may have been caused by the increasing memory demands incurred as the number of referents increased regardless of whether the subjects attempted to resolve the anaphors. It is possible that as the amount of information in the subjects' mental representations increased, the probability of the correct referent being activated even by the probe word itself, independent of anaphor resolution processes, decreased, resulting in slower reaction times as the number of referents.

EXPERIMENT 3
Experiment 3 was designed to address the possibility that anaphors were not being resolved in the prior experiments. To do so, the reference sentence was modified such that it contained an anaphor or not (see Table 1), a manipulation that has been used many times in the anaphor resolution literature (e.g., Dell et al., 1983;Levine et al., 2000). As in Experiments 2A and 2B, there were either two nouns (i.e., a referent and one distractor) or five nouns (i.e., a referent and four distractors) in the list sentence that preceded the reference sentence. The referent was used as the probe word to provide an index of the activation of this concept at the end of the anaphor or no-anaphor sentence. According to the fan-effect hypothesis, it is activation from the anaphor as a memory cue that is divided among the referent and the distractors that is the source of the effect of the number of nouns. Thus, when there is an anaphor, the fan-effect hypothesis predicts an effect of the number of nouns like that seen in the previous experiments. Whatever pattern emerges for the effect of the number of nouns in the anaphor condition, because anaphor resolution involves reactivation of the correct referent (e.g., Dell et al., 1983), there should be an overall accuracy and reaction time advantage in the anaphor over the no-anaphor control condition.

Subjects
Seventy students enrolled in a general psychology course at the University of Arkansas participated in the experiment to partially fulfill a research requirement. All subjects were native-English speakers.

Materials and design
Experiment 3 used the same set of materials as Experiments 2A and 2B with the exception that the reference sentence was manipulated (see Table 1) such that it included an anaphor (i.e., Anaphor condition) or not (i.e., No Anaphor condition), while equating for length (i.e., the mean length for both the anaphor and no anaphor conditions was 61.5 characters). Finally, the probe words were limited to referents only, as in Experiment 1A. The manipulation of these factors resulted in a 2 (nouns: 2, 5) × 2 (reference: anaphor, no anaphor) completely within-subjects design.

Procedure
The procedure of Experiment 3 was identical to that of Experiments 1A and 1B, except that it included only 98 trials (30 experimental and 68 fillers), as in Experiments 2A and 2B.

Data exclusion and general analytic considerations
Based on outlier identification and comprehension and probe accuracy, the data from 5 subjects were excluded from further analysis. Therefore, the reported analyses included 65 subjects and 30 items.

Probe reaction times
Based on outlier exclusion criteria, 7.5% of the data were excluded from further analyses. Reaction times (see Table 6) tended to be faster in the anaphor condition than in the no anaphor condition and when there were two nouns in the list sentence than when there were five nouns. A 2 (nouns: 2, 5) × 2 (reference: anaphor, no anaphor) repeated measures ANOVA revealed that the main effect of nouns was non-significant, F 1(1, 64) = 1.26, p = 0.27, F 2(1, 29) = 0.24, p = 0.63. Because of the prediction of the fan-effect hypothesis, the effect of the number of nouns was examined for the anaphor condition. The noun-effect was 23 ms but was also not significant, t 1(64) = 1.26, p = 0.21, t 2(29) = 0.80, p = 0.43. The main effect of reference was nearly significant in the subjects analysis, F 1(1, 64) = 3.38, p = 0.07, η 2 p = 0.05, but nonsignificant in the items analysis, F 2(1, 29) = 2.32, p = 0.14, and the interaction between reference and nouns was not significant, F 1(1, 64) = 0.65, p = 0.42, F 2(1, 29) = 1.09, p = 0.31.

DISCUSSION
The results from Experiment 3 provided some evidence that subjects were in fact resolving the anaphors when reading the passages. Probe accuracy was better after reading a sentence with an anaphoric reference than after reading a sentence that did not make an anaphoric reference. Additional evidence that subjects were resolving the anaphors comes from the reading-time data. Controlling for length, the reference sentences were read more slowly when they contained an anaphor than when they did not, consistent with the hypothesis that subjects were engaging in additional processing to resolve the anaphor. This conclusion is tentative, though, as there were more explicit references 5 (e.g., pronouns, specifiers, definite noun phrases) to entities in the prior sentence in the reference sentences in the anaphor condition (M = 2.8, SD = 0.8) than in the no-anaphor condition (M = 1.6, SD = 0.8), t (29) = 6.27, p < 0.001. In most cases (25 of 30 passages), these additional references were not to any of the list items; excluding the five passages with a second reference to list items does not change the pattern of results for probe accuracy or reaction time reported above.
The results of this experiment's anaphor condition were less consistent with the fan-effect hypothesis than the results from prior experiments, although the general pattern of degraded recognition performance with more nouns persisted; we return to this issue in the General Discussion.

GENERAL DISCUSSION
Explanations of how anaphoric expressions are understood have frequently appealed to general memory processes. Consistent with theories of comprehension that place memory at their center (e.g., Kintsch, 1988;Myers and O'Brien, 1998;Lewis and Vasishth, 2005), anaphor resolution is more difficult when factors are present that make retrieving a unique item from memory more difficult, such as when there is similarity between a desired target and some distractor. Prior research that has produced findings that are consistent with this hypothesis (e.g., Corbett and Chang, 1983;Corbett, 1984;O'Brien, 1987;O'Brien et al., 1990;Greene et al., 1992;Levine et al., 2000;Badecker and Straub, 2002;Klin et al., 2004Klin et al., , 2006 have used stimuli with one distractor and one antecedent, and by a variety of measures anaphor resolution has been shown to be more difficult because of the distractor. In five experiments, we examined the hypothesis that a greater number of distractors would lead to a fan effect (Anderson, 1974) in anaphor resolution, that is, if with each additional distractor there would be additional difficulty in identifying the correct referent of the anaphor. We also examined the effect of additional distractors on the activation of those distractors. Our subjects read pairs of sentences, the first of which provided a variably-long list of concepts from the same taxonomic category and the second of which made unambiguous reference to one of the items in the list with an adjective-modified definite noun phrase; this was followed by a probe recognition task that should provide an index of how active the probed concept is in the text representation.
Collectively, the probe word results from the present experiments supported the hypothesis that distractors have a cumulative effect on antecedent activation levels. Although the effect of the distractors on reaction time varied in size and significance from experiment to experiment, it is overall a robust effect. The two-and five-noun conditions with a referent probe were present in Experiments 1A, 2A, 2B, and 3. The subject data from these four experiments were combined and submitted to a 2 (nouns: 2, 5) × 4 (Experiments: 1A, 2A, 2B, 3) mixed-factor ANOVA with repeated-measures on the first factor. The effect of nouns was significant, F (1, 265) = 15.56, p < 0.001, and the interaction was not, F (3, 265) = 1.18, p = 0.32, suggesting that there was not significant variability in the effect of nouns across experiments. Cohen's d for the effect of nouns was 0.24 (95% confidence interval: 0.12, 0.36; Smithson, 2003), demonstrating a small but reliable effect. Whereas previous research has shown that the presence of a single distractor interferes with the activation of the antecedent, the present research extends this finding by demonstrating that each additional distractor further reduces the activation level of the antecedent and other distractors. This effect is akin to a set size effect (Sternberg, 1966), with larger lists leading to longer reaction times; however, the difference in the size of the effect for referents and distractors suggests that an additional process related to anaphor resolution is also occurring.
The present results are conceptually similar to the fan effect where delayed recognition [i.e., the recognition task occurring after the presentation of all of the materials as in Anderson (1974)] slows as the number of facts associated with a noun increases. This effect is generally attributed to the reduction in the probability of the correct item in memory being activated at the time of retrieval, thus slowing responses. The present experiments demonstrate an earlier effect, with the number of distractors affecting the activation level of nouns immediately after each trial. In this case, the categorical anaphor (e.g., tool) acts as a retrieval cue, with activation being split among all of the concepts associated with the category (i.e., the referent and distractor[s]). Increasing the number of distractors should therefore increase the time required to resolve the anaphor. This increased retrieval time effect was not observed in the present experiments, although this was likely due to subjects adopting a speeded-reading strategy (see the discussion of the reading-time results below). As a consequence of multiple potential antecedents, activation should be divided among the concepts, limiting the activation for each one (see spreading activation theory; Collins and Loftus, 1975;Anderson, 1983). This prediction was supported by the slowed reaction times and the reduced accuracy resulting from increasing the number of distractors. The present results further demonstrate that activation does not spread equally to all category members when there is disambiguating information (e.g., an adjective modifier like cutting in the cutting tool). Increasing the number of nouns led to a consistently greater reduction in probe accuracy and increase in reaction time for distractors than for referents in the Experiments 1A and 1B combined analysis and Experiments 2A and 2B, suggesting that activation was spreading disproportionately to the referent.
We have framed the current results as primarily being an effect that occurs at the time of retrieval (i.e., upon reading the anaphor). It is possible that these effects are also influenced by encoding or storage interference. Upon reading multiple items with many shared features, like our list-sentence items, the mental representation of these items may be overwritten (Nairne, 1990) or degraded due to repeated reactivation by similar items (Estes, 1997). The methodology used in the current research does not allow for delineation between a storage-based and a retrievalbased explanation. Ferreting out the relative contributions of storage-and retrieval-interference processes would likely require careful parametric manipulation of feature overlap among distractors and the referent as well as precise control over not only timing of reading and probes but also time elapsed between storage and retrieval, as well as manipulation of serial position of distractors and referents. Attempting to work out these details is a promising avenue for future research.
Turning to the reading-time results, we found no evidence that additional distractors led to more difficulty processing anaphoric reference. By contrast, we consistently found that our subjects read faster as there were more distractors. We believe that this is the result of subjects adopting a speeded-reading strategy on difficult trials (i.e., trials with longer lists of nouns), which counteracted the predicted increase in anaphor reading time. This is similar to Van Dyke and McElree's (2006) finding that, while reading grammatically-complex sentences, subjects read faster and had worse comprehension while holding a memory load (i.e., a list of three words) than when not holding a memory load, suggesting a dual-task strategic trade-off. Our subjects also had lower comprehension with greater list length (see Table 2), suggesting that there was possibly a task demand that shifted attention somewhat from the comprehension aspect of the task to the memory aspect of the task. In no case, however, was comprehension lower than about 83%. Moreover, there is no theoretical reason to expect anaphor resolution to take less time as the number of candidate antecedents increases unless subjects were giving up on trying to identify the correct antecedent (Levine et al., 2000). There are a few arguments consistent with the notion that subjects were in fact resolving the anaphors in the current research. First, correctly answering a large majority of the comprehension questions required the anaphors to be resolved, which some have suggested is necessary to get subjects to resolve anaphors in anaphor resolution research (Foertsch and Gernsbacher, 1994). Second, some subjects, especially in Experiment 1B, spontaneously adopted the strategy of labeling distractors as new in the probe recognition task, which suggests that they had selected the referent as the "correct" answer and distractors as the "incorrect" answer to the probe task. Third, Experiment 3 provides tentative evidence that subjects were resolving the anaphor, even on five-noun trials. Given these arguments and findings, we believe that our subjects were resolving anaphors even when it was difficult to do so. Therefore, the speeded-reading strategy appears to be the most parsimonious explanation of these unexpected results. Furthermore, the fixed-pace presentation of the sentence in Experiment 2B prevented subjects from engaging in the speeded-reading strategy, demonstrating that the probe word effects do not rely on such a strategy. Future research should attempt to prevent the speeded-reading strategy while maintaining naturalistic reading (e.g., introducing a substantial delay between the passages and the probe task or eliminating the probe task entirely) in order to better evaluate the anaphor reading time hypothesis.
Finally, returning to the fan-effect hypothesis, the original explanation offered for the fan effect by Anderson (1974) was based on Anderson and Bower's (1973) theory of memory, which assumed that memory retrieval was based on search cues being used to identify, in parallel, matching elements in memory, which were then serially examined, resulting in an increase in reaction time with each additional matching element. In the former detail (i.e., a parallel matching), this theory is in the same family as other global-matching memory theories like those of Ratcliff (1978), Gillund andShiffrin (1984), andHintzman (1986), upon which memory-based text processing frameworks like Myers and O'Brien's (1998) resonance model are based. In this sense, the results of our experiments are confirmation of both theories of memory search and the hypothesis that at least some aspects of comprehension may be explained by general memory processes. However, other research into the fan effect has shown that there are circumstances under which there is no fan effect despite there being multiple associations with a single memory cue (Myers et al., 1984;Radvansky, 1998;Radvansky et al., 1998). Myers et al. found no fan effect when memory elements could be integrated causally. For example, reading the elements the doctor went to the racetrack, the doctor studied the odds, and the doctor made a selection may be readily integrated into a causally-coherent narrative representation about events occurring at a racetrack. Similarly, Radvansky and colleagues showed that the fan effect is reduced or even eliminated when potentially-competing memory elements can be readily integrated. One feature that makes elements easy to integrate is if they can occur at the same time (e.g., the grocer was folding a towel; the grocer was clearing his throat), whereas elements that are in different locations may not be integrated (e.g., the welcome mat is in the cocktail lounge; the welcome mat is in the office building). Radvansky et al. observed a fan-effect in recognition of hard-to-integrate elements, but not for easy-tointegrate elements. Given that there are boundary conditions for the fan-effect in memory experiments, a natural question to ask is if there are circumstances under which the search process in anaphor resolution might occur without interference. Across sentences, one such circumstance might be if the items in a list occur in more-naturalistic texts, allowing for an integrated situation model to be constructed, as suggested by both Myers et al. (1984) and Radvansky (1998;Radvansky et al., 1998). By contrast, within sentences, one condition that has been shown to limit the search for referents is when there are strong grammatical constraints on reference. Recent evidence from Dillon et al. (2013;see also Chow et al., 2014) suggests that syntactic principles may guide retrieval in a constrained manner for some linguistic dependencies, such as reflexives (but see Badecker and Straub, 2002;Kennison, 2003 andSturt, 2003, for further complexities), leading to retrieval without interference from distractors; syntactic constraints may play an especially critical role in directing the retrieval processes that occur within a sentence. These types of findings are representative of two distinct research literatures have arisen over the past few decades, one focused on retrieval across sentences, and the other focused on retrieval within sentences. Integration of these theories and findings holds out the promise of yet further integration of theories of memory and comprehension.

AUTHOR NOTE
Portions of these data were presented at the 52nd Annual Meeting of the Psychonomic Society, 2011, Seattle, Washington, and at the 22nd Annual Meeting of the Society for Text and Discourse, 2012, Montreal, Canada.