Persistence Conditions of Institutional Entities: Investigating Copredication Through a Forced-Choice Experiment

The conditions under which certain complex polysemous nominals can sustain coherent sense relations (informally, can “survive”) is investigated through a two-alternative forced choice experiment. Written scenarios were constructed which permitted copredication, through which multiple, semantically different sense types are associated with a single nominal. Participants were presented with two scenarios involving a polysemous nominal (e.g., bank, city) and had to select which scenario (and, hence, which combination of predicates) appeared to be the most prototypical, faithful realization of the nominal. In order to achieve this, an additional manipulation was added, such that the number of senses hosted by each forced choice was either equal (2 senses choice vs. 2 senses choice) or unequal (1 sense choice vs. 2/3 senses choice). In order to address certain concerns in the literature about prototypicality, a core question addressed was whether the institutional sense of the nominals strongly determined the option chosen by participants, or whether the number of senses more strongly predicted this. It was found that the best predictor of sense “survival” was not sense frequency, but rather sense complexity or approximation to the institutional sense.


INTRODUCTION
In a recent discussion on the nature of consciousness, Chomsky (2018, p. 38) considers "one of the most ancient problems of philosophy: How can we cross the same river twice?" These are problems of "identity" and "individuation." Relatedly, Collins (2017, p. 680) presents an example demonstrating how river can license copredication through it being a geographical feature, an abstract relation or a body of water: "The Nile runs the length of Egypt and it serves as the most important trade route in the region as well as the source of irrigation for nigh-on all of Egypt's crop production." This issue has been discussed mostly within the philosophical literature; witness Collins (2017, p. 686): "Thus, a group of people and a geographical area wildly dissociate in every conceivable sense save for them being referred to by London, say. We can kill the population of London, but not the area in south-east England. Equally, we can burn the city down while sparing the people, but rebuild the same city elsewhere, with a new population." It has been argued that these issues pertaining to the semantic diversity of apparently simple entities may relate to the theme of polysemy. Cognitive models of polysemy have suggested that vagueness, polysemy and homonymy represent "a cline of diminishing schematicity and increasing instances of salience" (Lewandowska-Tomaszczyk, 2007, p. 139), such that polysemous senses bear a lower degree of salience than homonymous meanings do, and vague words have an even lower degree of salience for a particular meaning. For instance, student is not polysemic with respect to the distinction between young student and old student ("I gave the book to a student but not to a student" cannot refer to an old and young student, respectively); it is vague and unmarked. In a similar way that one would typically say "Milton Keynes is close to London" and not "London is close to Milton Keynes, " due to proximation to a larger city being seen as a prototypical frame of geographical reference, it may be that there is an empirically detectable range of sense prototypicality in polysemy such that one sense may be seen as more essential to the polysemous nominal than others.
The main set of prototype effects are plotted in Table 1, loosely adapted from Lewandowska-Tomaszczyk (2007, p. 151) and modified to focus on the form of polysemy explored here. This shows how degrees of salience, clustering of senses, a lack of clear necessary and sufficient features, and variability in category membership are hallmarks of prototype effects. Table 1 plots effects relating to extensions and intensions, which are, respectively, entities referred to by the concepts and senses which the concepts are composed of. These effects can in turn be categorized based on whether they effect the salience of the sense (how clear and prominent it is in a given interpretation) or its discreteness (related to issues of demarcation).
As the framework depicted in the above table would predict, it has been found that people tend not to categorize objects using necessary and sufficient features, but rather do so by comparing their similarity to a prototype of the candidate category (Rosch, 1975(Rosch, , 1978. As such, when judging whether a cluster of senses in a given context constitutes a school or not, comprehenders will presumably compare this structure to their stored prototypical representation. Yet, the main theoretical difficulty in discussing this topic surrounds the issue of what precisely constitutes the prototype. As Gries (2015, p. 473-474) notes, the prototypical sense of a word could be the most frequent, most salient, most concrete, or the earliest acquired sense, and it is furthermore likely that the criteria of prototypicality differs across nominal classes. As such, there seems little fundamental connection between prototype structure and models of polysemy; rather, the prototype is a representation likely independent of semantic or pragmatic processes (Murphy, 2016) and, the criteria for being a prototype likely differs across word classes and conceptual domains. As Geeraerts (1989) also proposed, there are certain concepts (for example, INSTITUTION-related polysemous nominals) which may not be able to be defined through a set of necessary and sufficient features, and which exhibit a semantic structure which assumes the form of a set of clustered and overlapping interpretations. For example, "The school with large windows starts at 9 a.m. and has a strict headmaster and unruly students" contains a number of clustered senses being attributed to a single nominal. However, it may be the case that one of these senses is more salient and prototypical than the others, but intuition alone does not seem a powerful enough measure to expose this. As such, behavioral data is needed.
All of these effects are clearly present in a particular instantiation of polysemy known as copredication, whereby multiple, semantically different sense types are associated with a single nominal ("Lunch was delicious but was delayed"; "The newspaper that I held this morning has been sued"). Progressing on from recent research into the acceptability dynamics of copredication (Murphy, 2019(Murphy, , 2021a and possible lower-level accounts (Murphy, 2018(Murphy, , 2020, I will investigate what I will term the persistence conditions of copredication, making close contact with the nature of prototypical copredications. The main research question addressed will be: What are the conditions under which the identity of a given entity can survive? This will act as a refined, controlled version of the classical Ship of Theseus paradox and the river paradox of Heraclitus. Crucially, the very notion of persistence conditions more readily lends itself to forms of polysemy involving copredication, since the notion of copredication is rooted in a sense of semantic conflict and incompatibility, rendering the construction of scenarios involving some aspect of competition between senses feasible. In terms of focal predictions, the persistence conditions of these entities could be primarily determined by the number of senses being referred to in the discourse. This would suggest that sense number renders the ongoing representation of the entity salient, supporting its ultimate representational perseverance. Alternatively, one particular sense may more strongly predict how the object persists, such that, for example, the institutional sense of school determines its persistence, and not any other sense (e.g., a school might not be conceived as a building with an institution, but an institution with a building). One might relate these predictions to certain existing models of polysemy. For example, the Sense Enumeration Lexicon Hypothesis maintains that distinct senses of polysemous words like school are in fact represented as separate lexical entries, such that the persistence of any individual sense would be predicted not to be directly reliant upon any other sense, since these are lexically independent (supporting the sense number prediction) (Lehrer, 1990;Foraker and Murphy, 2012). On the other hand, the One Representation Hypothesis maintains, broadly speaking, that a word like school has multiple, underspecified representations connected to a single lexical entry, and would more directly be related to the sense type prediction (Frisson, 2009(Frisson, , 2015. More specifically, Löhr's (2021) distinction between rich and thin semantic representations of polysemy (both of which are in accord with the One Representation Hypothesis) are relevant here. The thin view maintains that one specific sense forms the core meaning, around which other senses are clustered, while the rich view sees multiple senses forming distinct contributions to lexical meaning whilst still maintaining the existence of a single lexical entry. For more extensive discussion of polysemy storage and processing models (see Carston, 2016Carston, , 2019Vicente, 2018;Ortega-Andrés and Vicente, 2019).

Materials
An online acceptability judgment experiment was carried out using Qualtrics 1 and sourcing participants from Prolific Academic (prolific.ac). The purpose of the experiment was to present participants with scenarios involving a nominal that licenses copredication (e.g., bank, city) and ask them to make a forced choice to determine which outcome of the scenario (and, hence, which combination of predicates) appears to be the most prototypical, faithful realization of the nominal. In order to achieve this, an additional manipulation was added, such that the number of senses hosted by each choice was either equal (2 senses choice vs. 2 senses choice) or unequal (1 sense choice vs. 2/3 senses choice). All participants saw all trials across both types.
A core question of interest was whether the INSTITUTION sense strongly determined the option chosen by participants, or whether the number of senses predicted this. Addressing this question required balancing the number of scenarios in which the institution sense appeared either in isolation or with other senses. As such, in three scenarios the institution sense was isolated, in three scenarios it appeared with other senses while a different sense (e.g., PHYSICAL) was isolated, and the remaining six scenarios contained an equal number of senses.
The scenarios involved a two-alternative forced choice between qualified (licensed/acceptable) copredications of differing types, with participants being asked to choose Which is the bank? or Which is the city? 12 narratives were constructed, 6 of which presented two options exhibiting an equal number of senses, and the other 6 presented two options with an unequal number of senses. The narratives themselves exhibited all of the senses in the choices for participants. The central question which arises here is: What determines the more popular choice amongst participants? One possibility is sense frequency, i.e., participants will choose whichever option hosts the most frequent sense. Another possibility is sense complexity, with the most semantically complex sense (the one able to host the greatest number of senses and which is related to the greatest number of "core knowledge systems" or cognitive modules; Carey, 2009) determining true or prototypical objecthood for a given nominal.
Narratives of the following type were constructed (see Supplementary Appendix for full list). In the experiment, each narrative was followed by a choice: "Which is the X?": Library: A library catches on fire and is shut down. A new building across the street with self-service machines is built to 1 qualtrics.com/uk help the public take out books. However, the employees of the original building protest and insist that the old library can simply be repaired, and refuse to hand over most of the books to the new building.
Choice: Old building-New building Senses: Physical, People-Physical, Process Village: The King of a medieval village becomes corrupt and so the folk stage a rebellion, burning down the whole place in the process. Taking the village's original architects and main political leaders with him, the King relocates to a new site to accurately rebuild it. The village's entire population, however, move to a different site and also bring with them one of the original architects who helps them rebuild.
Choice: King's site-People's site Senses: Physical, Polity-Physical, Populace Six fillers were also used to ensure participants were paying attention, with these narratives having clearer and more obvious answers. These were of the following kind: Sandwich: John decides to make a sandwich. He slices it in half, begins to eat the first half, but then finds the bread very hard and difficult to swallow. He decides to take the second half and blend it into a smoothie.

Procedure
Scenarios were presented as single paragraph blocks, over a white background. Scenarios were presented in a random order. Below each paragraph, two options were presented corresponding to either choice, and participants were tasked with selecting their choice. After the presentation of the two-alternative forced choice, the screen was refreshed and participants were tasked with selecting one option from a 1-5 confidence metric, introduced by the question "How confident are you about your choice?" This allows differentiation between cases in which participants strongly believed in their choice, and cases when they were more ambivalent. Finally, to ensure that participants paid attention, they were forced to explain their reasons for selecting either option.
Participants 79 native English speakers (average age: 36, range: 20-60, 60 female) took part in the experiment, sourced from Prolific Academic and having an approval rating on the site of at least 90%. Participants were paid £6 per hour, with the average finishing time being 10 min. This study was approved by the UCL Research Ethics Committee and participants consented immediately prior to the experimental procedure to their recording responses being used for academic research purposes.

Confidence Scores
The average confidence score for the fillers was 4.2, and was lower for the experimental items (3.6). Confidence scores for the experimental items ranged from 3.01 (city) to 4.06 (bank), suggesting that the responses can be taken as an accurate and genuine reflection of the participant's semantic intuitions. Confidence scores for experimental items can be found in Table 2.

Sense Type Score
Participants scored correctly on most fillers, and the minority of deviations (25/474 total responses) provided well-reasoned explanations. The results for the experimental narratives are plotted in Figure 1.
A multiple regression analysis was conducted using SPSS 24 with Sense Equality (Equal vs. Unequal) and Nominal (1-12 of the nominal list) as independent variables and Sense Type (Institution vs. Non-Institution) as the score. Sense Equality and Nominal significantly predicted Sense Type [F(2, 945) = 19.659, p < 0.001]. While Sense Equality added statistical significance to the prediction (p < 0.001), Nominal did not (p = 0.438). The average Sense Type score for the Unequal nominals was 0.59 (with 1 being 100% Institution sense chosen) and was 0.39 for the Equal nominals.

DISCUSSION
The results suggest that adding a greater degree of sense variability to each decision (e.g., Village #1 vs. Village #2) results in a more stable role for the institutional sense in determining objecthood. Participants were more willing to select a Non-Institution sense when the number of senses was matched across choices, but when the number of senses was unequal this seems to have exposed the possibility that the institutional sense was somehow more primary and essential, since when the number of senses is equal the (putative) primacy of the institutional sense would be harder for participants to detect, only becoming clear when they were forced to make a more stark choice between the institutional or non-institutional senses.
Comparing the present results to those from a previous frequency experiment (Murphy, 2019) allows us to determine whether the results were modulated by sense dominance. For village and town, PHYSICAL is the most frequent sense (based on results reported in Murphy, 2019Murphy, , 2021a, which initially appears to predict the present results, yet participants made their decision based on the PEOPLE sense, not PHYSICAL, since both options hosted a PHYSICAL sense, and indeed both these decisions may have been influenced more by the POINT OF ORIGIN than by PEOPLE. Either way, it is difficult to determine whether frequency was the determining factor. PHYSICAL is also the dominant sense of factory, and two new locations were part of this narrative so POINT OF ORIGIN played no role, and both choices also hosted a PHYSICAL sense, but as with village and town the PEOPLE sense appears to have determined the outcome; that is, the sense semantically closest to INSTITUTION (and also the most abstract sense) determined the outcome. While the participant's choices for company, school and university on average matched the dominant sense of these nominals (INSTITUTION), twice as many other nominals (province, city, library, church, shop, bank) deviated from this.
In conclusion, the best predictor of the results was not sense frequency, but rather sense complexity or approximation to INSTITUTION. It was reasonable to hypothesize that frequency would have had at least some influence, not least because the present experiment was explicitly oriented toward pitting two entities against each other, with prototypicality likely being one of the best guides for participants to make a judgment. Yet, even under these circumstances, sense frequency was not the determining factor.
It might also initially appear that persistence conditions could more easily be explored by giving participants stages of decomposition, such that parts of a city (its people, institutions, etc.) are removed one-by-one in a given scenario, with participants being asked "Is this a city?" every step of the way. However, since objects can exist with only a single polysemous sense persisting this proposal would likely result in all participants (rightfully) selecting "Yes" up to the point when the last sense remains. As such, not only would this design not speak to the question of core, essential senses of complex polysemous nominals, it would also likely result in all participants claiming that each nominal survived the full destructive process simply because such a process would necessarily be a "senseby-sense" level of destruction, since this is the only level of granularity one can operate at (with the only other possibility being manipulating pragmatic factors such as the context that the object denoted by the nominal found itself in, in which case we are back to the original design of the present experiment).
One potential objection to the present study is that it did not control for pragmatic factors, such as the tendency for participants to "side" with the underdog against a putative antagonist, encouraging them to choose "People's location" over "King's location" for the village narrative. However, this seems unlikely since both village and town contain a clear "antagonist" (the corrupt King representing the INSTITUTION sense and the violent gangs representing the POPULACE sense) but this did not predict responses, since participants on average chose the ("good") POPULACE-PHYSICAL senses for village but the ("bad") POPULACE-PHYSICAL senses for town, even when the PHYSICAL sense of town was described in the narrative as being doomed for demolition. Likewise, the majority of participants , where POLITY is subsumed as an institutional sense and narratives involving no explicit institutional sense were sorted such that the sense most approximate to INSTITUTION acted as an institutional proxy (e.g., POPULACE/EMPLOYEES is more semantically related to INSTITUTION than PHYSICAL). opted for the "antagonist" PHYSICAL-INSTITUTION senses for university rather than the PHYSICAL-POPULACE senses. The notion of POINT OF ORIGIN therefore seems to play a role in individuating institutional entities like town, as it also does for school, company and shop-but not for bank, seemingly due to FUNCTION trumping POINT OF ORIGIN. For Pustejovsky (1995), these two notions of FUNCTION and POINT OF ORIGIN are, respectively, referred to as the TELIC and AGENTIVE Qualia roles. It appears that, much as how Pustejovsky originally claimed, different Qualia roles are foregrounded across different nominals based on context-although, going somewhat beyond this, the present experiment also suggests that the relations of these Qualia roles might exhibit a more robust, generalizable structure or hierarchy, such that the telicity of bank is foregrounded not simply due to context/narrative, but because of the internal structure of the senses which compose the (highly) polysemous nominal. This hypothesis ties in with Lang and Maienborn's (2011, p. 719) interesting proposal that institutional nominals appear to have a common PURPOSE semantic feature (essentially Pustejovsky's TELIC role), with the senses enveloping schools and banks and shops ultimately being centered on a core lexical meaning: "[A] legal entity that organizes purposeful events to be performed and/or received by authorized groups of persons in specific locations." Nevertheless, it is remains possible that certain pragmatic factors were a confounding factor (e.g., narrative perspective; coherence relations), and future research should aim to more carefully examine this possibility.

CONCLUSION
This experiment has shown that specific semantic and pragmatic factors enter into judgments about the persistence of polysemous entities in complex scenarios. These results provide support for a version of the One Representation Hypothesis of polysemous lexical representations, through which different senses of polysemous words can be navigated around a mereologically focal and essential representation. As such, Löhr's (2021) thin view of polysemy representation appears to be supported, although the specific sense which forms the thin "core" seems to vary across nominals. Further research is required to more systematically relate Löhr's (2021) model to psycholinguistic concerns. The present findings do not directly support Vicente's (2017) claim that the INSTITUTION sense is necessary for the persistence of these nominal types, nor do they support (Arapini's, 2013(Arapini's, , 2015 belief that the multiple senses of institutional entities are "clustered in a symmetric structure" (2013, p. 35), since there is variability in spite of the very strong trend in INSTITUTION-dominance. While there may in fact be no such thing as a core, essential sense for any of the nominals discussed (with each nominal being a cluster of senses with pragmatic factors determining which one is brought to the fore), the results suggest that there is considerable variation in the level of INSTITUTION-dominance the sense-cluster of each nominal exhibits.
Future experiments involving a larger range of scenarios could introduce additional factors to test, such as sentence type/syntax, or the presence of coherence relations between components of the scenario. Addressing the issue of pragmatics, narrative frames could also be kept consistent across nominals; statistical power could be boosted by increasing the number of nominals; and all nominals could be presented under both equal/non-equal sense number conditions, to more directly test the extent to which variations across nominals impacts persistence.

DATA AVAILABILITY STATEMENT
All datasets generated for this study are included in the article/Supplementary Material.