Great apes can defer exchange: a replication with different results suggesting future oriented behavior

Osvath, Mathias; Persson, Tomas

doi:10.3389/fpsyg.2013.00698

ORIGINAL RESEARCH article

Front. Psychol., 02 October 2013

Sec. Comparative Psychology

Volume 4 - 2013 | https://doi.org/10.3389/fpsyg.2013.00698

Great apes can defer exchange: a replication with different results suggesting future oriented behavior

Mathias Osvath*

Tomas Persson

Department of Cognitive Science, Cognitive Zoology, Lund University, Lund, Sweden

The topic of cognitive foresight in non-human animals has received considerable attention in the last decade. The main questions concern whether the animals can prepare for upcoming situations which are, to various degrees, contextually or sensorially detached from the situation in which the preparations are made. Studies on great apes have focused on tool-related tasks, e.g., the ability to select a tool which is functional only in the future. Dufour and Sterck (2008), however, investigated whether chimpanzees were also able to prepare for a future exchange with a human: an object exchanged for a food item. The study included extensive training on the exchangeable item, which is traditionally not compatible with methods for studying planning abilities, as associative learning cannot be precluded. Nevertheless, despite this training, the chimpanzees could not solve the deferred exchange task. Given that great apes can plan for tool use, these results are puzzling. In addition, claims that great ape foresight is highly limited has been based on this study (Suddendorf and Corballis, 2010). Here we partly replicated Dufour and Sterck's study to discern whether temporally deferred and spatially displaced exchange tasks are beyond the capabilities of great apes. In addition to chimpanzees we tested orangutans. One condition followed the one used by Dufour and Sterck, in which the exchange items, functional only in the future, are placed at a location that freely allows for selections by the subjects. In order to test the possibility that the choice set-up could explain the negative results in Dufour and Sterck's study, our second condition followed a method used in the planning study by Osvath and Osvath (2008), where the subjects make a forced one-item-choice from a tray. We found that it is within the capabilities of chimpanzees and orangutans to perform deferred exchange in both conditions.

Introduction

The last decade has seen a number of studies on episodic-like memory and foresight in animals, primarily on corvids and great apes (e.g., Clayton and Dickinson, 1998; Raby et al., 2007; Osvath and Osvath, 2008; Martin-Ordas et al., 2010). The positive results in several of these studies suggest that the underlying cognitive system, to some extent, can be compared to the human episodic system. The human system provides the ability to remember events and cognitively construct potential future events from a subjective perspective, often including contextual elements such as “when,” “where,” and “what” information. The episodic system is usually contrasted with the semantic system, which is also declarative, but concerns general knowledge unrelated to an explicit event. (For a review on the episodic system see e.g., Szpunar, 2010).

Regardless of whether there is an episodic component associated with the future directed behaviors of the animals in question, it remains important to study such behaviors in detail. Future directed behaviors exhibited in such studies appear difficult to explain by merely associative learning of key stimuli, or by rigid mechanisms, such as fixed action patterns (e.g., Raby and Clayton, 2009; Osvath, 2010). Cognition underlying future directed behavior, which seem not purely governed by the law of effect or innate responses, epitomizes some of the hardest problems within cognitive science: how matter (i.e., the brain) can be about something that does not yet exist (i.e., the future). Nevertheless, we know with some certainty that many brains can do this, e.g., all those of normal adult humans.

Current methods in the research on animal cognitive foresight are influenced by views forwarded by Wolfgang Köhler in the 1920s. Köhler studied the cognition of chimpanzees, and described his observations of chimpanzees anticipating events that were “planned acts of the animal itself” (Köhler, 1921). In the cases Köhler studied, however, the rewards were always visible. That is, a key stimulus of the goal of the planning action was available for sensory feedback. Köhler argued that it would be an even bigger achievement if the chimpanzee could make preparations for events that were not yet within sight. Köhler suggested an experimental protocol for such a study: a two-room paradigm, in which one room contains a reward, and the other room holds the means of getting the reward. Access to the rooms is temporally separated.

Today, there exist only a few studies on the abilities of great apes to act toward future goals where the goals are outside the animal's current sensory scope. (Mulcahy and Call, 2006; Dufour and Sterck, 2008; Osvath and Osvath, 2008; Osvath, 2009; Osvath and Karvonen, 2012). (For studies on corvids see Correia et al., 2007; Raby et al., 2007; Cheke and Clayton, 2012, and on monkeys Bourjade et al., 2012; Dekleva et al., 2012). The experimental studies have roughly followed Köhler's protocol with the two-room paradigm. Two studies have focused on great ape abilities to select, transport and save tools that are useful only in a future context (Mulcahy and Call, 2006; Osvath and Osvath, 2008). An additional study, on chimpanzees, included the same tool-using paradigm, but also, and mainly, investigated conditions based on the ape having to select an item that after a delay of 1 h could be exchanged for a food reward from a human (Dufour and Sterck, 2008). This study relied heavily on training the animals to exchange a certain object type. It might therefore not be regarded as a planning study in a tradition in which an exclusive reliance on associative learning is precluded. This does not make the results less interesting, however.

The two studies solely based on tool use showed that chimpanzees, bonobos and orangutans are capable of selecting an appropriate tool well in advance of its use. Bonobos and orangutans could keep the tool overnight (14 h per trial) (Mulcahy and Call, 2006). One of the studies showed that orangutans and chimpanzees could disregard an immediate, but smaller, favorite reward in favor of the tool that offered the means of attaining a future, larger, reward (Osvath and Osvath, 2008). Furthermore, this study controlled for whether the apes behaved toward the tool as if it was merely a reinforced stimulus, concluding that associative learning alone could not explain the results (see also Osvath, 2010).

The study by Dufour and Sterck (2008), concentrating primarily on future exchange in chimpanzees, yielded puzzling results. In the tool-using condition, which was a replication of one part of the studies described above (i.e., Mulcahy and Call, 2006), the chimpanzees were successful. The subjects did, however, not succeed in selecting a heavily reinforced item that was usable in a future exchange with a human experimenter. Suddendorf and Corballis (2010) have forwarded that these results show that great ape foresight is surprisingly poor. Regardless of whether this is a correct assumption or not, there are at least two reasons why these results are noteworthy.

The first reason is that the results of Dufour and Sterck (2008) confirm that great apes do not merely rely on associative learning of a target item in future directed tasks. The item designated for the future exchange was reinforced a high number of times in an immediate context, i.e., in training on exchange. It was then tested and confirmed that the subjects understood its token status. Despite this extensive training in an immediate context, the chimpanzees failed to perform the exchange when a delay was introduced between the presentation of the item and the exchange event. That is, they did not “blindly” collect the objects with the most reinforcement history, which would be the prediction if associative learning of the target item alone explains the results in some of the above tests on future tool-use.

Secondly, the results might indicate that the task of deferred exchange represents a domain where future-directed cognition in chimpanzees is restricted. The authors of the study speculate that it might be a result of the cooperative nature of the task (“I give you what you want, and you give me what I want”), which is a context that in general has been suggested to be more cognitively demanding for chimpanzees than competitive contexts (e.g., Hare and Tomasello, 2004). Indeed, other studies suggest that chimpanzees can plan for agonistic encounters, and even plan for deception (Osvath, 2009; Osvath and Karvonen, 2012). The authors further consider that the exchange task might have further types of social complexity built into it that can be difficult for chimpanzees. For example, memories of the human exchange partner's reaction, e.g., as “unwilling” to give food, in cases where the chimpanzee failed to bring the correct item, might interfere with the memories of one's own choices that brought about this response. That is, it can be difficult for the animal to connect the events into a correct causal chain.

It is also important to mention that a recent study on brown capuchins (Cebus appella) and Tonkean macaques (Macaca tonkeana), using the same paradigm on future exchange, also yielded negative results (Bourjade et al., 2012). It is not clear, however, whether these results reflect the same factors that made the chimpanzees fail. Another recent study on monkeys, which to some extent was a replication of Osvath and Osvath (2008), showed that long-tailed macaques (Macaca fascicularis) would not select, transport and use a tool for a future purpose, not even if they received immediate cues of the reward, unless being subjected to extensive shaping (Dekleva et al., 2012). Thus, it may be that monkeys differ from great apes in their cognitive and/or learning systems underlying future-directed behavior.

Exchange tasks as such, when not deferred, usually pose few problems for great apes. The exchange of items with a human for food rewards typically develops spontaneously in chimpanzees (e.g., Hyatt and Hopkins, 1998; Brosnan and de Waal, 2005). Something that seems to require more explicit training is learning the relative values of exchangeable items (e.g., Brosnan and de Waal, 2005). Even when the reward differences are maximized (i.e., reward vs. no reward), the learning of differentially valued exchange items is not immediate (see e.g., training in Pelé et al., 2009). A further complicating factor is that an exchange situation with a social counterpart involves more than the value of items, such as judgments of the prospect of adequate reward. For example, chimpanzees typically hand out objects in an exchange only when solicited by a human, and not in the complete absence of one (Hyatt and Hopkins, 1998). That bartering is socially modulated is especially well-illustrated in studies of inequity aversion in capuchin monkeys (Cebus apella) (van Wolkenten et al., 2007) and chimpanzees (Brosnan et al., 2010), where a previously successfully exchanged token may be discarded, seemingly in protest, in response to the more favorable exchanges taking place between the experimenter and another subject. That the previous attempt to establish deferred exchange in chimpanzees failed might thus have its basis in social modulation, or the lack thereof, rather than the future directedness of the activities as such. A replication is therefore warranted.

In order to discern whether the ability for deferred exchange with humans is outside the cognitive scope of great apes, the current study aimed to replicate the main experiment of Dufour and Sterck (2008) with subjects with everyday experience of direct exchange with humans. We also added orangutans (Pongo abelii) to the pool of chimpanzees, in order to phylogenetically trace the abilities to the most distantly related great ape species to humans and chimpanzees, and to compare with the results in Osvath and Osvath (2008). Two of the subjects (one chimpanzee and one orangutan) in the current study participated also in that study. Two main conditions were used: (1) one-item-forced-selection, and (2) multiple-items-free-selection.

Condition (1), used in Experiment 1, followed the item selection procedure used in a study on planning for future tool-use by Osvath and Osvath (2008). The subject was offered to select only one of four items from a selection tray operated by a human experimenter. In the current study, one of these items had previously been reinforced in immediate exchange training. The three other objects served as distractors.

Condition (2), used in Experiment 2, was similar to the selection procedure used in Dufour and Sterck (2008), which in turn followed the procedure in Mulcahy and Call (2006). The four items were placed on the floor in a compartment, which was later opened to allow access for the subject to enter and collect any number of items. In this condition no humans were present during the selection opportunity.

This division of conditions was used because of the possibility that the different procedures might influence the results. That is, if the apes would succeed in condition (1) but not in condition (2), then the negative results in Dufour and Sterck (2008) might be explained by the method. One of the reasons for assuming a possible difference is that the performance of the apes in Osvath and Osvath (2008) seemed slightly better than in Mulcahy and Call (2006) in which multiple-items-free-selection was used. Arguably, the situation in which the animal gets one trial to choose only one item might be clearer or less distracting to the animal, than when faced with the opportunity to select several items during an extended time. The task can be said to be more structured, or “clean,” in condition (1). But despite these differences apes seem to be able to perform at a significant level when it comes to tool-using tasks in both conditions. So there likely are additional aspects of complexity when it comes to deferred exchange. One candidate factor is that the human who is present during the object choice in condition (1) represents a bartering partner, which might elicit an expectation of an exchange. Or, in a similar vein, the human constitutes a cue for the future situation where a human will also be present; so called cued recall (Berntsen et al., 2013) In condition (2), on the other hand, the presence of an object on its own, with its history of being functional in a social context, might not evoke the same actions without a triggering social cue. Alternatively or additionally, selecting in front of a human could be a form of explicit communication where the subject expresses a desire (for similar ideas on selecting for exchange in front of a human see Brosnan and de Waal, 2005). For these reasons we predicted that condition (1) would constitute a situation in which the apes had it easier to solve the task. Finally, irrespective of condition, perhaps physical contexts, like reward apparatuses, represent more concrete “futures” for an ape, than do the variable presence of humans.

Preference Testing

Before training was undertaken, subjects were tested in a selection procedure for their potential spontaneous preferences for the different items. This was done to make sure that the subject actually learned the value of the exchangeable item in the training phase. If the subject would have a spontaneous preference for the item designated for exchange, then it could superficially pass the learning criterion (see below). That is, the reason for the selection of the correct item could be the result of a spontaneous preference and not a response to training.

Materials and Methods

Ethics statement

All procedures were performed in compliance with relevant laws and institutional guidelines. Participation was voluntary and testing was approved by Uppsala regional ethics committee (approval no. C356/9). The Swedish Agricultural board (No. 31-2599/09) has approved Furuvik Zoo as a cognitive research facility on chimpanzees and orangutans.

Subjects

Two chimpanzees (Pan troglodytes) and two orangutans (Pongo abelii) participated in the study. Both chimpanzees were females, Manda and Maria-Magdalena. The orangutans consisted of one male, Naong, and one female, Dunja.

Subjects were tested at Lund University Primate Research Station in Furuvik Zoo. At the time of testing, the participants had experienced few previous experimental tests, and none requiring object exchange. One chimpanzee, Maria-Magdalena, and one orangutan, Naong, had previously taken part in a planning experiment involving selection of items from a tray (Osvath and Osvath, 2008), and the chimpanzee Manda had experience of choice procedures from participation in an object-choice task (Zlatev et al., 2013). In addition, Naong had extensive experience of various object-choice procedures, requiring the selection of items (unpublished). All subjects had frequent exchange experience outside of testing.

The individuals were tested in their caretaking compartments, as well as in larger indoor areas. No public visitors were present at the time of testing. No changes in feeding procedures were made and access to water was continuous. Some changes in indoor housing routines were made to minimize disturbance from group members.

Materials

Four different objects were used in the selection procedures. A piece of blue plastic rope, a piece of jute cloth, a wooden rod, and a strip of bent metal (see Figure 1). The metal was the object later chosen to serve as target item in the exchange tasks (see below). The items were placed equidistantly on a 60-centimeter wide selection tray.

FIGURE 1

Figure 1. Choice items used throughout the study. From left to right: wooden rod, jute cloth, plastic rope, metal strip (the exchangeable item). The ruler units are in centimetres.

Procedure

Each subject received 15 presentations of the tray baited with the four objects. The tray was slid toward the subject and was then retracted as soon as a selection had been made, thereby allowing the subject to choose only one item. A choice was scored when the animal touched or grabbed one of the objects. If the tray was retracted before the subject had managed to grasp the object, the one touched by the subject was handed over. The criterion for a “refusal” to select was when the subject entered the selection situation, looked at the items but did not attempt to touch any of them before leaving.

All subjects but the orangutan female had previous experience of this type of procedure. The order of the items was pseudo-randomized between trials. The experimenter handling the tray did not gaze at the items but slightly above or at the face of the ape.

Results

Naong selected the piece of rope 3 times, the wooden rod 4 times, the jute cloth 1 time, and the piece of metal 3 times; in 4 trials he refused to select. The other orangutan, Dunja, selected the wood 2 times and refused to select in the rest of the trials. Manda selected the wood 12 times, the rope 2 times, and the jute 1 time. Maria-Magdalena selected the wood 12 times, the rope 2 times and refused to select once. No spontaneous preference thus existed for the metal strip at the onset of training for any of the subjects.

Training

Procedure

Subjects and materials were identical to the preference testing described above. The four subjects were trained until able to reliably exchange the target item (the metal strip) in a direct setting with no time delay. On the initial trials only the target item was placed in the enclosure and experimenter pointed to this and requested the “grunka” (Swedish for “the thingamajig”). Later all items were placed simultaneously into the enclosure and, pointing if needed, the experimenter asked for the “grunka.” On successful exchanges the subject was rewarded with verbal praise and a food item consisting of approximately a fifth of a banana. If the subject handed the wrong item back, the experimenter again pointed to the target item and asked for the “grunka.” In later learning trials pointing was phased out when the subject collected the target item without coaxing.

Successful learning was corroborated in tests of learning, in which 4 out of 5 trials had to be correct, which was followed by a second test (a test of retention) also requiring 4 out of 5 successful exchanges. Tests of learning and first test of retention were given on two consecutive days (except for Dunja who, due to practical reasons related to housing, received her first test of retention later the same day as she met the learning criteria).

Additional retention tests, which functioned as warm ups, were given for each new day of participation in the experimental conditions. These did not always amount to a full 5 trials before testing started, depending on how well the subject performed and/or on how motivated the animal appeared on the particular day.

In the tests of learning and retention the subject was not cued by pointing or gaze toward a particular choice object, instead the experimenter looked directly at the ape or at a point beyond it. The items were presented in a different area than the exchange. Tests of learning and tests of retention, all took place by presenting the items onto the cage floor, i.e., they were not given on a selection tray. The exception was for the female orangutan subject (Dunja) who had little previous experience of choosing from a tray, and appeared to be wary toward the tray. Thus, she received additional training trials using the selection tray. By using a test of learning and a retention test with a set criterion (80% success), we could make sure that all the subjects could perform the task in an immediate setting.

Results

All subjects swiftly learned to hand out the target item at the expense of the other items in exchange for food (and verbal praise). The average number of trials required before we decided to test their learning against the criteria was 8.5 ± 3.9 (5–14 trials). All subjects met, and exceeded, the criteria in the subsequent test of learning: 5 out of 5 trials. All subjects also met, and exceeded, the criteria in their first test of retention: 5 out of 5 trials.