Complexity and Competition in Appetitive and Aversive Neural Circuits

Barberini, Crista  L.; Morrison, Sara  E.; Saez, Alex; Lau, Brian; Salzman, C.  Daniel

doi:10.3389/fnins.2012.00170

ORIGINAL RESEARCH article

Front. Neurosci., 26 November 2012

Sec. Decision Neuroscience

Volume 6 - 2012 | https://doi.org/10.3389/fnins.2012.00170

Complexity and Competition in Appetitive and Aversive Neural Circuits

CL
Crista L. Barberini ¹
SE
Sara E. Morrison ²
AS
Alex Saez ¹
BL
Brian Lau ³
CD
C. Daniel Salzman ^1,4,5,6,7^*

1. Department of Neuroscience, Columbia University New York, NY, USA
2. Department of Psychiatry and Behavioral Science, Albert Einstein College of Medicine Bronx, NY, USA
3. Centre de Recherche de l’Institut du Cerveau et de la Moelle Épinière Paris, France
4. Department of Psychiatry, Columbia University New York, NY, USA
5. Kavli Institute for Brain Sciences, Columbia University New York, NY, USA
6. W. M. Keck Center on Brain Plasticity and Cognition, Columbia University New York, NY, USA
7. New York State Psychiatric Institute New York, NY, USA

Article metrics

View details

Citations

8,2k

Views

2,3k

Downloads

Abstract

Decision-making often involves using sensory cues to predict possible rewarding or punishing reinforcement outcomes before selecting a course of action. Recent work has revealed complexity in how the brain learns to predict rewards and punishments. Analysis of neural signaling during and after learning in the amygdala and orbitofrontal cortex, two brain areas that process appetitive and aversive stimuli, reveals a dynamic relationship between appetitive and aversive circuits. Specifically, the relationship between signaling in appetitive and aversive circuits in these areas shifts as a function of learning. Furthermore, although appetitive and aversive circuits may often drive opposite behaviors – approaching or avoiding reinforcement depending upon its valence – these circuits can also drive similar behaviors, such as enhanced arousal or attention; these processes also may influence choice behavior. These data highlight the formidable challenges ahead in dissecting how appetitive and aversive neural circuits interact to produce a complex and nuanced range of behaviors.

The Importance of Learning to Predict Reinforcement for Punishment-Based Decision-Making

The decision-making process – arguably one of the most important “executive” functions of the brain – can be influenced by a variety of different types of information and motivators. Punishment-based decisions constitute an important subcategory that is common to a wide phylogenetic range, from nematodes to rodents to humans. Studies old and new have shown that punishment engages brain systems specialized for processing aversive information (Seymour et al., 2007). Historically, these systems have been studied most frequently in rodents, and this work has revealed many aspects of the neural mechanisms driving behavior elicited by the threat of aversive stimuli (Davis, 1992; LeDoux, 2000). In everyday life, however, decisions typically require integrating information about potential punishments and rewards, as well as myriad factors such as external environment and internal drives. This is especially true in primates, as they exhibit particularly complex behavioral repertoires.

Rewards and punishments are reinforcers with opposite valence (positive versus negative), and they often drive behavior in opposite directions – e.g., approaching a rewarding stimulus or avoiding a threat. Moreover, punishment-based decisions are often made in a context in which rewards and punishments are both possible consequences of an action; therefore, brain systems processing aversive information must interact with brain systems processing rewards – interactions that presumably underlie how punishments and rewards compete to drive behavior and decision-making. Scientists have long appreciated these facts and have often posited that appetitive and aversive systems operate in an “opponent” manner (Konorski, 1967; Solomon and Corbit, 1974; Dickinson and Dearing, 1979; Grossberg, 1984; Daw et al., 2002). However, appetitive and aversive stimuli also have certain common attributes – e.g., they are both usually more salient than non-reinforcing stimuli – and thus appetitive and aversive systems need not always act in opposition to each other. Rather, stimuli of both valences may mediate a number of processes, such as enhanced arousal or enhanced attention to stimuli predictive of reinforcement (Armony and Dolan, 2002; Anderson, 2005; Lang and Davis, 2006; Phelps et al., 2006; Brosch et al., 2008; Ilango et al., 2010; Pinkham et al., 2010; Anderson et al., 2011).

Punishment-based decisions are generally choices that are based on one or more prior experiences with an aversive outcome. Typically, an organism learns that a sensory cue predicts a possible negative outcome – e.g., the taste of spoiled food precedes illness – and later must decide what to do to avoid or defend against that outcome. Thus, learning to anticipate negative outcomes is an essential skill for subsequently being able to make optimal decisions in the face of possible punishment. This is also true for rewards: the adaptive response is to acquire the reward, rather than avoid it, but anticipation is critical in both cases.

Because accurately predicting reinforcement – whether punishment or reward – plays such a vital role in decision-making, our work has focused on understanding the neurophysiological processes whereby the brain comes to predict reinforcement as a result of learning. We have sought to understand where and how signals in the brain represent anticipated positive or negative outcomes, and whether those signals occur at a time and in a manner such that they could be used as input to decision-making processes. We have often referred to these signals as value signals. Although our published studies have not characterized these signals during an explicit decision-making task, the tasks we employed do provide measures that appear to co-vary with the amount and type of the reinforcement associated with a stimulus (Paton et al., 2006; Belova et al., 2007, 2008; Salzman et al., 2007; Morrison and Salzman, 2009, 2011; Morrison et al., 2011). We believe that the value of anticipated possible outcomes often drives behavior, and the estimation of value may be computed on-line during decision-making by taking into account expected potential reinforcement as well as a variety of internal variables (e.g., hunger or thirst) and external variables (e.g., how difficult a reward would be to acquire; Padoa-Schioppa, 2011). We refer to the circuits that process and generate appetitive and aversive reinforcement predictions as value processing circuits, although in some cases work remains to be done to understand how different internal and external variables impact representations of reinforcement predictions.

Where in the brain does processing about reinforcement predictions occur? Early work indicated that the amygdala, a key structure in the limbic system, plays a central role in processing one of the primary negative emotions, the fear elicited by a stimulus predicting aversive consequences. Seminal fear conditioning studies in rats found that both learning and memory of fearful events required an intact, functional amygdala (Davis, 1992; LeDoux, 2000; Maren and Quirk, 2004). Since then, it has become clear that the purview of the amygdala extends beyond fear to include other emotions, including positive ones (Holland and Gallagher, 1999; Baxter and Murray, 2002; Everitt et al., 2003; Paton et al., 2006; Belova et al., 2008; Morrison and Salzman, 2010; Salzman and Fusi, 2010). These results suggest that the amygdala may carry signals related to the computation of both positive and negative value.

How do amygdala signals come to impact behavior? The amygdala is heavily interconnected with many other areas of the brain, providing an array of anatomical pathways by which it can participate in learning and decision-making. It receives input from multiple sensory modalities (McDonald, 1998; Amaral et al., 2003; Freese and Amaral, 2005), which accords with the amygdala’s established role in associative learning; information from predictive sensory cues converges with input about reinforcing outcomes at the single cell level (e.g., Romanski et al., 1993). Furthermore, lesions of the amygdala impair reinforcer devaluation (Baxter and Murray, 2002; Izquierdo and Murray, 2007), indicating that the amygdala plays a role not only in learning reinforcement contingencies, but also in adjusting these representations as the value of associated reinforcement outcomes changes.

Although the amygdala participates in learning stimulus-reinforcement associations that in turn may be utilized and adjusted during decision-making, it does not act alone in these processes. The amygdala has reciprocal connections with orbitofrontal cortex (OFC; McDonald, 1991; Carmichael and Price, 1995; Stefanacci and Amaral, 2000, 2002; Ghashghaei et al., 2007), a cortical area thought to play a central role in value-based decisions (Padoa-Schioppa and Assad, 2006; Wallis, 2007; Padoa-Schioppa, 2011). OFC may be important for implementing executive or cognitive control over behavior, and endowing subjects with the ability to rationally analyze their options, as well as to tune their behavior to what is socially acceptable in the face of emotionally driven impulses (Damasio, 1994; Rolls, 1996; Bechara et al., 2000; Berlin et al., 2005; Ochsner and Gross, 2005). Part of this may be due to the fact that OFC seems to play a role in the simple ability to anticipate aversive stimuli or negative outcomes, as well as positive outcomes (Tremblay and Schultz, 2000; Roberts et al., 2004; Young et al., 2010).

In this paper, we review our efforts to understand the roles of the amygdala and OFC in acquiring representations of reinforcement contingencies. As we reviewed above, these representations may be critical substrates for reward-based and punishment-based decision-making. One of the striking findings in these investigations concerns the differential dynamics of processing that takes place in appetitive and aversive systems in amygdala and OFC. The amygdala appears to have evolved an aversive system that learns changes in reinforcement contingencies more rapidly than its counterpart in OFC; but, for appetitive networks, the time courses of learning in the two brain areas are reversed. Moreover, both single unit and local field potential (LFP) data point to complex interactions between amygdala and OFC that change as a function of learning. Although appetitive and aversive systems have been posited to act in an opponent manner, this complex pattern of interactions suggests that a more nuanced framework may be required to understand the relative contribution of these networks during learning and decision-making. Moreover, behavioral evidence indicates that appetitive and aversive stimuli can have a variety of effects on cognitive processes, some of which may be induced by stimuli of either valence. Altogether, these data suggest that appetitive and aversive systems may act in congruent and opponent fashions – even at the same time – and do not merely compete to determine the most valuable behavioral option during decision-making.

Positive and negative cells in the brain

We have focused on trying to understand neural circuits involved in punishment and aversive learning, and how these circuits may differ from and interact with circuits involved in rewards and appetitive learning. When we began our experiments several years ago, only a few studies had examined the neurophysiology of the amygdala in primates (Sanghera et al., 1979; Nishijo et al., 1988, 2008; Rolls, 2000; Sugase-Miyamoto and Richmond, 2005; Wilson and Rolls, 2005). Furthermore, no primate lab had undertaken simultaneous recordings in amygdala and OFC to understand dynamic interactions between the brain structures during learning.

Our experimental approach strove to disambiguate neural responses that might be related to the sensory properties of visual conditioned stimuli (CSs) from responses related to the reinforcement contingencies. To accomplish this, we used a mixed appetitive/aversive reversal learning paradigm. This paradigm combined a conditioning procedure with standard extracellular physiology in rhesus monkeys; we measured the physiological responses of individual neurons to CSs that signaled an impending positive or negative US. CSs were small fractal patterns, positive outcomes were small aliquots of water, and negative outcomes were brief airpuffs directed at the face (Paton et al., 2006; Belova et al., 2007, 2008; Morrison and Salzman, 2009, 2011; Morrison et al., 2011). In these experiments, one CS was initially paired with reward and another with an aversive stimulus (unconditioned stimuli, USs); then, without warning, we reversed the reinforcement contingences of the CSs. We recorded single neuron responses while monkeys learned the initial CS-US associations and their reversal. One major advantage of this approach was that reinforcements – particularly aversive “punishments” – were unavoidable, so we were able to unequivocally identify neural activity related to the anticipation of appetitive and aversive reinforcement.

In both the amygdala and OFC, we observed two populations of neurons that fired more for positive or negative outcomes, respectively, which we refer to as positive and negative value-coding cells. The response profiles for these two populations are shown in Figures 1A–D for OFC and in Figures 1E–H for the amygdala. Shortly after CS onset, both cell populations systematically fire differentially for CSs paired with positive or negative reinforcement. Reversing the reinforcement contingencies (Figures 1C,D,G,H for positive and negative cells, respectively) demonstrates that the differential firing is specifically related to the reinforcement contingencies and not other aspects of the CS, such as specific visual features. Note that after reversal, an image formerly associated with a reward now leads to a punishment, and vice-versa; after only a few trials of exposure to these new contingencies (Paton et al., 2006; Belova et al., 2007; Morrison et al., 2011), the neural response pattern shifts to reflect these changes, such that the response profiles look quite similar before and after reversal.

Figure 1

The encoding of reinforcement contingencies seems to reflect the overall motivational significance, or value, of a US associated with a CS, and not other types of information learned during conditioning. Several lines of evidence support this conclusion. First, neither amygdala nor OFC neurons encode motor responses elicited by USs on our task, indicating that neurons do not appear to represent the relationship between a CS and the motor response elicited by USs (Paton et al., 2006; Morrison and Salzman, 2009). Second, both OFC and amygdala neurons generally do not simply represent the relationship between a CS and the sensory qualities of a preferred US. Rather, we found that OFC and amygdala neurons respond in a graded manner to CSs predicting large rewards (LRs), small rewards (SRs), and negative outcomes; this means that a cell that prefers a CS associated with an aversive airpuff also responds differentially to CSs associated with water rewards, and thus encodes information about two types of outcomes. Moreover, since the outcomes include two modalities (taste and touch), it is unlikely that the neural response is primarily driven by a physical quality of one type of outcome, such as the strength or duration of the airpuff (Belova et al., 2008; Morrison and Salzman, 2009).

Third, positive and negative neurons often appear to track value in a consistent manner across the different sensory events in a trial – including the fixation point, CS, and US presentations – even though those stimuli differ in sensory modality. This has led us to suggest that amygdala and OFC neurons represent the overall value of the animals’ “state,” or situation (Belova et al., 2008; Morrison and Salzman, 2009, 2011). Finally, in an additional series of experiments that examined the representation of “relative” value in different contexts, amygdala neurons changed their firing rate in accordance with changes in the relative value of a CS, even when the absolute value (i.e., reward size) of the associated US does not change (Schoer et al., 2011). This phenomenon has also been observed in the OFC (Padoa-Schioppa, 2009; Schoer et al., 2009).

In contrast to the signals just described, there are doubtless other signals in the brain that encode the magnitude of single stimulus dimensions – e.g., the size or taste of specific rewards. However, these signals would not, in and of themselves, be sufficient to inform choices made between outcomes that were in different modalities.

Dynamics during learning

The neurons we describe provide a dynamic representation that changes rapidly during learning. Overall, during reversal learning, the change in the neural responses in both amygdala and OFC was on a timescale similar to changes in the monkey’s behavior. Behavioral metrics of the monkey’s expectation – anticipatory licking of the water tube preceding rewards and anticipatory “blinking” before aversive airpuffs – reversed within a few trials, indicating that monkeys learned the new associations quite rapidly (Paton et al., 2006; Morrison et al., 2011). Amygdala and OFC neural activity likewise began to change their responses to CSs within a few trials of a reversal in reinforcement contingencies (Paton et al., 2006; Belova et al., 2007; Morrison et al., 2011). This sequence of neural and behavioral changes indicates that the amygdala and OFC could be involved in the monkeys’ learning of new reinforcement contingencies.

Neuroscientists have long believed that the prefrontal cortex, and OFC in particular, drives reversal learning (Iversen and Mishkin, 1970; Thorpe et al., 1983; O’Doherty et al., 2001; Schoenbaum et al., 2002; Chudasama and Robbins, 2003; Fellows and Farah, 2003; Hornak et al., 2004; Izquierdo et al., 2004; Chamberlain et al., 2008; Hampshire et al., 2008; Ghahremani et al., 2010); but some have recently proposed that in fact representations in OFC may update more slowly upon reversal than those elsewhere (Schoenbaum et al., 1998, 2003; Saddoris et al., 2005). Because we recorded amygdala and OFC activity simultaneously, we were able to examine the dynamics of learning in positive and negative value-coding neurons in both amygdala and OFC in order to characterize their relative timing. We found that appetitive and aversive networks in OFC and amygdala exhibited different learning rates, and – surprisingly – that the direction of the difference depended on the valence preference of the cell populations in question. For positive cells, changes in OFC neural activity after reversal were largely complete many trials earlier than in the amygdala; for negative cells, the opposite was true (Figure 2). In each case, the faster-changing area was completing its transition around the time of the onset of changes in behavior; meanwhile the other, more slowly changing area did not complete the shift in firing pattern until many trials after the behavioral responses began to change. Thus, signals appropriate for driving behavioral learning are present in both brain structures, with the putative aversive system in the amygdala and appetitive system in OFC being particularly sensitive to changes in reinforcement contingencies. This finding may reflect the preservation across evolution of an aversive system in the amygdala that learns very quickly in order to avoid threats to life and limb.

Figure 2

During versus after learning

Despite the complex pattern of dynamics we observed during learning, once the new CS-US contingencies have been established, we found that both populations of OFC cells – positive value-coding and negative value-coding – predict reinforcement earlier in the trial than their counterparts in the amygdala (Figure 3). To demonstrate this, we examined trials after learning had taken place and determined the earliest point in the trial each area begins to significantly differentiate between images that predict reward and images that predict airpuff. For both positive and negative cell populations, OFC predicted reinforcement more rapidly after image onset. Thus, it appears that the relationship between single unit firing in the appetitive and aversive networks in the two brain areas evolves as a function of learning, with the OFC perhaps assuming a more primary role after learning.

Figure 3

We found further evidence of the evolving dynamic relationship between amygdala and OFC during learning by examining LFP data recorded during the reversal learning task. To do so, we applied Granger causality analysis, which measures the degree to which the past values of one neural signal predict the current values of another (Granger, 1969; Brovelli et al., 2004), to the simultaneously recorded LFPs in the amygdala and OFC. Remarkably, we found significant Granger causality in both directions that increased upon CS onset (Wilcoxon, p < 0.01; Figure 4A). Notably, during learning, Granger causality was stronger in the amygdala-to-OFC direction, but after learning, Granger causality was strongest in the OFC-to-amygdala direction (Figures 4B,C). This result is consistent with single unit data showing that, after reversal learning has occurred, OFC predicts reinforcement with a shorter latency after CS onset. This positions the OFC to be able to drive or modulate amygdala responses to value-laden CSs after learning. (Note, however, that the amygdala continues to be able to influence processing in OFC, just not as strongly as the reverse.).

Figure 4

Conflict within appetitive and aversive circuits

There is an additional level of complexity within appetitive and aversive circuits that has not received much attention on the physiological level, namely competition and conflict within these circuits. Our learning data suggest that the signals carried by different neural circuits may be updated at different rates in different brain areas. This suggests that these systems might at times conflict with each other. Another possible example of competition is that between executive areas – which allow us to evaluate potential outcomes on a practical and rational level – and limbic areas, which are more involved in emotional processing, and which might provide a value signal based more heavily on immediate sensory experience and emotion-laden associations. For example, the amygdala and OFC themselves may at times “recommend” different responses, the former mediating more emotionally driven responses and the latter more executive or cognitive behaviors (De Martino et al., 2006).

This phenomenon has been given some attention on the behavioral level (McNeil et al., 1982; Damasio et al., 1994; Kahneman and Tversky, 2000; Loewenstein et al., 2001; Greene and Haidt, 2002), and has also been examined using fMRI in humans (McClure et al., 2004, 2007; De Martino et al., 2006; Kable and Glimcher, 2007). However, few studies have examined appetitive and aversive circuits at the level of single cells during a decision-making task involving rewards and punishments. To best investigate the interactions between appetitive and aversive neural circuits, such a decision-making task should include conditions in which rewards and aversive stimuli must be weighed against each other in order to guide behavior. As a first step, we trained monkeys to perform a simple two-choice task involving rewards and aversive stimuli (described below). We discovered that, even on this simple task, behavioral choices appear to be influenced not only by the value of the reinforcement associated with cues, but also by the salience of cues.

We used a two-choice task in which monkeys selected visual targets by making a saccade to the target of their choice. Monkeys viewed two visual targets on each trial, each of which was a CS associated with a particular outcome. After maintaining fixation during a 900–1200 ms delay period, monkeys chose one of the two targets by foveating it, followed by delivery of the associated outcome (Figure 5A). There were four possible outcomes: a LR, a SR, no reinforcement (N), or a punishment (P), where rewards were small amounts of water and punishments were brief airpuffs directed at the face. The four CSs (one for each outcome; Figure 5B) were offered in all possible combinations, with the exception of two of the same kind. Trial conditions were pseudo-randomly interleaved, and counter-balanced for spatial configuration. The list of trial types is shown in Figure 5C. New sets of CSs were used in each session. Two independent stimulus sets were used, and trials drawing from the two sets were interleaved in pseudo-random order. In each session, a pair of locations on the monitor was chosen and used for the duration of the session. The locations varied, but each pair always straddled the fixation point. While monkeys were free to choose either target, they had to make a choice: incomplete trials were repeated until one or the other target was chosen.

Figure 5

If monkeys always chose the higher-value target, then plotting the percent of trials on which a CS was chosen, out of all trials on which that CS was offered, yields a straight line, since LR is always the higher-value target when presented, SR on 2/3 of trials when presented, N on 1/3 of trials, and P on no trials, as can be seen in the list of trial conditions (see Figure 5C). We will refer to this as “optimal” behavior. In Figure 6, two example sessions are shown. The first is a session in which a monkey chose the higher-value target most of the time, such that the plot of the number of times each target was chosen follows the optimal behavior line quite closely (Figure 6A). In the second example, however, the same monkey chose the punished target many times, and about as often as he chose the neutral (non-reinforced) target (Figure 6B).

Figure 6

The deviation from optimal behavior seen in Figure 6B is not due to an overall drop in performance, but to a change in behavior on a single trial type: the N-P stimulus pair. In Figure 7, a running local average of the proportion of trials on which the monkey chose the higher-value target is shown, broken down by trial type, for the same two sessions shown in Figure 6. When offered a choice between a reward and a punishment, the monkey reliably chose the reward (LR-P and SR-P trial types in Figures 7A,B). However, when offered a choice between no reinforcement and a punishment, in some sessions, the monkey chose punishment quite often (N-P trial type in Figure 7B). These two sessions are representative of the type of choice behavior we observed.

Figure 7

This choice pattern was perplexing to us at first. We noticed that sometimes monkeys avoided the punished target in a session, while other times he chose it over the neutral target a substantial fraction of the time. We checked and manipulated a number of parameters: did monkeys find the punishment aversive? Was it aversive enough? Did monkeys understand the CS-US contingencies? What we found, in two monkeys, was an abundance of evidence that subjects did understand the task contingencies, did find the airpuff aversive, and yet chose the punished target despite the aversive outcome they knew would follow. Evidence in support of the idea that the airpuff was indeed aversive included: visible frustration and displeasure upon airpuff delivery, defensive blinking behavior in anticipation of airpuff, statistically significant greater likelihood of breaking fixation on N-P trials, and willingness to work being clearly dependent on the strength or frequency of airpuff delivery, with increases in any of these variables quickly leading to the monkey’s refusing to work for the rest of the day. None of these were observed in relation to rewarding outcomes.

Over a period of training lasting several months, these patterns persisted. Figure 8 shows the performance across a series of sessions over a period of a few weeks in one monkey. The two example sessions shown in the previous figures are marked with asterisks. In Figure 8A, the percent of trials completed for N-P versus other trial types is displayed. On average, the monkey broke fixation before completing the trial more often on N-P trials than on other trial types – resulting in a lower percent of trials completed – which is indicative of that trial type being aversive, difficult, or both. (Note that the two sessions shown in Figures 6 and 7 are not representative of this overall pattern, having lower than average percent break-fixation trials). Figure 8B shows the percent of trials on which the monkey chose the N-target on N-P trials (dark gray bars, %N of NP) as compared to choosing the P target (light gray bars). What is apparent is that %N of NP varied day to day, and did not appear to plateau at a stable level, nor was there a trend in either direction as training progressed. Note that the selection of the punished target on N-P trials occurred during blocks in which, on all other interleaved trial types, the monkey chose the higher-value target nearly all of the time (Figure 8C). This same pattern was seen in other training periods for this monkey, as well as across all training periods in the second monkey.

Figure 8

On average, one monkey chose neutral CSs over punished CSs only slightly more than half the time. Figure 9A shows the distribution of %N of NP across all training sessions, including the subset shown in Figure 8. The mean was 62.2%, and was significantly greater than 50% (t-test, p < 0.0001). This was over a training period of 5 months, and after trying a host of manipulations to ensure that the monkey understood the task and the CS-US contingencies involved. Also, note that on interleaved trials, the monkey was choosing the higher-value target virtually all the time (Figure 9B). In the second monkey, the average %N of NP was very close to 50%, and was not statistically significant (mean, 50.4%, mean > 50%, t-test, p = 0.4409), even though that monkey was also trained extensively and exposed to the same set of task manipulations as the first monkey. However, his performance on other trial types was similarly very high (mean, 89.1% higher-value target chosen, mean > 50%, t-test, p < 0.0001).

Figure 9

While there are several possible explanations of this counter-intuitive behavior, we favor one explanation that fits with some of the other examples of neural systems in competition. In particular, we believe that on the N-P trial type, the salience and value of cues were in conflict, and this conflict pushed monkeys toward different choices. This was not true on any of the other trial types, in which the most salient CS on the screen was also the most valuable (whatever the highest level of reward was). On N-P trials, however, the N-target is more valuable than the P target (presumed zero value versus negative value), but the P target, by virtue of its association with an aversive airpuff, is very likely to be more salient. Thus the P target is chosen some of the time, even though it is not necessarily what monkeys prefer, due to a strong impulse to foveate – i.e., look at or orient toward – this highly salient, behaviorally relevant stimulus. Further evidence to support this idea is that monkeys were much more indecisive on N-P trials than on other trials: this was apparent in the percentage of break-fixation trials (Figure 8A), and in the observation that monkeys often looked quickly back and forth between targets, even though this behavior led to a greater number of incomplete trials. The monkeys did not do this on other trial types. As might be expected for trial types that are more difficult or less certain, monkeys exhibited greater spatial bias on N-P trials than on other trial types. The differences were modest: first monkey, 10.0% versus 1.6% bias, and second monkey, 8.3% versus 2.4% bias, for N-P and other trials, respectively, when measured across all sessions. (Bias is the percentage over 50% that a preferred spatial location is chosen; a 10% bias is equivalent to a location being chosen 60% of the time). While both differences were statistically significant (t-test, p < 0.0001 in both cases), the small magnitude indicates that other factors had a strong impact on the monkeys’ choices.

We suspected that the absence of a possible reward on N-P trials was having a major impact on the choice behavior of our monkeys. Therefore, we redesigned the task for the first monkey so that all outcomes included some level of reward, using as our set of possible outcomes: LR, SR, and a compound outcome of airpuff and SR (P + SR). For the compound outcome, the punishment was delivered first, followed by a short delay and then the SR. This change resulted in a substantial shift in the monkey’s behavior. Within a few training sessions, the monkey learned the new task and began consistently choosing the higher-value target most of the time on all trial types. At the beginning of each session, new CSs were introduced, and the monkey learned them within a small number of repetitions, and then chose the higher-value target virtually all of the time for the rest of the session. The monkey performed at this level consistently day after day: the average choice %SR on the trial type SR−[P + SR] was 84.9% (Figure 9C), which was significantly greater than 50% (t-test, p < 0.0001), and variations around this mean were much smaller than they had been in the first version of the task. As before, on all other trial types, which were interleaved, the monkey chose the higher-value target virtually all of the time (Figure 9D).

We have here, then, an example of counter-intuitive choice behavior that is robust and occurs when no reward is possible. As we mention above, we suspect that this is due to competition between the neural circuits processing value and salience; we would also speculate that the salience of negative outcomes only grows large enough to compete with value signals driving behavior when the value of the alternative outcome is small or zero (e.g., when a cue predicts no reinforcement). Clearly, this results in sub-optimal choice behavior. This is consistent with other studies that have noted sub-optimal performance in tasks where monkeys are forced to make a choice between outcomes and the greatest possible reward is very small or zero. For example, Peck et al. (2009) observed more incorrect choices on “neutral” as opposed to rewarded trials, and Amemori and Graybiel (2012) observed longer reaction times and more omission errors on a “reward–reward” control task when reward size was very low. Moreover, Amemori and Graybiel (2012) designed their main experimental task to include a SR for any choice because they found it necessary to “maintain motivation to perform the task.” The paradigm employed by Amemori and Graybiel differed from ours in a number of ways, including the use of a joystick movement operant response, limiting our ability to make a direct comparison of the behavior observed in the two tasks. On the other hand, our use of an eye movement operant response may have increased the efficacy by which representations of salience modulated behavior. There is good reason to believe that salience has privileged access to the oculomotor system (Bisley et al., 2004; Hasegawa et al., 2004), especially in the highly visually oriented primate, to promote rapid foveation of salient stimuli.

We suggest that our behavioral results may be an example of a competition between limbic and cortical circuits dedicated to emotional versus cognitive processing, respectively. This paradigm, in the macaque, may test the limits of the amount of cognitive control monkeys are able to exert over reflexive behaviors. While the monkey does succeed in overriding the impulse to look at the punished target some of the time, he does not do so all of the time. Humans, with their greater level of cognitive processing and control, would presumably have much less difficulty avoiding the punished target.

Summary and Challenges

To make a decision, we often must predict how particular stimuli or courses of action lead to rewards or punishments. The ability to make these predictions relies on our ability to learn through experience the relationship between stimuli and actions and positive and negative reinforcement. It is therefore important to understand the representation of aversive and appetitive outcomes in the brain, both during and after learning, in order to understand how these signals generate behavior. However, at the same time, it’s important to recognize that the impact of appetitive and aversive circuits is not limited to behavior that is specific to the valence of the looming reinforcement. Activation of appetitive and aversive circuits can also elicit valence non-specific responses, such as enhanced arousal or attention.

A number of the studies in our lab have been directed at trying to understand the nature of appetitive and aversive circuits in the brain. Although there hadn’t been a great deal of work examining aversive processing at the physiological level in non-human primates in the past, some older studies suggested that our approach would be fruitful (e.g. Nishijo et al., 1988; Mirenowicz and Schultz, 1996; Rolls, 2000; Yamada et al., 2004). Our neurophysiological studies have expanded on these initial findings to create a more detailed picture of appetitive and aversive circuits. Both the amygdala and OFC contain neurons that belong to each network: positive and negative value-coding neurons are present in both areas, and appear to encode the value of cues that signal imminent appetitive and aversive reinforcers, responding in a graded fashion to the value of CSs as well as USs. The dynamics of learning exhibited by appetitive and aversive networks in amygdala and OFC are surprisingly complex, with aversive systems updating faster during reversal learning in the amygdala than OFC, but vice-versa for appetitive networks (Morrison et al., 2011). This suggests that reversal learning is not merely driven by one brain area or the other. The complexity of the dynamics is also illustrated by the fact that the degree to which each area may influence the other is not fixed and instead evolves during the learning process (Morrison et al., 2011) and perhaps in other circumstances as well.

In addition to our neurophysiological findings, behavioral data indicates that the interactions between appetitive and aversive systems are complicated. In a paradigm that required monkeys to make decisions based on the value of stimuli, behavior was sub-optimal when monkeys had to choose between a cue associated with nothing and a cue associated with an airpuff. These results indicate that eye movement choice behavior may be influenced not just by the value of stimuli but also by their salience. It demonstrates that competition between appetitive and aversive networks may occur not only between the values encoded by the two systems but also by the extent to which the systems influence brain structures representing salience, and thereby perhaps generating enhanced attention and eye movements to salient targets.

The complexity of interactions between appetitive and aversive circuits is likely to remain an enduring problem for neuroscientists, but headway is being made. Notably, in our studies of the amygdala and OFC, we have failed to find evidence of anatomical segregation of appetitive and aversive networks (Morrison et al., 2011). Rather, appetitive and aversive networks appear to be anatomically intermingled. Anatomical segregation of these systems would make it easier to develop experimental approaches that can target manipulations of one system or the other to test their causal role in behavior. Fortunately, some recent studies have begun to identify areas where anatomical segregation exists. Two examples of segregation in aversive systems may be found in the work of Hikosaka and colleagues on the habenula (Matsumoto and Hikosaka, 2007, 2008, 2009), and Graybiel’s team in the ACC (Amemori and Graybiel, 2012). The habenula appears to encode negatively valenced stimuli in relation to expectation. The ACC contains networks belonging to appetitive and aversive networks, though there appears to be some anatomical segregation of the aversive network. Both areas are likely to be involved in value-driven decision-making and/or learning. In addition, in contrast to our findings in the monkey, anatomical segregation of appetitive and aversive processing has been observed in the OFC in human fMRI studies (Kim et al., 2006). Our recordings focused only on a restricted part of OFC, largely area 13, and it remains possible that recordings from a more extensive part of the OFC will reveal anatomical segregation of appetitive and aversive systems in the macaque. In general, anatomical segregations may provide more experimentally tractable opportunities for future studies to elucidate details concerning how each network operates.

Despite the anatomical segregation of some aspects of these networks, the challenges ahead are formidable. The amygdala and OFC are two structures intimately related to emotional processing, and these structures, among others, likely mediate the executive control of emotion. Moreover, the amygdala, through its extensive connections to sensory cortex, to the basal forebrain and to the prefrontal cortex is poised to influence cognitive processing. The neurophysiological data we have presented illustrates the complexity of interactions between appetitive and aversive networks. Further, the behavioral data presented suggests that conflict between appetitive and aversive networks extends beyond conflicts about value to conflicts between value and salience. Future studies must clarify how these conflicts are resolved in the brain.

Statements

Acknowledgments

This research was supported by grants from NIMH R01 MH082017, NIMH RC1 MH088458, NIDA R01 DA020656, NEI P30 EY019007, and the James S. McDonnell and Gatsby foundations. Sara E. Morrison received support from an NSF graduate fellowship and from an individual NIMH NRSA F31 MH081620. Brian Lau received support from NIMH institutional training grant T32 MH015144 and the Helen Hay Whitney Foundation. Alex Saez was supported by the Kavli Foundation.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Authorization for Use of Experimental Animals

All experimental procedures were in accordance with the National Institutes of Health guidelines and were approved by the Institutional Animal Care and Use Committees at New York State Psychiatric Institute and Columbia University.

References

1
AmaralD. G.BehnieaH.KellyJ. L. (2003). Topographic organization of projections from the amygdala to the visual cortex in the macaque monkey. Neuroscience118, 1099–1120.10.1016/S0306-4522(02)01001-1
2
AmemoriK.GraybielA. M. (2012). Localized microstimulation of primate pregenual cingulate cortex induces negative decision-making. Nat. Neurosci.15, 776–785.10.1038/nn.3088
3
AndersonA. K. (2005). Affective influences on the attentional dynamics supporting awareness. J. Exp. Psychol. Gen.134, 258–281.10.1037/0096-3445.134.2.258
4
AndersonB. A.LaurentP. A.YantisS. (2011). Value-driven attentional capture. Proc. Natl. Acad. Sci. U.S.A.108, 10367–10371.10.1073/pnas.1014885108
5
ArmonyJ. L.DolanR. J. (2002). Modulation of spatial attention by fear-conditioned stimuli: an event-related fMRI study. Neuropsychologia40, 814–826.10.1016/S0028-3932(01)00178-6
- CrossRef
- Google Scholar
6
BaxterM.MurrayE. A. (2002). The amygdala and reward. Nat. Rev. Neurosci.3, 563–573.10.1038/nrn875
7
BecharaA.DamasioH.DamasioA. R. (2000). Emotion, decision making and the orbitofrontal cortex. Cereb. Cortex10, 295–307.10.1093/cercor/10.3.295
8
BelovaM. A.PatonJ. J.MorrisonS. E.SalzmanC. D. (2007). Expectation modulates neural responses to pleasant and aversive stimuli in primate amygdala. Neuron55, 970–984.10.1016/j.neuron.2007.08.004
9
BelovaM. A.PatonJ. J.SalzmanC. D. (2008). Moment-to-moment tracking of state value in the amygdala. J. Neurosci.28, 10023–10030.10.1523/JNEUROSCI.1400-08.2008
10
BerlinH. A.RollsE. T.IversenS. D. (2005). Borderline personality disorder, impulsivity, and the orbitofrontal cortex. Am. J. Psychiatry162, 2360–2373.10.1176/appi.ajp.162.12.2360
11
BisleyJ. W.KrishnaB. S.GoldbergM. E. (2004). A rapid and precise on-response in posterior parietal cortex. J. Neurosci.24, 1833–1838.10.1523/JNEUROSCI.5007-03.2004
12
BroschT.SanderD.PourtoisG.SchererK. R. (2008). Beyond fear: rapid spatial orienting toward positive emotional stimuli. Psychol. Sci.19, 362–370.10.1111/j.1467-9280.2008.02094.x
13
BrovelliA.DingM.LedbergA.ChenY.NakamuraR.BresslerS. L. (2004). Beta oscillations in a large-scale sensorimotor cortical network: directional influences revealed by Granger causality. Proc. Natl. Acad. Sci. U.S.A.101, 9849–9854.10.1073/pnas.0308538101
14
CarmichaelS. T.PriceJ. L. (1995). Limbic connections of the orbital and medial prefrontal cortex in macaque monkeys. J. Comp. Neurol.363, 615–641.10.1002/cne.903630409
15
ChamberlainS. R.MenziesL.HampshireA.SucklingJ.FinebergN. A.Del CampoN.et al (2008). Orbitofrontal dysfunction in patients with obsessive-compulsive disorder and their unaffected relatives. Science321, 421–422.10.1126/science.1154433
16
ChudasamaY.RobbinsT. W. (2003). Dissociable contributions of the orbitofrontal and infralimbic cortex to pavlovian autoshaping and discrimination reversal learning: further evidence for the functional heterogeneity of the rodent frontal cortex. J. Neurosci.23, 8771–8780.
- Pubmed Abstract
- Google Scholar
17
DamasioA. R. (1994). Descartes Error. New York: G.P. Putnam.
- Google Scholar
18
DamasioH.GrabowskiT.FrankR.GalaburdaA. M.DamasioA. R. (1994). The return of Phineas Gage: clues about the brain from the skull of a famous patient. Science264, 1102–1105.10.1126/science.8178168
19
DavisM. (1992). “The role of the amygdala in conditioned fear,” in The Amygdala: Neurological Aspects of Emotion, Memory, and Mental Dysfunction, ed. AggletonJ. (Hoboken, NJ: John Wiley & Sons), 255–306.
- Google Scholar
20
DawN. D.KakadeS.DayanP. (2002). Opponent interactions between serotonin and dopamine. Neural Netw.15, 603–616.10.1016/S0893-6080(02)00052-7
21
De MartinoB.KumaranD.SeymourB.DolanR. J. (2006). Frames, biases, and rational decision-making in the human brain. Science313, 684–687.10.1126/science.1128356
22
DickinsonA.DearingM. F. (1979). “Appetitive-aversive interactions and inhibitory processes,” in Mechanisms of Learning and Motivation, eds DickinsonA.BoakesR. A. (Hillsdale, NJ: Erlbaum), 203–231.
- Google Scholar
23
EverittB. J.CardinalR. N.ParkinsonJ. A.RobbinsT. W. (2003). Appetitive behaviour: impact of amygdala-dependent mechanisms of emotional learning. Ann. N. Y. Acad. Sci.985, 233–250.10.1111/j.1749-6632.2003.tb07085.x
24
FellowsL. K.FarahM. J. (2003). Ventromedial frontal cortex mediates affective shifting in humans: evidence from a reversal learning paradigm. Brain126, 1830–1837.10.1093/brain/awg180
25
FreeseJ. L.AmaralD. G. (2005). The organization of projections from the amygdala to visual cortical areas TE and V1 in the macaque monkey. J. Comp. Neurol.486, 295–317.10.1002/cne.20520
26
GhahremaniD. G.MonterossoJ.JentschJ. D.BilderR. M.PoldrackR. A. (2010). Neural components underlying behavioral flexibility in human reversal learning. Cereb. Cortex20, 1843–1852.10.1093/cercor/bhp247
27
GhashghaeiH. T.HilgetagC. C.BarbasH. (2007). Sequence of information processing for emotions based on the anatomic dialogue between prefrontal cortex and amygdala. Neuroimage34, 905–923.10.1016/j.neuroimage.2006.09.046
28
GrangerC. W. J. (1969). Investigating causal relationships by econometric models and cross-spectral methods. Econometrica37, 424–438.10.2307/1913549
- CrossRef
- Google Scholar
29
GreeneJ.HaidtJ. (2002). How (and where) does moral judgment work?Trends Cogn. Sci. (Regul. Ed.)6, 517–523.10.1016/S1364-6613(02)02011-9
30
GrossbergA. (1984). Some normal and abnormal behavioral syndromes due to transmitter gating of opponent processes. Biol. Psychiatry19, 1075–1118.
- Pubmed Abstract
- Google Scholar
31
HampshireA.GruszkaA.FallonS. J.OwenA. M. (2008). Inefficiency in self-organized attentional switching in the normal aging population is associated with decreased activity in the ventrolateral prefrontal cortex. J. Cogn. Neurosci.20, 1670–1686.10.1162/jocn.2008.20115
32
HasegawaR. P.PetersonB. W.GoldbergM. E. (2004). Prefrontal neurons coding suppression of specific saccades. Neuron43, 415–425.10.1016/j.neuron.2004.07.013
33
HollandP. C.GallagherM. (1999). Amygdala circuitry in attentional and representational processes. Trends Cogn. Sci. (Regul. Ed.)3, 65–73.10.1016/S1364-6613(98)01271-6
34
HornakJ.O’DohertyJ.BramhamJ.RollsE. T.MorrisR. G.BullockP. R.et al (2004). Reward-related reversal learning after surgical excisions in orbito-frontal or dorsolateral prefrontal cortex in humans. J. Cogn. Neurosci.16, 463–478.10.1162/089892904322926791
35
IlangoA.WetzelW.ScheichH.OhlF. W. (2010). The combination of appetitive and aversive reinforcers and the nature of their interaction during auditory learning. Neuroscience166, 752–762.10.1016/j.neuroscience.2010.01.010
36
IversenS. D.MishkinM. (1970). Perseverative interference in monkeys following selective lesions of inferior prefrontal convexity. Exp. Brain Res.11, 376–386.10.1007/BF00237911
37
IzquierdoA.MurrayE. A. (2007). Selective bilateral amygdala lesions in rhesus monkeys fail to disrupt object reversal learning. J. Neurosci.27, 1054–1062.10.1523/JNEUROSCI.3616-06.2007
38
IzquierdoA.SudaR. K.MurrayE. A. (2004). Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency. J. Neurosci.24, 7540–7548.10.1523/JNEUROSCI.1921-04.2004
39
KableJ. W.GlimcherP. W. (2007). The neural correlates of subjective value during intertemporal choice. Nat. Neurosci.10, 1625–1633.10.1038/nn2007
40
KahnemanD.TverskyA. (2000). Choices, Values and Frames. New York: Cambridge University Press.
- Google Scholar
41
KimH.ShimogoS.O’DohertyJ. P. (2006). Is avoiding an aversive outcome rewarding? Neural substrates of avoidance learning in the human brain. PLoS Biol.4, e233.10.1371/journal.pbio.0040233
- CrossRef
- Google Scholar
42
KonorskiJ. (1967). Integrative Activity of the Brain: An Interdisciplinary Approach. Chicago, IL: University of Chicago Press.
- Google Scholar
43
LangP. J.DavisM. (2006). Emotion, motivation, and the brain: reflex foundations in animal and human research. Prog. Brain Res.156, 3–29.10.1016/S0079-6123(06)56001-7
44
LeDouxJ. E. (2000). Emotion circuits in the brain. Annu. Rev. Neurosci.23, 155–184.10.1146/annurev.neuro.23.1.155
45
LoewensteinG. F.WeberE. U.HseeC. K.WelchN. (2001). Risk as feelings. Psychol. Bull.127, 267–286.10.1037/0033-2909.127.2.267
46
MarenS.QuirkG. J. (2004). Neuronal signalling of fear memory. Nat. Rev. Neurosci.5, 844–852.10.1038/nrn1535
47
MatsumotoM.HikosakaO. (2007). Lateral habenula as a source of negative reward signals in dopamine neurons. Nature447, 1111–1115.10.1038/nature05860
48
MatsumotoM.HikosakaO. (2008). Negative motivational control of saccadic eye movement by the lateral habenula. Prog. Brain Res.171, 399–402.10.1016/S0079-6123(08)00658-4
49
MatsumotoM.HikosakaO. (2009). Representation of negative motivational value in the primate lateral habenula. Nat. Neurosci.12, 77–84.10.1038/nn.2233
50
McClureS. M.EricsonK. M.LaibsonD. I.LoewensteinG.CohenJ. D. (2007). Time discounting for primary rewards. J. Neurosci.27, 5796–5804.10.1523/JNEUROSCI.4246-06.2007
51
McClureS. M.LaibsonD. I.LoewensteinG.CohenJ. D. (2004). Separate neural systems value immediate and delayed monetary rewards. Science306, 503–507.10.1126/science.1100907
52
McDonaldA. J. (1991). Organization of amygdaloid projections to the prefrontal cortex and associated striatum in the rat. Neuroscience44, 1–44.10.1016/0306-4522(91)90248-M
53
McDonaldA. J. (1998). Cortical pathways to the mammalian amygdala. Prog. Neurobiol.55, 257–332.10.1016/S0301-0082(98)00003-3
54
McNeilB. J.PaukerS. G.SoxH. C.Jr.TverskyA. (1982). On the elicitation of preferences for alternative therapies. N. Engl. J. Med.306, 1259–1262.10.1056/NEJM198205273062103
55
MirenowiczJ.SchultzW. (1996). Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli. Nature379, 449–451.10.1038/379449a0
56
MorrisonS. E.SaezA.LauB.SalzmanC. D. (2011). Different time courses for learning-related changes in amygdala and orbitofrontal cortex. Neuron71, 1127–1140.10.1016/j.neuron.2011.07.016
57
MorrisonS. E.SalzmanC. D. (2009). The convergence of information about rewarding and aversive stimuli in single neurons. J. Neurosci.29, 11471–11483.10.1523/JNEUROSCI.1815-09.2009
58
MorrisonS. E.SalzmanC. D. (2010). Re-valuing the amygdala. Curr. Opin. Neurobiol.20, 221–230.10.1016/j.conb.2010.02.007
59
MorrisonS. E.SalzmanC. D. (2011). Representations of appetitive and aversive information in the primate orbitofrontal cortex. Ann. N. Y. Acad. Sci.1239, 59–70.10.1111/j.1749-6632.2011.06255.x
60
NishijoH.HoriE.TazumiT.OnoT. (2008). Neural correlates to both emotion and cognitive functions in the monkey amygdala. Behav. Brain Res.188, 14–23.10.1016/j.bbr.2007.10.013
61
NishijoH.OnoT.NishinoH. (1988). Single neuron responses in amygdala of alert monkey during complex sensory stimulation with affective significance. J. Neurosci.8, 3570–3583.
- Pubmed Abstract
- Google Scholar
62
OchsnerK. N.GrossJ. J. (2005). The cognitive control of emotion. Trends Cogn. Sci. (Reugul. Ed.)9, 242–249.10.1016/j.tics.2005.03.010
- CrossRef
- Google Scholar
63
O’DohertyJ.KringelbachM. L.RollsE. T.HornakJ.AndrewsC. (2001). Abstract reward and punishment representations in the human orbitofrontal cortex. Nat. Neurosci.4, 95–102.10.1038/82959
64
Padoa-SchioppaC. (2009). Range-adapting representation of economic value in the orbitofrontal cortex. J. Neurosci.29, 14004–14014.10.1523/JNEUROSCI.3751-09.2009
65
Padoa-SchioppaC. (2011). Neurobiology of economic choice: a good-based model. Annu. Rev. Neurosci.34, 333–359.10.1146/annurev-neuro-061010-113648
66
Padoa-SchioppaC.AssadJ. A. (2006). Neurons in the orbitofrontal cortex encode economic value. Nature441, 223–226.10.1038/nature04676
67
PatonJ. J.BelovaM. A.MorrisonS. E.SalzmanC. D. (2006). The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature439, 865–870.10.1038/nature04490
68
PeckC. J.JangrawD. C.SuzukiM.EfemR.GottliebJ. (2009). Reward modulates attention independently of action value in posterior parietal cortex. J. Neurosci.29, 11182–11191.10.1523/JNEUROSCI.1929-09.2009
69
PhelpsE. A.LingS.CarrascoM. (2006). Emotion facilitates perception and potentiates the perceptual benefits of attention. Psychol. Sci.17, 292–299.10.1111/j.1467-9280.2006.01701.x
70
PinkhamA. E.GriffinM.BaronR.SassonN. J.GurR. C. (2010). The face in the crowd effect: anger superiority when using real faces and multiple identities. Emotion10, 141–146.10.1037/a0017387
71
RobertsN. A.BeerJ. S.WernerK. H.ScabiniD.LevensS. M.KnightR. T.et al (2004). The impact of orbitofrontal prefrontal cortex damage on emotional activation to unanticipated and anticipated acoustic startle stimuli. Cogn. Affect. Behav. Neurosci.4, 307–316.10.3758/CABN.4.3.307
72
RollsE. (2000). “Neurophysiology and functions of the primate amygdala, and the neural basis of emotion,” in The Amygdala: A Functional Analysis, ed. AggletonJ. (New York: Oxford University Press), 447–478.
- Google Scholar
73
RollsE. T. (1996). The orbitofrontal cortex. Philos. Trans. R. Soc. Lond. B Biol. Sci.351, 1433–1443.10.1098/rstb.1996.0128
74
RomanskiL. M.ClugnetM. C.BordiF.LeDouxJ. E. (1993). Somatosensory and auditory convergence in the lateral nucleus of the amygdala. Behav. Neurosci.107, 444–450.10.1037/0735-7044.107.3.444
75
SaddorisM. P.GallagherM.SchoenbaumG. (2005). Rapid associative encoding in basolateral amygdala depends on connections with orbitofrontal cortex. Neuron46, 321–331.10.1016/j.neuron.2005.02.018
76
SalzmanC. D.FusiS. (2010). Emotion, cognition, and mental state representation in amygdala and prefrontal cortex. Annu. Rev. Neurosci.33, 173–202.10.1146/annurev.neuro.051508.135256
77
SalzmanC. D.PatonJ. J.BelovaM. A.MorrisonS. E. (2007). Flexible neural representations of value in the primate brain. Ann. N. Y. Acad. Sci.1121, 336–354.10.1196/annals.1401.034
78
SangheraM. K.RollsE. T.Roper-HallA. (1979). Visual responses of neurons in the dorsolateral amygdala of the alert monkey. Exp. Neurol.63, 610–626.10.1016/0014-4886(79)90175-4
79
SchoenbaumG.ChibaA.GallagherM. (1998). Orbitalfrontal cortex and basolateral amygdala encode expected outcomes during learning. Nat. Neurosci.1, 155–159.10.1038/407
80
SchoenbaumG.NugentS. L.SaddorisM. P.SetlowB. (2002). Orbitofrontal lesions in rats impair reversal but not acquisition of go, no-go odor discriminations. Neuroreport13, 885–890.10.1097/00001756-200205070-00030
81
SchoenbaumG.SetlowB.SaddorisM. P.GallagherM. (2003). Encoding predicted outcome and acquired value in orbitofrontal cortex during due sampling depends upon input from basolateral amygdala. Neuron39, 855–867.10.1016/S0896-6273(03)00474-4
82
SchoerR.PatonJ. J.SalzmanC. D. (2009). Activity of amygdala and orbitofrontal cortical neurons during contrast revaluation of reward predicting stimuli. Program No. 784.17. 2009 Neuroscience Meeting Planner. Chicago, IL: Society for Neuroscience Abstracts Online.
- Google Scholar
83
SchoerR.SaezA.SalzmanC. D. (2011). Amygdala neurons adaptively encode the motivational significance of conditioned stimuli in a relative manner. Program No. 515.16. 2011 Neuroscience Meeting Planner. Washington, DC: Society for Neuroscience Abstracts Online.
- Google Scholar
84
SeymourB.SingerT.DolanR. (2007). The neurobiology of punishment. Nat. Rev. Neurosci.8, 300–311.10.1038/nrn2119
85
SolomonR. L.CorbitJ. D. (1974). An opponent-process theory of motivation. 1. Temporal dynamics of affect. Psychol. Rev.81, 119–145.10.1037/h0036128
86
StefanacciL.AmaralD. G. (2000). Topographic organization of cortical inputs to the lateral nucleus of the macaque monkey amygdala: a retrograde tracing study. J. Comp. Neurol.421, 52–79.10.1002/(SICI)1096-9861(20000522)421:1<52::AID-CNE4>3.0.CO;2-O
87
StefanacciL.AmaralD. G. (2002). Some observations on cortical inputs to the macaque monkey amygdala: an anterograde tracing study. J. Comp. Neurol.451, 301–323.10.1002/cne.10339
88
Sugase-MiyamotoY.RichmondB. J. (2005). Neuronal signals in the monkey basolateral amygdala during reward schedules. J. Neurosci.25, 11071–11083.10.1523/JNEUROSCI.1796-05.2005
89
ThorpeS. J.RollsE. T.MaddisonS. (1983). The orbitofrontal cortex: neuronal activity in the behaving monkey. Exp. Brain Res.49, 93–115.10.1007/BF00235545
90
TremblayL.SchultzW. (2000). Reward-related neuronal activity during go-nogo task performance in primate orbitofrontal cortex. J. Neurophysiol.83, 1864–1876.
- Pubmed Abstract
- Google Scholar
91
WallisJ. D. (2007). Orbitofrontal cortex and its contribution to decision-making. Annu. Rev. Neurosci.30, 31–56.10.1146/annurev.neuro.30.051606.094334
92
WilsonF. A.RollsE. T. (2005). The primate amygdala and reinforcement: a dissociation between rule-based and associatively-mediated memory revealed in neuronal activity. Neuroscience133, 1061–1072.10.1016/j.neuroscience.2005.03.022
93
YamadaH.MatsumotoN.KimuraM. (2004). Tonically active neurons in the primate caudate nucleus and putamen differentially encode instructed motivational outcomes of action. J. Neurosci.24, 3500–3510.10.1523/JNEUROSCI.0068-04.2004
94
YoungL.BecharaA.TranelD.DamasioH.HauserM.DamasioA. (2010). Damage to ventromedial prefrontal cortex impairs judgment of harmful intent. Neuron65, 845–851.10.1016/j.neuron.2010.03.003

Summary

Keywords

amygdala, orbitofrontal cortex, value processing, reward, punishment

Citation

Barberini CL, Morrison SE, Saez A, Lau B and Salzman CD (2012) Complexity and Competition in Appetitive and Aversive Neural Circuits. Front. Neurosci. 6:170. doi: 10.3389/fnins.2012.00170

Received

30 October 2012

Accepted

04 November 2012

Published

26 November 2012

Volume

6 - 2012

Edited by

Philippe N. Tobler, University of Zurich, Switzerland

Reviewed by

Kae Nakamura, Kansai Medical University, Japan; Anton Ilango, National Institutes of Health, USA

This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.

*Correspondence: C. Daniel Salzman, Department of Neuroscience, Columbia University, 1051 Riverside Drive, Unit 87, New York, NY 10032, USA. e-mail: cds2005@columbia.edu

This article was submitted to Frontiers in Decision Neuroscience, a specialty of Frontiers in Neuroscience.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Decision Neuroscience

ORIGINAL RESEARCH article

Complexity and Competition in Appetitive and Aversive Neural Circuits

Abstract