Abstract
The traditional animal model of instrumental behavior has focused almost exclusively on structures within the cortico-striatal network and ignored the contributions of various thalamic nuclei despite large and specific connections with each of these structures. One possible reason for this is that the thalamus has been conventionally viewed as a mediator of general processes, such as attention, arousal and movement, that are not easily separated from more cognitive aspects of instrumental behavior. Recent research has, however, begun to separate these roles. Here we review the role of three thalamic nuclei in instrumental conditioning: the anterior thalamic nuclei (ANT), the mediodorsal (MD), and parafascicular thalamic nuclei (PF). Early research suggested that ANT might regulate aspects of instrumental behavior but, on review, we suggest that the types of tasks used in these studies were more likely to recruit Pavlovian processes. Indeed lesions of ANT have been found to have no effect on performance in instrumental free-operant tasks. By contrast the mediodorsal thalamus (MD) has been found to play a specific and important role in the acquisition of goal-directed action. We propose this role is related to its connections with prelimbic cortex (PL) and present new data that directly implicates this circuit in the acquisition of goal-directed actions. Finally we review evidence suggesting the PF, although not critical for the acquisition or performance of instrumental actions, plays a specific role in regulating action flexibility.
Introduction
The thalamus has been traditionally viewed as a sensory relay center, forming the interface between the sensory cortices and subcortical structures responsible for the execution of actions. In performing this role, several thalamic nuclei have been implicated in general processes such as arousal, attention, and voluntary movement. However, research within the last three decades has begun to focus more specifically on the role of the thalamus in instrumental conditioning. This has been driven, at least in part, by anatomical evidence that many thalamic nuclei have large and specific connections with the prefrontal cortex and dorsal striatum that now have well-established roles in the regulation of instrumental learning and performance.
In this review, we examine studies that have incorporated behavioral tasks in conjunction with various neural manipulations to assess the role of individual thalamic nuclei in instrumental learning and behavior. In particular we will consider the role of three thalamic nuclei: the ANT, mediodorsal (MD) and parafascicular thalamic nuclei (PF), the latter often referred to, in humans and primates, as the centromedian-parafascicular complex. In evaluating the evidence that any neural structure plays a role in some specific function, however, it is necessary to carefully evaluate, not only the anatomical but also the functional evidence. In the case of instrumental conditioning that requires evaluating the behavioral evidence that specific anatomical manipulations are influencing instrumental actions and not other forms of conditioned behavior and, in the current context, the chief alternative to instrumental action is, of course, the Pavlovian conditioned response.
Pavlovian conditioning occurs through pairings of an initially neutral stimulus, the conditional stimulus (CS), and a biologically relevant unconditional stimulus (US). Across repeated pairings, the CS comes to elicit a specific set of conditioned responses (CRs) indicative of the animal’s expectancy of the impending US. By contrast, instrumental conditioning involves the animal learning to perform (or withhold) an action dependent on its consequences. Although the distinction between them may appear clear enough, in reality it is complicated by the fact that instrumental conditioning often takes place in the presence of stimuli any of which could form Pavlovian stimulus-outcome (S-O) relations. Under some circumstances, therefore, it might be difficult to separate whether it is the instrumental contingency or the Pavlovian S-O contingency that is guiding behavior.
There are two criteria that distinguish instrumental actions from Pavlovian CRs (cf. Dickinson and Balleine, ). Specifically, whereas an agent should be able to withhold and/or flexibly alter (e.g., reverse) the direction of an instrumental response to obtain an outcome (cf. Dickinson, ), the same is not true of Pavlovian CRs. Thus, whereas it is clear that rats are capable of withholding a lever press response to receive a pellet outcome (Davis and Bitterman, ) and, further, that this response can be bidirectional; a lever can be pushed either up or down to gain a reward (Bolles et al., ; Dickinson et al., 1996), Pavlovian CRs are not open to such adjustment (e.g., Hershberger, 1986), nor can the response be withheld during the stimulus to gain the reward (Sheffield, 1965; Williams and Williams, 1969; Holland, 1979).
These examples demonstrate that Pavlovian CRs are controlled by S-O relations. In contrast, evidence suggests that, in instrumental conditioning, development of the instrumental action can be controlled by two other distinct forms of learning process. Considerable evidence suggests that instrumental actions can be goal-directed and controlled by the encoding of specific response-outcome (R-O) relationships. Much of this evidence has been provided by outcome devaluation studies (e.g., Adams and Dickinson, ; Colwill and Rescorla, ; Dickinson and Balleine, ). In such studies animals are trained to perform a response for a particular outcome, the value of which is subsequently reduced by feeding it to satiety or repeatedly pairing it with lithium chloride to induce illness. The animal is then tested for its propensity to make that response under extinction (i.e., in the absence of feedback from outcome delivery). If animals subsequently shows reduced performance of the response previously paired with the now devalued outcome this can be taken as evidence that it is goal-directed (Dickinson and Balleine, ) because it is governed by both: (1) a representation of the outcome as a “goal” and (2) a representation of the contingency between performance of the action and access to the outcome. The absence of feedback on test ensures that the second criterion is met because the animal can only rely on its prior knowledge of the R-O contingency to show the requisite reduction in performance.
Importantly, continuing performance on a lever after devaluation, as has been reported after extended training, demonstrates that performance is sometimes not guided by its relation with the outcome. Such demonstrations (Adams, ; Dickinson et al., ; Yin et al., 2004, 2006; Lingawi and Balleine, 2012) have been argued to reflect the behavioral development of habits. Habits are not guided by the R-O relation but, rather, reflect the role of the outcome as a reinforcer, strengthening the relation between prevailing stimuli (S) such as the context and the response (R). Behavioral and neurological evidence (Dickinson et al., ; Yin et al., 2004, 2005a,b) suggests that S-R and R-O relations are not mutually exclusive and develop in parallel with the influence over performance shifting across the course of training. Although the behavioral and neural processes that control habitual actions are important and of increasing interest, in this review we will refer primarily to goal-directed instrumental action whose performance is under the control of the R-O relation.
Finally, although the learning processes controlling the Pavlovian CR are distinct from those controlling instrumental actions, the latter actions can be influenced by specific retrieval-related effects of Pavlovian stimuli, an effect demonstrated using the Pavlovian-instrumental-transfer (PIT) paradigm. In such procedures, Pavlovian S-O and instrumental R-O relations are trained separately, and the ability of the Pavlovian stimuli to modulate instrumental performance is measured in an extinction test. The typical finding is that, on test, stimulus presentations promote responding on the instrumental action that was paired with the same outcome during training. For example, Colwill and Rescorla () showed that a tone that had been paired with pellets promoted the performance of instrumental actions that had also been paired with pellets, relative to actions that earned a different outcome during training. This specific PIT effect requires the ability of the animal to retrieve specific R-O relations based on the ability of the Pavlovian stimulus to evoke a representation of the outcome. As a consequence, this effect is often characterized in terms of the formation of an S-O → R process in which the stimulus based retrieval of a specific outcome causes the animal to retrieve its specific associated action (see Balleine and Ostlund, ; Balleine and O’Doherty, , for discussion).
In the remainder of the paper, we examine the aforementioned thalamic structures and their role in instrumental conditioning, focussing specifically on their role in goal-directed actions. With regard to the issues above, therefore, we will attempt to focus on actions that have been shown to be acquired and maintained by their contingent relationship to, and the value of, their consequences, rather than by antecedent stimuli. Where relevant, therefore, we will point to issues of behavioral control affecting interpretation and that may require clarification in future studies.
Anterior thalamic nuclei
Several studies spanning the late 1970s–early 2000s proposed that the anterior thalamic nuclei (ANT: see Figure 1B) play a role in the regulation of behavior in discrimination tasks involving instrumental responding. In particular, a series of experiments by Gabriel et al. (1977, 1983, 1989) found several lines of evidence to suggest ANT involvement in the learning that underlies performance in a series of avoidance and appetitive discrimination tasks in rabbits.
Figure 1
The earliest of these studies examined unit recordings from the anterior cingulate cortex (AC), the reciprocally connected ANT, or both, during an aversive avoidance task. In this task the presentation of a tone stimulus (S+) preceded the presentation of a footshock. Rabbits also received presentations of a different frequency tone stimulus (S−) that did not predict shock. This procedure was carried out in a running wheel and the consequence of the rabbit performing a wheel turn during the S+ presentation was avoidance of the footshock as well as termination of the S+. Similar responses during S− presentations also terminated the S−. Behaviorally, rabbits learnt this task relatively well, taking between 4–5 sessions on average to reach a criterion of 9–10 responses to the CS+ and 9–10 non-responses to the CS− (Gabriel et al., 1977).
The first examination of the AC-ANT pathway using this task was conducted by Gabriel and colleagues (Gabriel et al., 1977). In this study it was found that neuronal activity increased from baseline in both the AC and ANT in the 15–25 ms following stimulus onset and decreased from 35–75 ms, then increased again at 75 ms where it continued until 200 ms when recording ceased. This response was greater in magnitude to the S+ than the S− in the first 100 ms and, as a consequence, the authors proposed that these differential neural responses reflected discrimination learning and that this information was used to evoke a behavioral response to the S+ over the S−. This experiment was one of the first attempts to apply a psychological function to a thalamic region that could be separated from the regulation of some other general function. In particular, the authors claimed that arousal and body orientation could not have influenced the results because these states should have been the same prior to the onset of both the S+ and the S− such that any differential responding to each stimulus could only be elicited as a result of their differential relationships with the footshock.
A later study (Gabriel et al., 1989) demonstrated a causal role for the ANT in regulating the underlying learning in the discriminative avoidance task outlined above. Gabriel et al. (1983) had previously shown that bilateral ANT lesions eliminated excitatory responses to the S+ in the cingulate cortex and, as such, they hypothesized that lesioning the ANT might also affect behavior in this task. They found that rabbits with combined lesions of the MD and ANT were unable to reach criterion, whereas rabbits with only MD lesions did not differ from controls. Rabbits with MD lesions did show some impairment relative to controls when their percentage of correct responses to the S+ relative to S− was considered, but rabbits with combined ANT/MD lesions were more impaired than either group (see Figure 1A). Again, the authors claimed that these differences could not be attributed to deficits in the general processes of orienting or autonomic responses to the stimuli as these were intact in lesioned animals. Further, the aversive footshock was argued to be similarly effective in all of the rabbits. As a consequence, the authors attributed the impaired performance to a deficit in learning.
There have been several follow-up studies replicating and expanding on these earlier effects. A notable example is that of Smith et al. (2002) who used an appetitively motivated discrimination task to examine the role of ANT and MD in appetitive conditioning. For this task, a water reward was given after head extension and oral contact with a spout following a tone S+ presentation whereas no reward was given after the (alternate frequency) tone S−. Rabbits with limbic thalamic lesions (spanning ANT and MD) were severely impaired in their acquisition of the task, but did eventually reach criterion. Further, cingulate cortical neurons developed discriminative neuronal responses (S+ > S−) in controls but not lesioned rabbits. These results were interpreted as implicating the limbic thalamic-AC pathway in associative learning more generally, rather than aversive avoidance learning specifically.
These and other studies (e.g., Sparenborg and Gabriel, 1992; Gabriel et al., 1995) represent, therefore, a significant body of work implicating the ANT-AC pathway and to a lesser extent the MD, in discrimination learning. It should be emphasized that the authors did not claim a role for this pathway in the regulation of instrumental behavior per se, but rather referred to their discrimination task as requiring an instrumental response. However, none of these studies included a specific test of the bidirectionality or omission of these responses, so it remains open to question as to whether they were actually instrumental or subject to other, particularly Pavlovian, contingencies. The head extension response in particular (Smith et al., 2002) seems an unlikely candidate for an instrumental response as it comprises a food approach behavior, which cannot be withheld or flexibly performed to achieve a desired outcome. The wheel turn response, on the other hand, comprises a better candidate for an instrumental response as it has been shown to be sensitive to omission (Wilson et al., 1987). Although to our knowledge bidirectional performance of wheel turning has not been demonstrated, it is not unreasonable to think that if a rabbit can turn a wheel in one direction to avoid a shock it could turn it in the opposite direction for the same outcome.
What is not clear from these studies, however, is the type of relation governing performance in these particular tasks. Because footshock occurred only in the presence of the S+, it is possible that in spite of its potentially instrumental nature, wheel turn responding in the presence of the S+ simply constituted a conditional response governed by S-O relations. Indeed, if wheel-turning might be considered a form of escape, which is an unconditional response appropriate to footshock, then this response could even be seen to fulfil Pavlov’s (1927) criterion of stimulus substitution. Even if we do accept that there was an instrumental contingency between wheel turning and shock avoidance, the fact that performance of this response only led to the desired outcome (i.e., footshock avoidance) in the presence of the S+ creates the possibility that it was under Pavlovian control in a manner similar to that observed during PIT. If this were the case it would again imply that the ANT was mediating performance through the regulation of S-O or S-O-R relations, rather than the R-O relation, as discussed previously. In order to separate these possibilities it would have been necessary to show that the wheel turn was governed by its contingency with the footshock avoidance, independent of the S-O contingency. For example, Grindley (1932) showed that Guinea pigs who had learned to turn their head to the left or right every time a buzzer sounded to gain a carrot reward, would readily reverse the direction of head turning when the instrumental contingency was reversed but the S-O relation between the buzzer and carrot remained constant. Likewise if Gabriel et al. (1977, 1983, 1989) had shown that animals that had initially learned to turn the wheel in one direction to avoid shock and then learned to turn it in the opposite direction to avoid shock, independent of the continuing tone-shock contingency, this would suggest that the response was governed by the R-O, not the S-O contingency. Therefore, although elegant and among the first to assign a psychological function to a thalamic nucleus outside of general physiological functions, the research by Gabriel and colleagues leaves open the question of which type of relation governed behavior in these tasks and therefore which of these processes is regulated by the ANT.
A subsequent study by Corbit et al. (
Mediodorsal thalamus
A second thalamic candidate that has been examined within the literature for its role in the regulation of instrumental behaviors is the MD (see Figure 2E). As mentioned above, in some of the experiments conducted by Gabriel et al. (1989) and Smith et al. (2002) the ANT was not the only thalamic target of some of their manipulations, as the MD was also targeted some of the time. Although the pattern of results seemed to suggest a greater deficit when the ANT and MD were both targeted than when the MD was targeted alone (Gabriel et al., 1989), rabbits with lesions of the MD alone did show some deficit relative to controls. However, because these tasks confound Pavlovian and instrumental processes, there is some difficulty extracting information about the involvement of the MD in regulating instrumental behavior from these results.
Figure 2

(A) Reproduced from Buchanan (
Another early attempt at examining the role of MD in instrumental conditioning also involved rabbits with MD lesions but examined performance in an eyeblink avoidance conditioning task (Buchanan,
In the same series of experiments they used to examine the ANT, Corbit et al. (
Ostlund and Balleine (2008) later re-examined the role of the MD in regulating instrumental performance. They again examined the effects of MD lesions but in this instance the lesions were performed after instrumental training. These post-training lesions produced a very different effect; although pre-training lesions abolished outcome devaluation it was unaffected by post-training lesions, suggesting that the MD plays a role in the acquisition of goal-directed behaviors but not their expression. This finding suggests that the MD might play a role similar to that of the prelimbic cortex (PL) which has been similarly found to mediate the acquisition but not expression of goal-directed behavior (Ostlund and Balleine, 2005), but differentiates it from the posterior dorsomedial striatum (pDMS) which has been shown to be mediate both acquisition and expression (Yin et al., 2005a,b).
In a second experiment, Ostlund and Balleine (2008) assessed PIT. In the Pavlovian training stage a tone stimulus was paired with pellets or sucrose and a white noise stimulus was paired with the alternate outcome. After eight days rats began the instrumental phase in which the left lever was paired with pellets or sucrose and the right lever paired with the other outcome. On test, the Pavlovian stimuli were presented while the rats were allowed to press both levers in the absence of outcome delivery. As is typically found, the Pavlovian cues biased performance towards the lever delivering the outcome predicted by the stimuli despite the rats never previously experiencing the stimuli and levers in the same session (Figure 2B). Rats with post-training MD lesions were unable to perform this task and pressed both levers equally during stimulus presentations. This result suggests that the MD not only governs reward guided actions but also stimulus guided actions, a result that offers some explanation as to why MD lesions, that impair goal-directed performance in the absence of explicit Pavlovian cues (Corbit et al.,
The last experiment in the series conducted by Ostlund and Balleine (2008) examined whether MD lesions affected performance during a Pavlovian contingency degradation task that employs alterations in the predictive S-O relationship. For this task the rats continued to receive the same S-O pairings received in previous Pavlovian training, but one of these outcomes was also delivered unpaired with any stimuli. This served to degrade the contingency between the stimulus and that outcome as Sham rats selectively reduced time spent in the magazine during presentations of that stimulus. Rats with MD lesions, on the other hand, reduced responding to both stimuli, suggestive of a specific deficit in the encoding of S-O relations.
Taken together, these experiments demonstrate the complex nature of the MD’s role in instrumental behavior. On the one hand, pre-training MD lesions impaired the acquisition of R-O contingencies and the selective degradation of one of these contingencies, suggesting that an intact MD is crucial for the acquisition of instrumental behaviors guided by R-O relations. On the other hand post-training MD lesions left outcome devaluation intact whilst impairing Pavlovian-to-Instrumental transfer and Pavlovian contingency degradation. Perhaps the simplest explanation for the multiple functions of the MD lies in the diverse connections it maintains with the frontal cortex. Connections between the MD and the prelimbic prefrontal cortex of the rat are, at least anatomically, the best studied (Groenewegen, 1988; Kuroda et al., 1993), but the MD also maintains strong connections with the orbitofrontal cortex (OFC) particularly its lateral regions (Krettek and Price, 1977). Recent studies have found that, whereas the prelimbic area is critical for the acquisition of goal-directed instrumental actions, it plays little if any role in appetitive Pavlovian conditioning or in the influence of Pavlovian cues on instrumental performance (Corbit and Balleine,
Prelimbic-mediodorsal thalamus interactions: the effect of disconnecting the thalamo-cortical pathway on goal-directed instrumental actions
The heavy interconnectedness of the MD and PL (Groenewegen, 1988) and their similar role in the acquisition of goal-directed instrumental actions led us to hypothesize that the encoding of the R-O contingency depends on the PL-MD pathway. In particular, we predicted that a functional disconnection of PL and MD would abolish goal-directed behavior. By contrast, we predicted that there would be no deficit in rats that received a functional PL/MD disconnection in outcome-induced reinstatement performance that tests the acquisition of O-R rather than R-O contingencies, particularly as bilateral PL lesions leave reinstatement unaffected (Ostlund and Balleine, 2005) as do MD lesions (Ostlund and Balleine, 2008).
Not only are the connections between PL and MD large and reciprocal, the PL projects to the MD in both the ipsilateral and contralateral hemispheres (Buchanan,
First we demonstrated the efficacy of lesioning the CC in severing these contralateral PL-MD projections. After this lesion had been made the retrograde tracer fluorogold (FG) was injected unilaterally into the MD of five Long-Evans rats. Brains were later examined for the extent of labeling in the PL in both hemispheres: that which was ipsilateral and that which was contralateral to FG injection. From Figure 3A it is clear that almost no FG labeling was observed in the PL contralateral to the MD injection site relative to that observed in a control rat that had no CC lesion. This suggests that the CC lesion was successful in severing contralateral projections between these structures. By contrast, it is also clear from this figure that ipsilateral projections were unaffected by the CC lesions: labeling in the hemisphere ipsilateral to the injection site looked similar in both lesioned rats and unlesioned control rats.
Figure 3

(A) Shows the extent of fluorogold labeling in prelimbic cortex (PL) after receiving an injection of retrograde tracer FG into MD and either electrolytic (Contra and Ipsi) or sham (control) lesions of corpus callosum (CC). Horizontal section (middle panel) shows injection site in MD as well as CC lesion. CC lesions did not affect ipsilateral projections (no difference in labeling in Ipsi and control, right panel) but were effective in disconnecting contralateral projections (very little labeling in Contra relative to control, left panel). (B) Mean (± SEM) lever presses per min for the control groups (Groups Ipsi and Sham) and Group Contra that suffered a functional PL-MD disconnection (i.e., CC lesion plus contralateral N-methyl-D-aspartate (NMDA)-induced lesions of PL and MD). For all statistical analyses Group Sham and Ipsi did not differ on any measure (all Fs < 1) and therefore were averaged across for further analysis. All rats linearly acquired lever press responding, F(1, 19) = 226.00, p = .00, and groups did not differ on acquisition, F(1, 19) = 2.194, p = .16. (C) Mean (± SEM) lever press responding per min during outcome devaluation testing. Groups did not differ in overall responding, F(1, 19) = 1.19, p = .29, but there was a main effect of devaluation (averaged over group), F(1, 19) = 18.54, p = .00. There was a significant interaction, F(1, 19) = 5.79, p = .026, suggesting that both the control groups responded selectively on the nondevalued lever relative to the devalued lever (simple effects: Group Sham, F(1, 19) = 10.08, p = .008, Group Ipsi, F(1, 19) = 14.76, p = .001) but that Group Contra responded equally on both levers (simple effect: F(1, 19) = .24, p = .63). (D) Mean (± SEM) lever press responding per min during outcome-induced reinstatement testing. There was a main effect of reinstatement, F(1, 19) = 105.38, but no group x reinstatement interaction, F(1, 19) = 3.88, p = .065. Although this interaction might be considered marginal, simple effects show that rats in each group pressed the reinstated lever more than the other lever on test, Group Sham, F(1, 19) = 54.31, p = .00, Group Ipsi, F(1, 19) = 39.6, p = .00, and Group Contra, F(1, 19) = 17.81, p = .00.
Once the efficacy of the CC lesion in severing these projections had been determined, 30 experimentally naïve Long-Evans rats received CC lesions combined with sham or excitotoxic PL and MD lesions with the aim of examining the effect of disconnecting these structures on outcome devaluation, and outcome-induced reinstatement. Eight of these had misplaced lesions or damage that extended beyond the CC and thus were excluded from the analysis. Twenty two rats were then used for analysis. There were three groups: Group Sham (n = 8), Group Ipsi (n = 7), and Group Contra (n = 7). Each rat in each group received a CC lesion. Rats in Group Ipsi received additional excitotoxic lesions of PL and MD in the same (ipsilateral) hemisphere such that these structures were disconnected in that hemisphere but an intact PL-MD pathway remained in the opposite hemisphere. Rats in Group Contra received additional excitotoxic lesions in alternate hemispheres such that the PL-MD pathway was disconnected in both. Therefore Groups Ipsi and Contra differed only in the hemispheric location but not the overall amount of damage. Rats in Group Sham controlled for the effects of receiving a CC lesion with sham PL and MD lesions (in which the needle was inserted but no excitotoxin injected). Half of the sham lesions were given ipsilaterally and half were given contralaterally. In addition, the hemispheres in which damage occurred were counterbalanced within each group (i.e., left vs. right).
For the next eight days rats received instrumental training. For half of the rats in each group the left lever earned pellets and the right lever earned sucrose. The remaining rats were trained on the opposite R-O contingencies. Acquisition of lever press responding is shown in Figure 3B. From this figure it is clear that all groups acquired lever press responding and that the groups did not differ (see Figure for statistical analysis). Subsequent to lever press training rats were tested for knowledge of these contingencies. There were two tests, one with pellets and one with sucrose (counterbalanced). Prior to each test rats received free access to either outcome to specifically satiate them on this outcome thereby reducing its value relative to the non-prefed outcome (cf. Balleine and Dickinson,
Finally, we examined whether rats in each group would selectively reinstate responding on the lever that had been associated with a particular outcome during training. Specifically, after 15 min of extinction on both levers, rats received four reinstatement trials separated by 7 min of extinction in which a pellet or sucrose outcome was freely delivered and responding recorded for the next 2 min. Outcomes were delivered in the order: pellets, sucrose, sucrose, pellets. It was expected that pellet delivery would reinstate responding on the lever that had earned pellets during training, and similarly sucrose delivery would reinstate responding on the sucrose lever. Results are shown in Figure 3D. It is clear from this Figure that all groups showed greater responding following outcome-delivery and that this increase in responding was selective for the reinstated lever (i.e., reinstated > other, see figure caption for statistical analysis). Although the bilateral lesions of PL and MD have no effect on outcome-induced reinstatement, it was important to demonstrate that the functional disconnection of these structures left reinstatement performance intact. This is because it rules out several potential explanations of the impairment in the outcome devaluation test, including a simple deficit in discriminating between levers and outcomes.
Together, these results show that disconnecting the PL-MD pathway creates a deficit in outcome devaluation performance whilst leaving outcome-induced reinstatement intact. The deficit observed during outcome-devaluation suggests that the MD does rely on inputs from the PL (or vice versa) for accurate performance in this task. Intact reinstatement suggests that this deficit was not a result of impaired discrimination, and the fact that there was no difference in lever-press acquisition suggests that the deficit in outcome devaluation performance cannot have resulted from a lack of opportunity to learn the R-O contingencies. Rather, the pattern of results suggests that this group suffered a specific deficit in using R-O contingencies to guide action selection such that they pressed both levers equally on test.
It is worth pointing out here that the success of the novel surgical technique involving electrolytically lesioning the CC in inducing a full functional disconnection of the PL and MD, as evidenced by the lack of FG labeling observed in the PL contralateral to the MD injection site as well as the behavioral deficit observed, could have wide-ranging implications. In particular, researchers who might have previously wished to examine the effect of disconnecting prefrontal cortical (and other cortical) structures from subcortical structures with which they share ipsilateral and contralateral connections now have a potentially viable technique with which to do so. For example, Hunt and Aggleton (1998) found that lesions of both regions produce similar deficits in shifting response rules during a radial arm maze task. Likewise, Balleine and Dickinson (
The parafascicular thalamic nucleus
The final region we consider for its role in instrumental behavior was that of the parafascicular thalamic nucleus (PF; see Figure 4C). The PF was one of the first thalamic regions to be assessed for its role in instrumental behavior. Delacour (
Figure 4

(A) Reproduced from Brown et al. (
The suggestion that the PF might regulate the flexibility of instrumental behavior was re-visited by Minamimoto et al. (2009) using primates as subjects. They recorded the responses of long latency facilitation (LLF) neurons in the centro-median parafascicular nuclear complex (CM-PF; the primate homologue of the PF) of two monkeys during a GO-NOGO task. This task requires monkeys to either respond (“GO”) or withhold responding (“NOGO”) to particular stimuli to receive a large or small water reward. LLF neurons in the CM-PF showed an interesting pattern of responding during the trial blocks when the GO response was paired with the large reward and the NOGO response paired with the small reward. Specifically, after several NOGO trials the likelihood of a GO trial increased, in parallel with the likely increase in the monkey’s expectation of a GO response. When the NOGO stimulus was then unexpectedly presented LLF activity increased, but only when a NOGO response was produced. When the response was not produced LLF activity remained weak or silent, indicating that the presentation of an unexpected stimulus alone was not sufficient for this increase in LLF activity. The authors interpreted this as showing that CM-PF LLF neurons drive a kind of “rebias” process that occurs when the animal expects to produce one response but quickly changes to another. This, like Delacour (
More recently, Brown et al. (
Although it is possible that this efflux in ACh did indeed reflect a facilitation of flexibility, this is not the only interpretation of these results. This is because behavioral flexibility implies an exclusively instrumental process, and it is particularly difficult to disentangle the Pavlovian and instrumental processes that might be employed during performance in T-maze tasks such as the one employed by Brown et al. (
A final point to be made about these experiments (Brown et al.,
More recently we have investigated whether the PF mediates behavioral flexibility via its afferents to the pDMS using unambiguous manipulations of the R-O contingency (Bradfield et al.,
These behavioral tasks were repeated in later experiments to test the effects of functionally disconnecting the PF-pDMS pathway. Prior to these experiments we had injected the retrograde tracer FG into the pDMS and evaluated labeling in the PF. This confirmed that the PF does project to the pDMS and that the pathway is entirely lateralised. Rats were then administered either Sham, ipsilateral, or contralateral, PF/pDMS lesions. Rats with ipsilateral PF-pDMS lesions (group Ipsi) retained an intact PF-pDMS pathway in the opposing hemisphere, whereas contralateral PF and pDMS lesions (group Contra) ensured rats had no intact pathway in either hemisphere. Thus both the Sham and Ipsi groups controlled for the behavior of group Contra. Rats in this group (group Contra) showed the same pattern of results as bilaterally PF lesioned rats. That is, they showed intact initial acquisition of R-O contingencies, but impaired contingency degradation and acquisition of the reversed R-O contingencies. In contrast Sham and Ipsi rats showed intact performance in all tasks. After the rats were sacrificed their brains were sectioned and examined for examined p-Ser240-244-S6rp intensity in cholinacetyltransferase (ChAT) immunoreactive neurons in the non-lesioned pDMS. p-S6rp was recently shown to reflect the activation levels of CINs particularly well (Bertran-Gonzalez et al.,
Given that PF also innervates the aDMS, and that Brown et al. (
This role of the PF (via inputs to the pDMS) differs from that of the MD in instrumental behavior in that the former is necessary for interlacing new and existing R-O contingencies, whereas the latter is necessary for the initial acquisition of R-O contingencies. Thus both of these thalamic regions play different but vital roles, however, whereas an intact MD is critical for a naïve animal to carry out various tasks to achieve an outcome, an intact PF is critical for animals to continue to perform these tasks when environmental contingencies change. Any animal that lacks either function would be at a distinct disadvantage.
The results regarding the PF are also consistent with another critical function: the regulation of what recent computational views of instrumental conditioning have referred to as “state prediction errors”. State prediction errors differ from reward prediction errors, that regulate learning during both Pavlovian and Instrumental conditioning and for which the neural mechanisms have been reliably established (Schultz and Dickinson, 2000; Waelti et al., 2001; Steinberg et al., 2013). The idea of “state” prediction has arisen with the recent increase in popularity of computational models (e.g., Daw et al.,
Experimentally, state and reward prediction errors tend to co-occur and are difficult to separate behaviorally (Schoenbaum et al., 2013). One experiment that does separate them, however, is the reversal of existing R-O contingencies. Upon entering the initial state during the reversal phase of the experiment, the animal expects that pressing the left lever (for example) will lead him to the state in which pellets are delivered to the magazine. When the animal is surprisingly transitioned into a different state in which sucrose is delivered instead, a large state prediction error is generated. If, however, it is assumed that rats value pellets and sucrose equally, then reward prediction error is zero because there is no discrepancy between the actual and expected reward. Therefore, the inability of rats with a compromised PF-pDMS pathway to accurately learn the reversed contingencies is consistent with an inability of these rats to effectively encode state prediction error. To be more specific, it is consistent with an inability to encode a reduction in contingency learning as a result of state prediction error. This is because the performance of PF-pDMS-compromised rats on this task was indiscriminate (i.e., they press equally on both levers at test, refer to figure). If these rats were incapable of encoding an increase in learning as a result of state prediction error, they should show no evidence of having learned the new contingencies (e.g., “left lever surprisingly leads to the state in which sucrose is delivered”) and show greater responding on the now-devalued lever than the nondevalued lever on test. If, however, these rats were specifically incapable of encoding a reduction in learning that resulted from state prediction error (e.g., “left lever no longer leads to the state in which pellets are delivered”) they would fail to unlearn the old contingencies whilst still learning about the new contingencies and their performance would be confused between the two on test. That is, they should show respond equally on the devalued and nondevalued levers, as observed.
Contingency degradation results also support this conclusion. PF-pDMS compromised rats, unlike controls, failed to reduce their responding on the degraded lever. State prediction error contributes to the reduction in learning about the degraded lever-outcome contingency during contingency degradation. Specifically, there is a state prediction error when an outcome that was previously paired only with lever press is also delivered outside of the lever press contingency. When the outcome was dependent on lever press alone, the animal learned that only pressing the lever in the initial state would transition them to the next state in which a pellet (for example) is delivered to the magazine. During contingency degradation they are surprisingly transitioned to this state without pressing the lever, generating a state prediction error. This state prediction error triggers an increase in learning (that favours learning about context-outcome relations) but also a reduction in learning about the contingency between performing the lever press in the initial state and entering the “food delivered” state. It is this reduction that leads to decreased responding on the degraded lever. Thus, the fact that the PF-pDMS compromised rats do not decrease responding on the degraded lever throughout training, is again consistent with an inability of those rats to process state prediction error in a manner that leads to a reduction in R-O contingency knowledge.
It is important to mention that, although broadly consistent with this view, Schoenbaum et al. (2013) have developed an alternative interpretation of these results. In a similar fashion to our interpretation (Bradfield et al.,
Conclusion
Research regarding the role of various thalamic nuclei in instrumental behavior has increased in recent years. One of the earliest regions considered were the ANT. Although early indications appeared to suggest that ANT did indeed mediate instrumental behavior, careful examination of these tasks revealed that the learning processes governing behavior confound Pavlovian and instrumental processes. By contrast, when rats with ANT lesions were tested in free operant instrumental conditions they showed no deficits in a range of tasks (Corbit et al.,
Another region that has received attention for its role in regulating instrumental behavior is the MD. Again, early indications suggested a possible role for this region but did not employ tasks that clearly separate Pavlovian and instrumental relations. In contrast to the ANT, however, MD lesions were later found to affect performance in several free operant behavioral tasks, highlighting a specific role for this region in the regulation of goal-directed instrumental behavior (Corbit et al.,
Given that PL lesions regulate the acquisition but not expression of R-O contingencies in the same manner of MD lesions, we examined the effect of their disconnection in the current study. Because there are contralateral, as well as ipsilateral, connections between PL and MD, this required the adoption of a novel surgical technique that involved electrolytic lesions of the CC. Once the efficacy of this procedure in severing contralateral PL-MD connections had been established using the retrograde tracer FG, a functional disconnection of these structures was employed to examine the effect of this disconnection on various behavioral tasks. Specifically, all rats received Sham, ipsilateral or contralateral excitotoxic lesions of PL and MD in addition to a CC lesion. We found that outcome-induced reinstatement performance was intact in all groups, but that Group Contra showed a specific deficit in outcome devaluation testing. This suggests that the PL-MD pathway regulates learning of R-O contingencies in a manner that cannot be attributed to a deficit in discrimination or some other general process important to learning.
Finally, the role of the PF was examined, in particular for its role in flexible of instrumental behavior. Although several earlier studies implicated such a role for PF, again these tasks made it difficult to separate the influence of Pavlovian and instrumental processes. Our recent research by Bradfield et al. (
In summary, then, it is clear that there are multiple important and contrasting roles of various thalamic nuclei in the regulation of instrumental behavior. Given the wide connectivity of these nuclei with many striatal and cortical regions of interest, this is unsurprising. Future research will continue to uncover the specific role of these regions, particularly in the context of the complex interplay these regions enjoy with other structures in the brain.
Statements
Acknowledgments
The research reported in the manuscript was supported by grants to Bernard W. Balleine from the NIMH #MH56446, the NHMRC #633267, and a Laureate Fellowship from the Australian Research Council, #FL0992409.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
1
AdamsC. D. (1982). Variations in the sensitivity of instrumental responding to reinforcer devaluation. Q. J. Exp. Psychol. B34, 77–98. 10.1080/14640748208400878
2
AdamsC. D.DickinsonA. (1981). Instrumental responding following reinforcer devaluation. Q. J. Exp. Psychol. B33, 109–122. 10.1080/14640748108400816
3
BalleineB. W.DickinsonA. (1998). Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology37, 407–419. 10.1016/s0028-3908(98)00033-1
4
BalleineB. W.LeungB. K.OstlundS. B. (2011). The orbitofrontal cortex, predicted value, and choice. Ann. N Y Acad. Sci. 1239, 43–50. 10.1111/j.1749-6632.2011.06270.x
5
BalleineB. W.O’DohertyJ. P. (2010). Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology35, 48–69. 10.1038/npp.2009.131
6
BalleineB. W.OstlundS. B. (2007). Still at the choicepoint: action selection and initiation in instrumental conditioning. Ann. N Y Acad. Sci. 1104, 147–171. 10.1196/annals.1390.006
7
Bertran-GonzalezJ.ChiengB. C.LaurentV.ValjentE.BalleineB. W. (2012). Striatal cholinergic interneurons display activity-related phosphorylation of ribosomal protein S6. PLoS One7:e53195. 10.1371/journal.pone.0053195
8
BollesR. C.HoltzR.DunnT.HillW. (1980). Comparisons of stimulus-learning and response learning in a punishment situation. Learn. Motiv. 11, 78–96. 10.1016/0023-9690(80)90022-3
9
BradfieldL. A.BalleineB. W. (2013). Hierarchical and binary relations compete for behavioural control during instrumental biconditional discrimination. J. Exp. Psychol. Anim. Behav. Process. 39, 2–13. 10.1037/a0030941
10
BradfieldL. A.Bertran-GonzalezJ.ChiengB.BalleineB. W. (2013). The thalamostriatal pathway and cholinergic control of goal-directed action: interlacing new with existing learning in the striatum. Neuron79, 153–166. 10.1016/j.neuron.2013.04.039
11
BrownH. D.BakerP. M.RagozzinoM. E. (2010). The parafascicular thalamic nucleus concomitantly influences behavioural flexibility and dorsomedial striatal acetylcholine output in rats. J. Neurosci. 30, 14390–14398. 10.1523/jneurosci.2167-10.2010
12
BuchananS. L. (1994). Mediodorsal thalamic lesions impair acquisition of an eyeblink avoidance response in rabbits. Behav. Brain Res. 65, 173–179. 10.1016/0166-4328(94)90103-1
13
ColwillR. M.RescorlaR. A. (1985). Postconditioning devaluation of a reinforcer affects instrumental responding. J. Exp. Psychol. Anim. Behav. Process. 11, 120–132. 10.1037//0097-7403.11.1.120
14
ColwillR. M.RescorlaR. A. (1988). Relations between the discriminative stimulus and the reinforcer in instrumental learning. J. Exp. Psychol. Anim. Behav. Process. 14, 155–164.
15
ConejoN. M.Gonzalez-PadroH.LopezM.CantoraR.AriasJ. L. (2007). Induction of c-Fos expression in the mammillary bodies, anterior thalamus and dorsal hippocampus after fear conditioning. Brain Res. Bull. 74, 172–177. 10.1016/j.brainresbull.2007.06.006
16
CorbitL. H.BalleineB. W. (2003). The role of prelimbic cortex in instrumental conditioning. Behav. Brain Res. 146, 145–157. 10.1016/j.bbr.2003.09.023
17
CorbitL. H.JanakP. H.BalleineB. W. (2007). General and outcome-specific forms of Pavlovian-instrumental transfer: the effect of shifts in motivational state and inactivation of the ventral tegmental area. Eur. J. Neurosci. 26, 3141–3149. 10.1111/j.1460-9568.2007.05934.x
18
CorbitL. H.MuirJ. L.BalleineB. W. (2003). Lesions of mediodorsal thalamus and anterior thalamic nuclei produce dissociable effects on instrumental conditioning in rats. Eur. J. Neurosci. 18, 1286–1294. 10.1046/j.1460-9568.2003.02833.x
19
CoutureauE.MarchandA. R.Di ScalaG. (2009). Goal-directed responding is sensitive to lesions to the prelimbic cortex or basolateral nucleus of the amygdala but not to their disconnection. Behav. Neurosci. 123, 443–448. 10.1037/a0014818
20
DavisJ.BittermanM. E. (1971). Differential reinforcement of other behaviour (DRO) – yoked-control comparison. J. Exp. Anal. Behav. 15, 237–241. 10.1901/jeab.1971.15-237
21
DawN. D.NivY.DayanP. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 8, 1704–1711. 10.1038/nn1560
22
DelacourJ. (1969). Role of a medial thalamic structure in various types of instrumental defensive conditioning. Physiol. Behav. 4, 969–974. 10.1016/0031-9384(69)90051-1
23
DickinsonA. (1994). “Instrumental conditioning,” in Animal Learning and Cognition, ed MackintoshN. J. (San Diego, CA: Academic Press), 45–79.
24
DickinsonA.BalleineB. (1994). Motivational control of goal-directed action. Anim. Learn. Behav. 22, 1–18. 10.3758/bf03199951
25
DickinsonA.BalleineB.WattA.GonzalezF.BoakesR. A. (1995). Motivational control after extended instrumental training. Anim. Learn. Behav. 23, 197–206. 10.3758/bf03199935
26
DickinsonA.CamposJ.VargaZ.BalleineB. W. (1996). Bidirectional control of instrumental conditioning. Q. J. Exp. Psychol. 49, 289–306. 10.1080/713932637
27
GabrielM.CuppernellC.ShenkerJ. I.KubotaY.HenziV.SwansonD. (1995). Mamillothalamic tract transection blocks anterior thalamic training-induced neuronal plasticity and impairs discriminative avoidance-behaviour in rabbits. J. Neurosci. 15, 1437–1445.
28
GabrielM.LambertR. W.FosterK.OronaE.SparenborgS.MaiorcaR. R. (1983). Anterior thalamic lesions and neuronal-activity in the cingulate and retrosplenial cortices during discriminative avoidance-behaviour in rabbits. Behav. Neurosci. 97, 675–695. 10.1037//0735-7044.97.5.675
29
GabrielM.MillerJ. D.SaltwickS. E. (1977). Unit-activity in cingulate cortex and anteroventral thalamus of rabbit during differential conditioning and reversal. J. Comp. Physiol. Psychol. 91, 423–433. 10.1037/h0077321
30
GabrielM.SparenborgS.KubotaY. (1989). Anterior medial thalamic lesions, discriminative avoidance-learning, and cingulate cortical neuronal-activity in rabbits. Exp. Brain Res. 76, 441–457. 10.1007/bf00247901
31
GrindleyG. C. (1932). The formation of a simple habit in guinea pigs. Br. J. Psychol. 23, 127–147. 10.1111/j.2044-8295.1932.tb00655.x
32
GroenewegenH. J. (1988). Organisation of the afferent connections of the mediodorsal thalamic nucleus in the rat, related to the mediodorsal prefrontal topography. Neuroscience24, 379–431. 10.1016/0306-4522(88)90339-9
33
HershbergerW. A. (1986). An approach through the looking-glass. Anim. Learn. Behav. 14, 443–451. 10.3758/BF03200092
34
HollandP. C. (1979). Differential effects of omission contingencies on various components of Pavlovian appetitive conditioned responding in rats. J. Exp. Psychol. Anim. Behav. Process. 5, 178–193. 10.1037//0097-7403.5.2.178
35
HuntP. R.AggletonJ. P. (1998). An examination of the spatial working memory deficit following neurotoxic medial dorsal thalamic lesions in rats. Behav. Brain Res. 97, 129–141. 10.1016/s0166-4328(98)00033-3
36
KrettekJ. E.PriceJ. L. (1977). Cortical projections of the mediodorsal nucleus and adjacent thalamic nuclei in the rat. J. Comp. Neurol. 171, 157–91. 10.1002/cne.901710204
37
KurodaM.MurakamiK.OdaS.ShinkaiM.KishiK. (1993). Direct synaptic connections between thalamocortical axon terminals from the mediodorsal thalamic nucleus (MD) and corticothalamic neurons to MD in the prefrontal cortex. Brain Res. 612, 339–344. 10.1016/0006-8993(93)91683-j
38
LingawiN. W.BalleineB. W. (2012). Amygdala central nucleus interacts with dorsolateral striatum to regulate the acquisition of habits. J. Neurosci. 32, 1073–1081. 10.1523/jneurosci.4806-11.2012
39
MatsumotoN.MinamimotoT.GraybielA. M.KimuraM. (2001). Neurons in the thalamic CM-Pf complex supply striatal neurons with information about behaviorally significant sensory events. J. Neurophysiol. 85, 960–976.
40
MinamimotoT.HoriY.KimuraM. (2009). Roles of the thalamic CM-PF complex-basal ganglia circuit in externally driven rebias of action. Brain Res. Bull. 78, 75–79. 10.1016/j.brainresbull.2008.08.013
41
NegyessyL.HamoriJ.BentivolglioM. (1998). Contralateral cortical projection to the mediodorsal thalamic nucleus: origin and synaptic organisation in the rat. Neuroscience84, 741–753. 10.1016/s0306-4522(97)00559-9
42
OstlundS. B.BalleineB. W. (2005). Lesions of medial prefrontal cortex disrupt the acquisition but not the expression of goal-directed learning. J. Neurosci. 25, 7763–7770. 10.1523/jneurosci.1921-05.2005
43
OstlundS. B.BalleineB. W. (2007). Orbitofrontal cortex mediates outcome encoding in Pavlovian but not instrumental conditioning. J. Neurosci. 27, 4819–4825. 10.1523/jneurosci.5443-06.2007
44
OstlundS. B.BalleineB. W. (2008). Differential involvement of the BLA and MD in instrumental action selection. J. Neurosci. 28, 4398–4405. 10.1523/jneurosci.5472-07.2008
45
PavlovI. P. (1927). Conditioned Reflexes: An Investigation of the Physiological Activity of the Cerebral Cortex. Oxford: Oxford University Press.
46
PaxinosG.WatsonC. (1998). The Rat Brain in Stereotaxic Co-Ordinates. 3rd Edn.New York: Elsevier.
47
SchoenbaumG.RoeschM. (2005). Orbitofrontal cortex, associative learning, and expectancies. Neuron47, 633–636. 10.1016/j.neuron.2005.07.018
48
SchoenbaumG.StalnakerT. A.NivY. (2013). How did the chicken cross the road? With her striatal interneurons of course. Neuron79, 3–6. 10.1016/j.neuron.2013.06.033
49
SchultzW.DickinsonA. (2000). Neuronal coding of prediction errors. Ann. Rev. Neurosci. 23, 473–500. 10.1146/annurev.neuro.23.1.473
50
SheffieldF. D. (1965). “Relation between classical and instrumental conditioning,” in Classical Conditioning, ed ProkasyW. F. (New York, NY: Appelton-Century-Crofts), 302–322.
51
SkinnerB. F. (1932). On the rate of formation of a conditioned reflex. J. Gen. Psychol. 7, 274–285. 10.1080/00221309.1932.9918467
52
SmithD. M.FreemanJ. H.NicholsonD.GabrielM. (2002). Limbic thalamic lesions, appetitively motivated discrimination learning, and training-induced neuronal activity in rabbits. J. Neurosci. 22, 8212–8221.
53
SparenborgS.GabrielM. (1992). Local norepinephrine depletion and learning-related neuronal-activity in cingulate cortex and anterior thalamus of rabbits. Exp. Brain Res. 92, 267–285. 10.1007/bf00227970
54
SteinbergE. E.KeiflinR.BoivinJ. R.WittenI. B.DeisserothK.JanakP. H. (2013). A causal link between prediction errors, dopamine neurons and learning. Nat. Neurosci. 16, 966–973. 10.1038/nn.3413
55
WaeltiP.DickinsonA.SchultzW. (2001). Dopamine responses comply with basic assumptions of formal learning theory. Nature412, 43–48. 10.1038/35083500
56
WarburtonE. C.AggletonJ. P. (1999). Differential deficits in the Morris water maze following cytotoxic lesions of the anterior thalamus and fornix transection. Behav. Brain Res. 98, 27–38. 10.1016/s0166-4328(98)00047-3
57
WilliamsD. R.WilliamsH. (1969). Auto-maintenance in the pigeon: sustained pecking despite contingent non-reinforcement. J. Exp. Anal. Behav. 12, 511–520. 10.1901/jeab.1969.12-511
58
WilsonP. N.BoakesR. A.SwanJ. (1987). Instrumental learning as a result of omission training on wheel running. Q. J. Exp. Psychol. B39, 161–171. 10.1080/14640748708402260
59
YinH. H.KnowltonB. J. (2002). Reinforcer devaluation abolishes conditioned place preference: evidence for stimulus-stimulus relations. Behav. Neurosci. 116, 174–177. 10.1037/0735-7044.116.1.174
60
YinH. H.KnowltonB. J.BalleineB. W. (2004). Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur. J. Neurosci. 19, 181–189. 10.1111/j.1460-9568.2004.03095.x
61
YinH. H.KnowltonB. J.BalleineB. W. (2005a). Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning. Eur. J. Neurosci. 22, 505–512. 10.1111/j.1460-9568.2005.04219.x
62
YinH. H.KnowltonB. J.BalleineB. W. (2006). Inactivation of dorsolateral striatum enhances sensitivity to changes in action-outcome contingency in instrumental conditioning. Behav. Brain Res. 166, 189–196. 10.1016/j.bbr.2005.07.012
63
YinH. H.OstlundS. B.BalleineB. W. (2008). Reward-guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico-basal ganglia networks. Eur. J. Neurosci. 28, 1437–1448. 10.1111/j.1460-9568.2008.06422.x
64
YinH. H.OstlundS. B.KnowltonB. J.BalleineB. W. (2005b). The role of the dorsomedial striatum in instrumental conditioning. Eur. J. Neurosci. 22, 513–523. 10.1111/j.1460-9568.2005.04218.x
Summary
Keywords
anterior thalamic nuclei, mediodorsal thalamic nucleus, parafascicular thalamic nuclei, corticothalamic disconnection, prelimbic cortex, instrumental conditioning
Citation
Bradfield LA, Hart G and Balleine BW (2013) The role of the anterior, mediodorsal, and parafascicular thalamus in instrumental conditioning. Front. Syst. Neurosci. 7:51. doi: 10.3389/fnsys.2013.00051
Received
29 April 2013
Accepted
27 August 2013
Published
09 October 2013
Volume
7 - 2013
Edited by
Yuri B. Saalmann, Princeton University, USA
Reviewed by
Björn Brembs, University of Regensburg, Germany; James W. Grau, Texas A&M University, USA
Copyright
© 2013 Bradfield, Hart and Balleine.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bernard W. Balleine, Behavioral Neuroscience Laboratory, Brain and Mind Research Institute, University of Sydney, Level 6, 94 Mallet Street, Camperdown, Sydney, NSW 2050, Australia e-mail: bernard.balleine@sydney.edu.au
This article was submitted to the journal Frontiers in Systems Neuroscience.
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.