A Critical Review of Habit Learning and the Basal Ganglia

Seger, Carol  A; Spiering, Brian  J

doi:10.3389/fnsys.2011.00066

HYPOTHESIS AND THEORY article

Front. Syst. Neurosci., 30 August 2011

volume 5 - 2011 | https://doi.org/10.3389/fnsys.2011.00066

A Critical Review of Habit Learning and the Basal Ganglia

CA
Carol A. Seger ^*
BJ
Brian J. Spiering

Department of Psychology, Colorado State University Fort Collins, CO, USA

Abstract

The current paper briefly outlines the historical development of the concept of habit learning and discusses its relationship to the basal ganglia. Habit learning has been studied in many different fields of neuroscience using different species, tasks, and methodologies, and as a result it has taken on a wide range of definitions from these various perspectives. We identify five common but not universal, definitional features of habit learning: that it is inflexible, slow or incremental, unconscious, automatic, and insensitive to reinforcer devaluation. We critically evaluate for each of these how it has been defined, its utility for research in both humans and non-human animals, and the evidence that it serves as an accurate description of basal ganglia function. In conclusion, we propose a multi-faceted approach to habit learning and its relationship to the basal ganglia, emphasizing the need for formal definitions that will provide directions for future research.

Introduction

The concept of habit learning has developed through the fruitful interaction of researchers in several intellectual domains, including animal learning, cognitive psychology, cognitive neuropsychology, and behavioral neuroscience. As a result, habit learning has taken on a variety of proposed definitions. In this paper, we will first describe the historical evolution of habit learning as a concept. We will then briefly describe the anatomical and functional roles of the basal ganglia that may underlie learning in general and habit learning in particular. Finally, we will revisit the defining features of habit learning and assess how well they characterize learning in the basal ganglia.

Historical Evolution of the Habit Learning Concept

The term habit was used, but not explicitly defined, by William James in the seminal Principles of Psychology (James, 1890). It was used on occasion by early researchers studying animal learning, in particular Hull (1934a,b) and Lashley (1930, 1950). “Habit” roughly corresponded to the resulting motor behavior (e.g., Lashley referred to the “maze running habit”), and habit learning to acquisition of these behaviors in an instrumental learning context.

Hippocampal research: Early definitions of habit learning

The earliest use of “habit learning” to refer to a specific form of learning came from researchers studying the effects of hippocampal damage in human and non-human animals. By the late 1960s it was clear that hippocampal damage affected learning on many, but not all, tasks. Hirsh (1974) first used the term “habit learning” to describe a particular type of memory or learning system. He defined the habit system as that “responsible for the learning of which hippocampally ablated animals are capable” (1974, p 421). Thus, from the beginning habit learning was defined negatively, in terms of what it was not (i.e., hippocampally based), rather than what it was. To Hirsch, the primary feature of hippocampal-based learning was contextual encoding (e.g., of the particular spatial and temporal context at encoding) and retrieval of information that was contextually sensitive. He argued in contrast that habit learning was similar to the stimulus–response (S–R) learning processes proposed by earlier learning researchers, and that these S–R associations were specifically insensitive to context.

Miskin et al. (1984) extended Hirsch's concept of habit learning. Following Hirsch, they identified features of habit learning as the opposite those of hippocampally-based learning. One set of features was “rapid” versus “slow” learning. Rapid learning was defined as one-trial learning, which required the hippocampus, whereas slow learning required repeated trials and was preserved in amnesia. They immediately related the “rapid”–“slow” distinction to the distinction posed by Hirsh, which they referred to as flexible (contextually sensitive in Hirsh) versus inflexible learning. They proposed that there is “a trade-off between short-term flexibility afforded by the memory system and long-term reliability afforded by the habit system” (p. 73). Finally, they argued that habits were a relatively primitive form of learning that should therefore appear earlier in ontogeny as well as phylogeny, which they supported with developmental evidence from their lab.

Mishkin and colleagues were also the first to propose a crucial role for the basal ganglia in habit learning. The basis for their argument, which they termed “admittedly speculative” was the early development of the basal ganglia both in phylogeny and ontogeny, and the presence of widespread anatomical projections to the striatum from cortex that “provide a mechanism through which cortically processed sensory inputs could become associated with motor outputs generated in the pallidum and so yield the stimulus–response bonds that constitute habits” (p. 74).

Cognitive psychology: Habit as implicit and automatic

The field of cognitive psychology did not use the term habit learning, but from the late 1960s through the 1980s several concepts were developed in this field that later were incorporated into theories of habit learning. These distinctions included unconscious, or implicit, learning and memory (in contrast with conscious or explicit learning and memory), and automatic processing (in contrast with controlled processing). Both of these distinctions fall broadly within “dual process” theories of cognition that see one type of cognitive process as relatively unconscious, automatic, evolutionarily early, and similar across individuals, in contrast with a second type of cognitive process that is conscious, controlled, evolutionarily more recent, and subject to significant individual differences (see Evans, 2008 for a review).

Reber (1967) coined the term “implicit learning”; the concept was extended to “implicit memory” by Graf and Schacter (1985). The focus in both areas of research was on consciousness: identifying what could or could not be learned and/or retrieved without awareness. Implicit memory was defined as “when previous experiences facilitate performance on a task that does not require conscious or intentional recollection of those experiences” (Schacter, 1987, p. 501). Implicit memory tasks typically used priming paradigms in which improvement in accuracy and/or processing time was observed for repeated stimuli; priming was later divided into perceptual (repeated visual stimulus processing) and conceptual (repeated semantic processing) forms (Keane et al., 1991).

Seger (1994, p. 164) outlined three guidelines for implicit learning: (1) “the knowledge gained in implicit learning is not fully accessible to consciousness, in that subjects cannot provide a full … verbal account of what they have learned,” (2) “information [learned] … is more complex than a single simple association or frequency count,” and (3) “implicit learning does not involve processes of conscious hypothesis testing but is an incidental consequence of the type and amount of cognitive processing performed on the stimuli.” Implicit learning was studied using several different tasks, most often the serial reaction time task (which measures improvement in reaction time when responding to stimuli when presented in a repeating sequence in comparison with stimuli presented in random order), and the artificial grammar task (which measures the ability of subjects to discriminate letter strings that follow a complex sequential pattern determined by a finite state automaton, or artificial grammar, from those that violate the pattern).

Another influential concept from cognitive psychology was that of automaticity, originally developed by Shiffrin and Schneider (1977) to account for different forms of attentional scanning. The concepts of automatic and controlled processing were widely adopted across various domains within cognitive psychology. Shiffrin and Schneider (1977) gave multiple criteria for considering a process to be automatic, including (1) automatic processes are not constrained by short-term memory capacity limitations and do not require attention; (2) automatic processes are generally performed too quickly to be consciously accessible and once initiated are completed regardless of subjects’ intentions; (3) automatic processes require significant training, undergoing a gradual shift from controlled to automatic through the course of practice; and (4) automatic processes, once acquired, are difficult to modify. Criterion 1 led to an operational definition of automaticity as primary task performance not negatively affected by a parallel, short-term memory demanding task.

Through development of the process dissociation procedure, Jacoby (1991) related automaticity to implicit learning and memory. He argued that participants should be able to exert strategic control over conscious knowledge, and theorized that they should be able to control the behavioral expression of this knowledge in accordance with task instructions. Conversely, he argued that participants should be unable exert strategic control over unconscious knowledge and theorized that they might have difficulty controlling the behavioral expression of this knowledge. The critical feature of the process dissociation procedure is that participants are asked to demonstrate knowledge via both “inclusion” and “exclusion” instructions. Inclusion instructions demand that participants produce behavior in accordance with a learned structure, while exclusion instructions demand that participants produce discordant behavior. This approach led to a different operational definition of automaticity. Automatic processing is measured by calculating the intrusion of the previously learned material into the exclusion condition (false positives); controlled processing is defined as the difference between performance in the inclusion condition and the automatic processing measure.

Cognitive neuropsychology: Habit as a type of “non-declarative” memory

Larry Squire and colleagues integrated the approaches taken by researchers examining hippocampal lesions in non-human animals, researchers in cognitive psychology, and researchers studying human patients with amnesia. Their theory developed across time. Cohen and Squire (1980) initially defined procedural learning as “operations governed by rules or procedures” in contrast to hippocampally based learning, which they characterized as “operations that depend on specific, declarative, data-based material.” The term “procedural” was adapted from artificial intelligence research (Winograd, 1972; Anderson, 1982). Anderson's (1982) view was that all cognitive knowledge started by being represented declaratively, as individual “propositions,” and procedural knowledge was formed by the compilation of groups of propositions into procedures. Procedural knowledge accounted well for the tasks known to be preserved in amnesia at that time, including pursuit rotor (Corkin, 1968), mirror drawing (Milner, 1962), and mirror reading (Cohen and Squire, 1980).

During the 1980s and early 1990s amnesic subjects were shown to have intact learning across a large number of novel tasks, primarily drawn from the implicit memory and learning literatures. These included perceptual priming (Graf and Schacter, 1985), the serial reaction time task (Nissen and Bullemer, 1987), artificial grammar learning (Knowlton et al., 1992), category learning using the Posner dot pattern task (Knowlton and Squire, 1993), and some aspects of learning on the Tower of Hanoi task (Cohen, 1984). It soon became clear that the term “procedural” was insufficient to characterize all the different types of non-hippocampal learning and memory. Squire and Zola-Morgan (1988) created the term “non-declarative” and defined it as “a heterogeneous collection of abilities: motor skills, perceptual skills, and cognitive skills (these abilities and perhaps others are examples of procedural memory); as well as simple classical conditioning, adaptation level effects, priming, and other instances where experience alters performance independently of providing a basis for the conscious recollection of past events” (p. 171). Non-declarative memory thus incorporated the cognitive psychology distinction between implicit and explicit memory with the result that hippocampal-based declarative learning was now identified as memory that was accessible to consciousness, and the heterogeneous non-declarative memory systems as unconscious.

Squire and Zola-Morgan (1988, 1991) developed what was to become an often reprinted figure illustrating the types and subtypes of declarative and non-declarative memory (the 1991 version is shown in Figure 1). In Squire and Zola-Morgan (1988) the term “habit” isn't used; instead, several different “skills” are described including motor skills (pursuit motor, Corkin, 1968; serial reaction time, Nissen and Bullemer, 1987; mirror drawing Milner, 1962), perceptual skills (mirror reading; Cohen and Squire, 1980), and cognitive skills (Tower of Hanoi; Cohen, 1984. Hebb digits task: Brooks and Baddeley, 1976). By 1991, Squire and colleagues referred to this type of non-declarative memory as “skills and habits,” as shown in Figure 1. They noted the basal ganglia as one potential neural system involved in habits and skills, along with the cerebellum.

Figure 1

Animal learning: Habit learning as one form of instrumental conditioning

In the 1980s, Dickinson (1985) proposed separate “goal-directed behavior” and “habit” instrumental learning systems, based on whether execution of the learned behavior is sensitive to the value of the reward or not, respectively. One typical manipulation is to devalue the reinforcer by satiating the animal before testing; the value of a food reward is greater when the animal is hungry than when it has recently fed. An animal will perform a habitual act to obtain food even when it has eaten to satiation. He contrasted habit with goal-directed behavior, which is sensitive to the motivational state of the animal. Subsequent neuroscience studies (Yin and Knowlton, 2006; Packard, 2009) found that the distinction between goal-directed and habitual learning corresponded with reliance on different parts of the basal ganglia: the dorsomedial rodent striatum (homologous to the primate anterior caudate nucleus), and the dorsolateral striatum (homologous to the primate posterior putamen) respectively.

Graybiel (2008) recently offered a broad definition of habit learning. “First, habits (mannerisms, customs, rituals) are largely learned; in current terminology, they are acquired via experience-dependent plasticity. Second, habitual behaviors occur repeatedly over the course of days or years, and they can become remarkably fixed. Third, fully acquired habits are performed almost automatically, virtually non-consciously, allowing attention to be focused elsewhere. Fourth, habits tend to involve an ordered, structured action sequence that is prone to being elicited by a particular context or stimulus. And finally, habits can comprise cognitive expressions of routine (habits of thought) as well as motor expressions of routine” (Graybiel, 2008, p. 361). Like Squire's approach to non-declarative memory, Graybiel's definition brings together several features from previous work, including that habits are relatively automatic, and unconscious, and that habits can be inflexible and rigid (particularly well learned habits). Graybiel emphasizes two additional features of habit: first, that motor habits are sequential behaviors with complex structure, going beyond a simple concept of a “response,” and second, that habits can extend beyond motor behaviors to include cognitive processes.

The Basal Ganglia and Learning

The basal ganglia are a group of subcortical nuclei, including the striatum, globus pallidus, substantia nigra, and subthalamic nucleus in humans. The basal ganglia interact with cerebral cortex via corticostriatal loops, in which information projects from cortex to the striatum, to the basal ganglia output nuclei, to the thalamus, and from there back to cortex (Alexander et al., 1986; Seger, 2008). The functions of the basal ganglia are supported by three pathways from the striatum to the thalamus, termed the “direct,” “indirect,” and “hyperdirect” pathways (Frank, 2005; Cohen and Frank, 2009). Broadly, the three pathways together implement a balance between regulating tonic inhibition in cortex as well as selective activation or gating of particular representations. The representations that the basal ganglia act upon is determined by the region of cortex within each corticostriatal loop. Although projections are continuous and there are no firm dividing lines between loops, it is useful for practical purposes to identify functionally different loops. Our approach includes four distinct loops (Seger and Cincotta, 2005; Seger, 2008) and is illustrated in Figure 2. They are the motor loop, which connects motor and premotor cortexes with the putamen; the executive loop, which connects lateral and medial prefrontal regions with the anterior caudate; the visual loop, which connects inferior temporal regions with the posterior caudate, and the motivational loop, which connects ventromedial prefrontal regions with the ventral striatum (including the nucleus accumbens and ventral caudate and putamen). Given the broad patterns of cortical projections to the basal ganglia, it is not surprising that the basal ganglia are associated with a large variety of functions, including motor control (Redgrave et al., 2010), cognitive coordination (Stocco et al., 2010), and emotional functions (Nakano et al., 2000).

Figure 2

The basal ganglia are involved in learning through a variety of inherent plasticity mechanisms. The best studied is N-Methyl-d-aspartate (NMDA) modulated long-term potentiation (LTP) at the corticostriatal synapse. Corticostriatal synapses also receive dopaminergic input and LTP is highly sensitive to the presence of dopamine (Pawlak and Kerr, 2008). Dopamine projections come from the midbrain, including the ventral tegmental area and portions of the substantia nigra. Some dopamine neuron activity is sensitive to reward expectation and is computationally well-described by reward prediction error (Schultz, 2002; Bromberg-Martin et al., 2010). This dopamine signal is well-suited to serve as a learning signal indicating the presence of unexpected rewards, thus the organism is more likely to repeat the behavior leading to the reward in the future.

The basal ganglia are particularly important in learning the relationship between sensory information and motor responses on the basis of trial by trial feedback (Seger, 2008; Shohamy et al., 2008). Computational models of dopamine-mediated plasticity within the direct pathway (Ashby and Ennis, 2006), and across pathways (Frank, 2005; Cohen and Frank, 2009) do an excellent job of accounting for learning in this type of task. Convergent evidence from a variety of species and techniques supports the view that the basal ganglia are critical for learning in these tasks (Yin and Knowlton, 2006; Graybiel, 2008; Balleine et al., 2009; Packard, 2009; Seger, 2009). Most habit learning tasks follow the same stimulus–response–reward/feedback task structure (Seger, 2009), and thus it is reasonable to propose that the basal ganglia should be important in habit learning.

Reassessment of Habit Learning's Defining Characteristics

As the concept of habit learning developed, a number of different defining features were proposed. The following features were most commonly cited: inflexible, slow, unconscious, automatic, and insensitive to reinforcer devaluation. Here we revisit each of these defining criteria, asking the following questions about each: Why was it proposed? How precisely is it defined, and are there different definitions in use in different research areas? How accurately does this feature describe basal ganglia related learning? And, if relevant, how practical is the criterion for use with both human and non-human animals?

Inflexible

The characterization of habit learning as “inflexible” comes from Hirsh (1974), in contrast with flexible, context-dependent learning that was subserved by the hippocampus. Miskin et al. (1984) also included inflexible in their definition of habit learning, as did Squire and colleagues in their development of the concept of non-declarative memory. Habit learning was independently characterized as inflexible by Dickinson (1985), who defined inflexibility in contrast to goal oriented behaviors, later shown to rely on prefrontal and dorsomedial striatal systems (Yin and Knowlton, 2006).

Flexible or inflexible has not been formally defined. The working definitions of these terms differ depending on whether habit learning is contrast with the hippocampal or prefrontal system. Within the hippocampal system, flexibility is often thought to be a consequence of individual memories formed by the hippocampus that can be applied to new situations. A commonly task thought to require hippocampally mediated flexibility is transitive inference, in which subjects are taught a set of ordinal relations, e.g., A > B, B > C, and then tested on whether they can infere that A > C (Eichenbaum and Fortin, 2009). Some researchers have argued that the basal ganglia are limited to learning the individual ordinal relations and cannot support transitive inference or related phenomena (Myers et al., 2003; Shohamy et al., 2006). However, other research has found an opposite pattern of results, in which transitive inference relies on corticostriatal dopaminergic systems and is actually enhanced when the hippocampus is inhibited (Frank et al., 2003). Similar findings of hippocampal independence on other tasks thought to reflect flexibility (e.g., novelty transfer, Driscoll et al., 2004) indicate that this concept needs to be reassessed. Research is currently underway in a number of labs to better characterize what specific computational roles are played in inference tasks (Moustafa et al., 2010; Shohamy and Adcock, 2010).

Habit learning “inflexibility” is also defined in contrast with the sorts of flexibility enabled by executive functions subserved by the prefrontal cortex. In fact, executive functions were originally defined in clinical neuroscience as the ability to deal with novel or non-routine situations (Shallice, 1982). Prefrontal cortex enables flexible behavior through a variety of mechanisms involved in planning (setting goals, hypothesis formation, and testing), working memory (holding information online for several seconds), and cognitive control (the ability to execute plans in the face of distractions or other forms of interference; O'Reilly et al., 2010). Some have argued that the basal ganglia implement an inflexible learning process limited to past experience which then interacts with the flexible representations in prefrontal cortex.

Daw et al. (2005) argue that the basal ganglia select behaviors on the basis of the previous history of reinforcement, whereas the prefrontal cortex enables “model-based” control based on theories or strategies. Activity in the basal ganglia can be predicted by measures taken from reinforcement-learning modeling, specifically reward prediction (the estimate of the expected reward associated with choosing a particular behavior in the current state) and reward prediction error (the difference between the predicted and actually received reward). In this sense, the basal ganglia is inflexible because it is constrained to act in accordance with past reinforcement history. However, some studies have found patterns of basal ganglia activity that cannot be completely accounted for by reinforcement-learning models (Lopez-Paniagua and Seger, 2011).

One limitation of reinforcement-learning models is that they model the environment as a finite set of repeating states. In reality organisms face situations that vary continuously, and need to be able to generalize to similar but not identical situations. The basal ganglia are active in categorization tasks that require generalization to related but novel stimuli, indicating at least some flexibility (for review of some possible mechanisms, see Seger, 2008). It is unclear what the limits are to generalization in habit learning, and what role the basal ganglia may play in generalization.

Slow or incremental

Habit learning was first characterized as slow or incremental by Miskin et al. (1984). As with “inflexible,” this criterion was defined on the basis of learning in hippocampally ablated animals, in which learning required multiple trials. In contrast, animals with an intact hippocampus can show one-trial learning. The terms “slow” and “incremental” are often interpreted as requiring hundreds or thousands of trials, but this is not well established. Standard approaches from cognitive psychology involve examining learning curves for accuracy and reaction time, and potentially then habit learning can be thought to be complete when asymptote is achieved (see Figure 3, bottom section). Attempts to formalize learning rates come from reinforcement learning and state space modeling approaches. Reinforcement-learning approaches result in two common measures: reward prediction error, which is the measure of how unexpected the received reward is, and value, which is the expected reward associated with the current stimulus and associated action. When learning is the fastest, RPE is the highest and value rapidly changes. As a task is learned, RPE reduces to zero and value asymptotes toward its maximum (Figure 3, middle section).

Figure 3

Determining whether basal ganglia dependent learning is slow will depend on the operational definition of slow. However, it should be noted that basal ganglia dependent learning tasks vary greatly in how many trials it takes for subjects to reach maximal performance. Cromer et al. (2011) found that activity in the head of the caudate reached asymptote after five trials in a rule-learning paradigm. Delgado et al. (2005) found greatest caudate activity in an fMRI study during early learning (the first 8 repetitions of each stimulus) in comparison with later learning. Notably, these results are all from the caudate nucleus. Some researchers argue that the putamen should primarily subserve habitual learning. Studies that examine putamen activity during learning often find slower increases than in the caudate. However, activity levels in the putamen often follow behavior: activity reaches its maximum as behavioral accuracy reaches asymptote (Brasted and Wise, 2004; Williams and Eskandar, 2006), or as reinforcement-learning measures of learning, e.g., reward prediction, reach their maximum (Seger et al., 2010). Regional differences in learning speed are discussed further in the Conclusion.

If habit learning is acquired gradually, then when is performance fully habitual? Some people have argued that habits continue to develop even beyond the point at which behavioral measures cease to change, e.g., accuracy and reaction time reach their asymptotes. Grol et al. (2006) found continued practice related change in basal ganglia activity during these time points. Helie et al. (2010) and Waldschmidt and Ashby (2011) examined learning related changes long past the point at which accuracy reached asymptote, and found that basal ganglia activity continued to change and ultimately decreased to baseline levels. A more formal computational approach to measuring learning rates would be particularly helpful in this regard, as well as theories that can account for different levels of expertise and their neural correlates.

Unconscious

Unconsciousness, as a defining feature of habit learning, stems from the inclusion of habit learning as a subtype of non-declarative memory by Squire and Zola-Morgan (1988, 1991). In their theory, declarative memory was accessible to consciousness, whereas non-declarative memory was not.

Consciousness can be difficult to define both on a practical and theoretical level. It is difficult to assess the degree of conscious access to knowledge in non-human animals, and impossible to assess verbalizable knowledge. Even with humans, there is debate about which measures of awareness are best for assessing whether there is conscious access to knowledge or not (Seth et al., 2008). Assessing awareness during task performance can affect the subject, Äôs strategic approach to the task, whereas assessing awareness after a task can easily miss information that might have been accessible to awareness during performance. On a more theoretical level, it is not always clear whether the relationship between awareness and learning is a necessary one; one logical possibility is that awareness is epiphenomenal and does not play a causal role in learning.

Recent research has found that the basal ganglia are involved in a wide variety of learning tasks, both ones in which learning is inaccessible to consciousness (Pessiglione et al., 2008), and in tasks in which subjects are aware of what they have learned, such as rule-learning tasks, arbitrary visuomotor learning tasks, and simple unstructured categorization tasks (Seger et al., 2011). Basal ganglia recruitment is similar for relatively simple categorization tasks associated with high levels of verbalizable knowledge and more complex tasks associated with little verbalizable knowledge (Seger, 2008). Thus, basal ganglia do not seem to be exclusively associated with either conscious or unconscious learning. Furthermore, in recent research consciousness has proved to be a less reliable sign of hippocampal involvement in memory. The hippocampus has been shown to be required in several implicit learning tasks. These include contextual cuing, in which subjects become faster at searching repeated stimulus arrays (Greene et al., 2007), and some sequential relationships in the serial reaction time task (Schendan et al., 2003; Ergorul and Eichenbaum, 2006; Wilkinson et al., 2009).

Automatic

The concept of automaticity was developed in cognitive psychology by Shiffrin and Schneider (1977). It is itself a complex concept with four main characteristics. Three of these characteristics have already been discussed as potential defining features of habit learning: that automatic performance is unconscious, that the knowledge applied automatically is rigid or inflexible, and that automatic processes are acquired slowly and incrementally. The remaining characteristic is that automatic processes do not require the limited capacity cognitive mechanisms involved in short-term memory and selective attention. This leads to an operational definition that automatic tasks should be able to be performed in a dual task situation along with a demanding task that requires short-term memory and selective attention processes.

Although this definition is on the surface clear, in practice it is hard to know whether a particular dual task actually monopolizes appropriate limited capacity cognitive mechanisms. Recently, the concept of controlled processing has undergone extensive revision; there is no longer support for a simple modal model of memory, with a single limited capacity short-term memory store, though evidence suggests there are some general purpose or shared resources (Lavie, 2010). The modern view of short-term, or working, memory, and executive function includes qualitatively different short-term stores for different materials (Linden, 2007), and instead of a single attentional mechanism it includes a wide variety of cognitive control mechanisms (Banich et al., 2009; Braver et al., 2009). Learning in basal ganglia dependent tasks is often less affected by dual tasks than comparison tasks (Zeithamova and Maddox, 2006).

Dual task independence is also problematic when considering whether the basal ganglia are involved in habit learning, because the basal ganglia are in addition important for executive functions involved in task switching and selection. Thus, in a dual task situation any basal ganglia activity could be due to demands on the basal ganglia for coordinating the dual tasks, rather than for either of the tasks individually (Poldrack et al., 2005). Nevertheless, some researchers have used dual task methodologies successfully, such as Foerde et al. (2006) who found greater reliance on the basal ganglia for classification learning during dual task conditions in comparison with single task conditions. Interestingly, they found greater reliance on the putamen during dual task learning, which raises the possibility that dual tasks may load some corticostriatal networks more than others.

An alternative operational definition was proposed by Jacoby: that an automatic process will be performed regardless of a person's intentions, and thus will affect performance on a task even when the subject is attempting to not be affected (an exclusion task in Jacoby's terminology). This operational definition has not often been used in examining the basal ganglia in habit learning, though some researchers studying motor sequence learning have found that the striatum is recruited during automatic performance and is affected by prefrontal cortical mechanisms when subjects attempt to suppress the automatic performance (Destrebecqz et al., 2005).

Reinforcer revaluation insensitivity

The requirement that habit learning be insensitive to reinforcer revaluation comes from the field of animal learning. It has the advantage of being well defined, and it is clear how to apply this criterion experimentally, at least with non-human animal subjects. It is also clear how this criterion relates to learning in basal ganglia dependent tasks. This criterion dissociates the dorsomedial from dorsolateral striatum, with only the latter involved in habitual action.

The criterion does have some practical disadvantages. It requires two manipulations: first, the subject's value for the reinforcer must be changed (typically via feeding to satiation), and second, the behavior must be tested under conditions of extinction. It is unclear how effectively this procedure can be used with human subjects, who are more likely to notice that they are no longer being rewarded and change their behavior strategically (though some studies with humans have been published; Valentin et al., 2007). Second, it is unclear how the shift to reinforcer value independence corresponds to other meaningful transitions in the development of expertise, such as reaching asymptotic behavioral performance, or the emergence of dual task independence (Ashby et al., 2010; see also Figure 3).

Conclusion

As surveyed above, no single defining feature completely captures all the commonly-held beliefs about habit learning. Furthermore, the combination of these features are not always compatible. For example, the criterion of reinforcer devaluation and dual task independence each imply that learning should be considered habitual at a different point in training. We draw three main conclusions from our examination of habit learning and the basal ganglia. First, we provide a taxonomy of criteria for habit learning and divide them into two primary classes. Second, we examine patterns within the different corticostriatal loops and argue that the loops differ in the degree to which they meet criteria for habit learning, with the motor loop qualifying on more criteria than the executive loop. Third, we argue that the basal ganglia and corticostriatal systems interact with other neural systems and therefore that habit learning should not be assumed to exclusively require the basal ganglia, and describe some ways that these systems may interact.

The criteria for habit learning discussed above fall into two types. One type are criteria that can apply at any stage of learning, early or late. In particular, the criteria that learning is unconscious and inflexible were traditionally meant to characterize learning at all stages. Another type of criteria is based on the view that habit learning develops across time and emerges as learning progresses. There have been various behavioral hallmarks of learning that have been proposed. Figure 3 illustrates these hallmarks and indicates approximate points in time across training that they may be achieved. Broadly, behavioral landmarks can be divided into three subtypes. First, those based on simple analyses of behavior such as accuracy and reaction time, in which learning is defined as habitual when the measure reaches asymptote, or when the task is “overlearned” via continued training past the point of asymptote. Second, those that apply computational modeling techniques to extract latent parameters thought to characterize learning. The most commonly used approach is from reinforcement learning, in which there are two relevant parameters: reward prediction error and value, or reward prediction itself. Learning can be considered habitual when prediction error approaches zero, and reward prediction approaches asymptote. Both simple behavioral and model-based approaches provide potential operational definitions for the criterion of “slow or incremental.” Third, qualitative criteria that are achieved at some point in learning. These include reinforcer devaluation insensitivity, automaticity defined as dual task insensitivity, and automaticity defined as inability to consciously control habitual knowledge. In addition, the field of motor learning suggests an additional possible criterion: the emergence of motor effector specificity. Across training, motor learning begins with relatively abstract representations that are accessible to multiple motor effectors, but learning become specific to the motor effector across training (Abrahamse et al., 2010).

The multiplicity, and at times incommensurability, of the different criteria for habit learning reinforces our belief that the field would benefit by moving towards more precise definitions of the various habit learning features. Formal mathematical or computational models will clarify exactly what is meant by slow and fast, flexible and inflexible learning and will allow for clear testable predictions. Formal models also have the advantage that they provide insight into potential underlying neural mechanisms; for example, reinforcement-learning modeling is particularly useful because it can be related to the firing patterns of dopaminergic neurons and the effects of dopamine on synaptic plasticity in the basal ganglia (Cohen and Frank, 2009; Moustafa and Gluck, 2011).

Another important lesson is that the basal ganglia is not a single unitary structure that is limited to a single cognitive domain. As described above, the basal ganglia and cortex interact in corticostriatal loops that implement different cognitive functions depending on the cortical regions involved. In Table 1 we summarize evidence for whether each corticostriatal loop meets criteria for being habitual. Broadly, regions participating in the motor loop (putamen and motor cortex) meet most criteria for habit learning, whereas regions participating in the visual loop are not well studied, and regions participating in the executive loop have a mixed pattern of results, meeting criteria for habit learning on some dimensions, but missing it on many more. The results summarized in Table 1 broadly support arguments made by researchers studying rodents who argue that the putamen (rodent dorsolateral striatum) is the neural substrate for habit learning and that the caudate (dorsomedial striatum) is involved in non-habitual goal-directed learning.

Table 1

Criterion	Executive	Visual	Motor
QUALITATIVE DIFFERENCES
Inflexible defined as reinforcement based¹	Mixed	?	Yes
Unconscious²	Mixed	?	Mixed
Automatic: dual task independent³	No	?	Yes
Automatic: PDP exclusion intrusions⁴	No	?	Yes
Reinforcer devaluation⁵	No	?	Yes
SLOW OR INCREMENTAL: COMPUTATIONAL MEASURES⁶
Reward prediction error	Strong	Weak	Weak
Value	Weak	Strong	Strong
SLOW OR INCREMENTAL: SIMPLE BEHAVIORAL MEASURES
Learning rate (slope)⁷	Strong	Weak	Weak
Learning asymptote	Weak	Strong	Strong
Changes beyond asymptote⁸	Mixed	?	Yes

Habit learning criteria within dorsal corticostriatal loops.

¹Daw et al. (2005) and Lopez-Paniagua and Seger (2011).

²Seger et al. (2011) and Pessiglione et al. (2008).

³Poldrack et al. (2005), Foerde et al. (2006), and Waldmann and Ashby (2011).

⁴Destrebecqz et al. (2005).

⁵Balleine et al. (2009) and Yin and Knowlton (2006).

⁶Haruno and Kawato (2006) and Seger et al. (2010).

⁷Brasted and Wise, 2004, Williams and Eskandar (2006), and Seger et al. (2010).

⁸Grol et al. (2006), Helie et al. (2010, and Waldschmidt and Ashby (2011).

Finally, it is important to avoid equating the behaviorally-defined habit learning system with the neurally-defined basal ganglia system. Given the complexity of habit learning, it likely recruits a number of neural systems in healthy, intact organisms. Neuroimaging studies of skill and habit learning tasks typically find learning related plasticity in several neural systems (Poldrack and Gabrieli, 2001; Poldrack et al., 2005). Probably the most studied system is the medial temporal lobe. However, other neural systems such as the cerebellum have an effect on learning and interact with basal ganglia system (Doyon et al., 2009). Exactly how these systems interact during habit learning is an open area of research. One approach is to postulate that habit learning and other systems learn independently and in parallel; the system that ultimately controls behavior is determined by competitive interactions between the systems (Ashby et al., 1998; Poldrack and Packard, 2003; Packard, 2009). Another approach assumes that initial learning is accomplished by a non-habit learning system, but that knowledge is transferred to the habit system across training (Ashby et al., 2010). When the basal ganglia and hippocampus systems are examined, some experimental results find antagonism, some cooperation, and some complete independence (see Seger and Miller, 2010, for a review). Among researchers studying human learning, an emerging view is that the hippocampus is recruited the first time a stimulus is seen in order to set up a memory representation of that stimulus, and that the basal ganglia can then utilize this representation when learning relations between the stimulus and the response (Meeter et al., 2008; Shohamy et al., 2008; Seger et al., 2011). Between the basal ganglia and prefrontal systems, the traditional view that the basal ganglia subserves habit learning led initially to arguments that cortical activity should precede activity in the basal ganglia. However, more recent theories argue that the basal ganglia are active primarily during learning, and that well established habits are represented cortically (Pasupathy and Miller, 2005; Seger and Cincotta, 2006; Ashby et al., 2007, 2010).

Statements

Acknowledgments

This research was supported by a grant from the National Institutes of Health (R01MH079182) to Carol A. Seger. We thank Kurt Braunlich for his contributions during the development of this manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
AbrahamseE. L.JiménezL.VerweyW. B.CleggB. A. (2010). Representing serial action and perception. Psychon. Bull. Rev.17, 603–623.10.3758/PBR.17.5.603
2
AlexanderG. E.DeLongM. R.StrickP. L. (1986). Parallel organization of functionally segregated circuits linking basal ganglia and cortex. Annu. Rev. Neurosci.9, 357–381.10.1146/annurev.ne.09.030186.002041
3
AndersonJ. R. (1982). Acquisition of cognitive skill. Psychol. Rev.89, 369–406.10.1037/0033-295X.89.4.369
- CrossRef
- Google Scholar
4
AshbyF. G.Alfonso-ReeseL. A.TurkenA. U.WaldronE. M. (1998). A neuropsychological theory of multiple systems in category learning. Psychol. Rev.105, 442–481.10.1037/0033-295X.105.3.442
5
AshbyF. G.EnnisD. M.SpieringB. J. (2007). A neurobiological theory of automaticity in perceptual categorization. Psychol. Rev.114, 632–656.10.1037/0033-295X.114.3.632
6
AshbyF. G.EnnisJ. M. (2006). The role of the basal ganglia in category learning. Psychol. Learn. Mem.1–36.10.1037/0278-7393.32.2.416
- CrossRef
- Google Scholar
7
AshbyF. G.TurnerB. O.HorvitzJ. C. (2010). Cortical and basal ganglia contributions to habit learning and automaticity. Trends Cogn. Sci. (Regul. Ed.)14, 191–232.10.1016/j.tics.2010.02.001
8
BalleineB. W.LijeholmM.OstlundS. B. (2009). The integrative function of the basal ganglia in instrumental conditioning. Behav. Brain Res.199, 43–52.10.1016/j.bbr.2008.10.034
9
BanichM. T.MackiewiczK. L.DepueB. E.WhitmerA. J.MillerG. A.HellerW. (2009). Cognitive control mechanisms, emotion and memory: a neural perspective with implications for psychopathology. Neurosci. Biobehav. Rev.33, 613–630.10.1016/j.neubiorev.2008.09.010
10
BrastedP. J.WiseS. P. (2004). Comparison of learning-related neuronal activity in the dorsal premotor cortex and striatum. Eur. J. Neurosci.19, 721–740.10.1111/j.0953-816X.2003.03181.x
11
BraverT. S.PaxtonJ. L.LockeH. S.BarchD. M. (2009). Flexible neural mechanisms of cognitive control within human prefrontal cortex. Proc. Natl. Acad. Sci. U.S.A.106, 7351–7356.10.1073/pnas.0808187106
12
Bromberg-MartinE. S.MatsumotoM.HikosakaO. (2010). Dopamine in motivational control: rewarding, aversive, and alerting. Neuron68, 815–834.10.1016/j.neuron.2010.11.022
13
BrooksD. N.BaddeleyA. D. (1976). What can amnesic patients learn?Neuropsychologia14, 111–122.10.1016/0028-3932(76)90012-9
14
CohenM. X.FrankM. J. (2009). Neurocomputational models of basal ganglia function in learning, memory and choice. Behav. Brain Res.199, 141–156.10.1016/j.bbr.2008.09.029
15
CohenN. J. (1984). “Preserved learning capacity in amnesia: evidence for multiple memory systems,” in The Neuropsychology of Memory, eds. ButtersN.SquireL. (New York: Guilford), 83–103.
- Google Scholar
16
CohenN. J.SquireL. R. (1980). Preserved learning and retention of pattern-analyzing skill in amnesia: dissociation of knowing how and knowing that. Science210, 207–210.10.1126/science.7414331
17
CorkinS. (1968). Acquisition of motor skill after bilateral medial temporal-lobe excision. Neuropsychologia6, 255–265.10.1016/0028-3932(68)90024-9
- CrossRef
- Google Scholar
18
CromerJ. A.MachonM.MillerE. K. (2011). Rapid association learning in the primate prefrontal cortex in the absence of behavioral reversals. J. Cogn. Neurosci.23, 1823–1828.10.1162/jocn.2010.21555
19
DawN. D.NivY.DayanP. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci.8, 1704–1711.10.1038/nn1560
20
DelgadoM. R.MillerM. M.InatiS.PhelpsE. A. (2005). An fMRI study of reward-related probability learning. Neuroimage24, 862–873.10.1016/j.neuroimage.2004.10.002
21
DestrebecqzA.PeigneuxP.LaureysS.DegueldreC.Del FioreG.AertsJ.LuxenA.Van Der LindenM.CleeremansA.MaquetP. (2005). The neural correlates of implicit and explicit sequence learning: interacting networks revealed by the process dissociation procedure. Learn. Mem.12, 480–490.10.1101/lm.95605
22
DickinsonA. (1985). Actions and habits: the development of behavioural autonomy. Philos. Trans. R. Soc. Lond. B Biol. Sci.308, 67–78.10.1098/rstb.1985.0010
- CrossRef
- Google Scholar
23
DoyonJ.BellecP.AmselR.PenhuneV.MonchiO.CarrierJ.LehéricyS.BenaliH. (2009). Contributions of the basal ganglia and functionally related brain structures to motor learning. Behav. Brain Res.199, 61–75.10.1016/j.bbr.2008.11.012
24
DriscollI.SutherlandR. J.PruskyG. T.RudyJ. W. (2004). Damage to the hippocampal formation does not disrupt representational flexibility as measured by a novelty transfer test. Behav. Neurosci.118, 1427–1432.10.1037/0735-7044.118.6.1196
25
EichenbaumH.FortinN. J. (2009). The neurobiology of memory based predictions. Philos. Trans. R. Soc. Lond. B Biol. Sci.364, 1183–1191.10.1098/rstb.2008.0306
26
ErgorulC.EichenbaumH. (2006). Essential role of the hippocampal formation in rapid learning of higher-order sequential associations. J. Neurosci.26, 4111–4117.10.1523/JNEUROSCI.0441-06.2006
27
EvansJ. S. (2008). Dual-processing accounts of reasoning, judgment, and social cognition. Annu. Rev. Psychol.59, 255–278.10.1146/annurev.psych.59.103006.093629
28
FoerdeK.KnowltonB. J.PoldrackR. A. (2006). Modulation of competing memory systems by distraction. Proc. Natl. Acad. Sci. U.S.A.103, 11778–11783.10.1073/pnas.0602659103
29
FrankM. J. (2005). Dynamic dopamine modulation in the basal ganglia: a neurocomputational account of cognitive deficits in medicated and non-medicated Parkinsonism. J. Cogn. Neurosci.17, 51–72.10.1162/0898929052880093
30
FrankM. J.RudyJ. W.O'ReillyR. C. (2003). Transitivity, flexibility, conjunctive representations, and the hippocampus. II. A computational analysis. Hippocampus13, 341–354.10.1002/hipo.10084
31
GrafP.SchacterD. L. (1985). Implicit and explicit memory for new associations in normal and amnesic subjects. J. Exp. Psychol. Learn. Mem. Cogn.11, 501–518.10.1037/0278-7393.11.3.501
32
GraybielA. M. (2008). Habits, rituals, and the evaluative brain. Annu. Rev. Neurosci.31, 359–387.10.1146/annurev.neuro.29.051605.112851
33
GreeneA. J.GrossW. L.ElsingerC. L.RaoS. M. (2007). Hippocampal differentiation without recognition: an fMRI analysis of the contextual cueing task. Learn. Mem.14, 548–553.10.1101/lm.609807
34
GrolM. J.de LangeF. P.VerstratenF. A.PassinghamR. E.ToniI. (2006). Cerebral changes during performance of overlearned arbitrary visuomotor associations. J. Neurosci.26, 117–125.10.1523/JNEUROSCI.2786-05.2006
35
HarunoM.KawatoM. (2006). Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning. J. Neurophysiol.95, 948–959.10.1152/jn.00382.2005
36
HelieS.RoederJ. L.AshbyF. G. (2010). Evidence for cortical automaticity in rule-based categorization. J. Neurosci.30, 14225–14234.10.1523/JNEUROSCI.2393-10.2010
37
HirshJ. (1974). The hippocampus and contextual retrieval of information from memory: a theory. Behav. Biol.12, 421–444.10.1016/S0091-6773(74)92231-7
38
HullC. L. (1934a). The concept of the habit-family hierarchy and maze learning: part I. Psychol. Rev.41, 33–54.10.1037/h0072855
- CrossRef
- Google Scholar
39
HullC. L. (1934b). The concept of the habit-family hierarchy and maze learning: part II. Psychol. Rev.41, 134–152.10.1037/h0070758
- CrossRef
- Google Scholar
40
JacobyL. L. (1991). A process dissociation framework: separating automatic from intentional uses of memory. J. Mem. Lang.30, 513–541.10.1016/0749-596X(91)90025-F
- CrossRef
- Google Scholar
41
JamesW. (1890). Principles of Psychology. New York: Henry Holt.
- Google Scholar
42
KeaneM. M.GabrieliJ. D.FennemaA. C.GrowdonJ. H.CorkinS. (1991). Evidence for a dissociation between perceptual and conceptual priming in Alzheimer's disease. Behav. Neurosci.105, 326–342.10.1037/0735-7044.105.2.326
43
KnowltonB. J.RamusS.SquireL. R. (1992). Intact artificial grammar learning in amnesia: dissociation of classification learning and explicit memory for specific instances. Psychol. Sci.3, 172–179.10.1111/j.1467-9280.1992.tb00021.x
- CrossRef
- Google Scholar
44
KnowltonB. J.SquireL. R. (1993). The learning of categories: parallel brain systems for item memory and category knowledge. Science262, 1747–1749.10.1126/science.8259522
45
LashleyK. S. (1930). Basic neural mechanisms in behavior. Psychol. Rev.37, 1–24.10.1037/h0074134
- CrossRef
- Google Scholar
46
LashleyK. S. (1950). “In search of the engram,” in Society of Experimental Biology Symposium, Vol. 4 (Cambridge: Cambridge University Press), 454–480.
- Google Scholar
47
LavieN. (2010). Attention, distraction, and cognitive control under load. Curr. Dir. Psychol. Sci.19, 143–148.10.1177/0963721410370295
- CrossRef
- Google Scholar
48
LindenD. E. (2007). The working memory networks of the human brain. Neuroscientist13, 257–267.10.1177/1073858406298480
49
Lopez-PaniaguaD.SegerC. A. (2011). Interactions within and between corticostriatal loops during component processes of category learning. J. Cogn. Neurosci.23, 3068–3083.10.1162/jocn_a_00008
50
MeeterM.RadicsG.MyersC. E.GluckM. A.HopkinsR. O. (2008). Probabilistic categorization: how do normal participants and amnesic patients do it?Neurosci. Biobehav. Rev.32, 237–248.10.1016/j.neubiorev.2007.11.001
51
MilnerB. (1962). “Les troubles de la memoire accompagnant des 1esions hippocampiques bilaterales,” in Physiologic de l'Hippocampe (Paris: Centre National de la Recherche Scientifique), 257–272.
- Google Scholar
52
MiskinM.MalamutB.BachevalierJ. (1984). “Memories and habits: two neural systems,” in Neurobiology of Learning and Memory, eds. LynchG.McGaughJ. L.WeinbergeN. M. (New York: Guilford), 65–67.
- Google Scholar
53
MoustafaA. A.GluckM. A. (2011). A neurocomputational model of dopamine and prefrontal-striatal interactions during multicue category learning by Parkinson patients. J. Cogn. Neurosci.23, 151–167.10.1162/jocn.2010.21420
54
MoustafaA. A.KeriS.HerzallahM. M.MyersC. E.GluckM. A. (2010). A neural model of hippocampal-striatal interactions in associative learning and transfer generalization in various neurological and psychiatric patients. Brain Cogn.74, 132–144.10.1016/j.bandc.2010.07.013
55
MyersC. E.ShohamyD.GluckM. A.GrossmanS.KlugerA.FerrisS.GolombJ.SchnirmanG.SchwartzR. (2003). Dissociating hippocampal versus basal ganglia contributions to learning and transfer. J. Cogn. Neurosci.15, 185–193.10.1162/089892903321208123
56
NakanoK.KayaharaT.TsutsumiT.UshiroH. (2000). Neural circuits and functional organization of the striatum. J. Neurol.247, V1–V15.10.1007/PL00007778
57
NissenM. J.BullemerP. (1987). Attentional requirements of learning: evidence from performance measures. Cogn. Psychol.19, 1–32.10.1016/0010-0285(87)90002-8
- CrossRef
- Google Scholar
58
O'ReillyR. C.HerdS. A.PauliW. M. (2010). Computational models of cognitive control. Curr. Opin. Neurobiol.20, 257–261.10.1016/j.conb.2010.01.008
- CrossRef
- Google Scholar
59
PackardM. G. (2009). Exhumed from thought: basal ganglia and response learning in the plus-maze. Behav. Brain Res.199, 24–31.10.1016/j.bbr.2008.12.013
60
PasupathyA.MillerE. K. (2005). Different time courses of learning-related activity in the prefrontal cortex and striatum. Nature433, 873–876.10.1038/nature03287
61
PawlakV.KerrJ. N. (2008). Dopamine receptor activation is required for corticostriatal spike-timing-dependent plasticity. J. Neurosci.28, 2435–2446.10.1523/JNEUROSCI.4402-07.2008
62
PessiglioneM.PetrovicP.DaunizeauJ.PalminteriS.DolanR. J.FrithC. D. (2008). Subliminal instrumental conditioning demonstrated in the human brain. Neuron59, 561–567.10.1016/j.neuron.2008.07.005
63
PoldrackR. A.GabrieliJ. D. (2001). Characterizing the neural mechanisms of skill learning and repetition priming: evidence from mirror reading. Brain124, 67–82.10.1093/brain/124.1.67
64
PoldrackR. A.PackardM. G. (2003). Competition among multiple memory systems: converging evidence from animal and human brain studies. Neuropsychologia41, 245–251.10.1016/S0028-3932(02)00157-4
65
PoldrackR. A.SabbF. W.FoerdeK.TomS. M.AsarnowR. F.BookheimerS. Y.KnowltonB. J. (2005). The neural correlates of motor skill automaticity. J. Neurosci.25, 5356–5364.10.1523/JNEUROSCI.3880-04.2005
66
ReberA. S. (1967). Implicit learning of artificial grammars. J. Mem. Lang.6, 855–863.
- Google Scholar
67
RedgraveP.RodriguezM.SmithY.Rodriguez-OrozM. C.LehericyS.BergmanH.AgidY.DeLongM. R.ObesoJ. A. (2010). Goal-directed and habitual control in the basal ganglia: implications for Parkinson's disease. Nat. Rev. Neurosci.11, 760–772.10.1038/nrn2915
68
SchacterD. L. (1987). Implicit memory: history and current status. J. Exp. Psychol. Learn. Mem. Cogn.13, 501–518.10.1037/0278-7393.13.3.501
- CrossRef
- Google Scholar
69
SchendanH. E.SearlM. M.MelroseR. J.SternC. E. (2003). An fMRI study of the role of the medial temporal lobe in implicit and explicit sequence learning. Neuron37, 1013–1025.10.1016/S0896-6273(03)00123-5
70
SchultzW. (2002). Getting formal with dopamine and reward. Neuron36, 241–263.10.1016/S0896-6273(02)00967-4
71
SegerC. A. (1994). Implicit learning. Psychol. Bull.115, 163–196.10.1037/0033-2909.115.2.163
72
SegerC. A. (2008). How do the basal ganglia contribute to categorization? Their roles in generalization, response selection, and learning via feedback. Neurosci. Biobehav. Rev.32, 265–278.10.1016/j.neubiorev.2007.07.010
73
SegerC. A. (2009). “The involvement of corticostriatal loops in learning across tasks, species, and methodologies,” in The Basal Ganglia IX, eds GroenewegenH. J.VoornP.BerendseH. W.MulderA. B.CoolsA. R. (New York: Springer-Verlag), 25–39.
- Google Scholar
74
SegerC. A.CincottaC. M. (2005). The roles of the caudate nucleus in human classification learning. J. Neurosci.25, 2941–2951.10.1523/JNEUROSCI.3401-04.2005
75
SegerC. A.CincottaC. M. (2006). Dynamics of frontal, striatal, and hippocampal systems during rule learning. Cereb. Cortex16, 1546–1555.10.1093/cercor/bhj092
76
SegerC. A.DennisonC. S.Lopez-PaniaguaD.PetersonE. J.RoarkA. A. (2011). Dissociating hippocampal and basal ganglia contributions to category learning using stimulus novelty and subjective judgments. Neuroimage55, 1739–1753.10.1016/j.neuroimage.2011.01.026
77
SegerC. A.MillerE. K. (2010). Category learning in the brain. Annu. Rev. Neurosci.33, 203–219.10.1146/annurev.neuro.051508.135546
78
SegerC. A.PetersonE. J.CincottaC. M.Lopez-PaniaguaD.AndersonC. W. (2010). Dissociating the contributions of independent corticostriatal systems to visual categorization learning through the use of reinforcement learning modeling and Granger causality modeling. Neuroimage50, 644–656.10.1016/j.neuroimage.2009.11.083
79
SethA. K.DienesZ.CleeremansA.OvergaardM.PessoaL. (2008). Measuring consciousness: relating behavioural and neurophysiological approaches. Trends Cogn. Sci. (Regul. Ed.)12, 314–321.10.1016/j.tics.2008.04.008
80
ShalliceT. (1982). Specific impairments of planning. Philos. Trans. R. Soc. Lond. B Biol. Sci.298, 199–209.10.1098/rstb.1982.0082
81
ShiffrinW.SchneiderR. M. (1977). Controlled and automatic human information processing: 1. Detection, search, and attention. Psychol. Rev.84, 1–66.10.1037/0033-295X.84.1.1
- CrossRef
- Google Scholar
82
ShohamyD.AdcockR. A. (2010). Dopamine and adaptive memory. Trends Cogn. Sci. (Regul. Ed.)14, 464–472.10.1016/j.tics.2010.08.002
83
ShohamyD.MyersC. E.GeghmanK. D.SageJ.GluckM. A. (2006). L-dopa impairs learning, but spares generalization, in Parkinson's disease. Neuropsychologia44, 774–784.10.1016/j.neuropsychologia.2005.07.013
84
ShohamyD.MyersC. E.KalanithiJ.GluckM. A. (2008). Basal ganglia and dopamine contributions to probabilistic category learning. Neurosci. Biobehav. Rev.32, 219–236.10.1016/j.neubiorev.2007.07.008
85
SquireL. R.Zola-MorganS. (1988). Memory: brain systems and behavior. Trends Neurosci.11, 170–175.10.1016/0166-2236(88)90144-0
86
SquireL. R.Zola-MorganS. (1991). The medial temporal lobe memory system. Science253, 1380–1386.10.1126/science.1896849
87
StoccoA.LebiereC.AndersonJ. R. (2010). Conditional routing of information to the cortex: a model of the basal ganglia's role in cognitive coordination. Psychol. Rev.117, 541–574.10.1037/a0019077
88
ValentinV. V.DickinsonA.O'DohertyJ. P. (2007). Determining the neural substrates of goal-directed learning in the human brain. J. Neurosci.27, 4019–4026.10.1523/JNEUROSCI.0564-07.2007
89
WaldschmidtJ. G.AshbyF. G. (2011). Cortical and striatal contributions to automaticity in information-integration categorization. Neuroimage56, 1791–1802.10.1016/j.neuroimage.2011.02.011
90
WilkinsonL.KhanZ.JahanshahiM. (2009). The role of the basal ganglia and its cortical connections in sequence learning: evidence from implicit and explicit sequence learning in Parkinson's disease. Neuropsychologia47, 2564–2573.10.1016/j.neuropsychologia.2009.05.003
91
WilliamsZ. M.EskandarE. N. (2006). Selective enhancement of associative learning by microstimulation of the anterior caudate. Nat. Neurosci.9, 562–568.10.1038/nn1774
92
WinogradT. (1972). Understanding natural language. Cogn. Psychol.3, 1–191.10.1016/0010-0285(72)90002-3
- CrossRef
- Google Scholar
93
YinH. H.KnowltonB. J. (2006). The role of the basal ganglia in habit formation. Nat. Rev. Neurosci.7, 464–476.10.1038/nrn1919
94
ZeithamovaD.MaddoxW. T. (2006). Dual-task interference in perceptual category learning. Mem. Cognit.34, 387.10.3758/BF03193416

Summary

Keywords

basal ganglia, habit learning, automaticity, reward

Citation

Seger CA and Spiering BJ (2011) A Critical Review of Habit Learning and the Basal Ganglia. Front. Syst. Neurosci. 5:66. doi: 10.3389/fnsys.2011.00066

Received

01 February 2011

Accepted

01 August 2011

Published

30 August 2011

Volume

5 - 2011

Edited by

Elizabeth Abercrombie, Rutgers-Newark: The State University of New Jersey, USA

Reviewed by

Christopher I. Petkov, Newcastle University, UK; Heiko J. Luhmann, Institut für Physiologie und Pathophysiologie, Germany

This is an open-access article subject to a non-exclusive license between the authors and Frontiers Media SA, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and other Frontiers conditions are complied with.

*Correspondence: Carol A. Seger, Department of Psychology, 1876 Campus Delivery, Colorado State University, Fort Collins, CO 80523, USA. e-mail: carol.seger@colostate.edu

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

HYPOTHESIS AND THEORY article

A Critical Review of Habit Learning and the Basal Ganglia

Abstract

Introduction

Historical Evolution of the Habit Learning Concept

Hippocampal research: Early definitions of habit learning

Cognitive psychology: Habit as implicit and automatic

Cognitive neuropsychology: Habit as a type of “non-declarative” memory

Animal learning: Habit learning as one form of instrumental conditioning

The Basal Ganglia and Learning