Nucleus Accumbens is Involved in Human Action Monitoring: Evidence from Invasive Electrophysiological Recordings

The Nucleus accumbens (Nacc) has been proposed to act as a limbic-motor interface. Here, using invasive intraoperative recordings in an awake patient suffering from obsessive-compulsive disease (OCD), we demonstrate that its activity is modulated by the quality of performance of the subject in a choice reaction time task designed to tap action monitoring processes. Action monitoring, that is, error detection and correction, is thought to be supported by a system involving the dopaminergic midbrain, the basal ganglia, and the medial prefrontal cortex. In surface electrophysiological recordings, action monitoring is indexed by an error-related negativity (ERN) appearing time-locked to the erroneous responses and emanating from the medial frontal cortex. In preoperative scalp recordings the patient's ERN was found to be significantly increased compared to a large (n = 83) normal sample, suggesting enhanced action monitoring processes. Intraoperatively, error-related modulations were obtained from the Nacc but not from a site 5 mm above. Importantly, cross-correlation analysis showed that error-related activity in the Nacc preceded surface activity by 40 ms. We propose that the Nacc is involved in action monitoring, possibly by using error signals from the dopaminergic midbrain to adjust the relative impact of limbic and prefrontal inputs on frontal control systems in order to optimize goal-directed behavior.


INTRODUCTION
We continuously evaluate our actions in order to detect and correct performance errors suggesting that we are endowed with a cognitive system that monitors and optimizes our behavior. Recordings of event-related brain potentials (ERPs) from the intact scalp have provided neurophysiological evidence for the existence of such a system in the human: An erroneous button press in a performance task elicits the "error-related" negativity (ERN or Ne) having a peak latency around 60 ms and a medial frontal scalp distribution. The ERN is believed to be generated in the anterior cingulate cortex (Falkenstein et al., 1990;Gehring et al., 1993;Luu and Tucker, 2001;Rodriguez-Fornells et al., 2002;Rollnik et al., 2004;Ullsperger and von Cramon, 2003). Electrophysiological fi ndings have been supported by functional neuroimaging results that corroborate a role of the medial prefrontal cortex (anterior cingulate cortex and adjacent pre-SMA) in the processing of performance errors (Botvinick et al., 2001;Carter et al., 1998;Fassbender et al., 2004;Hester et al., 2004;von Cramon, 2001, 2003). Importantly, the ERN has been proposed to be driven by reinforcement learning signals originating in the mesencephalic dopamine system (MDS; Holroyd and Coles, 2002). Briefl y, a monitoring system located in the basal ganglia is thought to evaluate internal "efference copies" of ongoing actions. From this information, the monitoring system predicts whether ongoing events will end in success or failure. When the monitoring system revises its predictions for the better, it induces a phasic increase in the activity of the MDS (see also, Barto, 1995;Houk et al., 1995;Montague et al., 1996). Conversely, if the system revises its predictions for the worse, as in performance errors, it induces a phasic decrease in the activity of the MDS. These positive and negative reward prediction error signals are carried by the MDS to various brain areas including the anterior cingulate cortex, where they are used to improve performance on the task at hand in line with principles of reinforcement learning (Schultz, 2006).
In addition to its projections to the medial prefrontal cortex the mesencephalic dopamine system also projects to the Nucleus accumbens (Nacc), which is considered to be an important motivational brain area (Kelley and Berridge, 2002) and therefore part of the brain's reward system. It has been implicated in the etiology of craving and obsession (Ikemoto and Panksepp, 1999;Redish and Johnson, 2007). We will consider the proposed functions of the Nacc in some detail in the discussion. Importantly, the Nacc has recently become the target of neurosurgical intervention (Kuhn et al., 2007;Schlaepfer et al., 2008;Sturm et al., 2003;van Kuyck et al., 2007). Similar to the approach routinely used for the subthalamic nucleus in Parkinson's disease, electrodes connected to a stimulator placed underneath the pectoral muscle are implanted in the Nacc for the relief of obsessive compulsive disorder (OCD), depression, and Tourette's syndrome. Operations may be carried out while the patient is fully awake and therefore able to perform psychological tasks during the procedure. Here we report invasive recordings carried out in a 39-year old male patient suffering from compulsions (checking, washing, repeating) and obsessions since age 11, who received deep brain stimulation via an electrode placed in the right Nacc. A standard form of the fl anker interference task was used, which has been employed widely to study action monitoring processes in humans (Riba et al., 2005;Rodriguez-Fornells et al., 2002;Rollnik et al., 2004). The motivation for using a fl anker task was two-fold. First, previous investigations using similar tasks and surface ERPs have found evidence for increased action monitoring in OCD patients (Gehring et al., 2000;Hajcak and Simons, 2002;Johannes et al., 2001Johannes et al., , 2002Nieuwenhuis et al., 2005;Santesso et al., 2006). Second, and more importantly, because of its dopaminergic innervation we hypothesized that we should be able to detect errorrelated electrophysiological activity in the Nacc. Such a fi nding would have implications for how the Nacc functions as a limbic-motor interface, as error-driven dopaminergic input to the Nacc might serve to change the weights given to contextual and emotional information coming from hippocampus and amygdale, respectively, and cognitive information coming from prefrontal cortex in the selection of goal-directed behaviours (Grace, 2000).

MATERIALS AND METHODS
All procedures were approved by the ethical review boards of the University of Cologne (patient recordings) and University of Magdeburg (control participants).

Patient
A 39-year old male patient was investigated who suffered from severe and therapy-refractory OCD (ICD 10 classifi cation: F42.2) characterized by compulsive checking of the hands, washing and repetitive behaviour, as well as excessive rumination of thoughts. Onset of the disorder was age 11. At the time of the operation he was on venlafaxine (225 mg/day), a reuptake inhibitor of serotonin and norepinephrine. Prior therapies (all unsuccessful) had included cognitive behavioural therapy, exposition therapy, supportive psychotherapy and drug treatment with clomipramine, imipramine, fl uvoxamine, citalopram, and ziprasidon. He was single and a third year student of biology and geology at the time of testing. He had been employed as an offi ce worker prior to returning to school in his late thirties. Comprehensive neuropsychological testing revealed no cognitive or memory disturbances (Computerized test battery for the assessment of attention functions, Zimmermann and Fimm, 2002;Verbaler Lern-und Merkfähigkeitstest, Helmstaedter et al., 2001, German adaptation of Rey's Auditory Verbal Learning Test, RAVLT; Lezak, 1995; among other tests).

Control participants
A total of 83 normal participants (mean age 30.7 years) with no history of neurological and psychiatric illness served as controls. Data from subsamples of the control group have been described previously (Riba et al., 2005;Rodriguez-Fornells et al., 2002;Rollnik et al., 2004).

Stimuli and procedure
The Eriksen fl anker task (Eriksen and Eriksen, 1974) was used, which requires to respond to the center letter of a 5-letter array with either a left-hand (for letter H) or right-hand response (letter S). Additional letters fl anking the target letter either favoured the target response (congruent trials, HHHHH or SSSSS) or primed the other response (incongruent trials, HHSHH or SSHSS). To increase the number of errors produced, 60% of the trials were incongruent. Each stimulus array subtended about 2.5° of visual angle in width, and a fi xation cross was presented in the middle of the computer monitor just below the target letter in the array. The duration of the stimuli was 100 ms, and an SOA between 900 and 1500 ms was used. Letter/hand assignments were counterbalanced between subjects and maintained in both sessions. Prior to the fi rst experimental session, subjects were trained with 200 trials to reach a reaction time (RT) baseline level and they were given feedback about their performance. The goal of this procedure was to aim for a reaction time that would yield approximately 10-15% of errors. The experiment proper consisted of 6 blocks of 4 minutes each containing 200 stimuli each. A 30-second rest period was allowed between blocks. Subjects were required to respond to the stimuli as fast as possible.
Of the 83 normal subjects, about half took part in a condition in which error corrections were encouraged (i.e., the instruction was: Whenever you detect an error, you should correct it by giving the appropriate response), while others were tested with the instruction to not correct performance errors (correction forbidden condition). ERN amplitudes have been shown to be virtually identical in a within-subject comparison (Fiehler et al., 2005;Rodriguez-Fornells et al., 2002), which is why we pooled data from both conditions for the normative sample. Nonetheless, because corrective responses might interfere with posterror adaptation processes (i.e., posterror slowing), the normal sample for the behavioural comparisons was restricted to the 41 subjects who performed in a "correction forbidden" condition. Also, the patient was tested with correction forbidden instructions.
Response-locked ERPs were averaged starting 200 ms before the response of the subject until 624 ms after response onset. The baseline used for the response-locked ERN was between −200 and 0 ms. ERPs used for the fi gures and statistical tests were band-passed fi ltered 1-8 Hz (half amplitude down).

Electrophysiological recording/patient
On the day prior to the operation, the patient was tested using the same set-up as described for the control participants. He fi nished a total of 10 experimental blocks of 80 stimuli each.
During the operation, EEG signals from the scalp surface were recorded using needle-electrodes (stainless steel) placed at positions Fz, Cz, F3, F4 and Pz according to the International 10-20 system (Chatrian et al., 1985(Chatrian et al., , 1988. All channels were referenced to the right mastoid. In two subsequent sessions, local fi eld potentials (LFPs) were recorded from inside the shell of the Nacc and 5mm above the border of the Nacc shell (controlled by a micro drive unit equipped with "Ben Gun" assembly) using the macro electrode contact (stainless steel) at the tip of the micromacro electrode (Inomed GmbH, Germany). Signals from the macro electrode were referenced to the electrodes' guide tube. All signals were acquired at a bandwidth of 0.1-100 Hz and a digitization rate of 250 Hz. The baseline for local fi eld potentials and event related potentials was −400 and −200 ms before response.
Response-locked ERPs were obtained in the same way as described for the control group. In addition to standard analysis in the time-domain, we also examined time-frequency behaviour of the electrical activity elicited by the response (see Marco-Pallares et al., 2008, for a more extensive description). To this end, 4 second epochs were generated (2000 ms before and after the stimulus-onset). Epochs exceeding ±50 μV in EOG or EEG were removed from further analysis. Single-trial data was convoluted using a complex Morlet wavelet: with the relation f 0 /σ f [where σ f = 1/(2πσ t )] set to 6.7 (Tallon-Baudry et al., 1997). Changes in time varying energy (square of the convolution between wavelet and signal) in the studied frequencies (from 1 to 40 Hz; linear increase) with respect to baseline (−300 to 0 ms with regard to stimulus onset) were computed for each trial and averaged separately for error and correct trials.

Behavior
Reaction times in correct trials were 447 ms (SD 80) in the healthy control subjects (n = 41) and 434 ms in the patient during presurgical testing.
The reaction time to error trials was 379 ms (SD 70) in the control participants and 436 ms in the patient. Control subjects committed choice errors in 11.78% (SD 6.93), while the OCD patient's errors amounted to 19.8%.
To investigate posterror adaptations of behaviour, the degree of posterror slowing (Rabbitt, 1966(Rabbitt, , 1968(Rabbitt, , 2002 was determined by subtracting the reaction times to correct trials following correct trials from those to correct trials following error trials. Mean posterror slowing was 28 ms (SD 40) in the control sample, while the patient's posterror slowing amounted to 70.5 ms. The estimated percentage (using the program provided by Crawford and Howell, 1998) of the control population falling below the patient's score was 85%. Thus, the patient's posterror slowing was clearly in upper range of the normal sample.
During the operation, the reaction times of the patient were 391 ms for correct and 338 ms for error trials. He committed choice errors in 25.2% of the trials. Thus, the patient was able to perform the task adequately in spite of the challenges of surgery and the busy environment in the operating theatre.

ERPs control participants
Grand average ERPs time-locked to the responses are shown in Figure 1A. A clear phasic negativity emerged for the errors starting with the button press and reaching its maximum at approximately 70 ms. This negativity is followed by a positivity (error positivity) with a maximum at about 250 ms. Figure 1B shows the distribution of the ERN amplitudes quantifi ed as the mean amplitude of the difference wave error -correct (time-window 0-100 ms) at electrode Cz (mean −5.01 μV, SD 3.57).

Patient pretest
Average response-locked ERPs of the patient recorded on the day prior to the operation are shown in Figure 2. A typical ERN was observed for this patient. The topographical distribution is illustrated by the isovoltage map in Figure 2B. A frontocentral maximum was obtained. The ERN was quantifi ed as described for the control participants in the previous section at electrode Cz (−11.82 μV). To determine whether the difference between the patient's ERN amplitude and that of the control participants was signifi cant, the test described by Crawford and Howell (1998) was computed (t = −1.896, p (one-tailed) = 0.03). The estimated percentage of the normal population having a higher amplitude than the patient was 3.07% (95% lower/upper confi dence limits on the percentage 1.17%/6.14%). Thus, it can be concluded that the patient has an extraordinarily large ERN amplitude.

Patient intraoperative recording
During the operation the patient completed 2 experimental blocks of 7.5 minutes each. The electrode was positioned 5 mm above (−5 mm, block 1) and directly within (Nacc, block 2) the target region, i.e., the shell of the Nacc. The fi nal position of the electrode tip can be derived from Figure 3. Figure 4 shows the surface ERPs (Cz referenced to A1) together with those of the respective depth recordings. Clearly, an ERN is observed for the errors but not for correct responses. Also, for the 0 mm depth recording a clear error-related modulation is observed. In order to account for the fact that errors are usually associated with a faster reaction time than correct responses, we also computed an average ERP for a subset of the correct trials that were matched to the error trials in terms of their RT. Again, averaged activity from the target region showed an error-related

Figure 2. Event-related potentials from the OCD-patient. Pretest prior to the operation. (A) Event-related brain responses time-locked to the button press response recorded from the scalp (Fz, Cz sites referenced to average activity at the two mastoid processes). (B) The topographic map shows the distribution of the patient's ERN. A typical fronto-central maximum is observed.
modulation, which was nearly absent in the recordings from a position 5 mm above the target.
In order to determine the temporal relationship between the activity at the surface and the Nacc activity, a cross-correlation analysis based on single trials was performed (Figure 5). This analysis revealed that the cortical error response was delayed relative to Nacc by about 40 ms.
In Figure 6 we show the time-frequency plots. Depicted is the change in power (in percent relative to the baseline −300 to 0 with regard to stimulus onset).

DISCUSSION
In the current single patient study we present surface and intracerebral recordings from the Nucleus accumbens in a patient suffering from severe OCD. During the pretest an extraordinarily large ERN was obtained placing the patient at the 97 th percentile of our large sample of 83 normal participants. This fi ts well with previous investigations from our (Johannes et al., 2001(Johannes et al., , 2002 and other laboratories (Gehring et al., 2000;Hajcak and Simons, 2002;Santesso et al., 2006) which similarly have reported an   (Cz -A1) and from the depth electrode within the Nacc or 5 mm above the target. Button press errors (25% of trials) show a typical error-related negativity with a maximum at about 70 ms at Cz. By contrast correct trials show a positive defl ection. Clearly, activity from the Nacc proper but not from the region 5 mm above the target shows error-related modulations. To rule out the possibility that differences in brain responses are due to differences in reaction times in error and correct trials, correct trials matched to the error trials with regard to RT are also shown.  increased amplitude of the ERN in OCD patients, patients with Tourette's syndrome with comorbid OCD, and normal participants with obsessivecompulsive personality traits. These fi ndings have been interpreted as refl ecting increased action monitoring in OCD subjects, an interpretation which is also supported by a recent fMRI study that reported signifi cantly greater error-related activation of the rostral ACC in OCD compared to healthy subjects (Fitzgerald et al., 2005). Moreover, activity in this region was positively correlated with symptom severity in the patients. It is interesting to speculate that these fi ndings, and in particular the increased ERN amplitude in our patient, refl ect excessive activity within the corticostriatal-pallidal-thalamic loop, which has been documented using different neuroimaging techniques such as positron-emission-tomography (PET; Perani et al., 1995;Rauch et al., 1994), single-photon emission computed tomography (SPECT; Machlin et al., 1991) or fMRI (Adler et al., 2000;Breiter et al., 1996). It has been speculated that enhanced activity of the error monitoring system might refl ect a greater sensitivity to errors (Fitzgerald et al., 2005;Gehring et al., 2000;Johannes et al., 2001). Such enhanced sensitivity could manifest itself behaviourally by an increased tendency to slow responses after errors in order to avoid further slips on subsequent trials (posterror slowing, see Rabbitt, 1966Rabbitt, , 1968 or in an enhanced error correction rate. We can not assess the latter, as error corrections were explicitly forbidden in the current experiment, but posterror slowing was 70.5 ms in our patient which compares with only 28 ms in the control subjects. Interestingly, the degree of posterror slowing has recently been shown to be positively correlated to activity in the medial prefrontal cortex (Klein et al., 2007). Taken together, this supports the notion of greater error sensitivity in our patient.
Importantly, and in accord with our hypothesis, we observed clear error-related modulations in time-domain averages of Nacc activity but not from a position 5 mm above the target region. This suggests that the Nacc is specifi cally involved in error processing and action monitoring and rules out the possibility that the activity is caused by volume conductance from distant sources (e.g., the ACC). The cross-correlation analysis further shows that the Nacc receives information about 40 ms prior to the medial frontal structure (ACC) that give rise to the ERN in surface recordings. In addition, performance-related modulations were also visible in waveletbased time-frequency analyses. In particular, the recordings from the Nacc show that error and correct trials were also differentiated in this analysis by a burst of activity in the beta-range (14-17 Hz) for the correct trials in addition to low frequency activity in the theta range. Using surface recordings of EEG, we have recently been able to show a dichotomy of theta and beta oscillatory responses over the frontal midline, with the former being sensitive to negative and the latter correlated to positive events (Marco-Pallares et al., 2008). It is interesting to speculate, whether a similar dichotomy could be demonstrated for the Nacc, but clearly recordings in a greater sample of patients are necessary to address this issue.
The Nacc receives inputs from the prefrontal cortex on the one hand and limbic structures such as the hippocampus and amygdala on the other (Groenewegen et al., 1999;Ikemoto and Panksepp, 1999). Moreover, its activity is modulated by dopaminergic neurons originating in the ventral tegmental area (Ikemoto and Panksepp, 1999;Voorn et al., 1986) which is a key area in incentive motivation (cf. Berridge, 2004). This circuitry allows for the integration of contextual information arising from hippocampus and emotional information coming from the amygdala with cognitive information supplied by the PFC in the selection of goal-directed behaviours (Grace, 2000), which is why the Nacc has been called a limbic-motor interface (Mogenson et al., 1980). With respect to the dopaminergic input into the Nacc, Goto and Grace (2005) found that tonic and phasic dopamine release has selective effects on hippocampal and cortical inputs via D1 and D2 receptors. They conclude that dopamine release "regulates the balance between limbic and cortical drive through activation and inactivation of DA receptor subtypes in the accumbens and this regulates goal-directed behaviour." What the current data, for the fi rst time in humans, demonstrate is that the Nacc receives error signals that might be used to weigh the information coming from the different inputs in order to use this information for the adjustments of response strategies.
The current data also fi ts well with one of the more prominent theories that have been proposed for the generation of the ERN, i.e., the reinforcement learning theory of Holroyd and Coles (2002). This theory is based on the interaction of the mesencephalic dopamine system and a "generic" error-processing system located within the anterior cingulate cortex, the latter giving rise to the ERN. Specifi cally, the ERN is thought to be generated when a negative reinforcement learning signal is conveyed to the anterior cingulate cortex in the form of a decrease of dopaminergic activity coming from the mesencephalic dopamine system. We suggest that in addition to the anterior cingulate cortex, the Nacc receives a similar negative reinforcement signal that is used for the modifi cation of its information processing.
The present investigation took advantage of the fact that the Nacc has been identifi ed recently as a promising target for the treatment of OCD (Okun et al., 2007;Sturm et al., 2003;Tass et al., 2003;van Kuyck et al., 2007), Tourette syndrome (Kuhn et al., 2007), depression (Schlaepfer et al., 2008), and possibly addictive disorders (Kuhn et al., 2007). While our study was not intended to assess the clinical utility of the Nacc as a target of functional neurosurgery, we would like to briefl y consider some clinical aspects of this therapy. In a recent review, summarizing animal and patient studies, van Kuyck et al. (2007) demonstrated effects of electrical stimulation of the Nacc on a variety of behaviours including reward processing, goal-directed activity, fi ght-or-fl ight response patterns, degree of exploratory behaviour and food intake. The possible value of the Nacc as a target structure has been underscored by recent modelling work linking this structure in the etiology of craving and obsessions (Redish and Johnson, 2007). The infl uence of DBS of a neighbouring region, that is, the ventral anterior internal capsule/ventral striatum, on regional cerebral blood fl ow has recently been investigated using PET in an on/off design in a small group of OCD patients (Rauch et al., 2006). During stimulation a signifi cant activation of orbitofrontal cortex, anterior cingulate cortex, striatum, globus pallidus, and thalamus was found: These are brain regions that have been implicated in OCD on the one hand and, at least with regard to the anterior cingulate cortex, action monitoring on the other hand. Whether or not stimulation of the Nacc or the ventral striatum resets action monitoring circuits to a more normal level in OCD remains a topic for future investigation.

CONCLUSION
The invasive electrophysiological recordings from the Nacc suggest that this structure is involved in action monitoring. Because of its strategic location, Nacc may use the error signal to adjust the weight of limbic and prefrontal inputs on control structures that optimize goal-directed behaviour.