Ventral Midbrain NMDA Receptor Blockade: From Enhanced Reward and Dopamine Inactivation

Hernandez, Giovanni; Cossette, Marie-Pierre; Shizgal, Peter; Rompré, Pierre-Paul

doi:10.3389/fnbeh.2016.00161

ORIGINAL RESEARCH article

Front. Behav. Neurosci., 26 August 2016

Sec. Motivation and Reward

Volume 10 - 2016 | https://doi.org/10.3389/fnbeh.2016.00161

Ventral Midbrain NMDA Receptor Blockade: From Enhanced Reward and Dopamine Inactivation

Giovanni Hernandez¹

Marie-Pierre Cossette²

Peter Shizgal²

Pierre-Paul Rompré^1,2*

¹Département de Neurosciences, Université de Montréal, Montréal, QC, Canada
²FRQ-S Research Group in Behavioral Neurobiology, Department of Psychology, Concordia University, Montréal, QC, Canada

Glutamate stimulates ventral midbrain (VM) N-Methyl-D-Aspartate receptors (NMDAR) to initiate dopamine (DA) burst firing activity, a mode of discharge associated with enhanced DA release and reward. Blockade of VM NMDAR, however, enhances brain stimulation reward (BSR), the results can be explained by a reduction in the inhibitory drive on DA neurons that is also under the control of glutamate. In this study, we used fast-scan cyclic voltammetry (FSCV) in anesthetized animals to determine whether this enhancement is associated with a change in phasic DA release in the nucleus accumbens. Rats were implanted with a stimulation electrode in the dorsal-raphe (DR) and bilateral cannulae above the VM and trained to self-administer trains of electrical stimulation. The curve-shift method was used to evaluate the effect of a single dose (0.825 nmol/0.5 μl/side) of the NMDAR antagonist, (2R,4S)-4-(3-Phosphopropyl)-2-piperidinecarboxylic acid (PPPA), on reward. These animals were then anesthetized and DA release was measured during delivery of electrical stimulation before and after VM microinjection of the vehicle followed by PPPA. As expected, phasic DA release and operant responding depended similarly on the frequency of rewarding electrical stimulation. As anticipated, PPPA produced a significant reward enhancement. Unexpectedly, PPPA produced a decrease in the magnitude of DA transients at all tested frequencies. To test whether this decrease resulted from excessive activation of DA neurons, we injected apomorphine 20 min after PPPA microinjection. At a dose (100 μg s.c.) sufficient to reduce DA firing under control conditions, apomorphine restored electrical stimulation-induced DA transients. These findings show that combined electrical stimulation and VM NMDARs blockade induce DA inactivation, an effect that indirectly demonstrates that VM NMDARs blockade enhances reward by potentiating stimulation-induced excitation in the mesoaccumbens DA pathway.

Introduction

Glutamate, the major excitatory neurotransmitter in the brain, plays a major role in behavioral, cognitive and motivational functions. Within the ventral midbrain (VM), glutamatergic afferents potently modulate reward-relevant circuitry by controlling dopamine (DA) excitability via two opposing mechanisms. Through its action on GABAergic afferents and GABAergic interneurons, glutamate maintains a strong inhibitory drive on DA neurons (Grace et al., 2007) so that a majority of them are unresponsive to excitatory inputs (Grace and Bunney, 1984). In contrast, through its direct action on DA neurons, glutamate switches DA neural activity from a slow, irregular, firing pattern to a phasic burst-firing mode that is associated with enhanced DA release (Grace and Bunney, 1984; Charara et al., 1996; Geisler et al., 2007; Omelchenko et al., 2009). Moreover, this mode of neural activity is associated with the acquisition of appetitive and aversive tasks (Zweifel et al., 2009). It has also been proposed that DA burst firing encodes reward prediction errors (Montague et al., 1996) and conveys motivationally relevant signals to anterior forebrain regions that control executive functions (Overton and Clark, 1997).

Excitation and inhibition of DA neurons may entail multiple glutamate receptor subtypes with different sensitivity to agonists and antagonists and the expression on different inputs to VM DA neurons. Empirical evidence for such an arrangement comes from experiments in which systemic or local injection of either N-methyl-D-aspartate receptor (NMDAR) agonists or antagonists increases DA burst firing (French et al., 1993), terminal DA release (Westerink et al., 1996; Mathé et al., 1998; Kretschmer, 1999), and forward locomotion (Kretschmer, 1999; Cornish et al., 2001). Furthermore, VM microinjections of short inhibitory RNA that reduces the number of NMDAR on VM neurons results in an attenuation of reward induced by dorsal-raphe (DR) electrical stimulation, while VM microinjection of (2R,4S)-4-(3-Phosphopropyl)-2-piperidinecarboxylic acid (PPPA) and 2-carboxypiperazin-4-propyl-1-phosphonic acid ((R)-CPP), NMDAR antagonists that display a high affinity for receptor composed of the GluN2A subunits, produces the opposite effect (Bergeron and Rompré, 2013; Hernandez et al., 2015). The most likely mechanism for the reward-attenuating effect is a reduction in the NMDARs that control DA burst firing, whereas a likely mechanism for the reward-enhancing effect is a blockade of a different subtype of NMDARs that maintain the inhibitory drive on DA neurons. Given the large body of evidence supporting a role of VM mesoaccumbens DA neurons in reward (Wise and Rompre, 1989; Lak et al., 2014; Eshel et al., 2016) and the more recent evidence that activation of VM glutamatergic inputs from the DR induces reward and enhances extracellular DA in the nucleus accumbens (Qi et al., 2014), we used fast-scan cyclic voltammetry (FSCV) to determine whether the enhancement of DR reward by VM NMDAR blockade is associated with a change in phasic DA release in the nucleus accumbens shell (NAcS).

Materials and Methods

Subjects and Surgery

Sixteen (16) male Long-Evans rats (Charles River, St-Constant, QC, Canada) weighing between 350–400 g at the time of the surgery were used. Rats were individually housed in a temperature- and humidity- controlled room with a 12-h light-dark cycle (lights on at 06:00 h) and ad libitum access to food and water. After a minimum 7-day period of acclimation to the housing environment rats were anesthetized with isoflurane and stereotaxically implanted according to Paxinos and Watson (2007) coordinates with 26-gauge guided cannulae (HRS Scientific, Montreal, Canada) aimed bilaterally at the VM (−5.5 mm AP, ±3.2 mm ML at a 18° angle, −6.5 mm DV from the skull surface) and a stainless steel monopolar electrode aimed at the DR (−7.6 mm AP, 0 mm ML, −6.6 mm DV from the skull surface). Detailed surgical procedures can be found in Bergeron and Rompré (2013). Of these 16 rats, nine rats were trained to nose-poke to deliver electrical pulses to the DR; whereas the other seven were used only in the electrochemistry experiment. For the behaviorally trained rats, one failed to self-stimulate and in three rats we were unable to measure DA. For the non-behaviorally trained rats we were unable to measure DA in one rat. For an schematic of the experimental sequence see Figure 1. All procedures were approved by the Animal Care and Use Committee of the Université de Montréal and Concordia University in accordance with the guidelines of the Canadian Council on Animal Care.

FIGURE 1

Figure 1. Experimental sequence for behaviorally trained and naïve subjects.

Drugs

PPPA (Tocris, Ellisville, MI, USA) was dissolved in sterile 0.9% saline and stored frozen in 40–50 μl aliquots. Drug solutions were thawed just before testing and used only once. PPPA was injected into the VM at a dose of 0.825 nmol/0.5 μl/side. Urethane (ethyl carbamate; Sigma, St. Louis, MO, USA) was dissolved in sterile 0.9% saline; it was injected intraperitoneally (i.p.) at a dose of 1.5 g/kg. Apomorphine (Sigma, St. Louis, MO, USA) was dissolved in sterile 0.9% saline; it was injected subcutaneously (s.c.) at a total dosage of 100 μg /kg. Drug doses are expressed as salts.

Self-Stimulation Training

Eight rats were shaped to nose poke, under a fixed ratio 1 (FR1), for a 0.4-s train of cathodal, rectangular, constant-current pulses, 0.1 ms in duration, delivered at a frequency of 98 Hz. Once the rat nose poked consistently at current intensities between 125 and 400 μA, a rate vs. pulse-frequency curve was obtained by varying the stimulation frequency across trials over a range that drove the number of rewards earned from maximal to minimal levels. A detailed shaping and training procedure can be found in Hernandez et al. (2015).

At least three behavioral tests were carried out before the FSCV experiments. A first saline test was performed to habituate the animals to the microinjection procedure. The detailed bilateral injections procedure can be found in Hernandez et al. (2015). Immediately after the microinjection, rats were allowed to self-stimulate for an hour. Results from this test were not included in the analysis. Baseline data were collected 1 week after this test. The rate vs. pulse-frequency data were fitted using the following sigmoid function

y = M i n + \frac{(M a x - M i n)}{1 + 10^{(x 50 - x) * p}}

where Min is the lower asymptote, Max is the upper asymptote, x50 is the position parameter denoting the frequency at which the slope of the curve is maximal, and p determines the steepness of the sigmoid curve. The resulting fit was used to derive an index of reward threshold, defined as the pulse-frequency sustaining a half-maximal rate of responding (M50). Self-stimulation behavior was considered stable when the M50 values varied less than 0.1 log unit for three consecutive days. Once stable performance was obtained, we evaluated, on separate days, the effect of bilateral VM microinjections of a single dose (0.825 nmol/0.5 μl/side) of PPPA and an equal volume of saline (counterbalance order).

FSCV

After at least 4 days after completion of the last behavioral test, rats were anesthetized, and phasic DA release was measured during delivery of DR stimulation, both before and after bilateral VM microinjection of saline and PPPA. Rats were anesthetized with urethane and placed in the stereotaxic apparatus. Holes were drilled for the placement of the carbon-fiber electrode, sintered Ag/AgCl reference electrode (In Vivo Metrics, Healdsburg, CA, USA) and the anode electrode. For naïve rats (not behaviorally tested; n = 7), a DR stimulation electrode and VM cannulae were also implanted using the stereotaxic coordinates mentioned above. The carbon fiber electrode was built by encasing a carbon fiber (Thorne, Amoco Corporation, Greenville, SC, USA) in a single barrel borosilicate glass capillary (ID = 0.40 mm OD = 0.60 mm; A-M System Carlsborg, WA, USA). The seal between the carbon fiber and the glass was produced by heating the glass capillary with a pipette puller (PUL-1, WPI, Sarasota, FL, USA). A wire covered with silver paint (GC Electronics, Rockford, IL, USA) was inserted in the capillary to make contact with the carbon fiber and secured with shrink tubing coated with epoxy. The carbon fiber electrodes had an exposed tip length of 150–200 μm exposed tip length and 7 μm diameter. The carbon-fiber electrode (working electrode) was aimed at the NAcS (+1.7 mm AP, +1 mm ML, −7.0 mm DV). FSCV was computer- controlled as described previously (Heien et al., 2003). In brief, an 8.5 ms triangular input waveform (initial ramp, −0.4 to 1.3 V, 400 V/s; Heien et al., 2003) was applied to the working electrode at 10 Hz. The potential was held at −0.4 V between each scan to promote cation absorption at the surface of the FSCV electrode. A computer using software written in LABVIEW (National Instruments) and a multifunction data-acquisition board (PCI-6052E, National Instruments) controlled the waveform parameters and digitalized the recorded data. A PCI-6711E (National Instruments) board was used to synchronize the waveform acquisition, data collection and stimulation delivery.

Background-subtracted cyclic voltammograms were obtained by digitally subtracting voltammograms collected during stimulation from those collected during baseline recording. Electrical stimulation was triggered after a synchronization signal was sent to the external input of a Master-8 pulse generator (A.M.P.I. Jerusalem, Israel). Voltages generated by the Master-8 were converted to constant currents via a stimulus isolation unit (AM-2200, AM-Systems, Carlsborg, WA, USA). Electrical stimulation was delivered 5 s after the start of each recording, and each was delivered in the 91.5 ms inter-waveform interval so that it did not interfere with the voltammetry scans.

Once the carbon fiber electrode was in place, electrical stimulation (40 cathodal rectangular, constant-current pulses, 0.1 ms in duration) was delivered at a frequency of 98 Hz at the current intensity used during the behavioral tests or at an initial current of 400 μA.) through the DR electrode while DA transients were monitored; if a DA signal was not observed, the working electrode was lowered by 0.1 mm, and the electrical stimulation was repeated. This sequence was reiterated until stimulation-induced DA transients were detected. For naïve rats, the current intensity was adjusted so that the magnitude of DA signal measured across the different stimulation parameters was similar to that of the behaviorally trained rats. The DA signal was recorded in response to a descending set of pulse parameters similar to the ones used during the behavioral test. Each set of stimulation parameters was repeated three times with a 60-s inter-stimulation interval. This inter-stimulation interval produces a stable amplitude of DA transients (Cossette et al., 2016). Following this initial sweep, stimulation was delivered using three sets of parameters, corresponding to those that had produced maximal, half-maximal, and minimal responding in the behavioral test. The first sweep was used as the baseline, and then the vehicle was injected into the VM, followed by PPPA; The DA release was monitored across the three sets of stimulation parameters. In some rats, apomorphine or its vehicle was injected systemically (s.c.) after PPPA, and DA release was monitored again. At the end of the FSCV recordings, electrodes were post-calibrated by placing the electrode into a flow injection system (Upchurch Scientific, Oak Harbor, WA, USA), in which known concentrations of DA 100, 200, and 500 nM dissolved in artificial cerebrospinal fluid (aCSF: 145 mM Na⁺, 2.7 mM K⁺, 1.22 mM Ca²⁺, 1.0 mM Mg²⁺, 150 mMCl⁻, 0.2 mM ascorbate, 2 mM Na₂HPO₄, pH 7.4 ± 0.05) were used to related current values to concentration values. The average concentration of DA transients observed in the NacS after electrical stimulation of the DR at maximum frequency was 151.38 (SEM = 19.58) nM a value in the range of previously reported studies (Cheer et al., 2007; Park et al., 2010; Oleson et al., 2012; Saddoris et al., 2015).

Histology

After the completion of the experiment the location of the cannulae, stimulating and recording electrodes were obtained via deposit of iron ions after 1 mA of anodal current was delivered for 60 s. Animals were deeply anesthetized with urethane (2 g/kg, i.p.) and the current was delivered through the stimulating electrode, through the injection cannulae that were inserted into the guides, or through a stimulating electrode that was lowered to the site at which the voltammetric recordings were obtained. The animals were then perfused intracardially with 0.9% sodium chloride, followed by a formalin-Prussian Blue solution (10% formalin, 3% potassium ferricyanide, 3% potassium ferrocyanide and 0.5% trichloroacetic acid) that forms a blue reaction product with the iron particles. Brains were removed and fixed with 10% formalin solution for at least 7 days. Coronal sections of 40-μm thickness was cut with a cryostat to confirm placements of the FSCV electrodes in the NAcS, the stimulating electrode in the DR and cannulae in the VM.

Data Analysis

Matlab (Natick, MA, USA) was used to fit curves to the nose-poke and DA-release data. The phasic DA release was normalized to the maximum current obtained during baseline recordings. When standardized, nose-poke responses were normalized to the maximum number of responses. Comparisons between effectiveness indices for the behavioral and neurochemical data were made with paired or unpaired Student’s t-tests or one-way repeated-measures ANOVAs, followed by Tukey’s honestly significant difference post hoc tests. If the sphericity assumption was violated, the Greenhouse and Geisser correction was used. The correlation between the average change in M50 and the average change in the DA phasic release following PPPA injection was computed. Initial voltammetric data analysis was performed using LabVIEW written software and was low-pass filtered at 2 KHz. DA was chemically identified by its characteristic background-subtracted cyclic voltammogram; oxidation peak occurs at ~0.65 V and the reduction peak at ~−0.2 V, and principal component regression (PCR) was used as previously described to extract the DA component from the raw voltammetric data (Heien et al., 2005). The DA signal used for analysis was time-locked to the electrical stimulation trains. The quality of the signature was comparable at all pulse frequencies. All analyses were performed and graphics were prepared in Origin v9 (Northampton, MA, USA).

Results

Histology

Histological analysis revealed that the tips of the FSCV recording electrodes were within the shell of the nucleus accumbens (Figures 2A,A′). The injection sites were located within the ventral part of the VM (Figures 2B,B′), a region that contains neurons activated by rewarding electrical stimulation (Wise and Rompre, 1989; Marcangione and Rompré, 2008). Finally, stimulation sites were located within the postero-medial mesencephalon, within the ventral central gray, between the anterior-posterior regions (Figures 2C,C′).

FIGURE 2

Figure 2. Location of the tips of the carbon fibers (fast-scan cyclic voltammetry, FSCV; A,A′), injection sites (B,B′), and stimulating electrodes (C,C′) for each animal included in the study. Left panel shows the animals that were behaviorally trained. Right panel shows naïve animals.

Sweep Dopamine Transients

Figure 3A shows the obtained DA phasic transiens across the different stimulation parameters for those rats that received previous behavioral training (circles) and those that were naïve (squares). The DR electrical stimulation induced DA transients in the NAcS and the magnitude of the transients increased systematically as a function of the pulse frequency. Normalized phasic DA transients overlap, and there is no statistical difference in the steepness of the curves [t₍₁₁₎ = 1.09; p > 0.05], the pulse frequency that produced half-maximal release [t₍₁₁₎ = 1.66; p > 0.05], the lower asymptote [t₍₁₁₎ = 0.57; p > 0.05], or the upper asymptote [t₍₁₁₎ = 0.89; p > 0.05] between trained and naïve subjects. Figure 3B shows, in one subject, how self-stimulation performance and stimulation-induced DA transients vary as a function of pulse frequency. Although the slopes, in all the behaviorally tested subjects, between fitted curves relating stimulation frequency to the behavior and DA release differ [t₍₅₎ = 3.79; p < 0.05]; their M50 values do not [t₍₅₎ = 1.78; p > 0.05].

FIGURE 3

Figure 3. Dopamine (DA) release induced by dorsal raphe (DR) stimulation, as a function of pulse frequency. (A) The DA release profile obtained from behaviorally trained and naïve subjects is very similar. (B) The relation between self-stimulation performance and stimulation-induced DA release in one representative subject. Although the slopes of the two curves differ, their midpoints fall at similar positions along the pulse-frequency axis.

PPPA Enhanced Brain Stimulation Reward, Yet it Decreased Dopamine Transients

Figure 4A shows for a representative subject the behavioral effects of intra-VM injections of PPPA (0.825 nmol/0.5 μl/side). This drug produced a leftward and upward shift of the curve that relates the nose-poke rate as a function of pulse frequency: less stimulation was necessary to obtain a given level of performance and more responses were emitted when contrasted against the vehicle, at some stimulation frequencies. Figures 4B,C show respectively the average changes in M50 values and in maximal response expressed as a percentage of baseline, for all the behaviorally trained subjects in which DA transients were recorded. In contrast to the vehicle, PPPA produced a significant 25.3% (SEM = 6.3) reduction in M50 value [t₍₅₎ = 3.43; p < 0.05] and a significant 39.7% (SEM = 11.7) increase in nose-poke responses [t₍₅₎ = 3.27; p < 0.05]. The observed potentiation of DR reward is very similar to what has been previously observed and described (Bergeron and Rompré, 2013; Ducrot et al., 2013; Hernandez et al., 2015).

FIGURE 4

Figure 4. Effects of intra ventral midbrain (VM) (2R,4S)-4-(3-Phosphopropyl)-2-piperidinecarboxylic acid (PPPA) injection on behavior. (A) In one representative subject intra VM injection of PPPA shifted the response-rate vs. pulse-frequency curve leftward and increased its upper asymptote. (B) The bar graph shows a significant reduction in M50 value in all the behaviorally trained subjects in which DA release was successfully measure. The reduction in M50 value suggests an increase in the effectiveness of the stimulation to elicit nose-poke behavior. (C) The bar graph shows a significant increase in the maximum response rate elicited by intra VM injection of PPPA.

Under urethane anesthesia (Figure 5A) VM vehicle microinjection produced no discernable change in DA transients evoked by DR stimulation, whereas PPPA injection produced a significant decrease in the magnitude of the DA transients at all tested frequencies Maximum [F_(1.002,5.02) = 47.94; p < 0.05]; M50[F_(1.09,5.46) = 12.48; p = <0.05]; Minimum [F_(1.41,7.07) = 15.65; p < 0.05]. It is noteworthy that the decrease in DA is negatively correlated with the magnitude of reward enhancement (r_xy = −0.821); 59% of the variance observed in the magnitude attenuation of DA transients can be explained by the observed magnitude of reward enhancement (r²_adj. = 0.593; p < 0.05, Figure 5B).

FIGURE 5

Figure 5. Effects of intra VM PPPA injection on DA release. (A) Under urethane anesthesia intra VM PPPA injection decreased stimulation-elicited DA release in comparison to measures obtained before and after vehicle injection; this effect was seen at all three frequencies tested. (B) A negative correlation between the drug-induced decrease in stimulation-evoked DA release and the change in M50 was observed. In the behaviorally trained animals, the enhancement in reward effectiveness, produced by VM injection of PPPA covariates with the reduction in DA release; so that the larger the enhancement the greater the DA release reduction.

We injected apomorphine to test whether the negative correlation between the drug-induced changes in M50 and DA transient magnitude resulted from depolarization inactivation (DI) in DA neurons due to the strong excitatory drive produced by the DR stimulation. At low doses, apomorphine is known to reduce DA firing by stimulating DA autoreceptors. This action hyperpolarizes DA neurons and increases input resistance, restoring DA firing and excitability when neurons are in a state of DI (Grace and Bunney, 1985). We injected a dose of apomorphine (100 μg s.c., Figure 6) that when injected alone produced a long-lasting and significant reduction of DA transients at the three frequencies tested Maximum [F_(1.65,4.96) = 9.91; p < 0.05]; M50[F_(1.89,5.69) = 17.87; p < 0.05]; Minimum [F_(1.41,7.07) = 8.04; p = <0.05].

FIGURE 6

Figure 6. Subcutaneous injection of apomorphine (100 μg) produced a significant and long lasting reduction of DA oxidation at the three frequencies tested. Filled symbols represent a significant reduction of DA oxidation when contrasted against the pre-injection values, time 0.

Figure 7 documents, for different selected subjects, the effects of apomorphine on stimulation-induced DA transients. The false-color plots represent background-subtracted redox currents as a function of voltage and time. Above each false-color plot is the time course of the DA-oxidation current at the potential corresponding to peak oxidation (the ordinal value corresponding to the center of the red blob in the leftmost false-color plot). DA transient concentration is directly proportional to this oxidation current. Superimposed on the false-color plots, in the upper right quadrant, is a background-subtracted voltammogram. The form of the voltammograms matches the voltammetric signature of DA; oxidation peak occurs at ~0.65 V and the reduction peak at −0.2 V. The time-course plots and voltammograms are horizontal and vertical sections, respectively, passing through the peak DA-oxidation current in the false-color plot. The dashed line denotes the onset of the electrical stimulation train.

FIGURE 7

Figure 7. Examples of DA release in the nucleus accumbens shell (NAcS) in response to electrical stimulation of the DR at M50 parameters. The false-color plots show redox currents as function of applied voltage and time. Current at the peak oxidation potential of DA is shown above the false-color plot as function of time. The insets show the cyclic voltammogram. Dashed lines denote the onset of the electrical stimulation train. (A) DA release for M50 stimulation at baseline, 10 min after intra-VM saline; 10 and 20 min after s.c. saline. The DA peak is quite stable over time and across conditions. (B) Intra-VM injection of PPPA alone produces a significant decrease in DA release. (C) A similar decrease in DA phasic release is observed after s.c. injection of apomorphine. (D) Administration of apomorphine 20 min after intra-VM injection of PPPA partially restored the magnitude of DA transient.

Figure 7A (top row) shows that systemic or intra-VM injection of saline had a negligible effect on the stimulation-induced DA transient. The rightmost two panels in Figures 7B,C demonstrate that both intra-VM PPPA (Figure 7B) and systemic apomorphine (Figure 7C) decreased the stimulation-induced DA transient. Systemic administration of apomorphine 10 min following intra-VM injection of PPPA partially restored the DA transient (Figure 7D, 4th column).

Figure 8 quantifies the partial restoration of the DA transients by systemic administration of apomorphine (as illustrated by the difference between the peak DA oxidation currents in the bottom right panel of Figure 7 and the panel to its immediate left). The increase observed at each frequency was significant when contrasted against the last transient recorded after PPPA injection (t_Max(5) = 4.34, p = <0.05; t_M50(5) = 7.17, p < 0.05; t_Min(5) = 3.58, p < 0.05).

FIGURE 8

Figure 8. Subcutaneous injection of apomorphine 20 min after intra-VM injection of PPPA partially restores stimulation-induced DA release at the maximal stimulation frequency (A), M50 stimulation frequency (B), and at the lowest stimulation frequency (C).

Discussion

DA plays a critical role in reward and reward-seeking. Direct optical stimulation of DA neurons produces a conditioned place-preference (Tsai et al., 2009) and rodents will perform an operant response to optically activate VM DA neurons (Witten et al., 2011; Kim et al., 2012). Rewarding electrical stimulation of different brain areas increases DA levels in the Nac and elicits DA phasic release (Hernandez et al., 2006; Owesson-White et al., 2008; Hernandez and Shizgal, 2009; Cossette et al., 2016). Also, drugs that boost DA release enhance brain stimulation reward (BSR); whereas the opposite behavioral effect is obtained with drugs that decrease DA availability (Wise and Rompre, 1989). Our results show that electrical stimulation of the DR, at frequencies that elicit self-stimulation behavior, produces DA phasic release in the NacS. The magnitude of DA transients increase as a function of pulse frequency in an orderly manner and eventually levels off, as it has been observed previously (Fiorino et al., 1993; Yavich and Tanila, 2007). In our anesthetized preparation, DA release correlates with the behavioral performance observed during self-stimulation, which suggests that information about the reward intensity is reflected in the DA phasic release, at least over the range of stimulation parameters tested in the present study. To evaluate this hypothesis rigorously, it will be necessary to determine whether equiprefered stimulation parameters would produce similar phasic DA output (Moisan and Rompre, 1998).

In freely moving animals, preferential blockade of VM NMDA GluN2A receptors with the PPPA enhances the reward-seeking produced by electrical stimulation of the DR (Bergeron and Rompré, 2013; Ducrot et al., 2013; Hernandez et al., 2015). This enhancement implicates GluN2A receptors in inhibition of VM DA neurons. Blockade of these NMDA receptors will lead to a disinhibition of DA neurons, enhanced phasic firing and DA release in terminal areas. Unexpectedly, electrically induced DA transients were lost when rewarding stimulation was combined with VM microinjection of PPPA. One possible explanation for this unexpected result is the product of an increase in net excitation that drives the DA neurons into a state of DI (White and Wang, 1983; Grace and Bunney, 1986). To test this hypothesis, we determined whether apomorphine can restore electrically induced DA transients following microinjection of PPPA. An in vitro electrophysiological study has shown that under normal conditions, apomorphine hyperpolarizes DA neurons (Grace and Bunney, 1985). However, in a state of DI, apomorphine-induced hyperpolarization restores the responsiveness of DA neurons to excitatory input (Grace and Bunney, 1986) by allowing the slow sodium gates to reopen. Consistent with this hypothesis, apomorphine partially restored the magnitude of electrically induced DA transients in animals that had received a prior VM microinjection of PPPA. We speculate that DA release was not totally restored because apomorphine produces multiple effects, some of which have opposing influences on the excitability of DA neurons. In addition to hyperpolarization due to stimulation of DA autoreceptors, apomorphine activates nerve-terminal autoreceptors, an effect that reduces extracellular DA release (Grace and Bunney, 1985). Accordingly, we found that apomorphine reduced the magnitude of electrically induced DA transients following VM microinjection of the vehicle.

The occurrence of DI in our preparation is most likely the product of coordinated disinhibition, via blockade of NMDAR, and glutamatergic excitation, via activation of AMPA receptors (Ducrot et al., 2013; Qi et al., 2014). Similar synergistic effects leading to DI had been previously reported with the DA antagonist, pimozide, and the opiate agonist, morphine (Rompre and Wise, 1989; Henry et al., 1992). A VM microinjection of morphine that enhanced BSR in naïve animals produced a complete cessation of responding in animals previously injected with pimozide. In this latter condition, operant responding was reinstated by VM microinjection of a dose of the GABA agonist, muscimol, that inhibited reward under a control condition (no other drug treatment).

The present findings not only support the hypothesis that PPPA and reward synergize to enhance DA excitation, but they also suggest that different NMDAR receptor subtypes are involved in modulation of mesoaccumbens DA impulse flow. NMDARs are heterodimers composed of two GluN1 subunits with GluN2 and/or GluN3 subunits. Previous pharmacological and SiRNA data suggest that PPPA-sensitive NMDARs are most likely located on VM afferents to DA neurons, are composed of GluN2A subunits, and are devoid of GluN2B (Bergeron and Rompré, 2013; Hernandez et al., 2015). Although previous results show that the DR reward signal is transmitted to VM neurons through AMPA receptors, a role for NMDAR cannot be excluded. NMDAR activation is essential for induction of DA burst firing (Zweifel et al., 2009) and a reduction in VM GluN1, the subunit common to all NMDARs, produces a significant attenuation of DR reward (Hernandez et al., 2015); it is thus most likely that the NMDAR involved induction of DA burst firing is composed of different subunits than the NMDAR that controls the inhibitory drive.

The present results show that among its many roles, glutamate mediates a strong inhibitory drive on DA neurons and that DA-related reward signals can be strongly enhanced by reducing this inhibitory drive through blockade of VM NMDARs. This provides additional evidence that glutamate modulates DA neural activity in multiple ways and thus plays a key, albeit complex, role in reward signaling.

Author Contributions

P-PR and GH designed the study. GH and M-PC carried out the experiments, GH analyzed the data. P-PR, GH, M-PC and PS contributed to interpretation of the results and writing of the article.

Funding and Disclosure

This article was supported by Natural Sciences and Engineering Research Council of Canada (NSERC) grants to P-PR (#119057, RGPIN-2015-05018) and to PS (RGPIN-308-11); NSERC postdoctoral Fellowship to GH, a grant from the “Fonds de recherche du Québec—Santé” to the “Groupe de Recherche en Neurobiologie Comportementale”/Center for Studies in Behavioral Neurobiology, and support to PS from the Concordia University Research Chairs program.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors thank Wayne Brake, for hosting part of the behavioral experiments in his laboratory and David Munro and Claude Bouchard for technical assistance.

References

Bergeron, S., and Rompré, P.-P. (2013). Blockade of ventral midbrain NMDA receptors enhances brain stimulation reward: a preferential role for GluN2A subunits. Eur. Neuropsychopharmacol. 23, 1623–1635. doi: 10.1016/j.euroneuro.2012.12.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Charara, A., Smith, Y., and Parent, A. (1996). Glutamatergic inputs from the pedunculopontine nucleus to midbrain dopaminergic neurons in primates: phaseolus vulgaris-leucoagglutinin anterograde labeling combined with postembedding glutamate and GABA immunohistochemistry. J. Comp. Neurol. 364, 254–266. doi: 10.1002/(SICI)1096-9861(19960108)364:2<254::AID-CNE5>3.0.CO;2-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheer, J. F., Aragona, B. J., Heien, M. L., Seipel, A. T., Carelli, R. M., and Wightman, R. M. (2007). Coordinated accumbal dopamine release and neural activity drive goal-directed behavior. Neuron 54, 237–244. doi: 10.1016/j.neuron.2007.03.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Cornish, J. L., Nakamura, M., and Kalivas, P. W. (2001). Dopamine-independent locomotion following blockade of N-methyl-D-aspartate receptors in the ventral tegmental area. J. Pharmacol. Exp. Ther. 298, 226–233.

PubMed Abstract | Google Scholar

Cossette, M.-P., Conover, K., and Shizgal, P. (2016). The neural substrates for the rewarding and dopamine-releasing effects of medial forebrain bundle stimulation have partially discrepant frequency responses. Behav. Brain Res. 297, 345–358. doi: 10.1016/j.bbr.2015.10.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Ducrot, C., Fortier, E., Bouchard, C., and Rompré, P.-P. (2013). Opposite modulation of brain stimulation reward by NMDA and AMPA receptors in the ventral tegmental area. Front. Syst. Neurosci. 7:57. doi: 10.3389/fnsys.2013.00057

PubMed Abstract | CrossRef Full Text | Google Scholar

Eshel, N., Tian, J., Bukwich, M., and Uchida, N. (2016). Dopamine neurons share common response function for reward prediction error. Nat. Neurosci. 19, 479–486. doi: 10.1038/nn.4239

PubMed Abstract | CrossRef Full Text | Google Scholar

Fiorino, D. F., Coury, A., Fibiger, H. C., and Phillips, A. G. (1993). Electrical stimulation of reward sites in the ventral tegmental area increases dopamine transmission in the nucleus accumbens of the rat. Behav. Brain Res. 55, 131–141. doi: 10.1016/0166-4328(93)90109-4

PubMed Abstract | CrossRef Full Text | Google Scholar

French, E. D., Mura, A., and Wang, T. (1993). MK-801, phencyclidine (PCP) and PCP-like drugs increase burst firing in rat A10 dopamine neurons: comparison to competitive NMDA antagonists. Synapse 13, 108–116. doi: 10.1002/syn.890130203

PubMed Abstract | CrossRef Full Text | Google Scholar

Geisler, S., Derst, C., Veh, R. W., and Zahm, D. S. (2007). Glutamatergic afferents of the ventral tegmental area in the rat. J. Neurosci. 27, 5730–5743. doi: 10.1523/JNEUROSCI.0012-07.2007

PubMed Abstract | CrossRef Full Text | Google Scholar

Grace, A. A., and Bunney, B. S. (1984). The control of firing pattern in nigral dopamine neurons: burst firing. J. Neurosci. 4, 2877–2890.

PubMed Abstract | Google Scholar

Grace, A. A., and Bunney, B. S. (1985). Low doses of apomorphine elicit two opposing influences on dopamine cell electrophysiology. Brain Res. 333, 285–298. doi: 10.1016/0006-8993(85)91582-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Grace, A. A., and Bunney, B. S. (1986). Induction of depolarization block in midbrain dopamine neurons by repeated administration of haloperidol: analysis using in vivo intracellular recording. J. Pharmacol. Exp. Ther. 238, 1092–1100.

PubMed Abstract | Google Scholar

Grace, A. A., Floresco, S. B., Goto, Y., and Lodge, D. J. (2007). Regulation of firing of dopaminergic neurons and control of goal-directed behaviors. Trends Neurosci. 30, 220–227. doi: 10.1016/j.tins.2007.03.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Heien, M. L. A. V., Khan, A. S., Ariansen, J. L., Cheer, J. F., Phillips, P. E. M., Wassum, K. M., et al. (2005). Real-time measurement of dopamine fluctuations after cocaine in the brain of behaving rats. Proc. Natl. Acad. Sci. U S A 102, 10023–10028. doi: 10.1073/pnas.0504657102

PubMed Abstract | CrossRef Full Text | Google Scholar

Heien, M. L. A. V., Phillips, P. E. M., Stuber, G. D., Seipel, A. T., and Wightman, R. M. (2003). Overoxidation of carbon-fiber microelectrodes enhances dopamine adsorption and increases sensitivity. Analyst 128, 1413–1419. doi: 10.1039/B307024G

PubMed Abstract | CrossRef Full Text | Google Scholar

Henry, D. J., Wise, R. A., Rompre, P. P., and White, F. J. (1992). Acute depolarization inactivation of A10 dopamine neurons: interactions of morphine with dopamine antagonists. Brain Res. 596, 231–237. doi: 10.1016/0006-8993(92)91552-p

PubMed Abstract | CrossRef Full Text | Google Scholar

Hernandez, G., Hamdani, S., Rajabi, H., Conover, K., Stewart, J., Arvanitogiannis, A., et al. (2006). Prolonged rewarding stimulation of the rat medial forebrain bundle: neurochemical and behavioral consequences. Behav. Neurosci. 120, 888–904. doi: 10.1037/0735-7044.120.4.888

PubMed Abstract | CrossRef Full Text | Google Scholar

Hernandez, G., Khodami-Pour, A., Lévesque, D., and Rompré, P.-P. (2015). Reduction in ventral midbrain NMDA receptors reveals two opposite modulatory roles for glutamate on reward. Neuropsychopharmacology 40, 1682–1691. doi: 10.1038/npp.2015.14

PubMed Abstract | CrossRef Full Text | Google Scholar

Hernandez, G., and Shizgal, P. (2009). Dynamic changes in dopamine tone during self-stimulation of the ventral tegmental area in rats. Behav. Brain Res. 198, 91–97. doi: 10.1016/j.bbr.2008.10.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, K. M., Baratta, M. V., Yang, A., Lee, D., Boyden, E. S., and Fiorillo, C. D. (2012). Optogenetic mimicry of the transient activation of dopamine neurons by natural reward is sufficient for operant reinforcement. PLoS One 7:e33612. doi: 10.1371/journal.pone.0033612

PubMed Abstract | CrossRef Full Text | Google Scholar

Kretschmer, B. D. (1999). Modulation of the mesolimbic dopamine system by glutamate: role of NMDA receptors. J. Neurochem. 73, 839–848. doi: 10.1046/j.1471-4159.1999.0730839.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Lak, A., Stauffer, W. R., and Schultz, W. (2014). Dopamine prediction error responses integrate subjective value from different reward dimensions. Proc. Natl. Acad. Sci. U S A 111, 2343–2348. doi: 10.1073/pnas.1321596111

PubMed Abstract | CrossRef Full Text | Google Scholar

Marcangione, C., and Rompré, P.-P. (2008). Topographical Fos induction within the ventral midbrain and projection sites following self-stimulation of the posterior mesencephalon. Neuroscience 154, 1227–1241. doi: 10.1016/j.neuroscience.2008.05.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Mathé, J. M., Nomikos, G. G., Schilström, B., and Svensson, T. H. (1998). Non-NMDA excitatory amino acid receptors in the ventral tegmental area mediate systemic dizocilpine (MK-801) induced hyperlocomotion and dopamine release in the nucleus accumbens. J. Neurosci. Res. 51, 583–592. doi: 10.1002/(SICI)1097-4547(19980301)51:5<583::AID-JNR5>3.0.CO;2-B

PubMed Abstract | CrossRef Full Text | Google Scholar

Moisan, J., and Rompre, P. P. (1998). Electrophysiological evidence that a subset of midbrain dopamine neurons integrate the reward signal induced by electrical stimulation of the posterior mesencephalon. Brain Res. 786, 143–152. doi: 10.1016/s0006-8993(97)01457-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Montague, P. R., Dayan, P., and Sejnowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci. 16, 1936–1947.

PubMed Abstract | Google Scholar

Oleson, E. B., Gentry, R. N., Chioma, V. C., and Cheer, J. F. (2012). Subsecond dopamine release in the nucleus accumbens predicts conditioned punishment and its successful avoidance. J. Neurosci. 32, 14804–14808. doi: 10.1523/JNEUROSCI.3087-12.2012

PubMed Abstract | CrossRef Full Text | Google Scholar

Omelchenko, N., Bell, R., and Sesack, S. R. (2009). Lateral habenula projections to dopamine and GABA neurons in the rat ventral tegmental area. Eur. J. Neurosci. 30, 1239–1250. doi: 10.1111/j.1460-9568.2009.06924.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Overton, P. G., and Clark, D. (1997). Burst firing in midbrain dopaminergic neurons. Brain Res. Rev. 25, 312–334. doi: 10.1016/S0165-0173(97)00039-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Owesson-White, C. A., Cheer, J. F., Beyene, M., Carelli, R. M., and Wightman, R. M. (2008). Dynamic changes in accumbens dopamine correlate with learning during intracranial self-stimulation. Proc. Natl. Acad. Sci. U S A 105, 11957–11962. doi: 10.1073/pnas.0803896105

PubMed Abstract | CrossRef Full Text | Google Scholar

Park, J., Aragona, B. J., Kile, B. M., Carelli, R. M., and Wightman, R. M. (2010). In vivo voltammetric monitoring of catecholamine release in subterritories of the nucleus accumbens shell. Neuroscience 169, 132–142. doi: 10.1016/j.neuroscience.2010.04.076

PubMed Abstract | CrossRef Full Text | Google Scholar

Paxinos, G., and Watson, C. (2007). The Rat Brain in Stereotaxic Coordinates Sixth Edition. (Vol. 170), San Diego, CA: Elsevier Academic Press, 547–612.

Qi, J., Zhang, S., Wang, H.-L., Wang, H., de Jesus Aceves Buendia, J., Hoffman, A. F., et al. (2014). A glutamatergic reward input from the dorsal raphe to ventral tegmental area dopamine neurons. Nat. Commun. 5:5390. doi: 10.1038/ncomms6390

PubMed Abstract | CrossRef Full Text | Google Scholar

Rompre, P.-P., and Wise, R. A. (1989). Behavioral evidence for midbrain dopamine depolarization inactivation. Brain Res. 477, 152–156. doi: 10.1016/0006-8993(89)91402-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Saddoris, M. P., Cacciapaglia, F., Wightman, R. M., and Carelli, R. M. (2015). Differential dopamine release dynamics in the nucleus accumbens core and shell reveal complementary signals for error prediction and incentive motivation. J. Neurosci. 35, 11572–11582. doi: 10.1523/JNEUROSCI.2344-15.2015

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsai, H.-C., Zhang, F., Adamantidis, A., Stuber, G. D., Bonci, A., de Lecea, L., et al. (2009). Phasic firing in dopaminergic neurons is sufficient for behavioral conditioning. Science 324, 1080–1084. doi: 10.1126/science.1168878

PubMed Abstract | CrossRef Full Text | Google Scholar

Westerink, B. H., Kwint, H. F., and deVries, J. B. (1996). The pharmacology of mesolimbic dopamine neurons: a dual-probe microdialysis study in the ventral tegmental area and nucleus accumbens of the rat brain. J. Neurosci. 16, 2605–2611.

PubMed Abstract | Google Scholar

White, F. J., and Wang, R. Y. (1983). Differential effects of classical and atypical antipsychotic drugs on A9 and A10 dopamine neurons. Science 221, 1054–1057. doi: 10.1126/science.6136093

PubMed Abstract | CrossRef Full Text | Google Scholar

Wise, R. A., and Rompre, P.-P. (1989). Brain dopamine and reward. Annu. Rev. Psychol. 40, 191–225. doi: 10.1146/annurev.psych.40.1.191

PubMed Abstract | CrossRef Full Text | Google Scholar

Witten, I. B., Steinberg, E. E., Lee, S. Y., Davidson, T. J., Zalocusky, K. A., Brodsky, M., et al. (2011). Recombinase-driver rat lines: tools, techniques and optogenetic application to dopamine-mediated reinforcement. Neuron 72, 721–733. doi: 10.1016/j.neuron.2011.10.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Yavich, L., and Tanila, H. (2007). Mechanics of self-stimulation and dopamine release in the nucleus accumbens. Neuroreport 18, 1271–1274. doi: 10.1097/wnr.0b013e328244e576

PubMed Abstract | CrossRef Full Text | Google Scholar

Zweifel, L. S., Parker, J. G., Lobb, C. J., Rainwater, A., Wall, V. Z., Fadok, J. P., et al. (2009). Disruption of NMDAR-dependent burst firing by dopamine neurons provides selective assessment of phasic dopamine-dependent behavior. Proc. Natl. Acad. Sci. U S A 106, 7281–7288. doi: 10.1073/pnas.0813415106

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: dopamine, glutamate, NMDA, reward, in vivo voltammetry

Citation: Hernandez G, Cossette M-P, Shizgal P and Rompré P-P (2016) Ventral Midbrain NMDA Receptor Blockade: From Enhanced Reward and Dopamine Inactivation. Front. Behav. Neurosci. 10:161. doi: 10.3389/fnbeh.2016.00161

Received: 25 May 2016; Accepted: 08 August 2016;
Published: 26 August 2016.

Edited by:

John D. Salamone, University of Connecticut, USA

Reviewed by:

Alicia Izquierdo, University of California, Los Angeles, USA
Akiko Shimamoto, Meharry Medical College, USA

Copyright © 2016 Hernandez, Cossette, Shizgal and Rompré. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution and reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Pierre-Paul Rompré, cGllcnJlLXBhdWwucm9tcHJlQHVtb250cmVhbC5jYQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.