ORIGINAL RESEARCH article
Neural Representation of Motor Output, Context and Behavioral Adaptation in Rat Medial Prefrontal Cortex During Learned Behavior
- Center for Neurogenomics and Cognitive Research, Department of Integrative Neurophysiology, VU Amsterdam, Amsterdam, Netherlands
Selecting behavioral outputs in a dynamic environment is the outcome of integrating multiple information streams and weighing possible action outcomes with their value. Integration depends on the medial prefrontal cortex (mPFC), but how mPFC neurons encode information necessary for appropriate behavioral adaptation is poorly understood. To identify spiking patterns of mPFC during learned behavior, we extracellularly recorded neuronal action potential firing in the mPFC of rats performing a whisker-based “Go”/“No-go” object localization task. First, we identify three functional groups of neurons, which show different degrees of spiking modulation during task performance. One group increased spiking activity during correct “Go” behavior (positively modulated), the second group decreased spiking (negatively modulated) and one group did not change spiking. Second, the relative change in spiking was context-dependent and largest when motor output had contextual value. Third, the negatively modulated population spiked more when rats updated behavior following an error compared to trials without integration of error information. Finally, insufficient spiking in the positively modulated population predicted erroneous behavior under dynamic “No-go” conditions. Thus, mPFC neuronal populations with opposite spike modulation characteristics differentially encode context and behavioral updating and enable flexible integration of error corrections in future actions.
The medial prefrontal cortex (mPFC) integrates and processes a multitude of information streams to drive behavior (Groenewegen and Uylings, 2000; Dalley et al., 2004; Gruber et al., 2010; Euston et al., 2012; Luchicchi et al., 2016). Activity of mPFC neurons correlates with task outcomes with both positive (Gruber et al., 2010; Horst and Laubach, 2013; Orsini et al., 2015; Pinto and Dan, 2015; Amarante et al., 2017) and negative valence (Senn et al., 2014; Halladay and Blair, 2015; Pinto and Dan, 2015; Kim et al., 2017; Rozeske et al., 2018) and distinct populations of mPFC neurons are selectively activated during movement and movement inhibition (Halladay and Blair, 2015). Moreover, incorporation of task rules in the mPFC leads to adaptive strategies to optimize task outcome (Durstewitz et al., 2010; Euston et al., 2012; Horst and Laubach, 2012; Narayanan et al., 2013; Cho et al., 2015; Orsini et al., 2015; Guise and Shapiro, 2017; Malagon-Vina et al., 2018). Additionally, the mPFC is critically involved in bottom-up detection of tactile sensory input (Le Merre et al., 2018), as well as top-down filtering of visual and auditory sensory information (Zhang et al., 2014; Wimmer et al., 2015, 2016; Kim H. et al., 2016; Schmitt et al., 2017). Not surprisingly, neuronal correlates of sensory information are found in the mPFC during auditory “Go”/“No-go” tasks (Pinto and Dan, 2015; Kamigaki and Dan, 2017) and during a visual attention task (Kim H. et al., 2016). Furthermore, short-term task rules are represented transiently by spiking patterns in mPFC populations (Durstewitz et al., 2010; Rodgers and DeWeese, 2014; Malagon-Vina et al., 2018) and combined audiovisual selective attention tasks require correct spiking in mPFC axons to thalamus (Wimmer et al., 2015; Schmitt et al., 2017). The mPFC is thus ideally situated in the circuitry to drive learned behavior under conditions when sensory information guides decision making and behavioral output. As a result, perturbation of normal mPFC function leads to a diversity of behavioral impairments (Narayanan and Laubach, 2006, 2008; Pinto and Dan, 2015; Koike et al., 2016; Lagler et al., 2016; Luchicchi et al., 2016; Bolkan et al., 2017; Guise and Shapiro, 2017).
Motor behavior also shows neurophysiological correlates in the mPFC (Horst and Laubach, 2013; Pinto and Dan, 2015; Amarante et al., 2017) and disturbing the mPFC network can lead to reduced attention performance and inappropriate motor output (Narayanan and Laubach, 2006; Narayanan et al., 2006; Luchicchi et al., 2016; Kamigaki and Dan, 2017). However, it is currently unknown how the mPFC controls and encodes this motor behavior or whether it encodes just a copy of the motor signal (efference copy). Similarly, we know that inhibitory control and top-down increase of stimulus discrimination depend on mPFC function (Zhang et al., 2014; Pinto and Dan, 2015; Wimmer et al., 2015), yet we do not know how these are encoded. The executive function of the mPFC in adaptive behavior occurs on short timescales implying that the mPFC should integrate trial outcomes and adapt behavioral strategies on a trial-by-trial basis (Narayanan and Laubach, 2008; Euston et al., 2012; Horst and Laubach, 2012; Narayanan et al., 2013; Pinto and Dan, 2015), nevertheless we do not know how these are represented by mPFC neuronal spiking.
To identify neurophysiological correlates of sensory-guided and adaptive motor behavior, we recorded activity of mPFC neurons extracellularly (Csicsvari et al., 2003; Rossant et al., 2016) in adult rats performing a whisker-based “Go”/“No-go” object localization task (O’Connor et al., 2010a; Pammer et al., 2013). We present four findings on populations of putative pyramidal mPFC neurons, which show modulation of their spike rates: (1) during correct performance of the task; (2) during motor output with and without the intent to collect reward; (3) when changing behavior after mistakes; and (4) as a function of increasing task difficulty.
Materials and Methods
Animal Welfare Statement
This study was carried out in accordance with European and Dutch law and approved by the animal ethical care committee of the VU Amsterdam and VU University Medical Center, Netherlands (protocol INF-14-08).
Male Wistar rats (250–350 g, 8–12 weeks) were implanted with a headpost. Preoperatively Baytril (5 mg/kg), Temgesic (buprenorphine 0.05 mg/kg) and carprofen (5 mg/kg) were administered subcutaneously. Animals were anesthetized with 2% isoflurane for induction, placed on a heating mat and fixed into a stereotactic frame, after which the isoflurane concentration was reduced to 1.5% for maintenance. The hair on the head was removed with hair clippers and lidocaine was injected for local analgesia (200 μl, 2% s.c.). An incision was made in the skin, the underlying tissue was removed and the skull cleaned. Next, the skull was etched with Gel Etchant (Kerr Dental, Visé, Belgium) and cleaned. Bonding agent (Optibond, Kerr Dental, Visé, Belgium) was used to enhance adhesion between dental cement (Tetric evoflow, Ivoclar Vivadent, Schaan, Liechtenstein) and the bone. Four screw holes were drilled (1× occipital bone, 1× contralateral parietal bone (above V1), 1× frontal bone (above the olfactory bulb) and 1x lateral of the temporal ridge) and stainless steel head screws were inserted to increase stability of the head cap. The craniotomy for electrophysiological mPFC recordings was drilled at 2.5 mm anterior and 0–1.2 mm lateral of bregma and recordings were targeted to the ventral anterior cingulate (AC), prelimbic (PL) and dorsal infralimbic (IL) cortex (at 1,900–3,500 μm from pia on the dorso-ventral axis).
Behavioral Task: Whisker Based Head-Fixed “Go”/“No-Go” Task
To study the neural basis of cognitive behavior in the mPFC, we adapted a whisker-based object localization (“Go”/“No-go”) task for head-fixed rats. The task was adapted for rats from similar tasks in mice (O’Connor et al., 2010a; Pammer et al., 2013) and development during the start-up phase was facilitated by expert input from Dr. Karel Svoboda (HHMI, Janelia farm, Ashburn, VA, USA) and Dr. Cornelius Schwarz (CIN, Tübingen, Germany). To drive motivation for optimal task performance, the rats were maintained on temporary water restriction such that they earned all their water by performing the task (weekly schedule of 5 days training, 2 days ad libitum access). Rats learned to distinguish two locations of a cue-pole and were rewarded when responding with motor output (i.e., licking) to the “Go” location. To promote discrimination and avoid simple detection strategies (e.g., keeping whiskers on one of the two locations), both locations were placed on the same azimuth of the resting position of the C1 whisker, but were spaced 4 mm on the proximal-distal axis (Figure 1A). During behavioral training, rats learn to lick for a reward when the object was in the proximal (“Go”) position and refrain from licking in the distal (“No-go”) position. To monitor the rats’ health and water intake, both daily water volume consumed and the weight of the rat were closely monitored and behavioral abnormalities were registered. When the rat did not drink enough during training and consequently lost weight, a small volume (typically 2–4 ml) of water was offered to the rat.
Figure 1. Experimental approach. (A) Schematic of the head-fixed rat. Licking when the object was in the “Go” position (dark blue) was rewarded with water and called a Hit trial (blue), while refraining from licking was called a Miss trial (purple). When the object was in the “No-go” position (brown-red) the rat should refrain from licking to make a Correct Rejection (CR; orange), while licking in this condition was a False Alarm (FA; red) and punished with a tone and time out (TO). (B) Block diagram showing the sequence of actions during trials. The object moved from a position in the middle between “Go” and “No-go”, while out of reach of the whiskers. Subsequently, the pole moves up, the rat was allowed to lick during the sampling period (1 s), but licking was neither punished nor rewarded. During the 1 s decision window licking triggers an outcome (rewarded or punished for “Go” or “No-go”, respectively). Colors as in (A). (C) The licking behavior of a rat during a single behavioral session. (C1) Licking raster plot of the 180 trials that were performed during this session, sorted by trial outcome and color coded as in (A). Every tick is a lick that was counted as a beam break in the lick port. We intermixed 10% Catch trials without the object (green). (C2) Peri-event time histogram (PETH) of licking around trial outcome (first lick during the decision window) for Hit and FA trials. (D) Average accuracy over all rats. (E) Licking probability over time for all sessions. (F1) Cartoon of the probe placement (left) and in gray four example channels with spikes of three units superimposed in color. (F2) The average waveform of the units that are shown in (F1) on the same four channels. (G1) Regular spiking units (RSUs; black) and fast spiking units (FSUs; gray) were separated based on their average waveforms. Colored dots are the spikes from (F1,F2). (G2) The FSUs show a wider distribution of spike rates. Inset are example FSU and RSU waveforms. (H1) Spiking raster plot of an example unit. Same session, sorting and color code as in (C1,C2). (H2) PETH of spiking of the unit from (H1) around “Go” trial outcomes with baseline subtracted. Horizontal blue bar is the window during which water is available from the lick port. (H3) As (H2) for “No-go” trials. Horizontal red bar is the window in which the tone is broadcast, the horizontal black bar shows the duration of the TO.
Trials started with the pole outside the reach of the whiskers in a neutral position, exactly in the middle of the “Go” and “No-go” positions to reduce detectability of the pole location by cues other than whisker touch (e.g., timing or sound cues). Trial identity varied randomly, but consecutive trials were limited to four of the same “Go”/“No-go” identity. After trials started, the object moved to either the “Go” or “No-go” position. The pole then moved up into the whisker field by valves driven by air pressure, which produced a soft clicking sound upon action. After the pole was fully up, a 1 s grace period started for sensory acquisition during which licking was allowed, but not rewarded or punished. Rats repeatedly touched the object with their whiskers during both “Go” and “No-go” trials (Supplementary Figure S1). Following the grace period, the rat had 1 s to lick if it decided the pole was in the “Go” position. Licking was recorded by beam breaks of an infrared laser beam in the lick port (Sunx, West Des Moines, IA, USA). When rats licked during “Go” trials, the trial was labeled a Hit trial and a 20–40 μl water was given (two drops separated in time by 1 s). When rats licked during “No-go” trials, the trial was labeled a False Alarm (FA) trial and the mistake was signaled with a 1 s 12 kHz sound at 70 dB followed by a time out (TO) period (5–7 s depending on previous session “No-go” outcome). In addition, the probability of another “No-go” trial was increased after FA errors to discourage “always-lick” strategies (fraction “Go” trials, 0.25). When the rat did not lick in “Go” or “No-go” trials, the trial outcome was labeled as Miss or Correct Rejection (CR), respectively. Trials without licking were not rewarded nor punished (Figure 1B) and all trials ended with the pole moving back to the neutral position. We used a variable inter-trial interval period of 1.5, 2 or 3 s to reduce predictability of task-timing. Rats were trained until they reached performance accuracy above 70% for “Go” and “No-go” trials collectively ((Hit + CR)/(“Go” + “No-go” trials) * 100%). Whisker object-touches were comparable between “Go” and “No-go” trials (Supplementary Figure S1). To confirm that the rats use whisker-guided sensory cues for decision making, we intermixed 10% catch trials (no object) in which the valves to control the pole z-position made similar sounds, but the pole did not move within reach of the whiskers. Licks were not rewarded or punished during Catch trials and rats usually did not lick during Catch trials.
After reaching threshold for task performance, the rats daily performed three types of sessions in random order. Apart from the regular session type, we changed the task in a second session type such that the “No-go” location was randomly selected from an Easy, Normal or Hard position (2, 4 and 8 mm from the “Go” position, respectively; Figure 6A). This manipulation allowed us to test mPFC physiological dependence on changes of stimulus contrast. In the third session type we changed the reward or punishment during random trials: in a subset of Hit trials, we doubled the water reward to investigate scaling of reward related spiking with reward magnitude. In a different subset of Hit trials, we delayed the delivery of the reward with 1 s to distinguish between the neurophysiological correlate of simple licking-induced motor activity and actual reward feedback and consumption. In the same session type we used trials in which the punishment signal was delayed with 1 s.
Figure 2. Classifying RSUs based on Hit trial spike rate modulations. (A) A simulated example of the bootstrapping method. In red are the spikes of a unit spiking on average with 0.5 Hz. The horizontal blue bars show the 1 s windows before Hit outcomes, the blue numbers give the number of spikes in each window. In black we show two randomizations of the Hit time stamps and the corresponding Δ spike rate. Under the dashed line we show in black a histogram of Δ spike rates (black) and the recorded Δ spike rate (blue). In this case the recorded Δ spike rate was in the 99th percentile of the randomized distribution and is considered positively modulated during Hit trials. (B) The PETHs of three example units around Hit outcomes. (C) The histograms of the bootstrapped spike rate distribution and the recorded Δ spike rate as a thick black line. Same units as in (B). The magenta unit was positively modulated, the gray unit unmodulated and the cyan unit was negatively modulated. (D1) The distribution of percentiles from the bootstrapping method with respect to the mean spike rate during the whole session. (D2) Histogram over the percentile axis of (D1). (E) Correlations between mean spike rates and Δ spike rate. Units that have a high spike rate will show the largest Δ spike rate, positive (magenta) or negative (cyan). (F) The positively and negatively modulated population have a significantly higher median spike rate compared to the unmodulated units. ***p < 0.001.
Figure 3. Spiking modulation encodes motivation in addition to motor output. (A) Average PETHs of spiking of the positively modulated population (n = 31). (A1) Spiking aligned to “Go” trial outcome. (A2) As in (A1) but for “No-go” trials. (A3) Spiking aligned to lick start for intra-trial licking bouts (solid line) and extra-trial licking bouts (dashed line). (B) (B1–B3) analogous to (A1–A3) but with PETHs of the unmodulated population (n = 112). (C) (C1–C3) analogous to (A1–A3) but with PETHs of the negatively modulated population (n = 48). (D) (D1–D3) analogous to (A1–A3) but with licking PETHs. See Supplementary Figure S1 for a separation of units that are (un)correlated to licking frequency. (E1) Scatterplot of spike rate outside licking bouts vs. spike rate during intra-trial licking bouts. Note the shift from the unity line for the positively (magenta) and negatively (cyan) modulated units. (E2) Scatterplot of spike rate during extra-trial licking bouts vs. intra-trial licking bouts. Note the shift of both the positively and negatively modulated populations above the unity line. (E3) Boxplots of spike rates outside of licking bouts (No), during extra-trial licking bouts (Ex) and during intra-trial licking bouts (In) for the three functional populations. n.s. not significant (p > 0.05), *p < 0.05, ***p < 0.001.
Figure 4. Matched extra-trial and intra-trial licks reveals the contribution of context to spike rate modulation. (A) Schematic representation of the procedure of matching behavioral statistics in which extra-trial licking bouts are matched with intra-trial licking bouts based on comparable number of licks and licking frequency to parse out the contribution of context to spike rate modulation. (B) Scatterplot of spike rates during intra-trial licking bouts vs. spike rates during extra-trial licking bouts for the positively modulated (magenta), unmodulated (gray) and negatively (cyan) modulated populations. (C,D) The negatively modulated population (C) and unmodulated population (D) did not show spiking rate modulations for matched intra- and extra-trial licking bouts, indicating that context is not involved in spike rate modulation for these populations. (E) The spiking rate of the positively modulated population was significantly higher during intra-trial than during extra-trial licking bouts, which is supportive evidence for the hypothesis that context is an important contributor to spike rate modulation for this population of medial prefrontal cortex (mPFC) units. n.s. not significant (p > 0.05), *p < 0.05.
Figure 5. Behavioral change upon feedback. (A) Boxplots of the performance of the rat separated by outcome of the previous trial. (A1) There are significantly more/fewer FA errors in “No-go” trials if the previous trial was a Hit/FA respectively. (A2) There was no significant difference in the performance in “Go” trials based on the previous trial outcome. (B1) Schematic representation of the selection of trials for (B2,B3). Only those Hit trials that are followed by a “No-go” trial are used. RW is water reward. (B2) Left: average PETHs of the positively modulated population around Hit outcomes for trials that were followed by a “No-go” trial. Hit trials that were followed by CR (HitCR; orange) and those that were followed by a FA (HitFA; red) show similar time courses of Δ spike rate (n = 27). Right: there was no significant difference of the median Δ spike rate in HitCR and HitFA trials. (B3) As (B2) for the negatively modulated population (n = 46). (C1) Similar to (B1) for FA trials. TO is time out punishment. (C2) Left: PETHs of the Δ spike rate of the positively modulated population during FA trials that are followed by a CR trial (FACR; orange) and those that are followed by another FA trial (FAFA; red; n = 12). Right: there was no significant difference of the median Δ spike rate in FACR and FAFA trials. (C3) As (C2) for the negatively modulated population (n = 18). The black bar indicates the window for right panel. Right: the median Δ spike rate during 0.2–3 s after the FA trial outcomes was significantly higher during FACR trials than FAFA trials. n.s. not significant (p > 0.05), *p < 0.05, **p < 0.01.
Figure 6. The positively modulated units have to be sufficiently excited for correct performance in easy “No-go” trials. (A) Schematic of the head-fixed rat and corresponding “Go” (blue), and three types of “No-go” trials, Hard (brown), Normal (red) and Easy (coral). (B) Proportion of trials licked during “Go” and the three types of “No-go” trials. The rats are most likely to lick during “Go” trials and the licking probability decreases progressively for Hard, Normal and Easy “No-go” trials. (C1) Average PETHs of spiking during “Go” and Normal “No-go” trials for the negatively modulated population (n = 35). The time course of both trial types was similar to the spiking recorded in the regular session (see Figures 3C1,C2). (C2) Average PETHs of spiking for the three types of FA trials. Black bar is window for (C3). (C3) Boxplot of Δ spike rates of the negatively modulated population during the −2 to 0 s from the outcome for Hard and Easy FA trials. (D1) As in (C1) for the positively modulated population (n = 20). (D2) As in (C2) for the positively modulated population. (D3) As in (C3) for the positively modulated units. The median Δ spike rate was significantly higher for Hard trials than for Easy FA trials. (E) Boxplot for the Δ spike rate of the positively modulated population −2 to 0 from trial outcome (first lick in the decision window) for the four licking trials (Hit and Hard, Normal and Easy FA trials). The Δ spike rate was significantly lower during Easy FA trials than Hit and Hard “No-go” trials.n.s. not significant (p > 0.05), *p < 0.05, **p < 0.01.
We used the Open Ephys data acquisition board with two RHD2132 digital interface chips (Intan Technologies, Los Angeles, CA, USA) and the open ephys GUI (Siegle et al., 2017) to record the electrophysiological activity of the mPFC. We used 4-shank 64-channel silicon probes (Cambridge Neurotech, Cambridge, UK). On these probes, 16 channels were clustered per shank in two parallel columns so that each channel has a distance of 25 μm to its neighbors. The shanks were spaced at 250 μm and were placed on the lateral-medial axis, i.e., throughout the layers of the mPFC. We saved the data as 16-bit integers at 30 kHz. Afterwards, we high-pass filtered the data and combined the 16 channels of each shank for automated clustering using Klustakwik (Rossant et al., 2016). We manually curated the clusters to get stable and well-isolated single units. We used stringent cut-offs; a unit was considered well-isolated if it had an isolation distance (ID) > 40, L-ratio <1, the cluster exceeded 300 spikes and the fraction of interspike intervals below 1.5 ms was <0.1%. This resulted in an average yield of 0.2 unit/recording site. Next, analysis was done on the average waveforms of all well-isolated units to further sub-classify units as regular spiking units (RSUs) vs. fast spiking units (FSUs; Barthó et al., 2004). Since FSUs (AP peak-to-trough time <0.5 ms and AP half-peak time <0.25 ms) represent a vastly heterogeneous population with a broad spectrum of functional and morphological characteristics, units with fast spiking waveforms were excluded from further analyses. Data obtained from three rats passed behavioral and electrophysiological criteria for subsequent data analysis.
Two distinct methods to place the silicon probe were used. In two rats we implanted the silicon probe on a nano-drive (Cambridge Neurotech, Cambridge, UK) with the electrodes in the dorsal mPFC. After the rat was trained we moved the probe down through the dorsal-ventral extent of the mPFC to record at multiple locations while rats were performing the behavioral task. In one rat, we positioned an acute silicon probe (same lay-out as the chronic probes) in the mPFC with a Luigs and Neumann manipulator (Ratingen, Germany).
Perfusion, Slicing and Histology
After data acquisition was completed, rats were deeply anesthetized with 3% isoflurane and urethane (i.p. 10 mL/kg 20%). Next, rats were perfused transcardially with 0.9% NaCl followed by 4% paraformaldehyde (PFA). Brains were extracted and stored overnight in 4% PFA in phosphate buffer (PB) at 4°C after which they were transferred to 0.05 M PB for further processing.
Brains were sliced into 100 or 50 μm coronal sections with a vibratome, rinsed with PB (0.05 BM) and mounted using a Mowiol solution (Clairant GmbH, Frankfurt am Main, Germany; Narayanan et al., 2014). Probe placement was verified visually using an Olympus BX51 microscope with a 4× air objective or 40× oil objective. For chronic recordings, we used the electrode tract to retrieve probe placement. For acute recordings, we dipped the probe in DiI (Thermo Fisher Scientific, Waltham, MA, USA) and used a X-Cite 120 Q light-source (Excelitas Technologies Corp., Waltham, MA, USA), to visualize the electrode tract.
Performance was quantified either as accuracy (number of correct over total number of trials of a type; Figure 1D), or as proportion of trials with licking in the response window for both “Go” and “No-go” trials (Figures 1E, 5A, 6B). Each lick was defined as the first time the infrared beam in the lick port was broken if it had remained unbroken for at least 10 ms. Licking Peri-event time histograms (PETHs) were made by binning licking time-stamps in 200 ms bins and averaging the detected licks per trial type.
To distinguish between spiking during motivated licking for a reward or during randomly emitted licks, spiking profiles for intra-trial vs. extra-trial licking bouts were computed. A licking bout was defined as the time between the first lick and last lick if all inter-lick-intervals in between were less than 1 s. We then defined a licking bout to be intra-trial if it started between 2 s before to 0.2 s after the outcome of a trial and extra-trial if it was outside of these windows. We then computed PETHs with 200 ms bins and from −3 s to +3 s from lick bout start for both the intra-trial and extra-trial licking bouts (spiking in Figures 3A3–C3, licking in Figure 3D3).
Quantification of spike rates per licking condition (i.e., no lick, intra-trial and extra-trial lick bouts) was done per unit as an average during all lick bouts (or all non-lick periods, Figure 3E). The distinction between intra- and extra-trial licking was made to have a measure for motor control (extra-trial licking) vs. a measure for the mixture of sensory, motivational, reward-driven and motor activity (intra-trial licking). To rule out that differences in licking behavior during intra- vs. extra-trial licking bouts would underlie differences in spiking between conditions, we matched individual extra-trial licking bouts based on equal number of licks and a licking frequency (# licks / [time between first and last lick]) within 1 Hz of the extra-trial bout (Figure 4). To test correlations between spike rates and licking behavior of the rat, we took all licking bouts with more than one lick. We quantified the licking frequency between the first and last lick of the bout as well as the spike frequency for each unit. We then performed Pearson’s correlations between the spike rate and licking frequency for each unit. Finally, we split the (positively, negatively and unmodulated) populations in a licking-frequency correlated group and an uncorrelated group and made licking-bout triggered PETHs for all six groups (Supplementary Figure S2).
A bootstrapping method was used to determine whether spiking activity of individual units was modulated by a specific segment of correct behavioral performance. First, we aligned the spiking activity of an individual unit to the Hit time stamps (first lick within decision window of a “Go” trial) and computed the Δ spike rate (mean spike rate in the 1 s windows before Hit outcomes minus mean spike rate during the session) of each unit during the 1 s before these Hit outcomes (mock example in Figure 2A). Subsequently, an equal number of random “Hit” time stamps were generated throughout the recording session and the mean Δ spike rate in the 1 s before these randomized “Hit” time stamps was determined. This procedure was repeated 1,000 times after which a distribution of Δ spike rate to the randomized triggers was constructed and compared to the Δ spike rate upon true Hit time stamps. If the recorded Δ spike rate of a unit was in the 1st percentile of its (randomized) bootstrap distribution it was grouped in the negatively modulated spike rate group (reduced spiking activity in 1 s window before Hit outcomes) and conversely, when the value was in the 99th percentile the unit was placed in the positively modulated spike rate group (increased spiking activity in 1 s window before Hit outcomes; Figures 2C,D). Units that did not fall in either group were considered unmodulated.
We performed Kolmogorov-Smirnov tests to check whether our data was normally distributed. To determine correlations between spike rates and Δ spike rate and between licking probability and trial number (Figures 1E, 2E), Pearson’s correlation (α < 0.05) was used. To compare medians of two populations, Wilcoxon signed-rank tests were used (α < 0.05). Finally, to compare three or more groups, Kruskal-Wallis tests were used followed by post hoc multiple comparison tests. In case of multiple testing, a Bonferroni correction was used.
To answer how mPFC neurons encode tactile decision making, motivated behavior and learning from mistakes, we trained rats in a head-fixed whisker-based object localization task. Data of three rats reached selection criteria for quality of electrophysiological recordings and behavioral performance. In this task, rats learned to report proximal (“Go”) location of an object by licking for a water reward (20–40 μl) while refraining from licking when the object was placed distally (“No-go”) to avoid a TO punishment (Figures 1A,B). “Go” trials were called Hit if the rat licks and Miss when the rat does not, conversely, licking during a “No-go” trial resulted in a FA, while refraining from licking resulted in a CR. Rats completed on average 147 trials per session (range 77–253 trials). After the object moved into the whisker field, rats started whisking extensively and many touches followed, both during “Go” and “No-go” trials (Supplementary Figure S1). Rats readily distinguished between “Go” and “No-go” positions and licked preferentially during “Go” trials over “No-go” trials (Proportion correct: “Go” 0.67 ± 0.13; “No-go” 0.82 ± 0.10; All 0.74 ± 0.07, mean ± standard deviation; Figures 1C–E, Table 1). Catch trials were intermixed to test whether rats relied on whisker information to determine task outcome. Rats typically did not lick during Catch trials (Proportion not licked: 0.98 ± 0.04, mean ± standard deviation; Figures 1C–E). Overall performance was stable between sessions, although rats tended to show a higher probability to lick during early trials compared to late trials (“Go”: R = −0.86, P = 1.8*10−5; “No-go”: R = −0.80, P = 1.8*10−4; Figure 1E, n = 16 sessions in N = 3 rats), which could reflect satiety from earned rewards.
To monitor mPFC neuron spiking activity, 64-channel silicon probes were implanted. Spikes were analyzed and clustered off-line (Figures 1F1,F2), which resulted in 204 well-isolated units distributed over the dorsal-ventral axis and throughout mPFC layers. FSUs (peak-to-trough time <0.5 ms and half-peak time <0.25 ms, gray), were excluded from subsequent analyses (Figures 1G1,G2). RSUs represent putative excitatory pyramidal neurons for which we found strong correlations between specific trial parameters and spike rates in a subset of units (example in Figures 1H1–H3). Thus, spiking in a subpopulation of mPFC RSU units strongly correlated with behavioral performance.
To identify which units were significantly modulated during correct performance of the task, we performed a bootstrap analysis on the spike train of each unit (Figures 2A–C). We found positively and negatively modulated units (Figures 2B–D, magenta, n = 31 and cyan, n = 48 respectively) as well as unmodulated units (gray, n = 112). The modulated populations both had higher median spike rates compared to unmodulated units (Spike rate values: median, 1st/3rd quartile: positively modulated 2.06, 0.99/4.5 Hz; unmodulated 0.77, 0.11/1.98 Hz; negatively modulated 2.48, 1.36/4.94 Hz, p = 2*10−8, Kruskal-Wallis test; Figure 2F) and there was a strong correlation of larger Δ spike rate for higher mean spike rates (Figure 2E, positively modulated R = 0.62, p = 2*10−4; negatively modulated R = −0.77, p = 1.3*10−10, Pearson’s correlation).
mPFC Units Encode Motor Output and Context
The bootstrap analysis identified units with spike rates that were positively modulated (Figure 3A), unmodulated (Figure 3B) or negatively modulated (Figure 3C) during correct “Go” trials. Units with positively modulated spiking activity (n = 31) were clearly excited before Hit outcomes and remained excited during reward delivery and consumption (blue, Figure 3A1). Similarly, during FA trials the average spike rate of positively modulated units increased before FA outcomes. As soon as rats received feedback on trial outcome however (i.e., learned that it made a mistake), spike rates returned to baseline (red, Figure 3A2). During Miss and CR trials, spike rates of the positively modulated population remained stable (purple and orange respectively, Figures 3A1,2). Unmodulated units (n = 112) did not show spike rate changes for any trial type (Figure 3B). The negatively modulated population (n = 48) showed a clear reduction in spike rate before, during and after Hit (Figure 3C1) and FA outcomes (Figure 3C2). Similar to the positively modulated units, spike rates of the negatively modulated population returned to baseline quickly after outcome of FA trials. Thus, neurophysiological correlates during “No-go” trials closely resembled those during “Go” trials, up to the moment that sensory feedback signaled whether behavioral output was appropriate.
Both the positively and negatively modulated populations showed obvious spike rate changes during trials in which rats licked, independent of reward delivery (Hit and FA) and spike rate changes were absent from non-lick trials (Miss and CR). This may suggest that spike rate modulations during trials in which rats were licking represented motor output. To further quantify correlations between modulation of mPFC spiking activity and licking behavior, we analyzed licking behavior for each trial type (Figure 3D) and found that the time course of licking matched changes in spike rate. Most licks occurred during Hit and FA trials, but rats occasionally licked outside trials. These extra-trial licks presumably represent licking behavior without the expectation to earn water rewards (see e.g., the ticks at 0.5–1.5 s for CR and Miss trials in Figure 1C1). It is thus tempting to assume that these licks were not as strongly driven by expectance to receive rewards compared to intra-trial licks and we therefore used this distinction as a control for motor behavior. Thus, to test whether Δ spike rates were related to motor output, we analyzed PETHs of spike rates centered around the start of lick bouts for both intra-trial bouts and extra-trial bouts (Figures 3A3–C3). Both positively and negatively modulated populations had stronger spike rate modulation around lick start for intra-trial licks compared to extra-trial licks (compare amplitude of solid and dashed lines for intra- and extra-trial licking respectively, Figures 3A3–C3). We additionally quantified the contribution of motor output (i.e., licking) to spike rate modulation by comparing spike rates during no licks vs. extra-trial licks. We found that spike rates of the unmodulated and negatively modulated population during extra-trial licks significantly decreased relative to no-lick episodes (neg. modulated p = 5*10−7; unmodulated p = 1*10−4; Wilcoxon signed-rank test with Bonferroni correction; Figure 3E3). Spike rate modulation was not observed in the positively modulated population upon comparison of no-lick and extra-trial licking episodes, indicating that spiking in this population is insensitive to motor output (pos. modulated p > 0.05).
Since intra-trial licking is associated with reward expectation, intra-trial licks probably represent a combination of motor output and contextual value (i.e., reward delivery). We thus quantified spike rates between intra- and extra-trial licking to determine whether context (i.e., reward delivery) impacted spike rate beyond motor output only (extra-trial licks). All three populations showed spike rate modulation with respect to the cognitive context and have a higher median spike rate during intra-trial compared to extra-trial licking (pos. modulated p = 0.01; unmodulated p = 0.03; neg. modulated p = 0.02; Wilcoxon signed-rank test with Bonferroni correction; Figure 3E3). This indicates that context increased spike rates consistently in all three populations.
Since we cannot rule out that fine-scale differences in behavior during intra- and extra-trial licks may underlie differences in spike rate modulation, we quantified licking behavior after intra- and extra-trial licking bout starts (Figure 3D3). Indeed, intra-trial licking frequencies were higher compared to extra-trial licking frequencies. When different licking frequencies underlie spike rate differences between intra- and extra-trial lick bouts, spike rates should correlate with licking frequency. For a subset of units, spike rate indeed significantly correlated to licking rate (Supplementary Figures S2A1,A2). Moreover, differences in spike rates between intra- and extra-trial lick bouts persisted when licking frequency-correlated units were excluded from the analysis (Supplementary Figures S2B2,D2).
Positively Modulated Units Represent Contextual Value
Finally, we matched individual extra-trial lick bouts with intra-trial lick bouts based on similar behavioral statistics (i.e., lick frequency, number of licks and bout duration) to rule out behavioral differences and isolate contextual value. We found that spike rate modulation during licking was comparable between intra-trial and extra-trial licking bouts when behavioral statistics were matched for the negatively and unmodulated populations (p > 0.05, Wilcoxon’s signed-rank test with Bonferroni correction, Figure 4), which indicates that spike rate modulation during licking is due to motor output and not contextual value. For positively modulated units, spike rates during intra-trial licking bouts continued to be significantly higher compared to extra-trial licking bouts after matching behavioral characteristics (Figure 4E), indicating that context but not motor output impacted spike rates in this population (pos. modulated p = 0.01, Wilcoxon’s signed-rank test with Bonferroni correction).
We also changed the cognitive context by doubling the water reward or by delaying reward or punishment but these experimental manipulations did not consistently change the behavior or spike rates and these manipulations were therefore not used for further in-depth analyses (Supplementary Figure S3). Collectively, our data suggest that motor behavior and cognitive context contribute to spike rate modulation during task execution, but modulation depth depends on the neuronal population involved.
A subset of “Go” trials is associated by the absence of licking, leading to “Miss” categorization. As a consequence, CR trials may be the outcome of a “No-go” miss instead of an active decision to identify the trial as “No-go.” To test whether withholding licks during “No-go” trials is a passive process or is actively encoded by the mPFC, the bootstrap analysis was applied again (Figures 2A–C) to classify units based on Δ spike rate during correct “No-go” performance. We found positively (n = 9), negatively (n = 12) and unmodulated units (n = 170; 12 out of 21 units were also modulated during Hit trials; Supplementary Figure S4). We quantified the median spike rate of the “No-go” CR-negatively modulated population during the 1 s before “Go”-Hit outcomes and found that it was not negatively modulated (data not shown). In contrast, the “No-go” CR-positively modulated population showed a reduction in spike rate in the 1 s window before Hit outcomes (p = 0.006 sign test with Bonferroni correction; Supplementary Figure S4A3). Thus, the presence of CR-modulated units supports the hypothesis that “No-go” trials are actively encoded in mPFC and rats actively determine the outcome of both “Go” and “No-go” trials.
Negatively Modulated Units Signal Updating of Task Rules on a Trial-to-Trial Basis
The mPFC is involved in monitoring and updating behavior in highly dynamic environments. During the behavioral task, rats constantly received feedback on performance through water rewards for Hit trials and sound and TOs for FA trials. The general assumption is that these cues are integrated with existing internal rule representations and change behavioral strategies to drive rewarded behavior (Dalley et al., 2004; Rich and Shapiro, 2009; Durstewitz et al., 2010; Horst and Laubach, 2012; Narayanan et al., 2013). In accordance, rats were more likely to lick in “No-go” trials following a Hit trial compared to “No-go” trials following a FA trial (p = 0.012, Wilcoxon signed-rank test with multiple comparisons, significant difference between “No-go” trials after Hit or FA outcome; Proportion licked: Hit 0.20, 0.11/0.29; FA 0, 0.0/0.23, values represent median, 1st/3rd quartile; Figure 5A1). This effect was not observed for “Go” trials (Figure 5A2), since performance during ongoing trials did not depend on the outcome of the previous trial. Thus, rats adapt behavior on a trial-to-trial basis to avoid consecutive FA trials and associated TO punishments.
To address the question whether mPFC spiking correlates with the trial-to-trial behavioral shifting, we tested correlations between mPFC spiking activity during trial outcome (feedback) of Hit and FA trials and subsequent trial performance. Thus, we took the PETHs of the modulated units for Hit trials followed by a “No-go” trial (HitCR and HitFA) and similarly for FA trials followed by a “No-go” trial (FACR and FAFA; Figures 5B,C). We selected units that were recorded in sessions that contained both HitCR and HitFA or both FACR and FAFA trials for the following analyses. We computed the average spike frequency of units in the time window where feedback was received and integrated (i.e., 0.2–3 s after outcome). We hypothesized differential spiking activity when the behavior on the next trial corresponded with feedback signals in the previous trial (i.e., keep licking after reward: HitFA; or stop licking after punishment: FACR) compared to no correspondence (HitCR and FAFA). Neither positively modulated units, nor negatively modulated units showed different spike rates between HitCR and HitFA (Δ spike rate: pos. modulated HitCR 1.06, 0.02/5.10 Hz; HitFA 0.25, −0.51/1.59 Hz; neg. modulated HitCR −0.65, −1.61/0.05 Hz; HitFA −0.64, −1.47/0.01, values are median, 1st/3rd quartile; p > 0.05, Wilcoxon signed-rank test with Bonferroni correction; Figures 5B2,3). Thus, spike rates during correct “Go” trials did not predict performance during subsequent “No-go” trials. Similarly, positively modulated units showed comparable spike rates during feedback of FACR and FAFA trials (FACR 0.07, −0.16/1.39 Hz; FAFA 0.06, −0.60/2.12 Hz, values are median, 1st/3rd quartile; p > 0.05, Wilcoxon signed-rank test with Bonferroni correction; Figure 5C2). Surprisingly, negatively modulated units had lower spike rates during FAFA trial feedback (when the behavior was not changed) compared to FACR trials, when rats updated their behavior and correctly performed during the next “No-go” trial (FACR −0.08, −0.59/0.36 Hz; FAFA −0.60, −1.74/−0.20 Hz, values are median, 1st/3rd quartile; p = 0.012, Wilcoxon signed-rank test with Bonferroni correction; Figure 5C3). To test whether this effect was robust, the two most active neurons were removed from the population, but this did not affect statistical outcome (p = 0.043, Wilcoxon signed-rank test with Bonferroni correction).
Additional evidence for mPFC encoding of behavioral shifting can be found in spike rates of the CR-modulated units. The positive CR-modulated units did not encode behavioral updating after error feedback (n = 5; Supplementary Figures S5A,B), but the negative CR-modulated population spiked more during feedback integration of FAFA trials compared to FACR trials (p = 0.03, Wilcoxon signed-rank test with Bonferroni correction, n = 8; Supplementary Figure S5C). We therefore propose that both negative Hit-modulated and negative CR-modulated populations may carry the neurophysiological representation of adaptive behavior in response to unsuccessful trial outcome and in turn may drive trial-to-trial learning.
Spike Rate Increase in Positively Modulated Units Could Predict Inhibitory Control
It has been suggested that mPFC recruitment depends on the level of task difficulty (Euston et al., 2012; Zhang et al., 2014; Kamigaki and Dan, 2017) and that increasingly difficult tasks involve mPFC more strongly (Chudasama and Muir, 2001; White et al., 2018; Wu et al., 2018), while others suggest that mPFC encodes certainty (Hanks et al., 2015). Additionally, correct mPFC spiking activity may be essential for impulse control (Narayanan and Laubach, 2006, 2009; Chudasama et al., 2012; Gourley and Taylor, 2016). Therefore, in one session each day, we randomly intermixed two extra “No-go” locations, in addition to the Normal “No-go” location. These locations were chosen such that the Easy location was twice as far from the “Go” location (8 mm) as the Normal location (4 mm) and the Hard location was half as far (2 mm; Figure 6A). The rats performed as expected; they licked mostly for “Go” trials, made a relatively high number of (FA) mistakes for the Hard “No-go” location and progressively fewer mistakes for the Normal and Easy “No-go” locations (Proportion licked “Go”: 62.1 ± 3.8%; Hard “No-go”: 31.4 ± 3.8%; Normal “No-go”: 17.8 ± 3.4%; Easy “No-go”: 6.8 ± 1.7%; mean ± SEM; Figure 6B). During these sessions both the positively and negatively modulated populations (n = 20 and n = 35 respectively) responded similarly relative to regular sessions (compare Figure 3C with Figures 6C1,D1) with prolonged increase/decrease of spike rates during licking and reward consumption in Hit trials and rapid return to baseline spiking after Normal FAs (Figure 6C1). “No-go” trial type did not determine the spike rate of the negatively modulated units during FA and CR trials (Figure 6C, CR not shown). Spiking of the positively modulated units was also not dependent on the type of CR trials (data not shown). However, spike rates of positively modulated units differed between distinct types of lick trials (Δ spike rate: Hit 1.61, 0.49/2.36 Hz; Hard “No-go” 1.45, 0.32/2.72 Hz; Normal “No-go” 0.83, 0.16/2.18 Hz; Easy “No-go” −0.09, −0.78/1.12 Hz, values are median, 1st/3rd quartile; p = 0.01, Kruskal-Wallis test; Figure 6E). More specifically, positively modulated units exhibited reduced spiking in the 0–2 s prior to outcomes of Easy FA compared to the same time-window during Hard FA trials (p = 0.004, Wilcoxon signed-rank test with Bonferroni correction; Figure 6D3) and during Normal FA trials (p = 0.035, Wilcoxon signed-rank test with Bonferroni correction). Thus, mPFC spiking activity scales with task difficulty during error trials. Since rats may deploy different cognitive or behavioral strategies to solve Easy vs. Hard “No-go” trials (# of licks p = 0.055 and licking peak latency p = 0.063; Wilcoxon signed-rank test; Supplementary Figure S6), future work should elucidate which factors underlie the observed correlation between task difficulty and mPFC spiking activity.
The mPFC orchestrates a variety of cognitive functions, such as strategy optimization, reward seeking and inhibitory control (Narayanan and Laubach, 2006; Euston et al., 2012; Narayanan et al., 2013; Orsini et al., 2015; Pinto and Dan, 2015; Kim H. et al., 2016; Luchicchi et al., 2016; Guise and Shapiro, 2017; Kamigaki and Dan, 2017; Malagon-Vina et al., 2018). Pharmacological disturbance of mPFC leads to deficits in behavioral switching (Guise and Shapiro, 2017), stimulus discrimination (Pinto and Dan, 2015) and memory-guided decision making (Lagler et al., 2016), while optogenetic manipulation of mPFC neurons disturbs correct performance of attentional (Wimmer et al., 2015; Kim D. et al., 2016; Kim H. et al., 2016; Luchicchi et al., 2016; Schmitt et al., 2017), working memory (Horst and Laubach, 2012; Kim D. et al., 2016; Bolkan et al., 2017) and stimulus discrimination tasks (Kamigaki and Dan, 2017). Conversely, selectively increasing the activity of mPFC parvalbumin (PV) or vasoactive intestinal peptide (VIP) expressing neurons can increase performance in an attention task and a stimulus discrimination task respectively (Kim H. et al., 2016; Kamigaki and Dan, 2017). Neurophysiological correlates of both somatosensory stimulation and decision making were uncovered in the mPFC and the mPFC is necessary for passive tactile decision making (Le Merre et al., 2018). However, mPFC involvement in active tactile decision making and short-timescale behavioral updating has remained understudied. Here, we report mPFC neurophysiological representations of decision making during a “Go”/“No-go” whisker-based, object localization task (O’Connor et al., 2010a; Pammer et al., 2013).
We performed extracellular electrophysiological recordings in the mPFC of rats in combination with rigorous cluster analysis to reach single-unit resolution (Csicsvari et al., 2003; Barthó et al., 2004; Schmitzer-Torbert et al., 2005; Rossant et al., 2016) and aligned spiking responses to specific epochs of task performance. We found units showing positive spike rate modulation (increased spiking), negative modulation (decreased spiking) or no modulation (no change) when aligned to correct “Go” performance, which is comparable to sensory, motor, associative and prefrontal cortices (Narayanan and Laubach, 2009; Totah et al., 2009; O’Connor et al., 2010b, 2013; Hanks et al., 2015; Zagha et al., 2015; Kim H. et al., 2016; Ebbesen et al., 2017; Le Merre et al., 2018). We assumed that populations with specific modulation characteristics represent segregated (and potentially competing) microcircuits (Halladay and Blair, 2015; Kim H. et al., 2016) and thus analyzed these populations separately in subsequent in-depth analyses. We found that: (1) modulation of spike rates coincides with decision making and motor output and modulation depth correlates to cognitive context; (2) low excitation of positively modulated units during Easy “No-go” trials predicts FA mistakes; and (3) units negatively modulated during correct “Go” behavior encode behavioral state updating by increased spiking after an error was made.
mPFC as a Single Functional Area
Even though there is evidence showing differential functional roles and innervation patterns of distinct mPFC areas (Heidbreder and Groenewegen, 2003; Euston et al., 2012; Gourley and Taylor, 2016), we chose not to subdivide the mPFC. We found significantly modulated units over the entire dorso-ventral extent of the mPFC without different proportions of significantly modulated units along the dorso-ventral axis (Supplementary Figure S7). Classical cytoarchitectonic divisions of mPFC (i.e., AC, PL and IL cortex) or divisions into the dmPFC and the vmPFC (Gabbott et al., 2005; Luchicchi et al., 2016) lead to ambiguous and potentially artificial subdivisions of the mPFC (Heidbreder and Groenewegen, 2003; Euston et al., 2012; Hardung et al., 2017). The absence of a dorso-ventral distinction of unit task-selectivity in our dataset strengthens our confidence that for our task mPFC subdivision is unnecessary and potentially counterproductive.
mPFC Processing of Tactile Behavior
Primary and/or secondary sensory cortices govern stimulus detection after low intensity sensory stimuli (Yang et al., 2016; Le Merre et al., 2018), whereas CA1 and mPFC are necessary for correct performance in detection tasks (Pinto and Dan, 2015; Le Merre et al., 2018). However, it remains unknown how sensory information is integrated into the mPFC during stimulus discrimination and how the mPFC influences sensory processing during tasks that involve sensation. Activity of mPFC axons increases stimulus detection and discrimination by top-down center-surround contrast enhancement in V1 (Zhang et al., 2014) and are necessary to respond to low-contrast stimuli during conditioning (Wu et al., 2018). The mPFC also increases stimulus detection by driving modality-dependent attention through the thalamus (Wimmer et al., 2015). The mPFC does not receive strong direct inputs from S1 (Hoover and Vertes, 2007; DeNardo et al., 2015) or somatosensory thalamic nuclei VPM and PoM (Hoover and Vertes, 2007), although somatosensory field potentials are observed in the mPFC after whisker stimulation in both anesthetized rats (Martin-Cortecero and Nuñez, 2016), awake untrained mice and trained mice (Le Merre et al., 2018). We did not observe robust changes in spiking upon touch in the mPFC (data not shown). Nonetheless, tactile information probably should be present in the mPFC to correctly drive tactile decision making. We did see a fast increase in local field potential amplitude (latency of approximately 50 ms, data not shown) after the first touch for each trial, but we cannot exclude the possibility that this local field potential carries multimodal information. When the mPFC does not encode sensory information directly, the alternative explanation could be that sensory information from primary sensory cortices is summarized by feature extraction and/or categorization (Hanks et al., 2015) in an upstream brain region, e.g., the more dorsal parts of frontal cortex (Sreenivasan et al., 2017) or the orbitofrontal cortex as has been suggested by Euston et al. (2012), since there is strong innervation from both these regions to mPFC (Hoover and Vertes, 2007). During our task we do not identify specific sensory spiking patterns. However, it remains enigmatic whether this is due to absence of sensory signals altogether or because the sensory information is packaged in a way that we cannot yet recognize and decode in separation from the motor and cognitive components of the task.
Recorded Populations and Their Characteristics
In our dataset, we aimed to address coding properties of pyramidal projecting neurons and therefore excluded putative GABAergic interneurons based on their fast-spiking waveform (Barthó et al., 2004). A small population of interneurons with a regular-spiking waveform may nevertheless remain in our dataset (Kim D. et al., 2016). In addition, histological recovery of recording location indicated that the predominant fraction of units was recorded in the deep layers (layer 5 and 6). Interpretations of the output of the mPFC circuitry may thus be influenced by overrepresentation of units from these layers. This is important to consider since encoding of task-relevant information will almost certainly be layer- and cell-type specific and thus correlate to projection target (Gabbott et al., 2005; Dembrow et al., 2010; Pinto and Dan, 2015; Kamigaki and Dan, 2017). For example, layer 5 units can project to the lateral hypothalamus and carry appetite-related signals or send motor signals to striatum, ventral tegmental area and spinal cord, while units from layer 6 can project to the mediodorsal (MD) thalamus (Vertes, 2002, 2004; Gabbott et al., 2005; Morishima et al., 2011). Additionally, the spiking activity of layer 2 and layer 3 output projections, most notably to basolateral amygdala (Gabbott et al., 2005), need to be determined to generate a comprehensive view on cell-type and layer-specific activity in mPFC during the whisker-guided objection localization task. We have not been able to link our three populations of modulated/unmodulated units to projection target or specific morphological types (Dembrow et al., 2010; Morishima et al., 2011; van Aerde and Feldmeyer, 2015; Kawaguchi, 2017). However, this may be essential for future experiments to increase the interpretability of links between the morphological structure, electrophysiological function, and influence on the behavior of the recorded populations.
Activity in mPFC Pyramidal Neurons Is More Than Motor Representation
In our study, we used the first lick during the decision window as a discrete trigger to signal decision making, but this means that it is not straightforward to disentangle spiking involved in decision making (e.g., motivation, cognitive context and sensory inputs) from spiking involved in motor output (Horst and Laubach, 2013; Amarante et al., 2017). This could be solved by temporally separating the response moment from motor output (delayed reward), but this goes at the expense of identification of the exact decision moment. Instead, we compared licks during and outside trials to quantify the impact of context on spike rate and “no licks” vs. “extra-trial licks” to quantify the influence of motor output. We show that motor output without contextual value (i.e., lick outside trial) leads to a decrease in spike rates, restricted to the populations that showed either no or negative modulation of spike rate during correct “Go” performance. In contrast, the population that showed positive modulation of spike rate during correct “Go” behavior did not show spike rate modulation by motor output. However, the positively modulated population showed a significant difference in spike rate during licks with and without trial-context (Figures 3E3, 4E), indicating a potential influence of motivation or attention processes on spiking. Collectively, cognitive context and motor output influence spiking in mPFC. Spike rate modulation by motor output and context could ultimately be the outcome of excitability changes by neuromodulatory inputs from e.g., the dopaminergic or cholinergic system (reviewed in Thiele and Bellgrove, 2018) or due to strong interactions with regions involved in appetitive behaviors, such as e.g., the supra-mammillary nucleus and lateral hypothalamus (Ikemoto et al., 2004; Hoover and Vertes, 2007; Reppucci and Petrovich, 2016).
Correlate of Behavioral Updating
It was proposed previously that mPFC may be critical for task learning, but perhaps not as much during task execution (Liu et al., 2014). This hypothesis is at odds with our finding of trial-to-trial changes in spike rates in the mPFC correlating to task performance. In addition, manipulation of the mPFC during a task shows that mPFC is needed to perform many behaviors, from bottom-up stimulus detection (Le Merre et al., 2018) to attention (Kim H. et al., 2016; Luchicchi et al., 2016) and top-down attentional selection (Wimmer et al., 2015; Schmitt et al., 2017). The activity of the mPFC contains short-term memories of actions and their consequences (Narayanan and Laubach, 2008; Horst and Laubach, 2012). Additionally, the mPFC contains a physiological representation of current task rules or strategies to solve the task (Durstewitz et al., 2010; Cho et al., 2015; Malagon-Vina et al., 2018). The mPFC, thus, has access to all information needed to compute strategies to optimize action outcome. Behavioral updating in mPFC is signaled by theta frequency oscillations (Narayanan et al., 2013) and it could be that spiking of the negatively modulated population is phase locked to theta (Horst and Laubach, 2013; Amarante et al., 2017) to broadcast relevant updates to the behavioral state to downstream targets (Fries, 2005; Womelsdorf et al., 2007). Furthermore, the mPFC has indirect control over actions, i.e., optogenetically stimulating efferent projections from the mPFC to the nucleus accumbens core (NAc) and the periventricular nucleus of thalamus (PVT) leads to enhancement or suppression of reward seeking behaviors after negative outcomes (Kim et al., 2017; Otis et al., 2017). Behavioral updating also depends on mPFC to MD thalamus output (Marton et al., 2018) and risky decision making after negative outcomes involves VTA to mPFC projection (Verharen et al., 2018). In the current work, we cannot show causality of spiking rates to changes in behavior. It could be that mPFC spike rate changes during behavioral updating are simply caused by the same upstream neuronal processes, or that it is not the spiking rate change, but rather phase locking of spiking to oscillations that result in behavioral updating (Fries, 2005; Narayanan et al., 2013; Amarante et al., 2017). Furthermore, it remains to be determined whether the functional populations of (positively modulated, unmodulated and negatively modulated) units represent anatomically distinct groups or heterogeneous populations.
mPFC Excitation Needed to Prevent Avoidable Mistakes
We show that insufficient spiking in positively modulated units predicts FA mistakes when task conditions are easy. We hypothesize that this may be due to either a lack of top-down attentional filtering (Dalley et al., 2004; Zhang et al., 2014; Wimmer et al., 2015; Kim H. et al., 2016; Luchicchi et al., 2016) or due to insufficient inhibitory control (Narayanan and Laubach, 2006, 2009; Chudasama et al., 2012; Luchicchi et al., 2016; Kamigaki and Dan, 2017). The mPFC is an important area for attentional processing and activation of mPFC projections have been shown to increase stimulus detection and contrast in primary visual cortex (V1; Zhang et al., 2014). Similarly, mPFC projections to the pons are selectively necessary to detect low-contrast stimuli during conditioning (Wu et al., 2018). Additionally, mPFC to thalamus projections are needed for proper selection of sensory modality (Wimmer et al., 2015) and top-down input from mPFC to claustrum is needed for correct attention behavior (White et al., 2018). Thus, the mPFC is driving top-down stimulus-response associations. In our experiment, however, we assume that the Easy “No-go” position is easy to distinguish from the “Go” position and that bottom-up input should therefore be sufficient to select the proper action. Thus, we suggest that insufficient excitation of mPFC neurons through malfunctioning of the mPFC-driven inhibitory control mechanisms leads to FA errors. Under Normal and Hard conditions, the Δ spike rate during the low number of errors of impulsivity could get lost in the larger number of errors of discrimination. The link between impulsivity and reduced mPFC spiking is further supported by findings that reduction of mPFC activity leads to more impulsive prematurely expressed responses (Narayanan and Laubach, 2006, 2009; Kim H. et al., 2016; Luchicchi et al., 2016; Hardung et al., 2017) and FAs (Kamigaki and Dan, 2017).
In summary, we quantified neurophysiological representations of motivational and cognitive context in the mPFC of rats performing a whisker-based object localization task (Figure 3). We show that a specific fraction of negatively modulated units could underlie behavioral updating on a trial-to-trial basis (Figure 5) and that a subpopulation of mPFC units needs to be sufficiently activated to maintain task performance under dynamic “No-go” task conditions (Figure 6). These results could provide a framework for future studies investigating causality of mPFC neuronal spiking activity to behavioral updating and stimulus detection.
RH, HM and CK designed the study. RH performed the experiments. AP, VN and RH designed and programmed the behavioral hardware and software. JL, SB, RH and CK analyzed the data. RH and CK wrote the manuscript with input from all authors.
This work was supported by the Max Planck Society, Center for Neurogenomics and Cognitive Research (CNCR), Neuroscience Campus Amsterdam (NCA), and the VU Amsterdam and funding from the Netherlands Organization for Scientific Research to CK (Nederlandse Organisatie voor Wetenschappelijk Onderzoek; Grant No. NWO ALW #822.02.013).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We would like to thank Prof. Dr. B. Sakmann for advice and support. In addition, we would like to thank Prof. Dr. K. Svoboda and Prof. Dr. C. Schwarz for advice on setting up a head-fixed whisker-based task for rats. Furthermore, we want to thank R. Elsinga of Fine Mechanics of BetaVU and T.J.F. Brouwer and P.J.M. Caspers from Electronic Engineering of BetaVU for building and helping design our custom behavioral hardware. In addition, we thank Dr. T. Pattij and H. Terra for comments on a previous version of the manuscript.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fncir.2018.00075/full#supplementary-material
Barthó, P., Hirase, H., Monconduit, L., Zugaro, M., Harris, K. D., and Buzsáki, G. (2004). Characterization of neocortical principal cells and interneurons by network interactions and extracellular features. J. Neurophysiol. 92, 600–608. doi: 10.1152/jn.01170.2003
Bolkan, S. S., Stujenske, J. M., Parnaudeau, S., Spellman, T. J., Rauffenbart, C., Abbas, A. I., et al. (2017). Thalamic projections sustain prefrontal activity during working memory maintenance. Nat. Neurosci. 20, 987–996. doi: 10.1038/nn.4568
Cho, K. K., Hoch, R., Lee, A. T., Patel, T., Rubenstein, J. L., and Sohal, V. S. (2015). γ rhythms link prefrontal interneuron dysfunction with cognitive inflexibility in Dlx5/6+/– mice. Neuron 85, 1332–1343. doi: 10.1016/j.neuron.2015.02.019
Chudasama, Y., Doobay, V. M., and Liu, Y. (2012). Hippocampal-prefrontal cortical circuit mediates inhibitory response control in the rat. J. Neurosci. 32, 10915–10924. doi: 10.1523/JNEUROSCI.1463-12.2012
Csicsvari, J., Henze, D. A., Jamieson, B., Harris, K. D., Sirota, A., Barthó, P., et al. (2003). Massively parallel recording of unit and local field potentials with silicon-based electrodes. J. Neurophysiol. 90, 1314–1323. doi: 10.1152/jn.00116.2003
Dalley, J. W., Cardinal, R. N., and Robbins, T. W. (2004). Prefrontal executive and cognitive functions in rodents: neural and neurochemical substrates. Neurosci. Biobehav. Rev. 28, 771–784. doi: 10.1016/j.neubiorev.2004.09.006
DeNardo, L. A., Berns, D. S., DeLoach, K., and Luo, L. (2015). Connectivity of mouse somatosensory and prefrontal cortex examined with trans-synaptic tracing. Nat. Neurosci. 18, 1687–1697. doi: 10.1038/nn.4131
Durstewitz, D., Vittoz, N. M., Floresco, S. B., and Seamans, J. K. (2010). Abrupt transitions between prefrontal neural ensemble states accompany behavioral transitions during rule learning. Neuron 66, 438–448. doi: 10.1016/j.neuron.2010.03.029
Gabbott, P. L., Warner, T. A., Jays, P. R., Salway, P., and Busby, S. J. (2005). Prefrontal cortex in the rat: projections to subcortical autonomic, motor, and limbic centers. J. Comp. Neurol. 492, 145–177. doi: 10.1002/cne.20738
Gruber, A. J., Calhoon, G. G., Shusterman, I., Schoenbaum, G., Roesch, M. R., and O’Donnell, P. (2010). More is less: a disinhibited prefrontal cortex impairs cognitive flexibility. J. Neurosci. 30, 17102–17110. doi: 10.1523/jneurosci.4623-10.2010
Halladay, L. R., and Blair, H. T. (2015). Distinct ensembles of medial prefrontal cortex neurons are activated by threatening stimuli that elicit excitation vs. J. Neurophysiol. 114, 793–807. doi: 10.1152/jn.00656.2014
Hanks, T. D., Kopec, C. D., Brunton, B. W., Duan, C. A., Erlich, J. C., and Brody, C. D. (2015). Distinct relationships of parietal and prefrontal cortices to evidence accumulation. Nature 520, 220–223. doi: 10.1038/nature14066
Hardung, S., Epple, R., Jäckel, Z., Eriksson, D., Uran, C., Senn, V., et al. (2017). A functional gradient in the rodent prefrontal cortex supports behavioral inhibition. Curr. Biol. 27, 549–555. doi: 10.1016/j.cub.2016.12.052
Heidbreder, C. A., and Groenewegen, H. J. (2003). The medial prefrontal cortex in the rat: evidence for a dorso-ventral distinction based upon functional and anatomical characteristics. Neurosci. Biobehav. Rev. 27, 555–579. doi: 10.1016/j.neubiorev.2003.09.003
Horst, N. K., and Laubach, M. (2012). Working with memory: evidence for a role for the medial prefrontal cortex in performance monitoring during spatial delayed alternation. J. Neurophysiol. 108, 3276–3288. doi: 10.1152/jn.01192.2011
Ikemoto, S., Witkin, B. M., Zangen, A., and Wise, R. A. (2004). Rewarding effects of AMPA administration into the supramammillary or posterior hypothalamic nuclei but not the ventral tegmental area. J. Neurosci. 24, 5758–5765. doi: 10.1523/jneurosci.5367-04.2004
Kim, D., Jeong, H., Lee, J., Ghim, J. W., Her, E. S., Lee, S. H., et al. (2016). Distinct roles of parvalbumin- and somatostatin-expressing interneurons in working memory. Neuron 92, 902–915. doi: 10.1016/j.neuron.2016.09.023
Kim, C. K., Ye, L., Jennings, J. H., Pichamoorthy, N., Tang, D. D., Yoo, A. W., et al. (2017). Molecular and circuit-dynamical identification of top-down neural mechanisms for restraint of reward seeking. Cell 170, 1013.e14–1027.e14. doi: 10.1016/j.cell.2017.07.020
Koike, H., Demars, M. P., Short, J. A., Nabel, E. M., Akbarian, S., Baxter, M. G., et al. (2016). Chemogenetic inactivation of dorsal anterior cingulate cortex neurons disrupts attentional behavior in mouse. Neuropsychopharmacology 41, 1014–1023. doi: 10.1038/npp.2015.229
Lagler, M., Ozdemir, A. T., Lagoun, S., Malagon-Vina, H., Borhegyi, Z., Hauer, R., et al. (2016). Divisions of identified parvalbumin-expressing basket cells during working memory-guided decision making. Neuron 91, 1390–1401. doi: 10.1016/j.neuron.2016.08.010
Le Merre, P., Esmaeili, V., Charriere, E., Galan, K., Salin, P. A., Petersen, C. C. H., et al. (2018). Reward-based learning drives rapid sensory signals in medial prefrontal cortex and dorsal hippocampus necessary for goal-directed behavior. Neuron 97, 83.e5–91.e5. doi: 10.1016/j.neuron.2017.11.031
Liu, D., Gu, X., Zhu, J., Zhang, X., Han, Z., Yan, W., et al. (2014). Medial prefrontal activity during delay period contributes to learning of a working memory task. Science 346, 458–463. doi: 10.1126/science.1256573
Luchicchi, A., Mnie-Filali, O., Terra, H., Bruinsma, B., de Kloet, S. F., Obermayer, J., et al. (2016). Sustained attentional states require distinct temporal involvement of the dorsal and ventral medial prefrontal cortex. Front. Neural Circuits 10:70. doi: 10.3389/fncir.2016.00070
Malagon-Vina, H., Ciocchi, S., Passecker, J., Dorffner, G., and Klausberger, T. (2018). Fluid network dynamics in the prefrontal cortex during multiple strategy switching. Nat. Commun. 9:309. doi: 10.1038/s41467-017-02764-x
Martin-Cortecero, J., and Nuñez, A. (2016). Sensory responses in the medial prefrontal cortex of anesthetized rats. Implications for sensory processing. Neuroscience 339, 109–123. doi: 10.1016/j.neuroscience.2016.09.045
Marton, T., Seifikar, H., Luongo, F. J., Lee, A. T., and Sohal, V. S. (2018). Roles of prefrontal cortex and mediodorsal thalamus in task engagement and behavioral flexibility. J. Neurosci. 38, 2569–2578. doi: 10.1523/jneurosci.1728-17.2018
Narayanan, N. S., Horst, N. K., and Laubach, M. (2006). Reversible inactivations of rat medial prefrontal cortex impair the ability to wait for a stimulus. Neuroscience 139, 865–876. doi: 10.1016/j.neuroscience.2005.11.072
Narayanan, R. T., Mohan, H., Broersen, R., de Haan, R., Pieneman, A. W., and de Kock, C. P. (2014). Juxtasomal biocytin labeling to study the structure-function relationship of individual cortical neurons. J. Vis. Exp. 84:e51359. doi: 10.3791/51359
O’Connor, D. H., Clack, N. G., Huber, D., Komiyama, T., Myers, E. W., and Svoboda, K. (2010a). Vibrissa-based object localization in head-fixed mice. J. Neurosci. 30, 1947–1967. doi: 10.1523/jneurosci.3762-09.2010
O’Connor, D. H., Peron, S. P., Huber, D., and Svoboda, K. (2010b). Neural activity in barrel cortex underlying vibrissa-based object localization in mice. Neuron 67, 1048–1061. doi: 10.1016/j.neuron.2010.08.026
O’Connor, D. H., Hires, S. A., Guo, Z. V., Li, N., Yu, J., Sun, Q. Q., et al. (2013). Neural coding during active somatosensation revealed using illusory touch. Nat. Neurosci. 16, 958–965. doi: 10.1038/nn.3419
Orsini, C. A., Moorman, D. E., Young, J. W., Setlow, B., and Floresco, S. B. (2015). Neural mechanisms regulating different forms of risk-related decision-making: insights from animal models. Neurosci. Biobehav. Rev. 58, 147–167. doi: 10.1016/j.neubiorev.2015.04.009
Otis, J. M., Namboodiri, V. M., Matan, A. M., Voets, E. S., Mohorn, E. P., Kosyk, O., et al. (2017). Prefrontal cortex output circuits guide reward seeking through divergent cue encoding. Nature 543, 103–107. doi: 10.1038/nature21376
Pammer, L., O’Connor, D. H., Hires, S. A., Clack, N. G., Huber, D., Myers, E. W., et al. (2013). The mechanical variables underlying object localization along the axis of the whisker. J. Neurosci. 33, 6726–6741. doi: 10.1523/jneurosci.4316-12.2013
Reppucci, C. J., and Petrovich, G. D. (2016). Organization of connections between the amygdala, medial prefrontal cortex, and lateral hypothalamus: a single and double retrograde tracing study in rats. Brain Struct. Funct. 221, 2937–2962. doi: 10.1007/s00429-015-1081-0
Rodgers, C. C., and DeWeese, M. R. (2014). Neural correlates of task switching in prefrontal cortex and primary auditory cortex in a novel stimulus selection task for rodents. Neuron 82, 1157–1170. doi: 10.1016/j.neuron.2014.04.031
Rossant, C., Kadir, S. N., Goodman, D. F. M., Schulman, J., Hunter, M. L. D., Saleem, A. B., et al. (2016). Spike sorting for large, dense electrode arrays. Nat. Neurosci. 19, 634–641. doi: 10.1038/nn.4268
Rozeske, R. R., Jercog, D., Karalis, N., Chaudun, F., Khoder, S., Girard, D., et al. (2018). Prefrontal-periaqueductal gray-projecting neurons mediate context fear discrimination. Neuron 97, 898.e6–910.e6. doi: 10.1016/j.neuron.2017.12.044
Schmitt, L. I., Wimmer, R. D., Nakajima, M., Happ, M., Mofakham, S., and Halassa, M. M. (2017). Thalamic amplification of cortical connectivity sustains attentional control. Nature 545, 219–223. doi: 10.1038/nature22073
Schmitzer-Torbert, N., Jackson, J., Henze, D., Harris, K., and Redish, A. D. (2005). Quantitative measures of cluster quality for use in extracellular recordings. Neuroscience 131, 1–11. doi: 10.1016/j.neuroscience.2004.09.066
Senn, V., Wolff, S. B., Herry, C., Grenier, F., Ehrlich, I., Gründemann, J., et al. (2014). Long-range connectivity defines behavioral specificity of amygdala neurons. Neuron 81, 428–437. doi: 10.1016/j.neuron.2013.11.006
Siegle, J. H., López, A. C., Patel, Y. A., Abramov, K., Ohayon, S., and Voigts, J. (2017). Open Ephys: an open-source, plugin-based platform for multichannel electrophysiology. J. Neural Eng. 14:045003. doi: 10.1088/1741-2552/aa5eea
Sreenivasan, V., Kyriakatos, A., Mateo, C., Jaeger, D., and Petersen, C. C. (2017). Parallel pathways from whisker and visual sensory cortices to distinct frontal regions of mouse neocortex. Neurophotonics 4:031203. doi: 10.1117/1.nph.4.3.031203
Totah, N. K., Kim, Y. B., Homayoun, H., and Moghaddam, B. (2009). Anterior cingulate neurons represent errors and preparatory attention within the same behavioral sequence. J. Neurosci. 29, 6418–6426. doi: 10.1523/jneurosci.1142-09.2009
van Aerde, K. I., and Feldmeyer, D. (2015). Morphological and physiological characterization of pyramidal neuron subtypes in rat medial prefrontal cortex. Cereb. Cortex 25, 788–805. doi: 10.1093/cercor/bht278
Verharen, J. P. H., de Jong, J. W., Roelofs, T. J. M., Huffels, C. F. M., van Zessen, R., Luijendijk, M. C. M., et al. (2018). A neuronal mechanism underlying decision-making deficits during hyperdopaminergic states. Nat. Commun. 9:731. doi: 10.1038/s41467-018-03087-1
White, M. G., Panicker, M., Mu, C., Carter, A. M., Roberts, B. M., Dharmasri, P. A., et al. (2018). Anterior cingulate cortex input to the claustrum is required for top-down action Control. Cell Rep. 22, 84–95. doi: 10.1016/j.celrep.2017.12.023
Wimmer, R. D., Schmitt, L. I., Davidson, T. J., Nakajima, M., Deisseroth, K., and Halassa, M. M. (2015). Thalamic control of sensory selection in divided attention. Nature 526, 705–709. doi: 10.1038/nature15398
Wimmer, K., Spinelli, P., and Pasternak, T. (2016). Prefrontal neurons represent motion signals from across the visual field but for memory-guided comparisons depend on neurons providing these signals. J. Neurosci. 36, 9351–9364. doi: 10.1523/jneurosci.0843-16.2016
Womelsdorf, T., Schoffelen, J. M., Oostenveld, R., Singer, W., Desimone, R., Engel, A. K., et al. (2007). Modulation of neuronal interactions through neuronal synchronization. Science 316, 1609–1612. doi: 10.1126/science.1139597
Wu, G. Y., Liu, S. L., Yao, J., Sun, L., Wu, B., Yang, Y., et al. (2018). Medial prefrontal cortex-pontine nuclei projections modulate suboptimal cue-induced associative motor learning. Cereb. Cortex 28, 880–893. doi: 10.1093/cercor/bhw410
Keywords: medial prefrontal cortex, tactile decision making, behavioral adaptation, mPFC, electrophysiology, spiking modulation
Citation: de Haan R, Lim J, van der Burg SA, Pieneman AW, Nigade V, Mansvelder HD and de Kock CPJ (2018) Neural Representation of Motor Output, Context and Behavioral Adaptation in Rat Medial Prefrontal Cortex During Learned Behavior. Front. Neural Circuits 12:75. doi: 10.3389/fncir.2018.00075
Received: 07 June 2018; Accepted: 04 September 2018;
Published: 01 October 2018.
Edited by:David J. Margolis, Rutgers University, The State University of New Jersey, United States
Reviewed by:Daniel H. O’Connor, Johns Hopkins University, United States
Mark Laubach, American University, United States
Copyright © de Haan, Lim, van der Burg, Pieneman, Nigade, Mansvelder and de Kock. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Christiaan P. J. de Kock, email@example.com
† These authors have contributed equally to this work