Original Research ARTICLE
Does area V3A predict positions of moving objects?
- 1 Department of Psychology, University of California at Berkeley, Berkeley, CA, USA
- 2 Center for Mind and Brain, University of California Davis, Davis, CA, USA
- 3 School of Psychology, University of Sussex, Brighton, UK
- 4 Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- 5 Brain Imaging Center Frankfurt and Department of Neurophysiology, Max Planck Institute for Brain Research, Frankfurt am Main, Germany
- 6 Centre for Cognitive Neuroimaging, Institute of Neuroscience and Psychology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK
A gradually fading moving object is perceived to disappear at positions beyond its luminance detection threshold, whereas abrupt offsets are usually localized accurately. What role does retinotopic activity in visual cortex play in this motion-induced mislocalization of the endpoint of fading objects? Using functional magnetic resonance imaging (fMRI), we localized regions of interest (ROIs) in retinotopic maps abutting the trajectory endpoint of a bar moving either toward or away from this position while gradually decreasing or increasing in luminance. Area V3A showed predictive activity, with stronger fMRI responses for motion toward versus away from the ROI. This effect was independent of the change in luminance. In Area V1 we found higher activity for high-contrast onsets and offsets near the ROI, but no significant differences between motion directions. We suggest that perceived final positions of moving objects are based on an interplay of predictive position representations in higher motion-sensitive retinotopic areas and offset transients in primary visual cortex.
Processing of visual information in the nervous system takes time. For an acting organism in a dynamic environment this can cause severe problems. Processing delays mean that perceptual systems in the brain do not have access to the true position of a moving object at any time, but will always have delayed information. For interaction with a moving object accurate localization by the perceptual system is advantageous, as it reduces computational cost in the motor system, which additionally must deal with delays in efferent pathways and muscles (Wolpert et al., 1995). It has been proposed that perceptual systems comprise a prediction mechanism for moving objects that helps to overcome the consequences of neural delays. This mechanism becomes apparent in a variety of visual illusions, where motion causes objects to be perceptually mislocalized, usually shifted forward in the direction of motion (Whitney, 2002). One prominent example is the flash-lag effect: a moving object is seen ahead of a flashed object, although both are physically aligned at the time of the flash (Nijhawan, 1994, 2008).
The absence of a perceived overshoot when moving objects disappear abruptly has been used to argue that the perceived position of moving objects is not generally based on predictive localization in the visual system (Eagleman and Sejnowski, 2000). However, some studies showed that predictive overshoots do exist in certain conditions, for example for low-contrast (Kanai et al., 2004), blurred (Fu et al., 2001), gradually fading moving objects (Maus and Nijhawan, 2006), biological motion (Kessler et al., 2010), or objects moving into the retinal blind spot (Maus and Nijhawan, 2008). Maus and Nijhawan (2006) employed a gradually fading moving object and showed that it is perceived to disappear at luminance contrasts below detection threshold. The final part of the perceived trajectory is not based on retinal input, but on cortically extrapolated positions. Thus, when moving objects disappear without providing a transient signal, the object is seen disappearing in a predicted position. Abrupt offsets provide accurately localized transient signals that can mask extrapolated representations and thus facilitate the accurate perception of the final position.
What cortical areas are involved in this particular kind of predictive position perception? Possible mechanisms include lateral spread of activity within retinotopic maps (Jancke et al., 2004) and feedback from motion-sensitive higher areas to V1 (Pascual-Leone and Walsh, 2001; McGraw et al., 2004; Wibral et al., 2009). Using functional magnetic resonance imaging (fMRI) it has been shown that in long-range apparent motion retinotopic activity in unstimulated regions of V1 reflects the illusory motion percept (Muckli et al., 2005; Larsen et al., 2006; Sterzer et al., 2006). Cortical feedback to V1 is essential for predictive coding of objects along the path of apparent motion (Wibral et al., 2009; Alink et al., 2010; Muckli, 2010).
Here we measure activity in retinotopic areas corresponding to the illusory part of the trajectory of fading moving objects (Maus and Nijhawan, 2006). We focused on early visual areas: primary visual cortex (V1), area V2/V3, and area V3A, a mid-level area implicated in the perception of motion. A moving visual stimulus either gradually increased or decreased in luminance contrast, terminating either in a high- or low-contrast offset in its final position. We identified regions of interest (ROIs) in retinotopic maps abutting the start or end point of the motion trajectory. By comparing activity in the ROI for stimuli moving toward or away from the ROI we were able to analyze which retinotopic areas show predictive shifts of the represented object position.
Materials and Methods
Ten participants (five female, mean age 25.9, standard deviation 4.4 years) took part in the fMRI study. All had normal or corrected to normal visual acuity and gave written consent after being informed about the study and the procedure. All procedures were approved by local ethics boards.
Stimuli were projected with an LCD projector (Sanyo PLC-XP41 with a custom-build zoom lens) onto a frosted screen mounted in the scanner bore. Participants viewed the screen via a mirror from approximately 30 cm viewing distance.
We were interested in the cortical representations of the start and end positions of a moving object’s trajectory. The moving object consisted of a white radial bar (0.17 dva × 1.74 dva)1 on a black background, moving smoothly for one-quarter of a circle in the upper right quadrant of the visual field (see Figure 1A). The inner end of the bar was at 6.4 dva eccentricity from the central fixation cross. The area of interest was the start/end position of the bar’s trajectory to the right of the fixation spot (at the 3 o’clock position). In two offset conditions the bar moved clockwise toward this area, while either fading constantly until it disappeared at the 3 o’clock position in a low-contrast offset (lc-off), or it gradually increased contrast and disappeared abruptly in a high-contrast offset (hc-off). In two more conditions the bar moved counter-clockwise, either starting in a high-contrast onset (hc-on) or a low-contrast onset (lc-on, Figures 1A,B). Please note that the lc-off and lc-on conditions (as well as hc-off and hc-on) consisted of identical stimulation and differed only in the direction of motion of the bar. Critically however, onset is unpredictable and incorporates therefore no predictable signal whereas offset is anticipated by the movement toward the endpoint and incorporates predictive signals and the illusion of prolongated fading in the case of low-contrast offset. The rotation velocity of the bar in all conditions was 108°s−1. The motion sequence lasted 833 ms.
Figure 1. (A) Illustration of the moving stimuli. All motion was smooth, only “snapshots” at different time points of the trajectory are shown here. In the low-contrast offset condition (lc-off) a radial bar appeared at the 12 o’clock position and moved smoothly in a clockwise direction around the fixation point, while gradually fading. At the 3 o’clock position, on the horizontal meridian, it vanished. Oppositely, the low-contrast onset stimulus (lc-on) started moving from the 3 o’clock position and gradually increased in contrast until the 12 o’clock position, where it disappeared abruptly. These two conditions had weak transients near the region of interest just below the horizontal meridian. (B) In the other two conditions (hc-off and hc-on) the moving bar had high-contrast onsets/offsets at the horizontal meridian. (C) The checkerboard conditions were used as localizers to identify ROIs just below and just above the horizontal meridian. All three conditions are shown here (cb-up, cb-mid, cb-low), although only one checkerboard was presented in any given trial.
For the retinotopic localization of regions of interest (ROIs) in visual cortex we presented rectangular flickering checkerboards (0.35 dva × 1.93 dva), in the area where the moving bars had their on- and offsets, to the right of the fixation cross on the horizontal meridian (cb-mid), as well as rotated 4° upward (counter-clockwise around the fixation cross; upper checkerboard, cb-up) and 4° downward (clockwise; lower checkerboard, cb-low, see Figure 1C). The resulting distance between upper and lower checkerboard was only 0.89 dva at the inner edge and 1.16 dva at the outer edge, which is at the limit of what can be separated using fMRI (Kraft et al., 2005; Yacoub et al., 2008). The checkerboards consisted of a rectangle comprised of 2 × 11 alternating black and white squares that flickered at 6.7 Hz for 833 ms. The upper checkerboard was still on the motion trajectory whereas the lower checkerboard was on the illusory path that is perceived in low-contrast offset conditions only. The motivation for the mapping procedure was twofold: (1) to map the endpoint region in which the movement was either moving toward and ending (lc-off, hc-off) or starting (lc-on, hc-on), and (2) on a finer scale to map the illusion related prolongation (lc-off).
Outside the MRI scanner we checked if the experimental stimuli replicated the effect described by Maus and Nijhawan (2006). For this we presented stimuli on a CRT screen, and asked five participants (one also took part in the fMRI study) to judge whether the endpoint (in the offset conditions with clockwise motion) or the start point (in the onset conditions with counter-clockwise motion) of the trajectory was above or below the horizontal meridian. The offset and onset conditions were presented in separate blocks of 280 trials each, with low- and high-contrast conditions randomly intermixed. The physical endpoint of the trajectory was varied to be in one of seven positions between 6° above and below the horizontal meridian in the offset conditions; the physical startpoint was between 2° above and 10° below the meridian in the onset conditions. Psychometric functions were fitted to each participant’s responses to estimate the point of subjective alignment (PSA) with the horizontal meridian for each condition.
We used a rapid event-related fMRI design with a total of eight conditions: the four motion conditions (lc-off, lc-on, hc-off, hc-on), the three checkerboard conditions (cb-up, cb-mid, cb-low), and a fixation condition, where only the central fixation cross was visible. Rapid event-related designs allow for a high number of stimulus presentations with close spacing of trials. A trial of each condition started with the presentation of the visual stimulus for 833 ms and lasted 3 s in total. Participants were instructed to fixate the continuously presented fixation cross at all times and view the stimuli passively.
The trials (130 in each condition, 1040 in total) were presented in a pseudo-random sequence that was counterbalanced for a trial history of two trials (Buracas and Boynton, 2002). This sequence was split into six runs of 175 trials. Each run, except for the first one, began with a repetition of the last two trials of the previous run to keep the balanced trial history. Further, each run started with 8 s and ended with 10 s of the fixation condition. In total, each run consisted of 540 recorded volumes, lasting 9 min.
Magnetic resonance imaging data were obtained with a 3T Siemens Trio MR Imaging device (Siemens Medical Solutions, Erlangen, Germany) at the Brain Imaging Center Frankfurt using a standard 1-channel birdcage head coil. Each scanning session consisted of six functional runs and one anatomical scan, usually after the third functional run. For alignment with previous scans a low-resolution anatomical scans of the whole brain were recorded using a T1-weighted magnetization prepared rapid gradient echo sequence (TR = 2250 ms, TE = 4 ms, flip angle = 8°, 2 mm × 1 mm × 1 mm voxel resolution). For some subjects we had access to previously recorded high resolution data sets or recorded such a scan (MPRAGE TR = 2000 ms, TE = 4 ms, FA = 15º, 1 mm × 1 mm × 1 mm). For functional data acquisition we used an echo planar imaging sequence with high temporal resolution (TR = 1000 ms, TE = 30 ms, flip angle = 60°, 16 slices, voxel size = 3.3 mm × 3.3 mm × 4.0 mm) with slices oriented approximately parallel to the Calcarine sulcus. A short TR of 1 s was used to be able to accurately reconstruct BOLD time courses using a deconvolution approach (see below).
fMRI Data Analysis
All data analysis was done with the BrainVoyager QX software package (Brain Innovation, Maastricht, The Netherlands) and MatLab (The Mathworks Inc., Natick, MA, USA). The functional data was preprocessed with 3D-motion correction, intra-session alignment, slice scan time correction, linear trend removal, and a temporal high-pass filter at 0.01 Hz. The functional data was then sub-sampled to 1 mm × 1 mm × 1 mm resolution, coaligned with the anatomical scan from the same session and transformed into Talairach space (Talairach and Tournoux, 1988).
The rapid event-related design used closely spaced trials with one trial every 3 s, one of which is a fixation baseline condition (equally often presented as the other conditions). Because this leads to considerable overlap of the hemodynamic response to each trial, a general linear model (GLM) was fitted to the functional time course of each voxel to estimate the underlying hemodynamic responses to each stimulation condition by deconvolution (Serences, 2004). For each condition there were 20 box-car predictors in the GLM, one each for the 20 acquisition volumes recorded after each condition’s stimulus onset, covering the typical temporal range of a BOLD response.
We identified ROIs in left occipital cortex of each participant with the following method. First, we were interested in the region representing the space close to the trajectory endpoint in the right visual field. In this region we expect to see differences between predictably approaching moving objects and suddenly appearing moving objects. Second we aimed to contrast the BOLD responses to the different motion conditions in retinotopic regions just below the horizontal meridian, where the stimulus in the motion conditions was not physically presented. Motion toward the endpoint ending in a low-contrast offset does, however, lead to a perceived overshoot. There is only a gradual difference between the physical endpoint and the illusory overshoot; the mapping strategy was nevertheless optimized to gradual differences between conditions. We used flickering checkerboards just below and just above the horizontal meridian as localizer stimuli to identify ROIs (separated by 0.9–1.1 dva). Because the activations in response to the localizer checkerboards overlapped considerably, we contrasted each voxel’s activation in response to the upper and lower checkerboard (in a 5–7 s peristimulus time window, when the BOLD responses peaked) against the fixation condition. We displayed the relative contribution (RC) of the upper and the lower checkerboard condition to the activation on a statistical map. A RC value of 1 means that the activation of the voxel is completely explained by the GLM predictors from the lower checkerboard condition, and a RC value of −1 means that the upper checkerboard explains the activation completely. We defined ROIs in 3D volume space above and below the meridian by selecting voxels with only negative or only positive RC values. Thresholds of RC maps were defined by a combined correlation value (and independent of the RC-value). Thresholds were individually adjusted to achieve roughly equal sizes of ROIs in each area across participants. In this way we identified ROIs within the Calcarine sulcus (V1), along the V2/V3 borders in ventral and dorsal visual cortex, and in area V3A (see Table 1; Figure 3) – even though the spatial separation was only small (around 1 dva). ROI locations were visually checked on a cortical surface reconstruction. We focused on these early areas because of their relatively fine-grained representation of retinotopic space. Higher areas, especially MT+/V5, would potentially be interesting in this context, however, given the relatively small psychophysical effect and the close spacing of our localizer stimuli ROIs could not be selected using the same method, as receptive field sizes (or population receptive fields) and thus the overlap of activations to the different conditions were too big. The same strategy was successfully followed in earlier studies of our lab were we investigated long range apparent motion with stimuli separated by more than 8° visual angle (Muckli et al., 2005).
For these ROIs we then recomputed fixed-effects deconvolution GLMs by averaging voxels within each ROI and concatenating time courses for each ROI from all participants. To estimate signal response amplitude we used two different measures. First, we quantified BOLD response amplitude to each motion condition by summing beta values from the GLM for the predictors of time points between 3 and 8 s after stimulus onset. This time window covers the typical BOLD response to our stimuli. Summing across this large time window gives a robust estimate of overall strength of BOLD response despite differences in latency to peak and dispersion of the BOLD responses between conditions. In a second analysis we fitted gamma functions to the beta values of the GLM using MatLab’s nlinfit algorithm. Gamma functions were of the form
where k is a scaling factor, tp is the time-to-peak, w is the width at half maximum, and c is a constant offset. All parameters were fitted freely. We quantified the BOLD response by the product p*w (where p = f (tp) is the peak amplitude), which represents the triangular area spanned by the peak point and the two half-peak points under the curve of the gamma functions.
In the psychophysical task outside the scanner participants judged the perceived start (end) point of a moving bar that started (stopped) moving on the horizontal meridian. Gradually fading moving bars were perceived to overshoot the physical end point. In the condition with a low-contrast offset (lc-off) the bar needed to physically disappear before the horizontal position to be perceived as disappearing there. The mean point of subjective alignment (PSA) as calculated from psychometric functions fitted to the responses of five participants in the lc-off condition was −2.58° (SEM = 1.19°; see Figure 2). High-contrast offsets near the horizontal meridian were perceived accurately (mean PSA 0.13°, SEM = 0.63°). The two conditions showed a statistical trend to differ reliably from each other (Wilcoxon signed-rank test, N = 5, Z = 1.75, p = 0.080).
Figure 2. The mean perceived offset and onset positions from the psychophysics experiment outside of the MRI scanner (n = 5). A low-contrast offset (lc-off) leads to a perceived overshoot, high-contrast offsets (hc-off) are perceived accurately. Onsets are mislocalized forward in the direction of motion, the well-documented Fröhlich effect. Error bars are standard errors of the mean.
Onset positions for the moving bar with both low and high contrast onsets are misperceived as shifted forward, the well-documented Fröhlich effect (Fröhlich, 1923; Kirschfeld and Kammer, 1999). This forward misperception is larger for the low-contrast onset (lc-on) because it takes longer to register a low-contrast stimulus. The mean PSAs for low and high-contrast onsets are 7.71° (SEM = 1.37°) and 2.11° (SEM = 0.70°), respectively, Z = 2.02, p = 0.021.
Figure 3 shows the selected ROIs for two participants, projected onto an inflated reconstruction of visual cortex. This was performed for visualization purposes only and to confirm that the locations of ROIs in V1–V3A follow the pattern expected for visual positions around the horizontal meridian. The main analysis was performed in voxel space. Blue-turquoise areas represent retinotopic regions below the horizontal meridian with positive RC values, where the lower checkerboard contributed more to the voxels’ activation than the upper checkerboard. Yellow–orange areas are negative RC values and represent regions just above the horizontal meridian. Using these RC values we defined ROIs in the functional volume data for all participants.
Figure 3. The ROIs in the left hemisphere of two participants. The view is on the occipital pole of “inflated” cortices. Blue-turquoise areas show positive, yellow-orange negative RC values (positive RC values are retinotopic positions below the horizontal meridian). The circled areas show the selected ROIs in V1 (in the Calcarine sulcus), along the V2/V3 border in dorsal and ventral cortex, and in V3A.
Figure 4A shows the BOLD time courses within the ROIs in response to the four motion conditions. Clockwise conditions with offsets near the ROIs are shown in blue, counter-clockwise (onset) conditions in red. Low-contrast conditions are shown in light colors, high-contrast conditions in fully saturated colors. Figure 4B shows the sum of beta values from the deconvolution GLM in a time window 3–8 s after stimulus onset. This time window covers the temporal extent of the BOLD response and the sum of betas indicates the overall strength of BOLD response. Figure 4C shows an additional way to quantify the BOLD response to the motion conditions. Gamma functions were fitted to the betas from the deconvolution GLM, and the area under the curve is taken as an index of BOLD response magnitude (see Materials and Methods).
Figure 4. (A) BOLD time courses (as reconstructed by a fixed-effects deconvolution GLM) in response to the four motion conditions and fitted gamma-functions. (B) Sum of beta values from the deconvolution GLM in a time window 3–8 s after stimulus onset. (C) Area under curve as an index of BOLD response magnitude.
The first thing to note is that in V1 and V2/V3 high-contrast on- and offsets lead to higher BOLD responses than low-contrast on-/offsets. In V3A this difference is less pronounced. Second, in all ROIs offset conditions with motion toward the ROI lead to higher BOLD responses than onset conditions with motion away from the ROI, albeit to different degrees. The stimuli in the offset conditions differ from onset conditions only in their motion direction; the summed contrast is identical.
We compared the offset conditions to the onset conditions by computing t-contrasts for the time points between 3 and 8 s after stimulus onset. Mean beta values, differences between offset and onset conditions, and results of the statistical comparisons are shown in Table 2. In area V1 differences between onsets and offsets were small and not significant. For the ROIs in V2/V3 the differences are equally small and non-significant for low-contrast conditions, but there are reliable differences between offsets and onsets for the high contrasts conditions. In area V3A, we need to distinguish between the ROIs representing areas above and below the meridian. The pattern in the ROI above the meridian resembles that found in the V2/V3 ROIs. In the ROI below the meridian the difference between low-contrast offset and onset is marginally significant (uncorrected, see Table 2).
Table 2. Sum of beta weights for predictors 3–8 s after stimulus onset in each condition for all ROIs (the same data is plotted in Figure 4B).
Although not significant in all ROIs examined, there is a general trend for clockwise motion toward the ROI at the 3 o’clock position to result in higher BOLD amplitudes than counter-clockwise motion away from the ROI. We analyzed one additional ROI roughly at the boundary of area V1v and V2v, retinotopically at the 12 o’clock position (see Supplementary Material). As expected, the trend in this ROI was reversed, so that counter-clockwise motion, moving toward this ROI, lead to marginally higher amplitudes than clockwise motion, receding from this ROI.
The onset conditions with counter-clockwise motion (red curves in Figure 4A) led to earlier BOLD peaks than the offset conditions with clockwise motion (blue curves). This earlier peak is predicted from our stimulus design, because motion in the onset conditions started near the ROI at the beginning of each trial, whereas in the offset conditions the stimulus moved toward the ROI and terminated there 833 ms after the start of each trial. The time-to-peak parameter of the gamma-function fits can be used to quantify this. The differences between onset and offset conditions’ time-to-peak are shown in the right-most column of Table 2. In general, the differences are smaller than the expected value of 833 ms. This indicates that the object moving toward the ROIs is extrapolated and therefore the peak of activity is reached earlier as it would be, if it would follow the stimulus with a constant lag. Consistent with this interpretation, the difference between time-to-peak of offset and onset conditions is generally shorter in ventral ROIs (representing space above the meridian), since the clockwise moving object arrives there even earlier. Also, differences are smaller in higher areas, possibly due to larger receptive fields. However, these values have to be interpreted with caution. fMRI does not allow making judgments about the exact timing of neural processes, especially when comparing time courses between different areas, since hemodynamic properties might differ between areas and are summed over population receptive fields of different sizes.
We were looking for predictive activity in response to moving stimuli. By comparing activity in a region next to the start-/endpoint of a moving object’s trajectory, we were able to assess whether the cortical representation of a moving object in the respective retinotopic map was shifted in the direction of motion. Higher activity for motion toward the ROI, especially in the unstimulated region below the meridian, is indicative of a forward shift of the object’s representation.
Regarding the representation of the moving object’s start and end position, the following pattern emerges. In V1, positions are represented accurately with a small but not statistically reliable bias for higher activity in response to objects moving toward the ROI. The far bigger determinant of activity in V1 is contrast, with higher-contrast stimuli leading to higher activity.
In V2/V3 contrast is still the main determinant of strength of response, but the difference between high- and low-contrast conditions is smaller than in V1. Interestingly, there is a large difference between motion directions for high-contrast objects: motion toward the ROI leads to reliably higher activation than objects moving away from it. This might reflect the Fröhlich effect, indicating that the receding object only becomes visible in a later position of the trajectory, or extrapolation of the moving bar toward the endpoint of the trajectory. Psychophysically both effects are larger for low-contrast objects, but the difference in BOLD is observed reliably only for higher contrast objects. Note that the low-contrast bar was barely visible near the ROI, and thus observing reliable differences in BOLD would also be difficult. The effects of both extrapolation at the trajectory endpoint and the Fröhlich effect are additive, i.e., both would lead to larger differences between motion toward and away from the ROI. To distinguish between the two effects the pattern of responses in ROIs just above and just below the meridian needs to be analyzed more closely. Although both ROIs are very close in retinotopic space, and spread of the BOLD signal might lead to overlap, the Fröhlich effect should influence the ROI above the meridian more, and vice versa motion extrapolation should influence the ROI below the meridian more. Interestingly, the effect in dorsal V2/V3 is slightly larger, indicating that extrapolation beyond the trajectories end point does indeed cause some of the additive effects.
In the ROIs in area V3A the pattern is similar to V2/V3: high-contrast offsets caused higher activity than onsets. Additionally, low-contrast offsets caused larger activation than onsets specifically in the ROI below the meridian. Again the differences between the motion directions are generally larger in the ROI below the meridian. Because this ROI is located below the horizontal meridian in retinotopic space, stronger activation for motion toward versus away from the ROI means that the neural representation of the moving stimulus in area V3A is shifted forward in the direction of motion beyond the final position of the trajectory for both gradual and abrupt motion offsets.
In summary, area V1 did not show significant forward shifts, although all trends were in this direction. Area V3A showed predictive shifts, both for objects disappearing in a low-contrast offset, for which a perceptual shift illusion occurs, and for high-contrast offsets, which are usually perceived accurately. Note that the finer grained retinotopic maps in V1 would make it more likely to observe a retinotopic forward shift. The forward shift was however most prominent in V3A even though population receptive fields are bigger and a wash-out of effects therefore more likely. The differences in time-to-peak of the BOLD response confirm this finding, although these estimates should be interpreted with caution, since they are beyond the temporal limits of fMRI.
Visual attention to the moving object could contribute to the difference in BOLD response described here. Participants might attentively track the moving object along its trajectory, which is why attention is employed when a moving object moves toward the ROI, but not in conditions when it abruptly starts moving there. To assess this possibility we analyzed activity in ROIs both above and below the horizontal meridian. Both regions should similarly be affected by attention. The difference between offset and onset conditions in V2/V3 and V3A was slightly reduced in the region above the meridian, indicating that some part of the effect is specific to the region beyond the trajectory end point and arguing against an explanation of the differential activation solely in terms of visual attention. Regardless, an attentional contribution to the results presented here is not to be construed as a contradiction to a retinotopic forward shift. Cue-induced focal attention might be contributing to psychophysical forward displacements, as previously suggested by Kirschfeld and Kammer (1999).
Area V3A showed the most predictive activity overshooting the trajectory end point, regardless of whether motion was terminated in a strong or a weak transient. In other words, the measured BOLD effect was independent of the perceptual effects, which show perceived forward-shifts for fading objects only, but accurate localization for abrupt offsets (Maus and Nijhawan, 2006). Other studies also reported motion-dependent shifts of neural (Sundberg et al., 2006) or hemodynamic (Whitney et al., 2003) activity, regardless of the presence of a perceptual shift illusion. These findings imply that separate positions for a moving object are represented at different stages of cortical processing. In our study, the representation in motion sensitive area V3A utilizes information from the motion trajectory to create an anticipation of object position. This extrapolated representation is possibly fed back to the fine-grained retinotopic map in V1, where it can bias object position toward future positions (cp. Erlhagen, 2003; McGraw et al., 2004; Sterzer et al., 2006; Wibral et al., 2009; Alink et al., 2010; Muckli, 2010). However, V1 activity is more strongly influenced by retinal inputs, which can override the motion predictions in the event of sudden stimulus events like an abrupt high-contrast offset. The transient signal elicited by an abrupt offset on the retina inhibits the perception of extrapolated positions (Maus and Nijhawan, 2006, 2008, 2009). The final perceived position is based on an interplay of the extrapolated representation in higher motion-sensitive areas and bottom-up information expressed in area V1 about the nature of the disappearance. In the presence of a strong transient signal there, the extrapolated position is corrected. In the absence of strong transients however, the extrapolated position is perceived.
The high-contrast onset condition (hc-on) is the least predictable event from our four conditions. In some studies BOLD response is seen to be higher in response to prediction errors (Friston, 2005; Alink et al., 2010; Den Ouden et al., 2010). Coding of prediction error would act against the effect that we have observed in this study and that we relate to the prolongation of motion trajectory in response to motion path prediction. It is difficult to parcel out these antagonistic effects, what remains however is an increased activity for the conditions of movement toward the end point.
Our focus on V3A was due to the clear retinotopic separation of ROIs in this area. Another strong candidate to look for extrapolated position representations for moving objects is the human motion complex MT+/V5. In the present study area MT+/V5 was activated by all motion conditions and the checkerboard localizers (data not shown). It was however not possible to define ROIs in the same ways as in V1 thru V3A, because the overlap of activity in response to the different localizer stimuli was too big. Receptive fields of neurons in MT+/V5 are larger than in V1 and V3A, and retinotopic organization is known to be less prevalent. It is, however, likely that MT+/V5 plays a strong role in motion-induced position shifts. Transcranial magnetic stimulation (TMS) studies of MT+/V5 revealed the functional necessity of the area in the perception of motion (Pascual-Leone and Walsh, 2001; Silvanto et al., 2005; Sack et al., 2006), and there is evidence for a role of area MT+/V5 in motion-induced perceptual shifts (McGraw et al., 2004; Whitney et al., 2007).
Other studies investigating motion-induced mislocalizations have found position shifts in several retinotopic maps. Whitney et al. (2003) reported a shift of hemodynamic activity in the opposite direction of motion, when investigating the cortical retinotopic representation of stationary drifting Gabor gratings in V1. This shift is thought to be caused by inhibitory processes at the trailing part of the motion stimulus (but see Liu et al., 2006). Our finding, however, is consistent with neurophysiological findings showing neural anticipation of a moving stimulus ahead of actually stimulated retinotopic positions. Berry et al. (1999) reported this sort of predictive shift in the retina; other studies have shown predictive shifts of activity in cat V1 (Fu et al., 2004; Jancke et al., 2004) and monkey V4 (Sundberg et al., 2006). Our study is the first to show a predictive retinotopic forward shift in human visual cortex, specifically in area V3A.
Area V3A is highly sensitive to visual motion (Tootell et al., 1997; Smith et al., 1998). Recent studies using TMS attest to V3A’s role in the perception of speed and chromatic motion (McKeefry et al., 2008, 2010). In the present study V3A does not respond differentially to high and low stimulus contrasts, consistent with earlier reports (Tootell et al., 1997). The sole determinant of V3A activity with the present stimuli seems to be the direction of motion, regardless of contrast. Direction-selectivity in V3A has previously been shown with direction-selective fMRI adaptation (Nishida et al., 2003; Ashida et al., 2007). Furthermore, area V3A and nearby areas along the transverse and lateral occipital sulci have been implicated in the perception of apparent motion of complex three-dimensional shapes (Weigelt et al., 2007). Another study showed that anticipatory activity in area V3A predicts the perceptual accuracy of observers in a spatial attention cuing paradigm (Sylvester et al., 2007). Despite the diversity of proposed roles for area V3A, prediction of object location might be common to the above-mentioned studies. It is suitably situated in the visual hierarchy to influence both fine-grained spatial representations in V1 via feedback connections and goal-directed limb actions via feed-forward connections up the dorsal stream.
To conclude, the present study presents evidence for a forward shift of the neural representation of a moving object in motion-sensitive area V3A. This predictive activity may contribute to perceptual forward displacements such as the flash-lag effect (Nijhawan, 1994) and the forward shift of the final positions of fading objects (Maus and Nijhawan, 2006). Strong representations of transient signals in area V1 can interfere with these higher-level representations and thus facilitate the accurate localization of final positions of abruptly disappearing objects.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors wish to acknowledge the generous support of this study by Wolf Singer, Rainer Goebel, David Whitney, and BBSRC grant BB/G005044/1 on “Visual Prediction.”
Kessler, K., Gordon, L., Cessford, K., and Lages, M. (2010). Characteristics of motor resonance predict the pattern of flash-lag effects for biological motion. PLoS One 5, e8258. doi: 10.1371/journal.pone.0008258.
Kraft, A., Schira, M. M., Hagendorf, H., Schmidt, S., Olma, M., and Brandt, S. A. (2005). fMRI localizer technique: efficient acquisition and functional properties of single retinotopic positions in the human visual cortex. Neuroimage 28, 453–463.
McKeefry, D. J., Burton, M. P., and Morland, A. B. (2010). The contribution of human cortical area V3A to the perception of chromatic motion: a transcranial magnetic stimulation study. Eur. J. Neurosci. 31, 575–584.
McKeefry, D. J., Burton, M. P., Vakrou, C., Barrett, B. T., and Morland, A. B. (2008). Induced deficits in speed perception by transcranial magnetic stimulation of human cortical areas V5/MT+ and V3A. J. Neurosci. 28, 6848–6857.
Muckli, L., Kohler, A., Kriegeskorte, N., and Singer, W. (2005). Primary visual cortex activity along the apparent-motion trace reflects illusory perception. PLoS Biol. 3, e265. doi: 10.1371/journal.pbio.0030265.
Sack, A. T., Kohler, A., Linden, D. E., Goebel, R., and Muckli, L. (2006). The temporal characteristics of motion processing in hMT/V5+: combining fMRI and neuronavigated TMS. Neuroimage 29, 1326–1335.
Smith, A. T., Greenlee, M. W., Singh, K. D., Kraemer, F. M., and Hennig, J. (1998). The processing of first- and second-order motion in human visual cortex assessed by functional magnetic resonance imaging (fMRI). J. Neurosci. 18, 3816–3830.
Sylvester, C. M., Shulman, G. L., Jack, A. I., and Corbetta, M. (2007). Asymmetry of anticipatory activity in visual cortex predicts the locus of attention and perception. J. Neurosci. 27, 14424–14433.
Tootell, R. B. H., Mendola, J. D., Hadjikhani, N. K., Ledden, P. J., Liu, A. K., Reppas, J. B., Sereno, M. I., and Dale, A. M. (1997). Functional analysis of V3A and related areas in human visual cortex. J. Neurosci. 17, 7060–7078.
- ^For the remainder of the document, dva denotes “degree visual angle.” The degree sign “°” is used for degrees of rotation along the circular trajectory of the moving stimuli.
Keywords: visual motion, prediction, localization, fMRI, V1, V3A
Citation: Maus GW, Weigelt S, Nijhawan R and Muckli L (2010) Does area V3A predict positions of moving objects?. Front. Psychology 1:186. doi: 10.3389/fpsyg.2010.00186
Received: 27 February 2010;
Accepted: 11 October 2010;
Published online: 12 November 2010.
Edited by:Frans Verstraten, Universiteit Utrecht, Netherlands
Reviewed by:Ryota Kanai, University College London, UK;
Richard J. A. Van Wezel, Utrecht University, Netherlands;
Mark M. Schira, University of New South Wales, Australia
Copyright: © 2010 Maus, Weigelt, Nijhawan and Muckli. This is an open-access article subject to an exclusive license agreement between the authors and the Frontiers Research Foundation, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are credited.
*Correspondence: Gerrit W. Maus, Department of Psychology, University of California at Berkeley, 3210 Tolman Hall, Berkeley, CA 94720-1650, USA. e-mail: firstname.lastname@example.org