Contribution of Visuospatial and Motion-Tracking to Invisible Motion

Battaglini, Luca; Casco, Clara

doi:10.3389/fpsyg.2016.01369

ORIGINAL RESEARCH article

Front. Psychol., 14 September 2016

Sec. Perception Science

Volume 7 - 2016 | https://doi.org/10.3389/fpsyg.2016.01369

Contribution of Visuospatial and Motion-Tracking to Invisible Motion

Department of General Psychology, Perception, and Psychophysics, University of Padova Padova, Italy

Abstract

People experience an object's motion even when it is occluded. We investigate the processing of invisible motion in three experiments. Observers saw a moving circle passing behind an invisible, irregular hendecagonal polygon and had to respond as quickly as possible when the target had “just reappeared” from behind the occluder. Without explicit cues allowing the end of each of the eight hidden trajectories to be predicted (length ranging between 4.7 and 5 deg), we found as expected, if visuospatial attention was involved, anticipation errors, providing that information on pre-occluder motion was available. This indicates that the observers, rather than simply responding when they saw the target, tended to anticipate its reappearance (Experiment 1). The new finding is that, with a fixation mark indicating the center of the invisible trajectory, a linear relationship between the physical and judged occlusion duration is found, but not without it (Experiment 2) or with a fixation mark varying in position from trial to trial (Experiment 3). We interpret the role of central fixation in the differences in distinguishing trajectories smaller than 0.3 deg, by suggesting that it reflects spatiotemporal computation and motion-tracking. These two mechanisms allow visual imagery to form of the point symmetrical to that of the disappearance, with respect to fixation, and then for the occluded moving target to be tracked up to this point.

Introduction

The visual experience of motion elicited by an object moving behind a stationary occluder has often attracted the attention of psychologists because of the paradoxical fact that the object persists in being “seen” as continuously moving behind the occluder through time, even though it is no longer projected onto the retina. One of the first demonstrations of occluded (“invisible”) motion is given by Michotte (Michotte et al., 1964, 1991). Within this acceptation, invisible motion is another example of a motion phenomenon that involves the subjective impression of an object following a path even in the absence of any physical stimulus, such as during apparent motion (Wertheimer, 1912). Within this framework are the studies that conceive invisible motion as equivalent to an amodal filling-in and as involving neural activation to visible motion (Michotte et al., 1964, 1991; Pessoa and Neumann, 1998; Horowitz et al., 2006; Komatsu, 2006). Empirical evidence comes from the finding that distractors moving over the occluder interfere with invisible motion (Lyon and Waag, 1995). At the neurophysiological level, Barborica and Ferrera (2003) have provided direct evidence of the existence of velocity sensitive neurons in the frontal eye fields that fire during periods of occlusion.

A different and very accredited model for processing occluded motion investigated by DeLucia and Liddell (1998) and expanded upon by Makin and Poliakoff (2011) regards the tracking hypothesis. They claim that the position of a hidden moving object is “extrapolated” by tracking the position of the target through the shift of the spotlight of visuospatial attention, which is guided by the motion pursuit system. Furthermore, they posit that, when the target disappears, visible velocity information stored in short-term velocity memory guides pursuit eye movements across the temporal intervals during which the target is occluded (Bennett and Barnes, 2006; Makin and Chauhan, 2014). Indeed, invisible motion is affected by factors affecting perceived visible speed before occlusion such as, for example, changes in the target's contrast, size (Battaglini et al., 2013), prior adaptation (Gilden et al., 1995; Battaglini et al., 2015) and previously viewed velocity (Makin et al., 2008). In Makin and Poliakoff's model, it is irrelevant whether the eyes follow the hidden moving object or not, thus absorbing into the model the evidence that premotor pursuit commands do not need pursuit execution to be active (Rizzolatti et al., 1994; Barnes et al., 1997; Eimer et al., 2007). In its complete account, the model posits that “velocity store and premotor modules guide tracking of occluded targets during motion extrapolation, even if fixation is maintained” (Makin and Poliakoff, 2011).

From this account, visuospatial attention seems to rely exclusively on the memory of visible motion. However, in particular, the results of Lyon and Waag (1995) and Barborica and Ferrera (2003) suggest that motion information that is also acquired during the occluded trajectory may be used to judge target reappearance. If this were the case, then the imagery of an occluded target in motion could guide pursuit eye movements across the temporal intervals during which the target is occluded (Lu and Sperling, 1990; Sears and Pylyshyn, 2000; Shioiri et al., 2000; Huber and Krist, 2004; de'Sperati and Deubel, 2006; Jonikaitis et al., 2009). The internal model of the moving target can be tracked smoothly, even though the target is not physically present, allowing the target position to be updated very precisely at every (very close) local image point along the occluded trajectory (Shioiri et al., 2000). Shioiri et al. (2000) indeed showed that observers judge the apparent location of a target in invisible motion relative to an imaginary cue with high precision, suggesting that the target motion behind the occluder can be tracked and that any position of the target along the occluded trajectory can be precisely judged, providing that this point is made salient by visual imagery.

Spatiotemporal computation is needed to form an internal representation of a moving object. Thus, rather than using remembered speed to track one speed dimension (location) to judge the other (time), motion-tracking uses remembered speed to track the two dimensions combined (motion) and to infer time (Cavanagh, 1992; Verstraten et al., 2000; Shioiri et al., 2002). Rather than exploiting information achieved by spatial filtering, motion tracking exploits information provided by spatiotemporal filters, i.e., filters devoted to spatiotemporal computation underlying the coding of speed by the motion system (see Burr and Thompson, 2011; Mather et al., 2012, for a review). Doherty et al. (2005) showed that when pre-occluder motion generated expectations concerning the where and when of reappearance, reaction times to reappearances are shortened, especially when spatial and temporal expectations combine. These differences may reflect a difference with respect to the way covert-attention is deployed during occlusion: attention directed to space and time combined (motion) may be more efficient than visuospatial attention directed to space alone.

To assess the role of motion tracking we need to demonstrate that the time of arrival is judged on the basis of space and time combined, rather than on the computation of a separate motion dimension—either space or time. To this end, we made the occluder invisible and its shape unpredictable (as Figure 1 shows, it was an irregular hendecagonal polygon with bilateral symmetry in all directions), and abolished the reappearance cue that is typically used in experiments on motion extrapolation. In these conditions, spatiotemporal computation was precluded and observers were forced to respond either when they actually saw the target reappear or when they predicted its reappearance by “learning” the average trajectory length (spatial cue) or the average duration of occlusion (time cue). However, by placing a spatial cue centered on the invisible occluder we created the conditions for spatiotemporal computation. Indeed, occlusion duration can be combined with trajectory length (from disappearance to the cue centered on the occluder) to judge precisely when the target reaches the central cue. Assuming the lengths of the trajectory before and after the central cue are equal, reappearance can be “visualized” by imagery to allow spatiotemporal computation and motion pursuit from the central cue to reappearance. If the fixation mark is not central, motion tracking would never allow reappearance to be judged precisely. The same outcome is expected if the fixation mark is absent.

Figure 1

To establish the role of the spatiotemporal computation underlying motion tracking and evaluate its precision, we need evidence that anticipation errors also occur to guarantee that reappearance is anticipated. Most importantly, we need evidence of a linear relationship between the estimated time to reappearance (TTR_estimated), calculated from the moment in which the target is in the center of the invisible occluder to the button press) and the actual duration of the half-trajectory length (TTR_physical).

To sum up, predictions depend on whether stimulus conditions allow motion tracking or not:

If the visible speed, occluder shape (irregular and invisible), and reappearance point are unknown, then observers cannot predict (anticipate) the target reappearance behind the occluder and are forced to respond when they actually see the target. We predict a linear relationship between TTR_physical and TTR_estimated, with no anticipation errors.
If the visible speed is known but not the occluder shape (irregular and invisible), and there is no reappearance cue, and the central cue is either absent or not central, then the exact reappearance point is unknowable. However, reappearance may be predicted, based on inferred unprecise occluder shape and using as a cue for predicting reappearance an average duration of the trajectories. In this case, anticipation errors may occur but TTR_physical and TTR_estimated are not positively related because the average trajectory length differs from individual trajectory lengths. Note that if an observer use an average strategy for judging the duration of occlusion we should obtain a flat slope when plotting 2 × TTR_estimated against 2 × TTR_physical. However, since we considered (see Analysis Section) on the y axis the duration estimated from the center of occlusion, we removed also of the entire physical duration on the x axis that (obviously) is different according to the different trajectory lengths: smaller for a short trajectory and larger for a long trajectory. This way, when plotting TTR_estimate against TTR_physical we should obtain a negative slope when people predict target reappearance using an average value of the occlusion lengths. Moreover, to confirm that observers estimate an average duration of occlusion from the different trajectory lengths, a linear relationship between the RT (TTR_estimated- TTR_physical) and the TTR_physical with a negative slope is also expected.
If the visible speed is known but not the occluder shape (irregular and invisible) and there is no reappearance cue, but there is a visible cue centered on the occluder, then this may allow a spatiotemporal computation and the formation of an internal representation of the occluded moving target so that it can be “tracked” during its trajectory from disappearance to the central cue and from there to reappearance, “visualized” as symmetrical to the disappearance with respect to fixation. In addition to anticipation errors, a linear relationship between TTR_physical and TTR_estimated is expected. Thus, the crucial finding to infer that motion tracking has occurred, based on spatiotemporal computation, is the linear relationship between TTR_estimated and physical duration.

Experiment 1

Experiment 1 aims to disentangle outcome (a) from outcomes (b) and (c). Whereas pre-occluder motion allows participants to anticipate the target reappearance, this is impossible without pre-occluder motion, and observers can only respond when they see the target. That is, in this second baseline condition we do not predict anticipation errors without pre-occluder motion, whereas TTR_estimated should depend on trajectory length.

Methods

Participants

Seven students from the University of Padova (4 female, 3 male; age 19–22 years) participated voluntarily in Experiment 1. The participants remained unaware of the true aims of the experiment until they completed the task. All of the participants gave written informed consent in accordance with the Declaration of Helsinki.

Stimuli, apparatus, and procedure

The participants were placed in a dark room, seated 57 cm away from the display screen. The viewing was monocular, and both eyes were tested. Stimuli were generated with Matlab Psychtoolbox (Brainard, 1997; Pelli, 1997) and displayed on a 19-inch Asus monitor with a refresh rate of 60 Hz. The screen resolution was 1920 × 1080 pixels. Each pixel was subtended ~1.5 arcmin. The luminance of the background was 0.7 cd/m². The target was a small circle that was 0.5 degree of visual angle (deg) in diameter whose motion remained invisible when the disk passed behind an invisible irregular hendecagonal polygon. A fixation cross 0.3 deg long and 0.1 deg wide (60 cd/m²) was placed in the center of the occluder. Both had a luminance (as measured by a Minolta LS−100 photometer) of 90 cd/m². In one block, the target initiated a linear trajectory after a randomly chosen interval of 0–2000 ms from an acoustic cue either 7.5 or 10 deg from the center of the screen and terminated 4 deg after reappearance. In the other block, the visible pre-occluder trajectory was removed and the target motion started from the center of the occluder (the target was invisible behind the occluder). In this block, the observers knew where but not when the hidden trajectory started. The target speed (either 3 or 6 deg/sec) was randomly selected within each block. The direction was randomly chosen within each block. In the condition with pre-occluder motion available, the trajectory could begin from either side of the screen, from one of eight specified directions, separated by a 45 deg sector of a virtual circumference: 0−180 (horizontal), 45–225 (diagonal, from upper-right to lower-left and vice versa), 90–270 (vertical), and 135–315 deg (diagonal from upper-left to lower-right and vice versa). Because the polygon is irregular, the hidden trajectory had a different length for each direction (Figure 1): (0−180: 5 deg; 90–270: 4.9 deg; 45–225: 4.75 deg; 135–315: 4.7 deg). Each block consisted of 64 trials: 2 repetitions of each direction, speed and starting position (7.5 or 10 deg). In all of the blocks, the participants were required to fixate on the central cross. A chin-rest was used to limit head movement.

The participants' task was to respond as quickly as possible when the target “just reappeared.”

Analysis

The physical time to reappearance (TTR_physical)for each of the four trajectory lengths of 4.7, 4.8, 4.9, and 5 deg corresponded to 783, 800, 816, and 833 ms with a low-speed target and 391, 400, 408, and 417 ms with high speed, respectively [TTR_physical: (invisible trajectory length/2)/speed of the target. TTR_physical was calculated from the center of the occluder because in one block of Experiment 1 the target started from the center]. We considered three dependent variables: (a) estimated TTR (TTR_estimated), which corresponded to the response time measured from the center of the occluder to key press: TTR_estimated = TTR_physical + RT. (b) RT that is equal to the estimation of the entire duration of occlusion minus the entire physical duration of occlusion, corresponding to: (TTR_estimated + TTR_physical) − 2 × TTR_physical, i.e., half of duration estimated (that include the entire RT plus half of the physical duration) minus the entire physical duration of occlusion. The result is equal to TTR_estimated − TTR_physical. (c) anticipation errors (negative RTs). Individual regression lines were fitted to evaluate the relationship between TTR_physical and TTR_estimated, and between the RT and the TTR_physical. We used either t-tests or ANOVA to compare the individual slopes obtained in the condition with fixed central cue with those obtained in the control condition of each experiment.

Results

The results are shown in Figures 2, 3. In the pre-occluder motion condition, there were more individual anticipatory errors, which were inversely related in a linear way to individual mean RTs (Figure 2). Figure 2 shows also that the individual mean RT are shorter with pre than without pre-occluder motion, indicating that short RT can be another measure of the tendency of the participants to anticipate target reappearance. Most importantly, in both conditions, TTR_estimated was directly related to TTR_physical, indicating an isomorphic relationship between these two variables, a result implying that trajectory length/duration was judged with high precision (Figure 3).

Figure 2

Figure 3

One-sample t-tests revealed that the anticipation errors (negative RTs) differed from 0 (no errors) in the condition in which the pre-occluder trajectory was present [t₍₆₎ = 3.151; p < 0.02] but not when it was absent [t₍₆₎ = 1.14; p < 0.2]. Moreover, the regression lines fitted to the anticipation errors obtained as a function of RTs revealed a significant negative slope in the pre-occluder motion condition (slope = −0.75, R² = 0.4) but not when the pre-occluder motion condition was absent (slope = 0.05, R² = 0.03).

The average TTR data showed a linear relationship between TTR_estimated and TTR_physical, both in the condition with pre-occluder motion (slope = 0.99 and R² = 0.71) and in the baseline condition, without pre-occluder motion (slope = 1.19, R² = 0.90). A t-test executed to evaluate the difference between the individual slopes obtained with pre-occluded motion either present or absent was not significant [t₍₆₎ = 1.5; p = 0.17]. The results demonstrate that without pre-occluder motion, the observers responded when they saw the target. With pre-occluder motion present, the observers anticipated the target reappearance, and the evidence that TTR_estimated was isomorphic to TTR_physical indicated that they do so by a very precise spatiotemporal computation.

Experiment 2

We ran a second experiment to confirm the hypothesis that, whereas anticipation errors may result from a computation of average trajectory length, the linear relationship between physical and judged trajectory duration does not. Shioiri et al. (2000) have shown that participants can precisely judge the apparent location of a target in invisible motion relative to an imaginary cue. We asked whether the participants could exploit this ability to judge target reappearance. They could “track” the target's motion from disappearance to when it reached the position behind the occluder marked by a visible cue (the central fixation) and then, by symmetry, from there to when it reached an imaginary cue signaling the point of reappearance, positioned symmetrically to the point of disappearance with respect to the central fixation (Figure 1). To test this possibility in Experiment 2, we compared the condition in which the cue indicating the center of the trajectory was available, thus allowing spatiotemporal computation, with the condition in which it was absent. In the first case, participants could “follow” the moving target behind the occluder for the first part of its trajectory up to when it reached fixation; for the second part, its length was isomorphic to the first, so visual imagery of the reappearance point was then available by motion-tracking. Conversely, when there was no cue and the trajectory length was not constant, the participants were either obliged to respond when they saw the target reappearing or to learn an average trajectory length or occlusion duration. Two groups were tested: the first was instructed to maintain fixation at the central cue, while the second could follow the moving target with their eyes.