Independence of Size and Distance in Binocular Vision

Kim, Nam-Gyoon

doi:10.3389/fpsyg.2018.00988

ORIGINAL RESEARCH article

Front. Psychol., 25 June 2018

Sec. Perception Science

Volume 9 - 2018 | https://doi.org/10.3389/fpsyg.2018.00988

Independence of Size and Distance in Binocular Vision

Nam-Gyoon Kim^*

Department of Psychology, Keimyung University, Daegu, South Korea

For too long, the size distance invariance hypothesis (SDIH) has been the prevalent explanation for size perception. Despite inconclusive evidence, the SDIH has endured, primarily due to lack of suitable information sources for size perception. Because it was derived using the geometry of monocular viewing, another issue is whether the SDIH can encompass binocular vision. A possible alternative to SDIH now exists. The binocular source of size information proposed by Kim (2017) provides metric information about an object’s size. Comprised of four angular measures and the interpupillary distance (IPD), with the explicit exclusion of egocentric distance information, Kim’s binocular variable demands independence of perceived size and perceived distance, whereas the SDIH assumes interdependence of the two percepts. The validity of Kim’s proposed information source was tested in three experiments in which participants viewed a virtual object stereoscopically then judged its size and distance. In Experiments 1 and 2, participants’ size judgments were more accurate and less biased than their distance judgments, a finding further reinforced by the results of partial correlation analyses, demonstrating that perceived (stereoscopic) size and distance are independent, rather than interdependent as the SDIH assumes. Experiment 3 manipulated participants’ IPDs, one component of Kim’s proposed variable. Size and distance judgments were overestimated under a diminished IPD, but underestimated under an enlarged IPD, a result consistent with predictions based on participants’ utilization of the proposed information source. Results provide unequivocal evidence against the SDIH as an account of size perception and corroborate the utility of Kim’s proposed variable as a viable alternative for the binocular visual system.

Introduction

The sense of solidity experienced when viewing a pair of two-dimensional (2-D) stereo images is compelling. The added depth that is unavailable in each 2-D image may contribute to the vivid impression. This may be why Pinker (1997, p. 241) declares binocular vision as “one of the glories of nature.” Indeed, it is well documented that binocular vision facilitates our daily interactions with the surrounding environment (Jackson et al., 1997; Watt and Bradshaw, 2000, 2003; Melmoth and Grant, 2006).

The advantages of binocular vision have been well recognized, but its contribution to space perception has been unimpressive. Of the many sources of spatial information identified to date, only two distance cues (convergence and binocular disparity) are binocular. Further, the efficacy of these two sources is rather limited, effective, at best, up to 2 m but no more than 6 m from the observer (Ono and Comerford, 1977, for review; but see Allison et al., 2009; Palmisano et al., 2010). Nevertheless, these two information sources carry extra significance. Convergence (i.e., inward or outward turning of the eyes to fixate objects at different distances) may well be the only cue that provides absolute metric information (Kaufman, 1974); whereas binocular disparity (i.e., the difference in the images of the two eyes due to their different viewpoints) provides the sense of solidity (i.e., three-dimensionality) of objects. Presumably, the limited ranges of the two binocular sources of distance information are supplemented by other monocular sources of distance information to yield accurate awareness of the surrounding layout in depth.

Distance perception abounds with various sources of information; but its counterpart, size perception, does not. In fact, to date only a few sources of information have been identified to account for size perception (e.g., familiar size, relative size, and horizon ratio). The disparity between the number of candidate information sources for size and distance is puzzling, given the long history of this problem (Ross and Plug, 1998; Hatfield, 2002). Even for those few sources of information that have been postulated to support accurate size perception, their efficacy is limited. For example, Haber and Levin (2001) demonstrated that familiar size can be an effective cue for size judgments. Interestingly, their participants were able to judge the sizes of unfamiliar objects with comparable precision. Unable to provide an adequate explanation for this finding, they lamented that “All we can say is that they did not do it in the same way as they did for the distance estimations. This ignorance reflects a general ignorance about the perceptual variables underlying size perception” (p. 1150).

The horizon ratio, first introduced by Sedgwick (1980), utilizes the fact that one’s eye height coincides with the horizon line. The absolute height of an object, therefore, can be determined in proportion to one’s eye height. Wraga (1999; see also Dixon et al., 2000) confirmed the utility of this information source by explicitly manipulating one’s perceived eye height by surreptitiously varying the floor height. Results showed that the perceived heights of objects varied in accordance with perceived eye height, positive evidence for the utility of the horizon ratio. Interestingly, eye height’s impact on judging object width was minimal. Based on these findings, Wraga concluded that eye height can be utilized as a natural metric for object height, but not for object width. Thus, it is safe to conclude that there has yet to be a comprehensive account of perceptual capacity for size judgments, in particular, the horizontal extents of objects, apart from the size distance invariance hypothesis (SDIH).

Indeed, the prevalent explanation for size perception has been the SDIH. As illustrated in Figure 1A, the two sides of the triangle, S and D, are inversely related to the angle, 𝜃, through a trigonometric relation, tan 𝜃 = S/D. Extending this geometric relation to perception, the hypothesis states that the visual angle 𝜃 subtended by an object determines a unique ratio of the perceived size of the object S′ to its perceived distance D′, that is, tan 𝜃 = S′/D′ (Kilpatrick and Ittelson, 1953; Epstein et al., 1961).

FIGURE 1

FIGURE 1. (A) Monocular geometry depicting the size distance invariance hypothesis (SDIH). An object of size S is at a distance D from an observer O, thus subtends a visual angle 𝜃. (B) Binocular geometry for viewing a line segment AB. L and R refer to the left and right eye, respectively, and ρ the IPD. A and B are the two end points of the line segment. α and β are visual angles subtended by AB with respect to each eye, whereas γ and δ are binocular parallaxes of each end point of the segment with respect to the two eyes.

An infinite number of size and distance combinations exists for any given angle. Yet, it is primarily perceived size, not perceived distance, for which the SDIH is utilized. Lack of identified information sources for size perception may have contributed to this biased application of the SDIH. Hence, the perceived size of an object is thought to be determined by both the visual angle the object subtends and its perceived distance, that is, S′ = D′ tan 𝜃. The conjecture that perceived size is derived from visual angle by taking perceived distance into account has been referred to as the “taking-into-account” model (Epstein, 1973, 1977; Higashiyama and Shimono, 2004; Higashiyama and Adachi, 2006).

For the last several decades, extensive efforts have been made to validate the SDIH empirically. The results have been largely inconclusive, primarily due to anomalous effects collectively known as the size-distance paradox (see Ross, 2003, for review). Gruber (1954) set out to determine whether perceived size is proportional to perceived distance when image-size is held constant. He observed, instead, that “an object which is consistently underestimated in relative size was consistently overestimated in relative distance” (p. 426), a pattern opposite to that predicted by the SDIH. This effect has been replicated repeatedly (Heinemann et al., 1959; Baird and Biersdorf, 1967; Epstein and Landauer, 1969; Ono et al., 1974; Foley, 1980; Collewijn and Erkelens, 1990; Brenner and van Damme, 1998; see Ross, 2003, for a review).

Apart from the issue of being an effective account of size perception, the SDIH raises another issue, that is, whether it can be utilized as an account of size perception for binocular vision. As depicted in Figure 1A, the SDIH is derived based on the geometry of monocular viewing. However, there is ample evidence of the benefits of binocular vision in our daily interactions with the surrounding environment (Jackson et al., 1997; Watt and Bradshaw, 2000, 2003; Melmoth and Grant, 2006). As an illustration, to reach and grasp an object in space, the hand must be transported to the object of interest while the grip aperture must match the dimensions of the object. The transport component relies on extrinsic properties of the object (e.g., object’s distance); but the grasp component relies on intrinsic properties (e.g., size and shape). Watt and Bradshaw (2000) reported that removal of binocular information affected the formation of the grip aperture, but had negligible impact on the transport component.

Despite convincing demonstration that binocular vision facilitates the control of grasp, these researchers failed to identify the source of binocular information that facilitated the size judgments needed for the control of grasp. Thus, the question can be raised as to the exact source of binocular information that their participants utilized to control their grasp. Did their participants perceive an object’s distance first, then use that distance to recalibrate the retinal image to determine object size, in accordance with the SDIH; or did they utilize an as yet unknown binocular information source to judge object size?

Recently, Kim (2017) proposed an alternative source of information that the binocular visual system could utilize to detect an object’s size—the horizontal extent of an object. Drawing on the binocular geometry of viewing a fronto-parallel line segment AB (Figure 1B), the proposed variable is expressed as follows:

\begin{matrix} AB = ρ \sqrt{\frac{sin α}{sin δ} \frac{sin β}{sin γ}} \end{matrix}

Provided that the visual system can access its interpupillary distance (IPD), the contention is that any frontal size can, in principle, be perceived binocularly based on the proposed binocular variable. As is the case with convergence angle, the IPD provides a metric basis, enabling the variable to convey absolute metric information about object size. More importantly, the information for an object’s size, according to this binocular variable, is directly available in optical stimulation, even in the absence of egocentric distance information. Hence, its utility necessarily demands the independence of the perceptions of size and distance.

As da Vinci noted five centuries ago, the binocular mode of visual perception may be fundamentally different from the monocular mode of visual perception, particularly at short distances (see Wade et al., 2001, for further details; see also Ono et al., 2002). Yet, research on binocular vision has relied exclusively on a description developed based on the viewing geometry of monocular vision, in particular, the SDIH. This is problematic. The present study set out to determine whether the binocular visual system utilizes the SDIH or an alternative source information, such as that proposed by Kim (2017), to perceive an object’s size. To this end, three experiments were conducted in which participants viewed a virtual object stereoscopically then judged its size and distance. The first two experiments ascertained whether perceived (stereoscopic) size and perceived (stereoscopic) distance are interdependent, as predicted by the SDIH, or independent, as entailed by Kim’s (2017) proposed binocular information source. The results showed little evidence that the two perceptual qualities are related, thus contradicting the SDIH. Because these results can only be construed as indirect evidence for the utility of the proposed information source, the third experiment sought direct evidence for its utility by manipulating the IPD, one component of the proposed variable.

Experiment 1: Perceptual Independence of Size and Distance in Stereoscopic Vision

In research on size perception, the doctrine of size distance invariance has rarely been challenged. Any deviant results have been attributed to the degraded qualities of distance information. If, however, the perceptions of size and of distance are independent, as hypothesized here, a different method will be needed to evaluate this hypothesis. Garner and Morton (1969; see also Ashby and Townsend, 1986; Amazeen, 1999) caution that a proper experimental paradigm to test the perceptual independence of two percepts must be one in which the two stimulus variables are controlled independently and the corresponding perceptions are assessed separately. Only then can a lack of independence in performance be attributed to limitations in the perceiver rather than to limitations in the experimental arrangement.

In the current experiment, participants watched virtual images of a cube of varying size under stereoscopic viewing conditions. The images were rendered in cross disparity so that they appeared to be floating in front of the computer monitor at varying distances from the observer. Participants reported the perceived location of the image, as well as its horizontal extent, by manipulating the reporting apparatus with their right hands, which were hidden from their view (see below for more details).

Participants

Nineteen undergraduates (1 male and 18 female) from Keimyung University volunteered for the experiment and received course credit for their participation. All participants had normal or corrected-to-normal vision. With the exception of one participant, all had normal stereoacuity of at least 100 s of arc, as measured by the Multi-Target Red/Green Anaglyph Stereo Test (Random Dot Butterfly, Letter “E,” and Figures; Synthetic Optics Inc., Franklin Lakes, NJ, United States). Her data were excluded from analysis.

Ethics Statement

The study was approved by the Keimyung University’s Institutional Review Board. After providing a complete description of the study to the participants, written informed consent was obtained in accordance with the Declaration of Helsinki.

Apparatus

The visual stimuli were generated on a Dell Precision 380 workstation equipped with an NVIDIA Quadro FX3450 graphics card (NVIDIA, Santa Clara, CA, United States) and presented on a 21-in Samsung SyncMaster Magic CD210JP CRT monitor refreshed at 120 Hz. Participants viewed the displays in a dimly lit room through CrystalEyes (StereoGraphics, San Rafael, CA, United States) liquid crystal (LC) shutter glasses that were synchronized with the monitor’s refresh rate, which alternated at 60 Hz. The display had a resolution of 1080 H × 768 V pixels and subtended a field of view of 37.0° H × 28.0° V when viewed from a distance of 60 cm. A head and chin rest was used to restrict head movements.

A 56-cm H × 75-cm V × 100-cm deep (D) matte black viewing box was placed between participants and the monitor. The viewing box had a window for the monitor on one end and a window for a chin rest at the other. In addition to enhancing stereoscopic viewing, the viewing box blocked participants’ views of the hand with which they reported their perceptions of distance and size for a virtual object. To record their responses, participants used a special reporting apparatus (Figure 2). A wooden track onto which a 120-cm ruler was placed was positioned parallel to participants’ lines of sight and to the right of the viewing box. A wooden block could be moved along the track. On the block there was another 20.5-cm ruler lying parallel to the observer’s frontal plane. The ruler on the track was used to report the perceived distance of the virtual object, whereas the short ruler on the block was used to report the perceived size of the object (see below for details).

FIGURE 2

FIGURE 2. The reporting apparatus employed in the present study.

Stimuli

The stimulus was a cube in which each of its six sides was rendered with a different texture. The cube was displayed against a white background (Figure 3). The six texture images were randomized in each trial to produce different images of the cube across trials to eliminate effects of familiarity and texture information on size judgments. The stereo images were calibrated in accordance with each participant’s IPD.

FIGURE 3

FIGURE 3. Stereograms used in the volumetric condition. All the stereo pairs rendered in cross disparity. Hence, the left pair is the image for the right eye, whereas the right pair is the image for the left eye.

The stimulus object appeared either in its entirety as a three-dimensional (3-D) cube (the volumetric condition) or presenting only the frontal face as a 2-D rectangle (the frontal face condition). As part of control, each object appeared slightly to the left or to the right from the center of the screen. An additional effect of this manipulation was that each object, particularly in the volumetric condition, was depicted such that only its frontal face was projected to one eye, whereas the frontal face and one of the side panels were projected to the opposing eye, depending on the geometry of viewing (Figure 3). Note that in the volumetric condition, the object was rendered under perspective projection. Thus, despite the drastic difference in the two stereo images in the volumetric condition, when fused properly, the three-dimensionality of the perceived object in conjunction with its side panel rendered in perspective will further enhance depth impression.¹ If the SDIH is utilized for size perception, performance should be facilitated more so by enhanced depth impression under the volumetric condition than under the frontal face condition.²

Design

Four variables were manipulated. The simulated size of the cube varied among 3, 5, 7, 9, and 11 cm; and its simulated location varied among 34, 37, 40, 43, and 46 cm from the observation point. In the volumetric condition, the entire cube was shown; in the frontal face condition, only the frontal face of the cube was visible. Each cube was centered 3–4 cm to the left or to the right of the center of the screen. These manipulations yielded a 5 (Size) × 5 (Distance) × 2 (Shape) × 2 (Side) repeated measures design with 100 completely randomized trials.

Procedure

The experiment employed a double-blind procedure in which the laboratory assistant who ran the experiment was naïve as to the purpose of the experiment, as were the participants. Upon presentation of each stereo image pair, participants were instructed to move the wooden block along the track and place the surface of the block facing them coincident with the front face of the cube or the front face of the flat surface, depending on the condition. Participants were encouraged to adjust the block until satisfied with their judgments. They were then instructed to indicate the frontal size of the object using their thumb and the index finger, with the tip of their thumbnails placed at the left end of the short ruler on the block. The tick mark on the ruler indicated by the tip of the index fingernail was used to determine the size of the object. No feedback was provided. While reporting their responses, participants were instructed to keep their heads inside the viewing box and maintain their gaze on the monitor. Thus, the scales read by the experimenter were not visible to participants.

Data Analysis

First, performance was assessed in terms of constant error³ employing an analysis of variance (ANOVA) on each perceived quality, i.e., S′ (perceived size) and D′ (perceived distance) with size, distance, shape, and side as independent variables. Then, following Oyama (1974, 1977; see also Higashiyama and Shimono, 2004; Higashiyama and Adachi, 2006), partial correlation analyses were performed to assess the relationships among manipulated variables and perceptual variables. Partial correlation measures the relationship between two variables while holding the effect of other variables constant. Oyama contends that the patterns of partial correlations can be used to infer causal relations among variables. For example, given three variables, X, Y, and Z with the assumption that X is always an independent variable and every relation between two variables is linear, if X determines Y and Y determines Z, that is, Y is mediating X to determine Z, a high bivariate correlation between X and Z becomes almost zero when the effect of Y is removed. If, as the SDIH predicts, S′ is derived from visual angle 𝜃 by taking D′ into account, the partial correlation between 𝜃 and S′ should be zero when the effect of D′ is controlled. If, on the other hand, as contended here, S′ and D′ are independent, or more specifically, S′ is directly perceived by the information conveyed by Kim’s (2017) binocular variable, a high bivariate correlation between S′ and object size S should remain unaffected even when the influences of D′ or any other candidate intervening variables, such as 𝜃 or convergence angle ϕ, are held constant. For correlation analyses, 𝜃, ϕ, S, and D were entered as stimulus (or manipulated) variables⁴ and S′ and D′ as perceptual (or responding) variables. S is assumed to be specified by the proposed information source and D either by ϕ or a yet to be discovered higher-order optical variable specifying distance information binocularly comparable to Kim’s (2017) binocular size variable. Except for binocular disparity and convergence angle, all other spatial cues were unavailable under the experimental setup, that is, viewing stereoscopically produced virtual images of variously sized cubes rendered in novel texture images. Because these two distance cues are perfectly correlated with each other, only convergence angle was entered in the correlation analyses as distance information.

Results and Discussion

The five object sizes employed in the experiment were 3, 5, 7, 9, and 11 cm; and the means of the corresponding perceived sizes (SD) were 5.23 (0.84), 7.43 (1.00), 9.38 (1.25), 11.43 (1.54), and 13.32 (1.66) cm, respectively. The five target locations employed were 34, 37, 40, 43, and 46 cm; and the means of the corresponding perceived distances (SD) were 23.37 (8.07), 25.76 (7.98), 27.53 (8.40), 29.85 (8.09), and 32.22 (7.79) cm, respectively.

Overall, participants performed poorly, overestimating size and underestimating distance. The overall mean constant errors in size and distance judgments were 2.36 cm (SD = 1.18) and -12.26 cm (SD = 7.93), respectively. Nevertheless, an ANOVA on perceived size with size as a within-subject factor confirmed a significant effect of size, F(4,68) = 502.88, p < 0.0001, $η_{p}^{2}$ = 0.97. A Tukey post hoc test confirmed that all five sizes were discriminated from one another at the 0.01 significance level. The result was the same for perceived distance, F(4,68) = 76.55, p < 0.0001, $η_{p}^{2}$ = 0.82; means at the tested distance values were significantly different from each other at the 0.01 level. It appears that, despite over- and underestimation of size and distance, participants responded systematically to the variables manipulated in the experiment.

Constant Error Analysis

Judgment accuracy was assessed using constant error. Mean constant error in perceived size and in perceived distance are presented, respectively, as a function of object size (cm) for each condition of object distance (cm) and as a function of object distance (cm) for each condition of object size (cm) in the top and bottom panels of Figure 4. An ANOVA on perceived size revealed a main effect of distance, F(4,68) = 15.58, p < 0.0001, $η_{p}^{2}$ = 0.48, a significant Size × Distance interaction, F(16,272) = 1.80, p < 0.05, $η_{p}^{2}$ = 0.10 (top panel of Figure 4), a significant Size × Shape interaction, F(4,68) = 3.68, p < 0.01, $η_{p}^{2}$ = 0.18, and a significant four-way interaction among size, distance, shape, and side, F(16,272) = 1.92, p < 0.05, $η_{p}^{2}$ = 0.10.

FIGURE 4

FIGURE 4. Mean constant error in perceived size as a function of object size (cm) for each condition of object distance (cm) (top); and mean constant error in perceived distance as a function of object distance (cm) for each condition of object size (cm) (bottom) in Experiment 1. Error bars represent ±1 standard error (SE) of the mean.

With respect to the main effect of distance, a Tukey post hoc test confirmed performance differences between the 34 cm condition and the 40, 43, and 46 cm conditions and between the 37 cm condition and the 43 and 46 cm conditions. Size overestimates tended to be greater at near distances (the 34 and 37 cm conditions) than at far distances (the 43 and 46 cm conditions). With respect to the Size × Distance interaction, a simple effects analysis confirmed that the effect of distance was significant in the 9 cm condition, F(4,14) = 6.31, p < 0.01, and in the 11 cm condition, F(4,14) = 3.85, p < 0.05. Taken together, overestimation of object size at near distances was particularly pronounced in the two largest (9 and 11 cm) size conditions (top panel of Figure 4). With respect to the Size × Shape interaction, a simple effects analysis revealed a significant effect of shape in the 9 cm size condition, F(1,17) = 6.00, p < 0.05.

The ANOVA on perceived distance confirmed main effects of size, F(4,68) = 17.03, p < 0.0001, $η_{p}^{2}$ = 0.50, distance, F(4,68) = 11.04, p < 0.0001, $η_{p}^{2}$ = 0.39, shape, F(1,17) = 5.94, p < 0.05, $η_{p}^{2}$ = 0.26, and side, F(1,17) = 5.18, p < 0.05, $η_{p}^{2}$ = 0.23. The ANOVA also confirmed a significant Distance × Shape interaction, F(4,68) = 5.42, p < 0.01, $η_{p}^{2}$ = 0.24, and a significant Distance × Size × Side interaction, F(16,272) = 2.12, p < 0.01, $η_{p}^{2}$ = 0.11.

All in all, distances were severely underestimated (that is, objects were perceived closer than they were), with the degree of underestimation magnified with increases in distance and in size. A Tukey test for the size effect confirmed performance differences between the 3 cm condition and the other four larger size conditions and between the 11 cm condition and the 5 and 7 cm conditions; whereas a Tukey test for the distance effect confirmed performance differences between the 34 cm condition and the 40, 43, and 46 cm conditions and between the 37 cm condition and the 43 and 46 cm conditions.

The degree of distance underestimation also increased when the virtual object was displayed slightly to the right of the center of the screen. Recall that participants reported their responses using the reporting apparatus that was located on their right-hand side. This asymmetry may have contributed to this effect, although it is not clear why objects appearing on the right side were more underestimated (i.e., judged closer) than those on the left.⁵

The same pattern of underestimation occurred for the shape of the object with more pronounced underestimation for 3-D objects (M = -0.56) than for 2-D objects (M = 0.44). Shape further interacted with distance. A simple effects analysis revealed that the effect of distance was significant in the frontal size condition, F(4,14) = 3.85, p < 0.05, and in the volumetric condition, F(4,14) = 7.21, p < 0.01; whereas the effect of shape was significant only at 40 cm, F(1,17) = 13.35, p < 0.01. It appears that perceived distances of virtual objects located 40 cm away from the observation point were underestimated less than those of objects located at other distances, especially when the objects appeared in 2-D (the frontal face condition) rather than in 3-D (the volumetric condition). The reason for this is unknown. Taken together, the effect of the shape of the virtual image on perception appears to be minimal on size judgments. Thus, the present result in which enhanced depth impression failed to facilitate size judgments contradicts, or at least is inconsistent with, the SDIH.

Correlational Analysis

Bivariate and partial correlation analyses were performed by pairing one of the four stimulus variables (i.e., 𝜃, ϕ, S, and D) with one of two perceptual variables (i.e., S′, D′) to explore causal relationships between these variables. The analyses were performed for each participant, and the mean coefficients are presented in Table 1. At first blush, strong correlations between 𝜃 and S′, S and S′, D and D′, and ϕ and D′ are easily noticeable. Interestingly, except for the S and S′ pair, other pairs’ correlations became zero when the effect of control variables was factored out. First, with respect to the 𝜃 and S′ pair, as Oyama (1974, 1977) suggests, the near zero partial correlation indicates a causal relationship between these two variables, a result corroborating the SDIH. Note that four variables (i.e., ϕ, S, D, and D′) were entered in the partial correlation computation. Thus, it is important to identify the exact source of the confounding effect to verify this possibility. A separate partial correlation was performed while controlling one variable at a time. The additional analysis revealed coefficients of 0.78 with D, 0.81 with D′, 0.81 with ϕ, and 0.09 with S, as the control variable, respectively. Clearly, it was S that confounded the relationship between 𝜃 and S′, not a distance related variable, i.e., ϕ, D, or D′. The apparent relation between 𝜃 and S′ was spurious, arising largely because both were highly correlated with S, not causal, as it would be if it were to corroborate the SDIH.

TABLE 1

TABLE 1. Mean bivariate and partial correlation coefficients among stimulus and perceptual variables in Experiments 1 and 2.

With respect to the two pairs with near zero partial correlations, that is, the D and D′, and ϕ and D′ pairs, it was ϕ for the D and D′ pair and D for the ϕ and D′ pair, respectively, that played mediating roles. Interestingly, the magnitudes of the two correlation coefficients were identical (0.56 for the D and D′ pair but -0.56 for the ϕ and D′ pair), reflecting the inverse relation between convergence angle and distance. The results of the correlation analyses indicate a strong relationship between S and S′, but relatively weaker relationships between the two distance pairs, i.e., D—D′ and ϕ—D′ pairs. In fact, all 18 participants reached the statistically significant level in the S and S′ pair with coefficients no lower than 0.81, whereas two participants failed to reach statistical significance levels in the D—D′ and ϕ—D′ pairs. The efficacy of convergence as a reliable source of distance information has been controversial with conflicting evidence (Heinemann et al., 1959; Foley, 1980; Collewijn and Erkelens, 1990; Brenner and van Damme, 1998). The current consensus is that convergence may, at best, be a rough indicator of distance at close range. Given that convergence angle was the only source of distance information available in the present setting, relatively poorer distance judgments may reflect this consensus view. It appears, nevertheless, that, except for a few individuals, most participants relied on the convergence angle to judge target distance in the present experiment.

Finally, with respect to the S and S′ pair, the strength of the relation remained high even with the effect of the candidate intervening variables factored out, clear evidence that S′ is determined exclusively by S alone. Taken together, these results are consistent with the thesis that the perceptions of size and of distance are two independent perceptual processes, and, by extension, that the binocular visual system perceives an object’s size directly by detecting information specified in a source of information, such as that proposed by Kim (2017).

Before pursuing the present issue further, the large errors observed in these two judgments must be investigated to determine whether they have any bearing on the main issue. The following facts may be relevant: first, errors were primarily constant and bias-induced. Second, participants had no practice trials prior to the experiment and received no feedback during the experiment. Finally, participants had to respond to virtual objects with which they had no prior experience. Thus it may be that the biased responses were the consequence of improper attunement or failure in calibration—or both—by the perception-action system (Bingham and Pagano, 1998; Withagen and Michaels, 2005; Jacobs and Michaels, 2006). Attunement refers to the detection of a specifying information source; whereas calibration refers to the scaling of perception to the information source detected (see Withagen and Michaels, for further details). Unfamiliarity with the task in conjunction with the lack of feedback during the experiment may have allowed the perception-action system to drift without a proper anchor. Still, it is worth recalling that the patterns of bias differed (overestimation of size but underestimation of distance). The following investigation was conducted to clarify whether the current results were due to lack of attunement or calibration on the part of the perception-action system. To this end, Experiment 2 replicated Experiment 1, except that a short practice session was provided prior to the experiment. Feedback was provided after each practice trial, but no feedback was provided during the experiment.