Do Artists See Their Retinas?

Perdreau, Florian; Cavanagh, Patrick

doi:10.3389/fnhum.2011.00171

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 30 December 2011

Sec. Sensory Neuroscience

volume 5 - 2011 | https://doi.org/10.3389/fnhum.2011.00171

This article is part of the Research TopicBrain and ArtView all 38 articles

Do artists see their retinas?

Part of this article's content has been mentioned in:

Is artists' perception more veridical?
1. Read focused review

Florian Perdreau*

Patrick Cavanagh

Laboratoire Psychologie de la Perception, Centre Attention Vision, CNRS UMR 8158, Université Paris Descartes, Paris, France

Our perception starts with the image that falls on our retina and on this retinal image, distant objects are small and shadowed surfaces are dark. But this is not what we see. Visual constancies correct for distance so that, for example, a person approaching us does not appear to become a larger person. Interestingly, an artist, when rendering a scene realistically, must undo all these corrections, making distant objects again small. To determine whether years of art training and practice have conferred any specialized visual expertise, we compared the perceptual abilities of artists to those of non-artists in three tasks. We first asked them to adjust either the size or the brightness of a target to match it to a standard that was presented on a perspective grid or within a cast shadow. We instructed them to ignore the context, judging size, for example, by imagining the separation between their fingers if they were to pick up the test object from the display screen. In the third task, we tested the speed with which artists access visual representations. Subjects searched for an L-shape in contact with a circle; the target was an L-shape, but because of visual completion, it appeared to be a square occluded behind a circle, camouflaging the L-shape that is explicit on the retinal image. Surprisingly, artists were as affected by context as non-artists in all three tests. Moreover, artists took, on average, significantly more time to make their judgments, implying that they were doing their best to demonstrate the special skills that we, and they, believed they had acquired. Our data therefore support the proposal from Gombrich that artists do not have special perceptual expertise to undo the effects of constancies. Instead, once the context is present in their drawing, they need only compare the drawing to the scene to match the effect of constancies in both.

Introduction

Visual perception is our main access to the outside, “distal”, world which we experience consciously at the end of a long chain of processes. The image projected on our retina is the proximal stimulus, the original data on which these processes operate. If we should see the world as it is represented on the retina, objects would change size as they moved toward or away from us, change color as they moved into different lights, be cut into pieces as they moved behind other objects, and jump to and fro every time we moved our eyes. But instead of perceiving this ever-changing world, we have a coherent, invariant visual representation of objects: we experience visual constancy, that is, our conscious percept is to a large extent in accordance with the distal object’s properties whatever the proximal stimulus projected on our retina.

However, visual artists when rendering an object or a scene on a canvas return to a representation that is closer to the proximal image, depicting distant objects as smaller and nearby objects as larger. Clearly, compared to non-artists, artists are able to depict scenes and objects much more accurately. What is the basis of their expertise? One aspect is of course motor skill but the other of interest to us is the ability to see the proximal pattern of light and dark – to ignore the corrections, the visual constancies, underlying our everyday perception. The artist can pick the right dark pigment for depicting an object in a shadow, a pigment much darker than our subjective impression of the object; can make the distant object the correct size even though it is experienced as not very small. A number of studies have addressed these issues (Cohen and Bennett, 1997; Kozbelt, 2001; Cohen, 2005; Mitchell et al., 2005; Kozbelt and Seeley, 2007; Cohen and Jones, 2008; Matthews and Adams, 2008) showing indeed that drawing accuracy is correlated to perceptual performances: subjects who made more accurate drawings also showed less effect of context and visual constancies. According to Kozbelt (2001), artists are “experts in visual cognition.” The present study addresses whether the expertise of visual artists lies in their ability to access their proximal representation better than non-artists. Have years of experience changed their visual processing and their ability to access early levels of representation? Such plasticity in visual processing as a result of visual experience is seen in many contexts (Hubel and Wiesel, 1970; Goldstone, 1998; Ostrovsky et al., 2006; Green and Bavelier, 2008).

The idea that artists have direct access to early representations has been strongly criticized by the art historian Gombrich (1987). Gombrich agreed with Ruskin (1912) that artists do use special techniques to depict the proximal stimulus but he felt that their training could not lead them to get an “innocent eye”: the “innocent eye is a myth” (Gombrich, 1987, p. 251). Instead, “making comes before matching” (Gombrich, 1987, p. 99), and artists have to deal with their biased perception by drawing sketches according to it, and then make corrections in order to match it with the objective model they wish to represent. In this view, image-making is a hypothesis-testing process, a continuous back, and forth between production and correction. This “copyist” approach is an alternative explanation for the representational skills of artists. That is, artists may experience the same visual constancies as non-specialists but learn to make corrections in the context of the drawing itself as it progresses. Specifically, once sufficient context is present in the drawing, they only have to match the sizes and colors they see in their artwork to the perceived sizes and colors they see in the scene being depicted; the similarity of context in both will impose the same constancies.

To examine whether artists have developed visual expertise or copyist expertise, we tested three different constancies: size, lightness, and shape, all of which must be undone or bypassed for figurative artists to create an accurate copy of a scene. Two of the experiments use matching-to-standard tasks while the third is a visual search task. In all of these tasks, we will use context, perspective grids, shadows, and occlusion to trigger the application of visual constancies (Day, 1972; Todorovic, 2002, 2010), and see whether the artists are less influenced by the context than non-artists. If artists are indeed able to access, or recover their initial retinal image (closer to the proximal stimulus), they would be less affected by context than non-artists. However, this finding would not tell us whether the greater accuracy was due to perception that was uncorrected by visual constancies (Ruskin, 1912) or to skill in undoing the corrections (Gombrich, 1987). The critical factor to distinguish these two possibilities is speed: access to the uncorrected proximal image ought to allow for rapid response whereas the reversal of the corrections should require extra time. To test the speed of access, we use a visual search task for partial shapes in occluded or unoccluded presentation (He and Nakayama, 1992; Rensink and Enns, 1998). If artists are able to access the initial uncorrected image then their processing rates for the occluded versions will be more rapid than those of non-artists.

In these experiments, context is introduced in order to trigger the corrections of visual constancies and we assume that, without any instruction, both artists and non-artists would probably experience these context effects to the same degree. However, the subjects were not asked to judge the perceived size, or lightness, or shape, they were asked to ignore the context, to bypass constancy, and report the “real” size or luminance, or shape of the test. This is a critical point in the procedure: subjects are asked explicitly to report what corresponds to their retinal image. Can artists do this better than non-artists?

Experiment: Size Constancy

Size constancy refers to the accurate perception of an object’s size despite the fact that a distant object will have a smaller size on our retina than a near object. In order to provide such a “veridical perception” (Todorovic, 2002), the visual system needs to infer the object’s size by correcting its size on the retina (in visual angle) for the perceived distance (Figure 1). Because size constancy is related to distance perception, it must be directly dependent on the various cues to depth (Leibowitz and Harvey, 1967; Day, 1972). For example, the influence of monocular cues (perspective grids) on size constancy has been shown in several experiments (Stuart et al., 1993; Aks and Enns, 1996; Bennett and Warren, 2002). Nevertheless, our perception is not limited strictly to corrected distal image; for example, Rock (1983) suggested that we are aware of both retinal size and actual size of the object, even if we generally do not pay attention to retinal size. However, even when asked to judge an object’s retinal size (say, compare a distant building to our thumb held out beside it), there are residual effects of the actual size in the world (Carlson, 1960, 1962). This suggests that artists may be able to access the uncorrected retinal size of objects, ignoring to some extent the real world sizes of the objects; perhaps, they may do this more effectively than non-artists.

FIGURE 1

Figure 1. In the left panel, the man in the background appears to be about the same height as the woman in the foreground. This perception corresponds to visual constancy. However, in the right panel, the man’s image is moved so that he appears to be adjacent to the woman, and the now appears much smaller than he does on the left. This is the correction for distance that underlies size constancy and we, non-artists, are unable to ignore it even though we know that the two images of the man have identical size on the picture plane (measure them to check). Can artists register that the two images of the man have identical size in the picture plane?

In this first experiment, perceived depth was induced by linear perspective cues of a receding hallway in the context condition. Here, size constancy should make the test stimulus look larger in the hallway than when it is seen against the flat grid (Figure 2), and we assume that, without any instruction, both artists and non-artists would probably experience this effect to the same degree. However, the subjects were asked to adjust the size to match the physical size of a standard (presented on a blank field below) as if they were using their fingers to measure the size directly on the screen. In other words, subjects were encouraged to ignore the context and report the “real” size of the test.

FIGURE 2

Figure 2. Size task conditions. Subjects were asked to adjust the size of the test cylinder so that it matched the actual size of the standard cylinder, imagining that they were using their fingers to judge the size of both cylinders on the screen. There were two randomly counterbalanced conditions: “normal” condition where the cylinder was displayed on a simple 16 × 16 grid, and “context” condition where the cylinder appears in linear perspective represented by a hallway.

Materials and Methods

Subjects

For the three experiments, the subjects were subdivided in three groups: art students, professional artists, and non-artists. The first were recruited from high-ranked Major Art School [n = 9, six females and three males, age = 22 ± 1.7]. Professional artists were recruited from galleries, workshops, and international artists associations [n = 14, nine females and female males, age = 39 ± 12.9]. Non-artists subjects were recruited from the internal network of Cognitive Science (RISC), a database of voluntary subjects, except for two subjects from our laboratory [n = 14, nine females and five males, age = 23 ± 2.8]. The non-artists reported having no particular drawing skills or specific training in visual arts. All subjects had normal or corrected-to-normal vision and those from outside our laboratory were paid 10€ for their participation. They were informed about the purpose of the experiment and were naïve about our hypotheses. They all gave their informed consent before passing the experiment.

Materials

All the experiments took place in a dark room and used the same materials. Also, the subject’s head was always held by a chinrest so that his or her eyes were approximately 52 cm from the center of the screen. The stimuli were projected on a 22″ CRT screen (LaCie, Electron 22 blue IV), with a resolution of 1024 × 768 pixels and with a refresh frequency of 100 Hz. The monitor’s luminance was linearized with a gamma correction. The experiments were programmed with MATLAB Psychtoolbox (version 3.0.8), and were run on an Apple computer.

The screen was divided in two equal vertical halves (21° × 16°). In the top half (“standard”), two possible texture gradients could be displayed: a simple 16 × 16 black line-drawn grid simulating a vertical wall, or a black line-drawn perspective grid representing a hallway with a central perspective (with a unique vanishing point in the center). The targets were two green cylinders, one in each half (see Figure 2). The cylinders were drawn with Adobe Photoshop CS4, and their color saturation was set at 10% in order to avoid any distracting salience. All the visual elements (texture gradients and cylinders) were presented against a white background.

Procedure

Participants were told, “Adjust the size of the cylinder, at the bottom of the screen, so that it matches the size of the standard cylinder at the top. Make your adjustment as if you were using your fingers to measure the size directly on the screen.” They pressed the right arrow on the keyboard to increase the lower cylinder’s size, or the left arrow to decrease it, and then pressed the space button to register the setting. There was no time pressure but the time they took to make their setting was recorded.

The standard cylinder displayed in the top half of the screen could be presented either on a simple grid or on a texture gradient representing a hallway. The former corresponded to the normal condition, while the latter corresponded to the context condition. The two conditions were presented equally often with the order randomized across trials. The standard cylinder could have six possible heights (1.5°, 1.6°, 1.7°, 1.8°, 1.9°, and 2° of visual angle), which were randomized across trials, and the test cylinder could begin randomly either 50% smaller or bigger than the standard.

Each participant started the experiment with a block of 10 practice trials. The conditions in the test block were the texture gradient (normal/context) and the possible heights of the referential cylinder. There were 5 trials per condition for a total amount of 60 trials for the test bloc (5 × 6 × 2).

Results

Subjects settings increased proportionally with the standard size and we summarized each subject’s settings by their means across the six standard sizes. We then computed a ratio between the context mean response and the normal mean response for each subject (group mean ratios are plotted in Figure 3). These ratios are a measure of the context effect on the subject’s judgment. Ratios close to 1 mean that there was no effect of the context, while ratios significantly greater than 1 would suggest such an effect, that is, that subjects have overestimated the standard size when presented in the hallway context.

FIGURE 3

Figure 3. Group mean ratios. Ratios were computed by dividing the subject’s mean response in the context condition by that obtained in the normal condition. The art students showed a numerically smaller ratio, but this difference was not significant. Nevertheless, all ratios were significantly greater than 1, demonstrating the presence of significant constancy effects in all subjects.

We ran a one-way ANOVA on those ratios with Groups (non-artist, art students, professional artists) as factor. This test showed no significant difference in the effect of context vs. normal conditions across groups [F(2,34) = 0.37, p = 0.69]. Nevertheless, all ratios were significantly greater than 1 [t(36) = 6.36, p < 0.000]. The average ratio was 1.08, where a ratio of 1 would indicate no effect of context. There was therefore no evidence in our results suggesting that artists are better than non-artists at ignoring context in accessing stimulus size. One of our other questions was whether artists’ performances would vary with experience. To address that point, we analyzed the correlation between the context effect expressed as the ratio described above and subjects’ years of art experience. We fixed non-artists’ experience to 0, since they were not supposed to have followed an art training, and used the self-reported years of art training as the other variable in the correlation. The correlation was not significant (Pearson’s r = 0.08, ns).

Finally, we analyzed the response time for each subject to evaluate the effort the subjects put into making their settings in each settings in each condition. A longer time would suggest more effort. We found a significant main effect of Groups [F(2,219) = 22.59, p < 0.001, η² = 0.17], as well as a main effect of the condition [F(1,219) = 5.89, p < 0.016, η² = 0.03], but no interaction between Condition and Groups [F(2,219) = 0.357]. A Post hoc analysis showed that surprisingly, art students, like professional artists, spent more time on each trial, 15.37 and 15.95 s, respectively, almost twice as much as non-artists 8.60 s (both p < 0.001). There was no difference between art students and professional artists. This result is the opposite of our expectation that artists would find this task easier.

In summary, size perception was influenced by visual context for all subjects, showing an increase in the estimated size of the standard by in an average of 8% in the context condition compared to the normal condition. We also found no correlation between the degree of context’s influence and the subject’s experience, suggesting that experience and training do not play a crucial role in artists’ performance. In sum, we find no evidence of an advantage for artists in ignoring context when judging object size.

The instructions were of critical importance in this task: if we had asked subjects to match the apparent size, we would expect that size constancy would apply equally to all, independently of their art training. But instead, we were encouraging subjects to ignore the context and evaluate the size of the standard and comparison as if they were measuring them on the screen with their fingers. Our adjustment procedure also allowed subjects time to engage various strategies; this is of particular interest to us as it should bring into play explicit strategies that artists have learned in drawing class as well as the implicit ones acquired through long practice.

Despite these aspects of the experiment that should have favored the artists if they did have special perceptual expertise, we found that the artists were as bound to the context effects as non-artists. Moreover, response time analysis showed that both art students and professional artists spent much more time on each trial than non-artists. We had expected artists to take less time, given their expertise. This opposite result suggests that the artists felt some pressure, as experts in visual perception, to perform well on these tasks, to engage the strategies that they had been taught to correct size perception and to overcome context effect. But despite the instructions to ignore context and despite the longer duration the artists spent on the task, they showed the same extent of constancy as non-artists.

Experiment: Lightness Constancy

We perceive objects via the light they reflect back to our retina. The received light is determined by two components: the object’s surface reflectance and the illumination falling on it. The reflectance corresponds to the proportion of the incident light that is reflected at different wavelengths of the spectrum and fully depends on the surface material. It is a property of the object and remains constant whatever the intensity or wavelength distribution of the illumination falling on the object. The amount of light arriving at the retina (the proximal property) is the product of the object’s reflectance (its “color,” the distal property) and the illumination. Here we will focus on achromatic property of the object’s surface – whether it is light or dark, and in the case of the achromatic test patches we use, white, gray, or black. We will use “lightness” as the perceived reflectance (white vs. black surface) and “brightness” or luminance as the perceived luminance (the product of illumination and reflectance). According to those definitions, lightness constancy designates the invariance of the surface’s perceived reflectance despites changes in illumination (Gilchrist, 1988; Moore and Brown, 2001).

To recover the surface reflectance of an object, most authors assume a process that can discount the illumination falling on it. To do so, the visual system must estimate the illumination. A number of proposals have been made for this process (Gilchrist, 1988, 2006; Adelson, 1993, 2000; Arend and Spehar, 1993a,b; Agostini and Galmonte, 2002). Although lightness constancy has often been explained in terms of low-level mechanisms (simultaneous contrast effect caused by lateral inhibition in retina’s ganglion cells), it now appears that in some cases, a high-level computation of spatial relationships of surfaces and light is required. For example, a cast shadow on a surface can be recognized by the visual system because it is darker, its borders are unrelated to object borders, the surrounding texture continues into the shadow area with a reduction of luminance but not contrast, and it appears to have no volume of its own (Cavanagh and Leclerc, 1989). Thus the visual system would attribute change of luminance within the shadow limits to a change in illumination, not reflectance (Gilchrist, 1988).

However, a painter can only vary the reflectance of the paint used to depict the object and so this one pigment must correspond to the luminance coming from the real object where the luminance is the product of the object’s reflectance and the illumination falling on it. Can normal observers make these luminance judgments with any accuracy (brightness) – how well could they pick a paint to match it? For instance, when a cast shadow falls on a test surface it leads the observer to perceive the object’s surface as lighter (Figure 4). Can artists ignore the perceived reflectance and “see” the actual luminance any better than normal observers?

FIGURE 4

Figure 4. Lightness constancy and shadows (Adelson, 1993). Squares A and B have identical luminance as shown by the vertical gray stripes that contact both in the right hand panel. However, B appears to lie in a shadowed region indicating a reduction in illumination, Once the visual system compensates for the illumination difference, B appears to be a lighter (whiter) surface than A.

To examine this we introduce a cast shadow into a simple scene (Figure 5) where lightness constancy should make the test stimulus look lighter, more white, when the shadow falls on it even though its luminance remains the same. We assume that, without any instruction, both artists and non-artists would probably experience this effect to the same degree. However, the subjects were not asked to judge the perceived surface lightness (light or dark) but to judge the amount of light as if the shadow were not present or they could look at the gray patch through a tube. In other words, subjects were encouraged to ignore the context, to bypass lightness constancy and report the “real” luminance of the test.

FIGURE 5

Figure 5. Brightness task conditions. The task was to adjust the brightness (luminance) of the test ellipse (B and D) so that it corresponded to the actual brightness of the standard ellipse (A and C). Two conditions were randomly presented to the subject: the “normal” condition where the standard was outside the shadow, and the “context” condition where the standard was within the shadow.