Aesthetics by Numbers: Links between Perceived Texture Qualities and Computed Visual Texture Properties

Jacobs, Richard H. A. H.; Haak, Koen V.; Thumfart, Stefan; Renken, Remco; Henson, Brian; Cornelissen, Frans W.

doi:10.3389/fnhum.2016.00343

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 21 July 2016

Sec. Cognitive Neuroscience

Volume 10 - 2016 | https://doi.org/10.3389/fnhum.2016.00343

This article is part of the Research TopicHow Variable, Stable, or Universal are Aesthetic Preferences?View all 22 articles

Aesthetics by Numbers: Links between Perceived Texture Qualities and Computed Visual Texture Properties

Richard H. A. H. Jacobs^1,2*

Koen V. Haak^1,3

Stefan Thumfart^4,5

Remco Renken^1,6

Brian Henson⁷

Frans W. Cornelissen¹

¹Laboratory for Experimental Ophthalmology, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
²Donders Institute for Brain, Cognition and Behavior, Donders Center for Cognition, Radboud University, Nijmegen, Netherlands
³Donders Institute for Brain, Cognition and Behavior, Centre for Cognitive Neuroimaging, Radboud University, Nijmegen, Netherlands
⁴Profactor GmbH, Steyr-Gleink, Austria
⁵Research Unit for Medical-Informatics, RISC Software GmbH, Johannes Kepler University Linz, Linz, Austria
⁶BCN NeuroImaging Center, School for Behavioral and Cognitive Neurosciences, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
⁷School of Mechanical Engineering, University of Leeds, Leeds, UK

Our world is filled with texture. For the human visual system, this is an important source of information for assessing environmental and material properties. Indeed—and presumably for this reason—the human visual system has regions dedicated to processing textures. Despite their abundance and apparent relevance, only recently the relationships between texture features and high-level judgments have captured the interest of mainstream science, despite long-standing indications for such relationships. In this study, we explore such relationships, as these might be used to predict perceived texture qualities. This is relevant, not only from a psychological/neuroscience perspective, but also for more applied fields such as design, architecture, and the visual arts. In two separate experiments, observers judged various qualities of visual textures such as beauty, roughness, naturalness, elegance, and complexity. Based on factor analysis, we find that in both experiments, ~75% of the variability in the judgments could be explained by a two-dimensional space, with axes that are closely aligned to the beauty and roughness judgments. That a two-dimensional judgment space suffices to capture most of the variability in the perceived texture qualities suggests that observers use a relatively limited set of internal scales on which to base various judgments, including aesthetic ones. Finally, for both of these judgments, we determined the relationship with a large number of texture features computed for each of the texture stimuli. We find that the presence of lower spatial frequencies, oblique orientations, higher intensity variation, higher saturation, and redness correlates with higher beauty ratings. Features that captured image intensity and uniformity correlated with roughness ratings. Therefore, a number of computational texture features are predictive of these judgments. This suggests that perceived texture qualities—including the aesthetic appreciation—are sufficiently universal to be predicted—with reasonable accuracy—based on the computed feature content of the textures.

Introduction

Can aesthetic appreciation of textures be predicted based on computed visual features? That is the question addressed in the present work that arose from the Syntex project (http://visualneuroscience.nl/syntex). Aesthetics often refers to beauty and related judgments, such as preferences. In a broader sense, it often refers to other impressions, such as the judgment of naturalness. Both interpretations apply to this article; its focus is on beauty, but we also consider other judgments about visual textures.

We use the following working definition of texture: any pattern in which no single object outline can be discerned. We used “single” because an outline of one stone would count as an object, but a field of stones would count as a texture. Textures typically contain repetitive information. For the present study, color was defined as an integral part of textures or surface properties.

Visual and tactile textures are widely used in industrial design, art, and architecture to convey information (e.g., about the atmosphere or safety of buildings, or the strength, quality, or intended use of objects) and to influence aesthetic experience. Despite this widespread use, until recently there have been relatively few systematic attempts to reveal systematic relationships between such perceived aesthetic qualities and the texture’s computed visual features. The Syntex project and its derivatives also addressed the impact of visual textures on aesthetic experiences in a number of previous publications (Thumfart et al., 2008, 2011; Liu et al., 2015).

Using Textures to Examine Aesthetic Responses

The study of texture processing is interesting in itself because evidence is accumulating that textures are processed in dedicated visual processing regions, which are located mainly along the medial visual cortex (Puce et al., 1996; Peuskens et al., 2004; Cant and Goodale, 2007; Hiramatsu et al., 2011; Jacobs et al., 2014). We consider textures or surfaces as the complement of shapes or outlines. Texture information can be quantified as the degree to which a feature is present. For outline stimuli, in which texture information is dropped, only things such as the length of outlines, the position of certain elements, or the number of elements can be quantified, along with features such as contrast which can also be computed for textures. When using natural stimuli such as faces, texture information can be quantified for the entire picture, but this would disregard differences in various parts of the picture; e.g., the frequency content of a face would differ from the frequency content of the hair or of the background of the face, resulting in average values which reflect neither. We consider these to be issues that can potentially affect any human output (judgments or physiological responses). Studying texture perception may therefore lead to insights into human perception that may not be found when using other stimulus types. In addition, textures provide important clues about material properties. Understanding texture perception will therefore contribute to our understanding of the perception of material properties.

Texture processing is not only inherently interesting. If we improve our understanding of texture processing mechanisms, this may shed light on the processing of other stimuli that are more typically investigated in aesthetics research, such as photographs of faces or objects or various categories of painting (which also contain texture). Moreover, the results could possibly point out confounds in other studies. Compared to such relatively complex stimuli, the use of textures has advantages, such as minimizing semantic associations that are hard to control for. Semantic information has been shown to be an important factor for determining preferences (Berlyne, 1970; Martindale et al., 1990). With textures, semantic influences are attenuated, although some textures may still elicit associations through the recognition of the materials of which they are composed (e.g., stone, wood, silk or fur). A final advantage of using textures over more complex stimuli is the availability of a large number of algorithms to compute image features, allowing quantification of their relationship to perceived texture qualities. For this reason, we refer to our approach as: aesthetics by numbers.

Previous Research into Texture Perception

In the visual domain, studies examining texture perception have primarily focused on lower-level texture processing such as texture segmentation and discrimination (Julesz, 1981; Bergen and Adelson, 1988; Knill et al., 1990; Landy and Bergen, 1991; Williams and Julesz, 1992; Victor and Conte, 1996; Merigan, 2000; Sireteanu et al., 2005; Victor et al., 2005; Ben-Shahar, 2006; Abbey and Eckstein, 2007; Yeshurun et al., 2008; Hollingworth and Franconeri, 2009). Studies of higher-level processing of visual textures have focused on judgments of appearance and material properties related to glossiness (Pont and te Pas, 2006; Motoyoshi et al., 2007a), illumination (Pont and te Pas, 2006), metallic appearance (Motoyoshi et al., 2007b), transparency (Watanabe and Cavanagh, 1993; Fleming and Bülthoff, 2005), estimated weight (Buckingham et al., 2009), roughness (Ho et al., 2006), slipperiness (Lesch et al., 2008), complexity and self-similarity and liking (Bies et al., 2016; Güçlütürk et al., 2016), and the relationship between perceived material properties and material categories (Fleming et al., 2013).

The number of studies investigating preferences for textures or features that can be considered texture features (e.g., Soen et al., 1987; Aks and Sprott, 1996; Schira, 2003; Fleming et al., 2013) is greatly exceeded by the vast number of studies devoted to understanding the affective responses to objects. Aesthetics research has often focused on stimuli such as paintings or faces for which feature information is hard to control—let alone that this has even been attempted. Several studies have found relationships between preference and color features (Ball, 1965; Valdez and Mehrabian, 1994). Some studies have examined the frequency content and self-similarity of paintings, with or without relating this aspect to actual beauty judgments (Redies et al., 2007; Graham and Redies, 2010; Mallon et al., 2014). In the Syntex-project and the current article we specifically sought to address the relationship between texture features and aesthetics.

Evidence for a Textural Influence on Preferences

The recency of the interest in the relationships between textural image features and beauty ratings is somewhat surprising, given the many long-standing indications in the literature that texture may have an impact on preference. Such indications come from studies investigating the relationship between preference and fractal dimension (Aks and Sprott, 1996; Spehar et al., 2003; Juricevic et al., 2010; Spehar and Taylor, 2013), entropy (Stamps, 2002), spatial frequency content (Soen et al., 1987; Kawamoto and Soen, 1993; Schira, 2003) or certain colors (Valdez and Mehrabian, 1994; Jacobs et al., 2010) of stimuli (usually not textures—but such features are also present in textures). Also work showing that paintings contain certain spatial frequency characteristics (Redies et al., 2007; Graham and Redies, 2010) is suggestive of such preference. Moreover, texture strongly influences facial attractiveness (Jones et al., 2004). In line with the reported relationship between spatial frequencies and beauty ratings, the brain responses to affective stimuli—such as expressive faces—depend on the frequency bands present in the stimulus (Vuilleumier et al., 2003; Holmes et al., 2005; Alorda et al., 2007; Delplanque et al., 2007). Moreover, brain centers regarded as emotion processors (such as the amygdala) respond to features such as angularity (Bar and Neta, 2007), which are both object and texture features.

To summarize the above, there are indications that texture features influence beauty ratings. Until recently, these influences have not yet been systematically investigated. To do so, we decided to perform two exploratory experiments and a computational analysis to establish the degree to which computed visual texture features influence beauty and other high-level judgments.

Our present study differs in a number of ways from some of the earlier work from the Syntex-consortium (Thumfart et al., 2008, 2011; Liu et al., 2015). First, we opted for a semantic differential approach in which—based on a factor analysis of a larger number of judgments—we select a small number of judgments that best represent the observers’ judgment space, rather than a priori assigning judgments to different “cognitive layers” (Thumfart et al., 2008, 2011; Liu et al., 2015). Second, we also used factor analysis for selecting the relevant computational features (rather than the Laplacian Score employed by Liu et al., 2015). Third, we emphasize the relevance of single features to the selected judgments, rather than the overall performance of a model, as Liu et al. did.

To give an overview of the present study, we first conducted an experiment for selecting the appropriate textures to use, and one to select appropriate adjectives for use in the judgments. Next, we conducted two separate semantic differential experiments, in which we focus on revealing the relationships between various judgments on textures and selecting the most representative ones (based on factor analysis). Finally, in a computational analysis, we address the relationships between computed texture features and the selected judgments.

Experiment 1: Texture Selection

The aim of this experiment was to select the textures for use in semantic differential experiment 1 (reported below).

Methods

Participants

Twenty four participants (12 males, age range 18–29 years) participated. All participants had normal or corrected to normal vision. Our entire study conformed to the tenets of the Declaration of Helsinki. The experiments were carried out as part of a psychology bachelor’s course. The ethical review board of the Department of Psychology of the University of Groningen approved the study. Participants gave their written informed consent prior to participation. All participants were students in higher education and received course credits for their participation.

Equipment and Software

Experiments were run on a MacBook pro under Mac OS X (Apple, Cupertino, CA, USA), using Matlab (Mathworks, Natick, MA, USA) with the Psychophysics toolbox extensions (Brainard, 1997). Stimuli were presented on a 30” Apple Cinema HD Display monitor.

Stimuli

From a large database (available on request; see Figure 1 for examples), taken from various sources, a pre-selection of 300 textures was made, based on criteria such as the absence of object outlines, and the elimination of very similar textures. The visual angle of the textures ranged from 3.3 to 32° in height, and from 5.7 to 37° in width.

FIGURE 1

Figure 1. Example textures. Thumbnails of textures used in the experiments. The enlargement shows a texture sample, as displayed on screen, with a green slider bar at the bottom.

Procedure

Textures were presented on the screen one by one. Participants indicated their preference by adjusting the position of a slider at the bottom of the screen (Figure 1) by moving a mouse along a bar corresponding to the judged dimension. They indicated their judgment by clicking on the desired location (right = beautiful; left = ugly/not beautiful).

Satisfaction with the judgment was indicated by pressing a mouse button, after which the screen went blank. The next trial started 1000 ms later. Instructions were given orally, as well as written on the screen, prior to the start of each run. All textures were presented once in a block, in random order. Participants were asked to use the entire range of the slider, and to not necessarily regard the central point as being “neutral”. In order to give a sense of the range of stimuli they would see, and to practice the procedure, a few test trials were performed by the participant before the actual start of the experiment. Participants were asked to respond based on their first impression. Participants performed the test individually. The experiment was performed in a room that was dark, except for the illumination provided by the screen.

Analysis and Selection

Based on the average rank order over participants, the 20 most and 20 least liked textures were selected, as well as 20 from the middle of the range. Note: we determined that such a selection of the most extreme textures is necessary to obtain reliable beauty ratings (see Supplementary Material, part I). We consider that this is likely to also enhance our ability to discover relationships between specific features and judgments.

Results

The texture selection experiment yielded a rank ordering of the 20 most and the 20 least liked textures, as well as 20 from the middle of the range. Four of each category are displayed in Figure 2.

FIGURE 2

Figure 2. Examples of textures rated high, low, and average on beauty. Twenty of each were selected for use in the first semantic differential experiment.