Evolutionary and Modern Image Content Differentially Influence the Processing of Emotional Pictures

Dhum, Matthias; Herwig, Uwe; Opialla, Sarah; Siegrist, Michael; Brühl, Annette B.

doi:10.3389/fnhum.2017.00415

ORIGINAL RESEARCH article

Front. Hum. Neurosci., 23 August 2017

Sec. Cognitive Neuroscience

Volume 11 - 2017 | https://doi.org/10.3389/fnhum.2017.00415

Evolutionary and Modern Image Content Differentially Influence the Processing of Emotional Pictures

Matthias Dhum^1†

Uwe Herwig^1,2,3*†

Sarah Opialla²

Michael Siegrist¹

Annette B. Brühl²

¹Department of Consumer Behavior, Institute of Environmental Decisions, ETH, Zurich, Switzerland
²Department of Psychiatry, Psychotherapy and Psychosomatics, University Hospital of Psychiatry, University of Zurich, Zurich, Switzerland
³Department of Psychiatry and Psychotherapy III, University of Ulm, Ulm, Germany

From an evolutionary perspective, environmental threats relevant for survival constantly challenged human beings. Current research suggests the evolution of a fear processing module in the brain to cope with these threats. Recently, humans increasingly encountered modern threats (e.g., guns or car accidents) in addition to evolutionary threats (e.g., snakes or predators) which presumably required an adaptation of perception and behavior. However, the neural processes underlying the perception of these different threats remain to be elucidated. We investigated the effect of image content (i.e., evolutionary vs. modern threats) on the activation of neural networks of emotion processing. During functional magnetic resonance imaging (fMRI) 41 participants watched affective pictures displaying evolutionary-threatening, modern-threatening, evolutionary-neutral and modern-neutral content. Evolutionary-threatening stimuli evoked stronger activations than modern-threatening stimuli in left inferior frontal gyrus and thalamus, right middle frontal gyrus and parietal regions as well as bilaterally in parietal regions, fusiform gyrus and bilateral amygdala. We observed the opposite effect, i.e., higher activity for modern-threatening than for evolutionary-threatening stimuli, bilaterally in the posterior cingulate and the parahippocampal gyrus. We found no differences in subjective arousal ratings between the two threatening conditions. On the valence scale though, subjects rated modern-threatening pictures significantly more negative than evolutionary-threatening pictures, indicating a higher level of perceived threat. The majority of previous studies show a positive relationship between arousal rating and amygdala activity. However, comparing fMRI results with behavioral findings we provide evidence that neural activity in fear processing areas is not only driven by arousal or valence, but presumably also by the evolutionary content of the stimulus. This has also fundamental methodological implications, in the sense to suggest a more elaborate classification of stimulus content to improve the validity of experimental designs.

Introduction

Identifying threatening situations is a vital feature of human perception that has evolved over the history of animals. A failure in this aptitude could have had fatal consequences for our ancestors. The vital relevance of a working adaptive behavior is assumed to have led to the evolution of a dedicated fear module that, in turn, governs this behavior (Öhman and Mineka, 2001; Sander et al., 2003). Öhman and Mineka (2001) further argue that this module has evolved to recognize all natural threats facing our ancestors, such as predators and poisonous animals. The concept of the fear module is based on the theory of preparedness, which posits that the successful perception and identification of environmental threats lead to a reproductive advantage for the individual (Seligman, 1971).

In modern times, however, the environmental threats prevalent until a few centuries ago are no longer the main threats to most people, particularly in modern urban societies. Instead of the natural and evolutionary established threats we now increasingly face threats that are qualitatively different, more technical, and in some cases less tangible. This might require an adaptation of our perception, evaluation and reaction to these threats: We no longer have to fear for instance snakes, spiders and predators, but should rather be cautious in motor traffic, when facing guns as well as when handling tools like knives. These stimuli are in this study referred to as modern threats. The question is how the neural processes have adapted in response to these “newer” threats. In our study we therefore investigated the neural differences when comparing the perception of evolutionary threats to modern threats. In this line of investigation, we define the term “threat” as the anticipation of a spatially and temporally proximate source of potential harm for the individual (Baldwin, 1971; Davis et al., 2009). The concept of threat involves the identification of emotional significance, the generation of an affective state, and a subsequent behavior, they both engage overlapping neural structures and functions (Phillips et al., 2003; Mohr et al., 2010; Herwig et al., 2011). Earlier studies addressed the question of differences in the central nervous processing of evolutionary vs. modern threats (Blanchette, 2006; Fox et al., 2007; Brown et al., 2010; Sakaki et al., 2012). Regarding the threat-superiority effect, modern threats were reported to be detected in some instance better than evolutionary ones (Blanchette, 2006), whenever such a difference was not observed in an event-related potential study (Brown et al., 2010) or regarding reaction time (RT; Fox et al., 2007). Sakaki et al. (2012) reported differences regarding involved brain areas when comparing evolutionary and social stimuli with more activation in dorsomedial prefrontal areas in the social context.

The neural underpinnings of the perception of threats in general and associated negative emotions have been studied extensively with a range of methods and stimuli (LeDoux, 2000; Phan et al., 2002; Wager et al., 2003; Pessoa, 2008; Pessoa and Adolphs, 2010). The current model posits that a network of cortical and subcortical regions, including the amygdala, orbitofrontal cortex, anterior insula, anterior cingulate cortex, and inferotemporal visual cortex, play a central role in the perception and identification of threatening stimuli (Sabatinelli et al., 2005; Pessoa, 2008). While the amygdala was previously thought to be involved primarily in the perception of threatening (or more general, emotionally negative) stimuli, the concept of this subcortical region has progressed to a more general function of significance detection and processing (Sander et al., 2003; Williams, 2006; Pessoa and Adolphs, 2010). According to this concept, the amygdala should be activated when encountering any stimuli that convey a biological significance for the individual, which can be of either positive or negative valence (Sergerie et al., 2008). Neuroimaging studies support this assumption by showing that amygdala activity varies according to the level of arousal evoked by a stimulus (Kensinger and Corkin, 2004; Sabatinelli et al., 2005; Kensinger and Schacter, 2006; Kryklywy et al., 2013). However, valence, which is the other main dimension in the Circumplex model of affect (Russell, 1980), seems to have a smaller effect on amygdala activity (Phan et al., 2002; Wager et al., 2003; Sergerie et al., 2008).

To investigate the influence of content on the neural circuits involved in processing threatening stimuli, we chose pictures showing a different phylogenetic origin by selecting those with a strong evolutionary history vs. modern pictures. As a reference, we included two neutral categories, again comprising evolutionary prepared vs. modern pictures. Thus, our study included four experimental conditions: evolutionary-threatening, modern-threatening, evolutionary-neutral and modern-neutral. The pictures included in our study displayed threatening stimuli related to the basic emotion of fear. In contrast, pictures showing disgust and sadness were not covered in our study.

On a neurophysiological level, we propose that the affective pictures will engage a network of brain regions comprising amygdala, orbitofrontal cortex, anterior insula, anterior cingulate cortex, inferotemporal visual cortex as well as medial thalamus and midbrain (Sabatinelli et al., 2005). We consider two complementary lines of reasoning which serve as a theoretical frame in our study. First, literature (Sander et al., 2003; Wager et al., 2003; Sergerie et al., 2008) suggests that the neural activity in emotion processing circuits reflects the affective rating of the International Affective Picture System (IAPS) pictures—especially the arousal dimension, and to a lesser extent the valence dimension. Second, the theory of the evolved fear module (Öhman and Mineka, 2001) suggests differences between evolutionary and modern stimuli in the activation of emotion processing circuits. The evolution of the fear module in response to threats such as snakes and spiders implies that evolutionary threatening stimuli might be associated with a stronger activation particularly in evolutionary older regions as amygdala, thalamus and midbrain than the modern stimuli, which are supposed to evoke stronger activation in cortical stimulus processing areas as inferotemporal cortex.

Materials and Methods

Subjects

We recruited healthy subjects through a mailing list and pin board postings. Exclusion criteria were any history of major medical conditions, head trauma, neurological and psychiatric disorder (both individually and in the family), current substance abuse and medication; further contraindications against MRI such as claustrophobia, pregnancy, pace maker or ferromagnetic implants. These criteria were assessed in a semi-structured clinical interview. Subjects received CHF 50 compensation.

In total, 44 subjects (22 females) were scanned for the study. Three subjects were excluded from the final analysis (two subjects due to performance in the behavioral task suggesting a lack of attention or cooperation or otherwise misunderstanding of the instructions (RT in 35% of the trials >1.5 s or button presses outside the required time frame) and one subject due to potential clinical conditions which the subject revealed only after inclusion). Thus, the final sample comprised 41 subjects (21 females) with an average age of 25.0 years (SD = 5.3 years).

All subjects had normal or corrected-to-normal vision and were right-handed according to the Annett handedness questionnaire (Annett, 1967). All subjects were within the normal range of anxiety according to the State-Trait Anxiety Inventory X1 and X2 (Spielberger et al., 1970; Laux et al., 1981). No subject reported phobic symptoms related to the stimulus material (e.g., arachnophobia).

The study was approved by the Ethics Committee of the Canton of Zurich (Kantonale Ethikkommission Zürich¹). All subjects gave their written informed consent. The study was conducted in accordance to the Declaration of Helsinki (World Medical Association, 2008).

Stimulus Material

For each of the four experimental conditions, we selected 16 representative pictures from the IAPS database (Lang et al., 2008). First, the pictures were assigned to the respective condition based on a content analysis. This was done independently by two of the authors (MD, ABB) and discussed with the co-authors in case of divergent assignments. Second, the assignment was based on the ratings provided by the IAPS technical report (Lang et al., 2008). Each picture condition was constructed with the aim to not contain outliers in the valence and arousal ratings. Thus, pictures were selected for a condition if their rating was homogeneous within the condition and distinct to the other conditions. This manual selection process was validated in a pre-test with an independent larger sample (N = 201) by running a confirmatory factor analysis across all pictures (unpublished data). The evolutionary-threatening condition included pictures of predatory animals (e.g., snakes, spiders, dogs, bears, sharks) whereas the modern-threatening condition displayed pictures of guns, knives, and accidents involving cars, ships and airplanes. Evolutionary-neutral pictures comprised landscapes, forests and flowers, while the modern-neutral pictures showed inanimate objects such as cars, trains, ships, bridges, suitcases, and drawers. Picture numbers are provided in the Supporting Information, Supplementary Table S1.

Experimental Procedure

In the scanner, pictures were displayed covering the full screen of digital video goggles (Resonance Technologies, Northridge, CA, USA) using Presentation software (version 15.1²). We presented blocks of eight consecutive pictures from the same experimental condition (Figure 1). Each picture was shown for 1980 ms. Thus, each block lasted 15,840 ms in total. Before the first block and between the blocks, a black screen with a white fixation cross was shown for 15,840 ms to allow the Blood-Oxygen Level-Dependent (BOLD) signal to return to a baseline (Ogawa et al., 1990).

FIGURE 1

Figure 1. Experimental task. For representational reasons, only four pictures for each category are shown. In the experiment, each block consisted of eight pictures. In order to make the pictures less identifiable in the sense of the International Affective Picture System (IAPS) providers, in the figure black boxes are pasted over the front picture which of course was not the case in the experiment.

The pictures for each block were randomly taken from the 16 pictures selected for the respective experimental condition. The block order was pseudo-randomized across an experimental run to control for serial position effects. One experimental run included four blocks of each experimental condition. Thus, each of the 16 pictures of every experimental condition was shown twice in an experimental run. The whole experiment consisted of three experimental runs, each lasting approximately 11 min.

The subjects were instructed to press the button of a response box with their right index finger at the onset of the first picture of a new block. The recorded RT served as a control for general attention and wakefulness of the subjects. Further, fast RT are generally associated with higher fear relevance of the stimulus (Fox et al., 2007). After the scanning session, subjects rated the pictures on the valence (scaled from 1 = negative to 9 = positive) and arousal (scaled from 1 = low to 9 = high) scales using a digital version of the original IAPS self-assessment manikin (Mogg et al., 1994).

Similar to previous studies (Anders et al., 2008), we deliberately decided against an online rating during the functional magnetic resonance imaging (fMRI) task since it has been shown that emotional rating instructions may influence neural activity already during the perception of a stimulus (Taylor et al., 2003). Moreover, the post-scan evaluation of stimuli has been demonstrated to correspond well with the emotional experience during the scan (Hariri et al., 2000; Phan et al., 2004).

Behavioral Data

We removed outliers (RT <100 ms or >1500 ms) from the data gathered during the scan. A repeated-measures ANOVA was performed to check for differences in RT to the different experimental conditions. In case of significant Mauchly’s tests of sphericity, Greenhouse-Geisser correction was applied. Bonferroni-corrected post hoc tests were performed to reveal differences between single conditions. Similarly, repeated-measures ANOVAs and subsequent post hoc tests were performed to test for differences in valence and arousal ratings between experimental conditions. Statistical analysis was performed with SPSS (Version 19.0.0.1, SPSS Inc., Chicago, IL, USA) and Matlab (Version R2014a; The MathWorks Inc., Natick, MA, USA).

Image Acquisition

Imaging was performed using a 3.0 T GE Signa HD Scanner (GE Medical Systems, Milwaukee, WI, USA; 8-channel head coil). fMRI was conducted using echo-planar imaging (EPI) with the following configuration: 28 interleaved axial slices, 3.5 mm slice thickness, 0.5 mm gap, matrix 64 × 64, 240 mm field of view, resulting voxel size 3.75 × 3.75 × 4.0 mm, repetition time (TR) = 1980 ms, echo time (TE) = 32 ms, flip angle = 70°. The slice angle was optimized to reduce susceptibility artifacts in the amygdala and frontal regions. Per run a total of 328 volumes were acquired, 16 for each of the 20 experimental blocks. The first four volumes of each run were discarded to allow for T1 equilibration. In addition, 3-D T1-weighted anatomical volumes (172 axial slices, TR = 9.9 ms, TE = 2.9 ms, matrix size 256 × 256, voxel size 1 × 1 × 1 mm) were acquired for co-registration with the functional data. Furthermore, T2-weighted images in parallel to the EPI sequence were acquired to exclude possible T2-sensitive brain abnormalities.

Image Analysis and Statistics

Imaging data was analyzed using BrainVoyager QX 2.8.4 (Brain Innovation, Maastricht, Netherlands; Goebel et al., 2006). Pre-processing of the functional data included slice scan time correction, 3-D motion correction with intra-session alignment, and temporal high-pass filtering with removal of linear trends. Functional data was co-registered to the individual anatomical 3-D datasets. Anatomical datasets were corrected for intensity inhomogeneity and transformed into Talairach coordinate space (Talairach and Tournoux, 1988). Volume time courses with a 3 × 3 × 3 mm³ voxel size were created from the functional datasets. For the subsequent group analysis, the volume time courses were spatially smoothed with a 6.0 mm full-width at half-maximum Gaussian kernel.

The experimental conditions were used as HRF-convolved box-car function predictors in the General Linear Model (GLM) design matrix. In addition, the individual 3-D motion correction parameters were z-transformed, high-pass filtered (10 cycles) and linear detrended using the BVA Predictor Tool (Version 1.52, J.M. Born, Maastricht, Netherlands), and added as predictors of no interest to the design matrix to account for BOLD artifacts caused by task-correlated motion (Morgan et al., 2007). From the individual GLM matrices, we calculated a Random Effects GLM as a first step in the group analysis. Voxel time courses from the single runs were percent-transformed. Serial correlations were detected and removed using the AR(2) model approach. We automatized most pre-processing steps using BrainVoyager scripts or WinAutomation software (Version 4.02, Softomotive Ltd., Athens, Greece).

Our aim was to analyze the differential activation of those brain regions centrally involved in the processing of negative emotional stimuli. In a first step, we identified brain regions activated by all threatening stimuli compared to neutral stimuli. Therefore, we calculated a repeated measures 2 × 2 ANOVA with the factors threat (levels: threatening, neutral) and origin (levels: evolutionary, modern). The voxel-wise threshold for statistical maps correspond to p < 0.001 uncorrected. To correct for multiple comparisons, a Monte Carlo simulation with 1000 iterations was used for estimating cluster-level false-positive rates on these maps (statistics implemented in BrainVoyager). This resulted in a minimum cluster size of 34 voxels at 3 × 3 × 3 mm (904 mm³), corresponding to p < 0.05 corrected cluster-wise.

In a second step, we analyzed the differential effect of the factor origin in the threatening stimuli. Therefore, we created individual maps for the two contrasts (evolutionary-threatening > evolutionary-neutral) and (modern-threatening > modern-neutral). These individual maps were subsequently used as input for a paired t-Test where we contrasted the maps “evolutionary-threatening > evolutionary-neutral” and “modern-threatening > modern-neutral” against each other. The voxel-level threshold for statistical maps corresponds to p < 0.001. To correct for multiple comparisons, a Monte Carlo simulation with 1000 iterations was used for estimating cluster-level false-positive rates on these maps. This led to a minimum cluster size of 38 voxels at 3 × 3 × 3 mm (1007 mm³), corresponding to p < 0.05 corrected cluster-wise. Further, we extracted t values from the resulting clusters to quantify the effect of origin. In selected regions, we additionally computed for each condition the mean time course by averaging all peri-stimulus BOLD time course segments belonging to the same condition using the respective tool in BrainVoyager. Anatomical regions were identified using the Talairach Client (Lancaster et al., 2000).

Results

Behavioral Results

We performed a repeated-measures ANOVA to test for differences in RT to the first picture of a block. Mauchly’s test indicated that the assumption of sphericity had been met ( $χ_{(5)}^{2}$ = 7.00, p > 0.05). The results showed no significant effect of experimental condition on RT (F_(3,120) = 1.58, p > 0.05). Mean RT ranged from 542.49 ms to 561.56 ms (Table 1).

TABLE 1

Table 1. Reaction times to the first picture of a block during the scan session, and means and standard deviations of the normative ratings of the International Affective Picture System (IAPS) pictures.

The general pattern of the post-scan rating of valence and arousal of the IAPS pictures did not deviate from the original ratings provided in the IAPS technical report (Lang et al., 2008) and from our own data in an independent sample (unpublished data). A confirmatory factor analysis in this independent sample on the valence and arousal ratings supported the assignment of the pictures to the four conditions, thus adding to the validity of the experimental design.

To test for differences in the valence rating between experimental conditions, we conducted a repeated-measures ANOVA. Mauchly’s test was significant ( $χ_{(5)}^{2}$ = 28.75, p < 0.05), indicating a violation of the sphericity assumption. Greenhouse-Geisser corrected values showed significant differences between experimental conditions (F_(2.11,84.40) = 222.76, p < 0.05). Bonferroni-corrected post hoc tests revealed significant differences for all pairwise comparisons between all conditions at p < 0.05 (Table 1).

Differences in the arousal rating between experimental conditions were assessed with a repeated-measures ANOVA. Mauchly’s test was significant ( $χ_{(5)}^{2}$ = 26.14, p < 0.05), indicating a violation of the sphericity assumption. Greenhouse-Geisser corrected values showed significant differences between experimental conditions (F_(2.02,80.79) = 119.36, p < 0.05). Bonferroni-corrected post hoc tests indicated that each threat condition was rated significantly different to both neutral conditions at p < 0.05 (Table 1).

To summarize, evolutionary-neutral pictures were rated significantly more positive in valence compared to all other conditions and within the positive spectrum of the IAPS set. Subjects rated both threat conditions significantly higher in arousal and more negative in valence than the two neutral conditions. While the two threatening conditions did not differ in arousal (p > 0.5), modern-threatening pictures were rated significantly more negative in valence than evolutionary-threatening pictures. Across all presented pictures, the subjects’ ratings varied significantly more on the arousal scale than on the valence scale (N = 64 pictures; average SD across pictures: valence = 1.39, arousal = 1.77, Wilcoxon Z = −5.5, p < 0.001).

fMRI Results

The main effect of threat in the 2 × 2 repeated measures ANOVA (factors threat and origin) revealed a network of cortical and subcortical regions (Figure 1) including the left middle frontal gyrus, right inferior frontal gyrus, right posterior cingulate gyrus, right cuneus, large portions of the bilateral occipital lobe including extrastriate and inferotemporal regions, and bilateral amygdala (see Table 2, Supplementary Table S2 and Figure 2A).

TABLE 2

Table 2. Anatomical regions activating stronger for threatening stimuli than for neutral stimuli.

FIGURE 2

Figure 2. (A) Brain areas activating stronger for threatening stimuli than for neutral stimuli. The map shows the main effect of threat, derived from a repeated measures 2 × 2 ANOVA with factors threat (levels: threatening, neutral) and origin (levels: evolutionary, modern). The thresholds in the figures are chosen for representational purposes, q(FDR) < 0.01. Talairach coordinates of slices x: 18, y: −56, z: −17. (B) Brain areas showing the differential effect of origin in threatening pictures. Contrast: (Evolutionary-threatening > Evolutionary-neutral) > (Modern-threatening > Modern-neutral), q(FDR) < 0.01. Talairach coordinates of slices x: −18, y: −66, z: −12.

To identify regions showing a differential activation to the evolutionary vs. modern origin within the threatening stimuli, we applied the combined contrast “evolutionary-threatening > evolutionary-neutral” > “modern-threatening > modern-neutral”. This analysis revealed a network of regions including the left inferior frontal gyrus, right middle frontal gyrus, right parietal lobe (sub-gyral), right precuneus, left thalamus, bilateral fusiform gyrus, bilateral superior parietal lobule, bilateral amygdala (see Table 3, Supplementary Table S3 and Figure 2B).

TABLE 3

Table 3. Anatomical regions showing the differential effect of origin in threatening pictures.

The opposite contrast revealed that only in the bilateral posterior cingulate and the bilateral parahippocampal gyrus the activity was higher for modern-threatening stimuli than for evolutionary-threatening pictures (see Table 3). For the amygdala, the fusiform gyrus and the parahippocampal gyrus, we created event-related averages of all conditions to characterize the BOLD response of each experimental condition (see Figure 3).

FIGURE 3

Figure 3. Mean time course for selected regions. (A) Left and right amygdala, (B) Left and right fusiform gyrus (BA 19), (C) Left and right parahippocampal gyrus PHG, (BA 36 left, BA 35 right). Contrast: (Evolutionary-threatening > Evolutionary-neutral) > (Modern-threatening > Modern-neutral).

Discussion

Functional Implications

We systematically investigated the effect of image content in threatening stimuli on the activation of neural networks involved in emotion processing. By contrasting threatening with neutral pictures, we revealed a network of regions typically found in emotion processing (Pessoa and Adolphs, 2010), thus supporting the validity of our threatening stimuli. Evolutionary-threatening pictures evoked significantly stronger activations than modern-threatening pictures in most regions of the network for processing threatening stimuli. Surprisingly, however, this finding is in contrast to the behavioral part of the experiment, the post-scan rating of the IAPS pictures. Subjects rated modern-threatening stimuli as significantly more negative in valence than evolutionary-threatening pictures, indicating a higher level of perceived threat or fear for stimuli such as guns, knives and car accidents. At the same time, the two threatening conditions did not differ in the arousal rating, thus implying no relevant association of subjective arousal with the difference of neural activity between the threatening conditions. According to the prevalent opinion in the literature, our behavioral findings would have suggested that modern-threatening pictures evoke a stronger BOLD response than the evolutionary-threatening pictures in regions involved in emotion processing, or the fear module, respectively (see Sabatinelli et al., 2005; Kensinger and Schacter, 2006). However, since the opposite was the case in our study, we argue that the evolutionary preparedness of the evolutionary-threatening stimuli is the actual driver of the neural activity.

The network of brain regions that activated stronger for evolutionary-threatening stimuli than for modern-threatening stimuli comprised bilateral amygdala, the left inferior frontal gyrus, right middle frontal gyrus, right parietal lobe (sub-gyral), right precuneus, left thalamus, bilateral fusiform gyrus and bilateral superior parietal lobule (Sabatinelli et al., 2005). The finding in the amygdala as central emotion processing region supports the close relationship to emotion processing (Figure 3A), but also early region in the visual stream as fusiforme gyrus, known for face processing (Sabatinelli et al., 2005), are involved (Figure 3B). We found the opposite effect (higher activity for modern-threatening than for evolutionary-threatening stimuli) in the bilateral posterior cingulate and the bilateral parahippocampal gyrus (Figure 3C). A possible explanation of this reversal could be the connection of the posterior cingulate gyrus with the hippocampus (FeldmanHall et al., 2016). Modern stimuli might engage more processes of memory, self-reflection and appraisal, which could be mediated by the posterior cingulate cortex. The parahippocampal gyrus has been found to encode complex visual scenes and the local environment (Epstein and Kanwisher, 1998). This could explain the low activation in this area for evolutionary-threatening stimuli, where the focus is on the animal itself and not so much on its surroundings (see Figure 3). In the other experimental conditions, however, about half of the pictures show wide-angled shots of natural landscapes or built environments, thus possibly activating the parahippocampal place area (Aguirre et al., 1996; Epstein and Kanwisher, 1998; Ishai et al., 2004).

The results of our study support the hypothesis of the amygdala and its connected regions as an evolved module for the detection of threat. This detection takes place automatically, without the need of cognitive processing of the stimuli (Lundqvist and Ohman, 2005). Furthermore, our results stress the perceived biological significance of evolutionary prepared stimuli, even if they do not pose such an actual threat to the individual anymore. A recent study found neurobiological evidence for a rapid snake detection mechanism in the pulvinar, which could represent a part of the evolved module (Stuber et al., 2011).

Our findings suggest that the assumption of amygdala activity explained by arousal ratings may not be fully comprehensive. Also, at least in our study, the valence ratings do not seem to reflect the activation of the emotion network. The study by Anders et al. (2008) showed effects of valence in line with the majority of the literature (i.e., less neutral valence ratings correlating with higher amygdala activity). However, the most negative rated pictures (modern-threatening) interestingly did not show the highest amygdala activity. When investigating amygdala activity to threatening stimuli, the explicit arousal and valence ratings might not be the strongest indicators to predict the neural activity. Even when made quickly and intuitively, these ratings might comprise elaborate cognitive evaluations and might thus not be strongly indicative of the amygdala’s role of automatic significance detection.

Also, alternative explanations of the difference in activation between the evolutionary-threatening and the modern-threatening condition can be taken into account. First, a complimentary explanation for the diminished BOLD response towards the modern-threatening stimuli could lie in a cognitively more demanding evaluation after the perception. Since an evolved module for these modern pictures can hardly exist, the evaluation of threat might require higher-level cortical processing, which in turn reduces amygdala activity (Hariri et al., 2000, 2003). This demanding evaluative process might set in involuntarily even in the absence of an additional experimental task.

Second, the conditions could potentially differ in perceived threat, when assuming that threat could not be defined by valence and arousal ratings. Some of the pictures display an immediate threat (e.g., snarling snakes and pointing guns), whereas other pictures only show a distal or already occurred threat (e.g., resting spiders or car accidents). It might be possible to quantify the threat potential of either experimental category in terms of probabilities, for instance by pooling lethality rates of each stimulus displayed. However, we assume that the individual rating of valence and arousal represents an appropriate and valid proxy to the subjective feeling of threat. Moreover, by averaging across a broad range of image content, we reduce the influence of outliers in terms of perceived threat. Further, we argue that this averaging, together with the randomized presentation of the stimuli, reduced possible effects of image features (e.g., eyes, color and spatial frequency) that differed between the two threatening conditions.

Methodological Implications

As we have demonstrated in this study, the content of the pictures shown to the subjects has a pronounced effect on the neural response, even if the pictures are believed to be in the same emotion category (i.e., threatening pictures). As a consequence, we suggest that greater care should be taken when selecting stimuli for studies on emotion processing. In addition to the selection based on normative ratings, we recommend to characterize images also on qualitative dimensions, with the evolutionary-to-modern dimension being only one of several. For instance, Kensinger and Schacter (2006) had the subjects rate picture or word stimuli on the dimensions animacy (animate vs. inanimate) and commonality (common vs. uncommon). However, the authors report that the emotion processing did not differ depending on the task, whereupon the authors collapsed the data of both tasks. For a study containing a matching task and a labeling task, Hariri et al. (2003) used sets of threatening IAPS pictures which were virtually identical to our selection. Instead of evolutionary vs. modern, they denoted the stimuli being of natural vs. artificial origin. In the subsequent analysis, however, the authors collapse the fMRI data across these two different categories, since the focus of the study was the difference in task but not the differentiation of the two origins. In this case, pooling the data might cause undesired variance, assuming that the two categories engage the network of emotion processing differently.

From a more technical perspective, even measurable image parameters such as color, contrast, or spatial frequency do not directly account for the aspect of image content. Interestingly, also the RT did not differ between experimental conditions, giving no indication of a prioritized perception of evolutionary or modern threats (and thus, not reflecting our fMRI or behavioral results). Empirical evidence of this effect would suggest faster perception of threatening vs. neutral stimuli (Öhman et al., 2001) and no differences between evolutionary and modern threats (Fox et al., 2007; Pool et al., 2016).

In conclusion, exerting a more elaborate process of stimulus classification and selection will consequently lead to better experimental designs and thus more valid results. Effects that might have been confounded by the selection of overly heterogeneous stimuli could thus be revealed. We point out that researchers should be more aware of the possible effect of image content when selecting pictures as well as reporting results of studies using IAPS and comparable databases. We suggest that future studies utilizing affective picture stimuli should firstly replicate our findings of marked differences between evolutionary and modern stimuli, and secondly characterize the image content on more dimensions than only valence and arousal. The IAPS database was originally conceived on a theoretical foundation representing the basic emotion dimensions valence and arousal (Russell, 1980) and the dominance dimension. However, when applied in studies investigating neural processes, these dimensions might fall short of representing the complexity of the brain mechanisms adequately. Thus, adding further dimensions that are relevant and tailored to neurophysiological research might greatly improve the IAPS database and future studies. In addition, this study only investigated still pictures. However, real life visual perception is much more adapted to the perception of moving and animated scenes. Future studies might therefore also use short video clips to investigate effects of content on the processing of scenes.

Limitations

The content selected for our experimental conditions could be criticized in different aspects. While both modern stimuli conditions show similar scenes from a wide-angle perspective, the modern-threatening condition also comprises close-up views of weapons. Also, the evolutionary-threatening pictures present menacing animals in their natural surroundings, whereas the evolutionary-neutral pictures do not contain any non-threatening animals. We are aware of this difference but argue that the inclusion of animals in both categories would have changed the focus to the comparison of animate vs. inanimate stimuli.

Similar to previous studies (Anders et al., 2008) and the original IAPS sample (Lang et al., 2008), we found that ratings varied significantly more on the arousal scale than on the valence scale. Anders et al. (2008) concluded that arousal ratings were thus less directly related to the actual stimulus than valence ratings. Moreover, the validity of the valence and arousal ratings should be cross-checked by other measures (e.g., verbal descriptions, thinking-aloud, etc.). The results are further to regard with the limitation that we did not match for physical properties of the pictures as luminance, color or complexity, as this would lead to very low samples of pictures with identical properties not suitable any more for statistically sufficient stimuli samples.

Furthermore, this study relies on subjective ratings of arousal combined with fMRI measures of brain activity. Psychophysiological measures such as heart rate, heart rate variability and skin conductance might increase the specificity of the findings.

Finally, we acknowledge that our findings and implications are not readily transferable to other basic emotions. While the effect of the evolutionary origin might be valid for threatening stimuli evoking fear or anxiety, it might not hold true for emotions such as happiness, disgust, sadness, or surprise.

Conclusion

We provide evidence that neural activity in the fear module is not only driven by arousal or valence, but presumably also by the evolutionary content of the stimulus. Methodologically, we thus suggest that a more elaborate classification of stimulus content will improve the validity of experimental designs.

Author Contributions

UH, ABB, MS, MD: substantial contributions to the conception or design of the work. UH, ABB, MS, SO, MD: acquisition, analysis, or interpretation of data for the work; final approval of the version to be published and agreement to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. UH, ABB, MS, SO: drafting the work (MD, UH) or revising it critically for important intellectual content. MD: prepared an earlier version of the manuscript, near to the current version with equivalent message and approved it. However, he then left the working group and did not contribute to the final revisions provided here. He was formally informed about the submission and did not contradict which is assumed as approval. Regarding his initial contribution, MD is still considered as first author.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We further would like to thank Sigrid Scherpiet, Anna Hittmeyer and Lilian Roth for help with data collection, Michael Stauffacher and Pius Krütli for help with experimental design and feedback to the manuscript.

Footnotes

Supplementary Material

The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fnhum.2017.00415/full#supplementary-material

References

Aguirre, G. K., Detre, J. A., Alsop, D. C., and D’Esposito, M. (1996). The parahippocampus subserves topographical learning in man. Cereb. Cortex 6, 823–829. doi: 10.1093/cercor/6.6.823

PubMed Abstract | CrossRef Full Text | Google Scholar

Anders, S., Eippert, F., Weiskopf, N., and Veit, R. (2008). The human amygdala is sensitive to the valence of pictures and sounds irrespective of arousal: an fMRI study. Soc. Cogn. Affect. Neurosci. 3, 233–243. doi: 10.1093/scan/nsn017

PubMed Abstract | CrossRef Full Text | Google Scholar

Annett, M. (1967). The binomial distribution of right, mixed and left handedness. Q. J. Exp. Psychol. 19, 327–333. doi: 10.1080/14640746708400109

PubMed Abstract | CrossRef Full Text | Google Scholar

Baldwin, D. A. (1971). Thinking about threats. J. Conflict Resolut. 15, 71–78. doi: 10.1177/002200277101500106

CrossRef Full Text | Google Scholar

Blanchette, I. (2006). Snakes, spiders, guns, and syringes: how specific are evolutionary constraints on the detection of threatening stimuli? Q. J. Exp. Psychol. 59, 1484–1504. doi: 10.1080/02724980543000204

PubMed Abstract | CrossRef Full Text | Google Scholar

Brown, C., El-Deredy, W., and Blanchette, I. (2010). Attentional modulation of visual-evoked potentials by threat: investigating the effect of evolutionary relevance. Brain Cogn. 74, 281–287. doi: 10.1016/j.bandc.2010.08.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Davis, M., Walker, D. L., Miles, L., and Grillon, C. (2009). Phasic vs. sustained fear in rats and humans: role of the extended amygdala in fear vs. anxiety. Neuropsychopharmacology 35, 105–135. doi: 10.1038/npp.2009.109

PubMed Abstract | CrossRef Full Text | Google Scholar

Epstein, R., and Kanwisher, N. (1998). A cortical representation of the local visual environment. Nature 392, 598–601. doi: 10.1038/33402

PubMed Abstract | CrossRef Full Text | Google Scholar

FeldmanHall, O., Raio, C. M., Kubota, J. T., Seiler, M. G., and Phelps, E. A. (2016). The effects of social context and acute stress on decision making under uncertainty. Psychol. Sci. 26, 1918–1926. doi: 10.1177/0956797615605807

PubMed Abstract | CrossRef Full Text | Google Scholar

Fox, E., Griggs, L., and Mouchlianitis, E. (2007). The detection of fear-relevant stimuli: are guns noticed as quickly as snakes? Emotion 7, 691–696. doi: 10.1037/1528-3542.7.4.691

PubMed Abstract | CrossRef Full Text | Google Scholar

Goebel, R., Esposito, F., and Formisano, E. (2006). Analysis of functional image analysis contest (FIAC) data with brainvoyager QX: from single subject to cortically aligned group general linear model analysis and self-organizing group independent component analysis. Hum. Brain Mapp. 27, 392–401. doi: 10.1002/hbm.20249

PubMed Abstract | CrossRef Full Text | Google Scholar

Hariri, A. R., Bookheimer, S. Y., and Mazziotta, J. C. (2000). Modulating emotional responses: effects of a neocortical network on the limbic system. Neuroreport 11, 43–48. doi: 10.1097/00001756-200001170-00009

PubMed Abstract | CrossRef Full Text | Google Scholar

Hariri, A. R., Mattay, V. S., Tessitore, A., Fera, F., and Weinberger, D. R. (2003). Neocortical modulation of the amygdala response to fearful stimuli. Biol. Psychiatry 53, 494–501. doi: 10.1016/s0006-3223(02)01786-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Herwig, U., Brühl, A. B., Viebke, M.-C., Scholz, R. W., Knoch, D., and Siegrist, M. (2011). Neural correlates of evaluating hazards of high risk. Brain Res. 1400, 78–86. doi: 10.1016/j.brainres.2011.05.023

PubMed Abstract | CrossRef Full Text | Google Scholar

Ishai, A., Pessoa, L., Bikle, P. C., and Ungerleider, L. G. (2004). Repetition suppression of faces is modulated by emotion. Proc. Natl. Acad. Sci. U S A 101, 9827–9832. doi: 10.1073/pnas.0403559101

PubMed Abstract | CrossRef Full Text | Google Scholar

Kensinger, E. A., and Corkin, S. (2004). Two routes to emotional memory: distinct neural processes for valence and arousal. Proc. Natl. Acad. Sci. U S A 101, 3310–3315. doi: 10.1073/pnas.0306408101

PubMed Abstract | CrossRef Full Text | Google Scholar

Kensinger, E. A., and Schacter, D. L. (2006). Processing emotional pictures and words: effects of valence and arousal. Cogn. Affect. Behav. Neurosci. 6, 110–126. doi: 10.3758/cabn.6.2.110

PubMed Abstract | CrossRef Full Text | Google Scholar

Kryklywy, J. H., Nantes, S. G., and Mitchell, D. G. V. (2013). The amygdala encodes level of perceived fear but not emotional ambiguity in visual scenes. Behav. Brain Res. 252, 396–404. doi: 10.1016/j.bbr.2013.06.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Lancaster, J. L., Woldorff, M. G., Parsons, L. M., Liotti, M., Freitas, C. S., Rainey, L., et al. (2000). Automated Talairach atlas labels for functional brain mapping. Hum. Brain Mapp. 10, 120–131. doi: 10.1002/1097-0193(200007)10:3<120::AID-HBM30>3.0.CO;2-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Lang, P. J., Bradley, M. M., and Cuthbert, B. N. (2008). International Affective Picture System (IAPS): Affective Ratings of Pictures and Instruction Manual. Technical Report A-8. Gainesville, FL: University of Florida.

Laux, L., Glanzmann, P., Schaffner, P., and Spielberger, C. D. (1981). Das State-Trait-Angstinventar. Weinheim: Beltz.

Google Scholar

LeDoux, J. E. (2000). Emotion circuits in the brain. Annu. Rev. Neurosci. 23, 155–184. doi: 10.1146/annurev.neuro.23.1.155

PubMed Abstract | CrossRef Full Text | Google Scholar

Lundqvist, D., and Ohman, A. (2005). Emotion regulates attention: the relation between facial configurations, facial emotion, and visual attention. Vis. Cogn. 12, 51–84. doi: 10.1080/13506280444000085

CrossRef Full Text | Google Scholar

Mogg, K., Bradley, B. P., and Hallowell, N. (1994). Attentional bias to threat: roles of trait anxiety, stressful events, and awareness. Q. J. Exp. Psychol. A 47, 841–864. doi: 10.1080/14640749408401099

PubMed Abstract | CrossRef Full Text | Google Scholar

Mohr, P. N. C., Biele, G., and Heekeren, H. R. (2010). Neural processing of risk. J. Neurosci. 30, 6613–6619. doi: 10.1523/JNEUROSCI.0003-10.2010

PubMed Abstract | CrossRef Full Text | Google Scholar

Morgan, V. L., Dawant, B. M., Li, Y., and Pickens, D. R. (2007). Comparison of fMRI statistical software packages and strategies for analysis of images containing random and stimulus-correlated motion. Comput. Med. Imaging Graph. 31, 436–446. doi: 10.1016/j.compmedimag.2007.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Ogawa, S., Lee, T. M., Kay, A. R., and Tank, D. W. (1990). Brain magnetic resonance imaging with contrast dependent on blood oxygenation. Proc. Natl. Acad. Sci. U S A 87, 9868–9872. doi: 10.1073/pnas.87.24.9868

PubMed Abstract | CrossRef Full Text | Google Scholar

Öhman, A., Flykt, A., and Esteves, F. (2001). Emotion drives attention: detecting the snake in the grass. J. Exp. Psychol. Gen. 130, 466–478. doi: 10.1037/0096-3445.130.3.466

PubMed Abstract | CrossRef Full Text | Google Scholar

Öhman, A., and Mineka, S. (2001). Fears, phobias, and preparedness: toward an evolved module of fear and fear learning. Psychol. Rev. 108, 483–522. doi: 10.1037/0033-295x.108.3.483

PubMed Abstract | CrossRef Full Text | Google Scholar

Pessoa, L. (2008). On the relationship between emotion and cognition. Nat. Rev. Neurosci. 9, 148–158. doi: 10.1038/nrn2317

PubMed Abstract | CrossRef Full Text | Google Scholar

Pessoa, L., and Adolphs, R. (2010). Emotion processing and the amygdala: from a ‘low road’ to ‘many roads’ of evaluating biological significance. Nat. Rev. Neurosci. 11, 773–783. doi: 10.1038/nrn2920

PubMed Abstract | CrossRef Full Text | Google Scholar

Phan, K. L., Taylor, S. F., Welsh, R. C., Ho, S.-H., Britton, J. C., and Liberzon, I. (2004). Neural correlates of individual ratings of emotional salience: a trial-related fMRI study. Neuroimage 21, 768–780. doi: 10.1016/j.neuroimage.2003.09.072

PubMed Abstract | CrossRef Full Text | Google Scholar

Phan, K. L., Wager, T., Taylor, S. F., and Liberzon, I. (2002). Functional neuroanatomy of emotion: a meta-analysis of emotion activation studies in PET and fMRI. Neuroimage 16, 331–348. doi: 10.1006/nimg.2002.1087

PubMed Abstract | CrossRef Full Text | Google Scholar

Phillips, M. L., Drevets, W. C., Rauch, S. L., and Lane, R. (2003). Neurobiology of emotion perception I: the neural basis of normal emotion perception. Biol. Psychiatry 54, 504–514. doi: 10.1016/s0006-3223(03)00168-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Pool, E., Sennwald, V., Delplanque, S., Brosch, T., and Sander, D. (2016). Measuring wanting and liking from animals to humans: a systematic review. Neurosci. Biobehav. Rev. 63, 124–142. doi: 10.1016/j.neubiorev.2016.01.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Russell, J. A. (1980). A circumplex model of affect. J. Pers. Soc. Psychol. 39, 1161–1178. doi: 10.1037/h0077714

Sabatinelli, D., Bradley, M. M., Fitzsimmons, J. R., and Lang, P. J. (2005). Parallel amygdala and inferotemporal activation reflect emotional intensity and fear relevance. Neuroimage 24, 1265–1270. doi: 10.1016/j.neuroimage.2004.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Sakaki, M., Niki, K., and Mather, M. (2012). Beyond arousal and valence: the importance of the biological versus social relevance of emotional stimuli. Cogn. Affect. Behav. Neurosci. 12, 115–139. doi: 10.3758/s13415-011-0062-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Sander, D., Grafman, J., and Zalla, T. (2003). The human amygdala: an evolved system for relevance detection. Rev. Neurosci. 14, 303–316. doi: 10.1515/revneuro.2003.14.4.303

PubMed Abstract | CrossRef Full Text | Google Scholar

Seligman, M. E. P. (1971). Phobias and preparedness. Behav. Ther. 2, 307–320. doi: 10.1016/s0005-7894(71)80064-3

CrossRef Full Text | Google Scholar

Sergerie, K., Chochol, C., and Armony, J. L. (2008). The role of the amygdala in emotional processing: a quantitative meta-analysis of functional neuroimaging studies. Neurosci. Biobehav. Rev. 32, 811–830. doi: 10.1016/j.neubiorev.2007.12.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Spielberger, C. D., Gorsuch, R. L., and Lushene, R. E. (1970). State-Trait Anxiety Inventory, Manual for the State-Trait-Anxiety Inventory. Palo Alto, CA: Consulting Psychologist Press.

Google Scholar

Stuber, G. D., Sparta, D. R., Stamatakis, A. M., van Leeuwen, W. A., Hardjoprajitno, J. E., Cho, S., et al. (2011). Excitatory transmission from the amygdala to nucleus accumbens facilitates reward seeking. Nature 475, 377–380. doi: 10.1038/nature10194

PubMed Abstract | CrossRef Full Text | Google Scholar

Talairach, J., and Tournoux, P. (1988). Co-Planar Stereotaxic Atlas of the Human Brain. New York, NY: Thieme.

Google Scholar

Taylor, S. F., Phan, K. L., Decker, L. R., and Liberzon, I. (2003). Subjective rating of emotionally salient stimuli modulates neural activity. Neuroimage 18, 650–659. doi: 10.1016/s1053-8119(02)00051-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Wager, T. D., Phan, K. L., Liberzon, I., and Taylor, S. F. (2003). Valence, gender, and lateralization of functional brain anatomy in emotion: a meta-analysis of findings from neuroimaging. Neuroimage 19, 513–531. doi: 10.1016/s1053-8119(03)00078-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Williams, L. M. (2006). An integrative neuroscience model of “significance” processing. J. Integr. Neurosci. 5, 1–47. doi: 10.1142/s0219635206001082

PubMed Abstract | CrossRef Full Text | Google Scholar

World Medical Association. (2008). World Medical Association Declaration of Helsinki: Ethical Principles for Medical Research Involving Human Subjects. Ferney-Voltaire: World Medical Association.

Google Scholar

Keywords: emotion processing, fMRI, evolutionary content, fear module, amygdala

Citation: Dhum M, Herwig U, Opialla S, Siegrist M and Brühl AB (2017) Evolutionary and Modern Image Content Differentially Influence the Processing of Emotional Pictures. Front. Hum. Neurosci. 11:415. doi: 10.3389/fnhum.2017.00415

Received: 25 February 2017; Accepted: 02 August 2017;
Published: 23 August 2017.

Edited by:

Srikantan S. Nagarajan, University of California, San Francisco, United States

Reviewed by:

Arun Bokde, Trinity College, Dublin, Ireland
Xin Di, New Jersey Institute of Technology, United States

Copyright © 2017 Dhum, Herwig, Opialla, Siegrist and Brühl. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Uwe Herwig, dXdlLmhlcndpZ0BwdWsuemguY2g=

^† These authors have contributed equally to this work.

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.