Use of Crowdsourced Online Surveys to Study the Impact of Architectural and Design Choices on Wellbeing

Altaf, Basma; Bianchi, Eva; Douglas, Isabella P.; Douglas, Kyle; Byers, Brandon; Paredes, Pablo E.; Ardoin, Nicole M.; Markus, Hazel R.; Murnane, Elizabeth L.; Bencharit, Lucy Z.; Landay, James A.; Billington, Sarah L.

doi:10.3389/frsc.2022.780376

ORIGINAL RESEARCH article

Front. Sustain. Cities, 15 February 2022

Sec. Health and Cities

Volume 4 - 2022 | https://doi.org/10.3389/frsc.2022.780376

This article is part of the Research TopicInsights in Health and Cities: 2021View all 5 articles

Use of Crowdsourced Online Surveys to Study the Impact of Architectural and Design Choices on Wellbeing

Updated

A correction has been applied to this article in:

Corrigendum: Use of crowdsourced online surveys to study the impact of architectural and design choices on wellbeing
1. Read correction

Basma Altaf¹^*^†

Eva Bianchi¹^*^†

Isabella P. Douglas¹

Kyle Douglas¹

Brandon Byers¹

Pablo E. Paredes²

Nicole M. Ardoin³

Hazel R. Markus⁴

Elizabeth L. Murnane⁵

Lucy Z. Bencharit⁶

James A. Landay⁷

Sarah L. Billington¹

¹Department of Civil and Environmental Engineering, Stanford University, Stanford, CA, United States
²Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, United States
³Graduate School of Education and the Woods Institute for the Environment, Stanford University, Stanford, CA, United States
⁴Department of Psychology, Stanford University, Stanford, CA, United States
⁵School of Engineering, Dartmouth College, Hanover, NH, United States
⁶Department of Psychology and Child Development, California Polytechnic State University, San Luis Obispo, CA, United States
⁷Department of Computer Science, Stanford University, Stanford, CA, United States

There has been growing interest among scholars regarding the role of the built environment on occupant wellbeing. Across five studies conducted online from January 2018 to July 2021, we investigate the impact of design interventions (materials, light, and decor representing diverse identities) on several constructs indicative of wellbeing (sense of belonging, self-efficacy, and environmental efficacy), using self-reported metrics. We hypothesize that natural materials, natural light and diverse representations lead to higher self-reported scores compared to artificial materials, no natural light and non-diverse representations. We find that, while our results vary across individual experiments, the synthesized effects of materials and light on all three dependent measures hold consistent across studies, supporting our hypothesized outcomes. We also examine the influence of seasonality, survey platform and design, and independent variables' dosage on survey results. We conclude with a discussion on the challenges associated with researching the psychological as well as behavioral impacts of design interventions in indoor spaces.

Introduction

Urbanization and modern day lifestyles have caused the majority of Americans to spend 87% of their day indoors (Klepeis et al., 2001). As a result, buildings have acquired a strong potential to promote human wellbeing through physical design interventions. Recent years have seen an increase in the number of research studies exploring the impact of indoor built spaces on occupants. For example, studies show that built features in work environments can influence employees' physical health (Bornehag et al., 2001), productivity (O'Neill, 2010) and psychological wellness (Thatcher and Milner, 2014). Natural light and views to nature in the workplaces has been proven to boost creativity (Dul et al., 2011) as well as cognitive performance (Heschong, 2003). Presence of natural light and windows in hospital workspaces has also been shown to improve mood and communication (Zadeh et al., 2014). Presence of biophilic design elements in the workplace has been linked to lower stress levels (Yin et al., 2019). There is also limited research demonstrating the link between presence of natural material (wood) and preference for a space in a hospital setting (Nyrud et al., 2014). However, a deeper understanding of how specific indoor environment design interventions affect wellbeing is desirable.

Studying human subjects in realistic physical spaces may be time-intensive and costly. Online, survey-based studies offer a quicker, lower-cost method of investigating the potential impact of physical spaces on wellbeing, and they provide preliminary results that can be used to design human-subjects experiments with targeted design interventions in laboratory and field settings. The use of online crowdsourcing platforms in particular has many benefits, such as data reliability and scalability, as well as access to a more demographically diverse participant sample compared to student subjects, who can sometimes be overrepresented in some lab-based human subject experiments (Sears, 1986; Behrend et al., 2011; Wazny, 2018). Increased participants' diversity means that difficult-to-reach populations or populations with more work experiences compared to a typical university sample, for e.g., are more likely to be included in the data collection process.

In this paper, we compare results from five online studies focused on evaluating the impact of various physical interventions on wellbeing outcomes. Our work focuses on exploring two key dimensions of human wellness: social and psychological wellness (Hettler, 1976). In the studies discussed in this publication, we focus on investigating the effects of materials (natural vs. artificial), light (natural vs. no natural), and representation (diverse vs. non-diverse) on three specific constructs of wellbeing: sense of belonging and self-efficacy, which explore social and psychological dimensions of wellbeing (Hettler, 1976), as well as environmental efficacy, which is also correlated with human wellbeing (Kaida and Kaida, 2016). Sense of belonging refers to seeing oneself as socially connected, and it has been shown to promote stress reduction (Bolger et al., 2000). Self-efficacy relates to an individual's capability to exercise control over his/her behavior and actions (Bandura, 1997). Environmental efficacy is an individual's belief of how able they perceive themselves of reducing their negative impact on the environment by engaging in pro-environmental behavior (Sellers et al., 2014). Materials (Nyrud and Bringslimark, 2010) and light (Edwards and Torcellini, 2002) have been studied in the context of indoor spaces and wellbeing before, but, to our knowledge, not in relation to their impact on sense of belonging and environmental efficacy. Additionally, the selection of representation as an independent variable was motivated by prior work on sense of belonging and symbols in the physical environment (Cheryan et al., 2009). With these variables, we hypothesize that natural materials (vs. artificial materials), natural light (vs. no natural light), and diverse representations (vs. non-diverse representations) will lead to higher self-reported scores on the wellbeing metrics of belonging, self-efficacy, and environmental efficacy.

An additional objective of our work is to assess the advantages as well as challenges of conducting research using online platforms to provide guidelines and recommendations for effectively executing and evaluating future work using similar methodological approaches. We selected two online crowdsourcing recruitment platforms for our research, Amazon Mechanical Turk (MTurk) and Prolific. In recent years, MTurk has emerged as a prominent crowdsourcing platform for research (Paolacci and Chandler, 2014), and several studies report MTurk as a source of representative data and reliable results (Buhrmester et al., 2016). However, there are growing concerns of participant non-naivety when using MTurk to conduct scholarly research (Chandler et al., 2015) as well as concerns about poor data quality as many MTurk workers have become “professional survey takers” and may no longer pay as much attention to individual surveys. Prolific is a relatively newer platform that provides similar capabilities to MTurk and offers access to more naive populations (Peer et al., 2017). In addition to exploring different online study platforms, we also administered the study at different points of time over a longitudinal period and examined our research questions using different experimental designs.

Materials and Methods

As mentioned, a key goal in undertaking these studies was to employ a variety of methodological setups (e.g., recruitment platform, timeframe, experimental design) in order to explore how these different choices may impact outcomes. This section reviews such differences, commonalities, and other details of the five studies we performed.

All five studies were developed on the cloud-based software platform Qualtrics, and were completed by respondents located in the USA and satisfying a 95% approval rate by other researchers. Table 1 provides an overview of the studies, with more details about characteristics of the studies given in the subsequent sections. All studies were covered by the same Institutional Review Board (IRB) protocol (#48481), and participants were paid $1.80 for Studies 1–4 (10–15 min), and $1 for Study 5 (2–3 min). Table 2 shows the demographic distributions of the respondents for each survey, after exclusions (exclusions are described in Section Studies 1, 2, and 3: independent conditions design).

TABLE 1

Table 1. Overview of study designs.

TABLE 2

Table 2. Survey demographics after exclusions.

Studies 1, 2, and 3: Independent Conditions Design

Studies 1, 2, and 3 had identical experimental designs with independent conditions and were conducted on MTurk at different times of the year, thus enabling us to test our hypotheses across different seasons. Study 1 was conducted before the COVID-19 outbreak while Studies 2 and 3 were conducted during the pandemic.

The survey for the studies was framed as a research study on the modern workplace. These surveys used a within-subject design approach, meaning that each participant was shown all six study conditions in a randomized order: artificial materials, natural materials, no natural light, natural light, diverse representations, and non-diverse representations. For each condition, five pictures were shown. The pictures were photographs taken with a mobile device on the lead authors' university campus, in Northern California (Figure 1). The pictures were taken in various types of spaces found across a university such as co-working spaces, communal areas, conference rooms, and corridors. Pictures selected for the materials and light conditions were chosen so that all sets contained a similar balance of different types of spaces. Additionally, for the natural light set, the size and resolution of the pictures did not allow our participants to clearly distinguish the content of the views through the windows. Picture selection involved a careful review of the interactions between our different variables of interest. For instance, between the artificial and natural materials sets, we sought to have a similar balance of photographs with and without natural light in both sets.

FIGURE 1

Figure 1. Pictures taken around the lead authors' university campus used in Studies 1, 2 and 3.

For each condition, participants were given the following prompt: “These images depict spaces and decorations in your new workplace. Please imagine yourself in this workplace and answer the questions that follow.” They then answered a fixed set of eight questions: three for sense of belonging in the space, two for self-efficacy, and three for environmental efficacy. All questions were in the form of 7-point Likert scales, ranging from 1 = “Strongly Disagree” to 7 = “Strongly Agree.” The eight questions were:

“I feel like I belong in this space.”

“I am similar to the kind of people who succeed in this space.”

“I feel like other people in this space have the same values as me.”

“If someone opposed me in this space, I could find means and ways to get what I want.”

“It would be easy for me to stick to my aims and accomplish my goals in this space.”

“I would recycle newspaper in this space”

“I would recycle cans or bottles in this space”

“I would pick up litter that is not my own in this space”

These questions were adapted from previously established instruments; i.e., the Sense of Social Fit Scale (Walton and Cohen, 2007) for the sense of belonging construct; the General Self-Efficacy Scale (Schwarzer and Jerusalem, 1995) for the self-efficacy construct, and the Environmental Attitudes Inventory (Milfont and Duckitt, 2010) for the environmental efficacy construct. The Cronbach's alpha values for all studies are reported in Table 3.

TABLE 3

Table 3. Cronbach's alpha value ranges^a across all studies.

Data exclusions for Study 1 and Study 2 were based on data integrity assessed by looking at unnatural answer patterns and self-reported attention issues. Unnatural answer patterns included the same score repeated across all questions, alternating values, and increasing or decreasing scores. Additionally, three concluding questions were added to the survey to gauge attention and ask if the participant had given the survey their full attention, if they read each question carefully, and if they believed we should use their data and responses. Participants who answered “no” to any one of the three final questions were excluded.

This exclusion approach was introduced to reduce the likelihood of using bots' data in our analyses (Chmielewski and Kucker, 2020). The data exclusion approach was updated in Study 3 to consist of three attention check questions systematically included at different points in the survey, and participants who failed two or more attention checks were discarded from the analysis. Attention checks were questions with evident answers such as “Please select “Slightly Disagree,” “Please select the color blue,” or “What was this survey about?” Participants were also checked for patterned answers and excluded using a similar criteria as Studies 1 and 2.

Studies 4 and 5: Factorial Design

For Studies 4 and 5, we produced a new set of images that feature a single, realistic office environment. The new pictures were photographs taken from a human-subjects laboratory experiment conducted on the lead authors' university campus (Figure 2). This experiment involved eight different room layouts, each containing a different combination of material, light, and representation type. Variations in material types were seen with different furniture and frame materials (natural wood for natural materials, and white or black laminate for artificial materials). Variations in light were represented by either the absence or presence of a large window in the room. We note that the natural light conditions included a window with a view of nature, as shown in Figure 2. For parsimony, we refer to this condition as natural light, but we are aware that the presence of a window could be a confounding factor impacting wellbeing outcomes. Variations in representations were shown with framed photographs, displaying either only white men (non-diverse), or more mixed gender and racial identifications (diverse). To enhance visibility of the representations in the wall photographs, these images were shown both in the room photograph and separately, up close. Figure 3 shows two examples of a participant's view during the online experiment and Figure 4 shows the non-diverse and diverse photographs used. The same eight questions for our measures from Studies 1–3 were used for Studies 4 and 5 with an additional qualitative question added at the end asking respondents what stood out between the different conditions they saw, in order to evaluate the efficacy of our manipulations in these new photographs.

FIGURE 2

Figure 2. Pictures of the two laboratory rooms located on the lead authors' university campus that were used in studies 4 and 5.

FIGURE 3

Figure 3. Comparison of participants' view for two of the eight laboratory room conditions: (A) artificial materials, artificial light and non-diverse representation, and (B) natural materials, natural light and diverse representation.

FIGURE 4

Figure 4. Original diversity pictures framed inside the laboratory rooms shown in Figures 2, 3.

Studies 4 and 5 were conducted using the online platform Prolific in an effort to reach a more attentive, and naive pool of participants (Peer et al., 2021). Exclusions were based on the same criteria as for Survey 3, excluding participants who failed more than one out of three attention checks for Study 4, or the one attention check for Study 5 (because of between-subjects design), or who provided unnatural, patterned answers (refer to Table 1 for exclusion data).

Within-Subjects Design (Study 4a)

Study 4a was a within-subject design wherein each participant was shown all eight study conditions in a randomized order and asked to answer our eight questions after each set of photographs. Cronbach's alpha ranges for this survey are reported in Table 3.

Between-Subjects Design (Study 4b and Study 5)

A concern regarding survey fatigue arose when designing Study 4a, as we speculated that the repetition of our measurements eight times (as opposed to six times in Studies 1–3), in conditions with similar room pictures, could lead to inattentive participants, increasing the chances of survey satisficing. To assess whether this issue impacted our results, we began by analyzing data from Study 4a using a between-subject approach, which we refer to as Study 4b. A between-subjects design refers to studies where participants only see one condition, randomly assigned. For Study 4b, we therefore extracted survey data for only the first room that each participant saw in Study 4a (maintaining the same exclusions from Study 4a), and repeated the analyses. The number of participants who had seen each condition as their first room ranged from 45 respondents in the condition least often seen first (artificial materials, natural light, non-diverse representation), to 69 participants in the most common condition seen first (artificial materials, no natural light, diverse representation). Cronbach's alpha values for Study 4b are also reported in Table 3.

Our final study, Study 5 was an intentional between-subjects experiment with the same photograph sets from Study 4 where only one condition was shown to each participant. As this study was designed with the purpose of being a between-subjects survey (unlike Study 4b), the number of participants per condition after exclusions was more balanced than for Study 4b. The two least seen rooms (having natural materials, no natural light, and diverse representation and having natural materials, natural light and non-diverse representation) had 56 participants each, and the two most viewed rooms (having artificial materials, no natural light, diverse representation and having natural materials, no natural light, non-diverse representation) had 60 participants each.

For this between-subject experiment, Cronbach's alpha values ranges are given in Table 3. We confirmed with our statistical analysis that the low Cronbach's alpha for self-efficacy scores in studies 4b and 5 did not affect the validity of our results as noted in Table 3.

Results

Analysis

Individual Studies

Data was analyzed using analysis of variance (ANOVA) to compute the main effects of our different intervention treatments. For within-subject designs (Studies 1, 2, 3, and 4a), we used repeated measures ANOVA to account for participants seeing all the conditions. For the factorial design experiments (Studies 4a, 4b, and 5), we conducted three-way ANOVA to integrate the interaction effects between our independent variables. In order to control for the influence of covariates on our results, we also conducted analysis of covariance (ANCOVA) on all study datasets. The covariates were gender identification (man or woman), racial identification (white or non-white), and highest level of education achieved (above or below a bachelor's degree or equivalent) based on findings from prior research (Steiner et al., 2010). We then used these covariates to conduct mixed-model ANOVA analyses for all the studies to identify significant interactions between the covariates and each one of our independent variables. As per standard practice, respondents with non-binary gender identifications, or those who had answered “Prefer Not to Say,” were excluded from the ANCOVA and mixed ANOVA analyses. All of our statistical models were carried out using the R-4.0.2 computing environment. To compute the sense of belonging, self-efficacy, and environmental efficacy scores, we generated the mean score for the three items for belonging, the two for self-efficacy, and the three for environmental efficacy, respectively.

Meta-Analysis

We ran a meta-analysis on the five studies (excluding Study 4b) to synthesize the effect of each one of the three independent variables (materials, light, and representation) on our three dependent variables (belonging, self-efficacy, and environmental-efficacy). Study 4b was not included in the meta-analysis as it was not an independent study and the data for that study was taken from Study 4a. We combined all within-subjects studies (Studies 1–3 and 4a) and the between-subject study (Study 5) for the meta-analysis. The decision to include all studies in the same meta-analysis was based on the argument presented by Morris and DeShon (2002) that it is acceptable to combine within-subjects and between-subjects study designs when (a) a common metric is used for the effect sizes across within-subjects and between-subjects study designs, (b) the same treatment effect is estimated by effect sizes in different study designs¹, and (c) sampling variance estimates used for the meta analysis are design-specific.

The common effect size metric used for all studies was change-score metric, as four of the studies had repeated measures design. The raw-score effect size for between-subjects study (Study 5) was therefore converted to change-score metric. The effect size estimates were computed using t-statistics. The sampling variances were calculated accounting for the effect size metric as well as the study design used. The meta-analysis was carried out using R version 4.0.2, with a random-effects model and restricted maximum likelihood for the estimation of variance-weighted mean effect sizes. A moderation analysis was also performed to test if the study design (within-subjects vs. between-subjects) was acting as a moderator of the effect size to ensure that the effect sizes did not differ significantly across the different study designs used, after they were transformed into a single effect size metric (i.e., change-score metric).

Results Per Independent Variable

Our results are organized for each independent variable with subsections for each dependent variable. The ANOVA results are reported using p-value (p), F ratio (F), degrees of freedom (df), and effect size ( $η_{g}^{2}$ ). Significant thresholds for p-value are < 0.05 (*), < 0.01 (**) and < 0.001 (***). Unless otherwise specified in the text, our results held after controlling for covariates with the ANCOVA model. Table 1 in the Supplementary Materials submission provides the detailed results from analysis of covariance. Table 2 in the Supplementary Materials submission provides additional statistical details from analysis of variance with interaction effects.

Materials

Table 4 presents the main effects obtained with ANOVA, as well as significant interactions for materials. The interactions reported in the tables are those produced by the mixed ANOVA, whereas the text sections discuss the post-hoc analysis results detailing the directions of the significant interactions. Table 5 shows the mean scores, standard deviations, and 95% confidence intervals for natural and artificial materials. Possible scores ranged from 1 (“Strongly Disagree”) to 7 (“Strongly Agree”).

TABLE 4

Table 4. ANOVA and mixed ANOVA results for materials for all three dependent variables.

TABLE 5

Table 5. Mean, standard deviation, and 95% CI scores obtained for materials for all the studies.

Sense of Belonging

Participants' self-reported sense of belonging scores were higher for natural materials across Study 1, Study 2, and Study 3, with both our ANOVA models showing significant results, as shown in Table 4. In Study 1, participants who identified as white women reported particularly higher belonging scores in the natural materials condition (M = 5.462, SD = 1.227) over its artificial counterpart (M = 4.059, SD = 1.625, p < 0.001). An interaction with participants' race was also observed, leading white people to express a strong preference for natural materials (M = 5.258, SD = 1.367) over artificial materials (M = 4.291, SD = 1.580, p < 0.001).

Data from Study 2 revealed that white women again reported stronger feelings of belonging with natural materials (M = 5.529, SD = 0.988) compared to artificial materials (M = 4.954, SD = 1.437, p < 0.001). Participants who reported having achieved less than a bachelor's degree also scored higher with natural materials (M = 5.140, SD = 1.306) over artificial materials (M = 4.473, SD = 1.432, p = 0.008). Additionally, Study 2 showed that white women who reported having achieved less than a bachelor's degree scored higher with natural materials (M = 5.381, SD = 1.02) over artificial materials (M = 3.714, SD = 1.709, p = 0.006). There was a similar result for white identifying women with bachelors or more reporting higher belonging scores in the natural materials condition (M = 5.561, SD = 0.986) compared to the artificial materials condition (M = 5.217, SD = 1.234, p = 0.028).

In Study 3, participants who had indicated having achieved less than a bachelor's degree were driving the main effect by showing a strong preference (p < 0.001) for natural materials (M = 5.08, SD = 1.33) over artificial materials (M = 4.36, SD = 1.59).

Analyses for Study 4a, Study 4b, and Study 5 revealed that, although participants' self-reported belonging scores were higher for natural materials than artificial materials, this difference was not statistically significant. A single significant interaction was found in Study 4a between race and materials, with white people self-reporting higher belonging scores with natural materials (M = 4.434, SD = 1.540) than artificial materials (M = 4.29, SD = 1.61, p < 0.001), while people of color preferred the artificial materials condition (M = 4.375, SD = 1.582) over natural materials (M = 4.26, SD = 1.58, p = 0.031).

Self-Efficacy

Participants' self-reported self-efficacy scores were statistically higher for the natural materials scenario compared to artificial materials in Study 1, with p-values below 0.001. However, in Study 2, although our ANOVA model produced significant results, this significance was not maintained when controlling for covariates, and the gender of participants was identified as the moderating covariate leading to null ANCOVA results. Study 3 returned significant results even after controlling for covariates with, once more, a preference for natural materials.

In Study 1, white women were driving the main effect by scoring significantly higher for natural materials (M = 5.441, SD = 1.066) than artificial ones (M = 4.401, SD = 1.321, p < 0.001).

In Study 2, white women once more expressed more self-efficacy with natural materials (M = 5.538, SD = 1.158) than they did with artificial materials (M = 4.963, SD = 1.321, p < 0.001). Also, in Study 2, women expressed higher self-efficacy scores in natural materials condition (M = 5.583, SD = 1.105) than they did with artificial materials (M = 5.117, SD = 1.282, p < 0.001). Further, Study 2 showed that people who associated as white expressed higher self-efficacy scores in the natural materials condition (M = 5.217, SD = 1.277) than they did with artificial materials (M = 5.079, SD = 1.172, p = 0.013). There was also a significant race by education by materials interaction where people who associated as whites and had less than a bachelor's degree expressed higher self-efficacy scores in the natural materials condition (M = 5.375, SD = 1.091) than they did with artificial materials (M = 4.528, SD = 1.270, p = 0.005). A gender by race by education by materials significant interaction was found where white women with less than (p = 0.004) as well as more than (p = 0.010) a bachelor's degrees reported higher levels of self-efficacy in the natural materials condition (M = 5.786, SD = 0.893) and (M = 5.484, SD = 1.206) compared to the artificial materials condition (M = 4.036, SD = 1.562) and (M = 5.159, SD = 1.187).

Data from Study 3 revealed that participants with an education level below the bachelor's degree were driving the main effect by showing a strong preference (p < 0.001) for natural materials (M = 5.14, SD = 1.31) over artificial materials (M = 4.54, SD = 1.45).

The results from Study 4a, Study 4b, and Study 5 showed no significant main effect for materials and self-efficacy (Table 4), despite natural materials leading to higher self-efficacy scores than artificial ones (Table 5). One statistically significant interaction was identified during Study 4a between race and materials, with white-identifying participants reporting higher self-efficacy in the natural materials setting (M = 4.58, SD = 1.33) over the artificial materials condition (M = 4.51, SD = 1.38, p = 0.025), and people of color in the artificial materials condition (M = 4.46, SD = 1.35) over natural materials (M = 4.36, SD = 1.37, p = 0.033).

Environmental Efficacy

Environmental efficacy results for Study 1, 2, and 3 showed a statistically significant difference between the effect of natural and artificial materials, with natural materials leading to higher self-reported scores.

In terms of significant interactions, participants from Study 1 who identified as white women reported higher levels of environmental efficacy when shown pictures of natural materials (M = 5.937, SD = 1.207) over artificial materials (M = 5.182, SD = 1.478, p < 0.001). White men also expressed higher environmental efficacy in the natural material condition (M = 5.633, SD = 1.428) over the artificial one (M = 5.408, SD = 1.369, p = 0.043). Similarly, men of color reported higher environmental efficacy in the natural material condition (M = 5.657, SD = 1.560) over the artificial one (M = 4.899, SD = 1.452, p = 0.007).

Similarly, for Study 2, higher levels of environmental efficacy in the natural materials condition were reported by both women (M = 5.807, SD = 1.099) and men (M = 5.589, SD = 1.199) over the artificial materials condition (M = 5.444, SD = 1.225) and (M = 5.435,SD = 1.224, p < 0.001) and (p = 0.024). Additionally, for Study 2, people of color with a bachelor's degree or more reported higher levels of environmental efficacy in the natural materials condition (M = 5.861, SD = 1.129) over the artificial materials condition (M = 5.592, SD = 1.257, p = 0.006). Finally, people who self-identified as white and had less than a bachelor's degree also reported higher levels of environmental efficacy in natural materials condition (M = 5.778, SD = 1.247) over the artificial materials condition (M = 5.111, SD = 1.168, p < 0.001).

An interaction between gender and race was observed in Study 3 with both men of color (p = 0.006) and white women (p = 0.005) preferring natural materials (Men of color: M = 5.75, SD = 0.98; White women: M = 5.65, SD = 1.13) over artificial materials (Men of color: M = 5.42, SD = 1.22; White women: M = 5.37, SD = 1.36).

The data from Study 4a, Study 4b, and Study 5 showed no significant difference in self-reported environmental efficacy between natural and artificial materials. While the ANOVA models reported null results, the mixed ANOVA analyses showed a significant interaction between gender and race for Study 4b only, with men of color reporting higher environmental efficacy scores with natural materials (M = 5.80, SD = 0.97) over artificial materials (M = 5.11, SD = 1.29) [p = 0.01, t₍₇₁₎ = 2.65, CI = (0.17, 1.22)].

Light

Table 6 shows the main effects and significant interactions obtained for light type. The interactions in the tables are those revealed by the mixed ANOVA, whereas the text discusses the post-hoc results and details the directions of the interactions. Table 7 shows the mean scores, standard deviations, and 95% confidence intervals for natural and artificial light. Possible scores ranged from 1 (“Strongly Disagree”) to 7 (“Strongly Agree”).

TABLE 6

Table 6. ANOVA and mixed ANOVA results for light for all three dependent variables.

TABLE 7

Table 7. Mean, standard deviation, and 95% CI scores obtained for light for all the studies.