Skip to main content

ORIGINAL RESEARCH article

Front. Virtual Real., 11 January 2023
Sec. Technologies for VR
Volume 3 - 2022 | https://doi.org/10.3389/frvir.2022.945791

Seeing the faces is so important—Experiences from online team meetings on commercial virtual reality platforms

  • Digital Media Lab, University of Bremen, Bremen, Germany

During the COVID-19 pandemic, online meetings became common for daily teamwork in the home office. To understand the opportunities and challenges of meeting in virtual reality (VR) compared to videoconferences, we conducted the weekly team meetings of our human-computer interaction research lab on five off-the-shelf online meeting platforms over 4 months. After each of the 12 meetings, we asked the participants (N = 32) to share their experiences, resulting in 200 completed online questionnaires. We evaluated the ratings of the overall meeting experience and conducted an exploratory factor analysis of the quantitative data to compare VR meetings and video calls in terms of meeting involvement and co-presence. In addition, a thematic analysis of the qualitative data revealed genuine insights covering five themes: spatial aspects, meeting atmosphere, expression of emotions, meeting productivity, and user needs. We reflect on our findings gained under authentic working conditions, derive lessons learned for running successful team meetings in VR supporting different kinds of meeting formats, and discuss the team’s long-term platform choice.

1 Introduction

The COVID-19 pandemic required many people to work from the home office. Online meetings with video conferencing software became omnipresent after this trend had been promised for decades (Nilles, 1975). Although video calls provide many advantages, such as seeing meeting participants or allowing for screen sharing, they still yield limitations, such as restricted social interaction between participants. Social virtual reality (VR) platforms might provide beneficial alternatives for online team meetings by gathering everyone in one virtual room. Previous studies showed positive psychological effects of social VR platforms (Barreda-Ángeles and Hartmann, 2022) and that group behaviors and emotional responses to it are largely similar to face-to-face encounters (Moustafa and Steed, 2018). However, it seems that the generation of ideas as a creative process is hindered (Brucks and Levav, 2022). Beyond the faithful reproduction of in-person meetings, research further explores the possibilities of VR for enhancing social encounters and collaboration outside the restrictions of reality (Slater and Sanchez-Vives, 2016; McVeigh-Schultz and Isbister, 2021a).

With an increasing number of companies investing in immersive technologies and the future of the metaverse, social encounters in virtual environments (VE) will become commonplace in the foreseeable future. We can already observe this trend on platforms for personal use where users meet in VR to socialize, play, and experience cultural events together (Sykownik et al., 2021)—even more pronounced during the pandemic (Rzeszewski and Evans, 2020). This development also extends to the business context. In 2022, 52% of employees were open to having meetings or team activities in the metaverse (Microsoft, 2022). The real estate company eXp Realty is operated entirely in virtual offices with more than 75,000 employees1. Numerous small companies and start-ups have launched collaborative platforms for meetings in virtual space. Also, large companies are preparing for a future in which professionals get together in virtual or blended realities, such as with Microsoft Mesh2 or Horizon Workrooms by Meta3. Enabled by rapid technological development, immersive teamwork is more topical than ever. Still, many advertised features are visions of the future and blur the public perception of present possibilities.

To explore current opportunities and restrictions of commercial VR meeting software and to gain a deeper first-hand understanding of the advantages and drawbacks of authentic meetings in VR, we conducted the regular team meetings of our university’s human-computer interaction (HCI) research lab in VR to compare the attendees’ experiences with meetings on video conferencing platforms. This experimental shift from video-only platforms was intrinsically motivated, not by conducting this study. Therefore, the authors had the opportunity to independently evaluate the experiment, resulting in genuine insights from experiences in the wild. Over 4 months in 2020, we conducted twelve meetings on five different platforms–seven in VR and five on video conferencing or hybrid platforms for comparison. The different platforms are shown in Figure 1. We accompanied the team members from the expectations before the first VR meeting to the final conclusions on the long-term software choice by collecting feedback right after each meeting through online surveys. As part of the lab team, the authors also attended the meetings, which enabled an additional autoethnographic perspective to reflect on the meetings with different platforms.

FIGURE 1
www.frontiersin.org

FIGURE 1. The off-the-shelf platforms compared in the wild for our weekly team meetings: (1) Our status quo, Zoom, compared to the two virtual reality platforms (2) AltspaceVR and (3) Engage, as well as the hybrid (4) Gather Town combining video feeds with a spatial environment. StarLeaf was also used in the case study but is not shown here as it has a similar interface to Zoom.

With this, our case study adds to the growing body of research that investigates the gathering of people on various professional occasions on VR platforms. Previous studies explored attendees’ experiences at professional social events in VR, e.g., at academic conferences and workshops (Erickson et al., 2011; Kirchner and Nordin Forsberg, 2021; Lahlou et al., 2021; Williamson et al., 2021), and group dynamics in social VR outside professional context (Moustafa and Steed, 2018; Scavarelli et al., 2021). However, in contrast to the existing body of work, this case study focuses on the participants’ personal experiences during regular online team meetings in different mediums over an extended period. Thereby, it covers both VR and video conferencing solutions. Our study aims to explore whether currently available off-the-shelf social VR platforms meet the needs and preferences of the team members for attending weekly lab meetings as a competitive alternative to conventional video conferencing solutions. Based on 200 completed survey responses, we analyze ratings and statements of the meeting experiences and the different aspects that influenced these experiences. Our analysis identifies central themes that matter for VR meetings and provides unique insights into the experiences of meeting on various platforms. While most of the individual themes have previously been discussed in the literature as isolated elements, encountering them integrated into authentic experiences of participants who compare it to videoconferencing in an in-the-wild study allowed us to assess and reconsider various factors, identify priorities, and add new aspects.

2 Related work

Research on computer-mediated collaboration has a long history. A broad landscape of literature provides an understanding of how people can work together remotely comprising various modalities, cultures, technologies, and goals of remote work (Galegher et al., 2013; Raghuram et al., 2019). Video-based meeting solutions received much attention in the late 1990s (Finn et al., 1997; Hinds, 1999) and provided strong indications that video-mediated meetings can have an equally good quality as face-to-face meetings (Olson et al., 1997). Still, for decades, videoconferencing was no competitive substitute for in-person meetings for large parts of the general public–or not explored as one–until an enforced shift due to the COVID-19 pandemic (OECD, 2021).

Along with the rise of videoconferencing platforms, the term “Zoom fatigue” was prominent in public and scientific discourse (Shockley et al., 2021). It describes the exhaustion after holding many or long meetings as videoconferences. Possible causes are suspected to be the cognitive load, always seeing oneself, and reduced mobility during video calls (Bailenson, 2021), but also spatial reduction of the conversation partners and their background to a 2D projection (Nadler, 2020) calling for mitigation or alternatives. According to Shockley et al. (2021), group belongingness was found to be the most consistent factor protecting from videoconference fatigue. Correspondingly, early research suggested that in remote collaboration, the employed technologies impact the communication outcome dependent on the interpersonal interactivity (Burgoon et al., 1999). While collaborating via videoconferences has been compared to other modalities, such as audio-only or in-person meetings (Ochsman and Chapanis, 1974; Daft and Lengel, 1986; Burgoon et al., 1999; Hinds, 1999; Bailenson, 2021), this work adds to the growing body of research with direct comparison to collaboration in VR. The following two sections will describe previous work on characteristics of social VR that might foster interpersonal interactivity and the feeling of group belongingness or mitigate limitations of videoconferencing in other ways. We further discuss how VR technology has previously been used for social encounters in a professional context.

2.1 Social interactions in VR

People use social VR platforms for a broad range of reasons. The strongest motives are meeting people, staying in contact, and experiencing social presence (Sykownik et al., 2021). Interacting in VEs offers considerable potential for high social presence (Short et al., 1976) as the spatial nature of the medium allows users to encounter other people in a shared space and affords complex social interactions and group dynamics. Social presence as “the connection of people via telecommunication systems” has been a central concept for comparing various forms of computer-mediated collaboration for decades (Nowak, 2001). Multiple factors affect how people interact with each other in VEs, including their visual and behavioral (self-)representation, the perceived agency of others’ avatars, and potentially immersion (Kyrlitsias and Michael-Grigoriou, 2022). Different studies yield inconclusive results on the effect of visual realism on social presence (Nowak and Biocca, 2003; Bente et al., 2008; Kang and Watt, 2013; Zibrek and McDonnell, 2019) and imply dependence on other aspects of the simulation such as behavioral realism (Bailenson et al., 2005), which is generally a powerful predictor of social presence (Oh et al., 2018), such as turn-taking in a conversation (Bailenson et al., 2004). The social VR platforms tested in this case study differ considerably in the realism of virtual human representations introducing an interesting testbed with variance.

Further, sound is an important consideration for social interactions in VR. Spatial audio has been shown to impact the user’s sense of presence (Poeschl et al., 2013; Kern and Ellermeier, 2020). Although it allows the user to identify the direction of different audio sources, concurrent talking of several people poses challenges for listeners in immersive environments. It can be effectively mitigated by helpful visual cues (Gonzalez-Franco et al., 2017). Besides the representation of virtual humans and verbal conversations, social VR interactions are influenced using non-verbal communication. On various social VR platforms, previous studies investigated the role of non-verbal cues and limitations in essential aspects of it and discussed similarities or discrepancies to non-verbal behavior patterns offline (Yee et al., 2007; Wigham and Chanier, 2013; Maloney et al., 2020; Tanenbaum et al., 2020). For instance, group arrangements, proxemics, and the preservation of personal space have been found to resemble offline behavior to a large extent (Bailenson et al., 2001; Yee et al., 2007; Hecht et al., 2019; Williamson et al., 2021) with adequate emotional responses to it (Wilcox et al., 2006). Also, previous research has shown that people can demonstrate similar social responses in virtual simulations as in reality. For example, in a study investigating compliance as done in the experiment by Milgram and Gudehus. (1978), the participants were equally or more compliant in VR compared to a control condition in physical reality (Dzardanova et al., 2021). Unfortunately, a problem that transfers from offline social interactions to virtuality, as well, is harassment (Freeman et al., 2022). As the communities are still establishing social norms for VR interactions, their governance remains a challenge (Blackwell et al., 2019a; Blackwell et al., 2019b; McVeigh-Schultz et al., 2019).

Altogether, many factors determine how a person in social VR perceives and interacts with another virtual human. For the collaboration of whole teams, however, not only the interaction between two people must be considered. Related work explored group dynamics and team communication in social environments that can resemble but also go beyond the possibilities of video conferencing. Torro et al. (2021) describe the impact of non-verbal communication and spatial information of social VR and why this makes it a game-changing medium for organizations. Due to the possibility of simulating any imaginable communication process, the authors argue that social VR has the potential to exceed the communication effectiveness of video conferencing and real-world settings. As one reason, they state how new forms of group dynamics can be facilitated and how teams benefit from formal and informal encounters in VR (Torro et al., 2021).

2.2 Meeting in VR

Researchers have explored the opportunities of remote meetings and social gatherings for a long time. Although the technical capabilities looked entirely different in 1999, the outcome of a study by Ståhl still shows similarities to current work. The most significant needs concerned the feedback on the users’ connection in terms of visual attention, audibility, and network connectivity. In recent years, due to technological advancements and being incentivized by COVID-19, the body of literature on meetings and events in social VR is growing rapidly. Typically, the study designs in this research field are not lab-based and less controlled but of observatory nature and in the wild.

In this manner, Moustafa and Steed. (2018) conducted a longitudinal study on social interactions in small groups already known to each other moving their contact to social VR. The authors found that the participants experienced similar emotional states as in real-life socializing, which suggests high co-presence and transferability of existing group dynamics. Concerning educational purposes, a literature review explored interactions relevant in social xReality (XR) learning spaces and provided an overview (Scavarelli et al., 2021). For example, a study by Yoshimura and Borst. (2021) reports on class meetings experienced in VR comparing access with a head-mounted display (HMD) and in desktop mode. The students attended lectures and presented in VR. The experiences highly depended on how comfortable the attendees were with the HMD and if it made them sick. In another experiment, Ginkel et al. (2019) found a close resemblance between learning outcomes from training presentations in VR compared to face-to-face training.

Previous studies also explored the use of social VR environments in the academic community, such as social events at scientific conferences. While paper presentations can be effectively realized in videoconferences, it is much more challenging to enable virtual conference attendees to meet and connect with other participants. Yet, the informal exchange during coffee breaks and at receptions is essential to the success of academic collaborations. Therefore, organizers searched for virtual compensation, which was sometimes accompanied by scientific evaluation. For instance, Kirchner and Nordin Forsberg. (2021) organized a virtual conference where they held a reception on the VR platform Engage and performed a qualitative evaluation. The participants reported lively experiences but struggled with the spatial audio and discomfort from the HMDs. Further, the participants felt restricted in getting to know people they had never met before due to missing facial expressions. For fostering dynamic group conversations on similar occasions, Rogers et al. (2018) proposed displaying word clouds around groups hinting at the discussed topics to help strolling participants find a suitable group to join. Research by Williamson et al. (2021) analyzed the user proxemics during an academic workshop in VR. The results showed proxemic interactions between attendees that are congruent to physical settings and afford dynamic group formations dependent on properties of the VE. Beyond characteristics of personal space in social interactions known from reality, the possibility to enable participants to fly with their avatar added a dimension of user proxemics that was most notable during presentation situations.

A few studies investigated the social interactions at entire conferences in VR. The first evaluation of a virtual avatar-based conference by Erickson et al. (2011) comprised 500 attendees in Second Life4 and was considered a “reasonably successful” event. According to the authors, the system must afford the formation of small groups for having focused interactions while breaking up and remixing into other groups over time. The analysis showed that the loud spatial audio disrupted conversation privacy and led to increased distances between groups which inhibited remixing. Structured social events worked better for the participants than more informal settings. In the end, none of the interviewed attendees experienced the virtual substitute and the face-to-face conference. A decade later, Lahlou et al. (2021) studied a conference in VR accompanied by video calls. The authors state two goals that a VR installation must meet for successful conferencing: (1) the development of knowledge and (2) informal social interaction. They find that current solutions still lack opportunities for natural social encounters and relational space. The researchers recommend careful preparation for organizers and suggest special consideration of onboarding processes and catering for socializing. In the same year, the conference IEEE VR 2020 was held entirely virtually for the first time. The accompanying evaluation by Ahn et al. (2021) provides a detailed comparison between the usage of different media platforms and their appropriateness for typical conference tasks. Again, the results point out the social constraints of the VE but also show advantages for connecting compared to other text-based platforms. Here, the social VR platform was rated as most appropriate for socializing and networking and providing the highest social presence. Most attendees who joined the VE decided to access it via desktop computer, although many owned an HMD. While the authors advocate for making use of the unique possibilities of virtual conferencing, they echo related research by advising caution not to transfer the substituted offline event directly to virtual space but to adapt the format and purpose of the event. In this spirit, McVeigh-Schultz and Isbister. (2021b) argue for conceiving solutions of VR collaboration that deliberately deviate from direct replication of familiar social encounters. Instead of imitating a face-to-face work environment, the authors promote using the full potential of immersive technologies to create an enriched experience of remote collaboration. The approach taken in this case study provides a flexible setting for this as the attendees could test how different platforms can serve the purposes of the meetings and reinvent the format along the way. Without predetermined system choices and autoethnographic insights, the team could adjust the conduct of the meeting every week over several months and compare the experiences.

During the COVID-19 pandemic, researchers were challenged with finding appropriate and practical scientific methods to continue conducting studies (Nind et al., 2021). Research on computer-mediated communication has established various methods and tools for evaluating specific aspects of the behaviors and opinions of interlocutors, e.g., turn-taking and behavioral analyses to examine meeting dynamics (Samrose et al., 2021; Samrose et al., 2018) and simulated conversations (Abdullah et al., 2021), audio-visual capturing to understand communication characteristics (Byun et al., 2011; Koseki et al., 2020), linguistic analyses (Kramer et al., 2006; Fägersten, 2010), self-reports in interviews (Bleakley et al., 2022) and standardized questionnaires (Nowak and Biocca, 2003), autoethnographic methods (Mack et al., 2021), or a self-hosted and modified version of a social VR platform to understand the proxemics (Williamson et al., 2021). For this research, we rely on the conjunction of quantitative and qualitative methods from self-reports in a survey that includes standardized questionnaire items and open questions. Our approach allows longitudinal analysis through unique identifiers while preserving anonymity. Previous literature investigated the use of VR technology for conducting conferences or socializing events, focusing on the comparison to the face-to-face equivalent. On the other hand, this work explores the potential of holding team meetings in VR and compares it to video conferencing while investigating, in both cases, the similarity to face-to-face meetings. Instead of determining usability issues of individual social VR platforms, as previous research has done systematically (Liu and Steed, 2021), the personal experiences and needs of the meeting participants are of central interest in this case study.

3 Methods

Over 4 months, from August until December 2020, we evaluated twelve weekly meetings on different platforms (see Table 1 for an overview) by inviting the participants to fill out a questionnaire after each session. We used a mixed-method approach and collected quantitative and qualitative data to investigate the participants’ ratings of each meeting experience and their personal impressions.

TABLE 1
www.frontiersin.org

TABLE 1. Overview over the 12 evaluated meetings.

3.1 Participants

Generally, the weekly meeting is attended by all team members, including technical and administrative staff, undergraduate and Ph.D. students, postdocs, professors, and guests. This leads to a heterogeneous sample concerning technical literacy, previous experience with VR, and expectations towards the meetings. Participation was voluntary and strictly anonymous, so we did not collect any demographic data. Nevertheless, the participants rated their prior experience using three items: How often they use VR in general, how often they use multi-user VR applications, and whether they use VR as part of their work. From these three items, we calculated an overall prior experience score (min 0 to 10 max). On average, the 18 participants of meeting M1 had a prior experience score of 3.167 (SD = 2.41). All group members had access to VR hardware, i.e., HTC Vive (Pro), Valve Index, or Oculus Quest 1. In total, we had 239 meeting participations in the 12 meetings with 32 distinct attendees completing at least one questionnaire. Four participants attended and filled out the questionnaires for all 12 meetings. 12 participants responded after at least 10 meetings. In each meeting were between 15 and 24 attendees. We did not collect demographic data such as age or gender because this would disclose the participants’ identities due to the small group size. For the same reason, we did not ask about the HMD model used.

3.2 Evaluated meetings

The main purpose of the weekly team meetings was to report on and discuss current matters concerning the lab or its associates, which the group manager and administrative staff mostly did. Additionally, there were presentations by undergraduates and group members (in 50% of the reported meetings, at least once on each platform). At the end of the meetings, individuals and small groups would discuss matters with the group manager irrelevant to the whole team, while everybody else would already leave. On average, the meetings investigated in this study lasted 48.42 (SD = 22.02) minutes and had 19.91 (SD = 2.47) attendees with details shown in Table 1. Before the COVID-19 pandemic, the lab meetings were held in person. Between March 2020 and the beginning of this case study, they were on Zoom or StarLeaf.

3.3 Procedure and questionnaire

The meeting was joined by the attendees’ device of choice–an HMD (if applicable) or desktop client–and followed its regular structure. During the weekly meetings, the group discussed which platform to try next and when to change it. After each meeting, the researchers emailed all group members and guests linking to a Google Forms questionnaire. Each recipient, including the authors, could individually decide to take part in the survey or not, which resulted in an average response rate of 83.7% (SD = 6%). We made every effort to ensure there was no social pressure to participate and no fear of negative consequences for answering honestly. The questionnaire started with a consent form and a unique identifier (ID) chosen by each participant to identify repeated participation and allow longitudinal analysis anonymously. We asked how the meeting was accessed and found that HMDs were used in 92 out of 114 VR platform cases (80.7%). For the first session, we also assessed the prior experience with VR and expectations for the VR sessions. The questionnaire incorporated questions from the Social Presence Questionnaire by Nowak and Biocca. (2003), the User Burden Scale (Suh et al., 2016) and self-designed questions to answer on a 7-point Likert scale. Additionally, we asked open questions about the meeting experience, e.g., group interactions, comparisons to the other platforms, and memorable experiences. The questionnaire after the first and last session had a few additional items. After the first meeting, we asked about the participants’ general usage of VR, their usage of multi-user VR applications, whether they work with VR, and their expectations and worries for VR meetings. Additionally, the final survey after the last session incorporated questions about the preferred meeting platform and the reasons for the preference. In the supplemental materials, we provide an overview of all questionnaire items and, if applicable, their respective origins from standardized instruments.

3.4 Platforms

To gain insights into the suitability of openly available social VR services for team meetings in the wild, we solely used established and sophisticated platforms that allow integrating presentations and giving talks. The most critical factor for the software selection was the accessibility from all devices used by the attendees, as they were free to choose which device they preferred to access each meeting. Throughout the case study, we tested the VR platforms AltspaceVR5 (see Figure 2, meetings 2–6) and Engage6 (see Figure 3, meetings 7 and 8), the videoconference platforms Zoom7 (see Figure 1 (1), meetings 9, 10 and 12) and StarLeaf8 (meeting 1), as well as Gather Town9 (see Figure 4, meeting 11), a hybrid of a virtual 2D environment and video calls. We provide additional screenshots from the meetings and a video figure in the supplemental materials.

FIGURE 2
www.frontiersin.org

FIGURE 2. This shows the setup of the AltspaceVR platform. It was an indoor room with large windows and a presentation screen.

FIGURE 3
www.frontiersin.org

FIGURE 3. This shows the setup of the Engage platform. It was an outdoor scenario similar to an amphitheater with a presentation screen.

FIGURE 4
www.frontiersin.org

FIGURE 4. This shows the setup of the Gather Town platform. The scenery was an outdoor place in a park without a dedicated presentation screen. The lighter grey space identifies one coherent meeting area.

The virtual setup of AltspaceVR was based on a template provided by AltspaceVR. We used a template from the category “Talk Show” because it had a big screen for presentations and consisted of a room with large windows. This layout was inspired by the real setting the meetings were held in pre-Covid. We modified the virtual space with the group’s logo as visible in Figure 2. The table also resembled the U-shaped setup of our physical meeting room and gave the virtual room some structure. Participants could customize their avatar representation, mute themselves, react with emoticons, and move around the space. There was a room admin who could enable the megaphone feature for single persons to make them audible to the group in the whole room.

Similar to the setup of AltspaceVR, the meeting space of Engage was based on a template provided by the platform. The room design resembled an ancient outdoor forum with a U-shaped area to sit down to watch a stage. The stage featured a large presentation screen in the background. The meeting area also featured some educational exhibits and was placed on an island, viewing the Golden Gate Bridge in the distance. In contrast to the AltspaceVR settings, the people could actually sit down and were evenly spaced in the VE, creating the impression of a seated audience for the person on stage.

We relied on a pre-existing template for the Gather Town meeting space again. We chose a park setting (see Figure 4) featuring areas for socializing with games. We modified the template so that the area around the fountain was one meeting space–which is colored in light grey in Figure 4, where everyone inside could see the video of everyone else in this area (cf. Figure 1 (4)). Additionally, we added the two pumpkins between the upper benches to mark two tiles with a broadcast feature. The participants standing on these tiles were audible in the whole environment, similar to the megaphone feature of AltspaceVR. Outside of dedicated meeting areas, the visibility and audibility depended on the proximity of the participants, allowing the creation of small spontaneous groups.

3.5 Data analysis

The questionnaires collected quantitative data (7-point Likert scales) and qualitative (answers to open questions). The quantitative data were analyzed using factor analysis. As the questionnaires included various attributes that are not independent of each other, an exploratory factor analysis (EFA) with Varimax rotation on all questionnaire items was used to reveal hidden factors that group multiple related questionnaire items. This data-driven process allowed us to identify groups of questions that influenced the users’ ratings of their meeting experiences. The qualitative data of the questionnaires were analyzed using thematic analysis by identifying similarities and inductively creating themes. A team of four researchers individually coded three responses, and after that, they discussed and iterated the resulting codes until they agreed on a common code system. This system was used to code all qualitative responses once. We received 200 responses. 155 of them contained qualitative data in addition to the questionnaire items. After the additions to the code system from the first cycle were discussed, a different researcher coded all responses a second time. This process resulted in 1,228 codings using 107 codes.

4 Results and lessons learned

In the following, we provide our analysis of the quantitative data collected in subsection 4.1 as well as insights on the five themes derived from the qualitative data in subsections 4.2 to 4.6. The questionnaire responses to the Likert scale questions are evaluated with a focus on the ratings of the meeting experience. To provide authentic impressions and derive and discuss dominating themes that explain and contextualize the quantitative results, we present the qualitative insights in greater depth in the following. We report on the experiences and statements of the participants on the five main themes that the thematic analysis revealed: Spatial Aspects, Meeting Atmosphere, Expression of Emotions, Meeting Productivity, and User Needs. Each subsection concludes with a Lessons Learned paragraph discussing central insights and linking them to related work.

4.1 Ratings of the meeting experience

As part of our post-meeting questionnaire, we obtained feedback on 19 Likert-scaled question items, shown in Table 2. In the following, we present the results of our quantitative data analysis of the users’ ratings on those items.

TABLE 2
www.frontiersin.org

TABLE 2. Overview of the most important questionnaire items. Questions from standardized instruments are denoted with SPQ for the Social Presence Questionnaire by Nowak and Biocca. (2003) or UBS for the User Burden Scale by Suh et al. (2016). The constellations of all five questionnaires is detailed in the supplemental material.

Starting from meeting M2, we asked the attendees about their overall meeting experiences on a 7-point Likert scale (poor 1 to 7 great). Figure 5 shows a barplot with the results of the total meeting experience. With a one-way ANOVA, we found a significant effect of the meeting modality on the total meeting experience (F(3,178)=11.041,p<0.01,ηp2=0.157). Bonferroni-corrected post hoc tests confirmed that the total meeting experience in the video call meetings was significantly higher than in the VR meetings for both VR options, VR via desktop PC (t (30.454) = −4.876, p < .01, d = −1.429) and VR using an HMD t (137.635) = −5.340, p < .01, d = −.821). For Gather Town, we also observed an improved meeting experience for PC VR (t (35.009) = −3.509, p < .01, d = −1.062), and HMD VR (t (45.499) = −3.124, p = .019, d = −.537), and no difference for the video call platforms (t (35.186) = −1.461, p = .917, d = −.372). These results indicate an advantage of video calls and the hybrid platform Gather Town compared to the VR conditions. Nevertheless, we did not observe any difference between the VR options t (35.156) = −1.321, p = 1.00, d = −.290) and, therefore, we did not differentiate between these two VR variants in the following quantitative analysis.

FIGURE 5
www.frontiersin.org

FIGURE 5. Barplot of the total meeting experience assessed in meetings M2–M12 together with the results of Bonferroni-corrected paired t-tests.

Since the attendees had a very diverse pre-experience with VR technology, we investigated whether the pre-experience, as assessed in meeting M1, correlates with the meeting experience in the first VR meeting M2. Using a Spearman rank correlation, we found that the two variables were moderately correlated (Spearman’s r (13) = .514, p = .072).

However, the total meeting experience rating did not provide a conclusive impression of the users’ perception of the meetings. Only four participants attended all twelve meetings, which prevented a more detailed within-subject analysis in the following. Still, we wanted to investigate exploratively which attributes contribute to the user ratings to understand the observed differences.

4.1.1 Exploratory factor analysis

To explore the structure of user ratings and identify possible factors in the multiple questionnaire items of Table 2, we conducted an exploratory factor analysis on the additional feedback obtained through the Likert scale questionnaire items. Figure 6 presents a scree plot of the variance associated with each factor, representing the explained variance for each resulting factor. This helps us to obtain the number of factors. Factors with high levels of explained variance are important, interpretable contributors to the overall model, whereas later factors explain less variance. The scree analysis revealed three factors with an Eigenvalue above 1. Continuing with these three factors, we calculated the factor loadings. Figure 7 shows the loadings of all question items for the three factors. The loading of an attribute for a factor can be interpreted as the correlation between the question item and the factor. Therefore, the higher the absolute loading value, the stronger that attribute is tied to that factor and predicted by it. On the other hand, a loading of close to 0 indicates the absence of a (linear) relationship between the attribute and the factor.

FIGURE 6
www.frontiersin.org

FIGURE 6. Scree plot of variance explained by each factor, resulting from the factor analysis on questionnaire items.

FIGURE 7
www.frontiersin.org

FIGURE 7. Loadings of all attributes (larger absolute values represented in darker colors) for the three factors (F1–F3) in the exploratory factor analysis.

This allowed us to find interpretations of these factors. Factor F1 loads highest on the questions related to Involvement, F2 corresponds to Co-Presence items, and F3 to items around Privacy. Subsequently, we compare VR meetings, video calls, and Gather Town with each other according to the three factors. With a one-way ANOVA, we found a significant effect of the meeting modality on factor F1 Involvement (F(2,156)=4.263,p=0.016,ηp2=0.052). Bonferroni-corrected post hoc tests confirmed a difference between VR and video calls with t (126.17) = −3.195, p < .01, d = −.515 with higher ratings for video calls. Another one-way ANOVA revealed a significant effect of the meeting modality on factor F2 Co-Presence as well (F(2,156)=26.934,p<0.01,ηp2=0.257). Post-hoc tests confirmed a difference between VR and video calls with t (137.940) = −8.228, p < .01, d = −1.269, and between Gather Town and video calls with t (16.149) = −3.775, p < .01, d = −1.446 each with better ratings for video calls. We could not find an effect for F3 Privacy (F(2,156)=0.309,p=0.735,ηp2=0.004), indicating conclusive experiences on that factor independent of the meeting modality. These results demonstrate the advantages of video calls in contrast to the other platforms.

4.1.2 Lessons learned

Concerning quantitative data, we could observe that video meetings outperformed VR meetings in terms of the total meeting experience. Furthermore, for two of the three factors revealed with the exploratory factor analysis, i.e., F1 Involvement and F2 Co-presence, we also found that video conferencing was ranked higher than VR meetings. Users tended to feel more involved and have higher co-presence with their colleagues in video calls than in VR meetings. With the immersive nature of VR and its capability to substitute the user’s physical environment, it is surprising that the attendees had a stronger impression of others being present on video calls than in VR. To obtain a deeper understanding of the reasons for this assessment, we provide detailed insights on the experiences along the five themes that evolved from our qualitative analysis in the following subsections.

4.2 Spatial aspects

An essential difference between video calls and virtual 3D environments is the spatial component that allows movement in space and varying proximity to others. It influences the group dynamics and the audio considerations.

4.2.1 Movement

Before the study, P08 expected “more fun and more movement during the meetings” in VR. Indeed, the possibility of moving around in the VE led to more vivid and “a lot more dynamic” (P18) meetings compared to both video calls and physical meetings. Some attendees preferred that everyone be seated for the meeting to be more “orderly” (P28). Other participants appreciated “being able to move around and arrange in the room” (P12) and to “walk in front of the screen and use [their] body” (P18) during presentations. This was sometimes perceived as troublesome during presentations because avatars of other attendees were blocking the view onto the slides or the speaker: “All the time there was someone in my way” (P09). However, for presenting, the participants described advantages: “I also liked the more true-to-life presentation environment with the projector screen and with the ability to see the audience spread out in a semicircle instead of a gallery view like on Zoom” (P07). Many attendees highlighted the value of meaningful movements such as waving, gaze direction, and pointing as indicators, e.g., to initiate conversations or for turn-taking, making it “more personal” (P16). That “you could walk up to someone to talk to them” (P17) was perceived as more effortless and more natural (see Figure 8). Another use of the spatial component was to form queues, especially after the meeting when several attendees wanted to speak to the same person.

FIGURE 8
www.frontiersin.org

FIGURE 8. A situation in Engage before the meeting where the participants grouped (left side) and one person walked up to this group (right side, black shirt).

4.2.2 Proxemics

Similarly to movements, proxemics was used with purpose or meaning. “The physical arrangement also plays a role–who sits next to whom”, mattered for P12, or when everyone in a circle is supposed to say something in turn. At the same time, the spatial relations did not always translate well to the VE: “In physical meetings, it is nice that you actually sit next to others. In VR, it still feels a bit odd (am I too close? Too far away?)—it is also a bit more difficult to exactly place yourself the way you would like” (P12). Other attendees also reported that they “felt awkward” (P05) or “felt bothered because the others came too close” (P16).

4.2.3 Group dynamics

The added spatial component of VEs allowed to form and vary group constellations and sizes, which was a crucial benefit to a majority of the participants: “The VR environment makes it much easier to converse in small groups and easily switch between talking to different people”, explained P07. P06 pointed out that especially “before and after the meeting [interactions] were a lot easier and closer to in-person interactions than they would be in a Zoom meeting.” The participants described a large variety of interaction types, including 1-to-all as in presentations, 1-on-1, small groups, large groups, the whole group, and dynamic transitions between the constellations. Other than in Zoom, small groups could split up and follow up on meeting topics in parallel, which is “barely possible […] next to the official round” (P12) without blocking the meeting room for all others in a video call. Therefore, VR meetings were described as more dynamic, allowing more attendees to say something and contribute. On the other hand, Zoom meetings were characterized as “static” (P14).

VEs also allow the users to position themselves in a third dimension: elevation. This affords extended group arrangements and was primarily used for presentations and other 1-to-all communication. One participant experienced the group manager hovering mid-air above the rest of the team as a status imbalance: “It was beneficial that we could all see him. But it also gave the impression that he was ‘superior’ and talking down on us” (P18).

4.2.4 Spatial audio

The systems based on a VE are designed so that a conversation is only audible to users standing close by. The further away a listener is, the quieter the sound becomes. While this enabled several small groups to have private conversations simultaneously in the same room, it was perceived as cumbersome for the plenum when everybody should hear everybody else by default: “In case of a presentation or group discussion, sometimes you do not understand others and we used the ‘megaphone’ [to amplify voices in AltspaceVR] all the time” (P05). Furthermore, attendees reported that, also in VR, “the main questions you hear are: ‘am I muted?’ or ‘can you hear me?‘. The spatial audio is completely useless.” (P05). Manually equipping speakers with the ‘megaphone’ as an amplifier was referred to as “a burden” (P01), not only by the participants who facilitated the meeting but also by observers. The thresholds for the spatial audio were often experienced as inadequate resulting in unnatural behavior: “Especially because of the spatial audio, people gathered closely in clusters so they can hear each other better. In reality, it would not be that condensed, and also not so hard to hear each other” (P18). Instead, P05 suggested “to have regular audio [for the plenum] and for private conversations sound bubbles.”

4.2.5 Lessons learned

The VEs allowed the attendees to gather in a virtual workspace. Videoconference platforms, such as Zoom or Skype, recently introduced features to create a similar impression by visually cropping attendees and stitching them together in a shared room10. However, this lacks intradiegetic possibilities to utilize this added spatial component, for example, the eye gaze in turn-taking. When designing a VR meeting platform, we consider it essential to foster dynamic interpersonal interactions by supporting meaningful movements, such as approaching others, forming groups and queues, waving, pointing, or nodding. Non-verbal cues strongly impact social interactions in VR but have not yet tapped the rich potential familiar from offline behavior (Maloney et al., 2020; Tanenbaum et al., 2020). In line with related research, our participants reported spatial group arrangements and proxemics similar to in-person meetings. As Williamson et al. (2021) point out, elevation provides additional opportunities for an audience in VEs to arrange for better visibility. While this can be beneficial, we advise caution when activating flying features, as they might diminish professionalism or affect the perception of social status differences.

For flexible and realistic group dynamics, purposeful and configurable spatial audio with carefully chosen volume reduction parameters is required, as well as an option to hear everyone in a plenum. We observed similar behavior of our participants as in a study by Williamson et al. (2022), where users seemed to move close together to hear each other best despite loud background conversations. As our participants proposed, allowing users to create a private sound bubble with visible boundaries to shield their group might be helpful. Already in 1999, (Ståhl, 1999), reported in a similar case study that the uncertainty of being audible was one of two critical challenges for VR meetings. Even today, we still observe the need for more explicit indicators of one’s comprehensibility within the room.

4.3 Meeting atmosphere

We further observed that the spatial component strongly influenced the meeting atmosphere. In the questionnaire before the first meeting in VR, P18 shared that the “feeling of working in the same place is missing” during the long home office directive. The potential of immersively feeling “like being somewhere else” (P12) for the meetings promised relief to some participants.

4.3.1 Co-presence and interpersonal interactions

Numerous attendees expressed excitement about “being in a room together” (P06) and “chatting ‘face-to-face”’ (P12). For P18, in the VE, it felt more like “coming together or gathering with the whole team” in a shared virtual office space than in video calls. P23 and P09 were reminded of physical meetings, which included that “interactions felt more like a real meeting” (P09). The effect was perceived to be stronger when accessing the VE with an HMD: “I felt more immersed and part of the group” (P09). P25 appreciated how “one has a better awareness for the group, such as closeness or distance, attention or distraction, etc., which is missing on Zoom.” While in VR, participants appreciated seeing everyone around them at a glance, the videoconference tools were repeatedly criticized for not displaying all attendees in large meetings: “I frequently do not register that they are attending the meeting” (P12). This is especially problematic when presenting as it “gives a wrong impression about the number of participants” (P11).

P18 appreciated being “surrounded by my colleagues, but without faces, it was not so personal. Like a coat of secrecy around everyone.” With the cartoony avatars on AltspaceVR, to P23, it “felt like speaking with very intelligent and responsive game characters”. This dilemma was especially noticeable for guests. Several team members expressed the wish to see a guest’s face “to get an idea of how [they are] as a real person” (P28). On the other hand, the avatar representation allowed guests to “visit” the meeting as equal to the other members.

4.3.2 Professional or playful

The initial atmosphere of the meeting was established mainly by the visual appearance of the interface, the avatars, and the environment. The setting of AltspaceVR was perceived as “sometimes too cartoonish to be taken seriously […] more like in-game conversations” (P23). On the other hand, Engage was perceived as more formal and “much more orderly than in Altspace because we were seated” (P28). Both the professional and the playful styles have received positive and negative feedback from the participants. Some participants liked the professional atmosphere that Zoom or Engage created because it is similar to physical meetings and makes it easier to talk about serious topics. But it was also perceived as more boring and formal, and therefore some rejected the approach of Engage: “I really hated Engage. The uncanny avatars, technical difficulties, and overall ‘seriousness’ of the application did not work for me” (P09). Due to the less formal setting in AltspaceVR, “even the student was completely chilled–usually on Starleaf they’re more nervous and quiet” (P18). As Engage allows for adding elements to the scene, it was soon “filled with animals, objects, special effects, and sounds” (P18). This was described as “absurdly funny” (P18) and “goofy” (P02) but during the meeting also as “distracting” (P17), and “less effective” (P01). Although the fun, interesting, and creative atmosphere in AltspaceVR, Engage and Gather Town can be appealing and support hedonic qualities of the system, it could be distracting from the subject of the meeting: “fun to use but not very helpful for serious work” (P11).

4.3.3 Social norms

Social interactions in VR depend on shared norms in a group as in reality. However, VEs enable behavior that is impossible in real life, such as walking through people, due to different physical or perceptual restrictions. The attendees shared experiences of how some social conventions translate well to virtual encounters: “I instinctively wanted to hug their avatar. When approaching, 1 m away, I noticed that this feels wrong. In reality, we would never hug each other […], so I expressed my compassion verbally only” (P18). More often, participants reported novel social situations that might require new conventions. We identified two types of violating social norms: unintended impropriety and intended provocations.

Among the unintentional norm violations are situations in which, for example, attendees “felt bothered because the others came to close” (P16), or had unnatural postures due to the controllers sitting on the desk, causing awkwardness. One attendee described another uncomfortable situation due to technical challenges, in which he was concerned about having offended somebody with a joke as they did not react. It turned out that the other person was only staring at him quietly because they could not unmute to share the laughter. Other disruptions, however, were intended by users, such as an “eccentric avatar” (P18) that was seen as surprising at the professional occasion, somebody attending in a space suit, or users adding 3D models and effects to the scene. The most prominent example for this was “the monster which appeared in the middle of a student’s presentation” (P17). The whipping daemon shown in Figure 9 caught the users’ attention, distracted them from the presentation, and crashed the seriousness of the setting.

FIGURE 9
www.frontiersin.org

FIGURE 9. A user inserted a daemon to the scene whipping a colleague’s avatar in the middle of a student’s presentation.

4.3.4 Lessons learned

The users appreciated the impression that their colleagues were in one room with them, especially when using an HMD. Barreda-Ángeles and Hartmann. (2022) found social presence to be a good predictor of relatedness and enjoyment, which is in line with our participants’ reports. The high loading values of the overall meeting experience for the factors F1 Involvement and F2 Co-Presence indicate a strong link between the predictors. Analysis of qualitative data corroborates this as especially those attendees, who appreciated the high presence of others and feeling noticed by others in VR, were most fond of those meetings. Still, attendees rated their Zoom experiences regarding F2 Co-Presence higher than in VR. This finding is surprising considering the high immersion of VR and that many participants described the impression of being together in one room. However, the attendees of the investigated meetings are not strangers but colleagues, often for many years. Nevertheless, the avatars surrounding the user in VR seem unfamiliar and need a name tag to identify who it embodies. Without it, they could have been mistaken for intelligent game characters. In contrast, the identity of the conversation partners was unmistakable on video calls because of the visibility of familiar faces, which we discuss in the following subsection. Further, the ratings of the F1 Involvement factor were higher for video calls. From an autoethnographic perspective, we suspect that the meeting format has been adapted too tentatively to the medium. To exploit the advantages of interpersonal interactions in VR, we emphasize that the meeting purpose must benefit from high social presence, e.g., with team building or socializing goals. Not every format should be directly replicated virtually, as previous research supports (McVeigh-Schultz and Isbister, 2021b).

The discomfort of the attendees with the uncanny degree of avatar realism appeared to be more pronounced than the increase in social presence found by Zibrek and McDonnell. (2019) as a consequence of photorealism in social VR. In line with related work, the effects of visual fidelity of our test platforms’ avatars seem more complex and depend on other factors, such as coherence, setting, and avatar behavior. We suggest focusing more on expressive and versatile avatar designs than sheer photorealism.

We can assume that some of the observed social inadequacies originated from technical overload or accidents. For example, the sense of personal space is not transferred directly from in-person interactions to VEs and depends on various factors, such as using an HMD or a desktop PC (Williamson et al., 2022). A prior training and familiarization phase with the system McVeigh-Schultz et al. (2018) is advisable. This may also reduce disruptions from users exploring features that the platform affords. We urge interaction designers to create systems that prevent users from awkward appearance or behavior by accident, e.g. while taking off equipment or sitting down and standing up. Further, until a common “VRtiquette” (Lahlou et al., 2021) is established over the years, the attendees should agree on shared rules and conventions that might be supported by regulations from the organizations–in the same way as video conferencing and hybrid meetings benefit from this (Saatçi et al., 2020).

4.4 Expression of emotions

The participants commented particularly much on the emotional aspects of interpersonal interactions and how replacing a video with an avatar changed their meeting experience. We must note that in the case of the team meetings investigated, most attendees typically turn on their cameras for the video call.

4.4.1 Facial Expressions and body Language

Seeing no facial expressions from the other attendees is one of the issues of VR meetings raised the most by our participants: “It is so odd to look at people and not see any reactions in their faces” (P12). Especially users who accessed the VE without an HMD and therefore lacked gesture and head tracking “looked a bit odd, less lively” to P12. The lack of faces was perceived as “disconcerting” (P07), “awkward” (P06), and “less direct, obtuse” (P10) compared to video calls. Therefore, interactions on video calls were described as “more expressive” (P14)—to “see people’s faces and emotions” (P30) was important to connect for many. Several attendees have emphasized this concern throughout the experiment: “This is essential!” (P12). Above all, P02 missed the smiles and cats usually visible in the team’s Zoom meetings. On the contrary, some attendees did not “need to see people’s awkward or negative faces” (P13) and were relieved to be able to hide behind their avatars. They highlighted the advantages of missing cameras: “It looked more like a physical meeting because I can see everyone at once and not just the face. I could see people moving their heads and hands in VR” (P13). Several users addressed the expressiveness through avatar style and gestures: “I like the body language and the way people dress” (P08). One participant reflected on the impact that the different forms of encountering guests or strangers might have: “People present themselves differently. I even think you might get to know them differently in VR” (P12).

4.4.2 Emojis

To compensate for the lack of facial expressions, AltspaceVR allows spawning emojis above the user’s avatar to react visually (see Figure 10). One attendee “expressed frustration with the –.– emoticons over her head” (P18) while having audio issues and no other means to communicate. A similar feature is also available on Zoom, despite other, more immediate ways to react: “When [the manager] asked the group to give quick reactions, most of the participants did not speak or gesture as a response but used the UI feature showing a thumbs-up or smiley” (P18). The participants explained that they missed this feature on Engage and StarLeaf. On Engage, they found the meetings “less ‘affective’ as I cannot show any emotion with the icons” (P13) as on AltspaceVR.

FIGURE 10
www.frontiersin.org

FIGURE 10. This figure shows the use of emojis to express emotions in the AltspaceVR platform.

4.4.3 Lessons learned

Regardless of the primary purpose of meetings being of productive or social nature, the affective states of the attendees naturally have a critical impact on the collaboration. Whether for working or socializing with colleagues, it is essential to enable users to express their emotions and recognize those of others. This is especially pronounced with previously unknown people (Kirchner and Nordin Forsberg, 2021). Moustafa and Steed. (2018) found similar emotional states of users in social VR as in face-to-face settings that need to be considered. Still, our participants complained about the lack of facial expressions and other non-verbal communication, in line with the analysis by Tanenbaum et al. (2020). We conjecture that the missing faces are a significant reason for the lower co-presence ratings of VR compared to video calls in our exploratory factor analysis. Considering the study by Abdullah et al. (2021), this could also explain the participants’ decreased activity to maintain social connectedness and less person-directed gaze in VR conversations compared to videoconferencing. With rapid progress in research on the recognition of facial expressions (Hamedi et al., 2018; Lou et al., 2020; Cha and Im, 2022) and eye movement (Schwartz et al., 2020), as well as their mapping to the avatar, more expressive face and gaze representation will be available in the near future. We expect this to be a crucial facilitator for authentic social exchange in VR. Until then, a rudimentary way of sharing one’s emotions or reactions is needed for successful interaction in VR, such as emojis.

4.5 Meeting productivity

As information exchange was a major aim of the meetings, productivity was a central theme in the data. When participants compared the VR meetings with in-person meetings or video calls, they often referred to them as “less productive” (P05) and “inefficient, cumbersome” (P10). The reasons range from the inability to take notes and technical issues to the restrictions in seeing everyone simultaneously. Participants concluded that “the static meeting format” (P18) of “regular reports and presentations do not benefit from VR” (P05). However, there were also some positive comments on the productivity of meetings in VR, as people listened “with less distractions and multitasking” (P17) after a few AltspaceVR meetings.

4.5.1 Attention and distraction

Being in VR reduced distractions from other applications like mail, web browsers, or instant messaging for our participants. At the same time, it introduced new distractions to the meeting as people explored the features: “it can be distracting with all the non-meeting stuff you can do in VR” (P14). Overall there is no consensus on which medium is more beneficial regarding (in)attention, as some people expressed the need for a little diversion to focus on the meeting. Another aspect is the noticeable (in)attention of the meeting participants. Especially for the speaker, it is valuable feedback whether the audience actively listens. Some participants found that “in VR, one has a better awareness of the group, such as the closeness or distance, attention or distraction, etc., which is missing in Zoom.” (P25). In contrast, others thought “it was hard to tell [in VR] how distracted people were during the presentations […] In the physical meetings, it is more obvious who’s listening and who’s busy with something else” (P18). P11 had the impression that on Zoom “the participants were more focused and could follow the conversation better.”

4.5.2 Presentations

Giving and listening to presentations were reoccurring aspects. It felt for some participants more like an in-person setting. For the presenters, it was a more immersive and realistic experience than in video-based presentations: “I could walk in front of the screen and use my body. Felt like really presenting!” (P18). Similarly, P07 appreciated the “more true-to-life presentation environment with the projector screen and with the ability to see the audience spread out in a semicircle”. Nevertheless, for the audience, problems known from in-person presentations, such as view-blocking or display resolution, reduced satisfaction. Attendees reported a better view of the slides in videoconferences.

4.5.3 Secondary tasks and tools

The access to notes had a substantial impact on the perceived productivity of the meeting, and comments like “I missed note taking” (P28) for VR were very common. Similarly, access to tools like the calendar, search engines, and instant messengers for discreet parallel exchange within the team was missed. The lack of simultaneous messaging was mentioned frequently as it led to “less interactive” (P17) meetings. In videoconferences, “most interactions occurred ‘outside the meeting’” (P28) simultaneously via instant messaging. Several attendees mentioned that they could not do other productive tasks while being “trapped in VR” (P28), such as answering emails during personally irrelevant parts of the meeting. Also, the inability to access physical objects as beverages during VR sessions was criticized: “I miss my tea” (P13) and “my coffee got cold” (P18).

4.5.4 Lessons learned

For our work-focused meetings, the participants greatly appreciated the possibility of taking notes and accessing additional information. These aspects are not yet adequately provided by the VR tools tested in the scope of this study or require an unreasonable amount of individual effort. The accustomed workflow of a user should be supported in virtual meeting environments without allowing too many distracting elements into the VE. Recent developments promise a smoother integration of physical keyboards and applications from the desktop environment (Otte et al., 2019; Oculus, 2021), which could help to bridge VEs and the usual workspace of a user.

Although Sarkar et al. (2021) found parallel chatting in video calls distracting and overwhelming for some users, they also identified valuable benefits from it that resonate with our participants’ responses. More research needs to be done on integrating discreet side communication in VR. On the other hand, compared to video conferencing tools, participants valued the high-fidelity and engaging way of presenting in VR. This close resemblance matches the findings by Ginkel et al. (2019) who measured similar skill improvement in an oral presentation training in VR as in a physical environment. But as similarly shown by Yoshimura and Borst. (2021), the satisfaction with VR presentations also depended for our participants on the (dis)comfort with the equipment. Short and interactive talks can benefit more from VR than long or many presentations.

4.6 User needs

The primary objective of the attendees in our meetings was to get together once a week and exchange information. Their individual needs for functionality, comfort, and appropriate self-representation must be met.

4.6.1 Technical Literacy of users

The questionnaire answers revealed some aspects rooted in the group’s diversity. Although most of the participants had considerable knowledge of technology in general, some had serious technical challenges that sometimes even prevented them from participating at all, “because I was not able to set up the app” (P19), or required considerable effort: “I needed nearly an hour to join the meeting (and then just via [desktop PC]). And on top of that, I later needed help to get the VR setup running.” (P25). Even for experienced users, it took time to get familiar with the system as they encountered initial issues with the setup, updates, or controls. This clearly shows that even when using standard devices and commercially available applications, VR meetings still pose challenges to people who do not use the technology regularly. P09 was concerned about guests because “it is more of a hassle as they have to get accustomed to the software as well.” In particular, P23 worried about students presenting their thesis progress as it “could add to the pressure that they are already under and make them even more uncomfortable and stressed”.

4.6.2 HMD discomfort

One frequently mentioned theme was the discomfort related to the VR headset. The participants reported motion sickness, headaches, that the “headset felt heavy and uncomfortable after some time in the meeting” (P15), and that it was “a bit stressful for my eyes” (P12). Participants used different HMD models, and some were unsatisfied with the visual quality. For example, P12 said about AltspaceVR: “the slides were difficult to read, the letters flickering; this was uncomfortable”. Furthermore, P17 mentioned that the Oculus Quest “got quite laggy with so many people. I had to reset the graphic settings. It gives me a headache and I’m more exhausted afterward.” But using the desktop client was also not a good alternative for some participants as they reported that “using 2D VR is a bit inconvenient for the control. It is somehow like [a first-person shooter], but I am not very used to that.” (P13)

4.6.3 Privacy

Some people described that they “felt observed” (P12) on video calls because everybody could “see me and my living room” (P18). Regarding personal privacy and discretion, the VR sessions received positive feedback, as they allowed users to be invisible behind the avatar. Therefore, they did not need to worry about a presentable appearance or “need to show [their] facial expressions” (P13). Some participants also felt more confident as it “kinda helps with anxiety” (P14). However, users also reported that the spatial audio settings in AltspaceVR were insufficient for private conversations. P09 felt like “intruding” in private conversations: “After the meeting, it was weird as people talked in smaller groups, but one could hear everything”.

4.6.4 Lessons learned

While some appreciated the concealment of personal appearance, emotional reactions, and the home office environment, private conversations could be heard from far away. After the effects of Zoom fatigue (Shockley et al., 2021) during the pandemic, our participants felt relieved from the stress of being observed. Although the quantitative results did not show any differences between the platforms in terms of factor F3 Privacy, privacy was of concern in the qualitative data. The reason for this discrepancy is the different interpretations of the term. While standardized questionnaires and legal frameworks often consider privacy as the protection of information from the service provider, our participants were more concerned about the other attendees invading their privacy or being visually exposed to others. Quantitative methods should consider aspects of the personal privacy of meeting attendees beyond data protection regulations.

The fact that attendees from an HCI research group struggled with technical barriers clearly shows that meeting platforms must cater to users’ diverse needs and abilities. Usability issues in many established social VR platforms, including AltspaceVR, were also identified by Liu and Steed. (2021). Using current off-the-shelf VR hardware and software resulted in a stressful experience with technical issues and HMD discomfort for some participants. Until the onboarding experience is smooth and quick for everyone, organizers must assume responsibility for providing support before the first meeting, as recommended similarly by Lahlou et al. (2021). It seems that VR meetings are currently not as inclusive and accessible as video-based platforms due to a higher technical entry barrier, even if expensive equipment is provided.

4.7 Platform choice for future meetings

In the final survey after the last meeting, we asked participants to draw a conclusion comparing the different platforms and to choose a long-term preference. Many participants enjoyed exploring the possibility of meeting the team in VR: “it sure was fun and showed the potential for the future” (P09). The users appreciated the possibilities of “free spatial arrangement” (P12) with benefits especially for “engaging conversations between multiple actors” (P05) as it allows “spontaneously splitting into groups and chatting in clusters, without losing the overview of who else is present in the global room” (P21). Nevertheless, only two survey participants selected a VR platform as their preferred meeting medium. P14 appreciated that it “is more fun and kinda helps with anxiety”. P18 supported using VR for future meetings because “on Zoom, it always feels like people cannot wait to get out again. Most participants try to be as quiet as possible not to delay the meeting unnecessarily. In VR, there was much more socializing, arriving early and staying longer, personal and informal exchange, having fun, or enjoying meeting with others.”

However, the vast majority argued for returning to desktop-based video conferencing tools. Of the 15 participants who completed the survey after the last session, 13 preferred to go back to Zoom or StarLeaf. For most attendees, the “regular reports and presentations do not benefit from VR” (P05) nor from the spatial component of Gather Town. Although fun, VEs were perceived as “not very helpful for serious work” (P11). P05 criticized the attempt “to replicate the reality without adding specific value that comes from the medium.” On the contrary, for VR, the preparations and technical issues were disproportionately time-consuming, caused motion sickness for some attendees, secondary tasks were unavailable, and it was repeatedly described as inconvenient. For P19, the experience “was like speaking on an old phone with multiple participants at the same time, wearing a rock tied to my head, and watching some meaningless cartoon simultaneously”. On the other hand, in video calls, it is possible to see the faces of “people, and it works best. No preparations. No nausea or headache” (P02). It was described as “most convenient” (P09) and “more productive” (P13) with easy “access for external visitors” (P11). As many others, P12 “learned that seeing the faces of others is so important.”

5 Limitations and future work

We conducted this study in the wild during the regular team meetings of the research lab. Therefore, it has some limitations. Following an autoethnographic approach, the authors are part of the team and participated in the study. However, we ensured that the anonymization remained intact during coding and analysis. The study sample represents a very heterogeneous group as some team members have high technical literacy, and administrative staff was involved. This is most likely not representative of the general population, but for the domains that will be early adopters, this should be close to a representative sample. We also point out that most meeting attendees had no previous experience holding meetings in VR. Hence, the findings apply to teams of novice users and should only be extended to experts with caution. In addition, there was a trade-off between the length of the questionnaire and the aspects to cover, which influenced the direction of the results.

To keep the atmosphere and behavior of the participants as natural as possible, we did not record the sessions. Consequently, no systematic observations of the attendees’ behavior were possible, and we can only report on aspects described in the questionnaires. The selection, order, and frequency of the used platforms were team decisions independent of the study objectives, and therefore, the platforms were not explored evenly. Additionally, there might be possible biasing effects for the VR platform used second (Engage), as most users now had previous experiences with social VR. Furthermore, the content, length, and need for follow-up conversations varied every week due to circumstances, making it challenging to compare the meetings directly. During the meetings, various system configurations were used, which created different experiences for the individual users. We had a variety of HMDs (HTC Vive (Pro), Valve Index, and Oculus Quest), operating systems (Mac, Windows, Linux), and internet browsers (relevant for Gather Town). While this limits comparability between users, this case study was done in an authentic context with meetings that would have taken place in this manner regardless. All this ensures an authentic setting, leading to high external validity.

Based on the outcome of this study, further research is needed with a clear separation of researchers and meeting attendees. At the cost of the insights from an autoethnographic approach, the generalizability can be higher with controlled, balanced, and diverse samples across domains and prior experience. Moreover, as a result of the independence of the meeting participants from the researchers, no negative consequences of honest answers are to be expected, which allows to lift the anonymity towards the data analysts and thus extends methodological possibilities. With a team that accepts the given parameters of the experiment, such as the choice of platforms and hardware or video recording of the meetings, future work could rely on more systematic and controlled circumstances enabling behavioral and linguistic analyses with high comparability. Still, a careful balance between external provisions and internal authenticity must be achieved.

6 Conclusion

This in-the-wild case study on using social VR platforms and video conferencing software for weekly team meetings over the course of 4 months provides authentic insights into the experiences of 32 participants. Through questionnaires, we collected responses to open questions and Likert scale items after every meeting on various topics related to social interaction, meeting productivity, and individual experiences. In a thematic analysis of the qualitative data, we found five prominent themes that cover spatial aspects, meeting atmosphere, expression of emotions, meeting productivity, and user needs. The results show that regarding realistic group dynamics in gatherings before and after the official meeting, or for the impression of being together in the same place, VR meetings resemble in-person meetings more than videoconferences. Nevertheless, at the same time, videoconferences provided a closer-to-reality experience regarding seeing others’ real faces and emotions, discreet side communication, and support of secondary tasks than VR meetings. Furthermore, we performed a factor analysis on the quantitative data, revealing the advantages of videoconferences over VR platforms regarding ratings on involvement and co-presence. This finding is in line with the assessments in the final survey, in which participants were asked to draw a conclusion comparing the different platforms. Here, only two out of 15 survey participants selected a VR platform as their preferred meeting medium for weekly meetings in the future. Not being able to see the real faces of their colleagues was one of the most decisive factors for many participants. Therefore, at the end of the case study, the team decided to return to videoconferences for the long term. Some aspects found in this study have been discussed in related literature before. Still, this authentic in-the-wild setting of intrinsically motivated exploration of how suitable current commercial platforms are under natural working conditions enabled us to bring practical issues into context and highlight critical research gaps.

Our results indicate that the currently available off-the-shelf social VR platforms tested in this study do not yet sufficiently meet the needs and preferences of users to attend weekly team meetings like ours. Although they provide advantages in certain aspects, limitations such as time-consuming preparations, technical issues, HMD discomfort, motion sickness, and unavailable secondary tasks were among the main reasons for returning to video conferencing. Moreover, the findings indicate that the meeting format was not ideal for fully using the benefits of social VR. Above all, our results reveal how important it is for meeting attendees to see their colleagues’ facial expressions and emotions and the current limitations of VR platforms in this regard. We suspect this to be the strongest contributor to the higher co-presence ratings of video calls compared to VR that we unexpectedly measured.

Some of the problems we encountered on our journey might be resolved within this decade thanks to technical advancements. As outlined in the Lessons Learned subsections above, new technologies using electromyogram and neuromuscular signals, as well as eye gaze and facial reconstruction, will provide avatars with rich emotional expressivity. Also, body language will be more realistic with advanced sensors and inverse kinematics. Peripherals such as physical keyboards and other components or applications of existing workflows will be seamlessly integrated into the virtual meeting environment. This considerably improves text input capabilities and the support of secondary tasks. In the meantime, usability issues on the software side will be resolved, and the devices will become lighter and more convenient. Prospectively, if this case study had been conducted in a few years, the outcome concerning these challenges might be completely different.

However, other challenges we encountered in our study might not be possible to overcome with technical progress and require careful consideration when (1) preparing technical aspects of the VE, and (2) planning and moderating the meeting. Considering our sample, these insights apply particularly to users who are new to having meetings in VR. First, as part of the technical setup, an onboarding tutorial before the meeting must ensure that all attendants are familiar with the application and that their system is up-to-date. Users with previous experience should have had the opportunity to explore features in the VE before the meeting starts so it would not distract them. Further, the spatial audio should be configured for plenum situations in which speakers are audible by everyone anywhere but also cater for private (group) conversations with privacy-protecting sound bubbles and little distraction from distant background conversations. Acoustic parameters must always be transparent to users. The application’s overall visual style and professional appearance should match the seriousness of the occasion and the attendees’ preferences. We recommend disabling permissions to insert distracting objects or special effects to the scene when not required for the meeting goal.

Second, to adapt the meeting format to the platform, a code of conduct should be agreed on to ensure a comfortable environment for everyone. Among other conventions, it should address proxemics, muting, recognizability, and modifications of the shared scene. In line with related work, we found it important to designate a person responsible for technical support and facilitation, such as amplifying or muting attendees. Generally, the format and goals of a meeting should be adapted to the advantages of social encounters in VR. Ideally, the reason for gathering is interpersonal interaction in dynamic group constellations with rich opportunities for socialization and creative exchange. Our participants appeared to appreciate situations before and after the meetings that afford the CoFIRe steps proposed by Erickson et al. (2011): coalescence into small groups, focused interactions without distractions, and eventually remixing with others. We recommend making use of spatial advantages and meaningful movements, such as forming queues, circles, and groups, flying when appropriate, moderating and taking turns with gaze and gestures, as well as organizing meeting content systematically within the space as demonstrated by Luo et al. (2022) for augmented reality.

We learned most of these lessons only as the study progressed. Initially, videoconferences were a substitute for in-person meetings due to the COVID-19 pandemic. The VR platforms were experimental substitutes for the new status quo of video conferencing. The reason for using any of these systems, however, was not to replicate in-person meetings but to achieve the purpose of the meetings. How the group aimed to achieve these goals was organically shaped within the conditions of the respective platforms. Nevertheless, the meeting format was not sufficiently adapted to the benefits and restrictions of social VR platforms. Technology does not break habits. And they appear to have been strong from the influence of a traditional meeting concept manifested over many years. The adjustment on-the-fly was not sufficient. Instead, a more conscious and targeted adaptation would have been required. Similarly, McVeigh-Schultz and Isbister. (2021b) argue for using XR to deliberately deviate from direct replication of face-to-face meetings, such as with augmentation of social behavior (Roth et al., 2018; Roth et al., 2019) or “social superpowers” (McVeigh-Schultz and Isbister, 2021a).

A potential direction for the future could be a seamless integration of the different platform types to allow dynamic rearrangement according to the meeting situation, which often changes depending on the ongoing activities. This would allow using the most powerful medium for each case. For example, interactivity, small group interaction, poster sessions, or social events could take place in immersive XR environments, and detailed presentations or one-to-many announcements through video conferencing channels–with immediate transition in between. Similar to the hybrid concept of Gather Town, this proposal attempts to combine the best of two solutions, although not limited to one device or a 2D world.

Overall, our case study provides authentic insights into conducting team meetings on off-the-shelf virtual reality platforms that can inform the appropriate choices and configurations of the platform, adaptations of the meeting format, and future requirements of social VR platforms.

Data availability statement

The quantitative dataset presented in this article can be found at\url (https://doi.org/10.17605/OSF.IO/YG7HK). The qualitative dataset presented in this article is not publicly available because the participants in the case study can easily be identified by their affiliation. Therefore, we assure them of the complete confidentiality of the collected data. Only the authors, excluding the group manager, could access the data to protect participants’ privacy and encourage them to respond authentically without fear of repercussions.

Ethics statement

The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

All authors developed the study design. MB prepared the test environments together with AR and coordinated the project. MB, AR, SP, YL, DA, RM, and TD carried out the data acquisition after the meetings. MB, AR, SP, YL, and TD were mainly responsible for data analysis. MB, AR, SP, and TD were responsible for drafting the manuscript. MB, AR, SP, and TD revised and finalized the manuscript.

Funding

This research was partially funded by the Klaus Tschira Stiftung by financing the scholarship of MB. The open access publication fees were covered by Staats-und Universitätsbibliothek Bremen.

Acknowledgments

We want to thank all team members for their commitment, feedback, and patience in this study. Also, we want to thank the Engage team, who provided us with a free trial for larger groups to run our study. A preprint of this paper is available on arXiv.org (Bonfert et al., 2022).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/frvir.2022.945791/full#supplementary-material

Footnotes

1https://www.virbela.com/customer-stories/exp-realty.

2https://news.microsoft.com/innovation-stories/microsoft-mesh.

3https://about.fb.com/news/2021/08/introducing-horizon-workrooms-remote-collaboration-reimagined.

4https://secondlife.com.

5https://altvr.com.

6https://engagevr.io.

7https://zoom.us.

8https://starleaf.com.

9https://gather.town.

10https://blog.zoom.us/introducing-zoom-immersive-viewhttps://www.skype.com/en/blogs/2021-05-together-mode.

References

Abdullah, A., Kolkmeier, J., Lo, V., and Neff, M. (2021). Videoconference and embodied VR: Communication patterns across task and medium. Proc. ACM Human-Computer Interact. 5, 129–29. doi:10.1145/3479597

CrossRef Full Text | Google Scholar

Ahn, S. J. G., Levy, L., Eden, A., Won, A. S., MacIntyre, B., and Johnsen, K. (2021). IEEEVR2020: Exploring the first steps toward standalone virtual conferences. Front. Virtual Real 2, 648575. doi:10.3389/frvir.2021.648575

CrossRef Full Text | Google Scholar

Bailenson, J. N., Beall, A. C., Loomis, J., Blascovich, J., and Turk, M. (2004). Transformed social interaction: Decoupling representation from behavior and form in collaborative virtual environments. Presence Teleoperators Virtual Environ. 13, 428–441. doi:10.1162/1054746041944803

CrossRef Full Text | Google Scholar

Bailenson, J. N., Blascovich, J., Beall, A. C., and Loomis, J. M. (2001). Equilibrium theory revisited: Mutual gaze and personal space in virtual environments. Presence Teleoperators Virtual Environ. 10, 583–598. doi:10.1162/105474601753272844

CrossRef Full Text | Google Scholar

Bailenson, J. N. (2021). Nonverbal overload: A theoretical argument for the causes of Zoom fatigue. Technol. Mind, Behav. 2. doi:10.1037/tmb0000030

CrossRef Full Text | Google Scholar

Bailenson, J. N., Swinth, K., Hoyt, C., Persky, S., Dimov, A., and Blascovich, J. (2005). The independent and interactive effects of embodied-agent appearance and behavior on self-report, cognitive, and behavioral markers of copresence in immersive virtual environments. Presence Teleoperators Virtual Environ. 14, 379–393. doi:10.1162/105474605774785235

CrossRef Full Text | Google Scholar

Barreda-Ángeles, M., and Hartmann, T. (2022). Psychological benefits of using social virtual reality platforms during the Covid-19 pandemic: The role of social and spatial presence. Comput. Hum. Behav. 127, 107047. doi:10.1016/j.chb.2021.107047

PubMed Abstract | CrossRef Full Text | Google Scholar

Bente, G., Rüggenberg, S., Krämer, N. C., and Eschenburg, F. (2008). Avatar-mediated networking: Increasing social presence and interpersonal trust in net-based collaborations. Hum. Commun. Res. 34, 287–318. doi:10.1111/j.1468-2958.2008.00322.x

CrossRef Full Text | Google Scholar

Blackwell, L., Ellison, N., Elliott-Deflo, N., and Schwartz, R. (2019b). Harassment in social virtual reality: Challenges for platform governance. Proc. ACM Human-Computer Interact. 3, 1125–25. doi:10.1145/3359202

CrossRef Full Text | Google Scholar

Blackwell, L., Ellison, N., Elliott-Deflo, N., and Schwartz, R. (2019a). “Harassment in social VR: Implications for design,” in 2019 IEEE Conference on Virtual Reality and 3D User Interfaces (VR), Osaka, Japan, 23-27 March 2019 (IEEE), 854–855. doi:10.1109/VR.2019.8798165

CrossRef Full Text | Google Scholar

Bleakley, A., Rough, D., Edwards, J., Doyle, P., Dumbleton, O., Clark, L., et al. (2022). Bridging social distance during social distancing: Exploring social talk and remote collegiality in video conferencing. Human–Computer Interact. 37, 404–432. doi:10.1080/07370024.2021.1994859

CrossRef Full Text | Google Scholar

Bonfert, M., Reinschluessel, A. V., Putze, S., Lai, Y., Alexandrovsky, D., Malaka, R., et al. (2022). Seeing the faces is so important” – experiences from online team meetings on commercial virtual reality platforms. arXiv. doi:10.48550/arXiv.2210

CrossRef Full Text | Google Scholar

Brucks, M. S., and Levav, J. (2022). Virtual communication curbs creative idea generation. Nature 605, 108–112. doi:10.1038/s41586-022-04643-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Burgoon, J. K., Bonito, J. A., Bengtsson, B., Ramirez, A., Dunbar, N. E., and Miczo, N. (1999). Testing the interactivity model: Communication processes, partner assessments, and the quality of collaborative work. J. Manag. Inf. Syst. 16, 33–56. doi:10.1080/07421222.1999.11518255

CrossRef Full Text | Google Scholar

Byun, B., Awasthi, A., Chou, P. A., Kapoor, A., Lee, B., and Czerwinski, M. (2011). “Honest signals in video conferencing,” in 2011 IEEE International Conference on Multimedia and Expo, Barcelona, Spain, 11-15 July 2011 (IEEE), 1–6. doi:10.1109/ICME.2011.6011855

CrossRef Full Text | Google Scholar

Cha, H. S., and Im, C. H. (2022). Performance enhancement of facial electromyogram-based facial-expression recognition for social virtual reality applications using linear discriminant analysis adaptation. Virtual Real. 26, 385–398. doi:10.1007/s10055-021-00575-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Daft, R. L., and Lengel, R. H. (1986). Organizational information requirements, media richness and structural design. Manag. Sci. 32, 554–571. doi:10.1287/mnsc.32.5.554

CrossRef Full Text | Google Scholar

Dzardanova, E., Kasapakis, V., Gavalas, D., and Sylaiou, S. (2021). Virtual reality as a communication medium: A comparative study of forced compliance in virtual reality versus physical world. Virtual Real 26, 737–757. doi:10.1007/s10055-021-00564-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Erickson, T., Shami, N. S., Kellogg, W. A., and Levine, D. W. (2011). “Synchronous interaction among hundreds: An evaluation of a conference in an avatar-based virtual environment,” in Proceedings of the SIGCHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery), 503–512. doi:10.1145/1978942.1979013

CrossRef Full Text | Google Scholar

Fägersten, K. (2010). “Using discourse analysis to assess social co-presence in the video conference environment,” in Cases on online discussion and interaction: Experiences and outcomes (Hershey, PA: IGI Global), 19.

Google Scholar

Finn, K. E., Sellen, A. J., and Wilbur, S. (1997). Video-mediated communication. Hillsdale, NJ: Lawrence Erlbaum Associates.

Google Scholar

Freeman, G., Zamanifard, S., Maloney, D., and Acena, D. (2022). Disturbing the peace: Experiencing and mitigating emerging harassment in social virtual reality. Proc. ACM Hum. Comput. Interact. 6, 1–30. doi:10.1145/3512932

CrossRef Full Text | Google Scholar

Galegher, J., Kraut, R. E., and Egido, C. (2013). Intellectual teamwork: Social and technological foundations of cooperative work. New York: Psychology Press. doi:10.4324/9781315807645

CrossRef Full Text | Google Scholar

Ginkel, S. v., Gulikers, J., Biemans, H., Noroozi, O., Roozen, M., Bos, T., et al. (2019). Fostering oral presentation competence through a virtual reality-based task for delivering feedback. Comput. Educ. 134, 78–97. doi:10.1016/j.compedu.2019.02.006

CrossRef Full Text | Google Scholar

Gonzalez-Franco, M., Maselli, A., Florencio, D., Smolyanskiy, N., and Zhang, Z. (2017). Concurrent talking in immersive virtual reality: On the dominance of visual speech cues. Sci. Rep. 7, 3817. doi:10.1038/s41598-017-04201-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Hamedi, M., Salleh, S. H., Ting, C. M., Astaraki, M., and Noor, A. M. (2018). Robust facial expression recognition for MuCI: A comprehensive neuromuscular signal analysis. IEEE Trans. Affect. Comput. 9, 102–115. doi:10.1109/TAFFC.2016.2569098

CrossRef Full Text | Google Scholar

Hecht, H., Welsch, R., Viehoff, J., and Longo, M. R. (2019). The shape of personal space. Acta Psychol. 193, 113–122. doi:10.1016/j.actpsy.2018.12.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Hinds, P. J. (1999). The cognitive and interpersonal costs of video. Media Psychol. 1, 283–311. doi:10.1207/s1532785xmep0104_1

CrossRef Full Text | Google Scholar

Kang, S. H., and Watt, J. H. (2013). The impact of avatar realism and anonymity on effective communication via mobile devices. Comput. Hum. Behav. 29, 1169–1181. doi:10.1016/j.chb.2012.10.010

CrossRef Full Text | Google Scholar

Kern, A. C., and Ellermeier, W. (2020). Audio in VR: Effects of a soundscape and movement-triggered step sounds on presence. Front. Robotics AI 7, 20. doi:10.3389/frobt.2020.00020

PubMed Abstract | CrossRef Full Text | Google Scholar

Kirchner, K., and Nordin Forsberg, B. (2021). “A conference goes virtual: Lessons from creating a social event in the virtual reality,” in Innovations for community services. Editors U. R. Krieger, G. Eichler, C. Erfurth, and G. Fahrnberger (Cham: Springer International Publishing), 123–134. doi:10.1007/978-3-030-75004-6_9

CrossRef Full Text | Google Scholar

Koseki, N., Iijima, M., Yamamoto, Y., and Seiki, K. (2020). “Identifying communication problems in the use of video conferencing systems,” in 2020 9th International Congress on Advanced Applied Informatics (IIAI-AAI), Kitakyushu, Japan, 01-15 September 2020 (IEEE), 651–658. doi:10.1109/IIAI-AAI50415.2020.00133

CrossRef Full Text | Google Scholar

Kramer, A. D. I., Oh, L. M., and Fussell, S. R. (2006). “Using linguistic features to measure presence in computer-mediated communication,” in Proceedings of the SIGCHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery), 913–916. doi:10.1145/1124772.1124907

CrossRef Full Text | Google Scholar

Kyrlitsias, C., and Michael-Grigoriou, D. (2022). Social interaction with agents and avatars in immersive virtual environments. A Surv. 2, 786665. doi:10.3389/frvir.2021.786665

CrossRef Full Text | Google Scholar

Lahlou, S., Pea, R., Heitmayer, M., Russell, G. M., Schimmelpfennig, R., Yamin, P., et al. (2021). “Are we ’Beyond being there’ yet?: Towards better interweaving epistemic and social aspects of virtual reality conferencing,” in Extended abstracts of the 2021 CHI conference on human factors in computing systems (Yokohama Japan: ACM), 1–6. doi:10.1145/3411763.3451579.00000

CrossRef Full Text | Google Scholar

Liu, Q., and Steed, A. (2021). Social virtual reality platform comparison and evaluation using a guided group walkthrough method. Front. Virtual Real. 2, 668181. doi:10.3389/frvir.2021.668181

CrossRef Full Text | Google Scholar

Lou, J., Wang, Y., Nduka, C., Hamedi, M., Mavridou, I., Wang, F. Y., et al. (2020). Realistic facial expression reconstruction for VR HMD users. IEEE Trans. Multimedia 22, 730–743. doi:10.1109/TMM.2019.2933338

CrossRef Full Text | Google Scholar

Luo, W., Lehmann, A., Widengren, H., and Dachselt, R. (2022). “Where should we put it? Layout and placement strategies of documents in augmented reality for collaborative sensemaking,” in CHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery), 1–16. doi:10.1145/3491102.3501946

CrossRef Full Text | Google Scholar

Mack, K., Das, M., Jain, D., Bragg, D., Tang, J., Begel, A., et al. (2021). “Mixed abilities and varied experiences: A group autoethnography of a virtual summer internship,” in The 23rd international ACM SIGACCESS conference on computers and accessibility (New York, NY, USA: Association for Computing Machinery), 1–13. doi:10.1145/3441852.3471199

CrossRef Full Text | Google Scholar

Maloney, D., Freeman, G., and Wohn, D. Y. (2020). Talking without a voice. Talk. without a voice” Underst. non-verbal Commun. Soc. virtual Real. 4, 1125–25. doi:10.1145/3415246

CrossRef Full Text | Google Scholar

McVeigh-Schultz, J., and Isbister, K. (2021b). A “beyond being there” for VR meetings: Envisioning the future of remote work. Human–Computer Interact. 37, 433–453. doi:10.1080/07370024.2021.1994860

CrossRef Full Text | Google Scholar

McVeigh-Schultz, J., and Isbister, K. (2021a). “The case for “weird social” in VR/XR: A vision of social superpowers beyond meatspace,” in Extended abstracts of the 2021 CHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery), 1–10.

Google Scholar

McVeigh-Schultz, J., Kolesnichenko, A., and Isbister, K. (2019). “Shaping pro-social interaction in VR: An emerging design framework,” in Proceedings of the 2019 CHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery). doi:10.1145/3290605.3300794

CrossRef Full Text | Google Scholar

McVeigh-Schultz, J., Márquez Segura, E., Merrill, N., and Isbister, K. (2018). “What’s it mean to ”Be social” in VR? Mapping the social VR design ecology,” in Proceedings of the 2018 ACM conference companion publication on designing interactive systems (New York, NY, USA: Association for Computing Machinery), DIS ’18 Companion), 289–294. doi:10.1145/3197391.3205451

CrossRef Full Text | Google Scholar

Microsoft (2022). Work trend index 2022. great expectations: Making hybrid work work. Annual Report.

Google Scholar

Milgram, S., and Gudehus, C. (1978). Obedience to authority. Harper & Row.

Google Scholar

Moustafa, F., and Steed, A. (2018). “A longitudinal study of small group interaction in social virtual reality,” in Proceedings of the 24th ACM symposium on virtual reality software and technology (New York, NY, USA: Association for Computing Machinery), 1–10. doi:10.1145/3281505.3281527

CrossRef Full Text | Google Scholar

Nadler, R. (2020). Understanding “Zoom fatigue”: Theorizing spatial dynamics as third skins in computer-mediated communication. Comput. Compos. 58, 102613. doi:10.1016/j.compcom.2020.102613

CrossRef Full Text | Google Scholar

Nilles, J. (1975). Telecommunications and organizational decentralization. IEEE Trans. Commun. 23, 1142–1147. doi:10.1109/TCOM.1975.1092687

CrossRef Full Text | Google Scholar

Nind, M., Coverdale, A., and Meckin, R. (2021). Changing social research practices in the context of covid-19: Rapid evidence review. Working Paper, NCRM. doi:10.5258/NCRM/NCRM.00004458

CrossRef Full Text | Google Scholar

Nowak, K. (2001). “Defining and differentiating copresence, social presence and presence as transportation,” in 4 th annual international workshop (Philadelphia: Lawrence Erlbaum Associates).

Google Scholar

Nowak, K. L., and Biocca, F. (2003). The effect of the agency and anthropomorphism on users’ sense of telepresence, copresence, and social presence in virtual environments. Presence Teleoperators Virtual Environ. 12, 481–494. doi:10.1162/105474603322761289

CrossRef Full Text | Google Scholar

Ochsman, R. B., and Chapanis, A. (1974). The effects of 10 communication modes on the behavior of teams during Co-operative problem-solving. Int. J. Man. Mach. Stud. 6, 579–619. doi:10.1016/S0020-7373(74)80019-2

CrossRef Full Text | Google Scholar

Oculus (2021). Introducing ’Horizon Workrooms’: Remote collaboration reimagined. Menlo Park, CA: Meta Platforms Inc.

Google Scholar

OECD (2021). “Teleworking in the COVID-19 pandemic: Trends and prospects,” in Tech. rep. (Paris, France: Organisation for Economic Co-operation and Development).

Google Scholar

Oh, C. S., Bailenson, J. N., and Welch, G. F. (2018). A systematic review of social presence: Definition, antecedents, and implications. Front. Robotics AI 5, 114. doi:10.3389/frobt.2018.00114

PubMed Abstract | CrossRef Full Text | Google Scholar

Olson, J. S., Olson, G. M., and Meader, D. (1997). “Face-to-face group work compared to remote group work with and without video,” in Video-mediated communication (Mahwah, NJ, US: Lawrence Erlbaum Associates Publishers).

Google Scholar

Otte, A., Menzner, T., Gesslein, T., Gagel, P., Schneider, D., and Grubert, J. (2019). “Towards utilizing touch-sensitive physical keyboards for text entry in virtual reality,” in 2019 IEEE Conference on Virtual Reality and 3D User Interfaces (VR), Osaka, Japan, 23-27 March 2019 (IEEE), 1729–1732. doi:10.1109/VR.2019.8797740

CrossRef Full Text | Google Scholar

Poeschl, S., Wall, K., and Doering, N. (2013). “Integration of spatial sound in immersive virtual environments an experimental study on effects of spatial sound on presence,” in 2013 IEEE Virtual Reality (VR), Lake Buena Vista, FL, USA, 18-20 March 2013 (IEEE). doi:10.1109/VR.2013.6549396

CrossRef Full Text | Google Scholar

Raghuram, S., Hill, N. S., Gibbs, J. L., and Maruping, L. M. (2019). Virtual work: Bridging research clusters. Acad. Manag. Ann. 13, 308–341. doi:10.5465/annals.2017.0020

CrossRef Full Text | Google Scholar

Rogers, B., Masoodian, M., and Apperley, M. (2018). “A virtual cocktail party: Supporting informal social interactions in a virtual conference,” in Proceedings of the 2018 international conference on advanced visual interfaces (New York, NY, USA: Association for Computing Machinery), 1–3. doi:10.1145/3206505.3206569

CrossRef Full Text | Google Scholar

Roth, D., Bente, G., Kullmann, P., Mal, D., Purps, C. F., Vogeley, K., et al. (2019). “Technologies for social augmentations in user-embodied virtual reality,” in 25th ACM symposium on virtual reality software and technology (New York, NY, USA: Association for Computing Machinery), 1–12. doi:10.1145/3359996.3364269

CrossRef Full Text | Google Scholar

Roth, D., Klelnbeck, C., Feigl, T., Mutschler, C., and Latoschik, M. E. (2018). “Beyond replication: Augmenting social behaviors in multi-user virtual realities,” in 2018 IEEE Conference on Virtual Reality and 3D User Interfaces (VR), Tuebingen/Reutlingen, Germany, 18-22 March 2018 (IEEE), 215–222. doi:10.1109/VR.2018.8447550

CrossRef Full Text | Google Scholar

Rzeszewski, M., and Evans, L. (2020). Virtual place during quarantine – A curious case of VRChat. Rozw. Reg. i Polityka Reg. 57, 57–75. doi:10.14746/rrpr.2020.51.06

CrossRef Full Text | Google Scholar

Saatçi, B., Akyüz, K., Rintel, S., and Klokmose, C. N. (2020). Re)Configuring hybrid meetings: Moving from user-centered design to meeting-centered design. Comput. Support. Coop. Work (CSCW) 29, 769–794. doi:10.1007/s10606-020-09385-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Samrose, S., McDuff, D., Sim, R., Suh, J., Rowan, K., Hernandez, J., et al. (2021). “Meetingcoach: An intelligent dashboard for supporting effective & inclusive meetings,” in Proceedings of the 2021 CHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery). doi:10.1145/3411764.3445615

CrossRef Full Text | Google Scholar

Samrose, S., Zhao, R., White, J., Li, V., Nova, L., Lu, Y., et al. (2018). Coco: Collaboration coach for understanding team dynamics during video conferencing. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 1, 1–24. doi:10.1145/3161186

CrossRef Full Text | Google Scholar

Sarkar, A., Rintel, S., Borowiec, D., Bergmann, R., Gillett, S., Bragg, D., et al. (2021). “The promise and peril of parallel chat in video meetings for work,” in Extended abstracts of the 2021 CHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery). doi:10.1145/3411763.3451793.00003

CrossRef Full Text | Google Scholar

Scavarelli, A., Arya, A., and Teather, R. J. (2021). Virtual reality and augmented reality in social learning spaces: A literature review. Virtual Real. 25, 257–277. doi:10.1007/s10055-020-00444-8

CrossRef Full Text | Google Scholar

Schwartz, G., Wei, S. E., Wang, T. L., Lombardi, S., Simon, T., Saragih, J., et al. (2020). The eyes have it: An integrated eye and face model for photorealistic facial animation. ACM Trans. Graph. 39, 91. doi:10.1145/3386569.3392493

CrossRef Full Text | Google Scholar

Shockley, K. M., Gabriel, A. S., Robertson, D., Rosen, C. C., Chawla, N., Ganster, M. L., et al. (2021). The fatiguing effects of camera use in virtual meetings: A within-person field experiment. J. Appl. Psychol. 106, 1137–1155. doi:10.1037/apl0000948

PubMed Abstract | CrossRef Full Text | Google Scholar

Short, J., Williams, E., and Christie, B. (1976). The social psychology of telecommunications. Toronto; London; New York: Wiley.

Google Scholar

Slater, M., and Sanchez-Vives, M. V. (2016). Enhancing our lives with immersive virtual reality. Front. Robot. 3, 74. doi:10.3389/frobt.2016.00074

CrossRef Full Text | Google Scholar

Ståhl, O. (1999). “Meetings for real–experiences from a series of vr-based project meetings,” in Proceedings of the ACM symposium on virtual reality software and technology (New York, NY, USA: Association for Computing Machinery), 164–165. doi:10.1145/323663.323691

CrossRef Full Text | Google Scholar

Suh, H., Shahriaree, N., Hekler, E. B., and Kientz, J. A. (2016). “Developing and validating the user burden scale: A tool for assessing user burden in computing systems,” in Proceedings of the 2016 CHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery), 3988–3999.

Google Scholar

Sykownik, P., Graf, L., Zils, C., and Masuch, M. (2021). “The most social platform ever? A survey about activities amp; motives of social VR users,” in 2021 IEEE Virtual Reality and 3D User Interfaces (VR), Lisboa, Portugal, 27 March 2021 - 01 April 2021 (IEEE). doi:10.1109/VR50410.2021

CrossRef Full Text | Google Scholar

Tanenbaum, T. J., Hartoonian, N., and Bryan, J. (2020). “How do I make this thing smile?”: An inventory of expressive nonverbal communication in commercial social virtual reality platforms,” in Proceedings of the 2020 CHI conference on human factors in computing systems (Honolulu HI USA: ACM), 1–13. doi:10.1145/3313831.3376606

CrossRef Full Text | Google Scholar

Torro, O., Jalo, H., and Pirkkalainen, H. (2021). Six reasons why virtual reality is a game-changing computing and communication platform for organizations. Commun. ACM 64, 48–55. doi:10.1145/3440868

CrossRef Full Text | Google Scholar

Wigham, C. R., and Chanier, T. (2013). A study of verbal and nonverbal communication in second life – The ARCHI21 experience. ReCALL 25, 63–84. doi:10.1017/S0958344012000250

CrossRef Full Text | Google Scholar

Wilcox, L. M., Allison, R. S., Elfassy, S., and Grelik, C. (2006). Personal space in virtual reality. ACM Trans. Appl. Percept. 3, 412–428. doi:10.1145/1190036.1190041

CrossRef Full Text | Google Scholar

Williamson, J., Li, J., Vinayagamoorthy, V., Shamma, D. A., and Cesar, P. (2021). “Proxemics and social interactions in an instrumented virtual reality workshop,” in Proceedings of the 2021 CHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery). doi:10.1145/3411764.3445729

CrossRef Full Text | Google Scholar

Williamson, J. R., O’Hagan, J., Guerra-Gomez, J. A., Williamson, J. H., Cesar, P., and Shamma, D. A. (2022). “Digital proxemics: Designing social and collaborative interaction in virtual environments,” in CHI conference on human factors in computing systems (New York, NY, USA: Association for Computing Machinery), 1–12. doi:10.1145/3491102.3517594

CrossRef Full Text | Google Scholar

Yee, N., Bailenson, J. N., Urbanek, M., Chang, F., and Merget, D. (2007). The unbearable likeness of being digital: The persistence of nonverbal social norms in online virtual environments. CyberPsychology Behav. 10, 115–121. doi:10.1089/cpb.2006.9984

PubMed Abstract | CrossRef Full Text | Google Scholar

Yoshimura, A., and Borst, C. W. (2021). A study of class meetings in VR: Student experiences of attending lectures and of giving a project presentation. Front. Virtual Real. 2, 648619. doi:10.3389/frvir.2021.648619

CrossRef Full Text | Google Scholar

Zibrek, K., and McDonnell, R. (2019). “Social presence and place illusion are affected by photorealism in embodied VR,” in Motion, interaction and games (New York, NY, USA: Association for Computing Machinery). doi:10.1145/3359566.3360064

CrossRef Full Text | Google Scholar

Keywords: CSCW, virtual reality, social VR, remote collaboration, virtual meetings, video conferencing, autoethnography, case study

Citation: Bonfert M, Reinschluessel AV, Putze S, Lai  Y, Alexandrovsky D, Malaka  R and Döring T (2023) Seeing the faces is so important—Experiences from online team meetings on commercial virtual reality platforms. Front. Virtual Real. 3:945791. doi: 10.3389/frvir.2022.945791

Received: 16 May 2022; Accepted: 23 November 2022;
Published: 11 January 2023.

Edited by:

Daniel Zielasko, University of Trier, Germany

Reviewed by:

Mark Billinghurst, University of South Australia, Australia
Myungho Lee, Pusan National University, South Korea

Copyright © 2023 Bonfert, Reinschluessel, Putze, Lai , Alexandrovsky, Malaka  and Döring. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Michael Bonfert , bonfert@uni-bremen.de

Download