A Comparison of Conventional and Technology-Mediated Selection Interviews With Regard to Interviewees’ Performance, Perceptions, Strain, and Anxiety

Melchers, Klaus G.; Petrig, Amadeus; Basch, Johannes M.; Sauer, Juergen

doi:10.3389/fpsyg.2020.603632

ORIGINAL RESEARCH article

Front. Psychol., 12 January 2021

Sec. Organizational Psychology

Volume 11 - 2020 | https://doi.org/10.3389/fpsyg.2020.603632

A Comparison of Conventional and Technology-Mediated Selection Interviews With Regard to Interviewees’ Performance, Perceptions, Strain, and Anxiety

Klaus G. Melchers^1*

Amadeus Petrig²

Johannes M. Basch¹

Juergen Sauer³

¹Institut für Psychologie und Pädagogik, Universität Ulm, Ulm, Germany
²Migros-Genossenschafts-Bund, Zurich, Switzerland
³Department of Psychology, University of Fribourg, Fribourg, Switzerland

Organizations increasingly use technology-mediated interviews. However, only limited research is available concerning the comparability of different interview media and most of the available studies stem from a time when technology-mediated interviews were less common than in the present time. In an experiment using simulated selection interviews, we compared traditional face-to-face (FTF) interviews with telephone and videoconference interviews to determine whether ratings of interviewees’ performance, their perceptions of the interview, or their strain and anxiety are affected by the type of interview. Before participating in the actual interview, participants had a more positive view of FTF interviews compared to technology-mediated interviews. However, fairness perceptions did not differ anymore after the interview. Furthermore, there were no differences between the three interview media concerning psychological and physiological indicators of strain or interview anxiety. Nevertheless, ratings of interviewees’ performance were lower in the technology-mediated interviews than in FTF interviews. Thus, differences between different interview media can still be found nowadays even though most applicants are much more familiar with technology-mediated communication than in the past. The results show that organizations should take this into account and therefore avoid using different interview media when they interview different applicants for the same job opening.

Introduction

Over the past decades, technological progress has considerably changed how organizations recruit and select applicants (Tippins and Adler, 2011; Ryan et al., 2015; Ployhart et al., 2017). The computer and telecommunication technology now available allows organizations to use web-based and computer-administered selection tools at all stages of the selection process. Furthermore, given the COVID-19 pandemic, many organizations had to change their selection processes to web-based or technology-mediated procedures to be able to evaluate candidates even during times of physical distancing. To do so, rather diverse tools have been introduced such as multimedia simulation tests that are administered via the internet (Oostrom et al., 2010), internet-based testing that can be completed using a computer (Tippins, 2009) or even on a smartphone (e.g., Arthur et al., 2018), and many other procedures (cf. Tippins and Adler, 2011; Ryan et al., 2015; Woods et al., 2019).

The increasing use of technology-based selection is also accompanied by significant changes in how selection interviews are administered with organizations increasingly making use of technology-mediated interviews. Furthermore, there is not only an increase in the number of interviews that are administered via the internet, but even before the COVID-19 pandemic the number of interviews that are conducted via telephone has risen compared to earlier levels from the last millennium (Amoneit et al., 2020). This increased use of technology-mediated selection interviews in addition to, or instead of, the traditional face-to-face (FTF) interviews, raises important questions concerning the comparability of the different ways in which interviews can be conducted. However, although these questions have been previously examined, most of the relevant studies were conducted when telephone-based interviews were less common than nowadays and when easy-to-use videoconference systems like Skype, Google Hangouts, or Zoom were not commonplace. Hence, it is likely that participants in previous studies may not have been as familiar and comfortable with these systems as they may be now. Consequently, it is unclear whether these earlier results still generalize to current generations of applicants.

The purpose of the present study is to respond to repeated calls (e.g., Huffcutt and Culbertson, 2011; Levashina et al., 2014; Ryan and Ployhart, 2014; Blacksmith et al., 2016) for more research on technology-mediated interviews and to compare FTF interviews with telephone and videoconference interviews on a broad range of variables. Ratings of interviewees’ performance (i.e., of their answers to specific interview questions and/or to the interview as a whole) are central in this regard. However, interviews do not only have a diagnostic function (i.e., to identify the best applicant for a given job), but also serve an important recruiting function because organizations aim to present themselves in an attractive way regarding the applicants (Dipboye et al., 2012; Wilhelmy et al., 2016). Therefore, it is also important to examine interviewee perceptions of the different kinds of interviews.

Background

Selection Interviews

Meta-analytic evidence has shown that selection interviews can be highly valid selection tools that may even reach similar levels of criterion-related validity as tests of general mental ability (e.g., Huffcutt and Arthur, 1994; Huffcutt et al., 2014). However, such high levels of validity can only be obtained when structured interviews are used. In a seminal article, Campion et al. (1997) discussed several factors that affect the degree of interview structure and reviewed the existing related literature. These factors can be categorized as being related to consistency (e.g., asking the same questions to all applicants, all interviews are conducted by the same interviewer), question sophistication (e.g., asking questions that minimize probing and prompting of the interviewee), and evaluation standardization (e.g., using descriptively-anchored scales to rate the interviewees’ responses to each question or interviewers receiving training about how to rate applicant performance, cf. Chapman and Zweig, 2005).

Traditionally, interviews have been conducted face-to-face. However, as a consequence of the advancement of telecommunication technology – and recently also of the COVID-19 pandemic – a growing number of organizations do not only rely on traditional FTF interviews but also make use of technology-mediated interviews. These interviews can be administered via telephone or videoconference systems (Chapman et al., 2003) or they might even be conducted without an actual interviewer when organizations use asynchronous video interviewing technology. In these asynchronous video interviews, interviewees are shown the interview questions on the computer screen, record their answers with their webcam, and submit them via an online platform, so that the videotaped answers can be evaluated later (e.g., Brenner et al., 2016; Langer et al., 2017).

If one uses different administration media for different applicants who apply for the same job, this reduces the consistency with which the interview is administered and thus influences the degree of standardization. However, this factor was not yet considered by Campion et al. (1997). Furthermore, as pointed out by Huffcutt et al. (2011) in their model on interviewee performance, technology-mediated interviews might increase interviewees’ apprehensiveness when they are not familiar with a given interview medium (also cf. Lukacik et al., 2020). Furthermore, Huffcutt et al. (2011) raised the question whether using technology-mediated interviews impairs the identification of social cues that are sent from the interviewer. Therefore, it is an important question to determine whether the administration medium influences ratings of interviewees’ performance in the interview, interviewees’ perceptions of the interview, or their reactions to the interview procedure.

Relevant Theories in the Context of Technology-Mediated Interviews

With regard to technology-mediated selection interviews, several general theoretical approaches are relevant that have been developed to describe preferences and the suitability of different media for communication with others. The first of these theories is media richness theory (e.g., Daft and Lengel, 1986), which is concerned with communication media (e.g., email, telephone, or videoconference) and the extent to which these media allow or limit the transmission of information. The theory assumes that the use of different channels of information transmission (verbal, non-verbal, and para-verbal) reduces the ambiguity of a message and thus also the uncertainty of the communication partners. There have been concerns that the conveyance of (subtle) social cues is impaired during technology-mediated communication (e.g., Straus and McGrath, 1994; Chapman and Webster, 2001; Huffcutt et al., 2011). However, the extent to which such social cues can be perceived depends on the kind of technology used. While videoconferencing represents the upper end of communication bandwidth in technology-mediated communication (since it conveys auditory and visual information but also non- and para-verbal cues), telephone conversations may be positioned in the middle of the continuum while email communication would be at the lower end of it. Based on the media richness theory, one would expect that high bandwidth communication would be more advantageous in selection interviews (even though it should be acknowledged that low bandwidth communication might be beneficial in some other situations, e.g., Sauer et al., 2000). Accordingly, FTF interviews would be more beneficial than videoconference interviews because of the more comprehensive use of different channels of information transmission and videoconference interviews would be more beneficial than telephone interviews, which allow for the most limited information transmission.

Similar to media richness theory, social presence theory (Short et al., 1976) can also be used to explain the effects associated with computer-mediated communication. It assumes that communication media differ in the level to which communication partners experience the presence of each other, including the perception of the conversation partner’s gestures, gaze, and facial expressions. Furthermore, higher social presence is associated with more positive communication outcomes such as mutual attraction, trust, and enjoyment (e.g., Lee et al., 2006). In line with this, a study by Croes et al. (2016), for example, found that face-to-face communication led to more social presence, which in turn influenced the level of interpersonal attraction.

Social presence theory makes similar predictions as media richness theory but explains the phenomena in a different way. Thus, the importance of the degree of psychological awareness of another person is stressed rather than the level of communication bandwidth. With regard to selection interviews, FTF interviews should be the most beneficial medium in terms of perceived social presence. Although the interviewee is in a different location than the interviewer both during videoconference interviews and telephone interviews, both conversation partners see each other in videoconference interviews and are probably able to perceive more social presence, whereas social presence in telephone interviews is limited to the voice.

A third theoretical approach that helps to differentiate between interview media is Potosky’s (2008) framework of media attributes. According to Potosky, media communication can vary in four general attributes: social bandwidth, interactivity, transparency, and surveillance. Social bandwidth means that communication is easier in general when more communication paths are used. Therefore, this attribute is relatively similar to what is assumed in media richness theory. Interactivity defines the extent of interaction that is possible during an interview. Transparency refers to the fact that one is aware of technology-mediation during the interview. And surveillance describes the feeling that an interview could be surveilled or recorded by a third party. The first three attributes are beneficial in the context of technology-mediated interviews and the fourth attribute is negative.

Taken together, all three theoretical approaches agree that there are several advantages of FTF interviews compared to both kinds of technology-mediated interviews. This is either because of the most complete transmission of subtle cues (according to media richness theory), because of the actual presence of the conversation partners (according to social presence theory) or because of the highest level of transparency and the lowest risk of surveillance (according to Potosky’s media attributes). Furthermore, according to all the different approaches, videoconference interviews should be more advantageous than telephone interviews because the lack of visual information in the latter might make it more difficult to correctly interpret ambiguous social cues and might also impair the social nature of the interview. Furthermore, the lack of non-verbal behavior should result in interaction impairments (because non-verbal signals cannot be used as additional sources of conversation information) and impairments of transparency (see also Morelli et al., 2017).

In addition to theories that were developed to describe the suitability of different media for communication with others, it has also been suggested to consider an evolutionary perspective to better understand differences between FTF and technology-mediated interaction (e.g., Kock, 2004; Piazza and Bering, 2009; Abraham et al., 2013). According to evolutionary psychology (e.g., Buss, 2005), human behavior can be understood from the view of evolutionary adaptations that occurred over hundreds of thousands of years in response to only slowly changing environmental conditions. With regard to technology-mediated interaction, the evolutionary perspective assumes that humans have acquired competencies in FTF communication over very many centuries while the acquisition of competencies in technology-mediated communication is only very recent in the development of humankind. Furthermore, positive social interactions with others that have occurred FTF throughout the vast majority of human evolution also serve to satisfy the underlying need for belongingness (Baumeister and Leary, 1995). However, with the advent of technology-mediated communication, interactions do not always have to be FTF, which can impair the satisfaction of social belongingness needs of the communication partners who have been used to FTF interaction throughout evolution. In line with this, Sacco and Ismail (2014), for example, found that FTF interactions satisfied the need for social belongingness of their participants better than virtual interactions or compared to a control group without interaction. With regard to selection interviews, it is therefore also conceivable that the interview medium could negatively affect the social character of these interviews because of its impact on the lower satisfaction of belongingness needs. Thus, for technology-mediated interviews, evolutionary approaches converge with predictions from the other theoretical approaches.

Previous Research on Technology-Mediated Interviews

Most of the earlier research on technology-mediated interviews has focused on the effects of the different interview media on ratings of interviewees’ performance and on interviewees’ perceptions of the different interviews. Furthermore, most of this research focused on telephone and videoconference interviews in which an interviewer and an interviewee interacted directly. These two kinds of technology-mediated interviews are also the main focus of our study.

Concerning telephone and videoconference interviews, an obvious issue is that videoconferencing has seen considerable technological progress over the past decades, whereas fewer changes were observed for telephone interviews. This implies that research on telephone interviews that was conducted some time ago (e.g., Silvester et al., 2000; Straus et al., 2001) may still be relevant to this day. However, despite the absence of huge technological change affecting communication bandwidth, the prevalence of telephone interviews has also increased in recent years (e.g., Amoneit et al., 2020). This may be of importance as experience with telephone interviews has been identified as a moderating variable that may affect performance in these interviews (Silvester et al., 2000).

Increased familiarity with videoconferencing systems might not only affect the relevance of previous research (e.g., many students and employees are now familiar with videoconference systems like Skype, Google Hangouts, or Zoom) but also the considerable technological progress over recent years. Thus, some of the previous findings may be less relevant today because they were obtained with technology that is now considered to be obsolete (Ployhart et al., 2017). Due to rapid progress in computing power and the availability of high-speed internet and high-definition cameras, today’s communication bandwidth is considerably more extensive than it used to be 10–15 years ago. Therefore, performance impairments in videoconference interviews reported in studies published more than a decade ago should be considerably reduced when using contemporary technology. Accordingly, potential differences between FTF and videoconference interviews that were attributed to lower communication bandwidth may have become smaller, now. Nevertheless, in a series of three qualitative studies McColl and Michelotti (2019) still found that interviewers reported several limitations of videoconference interviews in comparison to FTF interviews. In addition to some remaining technical issues, these limitations included aspects such as impairments of non-verbal communication (e.g., eye contact, perceptions of hand gestures) and problems of the setting (e.g., lighting and noise).

Interview Performance Ratings in Technology-Mediated vs. FTF Interviews

Concerning the effects of the interview medium on interview performance ratings, a recent meta-analysis of previous studies, which have all been published at least 9 years before the meta-analysis, found that interviewees generally receive better ratings in FTF interviews than in technology-mediated interviews (Blacksmith et al., 2016, but see Chapman and Rowe, 2001, or Straus et al., 2001, for exceptions). Furthermore, the meta-analytic effects were of intermediate size and were comparable for telephone and for videoconference interviews. In addition, interviewees in previous studies also had higher outcome expectations in FTF interviews than in telephone or videoconference interviews, which means that they also considered their performance to be better in FTF interviews (Chapman et al., 2003) and assumed to be able to use more impression management in FTF interviews than in technology-mediated interviews (Basch et al., 2020a). However, despite using modern videoconference technology, two recent studies still found higher interview performance ratings (Sears et al., 2013; Basch et al., 2020b) in FTF interviews compared to videoconference interviews with mean differences of intermediate size.

A limitation of the meta-analysis by Blacksmith et al. (2016) beyond its reliance on relatively old primary studies is that the empirical basis for effects concerning specific types of interviews is also rather sparse (e.g., the meta-analytic estimate for the comparison for videoconference vs. FTF interviews is based on an N of only 103 individuals). In addition, the effects from the corresponding primary studies vary considerably so that specific aspects of the few primary studies and the respective interviews have more impact on the meta-analytic estimate than is usually the case in other meta-analyses. Furthermore, earlier primary studies usually also only compared FTF interviews with one kind of technology-mediated interviews (e.g., Silvester et al., 2000; Sears et al., 2013; Basch et al., 2020b). Thus, it is unclear whether the differences between FTF and telephone or videoconference interviews are indeed comparable when the content and other aspects of the interviews beyond the interview medium are held consistent. This is a relevant question, because in comparison to FTF interviews one would expect to find larger performance differences for telephone interviews than for videoconference interviews according to the different theoretical approaches because telephone interviews should go hand in hand with lower media richness, lower social presence and so on.

Another limitation of previous research is that it remains unclear whether the lower performance ratings were in fact related to lower interviewee performance or whether they were due to effects on the side of the interviewers. This issue is important in light of a study by Van Iddekinge et al. (2006) who found higher performance ratings when interviewees were rated on the basis of FTF interviews than when the same interviewees were rated on the basis of videotaped interviews. Thus, it is possible that the interview medium does not lead to lower performance by interviewees but to lower evaluations of this performance by raters.

Interviewee Perceptions of Technology-Mediated Interviews

Similar to the evidence regarding interview performance ratings, the majority of the previous studies also found that interviewees had a preference for FTF interviews in comparison to telephone or videoconference interviews and perceived them as fairer (e.g., Kroeck and Magnusen, 1997; Chapman et al., 2003; Sears et al., 2013). Furthermore, interviewees in Straus et al.‘s (2001) study felt more comfortable in FTF interviews than in videoconference interviews. In line with this, Blacksmith et al.’s (2016) meta-analysis found that, overall, interviewees react negatively to technology-mediated interviews in comparison to FTF interviews. Furthermore, evidence from a recent study by Basch et al. (2020b) that compared perceptions of FTF and videoconference interviews confirmed that social presence is a mediator of the effects of the interview medium on interviewees’ fairness perceptions.

The negative perceptions of technology-mediated interviews by interviewees are especially relevant for organizations because of evidence that lower fairness perceptions are accompanied by lower perceptions of organizational attractiveness (Bauer et al., 2004; Langer et al., 2020) and lower intentions to accept a job offer (Chapman et al., 2003). Thus, using technology-mediated interviews may have negative effects for the recruitment function of employment interviews (e.g., Wilhelmy et al., 2016).

Unfortunately, there are several limitations concerning previous research on interviewee perceptions of the different kinds of interviews. First, in contrast to previous research concerning interview performance, the available database related to interviewee perceptions of technology-mediated interviews is considerably smaller so that the meta-analytic results by Blacksmith et al. (2016) for the different interview media are based on an even more limited empirical basis with only two or three primary studies for each comparison. Furthermore, the primary studies that considered telephone as well as videoconference interviews either could not ensure that the interviews per se were comparable concerning the different interview media (Chapman et al., 2003) or the available videoconference technology that was used in the primary studies was much more plagued by impaired conversation flow in comparison to present technology (Straus et al., 2001). Finally, in some more recent studies, participants did not take part in actual interviews but only had to answer survey questions after a description of the different interviews (e.g., Basch et al., 2020a), they only observed videos of technology-mediated interviews (e.g., Langer et al., 2020), or the study did not consider telephone interviews (Basch et al., 2020b).

Strain and Anxiety as Reactions to the Interview

Even though fairness perceptions are the most commonly investigated aspect of applicant reactions, other reactions to interviews are also relevant. For example, Straus et al.’s (2001) finding that interviewees felt more comfortable in FTF interviews than in videoconference interviews already indicated that emotional reactions might play a role when different interview media are compared.

In this context, it is important to realize that selection interviews can be a strong psychosocial stressor. This stressor may lead to interviewee strain that manifests itself not only at the subjective level (e.g., by increased anxiety) but also at the psychophysiological level (e.g., by increased transpiration and heartbeat). In line with this, stress researchers even used the analogy of an employment interview when they designed stressful situations for their research (Kirschbaum et al., 1993). Furthermore, people with higher interview anxiety show lower interview performance (e.g., McCarthy and Goffin, 2004; Powell et al., 2018), which is consistent with general evidence that applicants who experience higher test anxiety achieve lower test scores (Hausknecht et al., 2004). The latter is of particular concern in the present context because there are considerable differences between interviewees regarding interview anxiety. Thus, given that applicants are affected to different degrees by interview anxiety, this may influence the selection decision in the form of reduced chances to receive a job offer for those applicants who suffer from strong interview anxiety. Beyond the obvious negative consequences for applicants, there may also be negative effects for organizations because they may reject suitable applicants when these are unable to show their true performance level during the interview due to excessive anxiety levels (cf. McCarthy and Goffin, 2004).

There is little work on test anxiety that has examined strain beyond self-report measures. While it has not been uncommon to use psychophysiological indicators of strain in many research areas within work and organizational psychology (e.g., Åkerstedt, 1990; Zeier et al., 1996), such measures have rarely been employed in personnel selection research and none of the previous studies on technology-mediated interviews has used such indicators. Their use would allow us to obtain a broader picture of the multiple effects of stressors because it would not be necessary to rely on self-report measures and performance indicators alone. Of the many psychophysiological indicators available, heart rate variability (HRV) may be particularly suitable for the present study, as it is sensitive to changes in mental strain and negative affect (Kettunen and Keltikangas-Järvinen, 2001).

The Present Study

The literature reviewed above has raised the central question to what extent technology-mediated interviews can influence interviewees’ performance and their perceptions of and reactions to the interview procedure. Until now, the research related to this question has mainly found that interviewees received lower performance ratings in technology-mediated interviews than in FTF interviews and that applicant perceptions of technology-mediated interviews were less positive (Blacksmith et al., 2016). However, most of the previous work was conducted at a time when a different generation of technology-mediated communication tools was used when the fidelity of the technology was much lower. At that time, interviewees were also less familiar with these communication tools. Therefore, it is unclear to what degree previous evidence can still be generalized to current applicants. Furthermore, there is a lack of research concerning strain and anxiety in the different types of interviews.

To address these issues, we set up an experiment to compare three different ways in which interviews can be conducted: In the traditional FTF manner, via telephone, or by using a videoconference system. In the latter two conditions, which used technology-mediated interviews, the interviewer and the interviewee did not meet in person. We compared the three conditions with regard to ratings of interviewees’ performance and to their perceptions of the interviews, but also with regard to the anxiety and the subjective as well as the physiological strain that they experienced.

Methods

Participants

A total of 95 German-speaking final year students and recent graduates from a Swiss university took part in the study. The data of 7 participants had to be discarded because they did not consent to using videotapes of their interview performance (see below). Thus, the final sample consisted of 88 participants (36 males and 52 females; age: M = 25.09 years; SD = 4.05). Post hoc power analyses using G^∗Power 3.1 (Faul et al., 2009) on the basis of our sample size revealed that we had a power of 0.53 to detect a medium-sized effect (using an alpha-level of 0.05) in a one-way analysis of variance and of 0.59 for t-tests for mean differences between two of the three groups.

The participants were from a broad range of subjects, with larger groups majoring in communication sciences (23.9%), psychology (15.9%), and economics (12.5%). On average, participants had been enrolled for 3.82 years (SD = 2.09). Participants were recruited via an email that was sent to all final year students and that invited them to take part in a simulated selection interview, allowing them to gain experience regarding selection procedures and to receive feedback on their performance. All the data were collected before the advent of the COVID-19 pandemic.

Procedure

To register for the study, participants had to complete an online registration form. After registration, they were asked to complete a questionnaire that contained questions concerning demographic and personal data. This was followed by a questionnaire measuring participants’ fairness expectations and favorability ratings for the three interview media that were examined (FTF, telephone, and videoconference). Finally, participants could make an appointment with the experimenter to come to the laboratory for the interview.

The interviews in the three experimental conditions were always conducted in the same room. This room was equipped for telephone as well as for videoconference interviews. When participants arrived in the room, they were told that the present study was investigating subjective and physiological reactions to selection procedures. Then, the heart rate monitoring system was attached to the participants’ chest and wrist. Participants were asked to sit quietly for 5 min while watching a relaxing video. The reason for this was that previous studies indicated that body movements may influence HRV measurement by introducing artificial variability (Jorna, 1992; Bernardi et al., 2000). The physiological data were collected during the last 2 min of this 5-min period and were used as a baseline for the experimental HRV measure. After the resting stage, participants completed the KAB, a short questionnaire to measure their current subjective strain levels.

Next, participants completed a short general mental ability (GMA) test. During this test, HRV data were again collected during the first 2 min of the test administration. Directly after the GMA test, participants completed the strain questionnaire for a second time.

Participants were then randomly assigned to one of the three interview conditions: FTF (n = 30), telephone (n = 26), and videoconference (n = 32). In all three conditions, the experimenter instructed participants to remain seated at all times (which also meant they should not get up in the FTF condition when the interviewer entered the room) to prevent contamination of HRV by movement artifacts. Furthermore, the participants were asked to imagine that they had applied for an attractive leadership position in their field of study and that they would have direct reports in this position.

After the experimenter had left the room, the actual interview began. In the FTF condition, the interviewer entered the room. To make the interview situation more authentic, the interviewer wore a suit and a tie in all conditions. For the telephone condition, a typical office telephone was used. The interview began by the interviewer calling the participant on the telephone. In the videoconference condition, a program (Skype) was used that allowed voice calls to be made over the internet with an additional videoconference function. The interviewer began the interview by calling the participant via computer, upon which a window opened on the participant’s screen, with the interviewer being shown in full screen mode on a stand-alone 22-inch computer screen. The interviewer and the interviewee used the built-in microphones to talk to each other.

To ensure that participants could have eye contact with the interviewer in the videoconference condition, the webcam that was used to record the interviewer was fixed in the middle of his screen. In contrast to this, the webcam used to record the participants was fixed on top of the screen.

At the beginning of the interview, the interviewer introduced himself. Then, the participant was asked to prepare a short self-introduction in which he/she should introduce himself/herself to the interviewer. Participants were told that this self-introduction should not last for more than 3 min and they were given 4 min to prepare their answer, which allowed us to measure HRV during the last 2 min of the preparation phase. After the self-introduction, the interviewer asked a set of standardized questions. This set consisted of seven past-behavior questions and four future-oriented questions (see below). All questions were asked in the same order and no probing or follow-up questions were used. The interviewer took notes and evaluated the participants’ answers before asking the next question.

After the interview, participants were asked for a self-evaluation of their overall interview performance. Then, they were asked to complete a questionnaire to rate their current level of interview anxiety and the perceived fairness of the interview they had just experienced, and to complete the strain questionnaire again.

During the interview, all participants were videotaped with a hidden video camera so that their performance could be evaluated by a second rater after the completion of the study. This hidden camera was necessary because in the telephone condition in particular the feeling of being observed might have changed interviewees’ perception of the situation. After the last strain questionnaire, participants were debriefed and informed that a video recording had been made. They were asked to grant permission that these video recordings can be used for later analyses. As noted above, seven participants did not grant permission so that their video recordings were deleted by the experimenter in the presence of the participant. Thus, no data from these participants were used for later analyses.

Measures

Interview Performance Ratings

As noted above, the interview consisted of two parts, a short self-introduction and a set of standardized questions containing seven past-behavior questions and four future-oriented questions. The past-behavior questions asked interviewees to recall situations they had previously experienced and to describe what exactly they had done in those situations. The future-oriented questions asked interviewees to describe what they would do in hypothetical situations presented to them. Two of the questions targeted Cooperation, three targeted Information Management, five targeted Leadership, and one targeted Systematic Planning. All interview questions were taken from previous studies in which participants stem from similar populations (Melchers et al., 2009, 2012). The questions were suitable for university graduates applying for management trainee positions. An example question can be found in the Appendix.

Several ratings of interviewees’ performance were collected. First, the interviewer and a second rater (two Master students) rated interviewees’ performance in the two main parts of the interview (i.e., the self-introduction and the 11 structured interview questions). For both parts of the interview, these ratings were made on descriptively-anchored 5-point rating scales, ranging from 1 = poor through 3 = average to 5 = good. To ensure that both raters had a common frame of reference for their evaluations, each rating scale had descriptive anchors. These anchors were similar to BARS (Smith and Kendall, 1963) and described what behavior of an interviewee would be considered as poor, average, or good (see the example in the Appendix). The rating scales were employed in previous studies (Melchers et al., 2009, 2012) after they had undergone extensive pretesting. After the interview, the interviewer and the second rater evaluated the participants’ overall interview performance on another rating scale, again ranging from 1 = poor to 5 = good. Finally, participants also provided self-ratings of their overall interview performance on a 6-point rating scale ranging from 1 = very poor to 6 = very good.

Prior to the first interview, both raters were trained over a period of 2 days. During this training, they were introduced to the self-introduction and the different interview questions as well as to definitions and behavioral examples for answers to the different interview questions. In order to develop a consistent frame of reference for rating the interviewees’ answers, raters received specific training on the purpose of each interview question, typical errors committed by raters, and the idea behind descriptively-anchored rating scales (Melchers et al., 2011; Roch et al., 2012). Furthermore, it was emphasized that it was crucial to conduct all interviews in the same prescribed manner, that is, to read the interview questions as printed on the interview forms and not to rephrase them or give additional cues. After completing the training, one of the students was assigned the role of the interviewer during the experiment while the other one served as the second rater and also as the experimenter, who welcomed participants and looked after them.

For our later analyses, we averaged the ratings from both raters for the interviewees’ overall interview performance and for the self-introduction, respectively, and also the means from both raters across all the structured interview questions. To determine the inter-rater reliability of the ratings from the interviewer and the second rater, we calculated intraclass correlations (ICC 2,1). Mean inter-rater reliabilities (i.e., the reliability of each rater) were 0.81 (overall interview performance), 0.71 (self-introduction), and 0.96 (overall mean across the structured questions).

Interviewee Perceptions

We used two interviewee perception measures. First, participants rated the favorability for each of three interview media on one item (“If you applied for a job, how much would you like to go through each of the following selection procedures?”) on a scale ranging from 1 = not at all to 6 = a lot. And second, participants were asked twice to provide ratings of interview fairness. Prior to the interview, they rated the expected fairness of each interview medium (FTF, telephone, and videoconference) on a one-item 6-point rating scale (“Please rate the fairness of the following selection procedures?” ranging from 1 = very unfair to 6 = very fair). After the interview, participants rated the fairness of the interview again, but this time only the interview medium they had experienced during the study (“How fair did you feel that the experienced interview was as a selection procedure?”), using the same 6-point rating scale.

Strain and Anxiety

We used two ways to measure participants’ strain. First, we used the KAB (Müller and Basler, 1993), a short German-language questionnaire, to measure participants’ current subjective strain. In this questionnaire, participants had to indicate how they were feeling “right now” on six bipolar adjective pairs (e.g., tense – relaxed) using a 6-point scale. In our study, internal consistencies ranged between 0.80 and 0.86 for the different measurements.

Second, we measured heart rate variability as a psychophysiological indicator of participants’ strain. HRV describes the variation of the interval between two successive heart beats. By using spectral analysis, the main components of HRV can be separated and analyzed individually. The high frequency (HF) band represents the sympathetic activation and the low frequency (LF) band describes the parasympathetic activation (Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology, 1996). The relation of these two components (i.e., the LF/HF ratio) is an indicator of psychosocial strain with a low ratio indicating elevated levels of strain (Bosch et al., 2003).

The Polar S810iTM heart rate monitor (Polar S810iTM, Kempele, Finland) was used to measure HRV in a non-intrusive manner. Specifically, each participant was equipped with a belt, containing a pulse monitor, worn around the chest. The heart rate data was transferred to a watch (worn in addition to the belt) via an infra-red connection. The data were subsequently analyzed by using the Kubios software (Niskanen et al., 2004).

In addition to strain, we also measured participants’ state anxiety during the interview. To do so, we used the MASI (Measure of Anxiety in Selection Interviews, McCarthy and Goffin, 2004) and converted the items into a state measure, which allowed us to measure the impact of interview medium on current levels of interview anxiety. The conversion involved the translation into German, but also the removal of 4 of the original 30 items because they did not apply to our study (e.g., “When meeting a job interviewer, I worry that my handshake will not be correct” because in two of the three conditions the interviewee did not meet the interviewer in person). Furthermore, changes in wording were made so that all items referred to the specific interview situation rather than the experience of being interviewed in general.

The MASI items covered aspects such as communication anxiety (e.g., “During this interview, I often couldn’t think of a thing to say”), appearance anxiety (e.g., “I often felt uneasy about my appearance during this interview”), social anxiety (e.g., “I was worrying about whether the interviewer was liking me as a person”), performance anxiety (e.g., “During this interview, I got very nervous about whether my performance was good enough”), and behavioral anxiety (e.g., “I felt sick to my stomach during the interview”). Participants rated all items on a 5-point Likert scale ranging from 1 = strongly disagree to 6 = strongly agree). In line with McCarthy and Goffin (2004), the mean across all items was calculated as an overall measure of participants’ current interview anxiety. Coefficient alpha for the MASI was 0.88.

Additional Variables

We used a short but extensively validated test from a consultancy to measure GMA. This test represents a commonly used measure of cognitive ability and contained 50 items to assess verbal, arithmetic, and spatial reasoning. Participants had to complete as many items as possible within 12 min.

In addition, participants provided demographic information and answered questions concerning their previous experience with face-to-face selection interviews and also with telephone and videoconference interviews. For each of these interviews, they were asked to indicate the number of previous interviews that they had experienced in the past. Furthermore, participants also had to indicate their body size and weight to determine their body-mass index and to answer questions concerning their activity levels because these variables might influence HRV.

Results

Preliminary Analysis

Correlations and descriptive data for all study variables are shown in Table 1. Inspection of this table shows that participants’ performance in the interview (except for the self-introduction) was significantly related to interview state anxiety and to perceived psychological strain during the interview, all rs > | −0.21|, all ps < 0.05. However, HRV was not related to interviewees’ performance.

TABLE 1

Table 1. Descriptive information and intercorrelations for study variables.

Concerning the comparability of the experimental conditions, the preliminary analyses revealed that participants in the three experimental groups did not differ with regard to age, sex, experience with technology-mediated interviews, body size, or weight, but that they differed with regard to their previous experience of selection interviews in general, F(2,85) = 4.60, p < 0.05, η² = 0.10, (M = 3.20, SD = 1.88, for the FTF condition, M = 4.42, SD = 1.94, for the telephone condition, and M = 3.19, SD = 1.38, for the videoconference condition). Therefore, we used interview experience as a covariate for all later analyses that compared the three experimental groups.