The relationship between subconstructs of empathy and general cognitive ability in the context of policing

Purpose Empathy has been widely theorized as an important ability in professions such as policing, in which to perform well individuals require multiple and interacting abilities, not least when resolving conflict situations. Even so, there are few studies investigating how subconstructs of empathy relate to other constructs such as general cognitive ability. The purpose of this paper is to establish, after evaluating psychometric properties, relationships among measures of empathy and cognitive ability in a sample of Swedish police students (n = 157). Design/methodology/approach Multiple latent variable models of how the different measures work to predict tasks that can be seen as proxies for the ability to understand another person’s situation and intentions are evaluated to determine the most robust relationship(s) within the data. Findings We find support for the psychometric properties reported in previous studies with the used instruments. We also find support for perspective-taking, a cognitive empathy subconstruct predicting the ability to recognize emotions, and also the affective part of empathy, predicting general cognitive ability. These findings are discussed at length in the paper. Originality/value This research adds more knowledge to the issue of how general cognitive ability relates to cognitive empathy and other subconstructs of empathy or Theory of Mind.


Introduction
The ability to understand others' situations and intentions (i.e., empathy or Theory of Mind (ToM)) involves being able to perceive events or phenomena from the point of view of another person (Baron-Cohen et al., 2015). Possessing this capacity is important when dealing with difficult situations, such as when dealing with a victim or a suspected perpetrator at a crime scene (Beauregard et al., 2007;Maddox et al., 2011) or when trying to de-escalate a conflict (Abanonu, 2018).
Understandably, empathy and/or ToM is associated with general cognitive abilities (Gore et al., 2010;Rakoczy et al., 2012). However, there is little evidence of how empathy relates to higherorder constructs, such as intelligence, because it may have more to do with memory, that is if the observer has previous experiences of what a person might be experiencing in specific situations (Baron-Cohen et al., 2001). Hence, this paper examines the relationship of empathy to a number of related mental constructs (i.e., fluid intelligence, emotion recognition, intention detection, and working memory) in the context of police education, which emphasizes the importance of empathy. The study contributes to the field by showing that affective empathy has more to do with fluid intelligence, while cognitive perspective-taking related to emotion recognition.

Understanding empathy
Research into the process of understanding another person's intentions or experiences as a facilitator of daily encounters can refer to related processes (Baron-Cohen et al., 2015;Launay et al., 2015). Cognitive empathy, perspectivetaking, or ToM are terms used to describe these processes (Baron-Cohen et al., 2015;Launay et al., 2015). However, empathy has two dimensions, one cognitive and one affective (Decety and Jackson, 2006;Jolliffe and Farrington, 2006). The cognitive dimension involves understanding another person's intentions or experiences and beliefs of others (Singer, 2006). The affective dimension includes the bodily sensation such as being able to mimic the expression of others' emotional states (Decety and Jackson, 2006). While rapport has been suggested as an essential component of interpersonal relations (Jakobsen, 2021), it is not clear whether this involves cognitive or affective components.
This means that cognitive intellectual abilities involved in empathy may not be sufficient to explain why or how empathy operates. The number of studies focusing on the relation between empathy and cognitive ability is limited. An early study used the revised version of the Reading the Mind in the Eyes test (RMET), which measures how a test-taker can understand the mental state of another person (Baron-Cohen et al., 2001). The study reported no statistically significant correlation between the RMET and IQ (r = 0.09, p = 0.60). However, a later meta-analysis investigating how the RMET correlated to intelligence found a positive significant correlation (r = 0.24) with a robust effect size (Baker et al., 2014). Launay et al. (2015) found that there was a positive association between higher forms of mentalizing and emotion recognition in the RMET (r = 0.21, p < 0.001, n = 279), but they did not find any relation between mentalizing scores and self-reported empathy, measured with the Empathy quotient (EQ) instrument (r = 0.022, p = 0.33, n = 282). Guo et al. 's (2019) study with undergraduate students (n = 518) reported a significant correlation between perspective-taking in the Interpersonal Reactivity Index (IRI) and fluid intelligence measured with Raven's Standard Progressive Matrices, but the correlation was stronger between affective empathic concern and fluid intelligence (r = 0.17 vs. r = 0.27). In a multiple mediation analysis, they reported cognitive ability had a positive effect on perspective-taking (β = 0.22, SE = 0.03, p < 0.001) and also on empathic concern (β = 0.26, SE = 0.03, p < 0.001).
From this research, it would appear that the affective aspect of empathy is dependent on overall intelligence, but self-reported cognitive perspective-taking may be quite independent. However, further research with normal populations is needed to establish the nature of empathy outside clinical groups (Richell et al., 2003).
A related capacity to intelligence that may be related to empathy is executive functioning (EF). Yan et al. (2020) found a significant correlation between cognitive empathy and subcomponents of EF such as inhibitory control (r = 0.23, p < 0.001) and working memory (r = 0.20, p < 0.001) was found, while the affective dimension of empathy had a weaker relation. Another recent meta-analysis (Martingano and Konrath, 2022) found a positive association between the cognitive part of empathy and rational thinking as expected with both empirical data and meta-analysis.
Another issue related to the measurement of empathy is the difficulty individuals have in providing valid information on their empathic ability (Murphy and Lilienfeld, 2019). Murphy and Lilienfeld (2019) suggest that the weak relations between empathyrelated self-reports and task performance might be a consequence of measures that do not have acceptable measurement quality. In line with these reflections Baker et al. (2014) argue that the correlation would probably be even stronger if tests measuring these types of constructs had levels of reliability that were as high as intelligence tests.

Empathic policing
Empathy has been identified as an important ability when working as a police officer (Perez, 2010;Inzunza, 2015a). It is particularly relevant for professions where interaction with the public often involves difficult circumstances (Skogan, 2006;Rahr and Rice, 2015;Inzunza, 2015a). In the field, a police officer needs to understand how others perceive a complex critical situation, and at the same time have the capacity to regulate their own perceptions in order to act professionally. When developing models that include these types of processes they are often Frontiers in Psychology 03 frontiersin.org multi-dimensional models that include both a cognitive and an affective dimension (Davis, 1983;Inzunza, 2015b). The importance, and indeed the necessity, of including other elements such as regulating processes has also been discussed and incorporated in some of these models. This is especially relevant when applied to professions such as policing, where the complexity of some daily encounters may require robust self-regulatory processes (Inzunza, 2015a). In this context, the major goal is to end encounters without harm for any party. This requires the ability to predict how a situation may develop based on the various actors' intentions in a social situation (Abanonu, 2018). Policing, therefore, requires an adequate level of intelligence and resilience (Romosiou et al., 2019), as well as both cognitive and affective empathy. This perspective-taking and empathy capability is important for police, especially when conducting interrogations. There are several ways offenders act to make a story sound truthful (i.e., offender lie-telling strategies; Strömwall and Willén, 2011). Such strategies can involve impression management strategies or information management strategies, both of which require close attention from the police interrogator. Interrogators need to pay attention to minute details and use their verbal communication skills to retrieve information so that they can understand the intentions of the offender. Here the primary goal is to increase the probability of solving complex crime events. All parts are necessary when approaching a crime within the framework of the choices and decisions made by alleged offenders before, during, and after committing a specific type of crime (Leclerc, 2017).
Police need to demonstrate empathy also when working with crime victims (e.g., Maddox et al., 2011;Inzunza, 2022). This seems to be a somewhat challenging ability to learn for police students because the identity formation process that takes place within the profession struggles to treat vulnerable citizens appropriately (Charman, 2019).
A small-scale (n = 60) study with police students found no statistically significant relations when studying the total scores between a subjective self-report measure of empathy (i.e., an adapted version of the Empathy Assessment Index; EAI) and objective tasks aimed at measuring similar constructs (Inzunza et al., 2019). However, those self-reporting highly on perspectivetaking also performed better on the objective tasks. Other studies have noted the importance of understanding the relation of subconstructs of empathy to other constructs.
The practical value of researching this area can be seen both in the context of selection and the education and training of future police officers. The construct of empathy has been identified as crucial for general interactions in terms of social skills but also when conducting training in specific units within police organizations such as de-escalating conflict situations (Abanonu, 2018). In a recent Danish study of how official political goals have been translated to measurable criteria, the construct of empathy played a crucial role (Bloksgaard and Prieur, 2021). The study concluded that, within Danish police, police officers need to express emotions in a controlled way so that these communicative skills counterbalance their authority and ability to act; indicating the importance of empathy in modern policing.
Training of police is an important tool to their development of de-escalating strategies. Performance associated with empathy (e.g., understanding the situation of the other) is at focus when implementing specific programs aimed at improving police officers' interviews with victims (Darwinkel et al., 2013).
A more detailed understanding of how aspects of empathy relate to other cognitive abilities is important to the design of police education programs. Hence, this study focuses on empathy and its subconstructs, how they can be measured and how they relate to other selected measures within police education populations. Two hypotheses were investigated: H1: There is a relation between subconstructs of empathy and cognitive ability.
H2: Subconstructs of empathy have a positive relation to tasks measuring the corresponding abilities.

Materials and methods
Thus, the goal of this paper is to establish if there are relationships between measures of empathy and cognitive ability in a sample of Swedish police students. A prioritization was to collect information from the population of interest and support the validity argument of presenting a valid interpretation of the results from this specific group of test-takers (Kane, 2010). A key feature of this study is that it integrates self-reports and objective tasks. Multiple models of how the different measures work to predict tasks that can be seen as proxies for the ability to understand another person's situations and intentions are evaluated to determine the most robust relationship(s) within the data.

Participants
The participants in this study were police students enrolled in three of the Swedish national police programs during fall of 2017 (i.e., Umeå, n = 54; Södertörn, n = 48; and Linnaeus, n = 55). The approximate percentage of males was 72-73%, and their mean age in years was 25.7 (Umeå), 26.2 (Södertörn), and 25.4 (Linnaeus). Compared to the total group of police students in Sweden, this group of 157 students is relatively representative in terms of age and gender distribution (polistidningen.se). Through voluntary participation requirements, the obtained response rate varied by location (Umeå, 87%; Södertörn, 22%; and Linnaeus, 22%). The amount of police students differs between the three locations where Södertörn and Linnaeus have more students enrolled in their programs and it is more difficult to reach all students. Since the allocation of Swedish National Police students is centrally organized, the different police programs have a similar profile Frontiers in Psychology 04 frontiersin.org regarding individuals recruited, and they experience similar content throughout their training. It is noteworthy that Swedish police education is a two-year undergraduate university qualification plus 6 months on-the-job apprenticeship, rather than a brief sub-baccalaureate qualification as seen in some other jurisdictions. Of the 157 participants, complete data on all constructs ranged between 154 and 157, suggesting a very high rate of data recovery. Note that no pressure was put on the respondents to participate or to perform well on the different measures, meaning score means and relations might be different if the scores were used to select candidates for entry to the police.

Instruments
A battery of instruments was deployed to measure subconstructs of empathy and cognitive ability. The three instruments used to measure empathy and ToM (i.e., Swe-EAI, Eyes test, and Swe-IMT) have all been used in the police context in previous studies (see Inzunza, 2015b;Inzunza et al., 2019). Intellectual ability was measured with the Raven APM Set II measures of fluid intelligence (also referred to as cognitive ability).

Empathy assessment index (Swe-EAI)
The Swedish version of the EAI (Swe-EAI) is a self-report instrument that examines test-taker perceptions of their own ability to empathize. The Swe-EAI differentiates four aspects of empathy (Inzunza, 2015b), being affective response (AR, with six items), perspective-taking (PT, with six items), self-other awareness (SOA, with four items), and emotion regulation (ER, with four items). Participants respond to the items by choosing between six alternatives on a scale from never to always. A previous study (Inzunza, 2015b) indicated that the four-factor model had acceptable fit and levels of reliability for the four dimensions. The reported estimate of internal consistency for the complete scale was α = 0.74, for the affective dimension with the AR factor α = 0.70, for the cognitive dimension including the factors PT, SOA, and ER α = 0.73 (Inzunza, 2015b). Since SOA and ER are part of the regulating system in the process of empathy (meaningful when all subscales are used to measure empathy, but not when making the distinction between affective and cognitive empathy) as proposed and discussed in a previous study by Inzunza (2015a), they were omitted from the inter-correlation and relationship analyses to the other measures.

Imposing memory task (Swe-IMT)
The Swedish version of the IMT (Swe-IMT) was initially developed to assess how respondents could deduce the intentions and actions of others by understanding their mental states-this is also referred to as mentalizing ability (Kinderman et al., 1998). This test consists of three short stories about social situations with several characters, in which the test-taker, after reading the stories during a short period of time, answers questions and statements in an answer booklet. The questions and statements, answered in a true/false format, measure either intention (ToM) or memory. The outcome of the test can be reported as a total score or as sub-scores for the intention items, the memory items, or the mean number of errors. In a previous study (Inzunza et al., 2019) with 78 Swedish national police students participating, the mean error for 30 intention items was 5.8 (SD = 2.5) and the mean error for the 36 memory items was 4.8 (SD = 2.5). The intention and memory scores had a statistically significant correlation to each other (Spearman's ρ = 0.42, p < 0.01). To avoid fatigue, two stories rather than three were used, producing a test of 20 intention and 24 memory items.

Reading the mind in the eyes test (Eyes test)
The Eyes test requires the test-taker to infer the mental state of another person based solely on a picture of their eyes (Vellante et al., 2013). The underlying assumption of the test is that the mental state of a person can be determined by the appearance of their eyes. The test consists of 36 images of different persons expressing a mental state and a four-option multiple-choice questions for each image about the intention or emotion of the picture. Several previous studies with different populations have reported mean scores ranging from 26.2 (SD = 3.6) to 28.0 (SD = 3.5; Baron-Cohen et al., 2001;Söderstrand and Almkvist, 2012;Inzunza et al., 2019). Reliability in terms of alpha has not been frequently reported in the original version of the Reading the Mind in the Eyes Test; instead, there are examples of Guttman' split-half with a value of 0.77 with the Eyes test or test-retest reliability by intraclass correlation coefficient at 0.833 (Vellante et al., 2013). Not reporting the alpha reliability has been the common procedure with tests designed to assess emotion recognition due to the complexity involved in those tasks when correlating different items (Fernández-Abascal et al., 2013). On the other hand, there are other benefits with this type of test. In contrast to self-reports, a test of this sort is insensitive to socially desirable responses (Van Prooijen and van Dijk, 2014). Another dimension is the distinction between reliability and homogeneity, where the latter is not helpful for validity when high (Cattell and Tsujioka, 1964). The main attention in previous studies has been given to investigating the validity of the test, i.e., to what degree it captures different mental states and expressions and measures the respondent's abilities to distinguish between these.

Raven APM set II fluid intelligence ability (Ravens)
The Raven APM Set II measures the fluid intelligence ability to solve problems without reference to previous information or knowledge (Carpenter et al., 1990;Bors and Stokes, 1998). The items consist of matrices of eight related patterns presented across three rows, where the test-taker has to complete the matrix pattern by choosing the correct ninth option from a set of eight alternatives. The entire Set II consists of 36 problems which require approximately 40 min to complete. The reported mean score from a sample of 506 first-year university students was 22.17 (SD = 5.60), and reported internal consistency in terms of alpha was 0.84 (Bors and Stokes, 1998). In another study with university students, the reported mean score was 23.7 (SD = 5.7; Arthur Jr and Day, 1994). However, in this study, each participant answered just 18 items Frontiers in Psychology 05 frontiersin.org from either the odd or even numbered items to avoid fatigue. Given that items are of increasing difficulty this generates two parallel tests of fluid intelligence. This type of instrument was expected to be familiar to the participants in our research since intelligence tests are used in the selection procedure for applicants to the Swedish police program (Annell et al., 2014).

Procedure
The police students that participated in this study were informed that their participation was voluntary in an oral introduction given at the police academies and also in a cover letter. The participants were assured that all data would be treated according to prevailing research practices regarding confidentiality and ethics. The same procedures were at work in each of the three universities where data were collected. The police students completed the battery of instruments as described earlier. Data were collected from groups at each academy, with the participants sitting at individual tables under invigilated examination conditions.

Statistical analyses
Consistent with the two-step protocol (Anderson and Gerbing, 1988), measurement characteristics of each instrument were determined before inter-construct modeling took place. A combination of confirmatory factor analysis (CFA) and scale reliability estimation was used to establish the best-fitting set of items for each scale. Once scales were validated, inter-relationships among scales were examined using multiple structural equation models. Tested models included (a) simple inter-correlated scales, and (b) structural equation models (SEM). Analyses were conducted with latent variable software Mplus version 8 Muthén, 1998-2017). STATA version 16.1 (2020) was also used for calculating Raykov's factor reliability coefficient.
Swe-EAI. Because the Swe-EAI has four latent factors, CFA was used to evaluate its structure.
Swe-IMT, Eyes test, and Ravens. These scales use classical test theory scoring protocols, summing all items scored as correct according to the scoring key and thereafter calculating reliability coefficients. The test and sub-test scores are evaluated for reliability with scale reliability estimation, which allows calculation of the standard error of measurement (i.e., SEm = SD*√(1-α)).
Data preparation. Because the sample size is small relative to the total number of items (155,118), SEM analysis used parceled scale scores for the Swe-IMT intention and memory, Eyes test, and Ravens test to give a more appropriate cases-to-items ratio (155:24) (Costello and Osborne, 2005).
Model fit. The fit of CFA and SEM models was assessed with several fit indices using conventional thresholds (Browne and Cudeck, 1992;Hu and Bentler, 1999;Albright and Park, 2009). Because the Chi-square fit index is overly sensitive for large degrees of freedom (Wheaton et al., 1977), the ratio of χ 2 /df is accepted as fit evidence if it has a statistically non-significant p > 0.05. The comparative fit index (CFI) indicates acceptable fit if it is above 0.90 and good fit above 0.95. We also included the more stable gamma hat index, interpreted in the same way as the CFI (Fan and Sivo, 2007). Both the root mean square error of approximation (RMSEA) and standardized root mean residual (SRMR) indicate acceptable fit if they are below 0.08 and good fit when below 0.05 (Browne and Cudeck, 1992;Hu and Bentler, 1999;Albright and Park, 2009).

Measurement models
A four-factor structure with 20 items of the Swe-EAI showed acceptable fit to the data (χ 2 (164) = 262.30, χ 2 /df = 1.60, p = 0.21; CFI = 0.88; gamma hat index = 0.93, RMSEA = 0.070, 90% CI [0.056, 0.083]; SRMR = 0.08). The relatively low CFI value probably arises from model complexity (Fan and Sivo, 2007) and can be discounted in light of the more robust values for the more stable indices (i.e., gamma hat, SRMR). This administration of the Swe-EAI had comparable results to those seen in previous studies (Inzunza, 2015b;Inzunza et al., 2019). Table 1 below shows descriptive scale statistics, as well as three estimates of reliability for the two empathy subconstructs Affective Response (AR) and Perspective-Taking (PT) showing similar information concerning scale characteristics.
The Swe-IMT worked as intended in its shorter version. The mean number of errors was almost twice as high in the items measuring intentions compared to items measuring memory, suggesting that the ToM component is a more demanding skill ( Table 1). The correlation between the number of errors was similar to previous studies (Spearman's ρ = 0.47, p < 0.01; Pearson r = 0.61, p < 0.01). The scale estimates of reliability were lower than normally expected, but high scale reliabilities may reflect item homogeneity (Cattell and Tsujioka, 1964), which is not the case for the IMT Intention and Memory scales that are more heterogeneous in content. These values are deemed sufficient for research purposes.
The Eyes test mean (Table 1) is within the mean ranges reported previously. The conventional estimates of scale reliability were low, but this is expected with binary-scored items. Nevertheless, the SEm was relatively small, indicating that score differences of ≥7 are statistically significant at a 95% confidence level. The mean score for the odd Ravens items was slightly higher (d = 0.60, 90% CI = 0.27-0.92) than the mean score of the even items (Table 1). However, the estimates of reliability were robust, producing quite small SEm relative to means.

Scale inter-relationships
The factor inter-correlations for participants with complete data showed that few of the linear relations had statistical significance (Table 2). Of interest here is the weak positive relationship of the two EAI and two IMT scales to the Eyes test. In contrast, only the AR and IMT Intention scales had a similar strength of relationship with the Ravens scale. The withinconstruct correlations for the Swe-EAI were moderate and strong for the Swe-IMT scales.
The fully correlated model, with six items each for PT and AR, had poor fit to the data (χ 2 (93) = 178.34, χ 2 /df = 1.97, p < 0.01; CFI = 0.79; gamma hat = 0.93., RMSEA = 0.081, 90% CI [0.063, 0.098]; SRMR = 0.079). Trimming two items each from PT and AR for weak loadings generated a better fitting model (χ 2 (43) = 62.34, χ 2 /df = 1.82, p < 0.05; CFI = 0.933; gamma hat = 0.97, RMSEA = 0.058, 90% CI [0.024, 0.085]; SRMR = 0.055). Instead of treating each scale as an independent construct, the three measures reflecting processes within the cognitive dimension (i.e., Ravens, IMT Intention, and IMT Memory) were treated as indicators of a latent trait for cognitive processes measured as tasks. Given that the Eyes test is meant to reflect and thus indirectly measure the mental state of an individual, it was also positioned as an indicator of overall cognitive processes measured with cognitive tasks. Rather than simply correlate the scales, both PT and AR were situated as predictors of the three cognitive processes and the Eyes test to test the hypotheses. That model had good fit (χ 2 (51) = 78.35, χ 2 /df = 1.54, p = <0.05; CFI = 0.91; gamma hat = 0.97, RMSEA = 0.058, 90% CI [0.030, 0.083]; SRMR = 0.063.) and showed that PT had a modest standardized regression weight to the Eye test and AR had a weaker regression weight to Ravens (Figure 1).

Discussion
The aim of the present study was to shed light on the relation between measures of empathy and higher-order constructs of cognitive ability. There are few studies within this area and even fewer dealing with the specific population investigated in this study, that is police students. Within the police profession, it is often emphasized that to function well in daily work routines it is important to be prepared for complex and emotionally challenging situations with citizens in different contexts (Romosiou et al., 2019). Some examples of such situations include de-escalating conflict situations (Abanonu, 2018), interacting with victims of crime at crime scenes (Maddox et al., 2011), interrogating individuals with hidden agendas (Strömwall and Willén, 2011), or conducting investigative interviews of victims (Jakobsen, 2021). When facing these complex contact situations, there are several processes at work, both cognitive and emotional, and it is valuable to understand better how they are related. This knowledge is certainly necessary when seeking to implement different programs for improving police performance, or when allocating police officers for different missions.
In our study, we found that the psychometric properties from each instrument were similar to the ones seen in previous research. The multi-dimensional self-reported instrument of empathy Swe-EAI could differentiate between the two areas of cognitive and affective dimensions (Inzunza, 2015b;Inzunza et al., 2019). All task instruments developed to be used as proxies for the different abilities of interest also worked in a similar way as in previous studies (see Bors and Stokes, 1998;Kinderman et al., 1998;Baron-Cohen et al., 2001;Söderstrand and Almkvist, 2012;Inzunza et al., 2019).
After conducting several analyses, the findings that emerge from our data indicate that the affective dimension (AR) and the cognitive dimension (PT) do predict performance in different tasks (Figure 1). The preset hypotheses were not fully supported, they were supported to some level. Regarding the relation between subconstructs of empathy and cognitive ability, there is a significant prediction from the affective dimension, affective response (AR), to the shortened version of the Raven fluid test, which is a proxy for fluid intelligence ability: this is in line with some of the associations found in previous studies (e.g., Guo et al., 2019). The second hypothesis, if there is a positive relation between the subconstructs of empathy and tasks, the results show that perspective-taking ability is associated with how well a testtaker will perform in the Eyes test, which is a proxy for the ability to understand the mental state of others.
Another finding of interest is the non-significant relation between fluid intelligence and the Eyes test, which concerns the ability of a respondent to correctly identify the emotional states of others (Baron-Cohen et al., 2001;Baker et al., 2014). There are multiple reasons for there not being a relation since fluid intelligence may be something distinct from where an individual shows interest in another individual in the sense of what they may be experiencing. With the chosen sample in this study, we found a relation between self-reported perspective-taking and the Eyes test but no relation between the Eyes test and the shortened Ravens test measuring fluid intelligence. This finding is in line with the theories suggesting that ToM is associated with cognitive empathy (Singer, 2006). Further, the standardized loading from perspective-taking was considerably greater than from the latent cognitive task ability, suggesting that the Eyes test reflected more of the ability to empathically relate to how another person feels rather than a strictly intellectual ability. This gives some support to the notion that such relation is probably not direct, or is so weak that it is not beyond mere chance. Although the memory part of the IMT had a weak correlation with perspective-taking (Table 1), the more robust latent variable model (Figure 1) showed that after controlling for measurement error this association was not greater than chance. The memory part of the IMT showed no correlation to the shortened Ravens test, which was an unexpected result considering that previous studies have shown a relationship between working memory and episodic memory and fluid intelligence (Fukuda et al., 2010;Brewer and Unsworth, 2012). The memory part from the IMT was developed to control for memory not being an issue when measuring the intention part of the test, and it may not be a developed memory test per se.
In practice, the findings indicate that it is important to consider the interest of future applicants to professions dealing with people since performing well in tasks requiring general cognitive ability is different from performing well in tasks requiring the understanding of the other. Programs focusing at developing performance requiring these abilities (see Darwinkel et al., 2013) may require a different pedagogical approach before moving into the actual task training. The findings are also transferable and valuable to closely related areas such as emotional intelligence where instruments have been developed to be used in leadership training within police organizations following the principle of policing by consent such as in the United Kingdom (McDowall et al., 2019).

Limitations
Some of the limitations of this study stem from the small sample size of participants. It may be difficult to predict performance on different tasks. It would be valuable to conduct a similar study with a larger sample, including professionals from within the police organization, rather than just police students. Such a study might reveal different relations once police students gain experience in the field. More importantly, the current results have to be seen as exploratory and require replication to validate the observed relations. While the project team will be able to map cognitive and affective empathy to emotion recognition, intention detection, and working memory in future studies, obtaining information with fluid intelligence measures in similar contexts is more challenging. It may be possible to test these findings with a sample from the general population of university students but that is currently out of scope for the project. Furthermore, such a study would not provide robust information about the population of concern, that is the police.
Generalizability to other police educational organizations is limited by the fact of having collected only Swedish data; hence, Latent variable model representing tested relations between self-reported an task measures. pt. = perspective-taking; ar = affective response; cogtasks = cognitive tasks; eyetotal = Eyes test; Raven 18 = Ravens fluid test; in_2stories = IMT intentions; and me_2stories = IMT memory.
Frontiers in Psychology 08 frontiersin.org replication studies in different international contexts are warranted. This study was conducted among police students, but findings of the structure of the EAI have been used in an evaluation with in-service police officers (Hansson et al., 2021). The current measures treat each of these interactive aspects of empathy and cognition separately. Perhaps, a dynamic integrated assessment, along the lines of the objective structured clinical examinations used in medical education (Sloan et al., 1995), would improve our understanding of how these constructs inter-relate.

Conclusion
Our conclusions are that abilities beyond fluid intelligence are needed to understand emotional states and the intentions of others. Policing involves navigating complex and highly charged situations, where culturally specific knowledge of how individuals communicate is also required (Günthner, 2007)-this is a matter of increasing importance in the rapidly changing European demographic context (Ilie, 2019). In policing situations, officers need to remember relevant event information, express empathy, and identify possible intentions from fleeting facial expressions; clearly, this is a complex and dynamic process, and police training needs to prepare recruits for it as fully as possible, including in terms of empathy. Our findings which also are in line with previous studies are inherently valuable since there is higher expectancy from society that the police will act empathically and treat citizens professionally: while good performance in contact situations can have a weak positive effect on police trustworthiness, poor performance has a strong negative effect, essentially meaning that trustworthiness is hard to achieve but easy to lose (Oliveira et al., 2020). Thus, this study adds to our understanding of how we might improve the empathy of police students as they prepare to enter a very demanding profession.

Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, on reasonable requests.

Ethics statement
Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. The patients/participants provided their written informed consent to participate in this study.

Author contributions
MI, TS, and CW contributed to conception and design of the study. MI organized the database and wrote the first draft of the manuscript. MI and GB performed statistical analysis. GB, TS, and CW wrote sentences of the manuscript. All authors contributed to the article and approved the submitted version.

Funding
This work is part of a series of studies investigating constructs of relevance in police training and education. It was funded by the Swedish Research Council via Grant .