The Self-esteem Stability Scale (SESS) for Cross-Sectional Direct Assessment of Self-esteem Stability

Altmann, Tobias; Roth, Marcus

doi:10.3389/fpsyg.2018.00091

ORIGINAL RESEARCH article

Front. Psychol., 13 February 2018

Sec. Personality and Social Psychology

Volume 9 - 2018 | https://doi.org/10.3389/fpsyg.2018.00091

The Self-esteem Stability Scale (SESS) for Cross-Sectional Direct Assessment of Self-esteem Stability

Tobias Altmann^*

Marcus Roth

Institute of Psychology, University of Duisburg-Essen, Essen, Germany

Self-esteem stability describes fluctuations in the level of self-esteem experienced by individuals over a brief period of time. In recent decades, self-esteem stability has repeatedly been shown to be an important variable affecting psychological functioning. However, measures of self-esteem stability are few and lacking in validity. In this paper, we present the Self-Esteem Stability Scale (SESS), a unidimensional and very brief scale to directly assess self-esteem stability. In four studies (total N = 826), we describe the development of the SESS and present evidence for its validity with respect to individual outcomes (life satisfaction, neuroticism, and vulnerable narcissism) and dyadic outcomes (relationship satisfaction in self- and partner ratings) through direct comparisons with existing measures. The new SESS proved to be a stronger predictor than the existing scales and had incremental validity over and above self-esteem level. The results also showed that all cross-sectional measures of self-esteem stability were only moderately associated with variability in self-esteem levels assessed longitudinally with multiple administrations of the Rosenberg Self-Esteem Scale. We discuss this validity issue, arguing that direct and indirect assessment approaches measure relevant, yet different aspects of self-esteem stability.

Introduction

The majority of self-esteem research has focused on the global level of self-esteem (i.e., “the individual's positive or negative attitudes toward the self as a totality”; (Rosenberg et al., 1995), p. 141). According to Rosenberg (1979), an individual with a high level of self-esteem can be characterized as follows: “He has self-respect, considers himself a person of worth. Appreciating his own merits, he nonetheless recognizes his faults […]. The term ‘low self-esteem’ […] means that the individual lacks respect for himself, considers himself unworthy, inadequate, or otherwise seriously deficient as a person” (p. 54; this description can be assumed to be true for all sexes). This statement shows that self-esteem is conceptualized more or less as an individual trait, with day-to-day fluctuations in feelings of self-worth dismissed as measurement error. However, a growing number of studies in recent decades have expanded the meaning of self-esteem by differentiating between the global level of self-esteem in general and self-esteem stability.

Self-esteem stability has been defined as the extent to which an individual experiencesshort-termfluctuations in self-esteem (e.g., Kernis, 2005). Even though substantial correlations are usually found between self-esteem stability and self-esteem level (see Okada, 2010), recent studies have consistently found an incremental validity of self-esteem stability over and above self-esteem level in predicting variables relevant for psychological adjustment or functioning. In general, research has shown that a higher degree of self-esteem stability is associated with better adjustment or functioning. This is true both for individual concepts such as neuroticism/emotional stability (Butler et al., 1994), depression (Kim and Cicchetti, 2009), and vulnerable narcissism (Campbell et al., 2002), as well as dyadic concepts such as emotional responsiveness (Rhodewalt et al., 1998), attachment (Foster et al., 2007), and dysfunctional coping strategies (e.g., alcohol abuse; Bentall et al., 2011). Self-esteem stability is thus related to an individual's general life satisfaction (Oosterwegel et al., 2001) and can be assumed to be associated with satisfaction in interactions, such as dyadic relationship satisfaction, as well.

Despite the relevance and growing interest in self-esteem stability, only a few inventories of the construct are currently available. This might be because previous research on cross-sectional self-esteem stability inventories has shown only small to medium correlations with longitudinal measures of variation in self-esteem (Chabrol et al., 2006). Hence, there is a need to develop new competing measures that can predict outcomes better than what is currently available.

In the present paper, we review the existing self-esteem stability inventories and present a new brief inventory developed on the basis of this review. We report data on construct and criterion validity and compare the new instrument to existing measures.

Measuring Self-esteem Stability

There are two general approaches to assessing self-esteem stability. The first is a cross-sectional direct assessment via a scale that is administered once; the second is an indirect assessment in which self-esteem level is measured multiple times, usually with Rosenberg's Self-Esteem Scale (RSES; Rosenberg, 1965), and the standard deviation of the means is calculated (Kernis et al., 1989; Kernis, 2005). The latter is considered to provide the most valid assessment and is hence the “gold standard” (Chabrol et al., 2006, p. 137) against which newly developed scales have to be measured.

Although the latter procedure assesses variability in a naturalistic context, it requires participants to invest considerably more time and effort because they have to fill out the RSES repeatedly and without prompting from the researchers, and then return the questionnaires. These issues might keep researchers from applying this procedure in their studies.

When cross-sectional measures are used, participants are asked to directly rate any fluctuations in self-esteem they tend to experience on a single measurement occasion. Such measures are therefore much more economical. Two of their major limitations, however, are memory bias and validity. First, there is the risk of memory biases since direct assessments require subjects to retrospect over their past experiences which is prone to memory distortion effects (e.g., Schacter, 1999). Second, direct self-esteem stability measures typically only havemedium-sized correlations with the aforementioned “gold standard.” In the following section, we briefly review the existing direct measures.

Current Measures of Direct Stability

To our knowledge, three cross-sectional measures assessing self-esteem stability with a direct approach are available.

The most recently published of these is the Instability of Self-Esteem Scale (ISES) by Chabrol et al. (2006). Participants are asked to indicate their degree of agreement with the following four items, all of which are very similar in structure and phrasing:

• Item 1: Sometimes I feel worthless; at other times I feel that I am worthwhile.

• Item 2: Sometimes I feel happy with myself; at other times I feel very unhappy with myself.

• Item 3: Sometimes I feel useless; at other times I feel very useful.

• Item 4: Sometimes I feel very bad about myself; at other times I feel very good about myself.

Of course, internal consistency can be expected to be very high due to this extreme degree of overlap.

The scale's developers reported that the correlation between their scale and the SD of repeated assessments using the RSES was 0.81. As this is almost identical to the internal consistency of the ISES, this result suggests that the two measures assess exactly the same construct with equivalent validity, despite their very different approaches. Given that, to our knowledge, other self-esteem stability studies have consistently reported much lower correlations between cross-sectional and longitudinal stability measures (Kernis et al., 1989, 1992; Marsh, 1993; Webster et al., 2017), this result is very surprising.

The second inventory is a derivative of the RSES presented by Kernis et al. (1992). Participants have to estimate “how much they thought they would change their (dis)agreement on a day-to-day basis with each of the items on Rosenberg's Self-Esteem Scale” (p. 628). Of course, this requires participants to have unrealistically high self-reflection abilities. Accordingly, the authors did not find that this measure was significantly correlated with the SD of repeated RSES measurements. Consequently, they evaluated their scale as insufficient, and we did not include it in the present research.

The third and oldest scale is the five-item Stability of Self Scale (RSS) by Rosenberg (1965). Existing research on this scale has shown only weak correlations with longitudinally measured short-term (Kernis et al., 1989, 1992) and long-term self-esteem stability (Marsh, 1993). Only Webster et al. (2017) were able to obtain a moderate mean correlation of 0.31 in a recent meta-analysis. Although they incorporated only eight articles as well as almost 50% unpublished data in their analysis, we agree with the authors that the RSS “deserves a second look” (p. 12) and thus included it in the present research.

In recent decades, scales on state self-esteem (e.g., the State Self-Esteem Scale by Heatherton and Polivy, 1991) have been developed and used to measure temporary fluctuations in self-esteem (Linton and Marriott, 1996). These scales show high similarity with the RSES. First, the item content of current state self-esteem measures is quite similar to the RSES (e.g., “I feel confident about my abilities” from the aforementioned inventory and “I am able to do things as well as most other people” from the RSES). Second, the procedure to assess stability, making the instructions time-bound to the present moment (e.g., “How true are these statements for you RIGHT NOW”) and then having respondents complete the measure repeatedly, is the same as that of Kernis and colleagues, who applied the more prominent RSES with similar changes to the instructions (as described above). Therefore, applying state self-esteem scales is likely to yield comparable results.

From this brief review, we concluded that self-esteem stability is a relevant variable for psychological functioning. However, it is not always possible for researchers to apply the most valid assessment method, presenting the RSES multiple times in order to calculate the standard deviation. Therefore, a reliable and valid direct measurement approach would make an important contribution to improving research on self-esteem stability. A few measures exist but are either insufficient or their psychometric qualities are dubious.

In the present research, we developed a new measure (Study 1) on the basis of the critical issues described above and validated it with relation to individual outcomes (Studies 2 and 3) and dyadic outcomes (Study 4) in comparison with previous measures. To maximize ecological validity, different types of assessments and samples were used: both paper-and-pencil and online assessments, with individuals and couples, and workers and students, who filled out the measure independently at home and in the laboratory. All assessments were anonymous. All analyses were conducted with German samples; therefore, the German versions of the measures were used (see Appendix A2). Data and material of the presented studies are openly accessible at osf.io/sy59r.

Study 1

The aim of Study 1 was to develop a new, brief scale to directly assess self-esteem stability while overcoming the shortcomings of the existing measures described above. We expected to find unidimensionality and at least satisfactory psychometric properties for the new scale, which we called the Self-Esteem Stability Scale (SESS).

Method

Procedure

We constructed 18 items in German with a balance between positive and negative phrasing. The language style was kept similar to the RSES to enable joint administration. The content of the items was chosen to mirror the major aspects of the RSES (e.g., having a positive attitude toward oneself or having a positive evaluation of one's own abilities compared with others) to ensure content validity. Six of those 18 initial items were chosen (see Supplementary Material) on the basis of a discussion by a group of four personality assessment experts and administered to the sample in an online questionnaire. A 6-point Likert scale (1 = “Does not apply to me” to 6 = “Does apply to me”) was used. Higher scores represent higher reported stability.

Sample

Participants were recruited via online postings in social media and via email forwarding. Digital leaflets with information about the study and including a call for volunteers were posted in Facebook groups for students of all fields and in groups for workers in a number of different fields such as healthcare and engineering. Email forwarding was used for people who showed interest in the study but were aware of the potential hypotheses (e.g., psychology students) and therefore had to be excluded. They were asked to email the information about the study to friends and relatives. The total sample was N = 215 (70.2% female), aged 18 to 67 (M = 32.9, SD = 9.0), of which 63% were workers, 32% students, and 5% unemployed or retired. Participants provided informed consent (concerning the purpose of the study, the scientific use of the data, and anonymity) as required by the University of Duisburg-Essen Psychological Institute's Ethics Committee, which approved of the study.

Results and Discussion

Dimensionality

Parallel analysis (PA; with 1,000 sets, 95th percentile; see Horn, 1965) of the six items resulted in a one-factor solution (KMO = 0.81; eigenvalues for original data and random PA data for the six possible factors were 2.67/1.23, 0.99/1.11, 0.67/1.03, 0.66/0.95, 0,54/0.87, and 0.44/0.78). Factor analysis was used to exclude items that explained variance of the common latent factor also explained by other items. In contrast to the 0.30 minimal loading often used for larger scales (Costello and Osborne, 2005), we decided to use the much stricter criterion of 0.60 for item inclusion to ensure sufficient internal consistency due to the small number of items on this very brief scale. We thus included three items in the scale (see descriptive statistics Table 1 and factor loadings Table S1), which explained 65.4% of the variance in the factor analysis.

TABLE 1

Table 1. Psychometric properties of the SESS.

Psychometric Properties

The means and standard deviations of the three SESS items as well as their response probabilities and the part-whole corrected item-total correlations are shown in Table 1. Cronbach's α was 0.73, which is satisfactory considering that Cronbach's α strongly depends on item number. The results indicated that the SESS has at least satisfactory psychometric properties.

Study 2

The aim of Study 2 was to evaluate the validity of the SESS, developed in Study 1, in comparison to the other aforementioned direct scales (i.e., ISES and RSS).

As argued above, the best evidence of the validity of a direct measure is the extent to which it is able to predict the indirect longitudinal measurement of the same construct. Furthermore, it has been argued that high self-esteem stability should be associated with higher life satisfaction scores as a general indicator of psychological functioning. Therefore, we analyzed the relations between the direct measures and the indirect self-esteem assessment (the “gold standard”) as well as life satisfaction as general indicators of validity.

We expected the new SESS to explain significantly more variance than the ISES or RSS for both the indirect measure of stability and life satisfaction.