BRIEF RESEARCH REPORT article

Front. Psychol., 27 February 2026

Sec. Quantitative Psychology and Measurement

Volume 17 - 2026 | https://doi.org/10.3389/fpsyg.2026.1595258

Construction and validation of the ultra-short version of the Parenting Scale (PS-4)

  • 1. Medical Psychology and Medical Sociology, University Medical Center of the Johannes Gutenberg University, Mainz, Germany

  • 2. Institute of Delinquency and Crime Prevention, Zurich University of Applied Sciences, Zurich, Switzerland

  • 3. Faculty of Law, University of Zurich, Zürich, Switzerland

  • 4. Criminological Research Institute of Lower Saxony, Hannover, Germany

  • 5. Clinic and Polyclinic for Psychosomatic Medicine and Psychotherapy, Rostock University Medical Center, Rostock, Germany

  • 6. Department of Psychosomatic Medicine and Psychotherapy, University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany

  • 7. Ernst-Abbe-Hochschule Jena – University of Applied Sciences, Jena, Germany

Article metrics

View details

161

Views

41

Downloads

Abstract

Parenting behavior is a central determinant of childhood development and is thus deserving of more scientific attention. In the present article, we constructed an ultra-short scale for the assessment of parenting styles, the Parenting Scale 4 (PS-4). To this end, we analyzed large samples of parent–child dyads—one representative of the German general population (Sample 1), the other representative for the German federal state Lower Saxony (Sample 2). We applied an algorithm-based scale-shortening technique in Sample 1 and confirmed the resulting model in Sample 2, finding excellent model fit and—given the extreme brevity—acceptable reliability. Furthermore, we show that the model is invariant across parent and child genders. Correlations with the Strengths and Difficulties Questionnaire remain virtually unchanged compared to a longer version of the Parenting Scale, which is evidence for the PS-4’s validity. Overall, the PS-4 can be recommended for the assessment of parenting behavior, particularly in large-scale surveys with time constraints.

Introduction

Parenting is one of the most important determinants of childhood development (Belsky, 1984; Sulik et al., 2015; Tran et al., 2017). A myriad of crucial outcomes is shaped by the way parents treat and teach their children: health, life satisfaction, and success later in life among many others (Abubakar et al., 2015; Hamilton, 2016; McLeod and Shanahan, 1993; Milevsky et al., 2007; Raboteg-Saric and Sakic, 2014; Steinberg et al., 1989; Waters, 2015).

The Parenting Scale (PS; Arnold et al., 1993) is one of the most widely applied parenting behavior assessment tools - also in the evaluation of trainings and prevention programs (de Graaf et al., 2008; Salari et al., 2012). The original PS consists of 30 items and can be divided into two factors: Overreactivity and Laxness. A third factor (Verbosity) was discussed in the beginning but has been refuted by subsequent research (for a review see Salari et al., 2012). Various shortforms of the PS have been developed over the years (Lorber et al., 2014; Reitman et al., 2001; Rhoades and O’Leary, 2007). The shortest that we are aware of is the 8-item version by Kliem et al. (2019). While eight items is very brief, even shorter scales—such as the Patient Health Questionnaire 4 (Kroenke et al., 2009; Löwe et al., 2010)—have been developed and enjoy great popularity. They are particularly useful in survey research, where every item counts and minimizing time investment is of the utmost importance.

The present study thus served the purpose of shortening the PS-8 to the ultra-short screening version PS-4. To this end, we first examined the psychometric properties of the PS-8 items and constructed a subset of 2 items per scale. We then tested this subset in a second, independent sample using confirmatory factor analysis (CFA). In addition, we tested for measurement invariance between genders and tested correlations with the Strengths and Difficulties Questionnaire (SDQ; Goodman, 2001). Since the PS is supposed to capture problematic parenting tendencies, we expect positive correlations between the PS-4 subscales and the difficulties subscales of the SDQ and positive correlations with the strengths subscale.

Method

Procedure and participants

Sample 1 was collected as part of a representative survey of the German general population in 2013. The demographic consulting company (USUMA, Berlin, Germany) which was responsible for data collection, initially contacted a total of 4,360 individuals based on random selection. The selection procedure uses three steps: 258 geographic regions in Germany, households within a specific region, and individuals in a specific household. Of those, many refused to provide any information (n = 591), were not present upon visitation (n = 563), refused to be interviewed (n = 540), or were not included for other pre-defined reasons (n = 158). A trained interviewer visited all participants. After making sure that they possessed sufficient command of the German language, the interviewer informed them of the general purpose of the study and obtained informed consent. In the case of minors, consent of the parents was also obtained. Following a sociodemographic interview, the participants completed self-report questionnaires in the presence of (but without any interference from) the interviewer. The final study sample consisted of n = 2,508 individuals. Out of those, 539 reported currently raising a child. Eight of these observations were excluded because of missing values. Thus, our exploratory analysis sample consisted of 531 observations. No systematic differences between participants with complete and incomplete data were observed. Given the low and largely unsystematic proportion of missingness, no imputation procedures were applied. The study was authorized by the Ethics Committee of the Medical Faculty of the University of Leipzig (Az.: 050/13-03.05.2013).

Sample 2, was collected as part of a representative survey of fourth graders in the German federal state of Lower Saxony, carried out by the Criminological Research Institute of Lower Saxony (Kriminologisches Forschungsinstitut Niedersachsen – KFN) as part of an evaluation study of the “Klasse2000” universal prevention program (Kliem et al., 2020a,b) and was funded by the Federal Centre for Health Education (Bundeszentrale für gesundheitliche Aufklärung – BZgA). The survey involved a random sample of year four classrooms in Lower Saxony from all classrooms taught in the 2016/2017 school year. In total, K = 740 classes were selected for the survey. The final sample was reduced to k = 528 classrooms based on refusals to participate from school principals or teachers; this corresponds to a classroom participation rate of 71.4%. Of the N = 9,880 students attending the participating classes, n = 6,376 students participated in the survey, which corresponds to a student participation rate of 64.5%. All 6,376 participating children received a questionnaire for their parents by “satchel post.” From 3,324 of these parents a completed questionnaire was sent back to the KFN. This corresponds to a response rate of 52.1%. In addition, we removed observations with missing values. Thus, our final sample consists of n = 3,133 observations. Only children whose parents returned a signed consent form to the teacher were allowed to take part in the interview. The ethics committee of the Georg Elias Müller Institute for Psychology at the University of Göttingen (N0. 184/2018) has given a positive ethics statement for this procedure. Furthermore, the survey was approved by the regional school authorities (Landesschulbehörde Niedersachsen H1Rb-81402-09-2017). During the entire survey, the children were guided through the eleven-page questionnaire by the test leaders using a manual. Participation is entirely voluntary among principals, teachers, parents, and children, and nonparticipation does not lead to any negative consequences. Children whose parents have not provided written informed consent are not surveyed. Teachers administered the written informed consent forms, did not forward them to the researchers, and destroyed them 2 weeks after the survey was conducted. Since the study design provides for complete anonymization, no signature of the participating adults could be obtained for their own participation. The sociodemographic variables of both samples are displayed in Table 1.

Table 1

Sample 1 (n = 531)Sample 2 (n = 3,133)
n%n%
Parent gender
Male22041.434511.1
Female31158.62,77688.9
Parent age
M, SD39.568.3042.085.54
Child gender
Male27852.71,53149.0
Female25047.31,59251.0
Child age
M, SD8.925.232.690.58

Sociodemographic variables of both samples.

Instruments

The Parenting Scale 8 (PS-8, Kliem et al., 2019) consists of two subscales - overreactivity and laxness - measured with four items each. Respondents describe their typical reactions to certain behaviors a child can exhibit. All items are bi-polar, giving two potential extremes of the reaction spectrum. Respondents rate on a scale of 1 to 7 to what extent these responses are typical of them. Reliability coefficients for the present study are reported in Table 2.

Table 2

123456789
1. Overreactivity 40.622
2. Laxness 40.3290.546
3. Overreactivity 80.847/0.790+0.2710.645
4. Laxness 80.3870.895/0.854+0.3260.714
5. Emotional Symptoms (SDQ)0.1650.0930.1640.1090.696
6. Conduct Problems (SDQ)0.2830.1550.3120.1760.2950.554
7. Hyperactivity (SDQ)0.2090.1120.2440.1370.3290.4980.791
8. Peer Problems (SDQ)0.1470.1070.1140.1380.3830.3070.2910.676
9. Prosocial Behavior (SDQ)−0.203−0.104−0.233−0.138−0.098−0.400−0.277−0.1760.660

Correlation matrix of the Parenting Scales 4 and 8 as well as the strengths and difficulties questionnaire (Sample 2).

Internal consistency is displayed on the diagonal, +adjusted for overlap in accordance with Levy (1967).

Sample 2 also included the Strengths and Difficulties Questionnaire (SDQ; Goodman, 2001; Klasen et al., 2003). This questionnaire consists of 25 items that assess a parent’s view of their child’s particular strengths and weaknesses in terms of behavior, self-regulation, and social behavior. There are five subscales: Conduct Problems, Emotional Symptoms, Hyperactivity, Peer Problems, and Prosocial Behavior. Response options range from 1 (Not true) to 3 (Certainly true). Reliability coefficients are reported in Table 2.

Statistical analyses

We conducted all analyses in R, using the EFAutilities, ezCutoffs, lavaan, matrixStats, moments, multilevel, paran, semTools, and stuart packages (Bengtsson, 2020; Bliese, 2016; Dinno, 2018; Jorgensen et al., 2020; Komsta and Novomestky, 2015; Rosseel, 2012; Schmalbach et al., 2019; Schultze, 2020; Zhang et al., 2020). We excluded those respondents from the analyses who had not answered the PS-8 scale or had missing data on one or more of the items. We applied exploratory factor analysis and parallel analysis (PA; Horn, 1965) to Sample 1. The purpose of these procedures was to check how many components should reasonably be extracted from the eight indicators, and how the items load on them. We consider loadings of 0.50 and greater as sufficient. In addition, there should be no substantial cross-loadings (0.20). After this initial check we used stuart’s bruteforce algorithm to find the best 2 × 2 solution among all possible item subsets. “Best solution” here means best model fit and best composite reliability.

We then fitted the resulting model in Sample 2 using robust diagonally weighted squares estimation Li (2016) and delta parameterization. In, addition to the -test, we utilized the Comparative Fit Index (CFI), the Tucker-Lewis Index (TLI), the Root Mean Squared Error of Approximation (RMSEA), and the Standardized Root Mean Squared Residual (SRMR) to evaluate model fit. We judged these indices based on the traditionally recommended cutoff values: the p-value of the 2-test should be greater than 0.05 (0.01), CFI/TLI greater than 0.97 (0.95), RMSEA smaller than 0.05 (0.08), and SRMR smaller than 0.05 (0.10), respectively, to indicate good (acceptable) fit between theoretical model and empirical data (Schermelleh-Engel et al., 2003). To supplement these traditional fixed cutoff values, we employed the simulation-based approach introduced by Schmalbach et al. (2019). Here, the empirically derived fit values are compared to the fit index distributions obtained by simulating a large number of data sets with the same underlying parameter table and overall same properties as the empirical one. As generally recommended, we estimate reliability using McDonald’s (Dunn et al., 2014; Green and Yang, 2009, Formula 21). To test for measurement invariance between genders, we followed the guidelines provided by Wu and Estabrook (2016) and evaluated the comparisons using the recommended procedure (Chen, 2007). That is, we successively constrained item thresholds, factor loadings, item intercepts, and item residual variances to be equal between groups. For each of these steps 2 should be non-significant, CFI should be smaller than 0.01 and RMSEA should be smaller than 0.015.

Results

Item descriptive statistics and exploratory analyses (Sample 1)

We report item descriptive statistics in Table 3. All items were approximately normally distributed upon inspection of skewness and kurtosis (Kim, 2013). Parallel analysis showed clear evidence for two substantial components in the PS-8. The first three empirical eigenvalues were 2.62, 1.57, and 0.72 - compared to 1.24, 1.16, and 1.10 in the randomly generated data sets. Loadings from the subsequent exploratory factor analysis are also reported in Table 3. It should be noted that Items 5 and 8 had comparatively low loadings. In addition, Item 8 evinced a cross-loading. Next, we entered all items into the stuart to find the optimal two-factor solution among the possible 36 subsets. The best solution, which had excellent model fit, 2(1) = 0.667, p = 0.414, CFI = 1, TLI = 1.006, RMSEA = 0, SRMR = 0.006, is marked in Table 3.

Table 3

MSDSkewnessKurtosisλ F1λ F2
Item PS1O5.301.48−0.662.58−0.0360.688
Item PS2O, 45.791.18−1.053.900.0920.737
Item PS3L, 45.191.45−0.672.980.7630.055
Item PS4L5.131.51−0.783.060.653−0.043
Item PS5R, O4.331.75−0.162.00−0.1070.484
Item PS6L, 45.661.24−1.084.180.7690.030
Item PS7R, O, 45.821.43−1.344.310.0640.536
Item PS8R, L5.571.35−0.943.580.4660.240

Item descriptive statistics of the Parenting Scale 8 (Sample 1).

M, Mean; SD, Standard Deviation; λ, Standardized factor loadings on factors 1 and 2 in the exploratory factor analysis; R, Reverse-coded item; O, Overreactivity scale; L, Laxness scale; 4, Part of the PS-4.

Confirmatory analyses (Sample 2)

We then tested the model generated in the EFA sample in the independent CFA sample. As can be seen in Table 4, fit measures indicated very good fit for the two-factor model and the 2-test was not significant at p = 0.134. Even the simulated cutoffs, which are in this instance much stricter than the traditional cutoffs, were not violated. The two latent variables correlated strongly, r = 0.581, p < 0.001. Reliability was = 0.622 [0.601, 0.643] and = = 0.546 [0.522, 0.568], respectively. In general, these coefficients would be considered small, even unacceptable. However, it should be noted that ultra-short scales tend to yield smaller reliability coefficients. Based on the Spearman-Brown prophecy formula, we can estimate that two four-item scales of the same general characteristics would evince s of 0.767 and 0.706, respectively. In order to confirm the two-dimensionality of the PS-4 scale, we then also tested the uni-dimensional version of the model. However, this model evinced unacceptable fit by all accounts.

Table 4

chi2CFITLIRMSEASRMR
2FM empirical fit2.2471.0000.9970.0200.006
2FM simulated cutoffs3.9840.9980.9900.0310.009
1FM empirical fit242.5160.9140.7410.1960.060
1FM simulated cutoffs5.8300.9980.9940.0250.011

Model fit statistics of the Parenting Scale 4 (Sample 2).

Instances in which the empirical fit measure is worse than the simulated cutoff should be considered evidence of model misfit.

Next, we tested for measurement invariance between genders. This means that we considered both the child’s and the parent’s gender as well as the match/mismatch between the two. The results for these analyses are summarized in Table 1. As is evident from the non-significant increases in 2 - in addition to the virtually unchanged values for CFI and RMSEA - from step to step, the model can be considered to be invariant across groups of child gender, parent gender, and child–parent gender match/mismatch. Merely the RMSEA for the configural model of parent gender is substantially higher than in the more constrained models. This can probably be explained by the low number of free parameters, which is known to inflate RMSEA (Kenny et al., 2015). Overall, however, we find strong evidence for gender invariance. This is an important requisite for comparisons of both latent and observed means and variances between these populations (Gregorich, 2006).

Correlations with the SDQ

Overall, we observed the expected pattern of correlations between the PS-4/−8 and the SDQ scales (see Table 2). Specifically, there were small to moderate positive correlations between the two subscales of the PS-4/−8 and the four problem-oriented scales of the SDQ. In contrast, the SDQ prosocial behavior scale correlated negatively with both PS subscales. In order to gauge the difference between the 8-item PS and the ultra-short PS-4, we first examined the correlations between the two versions of the scales. As is evident with the very high correlations, the majority of the PS-8 s variance is still explained by the PS-4. In investigating the differences between correlations of the long and short versions of the PS and the SDQ subscales this similarity becomes even clearer: For Overreactivity, the average difference was z > −0.001, p = 0.999, while for Laxness the average difference was z = −0.012, p = 0.992. This means, one can use half the items and apply the PS-4 instead of the PS-8 with a negligible loss of information and validity.

Discussion

The aim of the research at hand was the construction of an ultra-short version of the PS. This was achieved by drawing on two large samples - one representative of the German general population, the other representative for the German federal state of Lower Saxony.

While a loss of reliability is close to inevitable while eliminating items, we observed exactly the decrease that was to be expected for a reduction from four to two items per scale. Model fit, on the other hand, improved greatly. All criteria evinced excellent goodness-of-fit. Furthermore, we found evidence for measurement invariance between genders: This means that the observed means can be compared both between different gendered children and parents. In addition, we examined whether the matching of same /different gendered children and parents could result in a lack of invariance, which was not the case. As is evident with the correlation matrix of the PS-4, PS-8, and SDQ, information loss was minimal in eliminating four items of the PS-8. Specifically, we observe the same pattern of correlations and correlational magnitudes for both the 4- and 8-item version: Positive correlations for the Prosocial Behavior scale and negative correlations for the other four scales, which deal with problematic tendencies.

Limitations

The present study is limited in that we have not applied the final four PS-4 items in isolation. It appears unlikely that the outcomes of the psychometric investigation should change, but future studies should strive to replicate our findings to be sure.

As in all survey studies, the data may have been falsified by the respondents due to processes of social desirability or deliberate deception, whereby such processes can be avoided by informing the participants in detail about the underlying data protection mechanisms (e.g., anonymous survey as well as in sample 2 no feedback to parents, school or teachers) should be kept to a minimum. Response rates were = 57.5%/57.6%, which is quite common in general population studies (e.g., Kliem et al., 2014, 2015, 2018, 2021, 2024, 2025; Schmalbach et al., 2019) as well as in school surveys (Beckmann et al., 2021; Kliem et al., 2020a,b; Rehbein et al., 2015). However, it cannot be excluded that participants/students with a high level of stress often did not participate in the surveys. A further limitation concerns potential partner and co-parent influences in families with separated parents. Although the study focuses on the primary caregiver, child outcomes are likely shaped by complex interactions with non-resident parents and new partners (e.g., co-parenting quality, conflict, parenting practices in the other household). These influences could not be reliably measured or disentangled in the present design and may have introduced unobserved heterogeneity. Future studies should incorporate multi-informant and multi-household assessments to better capture interparental dynamics and their impact on child outcomes. Furthermore, Germany, as well as most other countries in northern Europe, has abolished any form of parental violence (physical as well as emotional, i.e., corporal punishment) as an accepted form of parental educational method. As of the year 2000 corporal punishment is considered abusive and hence a criminal offence under German law (§ 1,631 Abs. 2 Bürgerliches Gesetzbuch [German Civil Code]) which is not the case in several US states. It can therefore be assumed that German parents are more prone to giving socially desirable answers than their US counterparts. Parenting behaviors and disciplinary practices are embedded within cultural, legal, and normative contexts. The PS-4 was developed and validated using samples from Western, high-income countries, where parenting norms and legal regulations surrounding child-rearing are relatively similar. Its generalizability to cultural contexts with different normative parenting practices, legal frameworks, or social expectations cannot therefore be assumed. Future research should explicitly test cross-cultural measurement invariance and cultural validity before applying the PS-4 in non-Western or legally divergent contexts.

Lastly, the ultra-short format of the PS-4 entails a trade-off between measurement efficiency and content coverage. While the scale captures core dimensions of dysfunctional parenting, it cannot fully reflect the behavioral breadth and nuance assessed by longer versions of the Parenting Scale. Accordingly, the PS-4 is not intended to replace longer instruments such as the PS-8, nor is it suitable for individual-level assessment or clinical decision-making. Rather, it should be understood as an ultra-short screening instrument designed for contexts in which assessment time and respondent burden must be minimized, such as large-scale surveys and population-based research. Although the PS-4 demonstrates good model fit and evidence of construct validity given its minimal item count, longer versions remain preferable when more detailed assessment is required. Due to its limited reliability, associations involving the PS-4 may be attenuated and should be interpreted with appropriate caution.

Conclusion

In sum, the PS-4 can be recommended as an ultra-short, yet valid and (given its extreme brevity) reliable assessment tool of parenting behavior. It is particularly useful in large-scale surveys where time is of the essence.

Statements

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Ethics Committee of the Medical Faculty of the University of Leipzig Ethics committee of the Georg Elias Müller Institute for Psychology at the University of Göttingen. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

BS: Writing – original draft, Formal analysis, Methodology, Data curation. DB: Writing – review & editing. YK: Writing – review & editing. EB: Writing – review & editing, Project administration. SK: Writing – review & editing, Conceptualization, Project administration.

Funding

The author(s) declared that financial support was received for this work and/or its publication. Open access funding by Zurich University of Applied Sciences (ZHAW).

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author SK declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  • 1

    AbubakarA.Van de VijverF. J. R.SuryaniA. O.HandayaniP.PandiaW. S. (2015). Perceptions of parenting styles and their associations with mental health and life satisfaction among urban Indonesian adolescents. J. Child Fam. Stud.24, 26802692. doi: 10.1007/s10826-014-0070-x

  • 2

    ArnoldD. S.O’LearyS. G.WolffL. S.AckerM. M. (1993). The parenting scale: a measure of dysfunctional parenting in discipline situations. Psychol. Assess.5, 137144. doi: 10.1037/1040-3590.5.2.137

  • 3

    BeckmannL.BergmannM. C.KriegY.KliemS. (2021). Associations between classroom normative climate and the perpetration of teen dating violence among secondary school students. J. Interpers. Violence36:NP11291–NP11321. doi: 10.1177/0886260519888207,

  • 4

    BelskyJ. (1984). The determinants of parenting: a process model. Child Dev.55, 8396. doi: 10.1111/j.1467-8624.1984.tb00275.x

  • 5

    BengtssonH. (2020). matrixStats: functions that apply to rows and columns of matrices (and to vectors). R package version 0.57.0. Available online at: https://CRAN.R-project.org/package=matrixStats (Accessed February 16, 2026).

  • 6

    BlieseP. (2016). multilevel: multilevel functions. R package version 2.6. Available online at: https://CRAN.R-project.org/package=multilevel (Accessed February 16, 2026).

  • 7

    ChenF. F. (2007). Sensitivity of goodness of fit indexes to lack of measurement invariance. Struct. Equ. Model. Multidiscip. J.14, 464504. doi: 10.1080/10705510701301834

  • 8

    de GraafI.SpeetjensP.SmitF.de WolffM.TavecchioL. (2008). Effectiveness of the triple p positive parenting program on behavioral problems in children: a meta-analysis. Behav. Modif.32, 714735. doi: 10.1177/0145445508317134,

  • 9

    DinnoA. (2018). Paran: horn’s test of principal components/factors (R package version 1.4.2). Available online at: https://CRAN.R-project.org/package=paran (Accessed February 16, 2026).

  • 10

    DunnT. J.BaguleyT.BrunsdenV. (2014). From alpha to omega: a practical solution to the pervasive problem of internal consistency estimation. Br. J. Psychol.105, 399412. doi: 10.1111/bjop.12046

  • 11

    GoodmanR. (2001). Psychometric properties of the strengths and difficulties questionnaire. J. Am. Acad. Child Adolesc. Psychiatry40, 13371345. doi: 10.1097/00004583-200111000-00015

  • 12

    GreenS. B.YangY. (2009). Reliability of summed item scores using structural equation modeling: an alternative to coefficient alpha. Psychometrika74, 155167. doi: 10.1007/s11336-008-9099-3

  • 13

    GregorichS. E. (2006). Do self-report instruments allow meaningful comparisons across diverse population groups? Testing measurement invariance using the confirmatory factor analysis framework. Med. Care44, S78S94. doi: 10.1097/01.mlr.0000245454.12228.8f,

  • 14

    HamiltonL. T. (2016). Parenting to a degree: how family matters for college women’s success. Chicago: University of Chicago Press.

  • 15

    HornJ. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika, 30, 179185. doi: 10.1007/BF02289447

  • 16

    JorgensenT. D.PornprasertmanitS.SchoemannA. M.RosseelY. (2020). semTools: useful tools for structural equation modeling. R package version 0.5-3. Available online at: https://CRAN.R-project.org/package=semTools (Accessed February 16, 2026).

  • 17

    KennyD. A.KaniskanB.McCoachD. B. (2015). The performance of RMSEA in models with small degrees of freedom. Sociol. Methods Res.44, 486507. doi: 10.1177/0049124114543236

  • 18

    KimH. Y. (2013). Statistical notes for clinical researchers: assessing normal distribution (2) using skewness and kurtosis. Restor. Dent. Endod.38, 5254. doi: 10.5395/rde.2013.38.1.52,

  • 19

    KlasenH.WoernerW.RothenbergerA.GoodmanR. (2003). Die deutsche Fassung des Strengths and Difficulties Questionnaire (SDQ-Deu)--Ubersicht und Bewertung erster Validierungs- und Normierungsbefunde [German version of the Strength and Difficulties Questionnaire (SDQ-German)--overview and evaluation of initial validation and normative results]. Prax. Kinderpsychol. Kinderpsychiatr.52, 491502. doi: 10.23668/psycharchives.11726

  • 20

    KliemS.KriegY.BellerJ.BrählerE.BaierD. (2021). Psychometric properties of the somatic symptom scale 8 (SSS-8) in a representative sample of German adolescents. J. Psychosom. Res.149:110593. doi: 10.1016/j.jpsychores.2021.110593

  • 21

    KliemS.KriegY.LohmannA.MößleT. (2020a). Evaluation of the universal prevention program Klasse2000 in fourth grade primary school children: protocol for a propensity score-matching approach. JMIR Res. Protoc.9:e14371. doi: 10.2196/14371,

  • 22

    KliemS.LohmannA.MößleT.BrählerE. (2018). Psychometric properties and measurement invariance of the Beck hopelessness scale (BHS): results from a German representative population sample. BMC Psychiatry18:110. doi: 10.1186/s12888-018-1646-6,

  • 23

    KliemS.LohmannA.MößleT.ForanH. M.HahlwegK.ZengerM.et al. (2019). Development and validation of a parenting scale short form (PS-8) in a representative population sample. J. Child Fam. Stud.28, 3041. doi: 10.1007/s10826-018-1257-3

  • 24

    KliemS.LohmannA.NeumannM.GlaubitzC.HaselbachS.BergmannM. C.et al. (2020b). Factor structure of the inventory of callous-unemotional traits in a representative sample of German 9th grade students. J. Abnorm. Child Psychol.48, 4355. doi: 10.1007/s10802-019-00590-x,

  • 25

    KliemS.MößleT.RehbeinF.HellmannD. F.ZengerM.BrählerE. (2015). A brief form of the perceived social support questionnaire (F-SozU) was developed, validated, and standardized. J. Clin. Epidemiol.68, 551562. doi: 10.1016/j.jclinepi.2014.11.003,

  • 26

    KliemS.MößleT.ZengerM.BrählerE. (2014). Reliability and validity of the Beck depression inventory-fast screen for medical patients in the general German population. J. Affect. Disord.156, 236239. doi: 10.1016/j.jad.2013.11.024,

  • 27

    KliemS.SachserC.LohmannA.BaierD.BrählerE.FegertJ. M.et al. (2025). Psychometric evaluation and community norms of the GAD-7, based on a representative German sample. Front. Psychol.16:1526181. doi: 10.3389/fpsyg.2025.1526181,

  • 28

    KliemS.SachserC.LohmannA.BaierD.BrählerE.GündelH.et al. (2024). Psychometric evaluation and community norms of the PHQ-9, based on a representative German sample. Front. Psych.15:1483782. doi: 10.3389/fpsyt.2024.1483782,

  • 29

    KomstaL.NovomestkyF. (2015). Moments: moments, cumulants, skewness, kurtosis and related tests (R package version 0.14). Available online at: https://CRAN.R-project.org/package=moments (Accessed February 16, 2026).

  • 30

    KroenkeK.SpitzerR. L.WilliamsJ. B.LöweB. (2009). An ultra-brief screening scale for anxiety and depression: the PHQ-4. Psychosomatics50, 613621. doi: 10.1176/appi.psy.50.6.613,

  • 31

    LevyP. (1967). The correction for spurious correlation in the evaluation of short-form tests. J. Clin. Psychol.23, 8486. doi: 10.1002/1097-4679(196701)23:1<84::AID-JCLP2270230123>3.0.CO;2-2

  • 32

    LiC. H. (2016). The performance of ML, DWLS, and ULS estimation with robust corrections in structural equation models with ordinal variables. Psychol. Methods21, 369387. doi: 10.1037/met0000093,

  • 33

    LorberM. F.XuS.SlepA. M. S.BullingL.O’LearyS. G. (2014). A new look at the psychometrics of the parenting scale through the lens of item response theory. J. Clin. Child Adolesc. Psychol.43, 613626. doi: 10.1080/15374416.2014.900717,

  • 34

    LöweB.WahlI.RoseM.SpitzerC.GlaesmerH.WingenfeldK.et al. (2010). A 4-item measure of depression and anxiety: validation and standardization of the patient health Questionnaire-4 (PHQ-4) in the general population. J. Affect. Disord.122, 8695. doi: 10.1016/j.jad.2009.06.019,

  • 35

    McLeodJ. D.ShanahanM. J. (1993). Poverty, parenting, and children's mental health. Am. Sociol. Rev.58, 351366. doi: 10.2307/2095905

  • 36

    MilevskyA.SchlechterM.NetterS.KeehnD. (2007). Maternal and paternal parenting styles in adolescents: associations with self-esteem, depression, and life satisfaction. J. Child Fam. Stud.16, 3947. doi: 10.1007/s10826-006-9066-9065

  • 37

    Raboteg-SaricZ.SakicM. (2014). Relations of parenting styles and friendship quality to self-esteem, life satisfaction and happiness in adolescents. Appl. Res. Qual. Life9, 749765. doi: 10.1007/s11482-013-9268-0

  • 38

    RehbeinF.KliemS.BaierD.MößleT.PetryN. M. (2015). Prevalence of internet gaming disorder in German adolescents: diagnostic contribution of the nine DSM-5 criteria in a state-wide representative sample. Addiction110, 842851. doi: 10.1111/add.12849,

  • 39

    ReitmanD.CurrierR. O.HuppS. D.RhodeP. C.MurphyM. A.O’CallaghanP. M. (2001). Psychometric characteristics of the parenting scale in a head start population. J. Clin. Child Psychol.30, 514524. doi: 10.1207/S15374424JCCP3004_08

  • 40

    RhoadesK. A.O’LearyS. G. (2007). Factor structure and validity of the parenting scale. J. Clin. Child Adolesc. Psychol.36, 137146. doi: 10.1080/15374410701274157

  • 41

    RosseelY. (2012). Lavaan: an R package for structural equation modeling. J. Stat. Softw., 48, 136. Available online at: http://www.jstatsoft.org/v48/i02/ (Accessed February 16, 2026).

  • 42

    SalariR.TerrerosC.SarkadiA. (2012). Parenting scale: which version should we use?J. Psychopathol. Behav. Assess.34, 268281. doi: 10.1007/s10862-012-9281-x

  • 43

    Schermelleh-EngelK.MoosbruggerH.MüllerH. (2003). Evaluating the fit of structural equation models: tests of significance and descriptive goodness-of-fit measures. Methods Psychol. Res.8, 2374. doi: 10.23668/psycharchives.12784

  • 44

    SchmalbachB.IrmerJ. P.SchultzeM. (2019). ezCutoffs: fit measure cutoffs in SEM. R package version 1.0.2. Available online at: https://CRAN.R-project.org/package=ezCutoffs (Accessed February 16, 2026).

  • 45

    SchultzeM. (2020). Stuart: subtests using algorithmic rummaging techniques. R package version 0.9.1. Available online at: https://CRAN.R-project.org/package=stuart (Accessed February 16, 2026).

  • 46

    SteinbergL.ElmenJ. D.MountsN. S. (1989). Authoritative parenting, psychosocial maturity, and academic success among adolescents. Child Dev.60, 14241436. doi: 10.1111/j.1467-8624.1989.tb04014.x,

  • 47

    SulikM. J.BlairC.Mills-KoonceR.BerryD.GreenbergM.Family Life Project Investigators (2015). Early parenting and the development of externalizing behavior problems: longitudinal mediation through children's executive function. Child Dev.86, 15881603. doi: 10.1111/cdev.12386,

  • 48

    TranT. D.LuchtersS.FisherJ. (2017). Early childhood development: impact of national human development, family poverty, parenting practices and access to early childhood education. Child Care Health Dev.43, 415426. doi: 10.1111/cch.12395,

  • 49

    WatersL. (2015). Strength-based parenting and life satisfaction in teenagers. Adv. Soc. Sci. Res. J.2, 158173. doi: 10.14738/assrj.211.1551

  • 50

    WuH.EstabrookR. (2016). Identification of confirmatory factor analysis models of different levels of invariance for ordered categorical outcomes. Psychometrika81, 10141045. doi: 10.1007/s11336-016-9506-0,

  • 51

    ZhangG.JiangG.HattoriM.TrichtingerL. (2020). EFAutilities: utility functions for exploratory factor analysis. R package version 2.1.1. Available online at: https://CRAN.R-project.org/package=EFAutilities (Accessed February 16, 2026).

Summary

Keywords

assessment, child development, mental health, parenting behavior, scale development

Citation

Schmalbach B, Baier D, Krieg Y, Brähler E and Kliem S (2026) Construction and validation of the ultra-short version of the Parenting Scale (PS-4). Front. Psychol. 17:1595258. doi: 10.3389/fpsyg.2026.1595258

Received

17 March 2025

Revised

03 February 2026

Accepted

04 February 2026

Published

27 February 2026

Volume

17 - 2026

Edited by

Cesar Merino-Soto, Monterrey Institute of Technology and Higher Education (ITESM), Mexico

Reviewed by

Nicola Lovecchio, University of Bergamo, Italy

Aikaterini Vasiou, University of Crete, Greece

Updates

Copyright

*Correspondence: Dirk Baier,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics