Leveraging Random Forests explainability for predictive modeling of children's conduct problems: insights from individual and family factors

Romero, Estrella; González-González, Jaime; Álvarez-Voces, María; Costa-Montenegro, Enrique; Díaz-Vázquez, Beatriz; Busto-Castiñeira, Andrea; Villar, Paula; López-Romero, Laura

doi:10.3389/fpubh.2025.1526413

ORIGINAL RESEARCH article

Front. Public Health, 12 June 2025

Sec. Children and Health

Volume 13 - 2025 | https://doi.org/10.3389/fpubh.2025.1526413

This article is part of the Research TopicAdvances in Artificial Intelligence Applications that Support Psychosocial HealthView all 7 articles

Leveraging Random Forests explainability for predictive modeling of children's conduct problems: insights from individual and family factors

Estrella Romero¹

Jaime González-González²^*

María Álvarez-Voces¹

Enrique Costa-Montenegro²

Beatriz Díaz-Vázquez¹

Andrea Busto-Castiñeira²

Paula Villar¹

Laura López-Romero¹

¹Department of Clinical Psychology and Psychobiology, Institute of Psychology (IPsiUS), University of Santiago de Compostela, Campus Vida, Santiago de Compostela, Spain
²atlanTTic, Information Technologies Group, Universidade de Vigo, Vigo, Spain

Conduct problems are among the most complex, impairing, and prevalent challenges affecting the mental health of children and adolescents. Due to their multifaceted nature, it is important to develop predictive models that capture the intricate interactions among contributing factors. This longitudinal study aims to: (1) evaluate the utility and effectiveness of Random Forest models for classifying children with varying levels of conduct problems, (2) analyze the interactions between individual and family variables in predicting high levels of conduct problems, and (3) determine the most relevant factors or combinations for accurate child classification. The sample was drawn from the ELISA study, and consisted of 1,352 children assessed twice within a 1-year frame. The use of Random Forest and its inherent structure allowed to identify subsets of variables with the capability of predicting Conduct Problems in children. This research demonstrates the effectiveness of integrating psychological insights with advanced computational techniques to address critical concerns in children's mental health, emphasizing the need for enhanced screening and tailored interventions.

1 Introduction

Child conduct problems (CP) comprise different forms of behavioral maladjustment, including aggressive, oppositional, defiant, deceitful, and rule-breaking behavior that violates the rights of others and conflicts with societal norms (1, 2). Both at clinical and subclinical levels, CP represents one of the most relevant, impairing, and prevalent problems across childhood and adolescence, being the primary reason for referring children to psychoeducational and mental health services (3, 4).

Epidemiologic studies have shown prevalence rates of CP in community samples ranging from 2% to more than 10% (2), around 8%–10% engage in the more severe childhood-onset form of CP, and another 25% initiate clinically significant levels of CP during adolescence (5). The severity of CP is endorsed by the wide range of deleterious outcomes linked to early manifestations of CP, which include a host of social, emotional, and academic problems such as anxiety and depression, substance use and abuse, academic failure and underachievement, impaired family well-being, problems in peers and intimate relationships, and legal system involvement (5–7).

Of note, a recent study conferred unique importance to CP as the leading cause of burden among all mental disorders identified up to age 14 (8). However, CP consequences are not only limited to the individual or family levels, with long-term societal [e.g., higher levels of service usage (9)] and economic burden (10) that place CP as an issue of public concern.

1.1 Understanding child CP: disentangling involved factors

Considering the wide range of personal and social implications related to CP, many efforts have been made to better understand the causes behind this phenomenon and to identify all the factors involved in its development and stability over time. Research conducted in this field has resulted in a long list of dispositional and contextual indicators that may impact the development of CP.

At the child (individual) level, developmental models have listed, based on previous research, some temperamental, personality, and socio-emotional variables with proven influence in the emergence of CP. Among the temperamental variables, widely analyzed in the context of child CP [e.g., (11)], two facets of temperament have demonstrated strong prediction for later maladjustment (12). When present at adaptive levels, emotion regulation, which refers to the ability to regulate emotional experience and expression in favor of adaptive behavior (13), and effortful control, defined as the self-regulatory aspect of temperament that involves attentional and inhibitory control mechanisms (14, 15), have evidenced their role as protective factors that may buffer the development of child CP [e.g., (16)]. Conversely, emotional and behavioral dysregulation would stem from emotional lability, negative affect, and poor effortful control, linked with an increased risk for later CP (12, 17).

Beyond the temperamental influence on child CP, personality traits also play an essential role in the early manifestation and later development of CP. In recent years, there has been an increasing interest in examining traits encompassing interpersonal [e.g., grandiose-deceitful (GD)], affective [e.g., callous-unemotional (CU)], and behavioral features [e.g., impulsive-need for stimulation (INS)] that resemble adult psychopathic personality (18).

Psychopathic traits in childhood have proven their usefulness and predictive value in the prediction of more severe and persistent pathways of CP (19, 20), even when controlling for other relevant risk factors [e.g., (21)]. This influence is particularly noteworthy when all psychopathic dimensions are present, reinforcing the importance of interpersonal (GD) and behavioral (INS) traits beyond the well-known influence of CU traits on child CP (22, 23).

Related to the aforementioned temperamental and personality variables, socioemotional competence has shown a particular ability to promote or restrain child CP based on their adaptive or maladaptive functioning. Deficits in emotion regulation linked to CU traits may drive deficits in the ability to understand and resonate with other's emotions. Empathy has been defined as a multi-component process involving at least affective (i.e., automatic emotional reaction to someone else's emotions) and cognitive domains (i.e., ability to recognize, understand, and share other people's emotions and perspectives), both of them related with multiple forms of CP in childhood and adolescence (24, 25).

Beyond the individual factors, the complexity underlying the CP should be interpreted in the context of the dynamic interplay between the individual and environmental domains. In this regard, individual vulnerabilities unfold in relational contexts, where the child interacts with peers, adults, and systems around them (26).

At early developmental stages, the family constitutes the most proximal context for child development, building social and emotional skills that will support later adjustment (27). Therefore, it is unsurprising that a large set of family variables, with the ability to promote or restrain child adjustment, were repeatedly placed in developmental models of CP (28–30).

From the family context, variables focused on the parent-child relationship have received much attention. The influence of parenting practices, including both positive (e.g., parental responsiveness and warmth) and ineffective parenting (e.g., permissiveness, harsh parenting), in the development of child CP has been extensively recognized in previous research (31). Ineffective and negative parenting, which a difficult child temperament might fuel, is expected to lead to tensions between parents and children, increasing conflictive interactions in the dyad that, in turn, may place the child at risk for CP [e.g., (32)].

Instability in parent-child relationships could also increase the risk for parenting stress, also triggered by an overwhelmed feeling from the multiple challenges and demands required in parenting, which were already related to the emergence and maintenance of CP (33, 34). Additional sources of parental stress (e.g., financial problems, job demands, mental health issues) have also been recognized as potential risk factors for child CP [e.g., (35, 36)]. In this regard, different forms of non-severe psychopathology, including perceived stress, anxiety, and depressive symptoms, have also shown their influence on the development of CP, being considered one of the most relevant early predictors of persistence in the most severe forms of later problematic behavior (37).

1.2 Integrating individual and family factors in the prediction of CP: a machine learning approach

Understanding child CP in-depth implies considering its complexity, largely represented by the large set of individual and environmental factors spanned in previous research. As occurs with other mental health problems, most of these factors repeatedly emerged in different studies investigating a reduced number of factors or domains at a time, resulting in a vast array of predictors of CP collected in previous literature but with limited information about their relative importance in prediction (38). Also, predictive models of CP have benefited when a cumulative risk perspective was assumed instead of focusing on one or two risk factors [e.g., (39)], suggesting that more complex and varied approaches, allowing to handle a large number of predictors, are needed to improve classification and prediction of child CP.

However, dealing with the multi-factor nature of CP through traditional data-driven approaches (e.g., regression models) has come with several limitations related to mass univariate testing, non-linearity, interaction effects, or overfitting, among others (40, 41), raising the need for new advanced and more sophisticated methodological procedures.

1.3 Why machine learning in longitudinal data

We face several challenges in trying to predict some health topics based on multiple features. As previously mentioned, these aspects of prediction have been traditionally addressed by methods such as regression. In this sense, Machine Learning (ML), still in its infancy in longitudinal data, can provide some valuable advantages (42).

Firstly, incorporating numerous repeated measurements of exposures presents significant challenges because traditional regression models are typically not designed to handle many covariates (43). Secondly, these regression models often assume a linear relationship between each exposure and the outcome, with few or no specified interactions between exposures. However, these assumptions are often unverifiable; if they are incorrect, they can lead to erroneous conclusions. Despite this, these assumptions are frequently overlooked or violated, which can bias the study results (44, 45).

In this sense, ML provides a solution to these limitations of traditional statistical methods because ML can process vast amounts of data with numerous exposures, automatically developing models that accurately predict outcomes and identify the most critical predictive exposures (46, 47). Importantly, ML techniques generally do not assume a specific functional form for the model; instead, they attempt to derive the model directly from the data to maximize prediction accuracy (48).

1.4 Related works

ML approaches allow for classification and prediction in a multivariate context, projecting data into a multi-dimensional nature space where a classifier can create a decision boundary that optimally separates individuals of different classes within this space, whilst all the factors and their interactions are simultaneously considered (49).

Random Forest (RF), a ML algorithm based on Decision Trees (DT), works under these premises, allowing classification and prediction by handling a large number of predictors that can be distilled based on their relative importance. RF models generate multiple DTs to produce aggregated predictions, detecting interactions and non-linear patterns and reducing overfitting, which overcomes most of the limitations observed in traditional regression models (41).

RF has been increasingly used in the prediction of different mental health problems [e.g., (38, 50)], although its use for predicting and classifying child CP is, to date, limited. Alternative ML classifiers have been used to classify CP children based on parenting practices (51) and emotion recognition abilities (49). At the same time, only one study has included multiple risk factors from multiple domains (i.e., biological, psychological, and social) in predicting child CP over a 2-year period (52). Remarkably, none of the previous studies have addressed the classification of individuals according to levels of CP.

The literature on the application of ML in psychology and healthcare often addresses the idea of explainability (53, 54). According to DARPA's Explainable Artificial Intelligence (XAI) program, explainability refers to an artificial intelligence (AI) system's ability to articulate its reasoning comprehensibly to human users. In recent years, XAI gained relevance within the domains of psychology and psychiatric diagnosis, where decision-making should always be supervised by specialists (55).

In contrast to opaque ML systems, the transparency XAI offers enables health experts to verify AI-assisted classifications, ensuring they are fair, unbiased, and ethically sound. This mitigates misclassification by revealing the most influential factors contributing to a potential diagnosis. Notable applications of explainability and ML include early detection of ADHD (56), schizophrenia diagnosis (57), prediction of anxiety and depression (58, 59), and Alzheimer's disease classification (60). However, to our knowledge, this is the first study addressing the detection of CP using ML with explainability techniques.

1.5 The current study

Given the multifaceted nature of CP in children, it is essential to explore predictive models that can account for the intricate interplay of its contributing variables. This study leverages a data-driven approach and, based on longitudinal data collected with a 1-year interval, conducts a ML approach for classifying children according to their levels of CP, considering both individual and family predictors.

This study is based on RF, a supervised ML method that combines the results of many different DTs. The longitudinal perspective enables the use of supervised methods, making possible the prediction of future CP using current information about individual and family factors. This also helps improve prediction accuracy by identifying interactions and patterns that traditional statistical models, like linear relationships, cannot capture.

Specifically, this study aims to focus on the following objectives: (1) To examine and describe the utility and performance of RF models for classifying children with different levels of CP based on individual and family factors; (2) To analyze the interplay of individual and family variables for predicting high CP; (3) To determine which factors, or combination of them, are more relevant for child classification, especially to classify children high on CP; (4) To compare decision-making models based on the presence or absence of a prior measure of CP, analyzing the predictive power of other variables when no prior CP are present.

2 Materials and methods

2.1 Participants

The sample consisted of 1,352 children (51.2% girls) from the longitudinal ELISA project, which is carried out in Galicia (northwestern Spain). For this study, two waves of the project were utilized: T1 (2022; M_age = 9.20, SD = 1.04, age range = 7–11 years) and T2 (2023; M_age = 10.24, SD = 1.05, age range = 8–12 years). Most of the children were Spanish (~95%).

Regarding the parents' level of education, 11.8% of the mothers and 25.4% of the fathers had attained the highest level of compulsory school education. Additionally, 8.4 and 10% of the mothers and fathers, respectively, had completed a higher level of non-compulsory education. A total of 26.3 and 29.9% had completed vocational training studies, 44.4 and 27.7% had completed university studies, and 8.9 and 6% had completed postgraduate studies. In T1 (2022), 87.1% of mothers and 92.7% of fathers worked outside the home, 5.3 and 2.7% were unemployed or on temporary layoff, 1 and 3.7% were retired/disabled, or unable to work, 0.9 and 0.4% were students, and 5.7 and 0.7% were responsible for domestic duties. In terms of financial well-being, 56.3% of families reported being financially comfortable, 37.4% reported barely getting by, and 6.4% reported having difficulty or serious problems making ends meet. Regarding concern over financial obligations, 39.3% of families reported never worrying, 25.8% worried less than once a month, 31.4% worried at least monthly, and 3.5% worried almost every day.

2.2 Measures

2.2.1 Children's predictors (T1)

2.2.1.1 Children's temperament variables

Attention focusing and inhibitory control were measured with the Temperament in Middle Childhood Scale (TMCQ) subscales of the same name (61). The attention-focusing subscale comprised seven items (e.g., “Has a hard time paying attention”; α = 0.92), and inhibitory control comprised eight items (e.g., “Can stop him/herself from doing things too quickly”; α = 0.73). The response scale of the questionnaire is a 5-point Likert-type scale, from 1 (totally false) to 5 (totally true).

2.2.1.2 Children's psychopathic traits

Children's psychopathic traits were examined using the Child Problematic Traits Inventory (CPTI) (18). This instrument consists of 28 items with a 4-point Likert response scale ranging from 1 (does not apply very well) to 4 (applies very well). Eight items are used to measure GD (e.g., “Lies often to avoid problems”; α = 0.84), 10 to measure CU (e.g., “Does not become upset when others are being hurt”; α = 0.88) and 10 to measure INS (e.g., “Often has difficulties with awaiting his or her turn”; α = 0.86).

2.2.1.3 Socioemotional competence

Emotional regulation and lability/negativity were measured through the Emotion Regulation Checklist (ERC) (62). This scale consists of 24 items with a 4-point Likert-type response scale ranging from 1 (never) to 4 (almost always). The emotional regulation subscale consists of eight items (e.g., “Responds positively to neutral or friendly overtures by adults”; α = 0.70) and the lability/negativity subscale of 16 items [e.g., “Is impulsive (cannot control him/herself)”; α = 0.83].

Children's cognitive and affective empathy were measured by items based on Griffith's scale (63). Both variables were measured by three items: cognitive empathy (e.g., “Does not seem to understand why people get upset”; α = 0.83) and affective empathy (e.g., “Gets sad when he sees movies or something sad on TV”; α = 0.75). The response scale was a 4-point Likert-type scale ranging from 0 (strongly disagree) to 3 (strongly agree).

2.2.1.4 Children's CP

The Conduct Problems Scale (18) was used to measure CP in children. This questionnaire consists of 10 items (e.g., “Threatens others”; α = 0.87) with a 5-point Likert-type response scale ranging from 1 (never) to 5 (almost always).

2.2.2 Parenting predictors (T1)

2.2.2.1 Dysfunctional parenting practices

The Parenting Scale Short Form (PS-8) (64) was employed to assess parental overreactivity and laxness. This questionnaire comprises eight items and two subscales, each comprising four items. The first subscale measures overreactivity (e.g., “When my child misbehaves, I raise my voice or shout”; α = 0.74), while the second subscale concerns laxness (e.g., “I am the kind of mother/father who lets her child do what they want”; α = 0.75). The response scale of the questionnaire is a 5-point Likert-type scale ranging from 1 (never) to 5 (always).

2.2.2.2 Child-parent conflict

The Child-Parent Relationship Scale-Short Form [CPRS-SF; (65)] was employed to assess child-parent conflict. The scale comprises eight items (e.g., “My child easily becomes angry at me”; α = 0.85), with a 5-point Likert-type response scale from 1 (definitely does not apply) to 5 (definitely applies).

2.2.2.3 Parental warmth

Parental warmth was measured by six items based on the Child Rearing Scale (66). The items (e.g., “We shared pleasant and loving moments together”; α = 0.84) had a 5-point Likert-type response scale ranging from 1 (never) to 5 (always).

2.2.2.4 Parenting stress

Parenting stress was measured using a scale based on the Parental Stress Scale (PSS) (67). This scale consisted of five items (e.g., “I feel overwhelmed by the responsibilities of being a parent”; α = 0.73) with a 5-point Likert-type response scale ranging from 1 (strongly disagree) to 5 (strongly agree).

2.2.3 Parents characteristics (T1)

2.2.3.1 Parental anxiety and depressive symptoms

The Patient Health Questionnaire-4 (PHQ-4) (68) was used to measure the presence of anxiety and depressive symptoms. This brief scale consists of four items from two different subscales with two items each: anxiety (e.g., “Feeling nervous, anxious or on edge”; α = 0.84) and depression (e.g., “Feeling down, depressed or hopeless”; α = 0.81). The response scale is a 4-point Likert scale ranging from 0 (not at all) to 3 (almost every day).

2.2.3.2 Parents' perceived general stress

Perceived stress was measured using the Perceived Stress Scale-Short Form (PSS-4) (69). This four-item scale (e.g., “In the last month, how often have you felt that you were unable to control the important things in your life?”; α = 0.75) has a 5-point Likert-type response scale ranging from 0 (never) to 4 (very often).

2.2.3.3 Parents' perceived support

Emotional support and instrumental support received were measured through the subscales of the BRIEF-2 Social Support Scale (70). Both subscales were measured through three items each: emotional support received (e.g., “When I am feeling down, there is someone I can lean on”; α = 0.95) and instrumental support received (e.g., “There is someone who can help me fulfill my responsibilities when I am unable to”; α = 0.89). The response scale of the questionnaire was a 6-point Likert-type scale ranging from 0 (never) to 5 (always).

2.2.4 Outcome (T2)

2.2.4.1 Children's CP

Children's CP were measured through the Conduct Problem Scale (18), see Section 2.2.1 for more information. The children were divided into three groups based on their score on the Conduct Problem Scale at T2. The classification was as follows: children scoring below or equal to –0.5 SD of the mean were classified as low, those scoring between –0.5 SD below and 0.5 SD above the mean were classified as medium, and those scoring 0.5 SD or above the mean were classified as high.

2.3 Procedure

The longitudinal ELISA project, initiated in 2017, is an ongoing study that has been continuously conducted up to the present day. This study has been approved by the Spanish Ministry of Economy and Competitiveness and by the University of Santiago de Compostela Bioethics Committee. Initially, 126 schools across urban, suburban, and rural areas in the Autonomous Community of Galicia (northwestern Spain) were contacted. Of these, 72 public (79.2%), charter (18.1%), and private (2.8%) schools agreed to participate. Subsequently, the families of the children were invited to participate, with ~25%–50% of families in each school agreeing to take part.

The child's primary caregiver (i.e., mother, father, or primary caregiver) completed a questionnaire at each collection point. Most respondents were mothers (87.3%). School teachers were responsible for supervising the distribution and collection of the questionnaires. Every part involved used a personal keycode to access and identify the questionnaires in order to ensure data confidentiality. Informed consent was obtained from the primary caregiver prior to each data collection. There was no financial compensation for participation.

To the extent possible, an attempt was made to standardize the administration of the questionnaires (from the order of presentation of the scales to the time and place where the questionnaires were administered) across the diverse range of schools included.¹

2.4 Analyses

From the original sample of 1,352 children, discarding all the entries that miss any of the variables required is mandatory. Table 1 describes the final set of 20 variables featured in the analysis. Table 2 includes a definition of each variable extracted from the construction/validation of the instrument.

Table 1

Table 1. Names and descriptive labels of the variables included in the Random Forest analyses.

Table 2

Table 2. Names and definitions of the variables included in the Random Forest analyses.

All the variables have been rescaled from their original ranges to a standard scale (from 1 to 5) to ease the interpretation of the results. As this is a linear transformation, the original information of the rescaled variables remains unaffected.

Once the data set was filtered, the ML algorithm was trained using a 10-fold cross-validation strategy widely used in the literature (71). Following this method, the data set is partitioned into ten equal batches of samples distributed randomly. Then, a ten-step iteration process begins, where nine batches are used to train the model and the remaining one to test the algorithm. On each iteration, the batch is used for testing changes. Once the ten sets have been tested, the results can be assessed.

This study defined two different sets of metrics depending on the aspect being evaluated: regression or classification. For the regression, we chose the Pearson correlation coefficient (PCC) to assess its accuracy regarding the original values of the target variable:

\begin{array}{l} ρ_{X, Y} = \frac{c o v (X, Y)}{X Y} & (1) \end{array}

We also observed the Mean Absolute Error (MAE) of the model to understand the magnitude of the errors in the predictions:

\begin{array}{l} M A E = \frac{\sum_{i = 1}^{n} | y_{i} - x_{i} |}{n} & (2) \end{array}

For the classification, once the samples were distributed into their corresponding groups according to their predicted score, we could observe the confusion matrix of the model, the global precision and recall, and, in particular, the recall for the group of high CP.

\begin{array}{l} P r e c i s i o n = \frac{T r u e P o s i t i v e s}{T r u e P o s i t i v e s + F a l s e P o s i t i v e s} & (3) \end{array}

\begin{array}{l} R e c a l l = \frac{T r u e P o s i t i v e s}{T r u e P o s i t i v e s + F a l s e N e g a t i v e s} & (4) \end{array}

A set of widely used ML models were tested to find the most accurate. Both performance and interpretability were taken into account.

We conducted these tests two times:

• Case A: using the previous CP score (CP.1) as a predictor.

• Case B: without using CP.1 as a predictor.

With these cases, we expect to conclude the real relevance of having previously measured CP for prediction of later CP scores.

3 Results

In this section, we present the statistical analysis results for the two scenarios proposed, with and without the use of the variable CP on T1. Only 1,095 out of the 1,352 children have the complete data in the required variables described in Table 1. Table 3 presents each variable's average and standard deviation with the remaining children on a scale from 1 to 5.

Table 3

Table 3. Average score and standard deviation for studied variables.

Distributing the 1,095 children in the three groups previously mentioned, we have 249 high cases, 434 medium cases, and 412 low cases on CP.2. As this imbalance affects the representation of high cases, we opted for balancing the samples used for training. To establish a benchmark, we tested a set of models widely used in the literature for regression purposes such as: Linear Regressor (LR),² Bayesian Ridge Regressor (BRR),³ Support Vector Regressor (SVR),⁴ Gradient Boosting Regressor (GBR),⁵ and a Multi-Layer Perceptron Regressor (MLPR) implemented using Tensorflow python library.⁶ All these models were trained using optimal hyperparameters.

We then found the best possible setting for each model using the GridSearchCV function from the Scikit-Learn python library.⁷ This function searches the optimal set of ML hyperparameters. Tables 4, 5 show the regression results for both cases.

Table 4

Table 4. System results for Case A (sorted by PCC).

Table 5

Table 5. System results for Case B (sorted by PCC).

With the results of the regressions, the system can draw the children into the three categories established for this study. On Tables 4, 5 we can observe the precision and recall of each model. Results for RF are similar or better than the rest of the models validating its selection for this study.

Although RF implies a greater computational cost and complexity compared to other models such as LR or BRR, its internal structure brings a singular advantage which makes it the optimal choice for studying the interactions between the variables studied. Table 6 contains the parameters that were tested for RF during the grid search phase.

Table 6

Table 6. Random Forest hyperparameters.

For Case A, the best criterion for splitting the samples into different branches is the Poisson criterion, being the absolute error criterion for Case B. Both approaches share the rest of the optimal parameters, needing 250 estimators, with a maximum tree depth of 6 nodes, a sample split, and a leaf threshold of 0.001 (or 0.1% of the original samples). Both models were trained using their optimal configuration and a bootstrapping technique, which selects a random subsample of the data set for the training of each estimator, enhancing the accuracy of the regression.

Observing the PCC in both Tables 4, 5, we can see that the availability of records from previous years (CP.1) grants a better performance for the prediction of CP.2. In terms of MAE, Case A has an average error of 0.197 (48% of the SD of CP.2) vs. the 0.234 (57% of the SD of CP.2) from Case B. Figure 1 contains the confusion matrices for both scenarios with RF.

Figure 1

Figure 1. Heatmap representation of the confusion matrices for both cases. (a) Case A. (b) Case B.

As we can see, the prediction for Case A offers a much better distribution of samples, with almost $\frac{2}{3}$ of each category correctly classified. This scenario has a global rate of success of 62.92%, 8% better than Case B.

Once both models have been trained and tested, it is time to study their internal structures and observe the most frequent paths in the DTs that form the RF. Figure 2 includes an example of a visual representation of one of the 250 DTs conforming the RFs and all RF paths and thresholds are in Supplementary material.

Figure 2

Figure 2. Graphical representation of a DT.

Each node on the tree (except the leaf nodes) evaluates the value of one variable for all the samples that reach that node. The node sorts the samples depending on whether their magnitude for that particular variable is lower, equal, or higher regarding a threshold.

The field Value of each node represents the regression of the target variable (CP.2) for the set of samples at that particular node. The leaf nodes exist at the end of each branch and determine the final value of the regression for the samples that reach them.

This part of the analysis studies the presence or absence of the 19 (or 20 including CP.1) variables used for training the model, paying particular attention to their hierarchical distribution and combinations. When studying the structure of the trees, the level on which a variable appears may determine how relevant it is for the regression, as it implies its capability to classify larger groups of samples (e.g., the root node will be the one that can divide the complete set of samples most effectively).

A first approach, summarized in Tables 7, 8, observes the frequency of appearance of each variable on each level of the tree from 0 (root node) to 5, depending on whether the variable CP.1 was available or not.

Table 7

Table 7. Depths for Case A.

Table 8

Table 8. Depths for Case B.

As we can observe, CP.1 is the most relevant variable to predict CP.2, being the root node at every tree in the RF. It is also the predominant variable for levels 1 and 2. This means that, by establishing multiple thresholds for CP.1, the algorithm can execute a coarse classification of samples.

We should expect that the relevant variables in Case B would be the first ones to appear in Case A, too. As it is true for GD.1, we can see that variables like LAB_NEG.1 or CONFLICT.1, which are two of the most frequent root variables for Case B, are not that relevant compared to others like TEMP_INHIB.1 or CU.1 in Case A. These first results can offer powerful insights on which variables can be targeted when studying CP.

When studying the structure of the trees, it is important to observe not only the position of the variables but also how they combine to form paths in the RF. These combinations can highlight smaller subsets of variables reliable for classification. Shorter paths with strong performances may simplify future longitudinal studies by reducing the necessary number of variables needed to provide accurate predictions on CP.

Table 9 shows a concise tabular representation of the paths contained in the trees described above. As stated earlier, all the paths studied from now on lead to high values of CP.2.

Table 9

Table 9. Example of the tabular representation of a path.

A path can have a maximum of six nodes. Each row contains the variables evaluated in each node and the thresholds established to sort the samples left (variable equal or lower than the threshold) or right (variable higher than the threshold) in the trees. The thresholds are averaged if there are more than one occurrence of a particular path.

If the variable's name is presented in capital letters, it means that the samples present a higher value than the threshold for that particular variable. Otherwise (with a lower or equal score), the variable is presented in lowercase letters (i.e., Table 9 shows a path leading to high CP.2 where temp_inhib.1 was lower than 3.07; GD.1 was higher than 2.07; LAB_NEG.1 was higher than 2.17; CONFLICT.1 was higher than 3.57 and EMP_AF.1 was higher than 1.67, all of them scored over five points).

Additionally, we present the average number of samples that met all the conditions of a particular path and the precision that it accomplishes for high CP.2 prediction.

We opted to study the common roots to obtain representative subsets of variables instead of full paths. We define these roots as a set of the first N nodes of a path. This allows to merge similar paths into prototypical groups of variables.

We chose the size N that gave us roots with at least ten occurrences each in order to ensure meaningful results. For Case A, our roots contain four nodes, while for Case B, only three nodes are necessary. Tables 10, 11 present the top five most frequent roots for both Case A and Case B.

Table 10

Table 10. Five most frequent roots for Case A.

Table 11

Table 11. Five most frequent roots for Case B.

The column Avg. Length presents the average length of the paths that start with the roots presented in the tables. The column Count presents the number of paths that contain a particular root in all the 250 DTs included in the model. The column Avg. Leaf samples describes the average number of samples sorted in the leaves of the paths that start with said roots. Finally, the column Precision is the average precision of all the paths starting with the same root.

Ideally, relevant roots should be highly frequent, with high precision and, as a lesser requirement, leading to shorter paths. That would mean that only N variables would be needed to predict CP and thus would ease the data gathering process.

As shown in Table 10, the most frequent roots in Case A combine the previous measure of CP with several individual (i.e., inhibitory control, GD, emotional regulation) and family (parents' perceived stress, parent-child conflict) variables. In Case B (Table 11), psychopathic traits (INS, GD, CU) tend to appear combined with different levels of other individual variables (lability/negativity) and also with parent-child conflict.

However, we wanted to know the best roots for each case. We defined an ad-hoc metric for assessing the quality of the roots. The metric is defined as follows:

1. The root must have a Precision equal to or higher than the global precision of the model.

2. The root Count must have at least 10 samples.

3. The value is obtained using the harmonic mean of the normalized values of the Precision and the Count of each row as follows:

\begin{array}{l} P = \frac{p r e c i s i o n}{m a x (P r e c i s i o n)} & (5) \end{array}

\begin{array}{l} C = \frac{c o u n t}{m a x (C o u n t)} & (6) \end{array}

\begin{array}{l} M = \frac{2 * P * C}{P + C} & (7) \end{array}

Thus, the best roots are those that most frequently appear and, at the same time, have a higher success rate in the classification task. Tables 12, 13 present the five best roots for Case A and Case B, respectively.

Table 12

Table 12. Five best roots for Case A.

Table 13

Table 13. Five best roots for Case B.

The best roots when previous CP is included (Case A, Table 10) replicated, to a great extent, the patterns of the most frequent roots: different levels of CP cluster together with inhibitory control, GD, and emotional regulation. CP also appears combined to high CU and with parents' perceived stress. For Case B (Table 11), psychopathic traits again appear in combination with lability/negativity and with family variables (parent-child conflict and parenting stress). The highest precision (92.86%) is achieved by high levels of GD in association with high conflict and parenting stress. Additionally, high levels of lability/negativity and low emotional regulation define one of the most accurate paths (86.96%) for this case.

4 Discussion

This study explored the utility of RF models for predicting high CP in children based on individual and family factors measured in a longitudinal design. Many decades of research have suggested a plethora of variables potentially involved in the development of CP, with studies arising from a variety of theoretical and empirical perspectives. In this research, the use of ML (and, particularly, RF) has allowed us to go beyond previous approaches by using a more flexible, data-driven approach that can process a high volume of data with fewer statistical assumptions.

The specific focus of this study was to identify the most relevant individual and family predictors and to capture interactions among them. In the field of individual factors, we included both dispositional traits (e.g., temperament, psychopathic traits) and socioemotional competence (e.g., emotional regulation, empathy); as for family factors, we considered not only variables of family functioning (e.g., conflict, parenting practices) but also parents' personal conditions (e.g., stress, depression) that have been previously proposed as sources of behavioral disturbances in children [e.g., (35, 37)], thus covering a wide array of factors distilled from more traditional studies on CP.

This study evaluated two different models: Case A included the previous measure of CP as a predictor, while Case B removed the previous measure of CP, to determine how individual and family factors could be “purer” predictors without prior data on CP. Results showed that both overall prediction and specific prediction of high CP performed better in Case A, where previous CP was incorporated. This expected result reflects a well-known behavioral science principle: past behavior is the best predictor of future acts [e.g., (72)]. Within the specific field of CP in children, this is also a commonly found pattern [e.g., (73, 74)], which reflects the relative stability of CP over time (75), and, in practical terms, it shows that information about past problems is a strong indicator of risk for later CP. Nevertheless, the results also suggest that, beyond the information provided by the previous measure of CP, additional factors emerge as relevant for the classification of children according to their probability of future CP.

4.1 Identification of the most relevant predictors

Among the variables used for model training, some stand out for predicting future CP, considering their positions in the tree structures. A first noteworthy result is that individual factors play a more decisive role than family factors, regardless of whether previous CP are considered. This is consistent with findings from other studies that compared factors from different domains to predict mental health outcomes in youth (38).

This pattern of findings can also be considered in the light of broad theoretical frameworks in behavioral sciences like Bronfenbrenner's (76, 77) Bioecological Model. According to this model, human development results from several intertwining layers of influence, including the individual and progressively larger systems (e.g., family, community, cultural values, etc.); despite the powerful influence of the family contexts, individual characteristics are more directly related to behavior, and, ultimately, filter the experiences and interactions with the other layers.

Within the individual factors, our results highlight the role of temperament and psychopathic traits in the pathways to children's CP. Particularly, the results reinforce the role of effortful control (attentional focusing and, especially, inhibitory control), in alignment with other studies (78), that highlight self-regulatory dispositions as predictors of behavioral disturbances. Our results also point to emotional reactivity (lability/negativity) as a significant contributor to the prediction of CP. This result is consistent with a strong line of research on the role of negative affect (79), emotional instability (80), and irritability (81) in the configuration of the children's most common CP. Thus, RF models endorsed both the tendency to display intense emotionality and the difficulties for self-control as key factors in the prediction of CP.

Our results also bolster the role of the so-called psychopathic traits (82) for the identification of children at risk of high CP. The significance of psychopathic traits remains evident, regardless of whether prior CP are included as a predictor. Results show that CU, often considered the core of psychopathic personality (83), is located within the highest positions in the decision trees. Thus, emotional coldness, lack of guilt, and insensitivity to other needs, which define the concept of CU, are corroborated as powerful aspects for the detection of risk for high CP. However, results indicate that other psychopathic variables are also important. GD ranks even higher than CU both in Case A and Case B; it seems that interpersonal components of the psychopathic domain (e.g., arrogance, manipulativeness, disposition to cheat or lie for the own benefit) play a standing role in the prediction of CP. Also, at least in Case B, the behavioral facets of the psychopathic traits (INS), encompassing impulsivity, irresponsibility, and need for stimulation, are part of the most influential predictors. Therefore, results bring support for a burgeoning research line claiming that, besides CU traits, interpersonal and behavioral psychopathic traits should be considered for a more precise account and prognosis of high CP (21, 22).

In the family realm, results underscore the role of conflict in the relations between parents and children. This result is in keeping with previous research showing how developmental outcomes are strongly influenced by the family atmosphere, family strain, and difficult interactions between parents and children [e.g., (84, 85)]. In our study, although some other family variables (e.g., parenting stress) seem decisive in the highest nodes, conflict stands out among the other family factors in both Case A and B. The power of this dimension may respond to the fact that conflictive relations can inherently reflect other relevant family aspects, like ineffective parenting and a compromised emotional connection among the family members (86, 87), thus being an efficient indicator of broader dysfunction in the family setting. Therefore, we found that the most important variables align with findings from previous literature on CP. The RF models underscore a specific set of predictors, both at the individual and family levels, that have been emphasized in psychological research and theory. This convergence between computational findings and existing research is noteworthy, as it not only validates the identified predictors, but also reinforces some previous advances in developmental psychopathology. At the same time, the RF models offer valuable insights for further exploration. Future research should keep expanding beyond the central focus on CU traits when studying psychopathic tendencies at early ages, and place greater emphasis on the core role of parent-child conflict among the various family factors typically considered in the development of CP.

4.2 Relevant combinations of predictive factors

In addition to studying the most relevant variables for prediction of CP, we also sought to identify clusters of variables that tend to act together, in an attempt to capture the interaction patterns that emerge from the RF analyses. Both the most frequent and the most successful combinations were analyzed, and similar combinations were identified in both approaches. Some sets of variables bring together different kinds of individual factors. Specifically, psychopathic and temperamental predictors tend to appear together in a relatively high number of samples and are also part of the most accurate paths. For example, GD and lability/negativity align together frequently and successfully. Some other times, several psychopathic traits are combined with lability/negativity.

Also, the tandem of GD and social competence (emotional regulation) can be identified, particularly when CP are also included as predictors. These results suggest the need for further studying the interactions between psychopathic and affective/self-regulatory traits [see, for example, (88, 89)] for a better depiction of the psychological mechanisms underlying high CP.

The analysis of predictor combinations also points to some individual and family factors working together in the classification task. For example, inhibitory control and parents' stress show up as a common and accurate cluster, a pattern of results that reflects the relevance of studying the interactions between temperament and family, as several previous research lines have suggested (27, 90). Also, our results show that psychopathic traits and family dimensions repeatedly appear as sets of intertwined variables. Specifically, interactions identified by the RF models highlight the predictive power of high levels of parent-child conflict when combined with CU or INS. Additionally, the coexistence of high GD, parent-child conflict, and stress in the parental role defines a particularly risky setting for the development of CP. As previous research has shown, psychopathic factors could trigger family dysfunction (91), or, conversely, the family dynamics could affect the development of psychopathic traits (92); psychopathic traits could also moderate the impact of family factors on developmental outcomes (93). Our results suggest that the dynamics of psychopathic traits and family relations (particularly, the strained relations between parents and children) should be a prioritized research field, so that the predictive and etiological processes underlying CP can be fully understood.

4.3 Theoretical and practical implications

Results from RF can provide valuable orientations for theory building and refinement in the field of early CP. The findings on individual dispositions (temperament and psychopathic traits) suggest that models on CP should account not only for these factors but also for their joint effect and their interaction with family variables. Emotional reactivity and self-regulatory skills are highlighted as two main axes of temperamental influence; both axes have already been present in classical models on “difficult temperament” (94) and in theoretical models specifically designed for antisocial behavior [e.g., (95)]. Results from RF corroborate the impact of such dimensions and support their utility in the design of broader models of CP. Also, psychopathic traits should occupy a significant space in theories on stability, chronicity, and severity of CP; within this framework, not only does affective insensitivity (CU) seem relevant, but behavioral traits (INS) and, especially, the interpersonal aspects (GD; manipulativeness, narcissism, dishonest charm) should also be considered (21, 82). These dispositions should be addressed in interaction with the family function; particularly, according to our results, and in line with coercion theory (66) conflictive interactions between parents and children, possibly within a psychosocially stressed family, may be a key factor for etiological models of children's CP.

Additionally, insights from RF provide support to the developmental heterogeneity of CP (96). As shown by our results, CP can be predicted on the basis of different combinations of factors; in some of the trees, individual factors (e.g., GD and lability/negativity) seem to be dominant, while in others, both individual and family factors play a pivotal role in prediction roots (e.g., family conflict, GD and CU). In this sense, RF models can be a useful way to explore the variety and complexity of pathways leading to CP, resonating with the principles of equifinality and multifinality proposed by developmental psychopathology (97).

On a practical level, our results can help in the early identification of at-risk children by recognizing the main predictive variables, the more powerful clusters of factors, the thresholds and the itineraries for making classificatory decisions. Beyond the previous levels of CP, individual and family factors can aid in distinguishing children who may present future behavioral difficulties. This can guide social and health policies so that targeted interventions can be designed to curb the problematic pathways. Additionally, our results yield meaningful information for guiding prevention programs by highlighting major domains for intervention like interpersonal sensitivity, coping with frustration and negative emotions, executive functioning skills, and parent-child relational patterns. Some interventions targeting these factors have demonstrated their efficacy for prevention of CP, thereby reinforcing the relevance of these predictors as key contributors to the development of CP. For instance, some interventions have built on the child's individual characteristics to promote cognitive flexibility and constructive coping strategies [e.g., Problem-Solving Skills Training (PSST)] (98). Other interventions, like the Triple P-Positive Parenting Program (99) have focused on family dysfunction, and have shown significant success in addressing CP. Moreover, the combined influence of individual and familial predictors in CP prediction further supports the need for a multicomponent approach that targets a broader range of risk factors. Such an approach may be particularly beneficial in CP treatment, as demonstrated by programs like Incredible Years (100), which have shown strong short- and long-term efficacy. Based on our results, further research should advance in the design of individual- and family-oriented programs that consider the diversity of needs of CP children. In particular, modular programs [e.g., (101)], matching the intervention components to the strengths and difficulties of each child, would show promise for dealing with the heterogeneity of pathways in children's CP.

4.4 Limitations and suggestions for further research

The results of this study should be viewed in light of some limitations. First, despite the longitudinal nature of the design, which enables prospective predictions not attainable by cross-sectional research, the follow-up (1 year) was relatively short. We analyzed data from children across the elementary school years, a critical period for the development of the most stable patterns of CP; however, prediction through a longer time frame, including the preschool years, would be particularly useful for identifying the earliest predictors and informing preventive practices in early childhood.

Second, while the ELISA sample was relatively large and reasonably heterogeneous, it was community-based and drawn from a specific sociocultural setting, which may restrict full generalizability to other samples and settings. Additionally, in studies like ours, the likelihood of involving children with very high CP is moderate unless there is an oversampling of participants exhibiting significant difficulties at baseline. Therefore, caution is advised, as the results might not be fully applicable to other groups. Further studies in diverse populations and social backgrounds are needed to evaluate the robustness and generalizability of our findings. Such research will help clarify which predictors of CP are consistent across different contexts and provide a more refined perspective on CP prediction. It is also worth noting that in this study, we excluded children who missed any data collection points. Additionally, previous studies associated with the ELISA project (32, 102), as well as existing literature (103), have shown that children with incomplete assessments typically have a lower socioeconomic status (SES) compared to those with complete data. This may introduce bias and limit the generalizability of the findings to children from lower SES backgrounds. Therefore, it is recommended that future studies place a stronger emphasis on retaining participants from disadvantaged backgrounds, as they are more likely to drop out of community-based longitudinal studies.

Third, our selection of family variables prioritized malleable factors, which are particularly useful for informing the design of preventive interventions. Consequently, certain variables, such as socioeconomic background, parental education level, and the presence of serious mental disorders in parents, were not included in the model. Research has shown that these factors can significantly affect developmental outcomes, including CP, often indirectly [e.g., (104)]. Incorporating such variables into future models could help further elucidate their role within a more comprehensive framework of risk factors.

Future research should address these limitations. As suggested above, applying RF to long-term longitudinal data will enhance the opportunity for predictions across extended periods of the lifespan. Additionally, examining predictors measured at different ages (e.g., preschool and primary school) could help identify which factors are more significant at different developmental stages and depict how the pathways to CP unfold through time. This approach would enhance prediction and theory building and improve prevention by highlighting critical ages for timely intervention and indicating which factors should be addressed at different developmental times.

Future investigations should also refine the RF models by incorporating a wider range of potential predictors. While individual and family factors are recognized as fundamental domains underlying CP [e.g., (105)], other variables from the community, school and peer contexts could boost the predictive power of the RF and provide new insights on the interactions that drive CP.

Also, RF models in this field can be fine-tuned by considering how the pathways to CP may be conditioned by gender. Higher prevalences of CP in boys are commonly found (106), although when CP are developed in girls, the behavioral difficulties may be particularly detrimental [i.e., the “gender paradox”; (107)]. Nevertheless, the differential factors involved in the development of CP in boys and girls are poorly understood (108). Longitudinal studies with RF can be a useful tool to unravel the distinctive routes to CP across genders.

4.5 Concluding remarks

This study illustrates how explainable ML can integrate predictive accuracy with psychological understanding, elucidating not only the key variables, but also the ways in which they interact across multiple levels of influence to contribute to maladaptative developmental trajectories. Data-driven models based on RF algorithms represent a promising approach to address the complexities of predicting and explaining CP. Our results emphasize the influence of individual and family factors and demonstrate how these elements are intricately combined to shape heterogeneous pathways. Moreover, findings from this longitudinal study can inform more effective screening processes and tailored interventions; they also highlight the value of bridging psychological insights and advanced computational methods to better address one of the most prevalent and impairing challenges in children's mental health.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Bioethics Committee from the University of Santiago de Compostela. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants' legal guardians.

Author contributions

ER: Conceptualization, Funding acquisition, Investigation, Writing – original draft, Writing – review & editing. JG-G: Investigation, Methodology, Software, Validation, Writing – original draft, Writing – review & editing. MÁ-V: Data curation, Writing – original draft, Writing – review & editing. EC-M: Funding acquisition, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing. BD-V: Data curation, Writing – original draft, Writing – review & editing. AB-C: Methodology, Writing – original draft, Writing – review & editing. PV: Funding acquisition, Investigation, Project administration, Writing – review & editing. LL-R: Conceptualization, Data curation, Investigation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was partially supported by (i) projects TED2021-130824B-C21 and TED2021-130824B-C22 funded by MICIU/AEI/10.13039/501100011033 and by the UE NextGenerationEU/PRTR; (ii) project PID2019-107897RB-I00 funded by MICIU/AEI/10.13039/501100011033; (iii) grant ED431C (2022/17) funded by Xunta de Galicia; (iv) grant ED481A-2023-090 (JG-G) funded by Galician Regional Ministry of Education, Science, Universities and Vocational training; and by FSE+; (v) grant FPU21/00552 (MÁ-V) funded by MICIU/AEI/10.13039/501100011033 and by FSE+; (vi) grant FPU22/02200 (BD-V) funded by MICIU/AEI/10.13039/501100011033 and by FSE+; (vii) grant FPU21/00798 (AB-C) funded by MICIU/AEI/10.13039/501100011033 and by FSE+; and (viii) grant RYC2021-032890-I (LL-R) funded by MICIU/AEI/10.13039/501100011033 and European Union NextGenerationEU/PRTR.

Acknowledgments

This study was made possible thanks to the participants of the ELISA project—children, families and schools—, as well as to the research team who have worked on it over the years.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2025.1526413/full#supplementary-material

Footnotes

1. ^See http://www.personalitydevelopmentcollaborative.org/project-page-elisa/ for additional details.

2. ^Available at: https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html, October 2024.

3. ^Available at: https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.BayesianRidge.html, October 2024.

4. ^Available at: https://scikit-learn.org/stable/modules/generated/sklearn.svm.SVR.html, October 2024.

5. ^Available at: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.GradientBoostingRegressor.html, October 2024.

6. ^Available at: https://www.tensorflow.org/api_docs/python/tf/keras, October 2024.

7. ^Available at: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.GridSearchCV.html, October 2024.

References

1. McMahon RJ, Frick PJ. Conduct and Oppositional Disorders. New York, NY: The Guilford Press (2019). p. 102–172.

Google Scholar

2. Association AP. Diagnostic and Statistical Manual of Mental Disorders, 5th Edn text revision ed. Washington, DC: American Psychiatric Association (2022).

Google Scholar

3. Fairchild G, Hawes DJ, Frick PJ, Copeland WE, Odgers CL, Franke B, et al. Conduct disorder. Nat Rev Dis Primers. (2019) 5:43. doi: 10.1038/s41572-019-0095-y

PubMed Abstract | Crossref Full Text | Google Scholar

4. Merikangas KR, Nakamura EF, Kessler RC. Epidemiology of mental disorders in children and adolescents. Dialogues Clin Neurosci. (2009) 11:7–20. doi: 10.31887/DCNS.2009.11.1/krmerikangas

PubMed Abstract | Crossref Full Text | Google Scholar

5. Burt SA, Hyde LW, Frick PJ, Jaffee SR, Shaw DS, Tremblay R. Commentary: Childhood conduct problems are a public health crisis and require resources: a commentary on Rivenbark et.al. (2018). J Child Psychol Psychiatry. (2018) 59:711–3. doi: 10.1111/jcpp.12930

PubMed Abstract | Crossref Full Text | Google Scholar

6. Odgers CL, Moffitt TE, Broadbent JM, Dickson N, Hancox RJ, Harrington H, et al. Female and male antisocial trajectories: from childhood origins to adult outcomes. Dev Psychopathol. (2008) 20:673–716. doi: 10.1017/S0954579408000333

PubMed Abstract | Crossref Full Text | Google Scholar

7. Bevilacqua L, Hale D, Barker ED, Viner R. Conduct problems trajectories and psychosocial outcomes: a systematic review and meta-analysis. Eur Child Adolesc Psychiatry. (2018) 27:1239–60. doi: 10.1007/s00787-017-1053-4

PubMed Abstract | Crossref Full Text | Google Scholar

8. Ferrari AJ, Santomauro DF, Herrera AMM, Shadid J, Ashbaugh C, Erskine HE, et al. Global, regional, and national burden of 12 mental disorders in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Psychiatry. (2022) 9:1115–28. doi: 10.1016/S2215-0366(21)00395-3

PubMed Abstract | Crossref Full Text | Google Scholar

9. Rivenbark JG, Odgers CL, Caspi A, Harrington HL, Hogan S, Houts RM, et al. The high societal costs of childhood conduct problems: evidence from administrative records up to age 38 in a longitudinal birth cohort. J Child Psychol Psychiatry. (2018) 59:703–10. doi: 10.1111/jcpp.12850

PubMed Abstract | Crossref Full Text | Google Scholar

10. Goulter N, Hur YS, Jones DE, Godwin J, McMahon RJ, Dodge KA, et al. Kindergarten conduct problems are associated with monetized outcomes in adolescence and adulthood. J Child Psychol Psychiatry. (2024) 65:328–39. doi: 10.1111/jcpp.13837

PubMed Abstract | Crossref Full Text | Google Scholar

11. Lahey BB, Waldman ID. A developmental propensity model of the origins of conduct problems during childhood and adolescence. In: Lahey BB, Moffitt TE, Caspi A, editors. Causes of Conduct Disorder and Juvenile Delinquency. New York, NY: The Guilford Press (2003). p. 76–117.

Google Scholar

12. Halvorson MA, King KM, Lengua LJ. Examining interactions between negative emotionality and effortful control in predicting preadolescent adjustment problems. J Appl Dev Psychol. (2022) 79:101374. doi: 10.1016/j.appdev.2021.101374

PubMed Abstract | Crossref Full Text | Google Scholar

13. Gross JJ, Thompson RA. Emotion regulation: conceptual foundations. In: Gross JJ, editor. Handbook of Emotion Regulation. New York, NY: The Guilford Press (2007). p. 3–24.

Google Scholar

14. Rothbart MK, Evans DE, Ahadi SA. Temperament and personality: origins and outcomes. J Pers Soc Psychol. (2000) 78:122–35. doi: 10.1037//0022-3514.78.1.122

PubMed Abstract | Crossref Full Text | Google Scholar

15. Rothbart MK, Bates JE. Temperament. In: Eisenberg N, Damon W, Lerner RM, editors. Handbook of Child Psychology. Hoboken, NJ: Wiley. (2006). doi: 10.1002/9780470147658.chpsy0303

Crossref Full Text | Google Scholar

16. Eisenberg N, Spinrad TL, Eggum ND. Emotion-related self-regulation and its relation to children's maladjustment. Annu Rev Clin Psychol. (2010) 6:495–525. doi: 10.1146/annurev.clinpsy.121208.131208

PubMed Abstract | Crossref Full Text | Google Scholar

17. Mitchison GM, Liber JM, Hannesdottir DK, Njardvik U. Emotion dysregulation, ODD and conduct problems in a sample of five and six-year-old children. Child Psychiatry Hum Dev. (2020) 51:71–9. doi: 10.1007/s10578-019-00911-7

PubMed Abstract | Crossref Full Text | Google Scholar

18. Colins OF, Andershed H, Frogner L, Lopez-Romero L, Veen V, Andershed AK, et al. new measure to assess psychopathic personality in children: the child problematic traits inventory. J Psychopathol Behav Assess. (2014) 36:4–21. doi: 10.1007/s10862-013-9385-y

PubMed Abstract | Crossref Full Text | Google Scholar

19. Fanti KA, Kyranides MN, Lordos A, Colins OF, Andershed H. Unique and interactive associations of callous-unemotional traits, impulsivity and grandiosity with child and adolescent conduct disorder symptoms. J Psychopathol Behav Assess. (2018) 40:40–9. doi: 10.1007/s10862-018-9655-9

Crossref Full Text | Google Scholar

20. Frogner L, Andershed AK, Andershed H. Psychopathic personality works better than CU traits for predicting fearlessness and ADHD symptoms among children with conduct problems. J Psychopathol Behav Assess. (2018) 40:1115–28. doi: 10.1007/s10862-018-9651-0

PubMed Abstract | Crossref Full Text | Google Scholar

21. López-Romero L, Colins OF, Fanti K, Salekin RT, Romero E, Andershed H. Testing the predictive and incremental validity of callous-unemotional traits versus the multidimensional psychopathy construct in preschool children. J Crim Justice. (2022) 80:101744. doi: 10.1016/j.jcrimjus.2020.101744

Crossref Full Text | Google Scholar

22. Colins OF, López-Romero L, Romero E, Andershed H. The prognostic usefulness of multiple specifiers for subtyping conduct problems in early childhood. J Am Acad Child Adolesc Psychiatry. (2024) 63:443–53. doi: 10.1016/j.jaac.2023.05.022

PubMed Abstract | Crossref Full Text | Google Scholar

23. López-Romero L, Andershed H, Romero E, Cervin M. In search of conceptual clarity about the structure of psychopathic traits in children: a network-based proposal. Child Psychiatry Hum Dev. (2024). doi: 10.1007/s10578-023-01649-z

PubMed Abstract | Crossref Full Text | Google Scholar

24. Levantini V, Muratori P, Bertacchi I, Grilli V, Marzano A, Masi G, et al. The “measure of empathy in early childhood”: psychometric properties and associations with externalizing problems and callous unemotional traits. Child Psychiatry Hum Dev. (2024). doi: 10.1007/s10578-024-01673-7

PubMed Abstract | Crossref Full Text | Google Scholar

25. Moul C, Hawes DJ, Dadds MR. Mapping the developmental pathways of child conduct problems through the neurobiology of empathy. Neurosci Biobehav Rev. (2018) 91:34–50. doi: 10.1016/j.neubiorev.2017.03.016

PubMed Abstract | Crossref Full Text | Google Scholar

26. Viding E, McCrory E. Disruptive behavior disorders: the challenge of delineating mechanisms in the face of heterogeneity. Am J Psychiatry. (2020) 177:811–7. doi: 10.1176/appi.ajp.2020.20070998

PubMed Abstract | Crossref Full Text | Google Scholar

27. Coe JL, Micalizzi L, Huffhines L, Seifer R, Tyrka AR, Parade SH. Effortful control, parent-child relationships, and behavior problems among preschool-aged children experiencing adversity. J Child Fam Stud. (2024) 33:663–72. doi: 10.1007/s10826-023-02741-7

PubMed Abstract | Crossref Full Text | Google Scholar

28. Reid JB, Patterson G, Snyder J. Antisocial behavior in children and adolescents: a developmental analysis and model for intervention. Am Psychol Assoc. (2002). doi: 10.1037/10468-000

Crossref Full Text | Google Scholar

29. Farrington DP. Childhood origins of antisocial behavior. Clin Psychol Psychother. (2005) 12:177–90. doi: 10.1002/cpp.448

Crossref Full Text | Google Scholar

30. Moffitt TE. Life-course-persistent versus adolescence-limited antisocial behavior. In: Cicchetti D, Cohen DJ, editors. Developmental Psychopathology: Volume Three: Risk, Disorder, and Adaptation. Hoboken, NJ: Wiley (2006). p. 570–98. doi: 10.1002/9780470939406.ch15

Crossref Full Text | Google Scholar

31. Pinquart M. Associations of parenting dimensions and styles with externalizing problems of children and adolescents: an updated meta-analysis. Dev Psychol. (2017) 53:873–932. doi: 10.1037/dev0000295

PubMed Abstract | Crossref Full Text | Google Scholar

32. Fanti KA, Mavrommatis I, Díaz-Vázquez B, López-Romero L, Romero E, Álvarez Voces M, et al. Fearlessness as an underlying mechanism leading to conduct problems: testing the INTERFEAR model in a community sample in Spain. Children. (2024) 11:546. doi: 10.3390/children11050546

PubMed Abstract | Crossref Full Text | Google Scholar

33. Lee SJ, Pace GT, Lee JY, Knauer H. The association of fathers' parental warmth and parenting stress to child behavior problems. Child Youth Serv Rev. (2018) 91:1–10. doi: 10.1016/j.childyouth.2018.05.020

PubMed Abstract | Crossref Full Text | Google Scholar

34. Neece CL, Green SA, Baker BL. Parenting stress and child behavior problems: a transactional relationship across time. Am J Intellect Dev Disabil. (2012) 117:48–66. doi: 10.1352/1944-7558-117.1.48

PubMed Abstract | Crossref Full Text | Google Scholar

35. Anthony LG, Anthony BJ, Glanville DN, Naiman DQ, Waanders C, Shatter S. The relationships between parenting stress, parenting behaviour and preschoolers' social competence and behaviour problems in the classroom. Infant Child Dev. (2005) 14:133–54. doi: 10.1002/icd.385

PubMed Abstract | Crossref Full Text | Google Scholar

36. Mäntymaa M, Puura K, Luoma I, Latva R, Salmelin RK, Tamminen T. Predicting internalizing and externalizing problems at five years by child and parental factors in infancy and toddlerhood. Child Psychiatry Hum Dev. (2012) 43:153–70. doi: 10.1007/s10578-011-0255-0

PubMed Abstract | Crossref Full Text | Google Scholar

37. Basto-Pereira M, Farrington DP. Developmental predictors of offending and persistence in crime: a systematic review of meta-analyses. Aggress Violent Behav. (2022) 65:1–11. doi: 10.1016/j.avb.2022.101761

Crossref Full Text | Google Scholar

38. Rothenberg WA, Bizzego A, Esposito G, Lansford JE, Al-Hassan SM, Bacchini D, et al. Predicting adolescent mental health outcomes across cultures: a machine learning approach. J Youth Adolesc. (2023) 52:1595–619. doi: 10.1007/s10964-023-01767-w

PubMed Abstract | Crossref Full Text | Google Scholar

39. Trentacosta CJ, Hyde LW, Goodlett BD, Shaw DS. Longitudinal prediction of disruptive behavior disorders in adolescent males from multiple risk domains. Child Psychiatry Hum Dev. (2013) 44:561–72. doi: 10.1007/s10578-012-0349-3

PubMed Abstract | Crossref Full Text | Google Scholar

40. Dwyer DB, Falkai P, Koutsouleris N. Machine learning approaches for clinical psychology and psychiatry. Annu Rev Clin Psychol. (2018) 14:91–118. doi: 10.1146/annurev-clinpsy-032816-045037

PubMed Abstract | Crossref Full Text | Google Scholar

41. Fife DA, D'Onofrio J. Common, uncommon, and novel applications of random forest in psychological research. Behav Res Methods. (2023) 55:2447–66. doi: 10.3758/s13428-022-01901-9

PubMed Abstract | Crossref Full Text | Google Scholar

42. Rose S. Intersections of machine learning and epidemiological methods for health services research. Int J Epidemiol. (2020) 49:1763–70. doi: 10.1093/ije/dyaa035

PubMed Abstract | Crossref Full Text | Google Scholar

43. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning, Second Edition. New York, NY: Springer. (2009). doi: 10.1007/978-0-387-84858-7

Crossref Full Text | Google Scholar

44. Jorm LR. Towards machine learning-enabled epidemiology. Int J Epidemiol. (2020) 49:1770–3. doi: 10.1093/ije/dyaa242

PubMed Abstract | Crossref Full Text | Google Scholar

45. Mahmoud HFF. Parametric versus semi and nonparametric regression models. Int J Stat Probab. (2021) 10:90–108. doi: 10.5539/ijsp.v10n2p90

Crossref Full Text | Google Scholar

46. Beam AL, Kohane IS. Big data and machine learning in health care. JAMA. (2018) 319:1317–8. doi: 10.1001/jama.2017.18391

PubMed Abstract | Crossref Full Text | Google Scholar

47. Bi Q, Goodman KE, Kaminsky J, Lessler J. What is machine learning? A primer for the epidemiologist. Am J Epidemiol. (2019) 188:2222–39. doi: 10.1093/aje/kwz189

PubMed Abstract | Crossref Full Text | Google Scholar

48. Breiman L. Statistical modeling: the two cultures (with comments and a rejoinder by the author). Statist Sci. (2002) 16:199–231. doi: 10.1214/ss/1009213726

Crossref Full Text | Google Scholar

49. Pauli R, Kohls G, Tino P, Rogers JC, Baumann S, Ackermann K, et al. Machine learning classification of conduct disorder with high versus low levels of callous-unemotional traits based on facial emotion recognition abilities. Eur Child Adolesc Psychiatry. (2023) 32:589–600. doi: 10.1007/s00787-021-01893-5

PubMed Abstract | Crossref Full Text | Google Scholar

50. Vedechkina M, Holmes J. Cognitive difficulties following adversity are not related to mental health: findings from the ABCD study. Dev Psychopathol. (2023). doi: 10.31234/osf.io/mtra9

Crossref Full Text | Google Scholar

51. Pauli R, Tino P, Rogers JC, Baker R, Clanton R, Birch P, et al. Positive and negative parenting in conduct disorder with high versus low levels of callous-unemotional traits. Dev Psychopathol. (2021) 33:980–91. doi: 10.1017/S0954579420000279

PubMed Abstract | Crossref Full Text | Google Scholar

52. Chan L, Simmons C, Tillem S, Conley M, Brazil IA, Baskin-Sommers A. Classifying conduct disorder using a biopsychosocial model and machine learning method. Biol Psychiatry Cogn Neurosci Neuroimaging. (2023) 8:599–608. doi: 10.1016/j.bpsc.2022.02.004

PubMed Abstract | Crossref Full Text | Google Scholar

53. Benzinger L, Ursin F, Balke WT, Kacprowski T, Salloch S. Should Artificial Intelligence be used to support clinical ethical decision-making? A systematic review of reasons. BMC Medical Ethics. (2023) 24:48. doi: 10.1186/s12910-023-00929-6

PubMed Abstract | Crossref Full Text | Google Scholar

54. Karimian G, Petelos E, Evers SMAA. The ethical issues of the application of artificial intelligence in healthcare: a systematic scoping review. AI Ethics. (2022) 2:539–51. doi: 10.1007/s43681-021-00131-7

Crossref Full Text | Google Scholar

55. Blease C, Kharko A, Annoni M, Gaab J, Locher C. Machine learning in clinical psychology and psychotherapy education: a mixed methods pilot survey of postgraduate students at a Swiss University. Front Public Health. (2021) 4:623088. doi: 10.3389/fpubh.2021.623088

PubMed Abstract | Crossref Full Text | Google Scholar

56. Khare SK, Acharya UR. An explainable and interpretable model for attention deficit hyperactivity disorder in children using EEG signals. Comput Biol Med. (2023) 155:106676. doi: 10.1016/j.compbiomed.2023.106676

PubMed Abstract | Crossref Full Text | Google Scholar

57. Shivaprasad S, Chadaga K, Saldanha C, Sampathila N, Prabhu S. An interpretable schizophrenia diagnosis framework using machine learning and explainable artificial intelligence. Syst Sci Control Eng. (2024) 12:2364033. doi: 10.1080/21642583.2024.2364033

Crossref Full Text | Google Scholar

58. Byeon H. Advances in machine learning and explainable artificial intelligence for depression prediction. Int J Adv Comput Sci Appl. (2023) 14:520–26. doi: 10.14569/IJACSA.2023.0140656

Crossref Full Text | Google Scholar

59. Benito GV, Goldberg X, Brachowicz N, Castaño-Vinyals G, Blay N, Espinosa A, et al. Machine learning for anxiety and depression profiling and risk assessment in the aftermath of an emergency. Artif Intell Med. (2024) 157:102991. doi: 10.1016/j.artmed.2024.102991

PubMed Abstract | Crossref Full Text | Google Scholar

60. Viswan V, Shaffi N, Mahmud M, Subramanian K, Hajamohideen F. Explainable artificial intelligence in Alzheimer's disease classification: a systematic review. Cognit Comput. (2024) 16:1–44. doi: 10.1007/s12559-023-10192-x

PubMed Abstract | Crossref Full Text | Google Scholar

61. Simonds J, Rothbart MK. The Temperament in Middle Childhood Questionnaire (TMCQ): a computerized self-report measure of temperament for ages 7–10. In: Occasional Temperament Conference. Athens, GA (2004). doi: 10.1037/t70081-000

Crossref Full Text | Google Scholar

62. Shields A, Cicchetti D. Emotion regulation among school-age children: the development and validation of a new criterion Q-sort scale. Dev Psychol. (1997) 33:906–16. doi: 10.1037//0012-1649.33.6.906

PubMed Abstract | Crossref Full Text | Google Scholar

63. Dadds MR, Hunter K, Hawes DJ, Frost ADJ, Vassallo S, Bunn P, et al. A measure of cognitive and affective empathy in children using parent ratings. Child Psychiatry Hum Dev. (2008) 39:111–22. doi: 10.1007/s10578-007-0075-4

PubMed Abstract | Crossref Full Text | Google Scholar

64. Kliem S, Lohmann A, Mößle T, Foran HM, Hahlweg K, Zenger M, et al. Development and validation of a parenting scale short form (PS-8) in a representative population sample. J Child Fam Stud. (2019) 28:30–41. doi: 10.1007/s10826-018-1257-3

Crossref Full Text | Google Scholar

65. Driscoll K, Pianta RC. Mothers' and fathers' perceptions of conflict and closeness in parent-child relationships during early childhood. J Early Child Infant Psychol. (2011) 7:1–24.

PubMed Abstract | Google Scholar

66. Paterson G, Sanson A. The association of behavioural adjustment to temperament, parenting and family characteristics among 5-year-old children. Soc Dev. (1999) 8:293–309. doi: 10.1111/1467-9507.00097

Crossref Full Text | Google Scholar

67. Berry JO, Jones WH. The parental stress scale: initial psychometric evidence. J Soc Pers Relat. (1995) 12:472. doi: 10.1177/0265407595123009

Crossref Full Text | Google Scholar

68. Kroenke K, Spitzer RL, Williams JBW, Löwe B. An ultra-brief screening scale for anxiety and depression: the PHQ-4. Psychosomatics. (2009) 50:613–21. doi: 10.1016/S0033-3182(09)70864-3

Crossref Full Text | Google Scholar

69. Cohen S, Kamarck T, Mermelstein R. A global measure of perceived stress. J Health Soc Behav. (1983) 24:385–96. doi: 10.2307/2136404

Crossref Full Text | Google Scholar

70. Obst P, Shakespeare-Finch J, Krosch DJ, Rogers EJ. Reliability and validity of the Brief 2-Way Social Support Scale: an investigation of social support in promoting older adult well-being. SAGE Open Med. (2019) 7:1–10. doi: 10.1177/2050312119836020

PubMed Abstract | Crossref Full Text | Google Scholar

71. Berrar D. Cross-validation. In: Ranganathan S, Gribskov M, Nakai K, Christian Schönbach C, editors. Reference Module in Life Sciences Encyclopedia of Bioinformatics and Computational Biology, Vol. 1. Amsterdam: Elsevier (2018). p. 542–5. doi: 10.1016/B978-0-12-809633-8.20349-X

Crossref Full Text | Google Scholar

72. Ouellette JA, Wood W. Habit and intention in everyday life: the multiple processes by which past behavior predicts future behavior. Psychol Bull. (1998) 124:54. doi: 10.1037//0033-2909.124.1.54

Crossref Full Text | Google Scholar

73. Stone LL, Otten R, Engels RCME, Kuijpers RCWM, Janssens JMAM. Relations between internalizing and externalizing problems in early childhood. Child Youth Care Forum. (2015) 44:635–53. doi: 10.1007/s10566-014-9296-4

Crossref Full Text | Google Scholar

74. Hare MM, Trucco EM, Hawes SW, Villar M, Zucker RA. Pathways to substance use: examining conduct problems and parenting behaviors from preschool to adolescence. Dev Psychopathol. (2024) 36:454–66. doi: 10.1017/S0954579422001328

PubMed Abstract | Crossref Full Text | Google Scholar

75. Niv S, Tuvblad C, Raine A, Baker LA. Aggression and rule-breaking: heritability and stability of antisocial behavior problems in childhood and adolescence. J Crim Justice. (2013) 41:285–91. doi: 10.1016/j.jcrimjus.2013.06.014

PubMed Abstract | Crossref Full Text | Google Scholar

76. Bronfenbrenner U. Interacting systems in human development: research paradigms: present and future. In: Bolger N, Caspi A, Downey G, Moorehouse M, editors. Persons in Context. Developmental Processes. Cambridge: Cambridge University Press (1988). p. 25–49. doi: 10.1017/CBO9780511663949.003

Crossref Full Text | Google Scholar

77. Bronfenbrenner U. Making Human Beings Human: Bioecological Perspectives on Human Development. London: Sage Publications Ltd. (2005).

Google Scholar

78. Robson DA, Allen MS, Howard SJ. Self-regulation in childhood as a predictor of future outcomes: a meta-analytic review. Psychol Bull. (2020) 146:324–54. doi: 10.1037/bul0000227

PubMed Abstract | Crossref Full Text | Google Scholar

79. Martel MM, Gremillion ML, Roberts B. Temperament and common disruptive behavior problems in preschool. Pers Individ Dif . (2012) 53:874–9. doi: 10.1016/j.paid.2012.07.011

PubMed Abstract | Crossref Full Text | Google Scholar

80. Caprara GV, Barbaranelli C, Incatasciato M, Pastorelli C, Rabasca A. Emotional instability, physical and verbal aggression, and prosocial behavior as precursors of scholastic achievement and social adjustment. In: Feshbach S, Zagrodzka J, editors. Aggression. The Plenum Series in Social/Clinical Psychology. Boston, MA: Springer (1997). p. 111–20. doi: 10.1007/978-1-4615-5883-5_7

Crossref Full Text | Google Scholar

81. Leibenluft E, Stoddard J. The developmental psychopathology of irritability. Dev Psychopathol. (2013) 25:1473–87. doi: 10.1017/S0954579413000722

PubMed Abstract | Crossref Full Text | Google Scholar

82. Salekin RT. Research review: what do we know about psychopathic traits in children? J Child Psychol Psychiatry. (2017) 58:1180–200. doi: 10.1111/jcpp.12738

PubMed Abstract | Crossref Full Text | Google Scholar

83. Frick PJ, Ray JV. Evaluating callous-unemotional traits as a personality construct. J Pers. (2015) 83:710–22. doi: 10.1111/jopy.12114

PubMed Abstract | Crossref Full Text | Google Scholar

84. LoBraico EJ, Bray BC, Feinberg ME, Fosco GM. Constellations of family risk for long-term adolescent antisocial behavior. J Fam Psychol. (2020) 34:587. doi: 10.1037/fam0000640

PubMed Abstract | Crossref Full Text | Google Scholar

85. Savell SM, Saini R, Ramos M, Wilson MN, Lemery-Chalfant K, Shaw DS. Family processes and structure: longitudinal influences on adolescent disruptive and internalizing behaviors. Fam Relat. (2023) 72:361–82. doi: 10.1111/fare.12728

PubMed Abstract | Crossref Full Text | Google Scholar

86. Kim KJ. Parent-adolescent conflict, negative emotion, and estrangement from the family of origin. Res Hum Dev. (2006) 3:45–58. doi: 10.1207/s15427617rhd0301_5

Crossref Full Text | Google Scholar

87. Moed A. Mothers' Aversion sensitivity and reciprocal negativity in mother-child interactions: implications for coercion theory. Dev Psychol. (2022) 58:2239–51. doi: 10.1037/dev0001427

PubMed Abstract | Crossref Full Text | Google Scholar

88. Kochanska G, Barry RA, Jimenez NB, Hollatz AL, Woodard J. Guilt and effortful control: two mechanisms that prevent disruptive developmental trajectories. J Pers Soc Psychol. (2009) 97:322–33. doi: 10.1037/a0015471

PubMed Abstract | Crossref Full Text | Google Scholar

89. Waschbusch DA, Baweja R, Babinski DE, Mayes SD, Waxmonsky JG. Irritability and limited prosocial emotions/callous-unemotional traits in elementary-school-age children. Behav Ther. (2020) 51:223–37. doi: 10.1016/j.beth.2019.06.007

PubMed Abstract | Crossref Full Text | Google Scholar

90. Chen N, Deater-Deckard K, Bell MA. The role of temperament by family environment interactions in child maladjustment. J Abnorm Child Psychol. (2014) 42:1251–62. doi: 10.1007/s10802-014-9872-y

PubMed Abstract | Crossref Full Text | Google Scholar

91. Tuvblad C, Bezdjian S, Raine A, Baker LA. Psychopathic personality and negative parent-to-child affect: a longitudinal cross-lag twin study. J Crim Justice. (2013) 41:331–41. doi: 10.1016/j.jcrimjus.2013.07.001

PubMed Abstract | Crossref Full Text | Google Scholar

92. Joyner B, Beaver KM. Examining the potential link between child maltreatment and callous-unemotional traits in children and adolescents: a multilevel analysis. Child Abuse Neglect. (2021) 122:105327. doi: 10.1016/j.chiabu.2021.105327

PubMed Abstract | Crossref Full Text | Google Scholar

93. Yeh MT, Chen P, Raine A, Baker LA, Jacobson KC. Child psychopathic traits moderate relationships between parental affect and child aggression. J Am Acad Child Adolesc Psychiatry. (2011) 50:1054–64. doi: 10.1016/j.jaac.2011.06.013

PubMed Abstract | Crossref Full Text | Google Scholar

94. Thomas A, Chess S. The role of temperament in the contributions of individuals to their development. In: Lerner RM, Busch-Rossnagel, editors. Individuals As Producers of their Development. A Life-Span Perspective. Amsterdam: Elsevier (1981). p. 231–55. doi: 10.1016/B978-0-12-444550-5.50016-X

Crossref Full Text | Google Scholar

95. DeLisi M, Vaughn MG. Foundation for a temperament-based theory of antisocial behavior and criminal justice system involvement. J Crim Justice. (2014) 42:10–25. doi: 10.1016/j.jcrimjus.2013.11.001

Crossref Full Text | Google Scholar

96. Dugré JR, Potvin S. Multiple developmental pathways underlying conduct problems: a multitrajectory framework. Dev Psychopathol. (2022) 34:1115–24. doi: 10.1017/S0954579420001650

PubMed Abstract | Crossref Full Text | Google Scholar

97. Cicchetti D, Toth SL. The past achievements and future promises of developmental psychopathology: the coming of age of a discipline. J Child Psychol Psychiatry. (2009) 50:16–25. doi: 10.1111/j.1469-7610.2008.01979.x

PubMed Abstract | Crossref Full Text | Google Scholar

98. Kazdin AE, Bass D, Siegel T, Thomas C. Cognitive-behavioral therapy and relationship therapy in the treatment of children referred for antisocial behavior. J Consult Clin Psychol. (1989) 57:522–35. doi: 10.1037//0022-006X.57.4.522

PubMed Abstract | Crossref Full Text | Google Scholar

99. Sanders MR, Markie-Dadds C, Turner KMT. Practitioner's Manual for Enhanced Triple P. Huntington Beach, CA: Families International (1998).

PubMed Abstract | Google Scholar

100. Webster-Stratton C, Reid J. In: Kazdin AE, Bass D, Weisz JR, editors. The incredible years parents, teachers, and children training series: a multifaceted treatment approach for young children with conduct problems. In: Kazdin AE, Weisz JR, editors. Evidence-based Psychotherapies for Children and Adolescents. New York, NY: Guildford Publications (2010). p. 224–40.

Google Scholar

101. Harmon SL, Price MA, Corteselli KA, Lee EH, Metz K, Bonadio FT, et al. Evaluating a modular approach to therapy for children with anxiety, depression, trauma, or conduct problems (MATCH) in school-based mental health care: study protocol for a randomized controlled trial. Front Psychol. (2021) 12:639493. doi: 10.3389/fpsyg.2021.639493

PubMed Abstract | Crossref Full Text | Google Scholar

102. Álvarez Voces M, Díaz-Vázquez B, López-Romero L, Villar P, Romero E. Gender differences in co-developmental trajectories of internalizing and externalizing problems: a 7-year longitudinal study from ages 3 to 12. Child Psychiatry Hum Dev. (2024). doi: 10.1007/s10578-024-01771-6

PubMed Abstract | Crossref Full Text | Google Scholar

103. Young AF, Powers JR, Bell SL. Attrition in longitudinal studies: who do you lose? Aust N Z J Public Health. (2006) 30:353–61. doi: 10.1111/j.1467-842X.2006.tb00849.x

PubMed Abstract | Crossref Full Text | Google Scholar

104. Devenish B, Hooley M, Mellor D. The pathways between socioeconomic status and adolescent outcomes: a systematic review. Am J Community Psychol. (2017) 59:219–38. doi: 10.1002/ajcp.12115

PubMed Abstract | Crossref Full Text | Google Scholar

105. Heberle AE, Thomas YM, Briggs-Gowan MJ, Wagmiller RL, Carter AS. The impact of neighborhood, family, and individual risk factors on toddlers' disruptive behavior. Child Dev. (2014) 85:2046–61. doi: 10.1111/cdev.12251

PubMed Abstract | Crossref Full Text | Google Scholar

106. Sacco R, Camilleri N, Eberhardt J, Umla-Runge K, Newbury-Birch D. A systematic review and meta-analysis on the prevalence of mental disorders among children and adolescents in Europe. Eur Child Adolesc Psychiatry. (2024) 33:2877–94. doi: 10.1007/s00787-022-02131-2

PubMed Abstract | Crossref Full Text | Google Scholar

107. Eme RF. Selective females affliction in the developmental disorders of childhood: a literature review. J Clin Child Psychol. (1992) 21:354–64. doi: 10.1207/s15374424jccp2104_5

Crossref Full Text | Google Scholar

108. Gushue K, Mccuish EC, Corrado RR. Developmental offending patterns: female offending beyond the reference category. Crim Justice Behav. (2021) 48:139–56. doi: 10.1177/0093854820966729

Crossref Full Text | Google Scholar

Keywords: conduct problems, childhood, Random Forest, family variables, individual variables, explainability

Citation: Romero E, González-González J, Álvarez-Voces M, Costa-Montenegro E, Díaz-Vázquez B, Busto-Castiñeira A, Villar P and López-Romero L (2025) Leveraging Random Forests explainability for predictive modeling of children's conduct problems: insights from individual and family factors. Front. Public Health 13:1526413. doi: 10.3389/fpubh.2025.1526413

Received: 07 January 2025; Accepted: 19 May 2025;
Published: 12 June 2025.

Edited by:

Michelle Plusquin, University of Hasselt, Belgium

Reviewed by:

Sergio Alejandro Rodriguez Jerez, Sergio Arboleda University, Colombia
Cristian Villanueva-Bonilla, Corporación Universitaria Empresarial Alexander von Humboldt, Colombia

Copyright © 2025 Romero, González-González, Álvarez-Voces, Costa-Montenegro, Díaz-Vázquez, Busto-Castiñeira, Villar and López-Romero. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jaime González-González, amFpbWVnb256YWxlekBndGkudXZpZ28uZXM=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.