- 1Department of Biostatistics, National Institute of Mental Health and Neuro Sciences, Bengaluru, Karnataka, India
- 2Department of Clinical Psychology, National Institute of Mental Health and Neuro Sciences, Bengaluru, Karnataka, India
- 3Department of Psychiatry, National Institute of Mental Health and Neuro Sciences, Bengaluru, Karnataka, India
Commonly used linear regression focuses only on the effect on the mean value of the dependent variable and may not be useful in situations where relationships across the distribution are of interest. This study aimed to appraise the utility of Quantile Regression (QR), a technique that can model any quantile value of the dependent variable. The primary aim of this study is to provide an overview of the QR method and its practical applications in psychiatric research. We demonstrated this with an exploratory analysis of the data on neuropsychological test performance among 119 subjects with obsessive-compulsive disorder (OCD). The varying effects of age, education, sex, antipsychotic use, and symptom severity between extreme quantiles were highlighted using simple and multiple QR models. While linear regression is easy to employ and interpret, QR is not only on par in performance but also more flexible in identifying a set of factors that may differ depending on the quantile of interest. QR analysis is a potent tool in applications where the effect of the independent variable varies depending on the values of the outcome variable. The results of this exploratory study suggest that the QR approach could potentially help explore inconsistent findings, generate future hypotheses, and/or provide possible interpretive frameworks for inconsistencies observed in neuropsychological research in OCD. As the QR offers a complete distributional analysis, it is valuable in providing new insights, especially in situations where the usual regression assumptions are violated or when interested the in extreme values of the outcome of interest.
1 Introduction
Linear regression, the most widely used method among a broad family of regression methods, models a continuous dependent variable (DV) as a function of categorical or continuous independent variables (IV). Ordinary least squares (OLS), a technique for estimating regression coefficients, is the most appealing because of its simplicity in interpretation, practical utility, and well-developed theoretical framework (1). However, the validity and reliability of the results with OLS regression are largely dependent on assumptions such as normality and homogeneity (equal variances) of error terms. The estimates can be misleading, and the statistical conclusions drawn may be invalid in OLS regression if these assumptions are not met (2, 3). The use of the OLS method without testing assumptions and despite violations of assumptions is common (4) as it is commonly available to employ in popular statistical software packages.
In general, conditional mean regression models, such as linear regression, reveal the effect of IVs only on the mean (representing the central location of a distribution) change in the DV but not on other values/locations of its distribution. Sometimes, it is important to see the effects of IV on non-central locations like quartiles or a certain specific extreme part of the distribution, such as the 5th or 95th percentile. Because conditional mean models fail to model quantiles, they provide an incomplete picture of the possibly different magnitude of the effect of an IV across the distribution of the DV. They invariably assume the same effect throughout the distribution; however, it is not necessary for a variable to have an equal effect on all parts, as it may depend on the values of the response variable. While assuming so may lead to missing out on important differential effects, treating them to have variable effects may reveal a different set of covariates for different parts of the distribution (5, 6).
Alternatives to traditional linear regression continue to intrigue applied statisticians and methodological researchers because of the limitations of the classical linear regression technique (7). Based on Mosteller and Tukey’s (8) idea of extending conditional mean models, Koenker and Bassett (9) proposed a concept of Quantile Regression (QR) to fit regression curves to specified parts of the distribution other than mean. QR is a natural extension of conditional mean models to other locational parameters, and it aims to estimate and conduct inference about any conditional quantile values in place of the mean (9). A special case of QR is called median regression, which is most appropriate in the case of a skewed distribution. QR coefficients describe how a specified quantile of the response changes per unit change in the predictor, as opposed to the mean change in the traditional OLS regression.
QR effectively deals with the problem of expressing the relationship with only one regression curve, as in OLS. This is especially useful when the DV is heavy-tailed/non-normally distributed, if the effects of covariates differ at different tails of the distribution, and when the extreme values (low and high) are important and/or are non-ignorable. For example, the data on neuropsychological performance of individuals often violate assumptions as they show higher variability, and it is difficult to apply many of the standard statistical procedures in such situations (10). It has also been reported that the results involving such variables are inconsistent across different studies (11). For instance, neuropsychological research on obsessive–compulsive disorder (OCD) is marked by inconsistent and heterogeneous findings despite decades of study (12). Given the emphasis placed on neuropsychological functions in clinical and functional recovery in psychiatric disorders (13), statistical methods may be useful in exploring potential sources of inconsistencies across studies. Neuropsychological dysfunction in OCD is significant and may persist despite clinical remission (12, 14) and predict treatment outcomes (15, 16). Neuropsychological findings are linked to the cortico-striatal-thalamo-cortical circuit, which is posited to underpin the clinical symptoms of OCD (17).
Some studies (18–20) have emphasized the utility of QR in behavioral sciences using real-life and simulated datasets. However, older methods, such as OLS regression, predominate the literature despite their limited applicability in specific situations. The main objective of this study is to provide an overview of the QR method and its practical applications using open-source R software. The secondary aim is to demonstrate the utility of this method with a strictly exploratory analysis of the data from neuropsychological test performances among individuals with OCD. This manuscript is primarily intended for methodologists and methodologically oriented clinical researchers; therefore, the clinical analyses are presented strictly as an illustrative case study.
2 Methodology
2.1 Estimation in QR
The unknown parameters in linear regression models are usually estimated using methods such as OLS, based on minimizing the sums of squared residuals. The idea of minimizing squared residuals can be extended to minimize absolute residuals to yield a line with minimum absolute error. QR methods offer an equally convenient mechanism for estimating models for the full range of conditional quantiles. The minimization of QR is reformulated as a linear programming problem and solved using algorithms. Several existing algorithms have been efficiently implemented using new-generation computers and software (21). For any quantile τ ∈ (0, 1) defined on the distribution of the DV, the quantity β(τ) is called the τth regression quantile.
2.2 Statistical inferences in QR
The methods for computing confidence intervals and statistical inferences can be classified as direct methods based on the asymptotic normality of the estimated regression quantiles, the rank-score method based on the inversion of the rank-score test, and the resampling methods, which use the bootstrap technique and other procedures. The widely used goodness of fit of a linear regression using R2 was extended to QR models by Koenker and Machado (22) via a pseudoR2 measure keeping in view of the difference in the estimation technique. Similar to the usual R2, it ranges between 0 and 1, with a higher value indicating a better model fit, but it only pertains to the particular quantile; hence, each fitted QR model will have a separate pseudo R2 value. The same measure can be used to test the significant difference between nested models, which will yield an estimate of the relative pseudoR2. In addition, information criteria such as Akaike (AIC) and Bayes (BIC) can be used to compare the overall model fit. The Likelihood Ratio Test (LRT) can be used to measure and verify the inclusion/exclusion of one or a set of variables under the QR framework, unlike in OLS, where different thresholds, such as p-values, are used to develop the best model.
Although the QR method addresses the issue of varying covariate effects across quantiles by estimating different regression lines, it is also important to test the equivalence of a pair of QR lines across quantiles corresponding to the same covariate. This test can determine whether all of the conditional quantile functions have the same intercept/slope parameter. An Analysis of Variance (ANOVA) type of scenarios is derived to test hypotheses about the equivalence of estimates across quantiles using asymptotic results (21). There are also newer methods that look beyond traditional p-value based interpretations, such as quantile loss approaches, which measure the model performance by evaluating calibration, discrimination, and overall performance of quantile predictions from the fitted model (23).
2.3 Data and study variables
Real-life data from a previously published cross-sectional study were used to demonstrate the utility of the QR approach. The participants were recruited from the OCD Clinic, Behavioral Medicine Unit, and Department of Psychiatry, National Institute of Mental Health and Neuro Sciences, Bengaluru, India, between January 2008 and 2010 (24). The subjects were diagnosed with OCD according to the DSM-IV criteria based on a detailed clinical interview using a semi-structured pro forma and the Mini International Neuropsychiatric Interview Plus (MINI Plus version 5.0). For this exploratory analysis, selected variables from a comprehensive battery of neuropsychological tests, such as attention, executive functions, memory, and intelligence, were used. Participants with complete data on all sociodemographic, clinical variables, and test scores (n = 119) were included in the current study. The exclusion and inclusion criteria for the participants and variable operationalization/definitions for the data utilized in this study are described in the supplemental section and elsewhere (24). Although the data were collected between 2008 and 2010, they were used here to address the primary objective of demonstrating the methodological application of Quantile Regression rather than to derive time-sensitive clinical conclusions.
2.4 Comparison of regression approaches
Linear regression is an obvious method of choice for identifying the factors that affect performance. Because of the skewed distribution of the DV and the importance of assessing the extreme parts of the DV, both conditional mean regressions using OLS and QR models at every 10th quantile ranging from 0.1 to 0.9 along with the 5th, 25th, 75th, and 95th quantiles were fitted. Bivariate and multivariate QR models were fitted with each neuropsychological performance as the dependent variable. Bootstrap-based confidence intervals were calculated for the coefficients and tested for their statistical significance. The models also included current age and sex as they are common influencing factors for neuropsychological scores (10), along with other variables such as years of education, age of onset of OCD, symptom severity scores, and drug use. Conditional normality and homogeneity of variances for the OLS model were tested using the Shapiro–Wilk and Breush–Pagan tests, respectively. An ANOVA-based approach was used to test the equality of the QR coefficient fit for two different quantiles. Model fits were assessed using measures such as AIC and pseudo-R2.
All the estimations and inference procedures stated above for the empirical analysis in this study were carried out using open-source software R (25), where most QR-related functions are available in the quantreg package (26).
3 Results
The patients’ profiles in terms of sociodemographic and selected clinical characteristics are displayed in Supplementary Table 1. The scores on the selected tests showed a high range of variability and skewed distributions, as shown in Supplementary Figure 1 and Supplementary Table 2. Many subjects scored very high/low, deviating from the average score, and a formal Shapiro–Wilk’s test for these indicated a non-normal distribution in most cases.
The results of the model with only age as an IV in modeling digit span test (DST) scores are presented in Figures 1A, B, whereas Table 1 shows the results in numbers along with results of other variables. The variable DST was selected to explain the results of the QR model, and a similar explanation holds for the other variables. The scatterplot of age with DST scores shows the red line indicating the OLS regression fit with its estimated confidence intervals superimposed with QR lines for each selected quantile. Figure 1B shows the estimates and confidence intervals of the regression coefficients across quantiles, which helps examine the trend via visualization. The non-significant regression coefficient using OLS (b = −0.09, p = 0.09) regression indicates a marginal decrease in DST scores for a yearly increase in age, but the test of residuals indicates a violation of the assumption of normality (p = 0.002). In contrast, the coefficients from the QR model change from a smaller, non-significant but positive value at the 5th quantile (b = 0.05, p = 0.11) to a higher, significant negative value at the 95th (b = −0.36, p =<0.001). The effect of age on DST is averaged out in OLS and QR, revealing that an increase in age leads to a greater decline in DST scores along the 95th quantile of DST values than along the 5th quantile.
Figure 1. (A) Simple regression models with effect of age on DST scores. (B) Change in QR coefficients across various quantiles.
Table 1. Simple QR model—comparison of OLS and QR across selected quantiles for the model with age as independent variable.
The differential effect of age with respect to other test scores as dependent variables is displayed in Table 1. Selected quantiles are displayed for brevity, although 13 different QR models (every 10th quantile along with the 5th, 25th, 75th, and 95th) along with OLS were fit for comparison and visualization. Supplementary Figure 2 summarizes the results of all such QR and OLS models, where most of the OLS models violate the assumption of normality of residuals and homoskedasticity. Variables such as DST, CTT2, and WCST_PR showed an increasing absolute effect of age along the higher quantiles. Age had no effect on WCST_TTCC1 at either extremes, with variations showing only in the middle quantiles. The OLS regression coefficients miss these differential effects and provide an incomplete picture by averaging them out, as can be clearly seen from these results.
Although we can fit as many QR as we may need, the coefficients may not be statistically significant between the two fitted quantiles. To test the equivalence and statistical significance between a pair of QR coefficients corresponding to the same covariate, an ANOVA approach was used, and the results for such tests conducted between the 5th vs. 95th and 25th vs. 75th quantiles for the model with age as a predictor are presented in Table 2 (also includes results from multivariable models). Although the tested pairs of coefficients are not significantly different for most of the variables, the significant difference in DST scores for both pairs is highlighted. This further supports the interpretation of the change in the QR coefficient values across quantiles. It is also interesting to note that if a binary predictor variable, such as gender, is used in a simple QR model, a complete distributional comparison is possible, as opposed to a comparison of only the means (analogous to an independent samples t-test) in the case of OLS.
Table 2. Testing for equality of slope estimates across selected quantiles for bivariate model and between models with same variables across selected quantiles for multivariable models.
As the next step in the case of other regression model building processes, different sociodemographic and clinical factors that might affect neuropsychological performance, such as sex, age, education, duration of illness, OCD severity (YBOCS), depression rating scores (HDRS), CGI scores, and use of antipsychotics and benzodiazepines, were included in a multivariable model. Comparative graphical presentations of coefficients under separate bivariate and multivariable models for DST scores are shown in Figure 2. The right panel shows the coefficients from the adjusted model, and the left panel shows those from the unadjusted model. Adjusted for other variables, the difference in DST scores between sexes increased at higher quantiles, as did the effect of education and antipsychotic intake. The adjusted effects of the YBOCS, HDRS, and CGI were mostly non-significant, which showed a significant effect individually. The effects of education were predominant in both scenarios, indicating a strong association with the scores. Specifically, the positive effects of education are high among higher quantile values. The differential effects of medications were also apparent in the QR analysis. Participants with higher scores were found to have a larger effect with the use of antipsychotic drugs. Furthermore, the supplementary sections (Supplementary Figure 3; Supplementary Table 3) with the results for models with other test scores show similar trends. The pairwise testing between the two QR models is found to be significantly different between the two extreme quantiles, that is, the 5th vs 95th, as shown in Table 2. Consequently, the set of significant variables in a lower quantile can differ from that in a higher quantile.
Figure 2. Comparison of change in regression coefficients across quantiles between (A) simple and (B) multivariable models for DST.
The model fit was assessed using pseudo R2, and the values for each model are graphically represented in Supplementary Figure 4. The aim of this study here was not to establish a superior model but to illustrate how the QR can complement traditional mean-based approaches. Most models showed moderate goodness of fit, indicating the usefulness of IVs in explaining the variation in test scores. DST shows a poor fit at lower quantiles, but the fitness gradually increases for higher quantiles. Relatively better model fitness values were obtained for variables such as CTT1, CTT2, CFT_IR, and WCST_TTCC1 at higher quantiles.
4 Discussion
The overall purpose of this study was to employ, explore, and highlight the utilities of the QR method over traditional linear regression in exploring relationships. It is common to use linear regression even when the assumptions are violated (27) because of its ease of use and interpretation, alongside its availability in commonly used software packages. Using QR is advantageous in situations where the assumptions of linear regression are not met, and it is a default choice when the research question itself centers around the effect on the specific percentile of the variable of interest.
It is well known that most variables observed in empirical research fail to follow normality and are skewed in nature, necessitating the use of methods that do not assume normality. This is especially true in the case of social and behavioral sciences (10), where the expected phenotypic variability is high. The QR analysis in the present study showed that it may be useful for better understanding relationships, specifically non-normally distributed outcome variables. The simple QR model showed that the assumption of normality of errors was violated, and the effects of IV varied depending on the quantile being estimated. For example, age may have a significant effect on patients with higher DST scores but not on those with low scores. Linear regression fails to capture this difference because it averages these effects. Although it is easy to fit multiple QR models, it is important to assess the differences in QR coefficients statistically. The regression coefficients showed positive or negative trends in most cases, even though some of them were not statistically significant individually. This differential effect highlights the use of QR, and their significance is testable. However, significance testing in QR mostly relies on computer resampling or asymptotic approximations resting on distributional assumptions. The difficulty in interpreting multiple effects compared to a single measure of effect, as in OLS regression, is unavoidable (5). However, commonly used goodness-of-fit measures, such as the pseudo R-square and testing between two QR models via the ANOVA approach, as shown in the results, can be useful in model building and variable selection.
Most multivariable models showed different sets of significant predictors in our analysis. Supplementing well-established conditional mean models with QR to estimate an entire family of conditional quantile functions offers a much more complete statistical analysis of the relationships. Multivariable QR models can help address the fact that the variables that affect the lower parts of the distribution may not necessarily affect individuals at higher quantiles (28). It is worthwhile to explicitly find the set of covariates that affect a specific percentile (low or high) of the distribution to adopt a more specific approach based on the model. For example, if different treatments are to be compared for their effects on an outcome of interest, it is possible to determine whether the treatment effect is dependent on the level of the outcome itself. Such understanding can also help develop better models for predicting treatment response, as discussed by Katja et al. (16), and aid in deciding specific treatment approaches.
As a strictly illustrative case study of specific uses of QR in Psychiatry research, neuropsychological test performance in OCD has been explored. The neuropsychological literature on OCD is characterized by notable inconsistencies (11, 12, 29) that have not been satisfactorily explained by several clinical and demographic moderators such as age, education, gender, symptom subtypes, and severity of illness. While the variable CFT IR is most consistently known to be impaired in OCD, several others have shown inconsistent findings across studies. Our analyses show that, although CFT IR looks homogenous, varying QR coefficients across quantiles may help generate hypotheses about why CFT_IR appears consistently implicated across several studies. While the diagnosis of OCD itself may be linked to lower scores on the CFT IR compared to healthy controls (30), the impact of the severity of OCD (YBOCS score) seems to be most prominent at the lowest quantiles, possibly explaining why this finding seems to be the most consistently replicated across studies. In contrast, DST is not often reduced in OCD patients compared to controls (12). However, the QR suggests that the severity of OCD impacts attention in individuals performing at the highest quantiles. While attention may be impacted in subtle ways for most individuals with OCD, the impact of OCD severity may be most pronounced in high performers. The variables like DST, CTT2, and WCST_PR show an increasing absolute effect of age along the higher quantiles. The effect of age on WCST_TTCC1 appears in variations only in the middle quantiles. Despite symptomatic recovery, many individuals with OCD continue to experience cognitive deficits and accompanying socio-occupational dysfunction, necessitating targeted and personalized interventions for cognition. Despite some attempts at cognitive training in OCD, there has been limited success in generalizing improvements to socio-occupational functions (31). Analyses such as the QR presented here could potentially help explore the inconsistent findings, generate future hypotheses, and/or provide possible interpretive frameworks for such inconsistencies. A fine-grained, adequately powered analysis of neuropsychological findings using QR may generate a conceptual framework that provides richer insights that may help further research in this area. Furthermore, QR analyses may be better suited for the examination of individual differences in behavioral research and may make a meaningful contribution to the advancement of neuropsychological theory and the development of cognitive remediation/training interventions.
QR is a practical tool for analyzing complex relationships between variables because it extends beyond the scope of the central part of the distribution. While QR provides a complete picture of functional relationships, using QR to explore beyond the mean effects presents many new possibilities for statistical analysis and interpretation of relationships. There is an increasing trend in the use of QR in various fields of research, as can be seen by a simple PubMed search, and using QR for analyzing behavioral data can add to the existing knowledge substantially. Although it has begun to be used in many scientific applications, many researchers remain unaware of it because it was developed relatively recently and is rarely taught in statistics courses (32). The popularity of a statistical method is also dependent on its availability in software packages, and only a handful of software packages, such as R, SAS, and Stata, provide options to fit QR models. Advancements in computation techniques and the development of more user-friendly packages in open-source software, such as R, have made it easy to implement these methods.
5 Limitations
Several limitations should be considered when interpreting the findings of this study, considering its illustrative and exploratory nature. The use of a cross-sectional design and restricted geographic scope limits the ability to infer any causal relationships and the generalizability of the findings to broader or more diverse populations. Caution should be exercised when interpreting the effects of variables across quantiles, as cross-sectional studies typically reveal only conditional associations and not causal effects. The modest sample size (n = 119) may limit the statistical power to detect differences across quantiles, along with the stability of the estimates and generalizability of the findings. Furthermore, multiple testing corrections were not applied in this study, because of its exploratory objective. However, such corrections, such as simultaneous quantile regression (QR) or methods based on family wise error rate (FWER) or false discovery rate (FDR), should be considered when the goal is to identify the best-fitting model. Despite these limitations, the study provides an illustrative demonstration of QR applied to real clinical data and may help guide future methodological and empirical work, particularly longitudinal studies or those including broader and more diverse samples with comprehensive control of confounding factors and an appropriate sample size.
Data availability statement
The data analyzed in this study is a secondary dataset from a previous study at the National Institute of Mental Health and Neuro Sciences (NIMHANS), Bengaluru, India and it was used with necessary institutional permissions. The dataset is not readily available because of the sensitive nature of the information provided by the participants. Requests to access these datasets should be directed to Dr. Ravi GS at cmF2aWdzMTk4OEBnbWFpbC5jb20=.
Ethics statement
The studies involving human data were approved by National Institute of Mental Health and Neuro Sciences, NIMHANS. Bengaluru (NIMH:ES:Ph.D(BS): 2012-15:RGS). The study was conducted in accordance with the local legislation and institutional requirements.
Author contributions
RS: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing, Project administration, Resources, Supervision, Validation. TK: Conceptualization, Methodology, Supervision, Writing – review & editing. HK: Data curation, Investigation, Validation, Writing – review & editing. YR: Data curation, Investigation, Validation, Writing – review & editing.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Acknowledgments
Research fellowship from the Indian Council of Medical Research for the PhD degree of the first author as this submission is a part of his PhD work.
Conflict of interest
The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1632001/full#supplementary-material
References
1. Montgomery D and Vining PE. Introduction to Linear Regression Analysis:. SE - Wiley series in probability and statistics. Hoboken, New Jersey: John Wiley & Sons, Inc (2013).
4. Ernst AF and Albers CJ. Regression assumptions in clinical psychology research practice-a systematic review of common misconceptions. PeerJ. (2017) 5:e3323. doi: 10.7717/peerj.3323
5. Michael AJ, Antonisamy B, Gowri SM, and Prakash R. Modelling the risk factors for birth weight in twin gestations: A quantile regression approach. J Biosoc Sci. (2017) 49(6):744–56. doi: 10.1017/S0021932017000050
6. Castellano KE and Ho AD. Contrasting OLS and quantile regression approaches to student “Growth” Percentiles. J Educ Behav Statistics. (2013) 38:190–215. doi: 10.3102/1076998611435413
7. Nevitt J and Tam HP. A comparison of robust and nonparametric estimators under the simple linear regression model. Multiple Linear Regression Viewpoints. (1998) 25:54–69.
8. Mosteller F and Tukey J. Data analysis and regression: a second course in statistics. In Addison-Wesley Series in Behavioral Science: Quantitative Methods (1st ed.)
9. Koenker RW and Bassett G. Regression quantiles. Econometrica. (1978) 46:33–50. doi: 10.2307/1913643
10. Sherwood B, Zhou AXH, Weintraub S, and Wang L. Using quantile regression to create baseline norms for neuropsychological tests. Alzheimer’s Dementia: Diagn Assess Dis Monitor. (2016) 2:12–8. doi: 10.1016/j.dadm.2015.11.005
11. Abramovitch A, Abramowitz JS, and Mittelman A. The neuropsychology of adult obsessive-compulsive disorder: A meta-analysis. Clin Psychol Rev. (2013) 33(8):1163–71. doi: 10.1016/j.cpr.2013.09.004
12. Kashyap H and Abramovitch A. Neuropsychological research in obsessive-compulsive disorder: current status and future directions. Front Psychiatry. (2021) 12:1864. doi: 10.3389/fpsyt.2021.721601
13. Woo YS, Rosenblat JD, Kakar R, Bahk WM, and McIntyre RS. Cognitive deficits as a mediator of poor occupational function in remitted major depressive disorder patients. Clin Psychopharmacol Neurosci. (2016) 14:1–16. doi: 10.9758/cpn.2016.14.1.1
14. Suhas S and Rao N. Neurocognitive deficits in obsessive-compulsive disorder: A selective review. Indian J Psychiatry. (2019) 61:S30–6. doi: 10.4103/psychiatry.IndianJPsychiatry_517_18
15. D’Alcante CC, Diniz JB, Fossaluza V, Batistuzzo MC, Lopes AC, Shavitt RG, et al. Neuropsychological predictors of response to randomized treatment in obsessive-compulsive disorder. Prog Neuropsychopharmacol Biol Psychiatry. (2012) 39:310–7. doi: 10.1016/j.pnpbp.2012.07.002
16. Hybel KA, Mortensen EL, Lambek R, Højgaard DRMA, and Thomsen PH. Executive function predicts cognitive-behavioral therapy response in childhood obsessive-compulsive disorder. Behav Res Ther. (2017) 99:11–8. doi: 10.1016/j.brat.2017.08.009
17. van den Heuvel OA, van Wingen G, Soriano-Mas C, Alonso P, Chamberlain SR, Nakamae T, et al. Brain circuitry of compulsivity. Eur Neuropsychopharmacol. (2016) 26:810–27. doi: 10.1016/j.euroneuro.2015.12.005
18. Davino C and Vistocco D. Quantile regression for the evaluation of student satisfaction. Statistica Applicata. (2008) 20:179–96.
19. Radford S, Eames C, Brennan K, Lambert G, Crane C, Williams JMG, et al. Trait mindfulness as a limiting factor for residual depressive symptoms: An explorative study using quantile regression. PloS One. (2014) 9(7):e100022. doi: 10.1371/journal.pone.0100022
20. Kazhungil F, Kumar KJ, Viswanath B, Shankar RG, Kandavel T, Math SB, et al. Neuropsychological profile of schizophrenia with and without obsessive compulsive disorder. Asian J Psychiatr. (2017) 29:30–4. doi: 10.1016/j.ajp.2017.04.004
21. Davino C, Furno M, and Dominico V. Quanitle regression theory and applications. United Kingdom: Wiley (2014).
22. Koenker R and MaChado JAF. Goodness of fit and related inference processes for Quantile Regression. J Americal Stat Assoc. (1994) 94:1296–310. doi: 10.1080/01621459.1999.10473882
23. Mabire-Yon R. Unravelling the nuances of data with quantile regression: A comprehensive tutorial. Int J Psychol. (2025) 60:e70006. doi: 10.1002/ijop.70006
24. Kashyap H, Kumar JK, Kandavel T, and Reddy YCJ. Neuropsychological correlates of insight in obsessive–compulsive disorder. Acta Psychiatr Scand. (2012) 126:106–14. doi: 10.1111/j.1600-0447.2012.01845.x
27. Petscher Y and Logan JAR. Quantile regression in the study of developmental sciences. Child Dev. (2014) 85:861–81. doi: 10.1111/cdev.12190
28. Brennan A, Cross PC, and Creel S. Managing more than the mean: Using quantile regression to identify factors related to large elk groups. J Appl Ecol. (2015) 52:1656–64. doi: 10.1111/1365-2664.12514
29. Snyder HR, Kaiser RH, Warren SL, and Heller W. Obsessive-compulsive disorder is associated with broad impairments in executive function: A meta-analysis. Clin psychol Sci. (2015) 3(2):301–30. doi: 10.1177/2167702614534210
30. Kashyap H, Kumar JK, Kandavel T, and Reddy YCJ. Neuropsychological functioning in obsessive-compulsive disorder: Are executive functions the key deficit? Compr Psychiatry. (2013) 54:533–40. doi: 10.1016/j.comppsych.2012.12.003
31. Bhattacharya M, Kashyap H, and Reddy YCJ. Cognitive training in obsessive-compulsive disorder: A systematic review. Indian J psychol Med. (2024) 46:110–8. doi: 10.1177/02537176231207781
Keywords: quantile regression, linear regression, neuropsychological tests, obsessive-compulsive disorder, non-normal distribution
Citation: Shankar RG, Kandavel T, Kashyap H and Reddy YCJ (2026) A case study on using quantile regression in psychiatry research. Front. Psychiatry 16:1632001. doi: 10.3389/fpsyt.2025.1632001
Received: 18 June 2025; Accepted: 24 November 2025; Revised: 24 November 2025;
Published: 05 January 2026.
Edited by:
Samer El Hayek, American Center for Psychiatry and Neurology, United Arab EmiratesReviewed by:
Mauro P. Olivera, Santo Tomás University, ChileCynthia Marisol Vargas, UAT UAMRA, Mexico
Renaud Mabire-Yon, Université de Lille, France
Copyright © 2026 Shankar, Kandavel, Kashyap and Reddy. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ravi G. Shankar, cmF2aWdzMTk4OEBnbWFpbC5jb20=