- 1Sport Science School, Beijing Sport University, Beijing, China
- 2Sports and Health Laboratory, Beijing Xiaomi Mobile Software Co., Ltd., Beijing, China
- 3School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
Objective: This study aimed to investigate whether pre-sleep heart rate variability (HRV) could predict chronic insomnia (CI) and sleep quality in male national-level team-based athletes.
Methods: A total of 174 athletes participated in this study, including 98 with CI and 76 exhibiting normal sleeping patterns. Pre-sleep HRV was assessed using heart rate chest straps, and sleep quality was evaluated through polysomnography (PSG) before a single night’s sleep. Binary logistic regression was first used to predict CI. Multiple linear regression and multi-layer perceptron (MLP) neural network models were then used to predict measures of sleep quality.
Results: Binary logistic regression revealed that measures of pre-sleep HRV accurately predict CI (R2 = 0.902 and 96% accuracy, AUC = 0.997). Multiple linear regression showed that pre-sleep HRV had a moderate predictive capacity for time awake (R2 = 0.526, P < 0.001) and sleep efficiency (R2 = 0.481, P < 0.001). The multiple linear regression model’s predicted values for sleep onset latency (r = 0.459, P < 0.01), sleep efficiency (r = 0.554, P < 0.001), and deep sleep time (r = 0.536, P < 0.001) showed moderate positive correlations with the corresponding actual values, whereas the MLP neural network’s predictions were not significantly correlated with the actual values. In contrast, the MLP neural network model was superior at predicting time awake when compared to the multiple linear regression model (MLP: mean absolute percentage error = 0.182 vs. Multiple linear regression: mean absolute percentage error = 0.516).
Conclusion: The present findings support the use of pre-sleep HRV not only to predict CI, but also some sleep continuity measures in national level athletes.
1 Introduction
Chronic insomnia (CI) is the most common sleep disorder, affecting approximately 10% of the population (Morin and Jarrin, 2022). It is characterized by persistent difficulty falling or staying asleep at least three times per week, for a period of 3 months or longer (Perlis et al., 2005). Elite athletes have a higher prevalence of sleep disorders than the general population due to overtraining, inadequate recovery, travel and performance pressure (Walsh et al., 2021). It is estimated that 13%–70% of athletes experience some form of sleep disruption, with 22%–26% suffering from severe sleep issues (Gupta et al., 2017). This is highly problematic given that poor sleep impairs athletic performance by reducing endurance, strength, speed, and reaction time, while increasing perceived effort and the risk of injury (Charest and Grandner, 2022). It also disrupts cognitive function, such as attention and decision-making, whilst also slowing recovery by altering hormonal balance and immune responses (Walsh et al., 2011). Therefore, finding strategies that assist with the early detection of CI and poor sleep quality remains a key focus for coaches, practitioners and sport scientists.
Commonly used tools for detecting sleep disturbances in athletes include the Pittsburgh Sleep Quality Index (PSQI) and actigraphy (Walsh et al., 2021). While these methods are practical and offer useful insights into sleep quality, polysomnography (PSG) remains the “gold standard” for the accurate diagnosis of sleep disorders as it provides more precise measurements of sleep architecture and disturbances through directly recording brain activity, eye movements, and muscle tone during sleep (Sateia, 2014). However, solely relying on sleep monitoring may delay the early diagnosis and treatment of long-term sleep conditions, such as CI (Garbarino and Bragazzi, 2024). Therefore, establishing prediction models for CI based on PSG data is essential, as they may offer a promising tool for the early diagnosis of CI. This would enable the personalized timing of CI treatment for individual athletes before significant declines in athletic performance occur.
Heart rate variability serves as a useful measure of autonomic nervous system function (Lehrer and Gevirtz, 2014), particularly in understanding sleep-related neural regulation through the balance between the sympathetic and parasympathetic branches (Li et al., 2024). Enhanced parasympathetic activity, reflected by higher high-frequency components of HRV, has been associated with the onset of sleep and a higher proportion of deep sleep (Grimaldi et al., 2019). In contrast, increased sympathetic activity is linked to difficulties in falling asleep and a higher proportion of light sleep (Michael et al., 2017). Long-term exercise training at appropriate frequencies, intensities, and volumes can lead to adaptive changes in HRV, whereas overtraining has been shown to have maladaptive effects on HRV (Marzbani et al., 2016). Notably, pre-sleep HRV biofeedback or relaxation techniques, such as controlled breathing and meditation, have been found to improve sleep quality by modulating HRV (Marzbani et al., 2016). These findings suggest that HRV may not only influence sleep regulation but could also have a role as a physiological predictor of sleep quality and CI.
This study aimed to investigate whether pre-sleep HRV could predict CI and various indicators of sleep quality in male national level team-sport athletes. Specifically, we sought to determine whether pre-sleep HRV is associated with the presence of CI and with key indicators of sleep quality. Additionally, we aimed to compare the predictive effectiveness of different modeling approaches. It was hypothesized that measures of pre-sleep HRV would serve as reliable predictors of CI and sleep quality outcomes.
2 Methods
2.1 Ethical approval
The study was approved by the Ethics Committee of the Beijing Sport University Sports Science Laboratory (Ethics Approval Number: 2023277H). All participants were fully informed of the experimental requirements and procedures and provided written informed consent prior to participation. This study was carried out in accordance with the principles outlined in the Declaration of Helsinki, except for prior registration in a database.
2.2 Participants
The sample size for this study was determined based on a multiple linear regression approach with a fixed model (G*Power 3.1), with 12 predictors R2 representing the deviation from 0. Calculations were based on correlations between sleep onset latency (SOL), time awake (TA), and pre-sleep HRV (Li et al., 2022). The study was powered a priori to detect effect sizes of 0.3, with α = 0.05 and β = 0.95. This resulted in a minimum sample size of 23 for SOL and 82 for TA. To account for attrition, recruitment was increased by 20%, leading to a required minimum sample size of 102.
Only male national-level athletes (Tier 3) (McKay et al., 2022), between the ages of 18–25 years, who participated in team sports were recruited for the study. The inclusion criteria for CI athletes were based on the International Classification of Sleep Disorders-Third Edition (ICSD-3) (Sateia, 2014) and a Pittsburgh Sleep Quality Index (PSQI) score of >7. In addition, athletes were required to have no recent history of psychological stress (e.g., family accidents, unemployment, major social conflicts), and potential sleep disturbances caused by emotional disorders. Anxiety and depression were assessed through the Hamilton Anxiety Rating Scale (HAMA) and Hamilton Depression Rating Scale (HAMD) with scores <7 required to enroll onto the study. Participants were not taking psychiatric medications (e.g., sleeping medication), did not habitually nap in the day, and were regularly training.
In total, 174 male national-level athletes were enrolled (age: 20 ± 1 year; body mass: 76 ± 9 kg; height: 183 ± 8 cm; training: 4–5 times/week; sport-specific training duration: 470 ± 30 min/week) onto the study. The types of team sports included basketball, soccer, volleyball, rugby and ice hockey. Data collection was conducted between November 2023 and November 2024 as this was off season and therefore avoided long-distance travel, which would cause jet lag or circadian rhythm disruption, and irregular training schedules.
2.3 Experimental protocol
Participants completed inclusion and exclusion screening, informed consent forms, and were briefed on experimental procedures. To avoid non-standard sleep patterns, participants slept 1–2 nights in the sleep laboratory with monitoring equipment prior to commencing the study (Hu et al., 2024). To ensure participants were not experiencing sleep disturbances prior to the study, which included two laboratory familiarization nights, we compared PSQI scores from the post-familiarization period to those at enrollment, with each score reflecting the preceding 1-month period. On testing days, athletes avoided intense physical activity for 24 h. Athletes were also advised to refrain from consuming foods and beverages known to disrupt sleep, such as caffeine and spicy foods and to limit excessive screen time by avoiding the use of devices such as mobile phones and laptops over the 4 h before bedtime. Participants arrived at the sleep laboratory 45–60 min before their usual bedtime. Prior to sleep, anthropometric data were collected and HRV was measured before the PSG was attached for overnight sleep monitoring. During sleep monitoring, participants were not restricted to specific sleep or wake times and were instructed to fall asleep and wake up naturally.
2.4 Sleep quality monitoring and evaluation
2.4.1 Subjective sleep quality
The PSQI was used to assess sleep quality (BUYSSE and REYNOLDS, 1989) and included seven components: subjective sleep quality, sleep latency, sleep duration, sleep efficiency, sleep disturbances, use of sleeping medication, and daytime dysfunction. Each component is scored on a scale from 0 to three and the total PSQI score was calculated by summing the scores of all components. As such, scores can range from 0 to 21, with higher scores indicating greater sleep impairment (Mollayeva et al., 2016).
2.4.2 Objective sleep quality
A PSG system (Natus Medical Inc., San Carlos, Canada) recorded sleep using electroencephalogram (EEG) (C4), electrooculogram (EOG) (below the left and above the right eye), and electromyogram (EMG) (bilateral submental muscles) signals at 256 Hz. Sleep analysis was performed with RemLogic-E™ software, following AASM Version 2.0 criteria (Berry et al., 2012). Data were segmented into 30-s epochs, with signals filtered at 0.3–35 Hz (EEG), 10–100 Hz (EOG, EMG), and a 50 Hz notch filter applied. CI criteria followed ICSD-3 standards (SOL >30 min and/or TA >30 min) (Sateia, 2014). Sleep quality measures included: SOL (time from “start” to sleep onset), TST (total sleep time, including 4–6 cycles), LST (time in stages 1 and 2 NREM), DST (time in stages 3 and 4 NREM), REM (time in REM), AT (time awake after sleep onset), and SE (sleep efficiency, ratio of actual sleep time to time in bed) (Berry et al., 2012).
2.4.3 HRV measurement and analysis
Heart rate variability is commonly used to assess the balance between sympathetic and parasympathetic nervous activity (Santos-De-Araújo et al., 2022). In accordance with established guidelines, HRV was measured at the same time of day before sleep for a period of 5 min in a quiet room, whilst participants were awake, lying supine and breathing spontaneously (Laborde et al., 2017). R–R intervals (the time between R waves in two QRS complexes) were recorded for no less than 10 min using a Polar H10 heart rate chest strap (Polar Electro, Finland). The data were processed and analyzed using the Kubios HRV-standard software, extracting 5-min segments to calculate HRV and assess autonomic nervous status. This included measures of Mean R-R (MRR), Root Mean Square of Successive RR Interval Differences (RMSSD), SDNN (Standard Deviation of NN Intervals), TINN (Triangular Interpolation of NN Interval Histogram), PNN50% (Percentage of Successive NN Intervals that Differ by More than 50 m), High Frequency (HF), Mean Heart Rate (MHR), Stress Index (SI), Low Frequency (LF), LF/HF Ratio (Ratio between sympathetic and parasympathetic nervous system activity). Approximate Entropy (ApEn) and Sample Entropy (SampEn): Non-linear measures that evaluate the complexity and irregularity of HRV patterns.
2.4.4 HAMA and HAMD scales
The HAMA (Thompson, 2015) and the HAMD (Zimmerman et al., 2013) scales are commonly used clinical instruments for assessing levels of anxiety and depression. In this study, these scales were used to exclude participants with sleep-related complications caused by anxiety or depressive symptoms. The HAMA consists of 14 items, and the HAMD consists of 17 items. Both scales use a five-point Likert scoring system (0–4), corresponding to “no symptoms,” “mild symptoms,” “moderate symptoms,” “marked symptoms,” and “severe symptoms,” respectively.
2.5 Model development
Binary logistic regression model: CI was initially diagnosed based on the ICSD-3 criteria, alongside PSG data. CI status (1 = CI present, 0 = no CI) was treated as the dependent variable, and pre-sleep HRV variables, including HF, LF, LF/HF Ratio; Time-domain: MRR, RMSSD, SDNN, TINN, PNN50%, MHR, SI; Non-linear: ApEn and SampEn were used as independent variables for binary logistic regression analysis. Data were randomly divided into a training set (80%) and a validation set (20%) to perform a train-validation split. The stepwise method is employed for variable selection. In the model formula, p represents the probability of CI (0/1 = 1), and 1−p represents the probability of no CI (0/1 = 0). The Hosmer-Lemeshow (HL) test was used to evaluate the model’s goodness-of-fit. R2 values indicate the fit: The goodness-of-fit of the binary logistic regression model was assessed through using the R2 value. Potential multicollinearity among the independent variables was assessed using variance inflation factors (VIF), and variables with VIF values exceeding the threshold of 10 were excluded from the analysis to ensure the robustness of the model. To evaluate how well independent variables explained the dependent variable, model fit was classified using R2 as follows: poor fit (<0.2), low fit (0.2–0.5), moderate fit (0.5–0.8) and high level of fit (>0.8).
Multiple linear regression model: Data were first normalized (X-Min)/(Max-Min). HRV measures were set as independent variables, and measures of sleep quality were treated as dependent variables. Data were randomly divided into a training set (80%) and a validation set (20%) to perform a train-validation split. The R2 value ranges from 0 to 1, with higher values indicating better fit. The tolerance of the prediction equation must exceed 0.1, and the VIF must be less than 10, indicating no multicollinearity. The Durbin-Watson test values ranged from 0 to 4, demonstrating independence between variables and the absence of autocorrelation.
Neural network model: The Multi-Layer Perceptron (MLP) neural network model was constructed by randomly dividing data into a training set (80%) and a validation set (20%) to perform a train-validation split. The training data were normalized (X-Min)/(Max-Min) to ensure uniform data scaling. The activation function was ReLU, and weights were optimized using the Adam optimizer with an initial learning rate of 0.001. L2 regularization was set at 0.0001, the maximum number of iterations was 200, and the optimization tolerance was 0.0001. The current model architecture employed a single hidden layer comprising 100 neurons, which is a similar approach to earlier studies involving comparable data structures and sample sizes (Sheela and Deepa, 2013). To evaluate the robustness of this configuration, we also tested alternative architectures with varying numbers of hidden neurons (e.g., 50, 150, and 200) as well as models with multiple hidden layers. However, these modifications did not further enhance performance.
2.5.1 Model construction and validation
A train-validation split of the binary logistic regression, multiple linear regression, and neural network models were carried out by firstly randomly dividing data into a training set (80%) and validation set (20%). The validation methodology for both the MLP neural network and multiple linear regression models was executed as follows: The training data set was first used to construct the models, and the validation data set (HRV) was then incorporated into the formula obtained from the training set to generate the predicted values. Parameters of sleep quality measured by the PSG validation set were tested for consistency by comparing them to the sleep parameters predicted by the validation data set. The performance of multiple linear regression and MLP neural network models was evaluated and compared using the following metrics: Mean Absolute Error (MAE), Root Mean Squared Error (RMSE) and Mean Absolute Percentage Error (MAPE). Using the reserved 20% validation subset, the binary logistic regression model (fitted on the training set) was validated by applying its coefficients to the HRV data to compute event probabilities. These probabilities were converted to binary classifications using a decision threshold of 1 (1 = CI present, 0 = no CI). Model discrimination was assessed via the Area Under the Receiver Operating Characteristic Curve (AUC-ROC).
2.6 Statistical analysis
Data normality was first assessed using the Kolmogorov-Smirnov test. Pearson correlation coefficients were used to evaluate the relationship between observed and predicted sleep quality. The magnitudes of the correlations were classified as follows: trivial (<0.1), small (0.1–0.3), moderate (0.3–0.5), large (0.5–0.7), very large (0.7–0.9), and extremely large (>0.9) (Hopkins et al., 2009). Bland-Altman analyses were used to assess reliability. The mean and difference between observed and predicted sleep quality values were then calculated to establish the limits of agreement (95% distribution range), and 95% limits of agreement (95% LoA) were constructed (Giavarina, 2015). Paired t-tests were used to compare the adaptation of PSQI, while independent t-tests were used to compare the differences in HRV and sleep quality (PSG) between groups. Data are reported as mean ± standard deviation (M ± SD). The skew distribution is expressed as the median (upper quartile to lower quartile). Statistical significance was accepted as α < 0.05. All statistical analyses were performed using SPSS 26.0.
3 Results
3.1 Sleep and pre-sleep HRV characteristics
The data of 12 participants were excluded due to electrode detachment and erroneous PSG data, and nine did not complete the experiment. There were no significant differences between CI and normal sleep groups in age, height, or weight (P > 0.05). The absence of significant differences in PSQI scores between the screening phase and the month prior to study participation indicated that there were no differences in sleep disturbances and sleep quality prior to commencing the study. The PSQI scores of the CI group were significantly higher than those of the normal sleep group (CI group: 10.96 ± 2.13 vs. normal sleep group: 4.04 ± 1.69, P < 0.01), with our PSG data indicating that the prevalence of CI among male athletes in this study was 56% (95% CI: 49%–64%). When compared to the normal sleep group, the CI group showed an increased duration of SOL (P < 0.001, mean difference = 11.94 min, d = 0.68), and TA (P < 0.001, mean difference = 41.68 min, d = 2.28). In addition, there was also a decrease in the duration of TST (P < 0.001, mean difference = −42.94 min, d = 0.84), SE (P < 0.001, mean difference = −10.4%, d = 2.18), and DST (P < 0.001, mean difference = −3.7%, d = 1.133). Athletes with CI also exhibited lower values for a range of pre-sleep HRV measures when compared to the normal sleep group, including MRR (P < 0.001, mean difference = −213.68 m, d = 1.68), RMSSD (P < 0.001, mean difference = −49.35 m, d = 2.01), PNN50 (P < 0.001, mean difference = −41.84%, d = 2.86), ApEn (P < 0.001, mean difference = −0.23, d = 1.66), and SampEn (P < 0.001, mean difference = −0.35, d = 1.49) (Table 1).
3.2 Binary logistic regression model for the prediction of CI
Binary logistic regression analysis was conducted using measures of HRV as independent variables and the presence of CI (0/1) as the dependent variable (Table 2). The model automatically filters out RMSSD, PNN50, MHR, and SampEn, as they are all negatively correlated with CI (R2 = 0.902). The HL goodness-of-fit test indicated a good model fit (P = 1.000 > 0.05). The final model equation is as follows: ln (p/1-p) = 28.317–0.073*RMSSD-0.222*PNN50-11.436*SampEn. The accuracy of using HRV measures to predict CI was 96%, respectively (Table 3). Using the validation set, the integrated diagnostic model achieved an accuracy of 97.14% and an excellent discriminative ability (AUC = 0.997). At the optimal threshold (0.493), both sensitivity and specificity exceeded 96%, indicating strong clinical applicability. In contrast, individual HRV metrics (RMSSD, PNN50, and SampEn) performed at chance level, with AUCs ranging from 0.023 to 0.072, highlighting their limited diagnostic value when used in isolation.

Table 3. Accuracy of the binary logistic regression model for predicting insomnia based on pre-sleep HRV.
3.3 Multiple linear regression model for predicting sleep quality
A multiple linear regression model was developed using pre-sleep HRV measures as independent variables and sleep quality as the dependent variable (Table 4). All model variables satisfied the conditions of variable importance in projection (VIP <10), tolerance >0.1, and D-W statistics within the range of 0–4, indicating no multicollinearity issues. The model demonstrated a moderate fit for predicting TA (R2 = 0.526, P < 0.001) and SE (R2 = 0.481, P < 0.001). The remaining measures of TST, SOL, LST, DST and REM exhibited poor predictive values, with R2 values below 0.2.
The predictive equation was validated by applying HRV data from a validation set of 35 cases (20%) (Table 5). The actual and predicted values of SOL demonstrated a moderate positive correlation (P < 0.01), while the actual and predicted values of TA, SE, and DST showed a strong positive correlation (P < 0.01). No correlations were observed between the predicted and actual values for LST and REM (P > 0.05). The Bland-Altman scatterplot (Figure 1) revealed that the probability of agreement within the limits of consistency for all predicted values ranged from 91%–97%.

Figure 1. Bland-Altman Scatter Plot for Predicting Sleep Structure Using Multiple Linear Regression Models of Pre-sleep HRV. Abbreviations: (A) total sleep time, TST; (B) sleep onset latency, SOL; (C) time awake, TA; (D) sleep efficiency, SE; (E) light sleep time, LST; (F) deep sleep time, DST; (G) rapid eye movement, REM. The red line represents the upper limit of the 95% LoA (1.96 SD), while the blue line represents the lower limit of the 95% LoA (1.96 SD).
3.4 Multiple linear regression model for predicting sleep quality
The pre-sleep TA prediction values had a strong positive correlation with the measured values (P < 0.01), while the prediction values for other sleep parameters showed no significant correlation with their measured values (P > 0.05) (Table 6). The mean absolute error range between the measured and predicted values was 0.120–0.269, the root mean square error range was 0.157–0.299, the mean absolute percentage error range was 0–1.059, and the bias range was −0.174 to 0.028. The results of the consistency test indicated that the boundary probability ranges were between 94% and 97% (Figure 2).

Figure 2. Bland-Altman Scatter Plot for Predicting Sleep Quality Using Neural Network Model of Pre-sleep HRV Abbreviations: (A) total sleep time, TST; (B) sleep onset latency, SOL; (C) time awake, TA; (D) sleep efficiency, SE; (E) light sleep time, LST; (F) deep sleep time, DST; (G) rapid eye movement, REM. The red line represents the upper limit of the 95% LoA (1.96 SD), while the blue line represents the lower limit of the 95% LoA (1.96 SD).
4 Discussion
This study aimed to investigate whether pre-sleep HRV could predict CI and a range of sleep quality indicators in male national level team-sport athletes. We hypothesized that pre-sleep HRV would reliably predict both CI and sleep quality outcomes. Consistent with our hypothesis, the analysis using pre-sleep HRV measures within a binary logistic regression model demonstrated an accurate prediction of CI. Furthermore, stepwise multiple linear regression moderately predicted sleep continuity measures of time awake and SE, but was poor at predicting sleep stages, such as LST, DST and REM. In general, the stepwise multiple linear regression outperformed the current neural network model at predicting measures of sleep quality, but the neural network model was superior at predicting TA. Taken together, these findings demonstrate that pre-sleep HRV accurately predicts CI and sleep continuity in male national level athletes.
The present study suggests that the prevalence of CI among Chinese national level athletes is between 49% and 64%, which is substantially higher than the rates reported in previous studies using elite athletes (5%–33%) (Gerber et al., 2022; Gupta et al., 2017). Such large discrepancies are likely due to PSG being more accurate than subjective measures for detecting CI (Sateia, 2014). For example, our earlier research, which utilized sleep-monitoring mattresses, revealed that the incidence of sleep disorders among Chinese elite bobsleigh athletes was ∼63% (Li et al., 2022). As expected, CI was accompanied by poor sleep quality in the form of reductions in TST, SE, and DST and increases in SOL and TA (Table 1). These observations are particularly important considering that CI is prevalent among athletes and has been linked with impairments in physical performance, attention, concentration and memory (Leger et al., 2005). The present study also observed that pre-sleep HRV parameters, RMSSD, PNN50, ApEn, and SampEn were reduced in athletes suffering from CI (Table 1). This suggests an imbalance in autonomic nervous system regulation, specifically a downregulation of parasympathetic and an upregulation of sympathetic activity. Since autonomic dysfunction is known to contribute to overreaching (Achten and Jeukendrup, 2003), these findings imply that CI may increase an athlete’s vulnerability to overtraining which can impair athletic performance and increase the risk of injury and illness (Soligard et al., 2016).
To our knowledge, this is the first study to demonstrate that pre-sleep HRV can predict CI in national level athletes using a binary logistic regression model. This is of importance as predicting CI across a range of different team-sports is challenging due to the complex interplay of individual-level factors, including sport type, performance level, training workload, sleep habits and personal beliefs (Walsh et al., 2021). Binary logistic regression models have been shown to predict CI in response to cognitive behavioral therapy with accuracies ranging from 60%–71% when used with the general public (Holler et al., 2024). In contrast, we showed a considerably higher accuracy of 96% through using pre-sleep HRV. However, caution is advised when interpreting these findings given the high predictive performance for CI as it may partly reflect hidden overlap between the training and validation datasets. Although we used separate data sets, these data were drawn from similar athlete populations (e.g., age, sex, team-based sports and national level) under similar measurement conditions, therefore the accuracy of our model could have been inflated. This being said, the higher predictive accuracy of pre-sleep HRV may also reflect that parasympathetic and sympathetic systems responding more clearly to stress, fatigue, or psychological strain in national-level athletes, making HRV a more sensitive marker of CI in this population. In this regard, our previous research showed that national level athletes frequently exhibit negative emotional states (Li et al., 2022), and there is well-established link between negative emotions and HRV (Di Simplicio et al., 2012).
The current study also demonstrated that multiple linear regression could predict parameters of TA and SE, which are key indicators of sleep continuity. These results align to the small body of research that has investigated whether pre-sleep HRV is related to sleep quality in healthy young and middle-aged adults (Jung et al., 2017; Fantozzi et al., 2019; Werner et al., 2015). For instance, Fantozzi et al. (2019) found that measures of parasympathetic activity, such as RMSSD and PNN50 predicted a shorter wake after sleep onset time. Similarly, in the present study, PNN50 was an integral component of the multiple linear regression model to predict both TA and SE, in addition to RMSSD for the prediction of SE. We expanded on previous work by identifying additional pre-sleep HRV predictors for sleep continuity (TA: PNN50, MHR, ApEn; SE: PNN50, MHR, RMSSD). However, pre-sleep HRV was less effective in predicting sleep phases (TST, LST, and REM). Nevertheless, the multiple linear regression model outperformed our MLP neural network model for predicting SOL, SE, and DST. This suggests that a linear relationship exists between pre-sleep HRV and some sleep related outcomes. Although the MLP neural network model was superior at predicting TA, which could also indicate that this type of model is better at solving more complex nonlinear problems. Whilst these results are promising, collectively they indicate that a single physiological signal, such as HRV, may restrict the predictive potential of a model when assessing measures of sleep quality which are multidimensional and complex in nature.
5 Limitations and future directions
While this study offers important insights into sleep prediction among athletes, several limitations should be considered. All regression models are inherently susceptible to overfitting, in which the model captures spurious patterns in the training data, thereby reducing its generalizability to independent datasets. As our work was considered exploratory, we performed a train-validation split to efficiently provide an estimate of model performance on unseen data, helping to detect overfitting. However, future work should use cross-validation to evaluate the models across multiple splits, providing a more stable and accurate estimate of their performance. The sample of participants in the current study was limited to male Chinese athletes involved in team sports, which may influence the broader applicability of our findings to other athletic or non-athletic populations, including female athletes or those in individual sports. Additionally, the study utilized HRV as the sole physiological indicator, without accounting for other psychophysiological (e.g., circadian rhythm, blood pressure, stress and anxiety), which could play a key role in sleep regulation. To strengthen the validity and generalizability of future models, research should include more diverse cohorts and a wider range of physiological and psychological variables. Incorporating alternative sleep assessment tools, such as actigraphy and subjective measures like sleep diaries, may also enhance ecological validity and provide a more holistic understanding of sleep in athletic populations.
6 Conclusion
The current binary logistic regression model accurately predicted CI in male national level athletes. In addition, our multiple linear regression model was effective at predicting sleep continuity measures (TA and SE), but was poor at predicting sleep phases (TST, LST, DST and REM). Based on these findings, pre-sleep HRV could be a potential approach for practitioners and coaches to accurately predict CI, enabling the timely implementation of strategies to enhance sleep quality and, consequently, maintain optimal athletic performance.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Sports Science Ethics Committee of Beijing Sport University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
QL: Formal Analysis, Methodology, Software, Visualization, Writing – original draft. XL: Data curation, Project administration, Resources, Visualization, Writing – review and editing. WY: Data curation, Investigation, Project administration, Writing – review and editing. CS: Conceptualization, Data curation, Formal Analysis, Methodology, Supervision, Writing – review and editing. YZ: Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – review and editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
Author XL was employed by Beijing Xiaomi Mobile Software Co., Ltd.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Achten J., Jeukendrup A. E. (2003). Heart rate monitoring: applications and limitations. Sports Med. 33, 517–538. doi:10.2165/00007256-200333070-00004
Berry R. B., Budhiraja R., Gottlieb D. J., Gozal D., Iber C., Kapur V. K., et al. (2012). Rules for scoring respiratory events in sleep: update of the 2007 AASM manual for the scoring of Sleep and Associated Events. Deliberations of the Sleep Apnea Definitions Task Force of the American Academy of Sleep Medicine. Sleep. Med. 8 (5), 597–619. doi:10.5664/jcsm.2172
Buysse D. J., Reynolds C. F., Berman S. R., Kupfer D. J. (1989). The Pittsburgh Sleep Quality Index: a new instrument for psychiatric practice and research. Psychiatry Res. 28 (2), 193–213. doi:10.1016/0165-1781(89)90047-4
Charest J., Grandner M. A. (2022). Sleep and athletic performance: impacts on physical performance, mental performance, injury risk and recovery, and mental health: an update. Sleep. Med. Clin. 17 (2), 263–282. doi:10.1016/j.jsmc.2022.03.006
DI Simplicio M., Costoloni G., Western D., Hanson B., Taggart P., Harmer C. J. (2012). Decreased heart rate variability during emotion regulation in subjects at risk for psychopathology. Psychol. Med. 42 (8), 1775–1783. doi:10.1017/S0033291711002479
Fantozzi M. T., Artoni F., Faraguna U. (2019). “Heart rate variability at bedtime predicts subsequent sleep features[C],” in 2019 41st annual international conference of the IEEE engineering in medicine and biology Society (EMBC). IEEE, 6784–6788.
Garbarino S., Bragazzi N. L. (2024). Revolutionizing sleep health: the emergence and impact of personalized sleep medicine. J. Personalized Med. 14 (6), 598. doi:10.3390/jpm14060598
Gerber M., Kellmann M., Lang C. (2022). Potential prevalence, pattern, and risk factors of insomnia symptoms in adolescent elite athletes. Z. für Sportpsychol. 356. doi:10.1026/1612-5010/a000366
Giavarina D. (2015). Understanding Bland Altman analysis. Biochem. Medica Cas. Hrvat. Drustva Med. Biokem. 25 (2), 141–151. doi:10.11613/BM.2015.015
Grimaldi D., Papalambros N. A., Reid K. J., Abbott S. M., Malkani R. G., Gendy M., et al. (2019). Strengthening sleep-autonomic interaction via acoustic enhancement of slow oscillations. Sleep 42 (5), zsz036. doi:10.1093/sleep/zsz036
Gupta L., Morgan K., Gilchrist S. (2017). Does elite sport degrade sleep quality? A systematic review. Sports Med. 47 (7), 1317–1333. doi:10.1007/s40279-016-0650-6
Holler E., Du Y., Barboi C., Owora A. (2024). Prognostic models for predicting insomnia treatment outcomes: a systematic review. J. Psychiatric Res. 170, 147–157. doi:10.1016/j.jpsychires.2023.12.017
Hopkins W. G., Marshall S. W., Batterham A. M., Hanin J. (2009). Progressive statistics for studies in sports medicine and exercise science. Med. Sci. Sports Exerc. 41 (1), 3–13. doi:10.1249/MSS.0b013e31818cb278
Hu S., Shi L., Li Z., Ma Y., Li J., Bao Y., et al. (2024). First-night effect in insomnia disorder: a systematic review and meta-analysis of polysomnographic findings. J. Sleep Res. 33 (1), e13942. doi:10.1111/jsr.13942
Jung D. W., Lee Y. J., Jeong D.-U., Park K. S. (2017). New predictors of sleep efficiency. Chronobiology Int. 34 (1), 93–104. doi:10.1080/07420528.2016.1241802
Laborde S., Mosley E., Thayer J. F. (2017). Heart rate variability and cardiac vagal tone in psychophysiological research - recommendations for experiment planning, data analysis, and data reporting. Front. Psychol. 8, 213. doi:10.3389/fpsyg.2017.00213
Leger D., Metlaine A., Choudat D. (2005). Insomnia and sleep disruption: relevance for athletic performance. Clin. sports Med. 24 (2), 269–285. doi:10.1016/j.csm.2004.12.011
Lehrer P. M., Gevirtz R. (2014). Heart rate variability biofeedback: how and why does it work? Front. Psychol. 5, 756. doi:10.3389/fpsyg.2014.00756
Li Q., Steward C. J., Cullen T., Che K., Zhou Y. (2022). Presleep heart-rate variability biofeedback improves mood and sleep quality in Chinese Winter olympic Bobsleigh athletes. Int. J. Sports Physiology Perform. 17 (10), 1516–1526. doi:10.1123/ijspp.2022-0037
Li Q., Shi M., Steward C. J., Che K., Zhou Y. (2024). A comparison between pre-sleep heart rate variability biofeedback and electroencephalographic biofeedback training on sleep in national level athletes with sleep disturbances. Appl. Psychophysiol. Biofeedback 49 (1), 115–124. doi:10.1007/s10484-023-09604-3
Marzbani H., Marateb H. R., Mansourian M. (2016). Neurofeedback: a comprehensive review on System design, methodology and clinical applications. Basic Clin. Neurosci. 7 (2), 143–158. doi:10.15412/J.BCN.03070208
Mckay A. K. A., Stellingwerff T., Smith E. S., Martin D. T., Mujika I., Goosey-Tolfrey V. L., et al. (2022). Defining training and performance caliber: a participant classification framework. Int. J. Sports Physiology Perform. 17 (2), 317–331. doi:10.1123/ijspp.2021-0451
Michael S., Graham K. S., Davis G. M. O. (2017). Cardiac autonomic responses during exercise and post-exercise recovery using heart rate variability and systolic time Intervals-A review. Front. Physiology 8, 301. doi:10.3389/fphys.2017.00301
Mollayeva T., Thurairajah P., Burton K., Mollayeva S., Shapiro C. M., Colantonio A. (2016). The Pittsburgh sleep quality index as a screening tool for sleep dysfunction in clinical and non-clinical samples: a systematic review and meta-analysis. Sleep. Med. Rev. 25, 52–73. doi:10.1016/j.smrv.2015.01.009
Morin C. M., Jarrin D. C. (2022). Epidemiology of insomnia: prevalence, course, risk factors, and public health burden. Sleep. Med. Clin. 17 (2), 173–191. doi:10.1016/j.jsmc.2022.03.003
Perlis M. L., Smith M. T., Pigeon W. R. (2005). Etiology and pathophysiology of insomnia. Princ. Pract. sleep Med. 4, 714–725. doi:10.1016/b0-72-160797-7/50067-7
Santos-De-AraúJO A. D., Shida-Marinho R., Pontes-Silva A. (2022). Heart rate variability (HRV): checklist for observational and experimental studies. Autoimmun. Rev. 21 (11), 103190. doi:10.1016/j.autrev.2022.103190
Sateia M. J. (2014). International classification of sleep disorders-third edition: highlights and modifications. Chest 146 (5), 1387–1394. doi:10.1378/chest.14-0970
Sheela K. G., Deepa S. N. (2013). Review on methods to fix number of hidden neurons in neural networks. Math. Problems Eng. 2013 (pt.7), 425741–11. doi:10.1155/2013/425740
Soligard T., Schwellnus M., Alonso J.-M., Bahr R., Clarsen B., Dijkstra H. P., et al. (2016). How much is too much?(Part 1) International Olympic Committee consensus statement on load in sport and risk of injury. Br. J. Sports Med. 50 (17), 1030–1041. doi:10.1136/bjsports-2016-096581
Thompson E. (2015). Hamilton rating Scale for anxiety (HAM-A). Occup. Med. Oxf. Engl. 65 (7), 601. doi:10.1093/occmed/kqv054
Walsh N. P., Gleeson M., Shephard R. J., Gleeson M., Woods J. A., Bishop N. C., et al. (2011). Position statement. part one: immune function and exercise. Exerc. Immunol. Rev. 17, 6–63.
Walsh N. P., Halson S. L., Sargent C., Roach G. D., Nédélec M., Gupta L., et al. (2021). Sleep and the athlete: narrative review and 2021 expert consensus recommendations. Br. J. Sports Med. 55(7), 356–368. doi:10.1136/bjsports-2020-102025
Werner G. G., Ford B. Q., Mauss I. B., Schabus M., Blechert J., Wilhelm F. H. (2015). High cardiac vagal control is related to better subjective and objective sleep quality. Biol. Psychol. 106, 79–85. doi:10.1016/j.biopsycho.2015.02.004
Keywords: chronic insomnia, athlete, heart rate variability, autonomic nervous system, linear regression model
Citation: Li Q, Lei X, Yu W, Steward CJ and Zhou Y (2025) Pre-sleep heart rate variability predicts chronic insomnia and measures of sleep continuity in national-level athletes. Front. Physiol. 16:1627287. doi: 10.3389/fphys.2025.1627287
Received: 12 May 2025; Accepted: 28 August 2025;
Published: 18 September 2025.
Edited by:
Mallikarjuna Korivi, Zhejiang Normal University, ChinaReviewed by:
Mohamed Kerkeni, High Institute of Sport and Physical Education of Sfax, TunisiaJoão Barreira, University of Maia, Portugal
Rubén Portes Sánchez, European University of Madrid, Spain
Copyright © 2025 Li, Lei, Yu, Steward and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Charles J. Steward, Y2hhcmxlcy5zdGV3YXJkQG5vdHRpbmdoYW0uYWMudWs=; Yue Zhou, emhvdXlAYnN1LmVkdS5jbg==