Work Habit-Related Sleep Debt; Insights From Factor Identification Analysis of Actigraphy Data

The present study investigates the factors of “Weekday sleep debt (WSD)” by comparing activity data collected from persons with and without WSD. Since it has been reported that the amount of sleep debt as well the difference between the social clock and the biological clock is associated with WSD, specifying the factors of WSD other than chronotype may contribute to sleep debt prevention. We recruited 324 healthy male employees working at the same company and collected their 1-week wrist actigraphy data and answers to questionnaires. Because 106 participants were excluded due to measurement failure of the actigraphy data, the remaining 218 participants were included in the analysis. All participants were classified into WSD or non-WSD groups, in which persons had WDS if the difference between their weekend sleep duration and the mean weekday sleep duration was more than 120 min. We evaluated multiple measurements derived from the collected actigraphy data and trained a classifier that predicts the presence of WSD using these measurements. A support vector machine (SVM) was adopted as the classifier. In addition, to evaluate the contribution of each indicator to WSD, permutation feature importance was calculated based on the trained classifier. Our analysis results showed significant importance of the following three out of the tested 32 factors: (1) WSD was significantly related to persons with evening tendency. (2) Daily activity rhythms and sleep were less stable in the WSD group than in the non-WSD group. (3) A specific day of the week had the highest importance in our data, suggesting that work habit contributes to WSD. These findings indicate some WSD factors: evening chronotype, instability of the daily activity rhythm, and differences in work habits on the specific day of the week. Thus, it is necessary to evaluate the rhythms of diurnal activities as well as sleep conditions to identify the WSD factors. In particular, the diurnal activity rhythm influences WSD. It is suggested that proper management of activity rhythm may contribute to the prevention of sleep debt.

The present study investigates the factors of "Weekday sleep debt (WSD)" by comparing activity data collected from persons with and without WSD. Since it has been reported that the amount of sleep debt as well the difference between the social clock and the biological clock is associated with WSD, specifying the factors of WSD other than chronotype may contribute to sleep debt prevention. We recruited 324 healthy male employees working at the same company and collected their 1-week wrist actigraphy data and answers to questionnaires. Because 106 participants were excluded due to measurement failure of the actigraphy data, the remaining 218 participants were included in the analysis. All participants were classified into WSD or non-WSD groups, in which persons had WDS if the difference between their weekend sleep duration and the mean weekday sleep duration was more than 120 min. We evaluated multiple measurements derived from the collected actigraphy data and trained a classifier that predicts the presence of WSD using these measurements. A support vector machine (SVM) was adopted as the classifier. In addition, to evaluate the contribution of each indicator to WSD, permutation feature importance was calculated based on the trained classifier. Our analysis results showed significant importance of the following three out of the tested 32 factors: (1) WSD was significantly related to persons with evening tendency. (2) Daily activity rhythms and sleep were less stable in the WSD group than in the non-WSD group. (3) A specific day of the week had the highest importance in our data, suggesting that work habit contributes to WSD. These findings indicate some WSD factors: evening chronotype, instability of the daily activity rhythm, and differences in work habits on the specific day of the week. Thus, it is necessary to evaluate the rhythms of diurnal activities as well as sleep conditions to identify the WSD factors. In particular, the diurnal activity rhythm influences WSD. It is suggested that proper management of activity rhythm may contribute to the prevention of sleep debt.
Keywords: weekday sleep debt, actigraphy, machine learning, feature importance, support vector machine INTRODUCTION Sleep debt has deleterious effects on work or academic performance and also may impair various other psychological and physical functions such as memory, learning, metabolism, and immunity (1). However, a convenient method for quantitively evaluating sleep debt has not yet been established. According to the International Classification of Sleep Disorders, Third Edition (ICSD-3), getting enough sleep duration for at least 7 days before polysomnography (PSG) should be performed for the sake of sleep debt resolution (2); however, inventories to measure the degree of sleep debt have not been developed.
We focus on "weekday sleep debt (WSD), " which refers to taking longer sleep on the weekend to compensate for lack of sleep during the weekdays (3,4). WSD is defined as the difference between the weekend and weekday sleep durations (5).
It has been well-known that chronotype affects weekday and weekend sleep timing and duration (6). One cause of WSD may be social jetlag, which is the discrepancy between biological and social clocks (7,8). WSD may not occur when persons live following their born chronotype determined by genetic factors. However, it is difficult for most persons to live a daily social life by following their chronotype due to work or school. The factors of WSD other than chronotype should be specified to prevent sleep debt because the chronotype are reported to be influenced by genetic factors (5) as well as age, and personality (9).
It is assumed that persons with WSD may not get enough sleep during the weekdays and take longer sleeps on the weekend than during the weekdays to compensate for lack of weekday sleep. Actigraph devices have been widely used for long-term circadian activity measurement because they are lightweight, easy-to-wear, and non-invasive (10)(11)(12). The long-term activity measurement enables the evaluation of the discrepancy between biological and social rhythms clocks.
In this study, we aim to investigate the factors of WSD by using 1-week wrist actigraphy data. The actigraphy data and answers to questionnaires of persons with and without WSD were collected from 324 male employees at a Japanese wholesale company, and 218 participants were analyzed.
Before analysis, we validated the collected actigraphy data through comparison with sleep diaries recorded by the participants. In addition, the definition of WSD was examined with the chorotype of the participants determined by the Morningness-Eveningness Questionnaire (MEQ).
We trained a classifier that predicts the presence of WSD from the collected actigraphy data by utilizing a machine learning (ML) technique and calculated feature importance based on the trained classifier to specify the factors of WSD in addition to statistical analysis of the answers to questionnaires.

Measurements From Actigraphy Data
We calculated the following 40 parameters listed in Table 1 from the actigraphy data, whose details are described below.  The Cole-Kripke sleep/wake identification method was adopted to estimate wake-up and sleep-onset times from the actigraphy data (13). It has been reported that the agreement rate between the Cole-Kripke method and PSG was 88% (13,14). It is difficult to estimate sleep/awake timing using only the Cole-Kripke method because it uses 1-min activity counts data measured by means of actigraphy; therefore, we introduce a sleep/wake function r(i) where x i is a sleep/wake state at the time i estimated by means of the Cole-Kripke method: x i = +1 and x i = −1 denote sleep and awake, respectively. r(i) becomes large when the sleep state dominates and becomes small when the wake state dominates. The extreme points of r(i) may be times that the sleep/awake states change. Thus, its maximum and minimum points can be defined as the wake-up time and the sleeping time. If there are multiple maximum and minimum points within a day, we adopt the latest one as the wake-up time or the sleeping time.
Using the sleep/wake function r(i), the following measurements, which represent the sleep habits, are estimated: wake-up time (WU), sleep-onset time (SO), mid-sleep (MS), and sleep duration (SD). WU, SO and MS are expressed as the minutes elapsed from midnight. Their means and standard deviations were used in addition to these values for each weekday.
• Sleep Rhythm: Sleep regularity index (SRI) and Sleep timing index (STI) Sleep regularity index (SRI) (15) and sleep timing index (STI) (16) represent sleep rhythm. SRI is the agreement between sleep/wake states at any two-time points separated by 24 h. A large SRI means that the sleep rhythm is regular. SRI can be calculated based on the Cole-Kripke sleep/wake identification method. N and p are the numbers of measurement days and samples per day, respectively. s k,j = 1 is satisfied if a person is determined to be in the sleep state at time j on the kth day, and s k,j = 0 is satisfied if a person is determined to be in the awake state at time j on the kth day. SRI is defined as where δ i,j = 1 if i = j and δ i,j = 0 otherwise. STI expresses the mean sleep midpoint during the measurement days. When the time from 0:00 to 24:00 are associated with the angles θ = [0, 2π], a time point j is expressed as θ j = 2πj/p. Using this relationship, STI is defined as where i is the imaginary unit, and is the argument of the sum of e iθ j over the sleep states. Figure 1 shows a schematic diagram of STI, in which the triangles denote the times estimated as the sleep states, and e iθ j is the composition of the vectors directing these triangles. Thus, the time corresponding to the argument of e iθ j is STI.
• Activity Rhythm: Inter-daily stability (IS) and Intra-daily variability (IV) Inter-daily stability (IS) and intra-daily variability (IV) are wellknown parameters for activity rhythm evaluation based on the actigraphy data (17), which are defined as follows: where A i is the activity data at time i, A j is the average of the activity data at time j over different days, and A is the average of all activity data. p is the number of activity data per day, and M is the total number of collected activity data. IS is the ratio of the variance of all the collected activity data to the variance of the average activity data at time j over different days. IS becomes large when the fluctuation of the diurnal activity data recorded at the same time of different days becomes large. IV is the ratio of the variance of the collected activity data to the sum of squares of the differences in activity data between a time point and the previous time point. IS and IV are measured for the synchronization of diurnal activity rhythm and the frequency of midday naps and nocturnal awakening, respectively (18). IS and IV can be calculated from both the activity counts and illuminance in the actigraphy data, which are denoted as IS act , IS light , IV act , IV light , respectively. • Rest-Activity Rhythm: Average activity during the most active 10-h period (M10) and Average activity during the least active 5-h period (L5) Although IV represents intra-daily activity rhythm, M10 and L5 can be calculated from the actigraphy data to emphasize the restactivity circadian rhythm. M10 and L5 are defined as the average activities during the most active 10-h period and during the least active 5-h period, respectively (17). M10 represents activity during the most active period within the day, which may be influenced by daytime napping. L5 represents movement activity during sleep. Although sleep during the weekdays does not reflect the congenital biological clock much due to the effect of the social clock, weekend sleep significantly reflects the congenital biological clock. Chronotype is classified into three types based on MSF: morning type (MSF < 3:00), intermediate type (3:00 ≤ MSF ≤ 4 : 00), and evening type (MSF > 4:00) (19). In addition, social jetlag (SJL) is the difference between the biological clock and the social clock, which is defined as where MSW is the meantime of mid-sleep on the weekdays. Although the Morningness Eveningness Questionnaire (MEQ) (20) or the Munich Chronotype Questionnaire (MCTQ) (21) is usually used for calculating MSF and SJL, we used the actigraphy data for calculating them instead of the questionnaires in this study.

Measurements From Questionnaires
In addition to the measurements derived from the actigraphy data, we collected answers to the following, modified for the Japanese population.

Definition of Weekday Sleep Debt
Sleep duration (SD) of each day can be calculated using the sleep/wake function r(i). By using the means of SD on the weekend (SD on a free day; SDF) (min), and SD on the weekday (SDW) (min), sleep rebound on the weekend (SRW) (min) (30) is defined as A person may have chronic sleep deprivation during the weekdays when SRW ≥ 120 min (31). Based on this criterion, we judge that a person has WSD when SRW ≥ 120 min.

Machine Learning-based Factor Identification
In this research, we adopt machine learning (ML) technologies to find the factors of WSD. A classifier that predicts whether a person has WSD is trained from the measurements described above or not, and feature importance is calculated based on the trained classifier. Feature importance is a well-known method for identifying which input features in a classifier contribute to output (32). Although various classifier training methods have been used in ML, we adopted a support vector machine (SVM) in this work. SVM is a classical nonlinear classification technique that was originally developed for classifying data into two classes (33). SVM has been widely used for various applications such as spam mail filtering, bioinformatics, and object recognition (34)(35)(36). Thus, SVM is a reliable ML technique. We used a permutation feature importance method to calculate feature importance, which is defined as the decrease in prediction performance when a single input feature is randomly shuffled (37, 38). E is the prediction performance of the trained classifier when the original feature set is input to the classifier, and E j is the prediction performance of the classifier when a feature x j is exchanged randomly into another feature. The feature importance of x j is calculated as We can judge that the feature x j contributes to a prediction when Pimp x j is large.

Data Description
A cross-sectional survey was conducted in 324 employees at a drug wholesale company in Osaka, Japan, from January 26, 2004, to December 19, 2005. All of the participants were male. It becomes difficult to focus on the relationship between work habits and activity rhythm when female employees are included in the survey. There are more factors affecting sleep and rhythm in females than in males, such as having small children or not, menstrual cycle, unmarried or married (39,40). However, some studies have already reported that there were no significant differences in daytime activities between males and females (41,42). Thus, it is expected that our analysis is applicable to females as well as males.
The workdays were from Monday to Friday, and the holidays were Saturday and Sunday, which are typical for Japanese companies and government offices. We used a wrist-type actigraph device (Actiwatch AW-Light, Mini-Mitter) for data recording, which measures the activity counts and illuminance within every one-minute bin. Illuminance is recorded by a photodiode equipped in the actigraph device. Its measurement range is 0.1 to 150,000 Lux with 0.1 Lux resolution, which is suitable for both interior and exterior illuminance measurement. The actigraph device worked for 2-3 months per one battery. Thus, data missing did not occur during data collection. In addition, we collected answers to the questionnaires and sleep diaries during the survey. The protocol of this study was approved by the Ethics Committee of the Shiga University of Medical Science (R2020-026).
Since all participants wore the actigraph device from around 5:00 pm on a Monday to around 8:00 am on the next Monday, the Monday (the first day) actigraphy data were not recorded in their entirety. The Monday data were excluded from the analysis except for wake-up and sleep-onset time estimation. On the other hand, sleep-onset time, mid-sleep, and sleep duration of Friday were not able to be calculated because sleep in the Friday night sleep is regarded as a weekend.
One hundred and six participants who removed the actigraph device for more than 30 min during the survey term were excluded from the analysis so that 218 participants were included in the analysis. The characteristics of the subjects are illustrated in Table 2, and a detailed description of the data used in this study is available in Kadotani    Our dataset consisted of 1-week actigraph data, which seems to be rather short for analysis. However, Tienoven et al. calculated SRM of healthy persons from their 1-week activity data (45). Fonseca et al. evaluated RM of stroke patients and healthy persons based on 1-week activity data (46). In addition, the sleep extension term required before performing PSG is also 1 week according to ICSD-3 (2). These indicate that 1-week activity data can be used for sleep rhythm evaluation.
An example of sleep/awake state estimation is shown in Figure 2. All participants were classified into the WSD group or the non-WSD group based on the SRW calculated from the actigraphy data. The numbers of persons with and without WSD were 89 and 129, respectively. Figure 3 shows the distribution of SRW in this data.

Analysis Procedure
Before analysis, we validated the accuracy of the actigraphy data through a comparison between sleep durations derived from the actigraphy and the sleep diaries recorded by participants themselves. In addition, the answers of MEQ item 19, which asks self-awareness of chronotype were compared to verify whether the definition of WSD adopted in this study was supported by self-awareness of chronotype.
To specify the factors of WSD based on the feature importance, we needed to construct a classifier that predicts the presence of WSD. In general, the numbers of positive and negative examples for training should be balanced in binary classification problems to construct a good classifier (47). We randomly discarded non-WSD data so that the numbers of persons with WSD and non-WSD became even by means of random under-sampling because the number of persons without WSD was larger than that of persons with WSD.
As the input features of the WSD classifier, we used 40 features listed in Table 1, derived from the actigraphy data. The Gaussian kernel with a parameter γ was adopted in SVM, of which the parameter γ was tuned by means of 10-fold cross-validation. The WSD classifier training and permutation feature importance computation were repeated 100 times by changing the training and the test samples at random, in which the ratio of the number of training samples to that of test samples was fixed at 7:3. Finally, the mean of the computed permutation feature importance was calculated. In this study, we focus on features occupying the top 20% of the sum of importance, which does not necessarily mean only these factors are important (or statistically significant). We used the top 20% just for limiting the number of factors to discuss in detail, in which the cut-off value can be varied.
In addition, we examined whether the presence of WSD may relate to occupational categories or not. In this survey, employees were classified into the following categories: clerical, managers, professional and technical sales, service, transportation and communication, manufacturing, and others (43).
We compared each of the 40 input features and each of the collected answers to the questionnaires between the WSD and non-WSD groups by means of a statistical test.

Statistical Analysis
We used the t-test for comparison between the WSD and non-WSD groups. The significance level was set to p < 0.05. To consider the multiple comparisons of the total of 40measurements from the actigraphy data and 14-measurements from the questionnaires, the Bonferroni correction was adopted. That is, the significance levels were corrected as p < 0.

RESULTS
First, we checked the validity of the actigraphy data. Figure 4 is the Bland-Altman plot between sleep durations derived from the  (48) reported that there were errors of >1.5 h between sleep durations derived from the actigraphy and the sleep diaries in 79% of healthy adults. Short et al. (49) confirmed that the average errors between sleep durations derived from the actigraphy and the sleep diaries were 87 min in healthy adolescents. According to these studies, the sleep duration errors observed in this study was appropriate. The participants distribution of MEQ item 19 is shown in Table 3, and a significant difference was confirmed in MEQ item 19 through the χ 2 test (p = 0.12 × 10 −7 ). On the other hand, there was no significant difference in the total score of MEQ (p = 0.12). Thus, the definition of WSD adopted in this study was associated with self-awareness of chronotype.
We compared the answers to the questionnaires between the WSD and non-WSD groups using the t-test with the Bonferroni correction. There were no questionnaires that had a significant difference between the WSD and non-WSD groups, as shown in Table 4.
The calculated mean of the permutation feature importance is shown in Figure 5, which shows that the estimated WSD factors are as follows: sleep duration on Thursday (SD Thu ), sleep timing index (STI), the standard deviation of sleep duration (SD std ), sleep onset on Thursday (SO Thu ), inter-daily stability calculated by the activity counts (IS act ), mid-sleep on free days (MSF), and social jetlag (SJL). Table 5 shows the comparison result of input features between the WSD and non-WSD groups by the t-test with the Bonferroni correction, in which features displayed by bold fonts have a significant difference between the two groups. In addition, the last column in Table 5 indicates the estimated WSD factors by the permutation feature importance. According to the t-test, MSF and SJL were significantly different, as well as some features about the weekday sleep condition, including SD Thu and SO Thu . The results of the t-test almost agreed with those of the permutation importance. Thus, these results indicate that sleep on Thursday, activity rhythm and biological clock might contribute to WSD. Figure 6 shows the numbers of employees with/without WSD in each occupational category. The additional χ 2 test examining the effect of occupational categories on WSD showed that the occupational category was not associated with the presence of WSD in this dataset (p = 0.87).
We constructed additional two classifiers using the actigraphy data collected from two groups: a salesperson group (N = 119) and a non-salesperson group (N = 205). The calculated means of the permutation feature importance derived from each classifier were shown in Figure 7. The top five WSD factors were the same, which shows that the factors of WSD were not related to specific occupational categories.

Chronotype and WSD
There was a significant difference in MEQ item 19 between WSD and non-WSD groups (p = 0.12 × 10 −7 ) although a significant difference in the total score of MEQ was not confirmed (p = 0.12).
Turco et al. (50) assessed the reliability of MEQ item 19 in comparison with the total score of MEQ, the time of subjective sleepiness, and real-life sleep timing variables. They found that significant differences in sleep-wake timing between the answers of MEQ item 19. In addition, such differences were still observed when sleep-wake habits were analyzed separately

Day-to-Day Variance
The weekday sleep conditions were different between the WSD and non-WSD groups, according to Table 2. Sleep duration during the weekdays of the WSD group was significantly shorter than that of the non-WSD group except for Wednesday (p < 0.05), which may be plausible from the viewpoint of the assumption that WSD compensates for sleep deprivation during the weekdays. SD Thu and SO Thu were important features for WSD based on the analysis result of permutation importance analysis, which indicated that Thursday was associated with WSD.

Morning Evening Tendency
There were significant differences in MSF and SJL, the biological clock features (p < 0.05). MSF and SJL of the WSD group were later than those of the non-WSD group. That is, persons with WSD have eveningness tendencies. In addition, MSF and SJL were also important to WSD according to the permutation importance analysis. Since all of the participants worked at the same company, they might live and work following the same social clock. Thus, this result indicated that disagreement between the social clock and the individual biological clock might contribute to WSD, which is consistent with the previous study (5).

Activity Levels Variance
The inter-daily stability by the activity counts (IS act ) evaluates the synchronization of diurnal activity rhythm, and it was also an important feature for WSD according to the permutation importance analysis, although the significant difference of IS act between the WSD and the non-WSD groups was not confirmed. Figure 5 shows the 24-h activity counts during the weekdays of two participants with and without WSD. Inter-daily synchronization of activity rhythm, including sleep conditions, might affect WSD.
On the other hand, intra-daily variability based on the activity counts (IV act ) was not significantly different between the WSD and non-WSD groups, and its feature importance was low. Since IV act denotes the frequency of midday naps and nocturnal awakening, its low importance suggests that sleep disorders, such as narcolepsy or sleep apnea, might not contribute to WSD. In addition, IS and IV by illuminance (IS light and IV light ) were not significantly different and of low importance, implying that illuminance might not affect WSD.
It was of note that Thursday was more important for WSD presence than other weekdays. This may be related to the participants' working conditions. All participants worked at the same drug wholesale company, which provides various medical supplies to hospitals and clinics. Since private clinics in Japan tend to close on Thursday evening (52), work habits on Thursdays are different from other workdays at the drug wholesale company. For example, employees might tend to do desk work rather than outside work or to go home earlier than other workdays. Such differences in work schedules might affect their social clock and disturb the synchronization of diurnal activity rhythm, which is consistent with the causes of WSD (5). Figure 8 clearly illustrates that the activity of a participant with WSD on Thursday was significantly different from those of other days while the activity of a participant without WSD synchronized even on Thursday. This difference might indicate increased desk time and reduced outside work on Thursday.
Thus, it is reasonable that Thursday was an important factor of WSD at the company. This analysis indicates that the work style differences on a specific day of the week may cause WSD.

Occupational Category
In this survey, participants were classified into a salesperson group and a non-salesperson group, and the top five permutation feature importance derived from each group was the same, as shown in Figure 6. This result indicated that the factors of WSD were not related to specific occupational categories, which also suggests that our analysis may be applicable to data collected from other companies.
The limitations of this study include the collected data; all of the participants were male and Japanese. Hence, we could consider neither gender nor racial differences in this study. In addition, the participants worked at the same company; it was difficult to investigate the effects of other days of the week on WDS since we did not compare our data with persons working at companies that have different work habits on a specific day of the week. Accordingly, we need to collect data from employees working in various types of industries to confirm our results. In addition, we were not able to analyze the Monday activity data due to the constraint of the data collection, which might affect the analysis results.

CONCLUSION
In this study, we collected actigraphy data from 324 healthy male employees at a drug wholesaler and calculated permutation feature importance based on SVM to identify the factors of WSD. We compared the answers to questionnaires between the WSD and the non-WSD groups.
Our analysis results indicated that sleep duration during the weekdays and the individual biological clock might affect WSD, which is consistent with previous studies. In addition, we demonstrated a new finding that turbulence of diurnal activity rhythm synchronization as well as nocturnal sleep rhythm, even for 1 day, is associated with WSD, which is a new finding of this work.
In the future, we will develop a quantitative evaluation methodology for sleep debt based on this study.

DATA AVAILABILITY STATEMENT
The data analyzed in this study is subject to the following licenses/restrictions: The actigraphy data will be made available by the corresponding author to colleagues who propose a reasonable scientific request after approval by the institutional review board of the SUMS Hospital. Requests to access these datasets should be directed to Hiroshi Kadotani, kadotanisleep@gmail.com.

ETHICS STATEMENT
The studies involving human participants were reviewed and approved by the Ethics Committee of the Shiga University of Medical Science. The patients/participants provided their written informed consent to participate in this study.