Application of Machine Learning Methods to Ambulatory Circadian Monitoring (ACM) for Discriminating Sleep and Circadian Disorders

Rodriguez-Morilla, Beatriz; Estivill, Eduard; Estivill-Domènech, Carla; Albares, Javier; Segarra, Francisco; Correa, Angel; Campos, Manuel; Rol, Maria Angeles; Madrid, Juan Antonio

doi:10.3389/fnins.2019.01318

ORIGINAL RESEARCH article

Front. Neurosci., 10 December 2019

Sec. Sleep and Circadian Rhythms

Volume 13 - 2019 | https://doi.org/10.3389/fnins.2019.01318

Application of Machine Learning Methods to Ambulatory Circadian Monitoring (ACM) for Discriminating Sleep and Circadian Disorders

Beatriz Rodriguez-Morilla¹

Eduard Estivill²

Carla Estivill-Domènech³

Javier Albares⁴

Francisco Segarra²

Angel Correa⁵

Manuel Campos^1,6

Maria Angeles Rol^1*

Juan Antonio Madrid¹

¹Laboratory of Chronobiology, IMIB-Arrixaca, Department of Physiology, Centro de Investigación Biomédica en Red de Fragilidad y Envejecimiento Saludable, Instituto de Salud Carlos III, University of Murcia, Murcia, Spain
²Clínica del Sueño Estivill, Barcelona, Spain
³Fundación Estivill Sueño, Barcelona, Spain
⁴Medicina del Sueño Doctor Albares, Centro Médico Teknon, Barcelona, Spain
⁵Department of Experimental Psychology, Faculty of Psychology, University of Granada, Granada, Spain
⁶Department of Computing and Systems, Faculty of Computer Science, University of Murcia, Murcia, Spain

The present study proposes a classification model for the differential diagnosis of primary insomnia (PI) and delayed sleep phase disorder (DSPD), applying machine learning methods to circadian parameters obtained from ambulatory circadian monitoring (ACM). Nineteen healthy controls and 242 patients (PI = 184; DSPD = 58) were selected for a retrospective and non-interventional study from an anonymized Circadian Health Database (https://kronowizard.um.es/). ACM records wrist temperature (T), motor activity (A), body position (P), and environmental light exposure (L) rhythms during a whole week. Sleep was inferred from the integrated variable TAP (from temperature, activity, and position). Non-parametric analyses of TAP and estimated sleep yielded indexes of interdaily stability (IS), intradaily variability (IV), relative amplitude (RA), and a global circadian function index (CFI). Mid-sleep and mid-wake times were estimated from the central time of TAP-L5 (five consecutive hours of lowest values) and TAP-M10 (10 consecutive hours of maximum values), respectively. The most discriminative parameters, determined by ANOVA, Chi-squared, and information gain criteria analysis, were employed to build a decision tree, using machine learning. This model differentiated between healthy controls, DSPD and three insomnia subgroups (compatible with onset, maintenance and mild insomnia), with accuracy, sensitivity, and AUC >85%. In conclusion, circadian parameters can be reliably and objectively used to discriminate and characterize different sleep and circadian disorders, such as DSPD and OI, which are commonly confounded, and between different subtypes of PI. Our findings highlight the importance of considering circadian rhythm assessment in sleep medicine.

Introduction

The study of circadian rhythms, especially when applicable to clinical practice, has generated increasing interest over the last few years (Sadeh and Acebo, 2002; Ancoli-Israel et al., 2003). This is partially due to lifestyle changes in developed societies that directly affect the quality of circadian rhythms, with the subsequent impact on sleep (Shochat, 2012). As reviewed by Goel et al. (2014), this so-called 24/7 society is characterized by an increasing proportion of work and leisure activities at night, exposure to light at aberrant times of the day, growing use of technological devices, and social jetlag, i.e., a circadian misalignment between free and work days (Roenneberg et al., 2003). All these factors lead to desynchronization between internal rhythms and the environmental day–night cycle. Moreover, the use of light-emitting electronic devices at night has been associated with delayed, reduced, and fragmented sleep (Owens et al., 1999; Van Den Bulck, 2004; Shochat et al., 2010).

In turn, sleep disorders usually lead to circadian disruption. On the one hand, sleep timing and duration modulate exposure to environmental cues that synchronize the circadian system (e.g., environmental light, food schedules, social activities, etc.). On the other hand, common symptoms of sleep disorders such as sleep fragmentation or short sleep duration, apart from contributing to excessive daytime sleepiness, affect the amplitude of other rhythms (Martinez-Nicolas et al., 2011) and alter the balance between homeostatic and circadian processes involved in sleep–wake regulation (Borbély et al., 2016).

This bidirectional relationship between sleep and circadian alterations highlights why evaluating circadian rhythms is also important for sleep medicine. In the current study, we aimed to test the diagnostic usefulness of ACM, based on wearable technology which combines the simultaneous recording of several circadian output and input signals, namely wrist temperature (T) (Sarabia et al., 2008), motor activity (A), body position (P) (Sadeh and Acebo, 2002; Ancoli-Israel et al., 2003), and exposure to environmental light (L) rhythms. Actigraphy is accepted by the American Academy of Sleep Medicine (AASM) as clinically appropriate for studying sleep and circadian disorders, as it has been used for >20 years (Sadeh and Acebo, 2002; Ancoli-Israel et al., 2003). In addition, wrist temperature has been shown as a reliable circadian marker (Sarabia et al., 2008; Martinez-Nicolas et al., 2013), showing a close relationship with dim light melatonin onset (Bonmatí-Carrión et al., 2014) and being therefore proposed as a non-intrusive circadian marker of choice in a consensus document (Mullington et al., 2016). Its usefulness for diagnostic purposes has been also reported for multiple conditions (Zornoza-Moreno et al., 2013; Martinez-Nicolas et al., 2014, 2018). The variables T, A, and P can be combined into the integrated variable TAP, which indicates general activation and has been previously validated for estimating sleep, showing greater reliability than any of the individual variables by themselves according to both sleep diaries (Ortiz-Tudela et al., 2010) and polysomnographic assessment (Ortiz-Tudela et al., 2014). Recently, this method has been successfully employed for chronotype identification (Martinez-Nicolas et al., 2019). In the present study, the usefulness of ACM as a diagnostic tool has mainly focused on PI and DSPD, due to their high prevalence and the overlapping of their symptoms (Gradisar and Crowley, 2013; Sivertsen et al., 2013).

Insomnia may be both a symptom and a disorder. Briefly, it implies difficulty for initiating or maintaining sleep, or non-restorative sleep, when a favorable opportunity and circumstances arise. It is considered a disorder when these symptoms are present at least three times a week, for at least 1 month (Roth, 2007). As such, it is the most common sleep disorder, but its prevalence varies depending on the definition used and the population studied. The strictest diagnostic criteria suggest a prevalence rate of around 5–7%, although insomnia symptoms rise to nearly 30% (Roth, 2007).

Delayed sleep phase disorder refers to normal sleep (in terms of quality and structure) that shows significantly delayed onset and wake-up times, with respect to those externally imposed, or desired by the patient according to standard schedules (American Academy of Sleep Medicine, 2005; World Health Organization [WHO], 2008). This also implies difficulties for initiating sleep, as in the case of insomnia, as well as problems for waking up early in the morning. The accomplishment of standard wake times results in partial sleep deprivation, which in turn deteriorates daytime performance (Gradisar and Crowley, 2013). Studies on the general population yielded prevalence rates of between 0.13 and 0.17% (Schrader et al., 1993; Yazaki et al., 1999), but when focusing on adolescence, prevalence increases to 16% (Lovato et al., 2013). Nevertheless, all estimations until now are uncertain, since this disorder is likely underestimated (Gradisar et al., 2011) and probably confounded with OI (Weitzman et al., 1981). Thus, tools are needed that allow their differentiation, in order to select the most appropriate interventions.

Prediction and classification methods based on machine learning are rapidly expanding as diagnostic tools for a wide spectrum of pathologies (Kubota et al., 2016; Dagliati et al., 2017; Duda et al., 2017; Kim et al., 2017; Mossotto et al., 2017; Serrano et al., 2017). The aim of this field of computational sciences is to develop algorithms capable of detecting predictable patterns in complex data samples. In general terms, supervised machine learning permits classification when the output is a label or nominal value, and prediction when the output is a numeric value. In this study, we used a classification model known as decision tree: a top-down classification algorithm that splits data into hierarchical nested classes, by selecting the variable in each step that better subdivides the sample (Rokach and Maimon, 2005). Comparing the classification obtained with an expert criterion (in this case, the initial categories of pathologies) allows estimating both the sensitivity and specificity of the generated model (Kubota et al., 2016).

Thus, the aim of this study was to assess the usefulness of machine learning methods to differentially diagnosis PI from DSPD using the circadian rhythms of wrist temperature, motor activity, body position, and exposure to environmental light, recorded by a ACM device.

Materials and Methods

Participants

A total of 242 patients suffering from either PI (n = 184) or DSPD (58), and 19 healthy control subjects were included in this study. Patients and healthy controls were retrospectively selected from the Circadian Health Database of the Chronobiology Laboratory at the University of Murcia¹. Chronobiological and sleep parameters from subjects attendants to Dr. Estivill Sleep Clinic (Barcelona, Spain) were anonymously analyzed and coded by the Kronowizard platform which was approved by the Ethics Committee of the University of Murcia. All subjects gave written informed consent before their diagnosis tests according to the model provided in the Kronowizard platform. The selection criteria for patients were: (1) having been diagnosed of either PI or DSPD by sleep experts from the above mentioned sleep clinic, (2) including ACM as part of routine clinical diagnosis and treatment of sleep and circadian disorders, (3) not being diagnosed of any other primary circadian or sleep disorders (discarded by PSG). The available information from the database did not differentiate between onset, maintenance, or terminal insomnia. The selection criterion for healthy controls was not being diagnosed of any sleep or circadian disorder. Exclusion criteria for both patients and healthy controls were any organic, metabolic, endocrine, or psychiatric disorder, so as the abuse of alcohol or illicit drugs.

Clinical Interview

The diagnostic classification was based on the clinical history. To this end, patients were interviewed about the nature of their complaints, including questions aimed at addressing the following aspects:

– Whether the problem referred to sleep onset, maintenance, non-restorative sleep, or a combination of these.

– Duration, frequency, and severity of the symptoms.

– Daytime functioning and associated symptoms.

– Other possible symptoms (snoring, apnea, nocturia, parasomnias, symptoms associated with other disorders, such as restless legs syndrome, periodic limb movement disorder, etc.).

– Sleep–wake habits: work, school, food and social schedules, physical activity, use of technological devices at night, naps during daytime, caffeine intake, routines before bedtime, etc.

– Sleep conditions and routines: conditions of the bedroom (light, noise, etc.), whether the patient sleeps alone or with someone else (and, in this case, possible disturbances from the bedmate), the presence and use of technological devices once in bed (TV, computer/laptop, light-emitting e-books, radio, etc.).

– Time preferences for sleep onset, wakeup, activities schedule, etc.

– Intake of psychoactive substances (alcohol, stimulating or relaxing substances, legal drugs, illicit drugs, etc.).

Ambulatory Circadian Monitoring Device

The multichannel device Kronowise^® (Chronobiology Laboratory, University of Murcia, Murcia, Spain) employed to evaluate the circadian rhythms of the selected sample integrates several sensors: (1) The Thermochron^® iButtonDS1921H data logger (Dallas, Maxim), placed on the skin of the non-dominant wrist, assesses distal temperature every 10 min; (2) an actimeter (Hobo^® Pendant G Acceleration Data Logger), placed on the non-dominant arm, records body position as the tilt (°) of the vertical axis parallel to the arm (axis X, Figure 1) and motor activity as its acceleration (m/seg²), every 30 s; and (3) a luxometer (Hobo^® Pendant Light-Temperature Data Logger) registers environmental light (in luxes) every 30 s. All subjects wore this equipment 24 h a day for a week, including work and non-work days, under normal-living conditions as already described (Refinetti et al., 2013; Martinez-Nicolas et al., 2019). During the registration period, participants were instructed to follow their usual lifestyle: since sleep assessment was performed within a clinical context, the ACM study was intended to be conducted in conditions as faithful as possible to their normal living at the moment of their complaints.

FIGURE 1

Figure 1. Schematic arrangement of the ambulatory circadian monitoring device Kronowise^®, composed of a luxometer and a temperature data logger placed on the wrist, and an accelerometer placed on the arm (Chronobiology Laboratory, University of Murcia).

Data Analysis

Non-parametric Analysis

The integrated variable TAP was calculated for each subject, from his/her rhythms of skin temperature, motor activity, and body position, as described in Ortiz-Tudela et al. (2010). To this end, the values of skin temperature were normalized between 0 and 1 using the 5th and 95th percentile and inverted; the values of motor activity were normalized with respect to the actimeter placement. Normalized activity values ranged from 0 (total immobility) to 1 (percentile 95th, used as maximal acceleration reference). Arm position was also normalized between 0 (horizontal) to 1 (vertical position). The integrated variable TAP expressed the level of general activation from 0 (minimal activation, high temperature, low activity, and horizontal arm position) to 1 (maximal activation, low temperature, high activity, and vertical arm position). This was used to estimate sleep probability, from a threshold dynamically established for each subject, depending on his/her own rhythms (Ortiz-Tudela et al., 2010).

All these variables, including TAP and estimated sleep, were subsequently subjected to non-parametric analyses (Madrid-Navarro et al., 2019) using the software Circadianware^® implemented in the website Kronowizard^®² (Chronobiology Laboratory, University of Murcia, Murcia, Spain), yielding the following indexes:

• M10v and M5v, and L10v and L5v refer to the mean values of 10 or 5 consecutive hours of maximal (M) and lowest (L) values of the variable. M5v of temperature and L5v of activity, position, and TAP have been used as indexes of sleep depth, while L10v of temperature and M10v of activity, position, and TAP indicate general activation levels during wakefulness. The midpoint timing of these periods (M10h and M5h, and L10h and L5h) was also obtained and used as indexes of circadian phase.

• IS quantifies the repetitiveness of the rhythm across consecutive days. It is obtained according to the following formula:

IS = \frac{n \sum_{h = 1}^{n} {({\bar{x}}_{h} - \bar{x})}^{2}}{p \sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2}}

n: total number of data,

p: number of data per day,

$\bar{x}$ : mean value of all the data,

${\bar{x}}_{h}$ : mean value of the data at a specific time of day,

IS ranges from 0 (maximum noise) to 1 (perfect IS, i.e., when the daily wave repeats exactly across days).

Intradaily variability indicates the rhythm fragmentation, which depends on the frequency and extension of the transitions between low and high values within the cycle, according to the following formula:

IV = \frac{n \sum_{i = 2}^{n} {(x_{i} - x_{i - 1})}^{2}}{(n - 1) \sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2}}

n: total number of data,

$\bar{x}$ : mean value of all the data.

Intradaily variability values are close to 0 in the case of a perfect sinusoid wave, and approach 2 in the case of Gaussian noise.

• Relative amplitude is a marker of the rhythm amplitude or contrast between wakefulness and sleep values, i.e., the difference between the maximum and minimum values, according to the following formula for motor activity, body position, TAP, and environmental light:

RA = \frac{M10v - L5v}{M10v + L5v}

As wrist temperature and estimated sleep show opposite profiles, RA was calculated as follows:

RA = \frac{M5v - L10v}{L10v + M5v}

The RA for WT was multiplied by 10 in order to amplify its values to range from 0 to 1. Values near 0 in this index indicate null contrast between wakefulness and sleep, while values near 1 express maximal contrast.

• The CFI was developed to characterize rhythm robustness with a single score (Ortiz-Tudela et al., 2010). It integrates normalized values between 0 and 1 for IS, IV, and RA, with IV values being inverted. Accordingly, the CFI ranges from 0 (null circadian rhythmicity) to 1 (maximally robust circadian rhythm).

Machine Learning Analysis

All subjects included in our study were classified by machine learning analysis, using a decision tree, performed on the indexes described above. This analysis was carried out using the software Orange Canvas© (Demšar et al., 2013).

Attribute selection

The selection of attributes for the classification model was based on the TAP variable and sleep probability estimated from it, due to its greater validity over the individual variables (Ortiz-Tudela et al., 2010, 2014). Attribute selection was guided by the expert criterion of including those indexes providing complementary information to one another. Therefore, we aimed to select indexes describing circadian phase and rhythm and sleep quality.

With this in mind, the discriminative potential of the candidate attributes was obtained according to the criterion of information gain (based on entropy reduction) and the statistical criteria of ANOVA (maximization of differences between classes) and Chi-squared (maximization of internal correlation within each class) measures.

Discretization

The selected attributes were reconverted from continuous to discrete values, since classification models built on discrete attributes are more exact than those built on continuous ones (Demšar et al., 2013). Discretization was based on data splits that met the criteria proposed by Maslove et al. (2013) and Liu and Hussain (2002): (1) reflecting the original distribution of the continuous attribute; (2) maintaining the attribute patterns without adding additional spurious patterns; and (3) making sense and being interpretable according to expert criteria.

The discretization method used in our study was minimum description length (MDL) (Fayyad and Irani, 1993). This is a top-down technique that recursively splits the attribute maximizing information gain to the point where a new split would not add any new information to the predictions.

Model validation

The model was evaluated through 10-fold cross-validation, obtaining the following indexes:

• Sensitivity, true positive rate, or recall: probability of correctly classifying a case. It is obtained according to this formula:

Sensitivity = \frac{TP}{TP + FN}

TP: number of true positives,

FN: number of false negatives.

• Accuracy, predictive value: probability of a specific case actually belonging to the class assigned by the model, according to the following formula:

Accuracy = \frac{TP}{TP + FP}

TP: number of true positives,

FP: number of false positives.

• F1 score indicates sensitivity and accuracy together, as the harmonic mean of the two, i.e.,

F 1 = 2 \frac{Sensitivity x Accuracy}{Sensitivity + Accuracy}

• Specificity or true negative rate is the probability of correctly excluding a case from a class where it does not belong, obtained as follows:

Specificity = \frac{TN}{TN + FP}

TN: number of true negatives,

FP: number of false positives.

• False positive rate: probability of erroneously assigning a case to a class where it does not belong, calculated as:

False positives rate = 1 - Specificity

• Receiver operating characteristic (ROC) curve and AUC: the ROC curve is obtained from the graphic representation of sensitivity and false positives rate together. The AUC is a discriminating measure that indicates the capacity of the model to discriminate values of different classes.

Results

Non-parametric Analyses

The circadian markers obtained from ACM and the circadian indexes obtained from non-parametric analyses for every original class (i.e., diagnostic categories based on the clinical history previous to machine learning classification) are summarized in Table 1.

TABLE 1

Table 1. Mean value (standard deviations in brackets) for every index obtained from non-parametric analysis for wrist temperature, motor activity, TAP, estimated sleep, and environmental light.

Attribute Selection

The attributes selected for building the decision tree classification model, according to the criteria of information gain, ANOVA, and Chi-squared were: TAP-L5h as sleep phase marker, TAP-M10h as wakefulness phase marker, CFI of estimated sleep as an indicator of global quality of the rhythm, and TAP-RA, which apart from being an index for rhythm quality, provides information relative to sleep depth and general activation during wakefulness together. Table 2 shows the magnitudes of the criteria used for each of the selected attributes.

TABLE 2

Table 2. Information gain, ANOVA F, and χ² for every attribute selected for building the decision tree (see Figure 1 for legend).

Decision Tree

Subjects were classified as shown in Figure 2, based on the attributes described above. The sleep phase marker TAP-L5h (a marker of mid-sleep time) allows for discriminating between pathologies characterized by sleep onset problems (TAP-L5h later than 5:27 h) and the remaining classes. Subjects with earlier TAP-L5h were then divided according to their TAP-RA, i.e., the contrast between rest and activity period levels. Those with lower TAP-RA (<0.629), classified by the model as insomnia, could be considered as a MI subtype according to its characteristics (explained below). Next, subjects with higher levels of TAP-RA were divided according to their sleep-CFI. This resulted in a group with higher scores (≥0.852), indicating more robust rhythms; these were classified as healthy controls, while a group with lower rhythm robustness, classified by the model as insomnia, could be considered a mild type. On the other hand, the group characterized by a delay in mid-sleep time was subsequently divided according to their TAP-M10h, i.e., the central time of maximal activation. This allowed for differentiating between DSPD (TAP-M10h later than 16:07 h) and a third group of insomnia (TAP-M10 earlier than 16:07 h) compatible with an insomnia onset subtype. In summary, this model generated five final classes: healthy controls, DSPD (with both delayed mid-sleep and central time of maximal activation), and three subtypes of insomnia, henceforth referred to as OI (delayed mid-sleep time, but not central time of maximal activation), MI (normal sleep phase and low RA and CFI), and mild insomnia (which only differed from the controls in terms of their sleep rhythm robustness).

FIGURE 2

Figure 2. Decision tree and cut values used by the model to generate five final classes: DSPD (with both delayed mid-sleep and central time of maximal activation), OI (delayed mid-sleep time, but not central time of maximal activation), MI (normal sleep phase and low relative amplitude and CFI), mild insomnia (which only differed from the controls in terms of their sleep rhythm robustness), and healthy controls. TAP-L5h, midpoint timing of the five consecutive hours of lowest TAP values; TAP-M10h, midpoint timing of the 10 consecutive hours of maximal TAP values; TAP-AR, relative amplitude of TAP rhythm; Sleep CFI, circadian function index of sleep rhythm.

Each class obtained by the decision tree was characterized by a specific circadian profile (Figure 3). Healthy controls presented: (a) deeper sleep: indicated by higher night wrist temperature and sleep probability, and lower motor activity and TAP levels during sleep, and (b) lower daytime temperature, pointing to higher activation levels than the remaining classes. DSPD was characterized by: (a) an evident delay of the nocturnal phase of all variables; (b) low activation during the morning (high temperature, low motor activity, and TAP); and (c) low levels of environmental light during the morning, with no differences with respect to the remaining classes from 17:00 to 23:00 h, and prolonged exposure to environmental light during the night (until 6:00 h, on average).

FIGURE 3

Figure 3. Mean circadian waveforms for (A) wrist temperature, (B) motor activity, (C) environmental light, (D) TAP, and (E) estimated sleep probability of the three main classes: healthy controls (purple), insomnia (red), and DSPD (green). Data are expressed as mean ± SEM.

Figure 4, right panels, shows specific circadian profiles of the different subtypes of insomnia yielded by the model, labeled as onset, maintenance, or mild insomnia according to their characteristics. In consonance with the classification model, OI differed from the maintenance and mild insomnia by later sleep onset and wakeup times, while these two latter cases differed from each other in their TAP-RA. In the mild subtype, TAP showed lower night levels (lower general activation and, therefore, deeper sleep) and higher daytime levels (higher diurnal activation). Figure 4, left panels, shows specific circadian profiles of DSPD and OI. DSPD showed a sleep phase delay when compared to OI and, importantly, a different pattern of diurnal activation and exposure to light. In particular, OI exhibited higher general activation during the morning and more marked postprandial sleep propensity, while DSPD showed higher activation during the evening and early hours of the night. In addition, DSPD subjects were exposed to lower environmental light during the morning than the OI group, and higher levels after noon.

FIGURE 4

Figure 4. Mean circadian waveforms (from top to bottom) for TAP, estimated sleep, and exposure to environmental light rhythms, for insomnia subtypes (left) and onset insomnia vs. DSPD (right). Data are expressed as mean ± SEM.

Model Validation

The model was evaluated through 10-fold cross-validation. Table 3 shows the accuracy, sensitivity, specificity, F1 index, and AUC of the model for each class, while false positive rates are displayed on the ROC curve, together with sensitivity (Figures 5A–C). This validation allowed for obtaining a confusion matrix, the data for which are shown in Figure 5D.

TABLE 3

Table 3. Indexes for model validation.

FIGURE 5

Figure 5. ROC curves for healthy controls (A), insomnia (B), and DSPD (C). (D) The graphical representation of the confusion matrix data, as a function of the attributes TAP-RA (ranging from 0 to 1, Y-axis) and TAP-L5h (time of day, X-axis). Solid dots indicate true positives, and empty dots indicate false negatives. Cases classified as DSPD (green dots) were characterized by later mid-sleep time (TAP-L5h) than the other classes, while their TAP-RA scored within a wide range. Insomnia (red dots) exhibited a mid-sleep time between 3:00 and 5:00 h and, as in the case of DSPD, a wide range of TAP-RA scores. Healthy controls showed a mid-sleep time primarily centered around 4:00 h and predominantly high values of TAP-RA (0.65–0.7).

Discussion

The present study proposes a novel approach for clinically detecting sleep disorders using circadian markers. To this end, we evaluated the diagnostic potential of an ACM technique based on the simultaneous assessment of wrist temperature, motor activity, body position, and environmental light in a sample of patients suffering from either insomnia or DSPD, and a control group consisting of subjects without any circadian or sleep pathology. The potential of this technique for differential diagnostic was explored through machine learning analysis.

Machine learning methods are revolutionizing the field of clinical research as a support for differential diagnosis. They have recently been applied to widely different pathologies, such as diabetes (Dagliati et al., 2017), autism and attention-deficit hyperactivity disorder (ADHD) (Duda et al., 2017), glaucoma (Kim et al., 2017), Parkinson’s disease (Kubota et al., 2016), and pediatric inflammatory bowel disease (Mossotto et al., 2017).

To apply this technique in our study, we mainly based our analysis on the integrated variable TAP, obtained from wrist temperature, motor activity, and body position, since it showed higher accuracy for estimating sleep than any of the individual variables integrating it (Ortiz-Tudela et al., 2010, 2014). The circadian parameters of sleep estimated from TAP were also considered. A decision tree was built from just four of the indexes, calculated by non-parametric analyses (Refinetti et al., 2013). They included a sleep phase marker (TAP-L5h), a wakefulness phase marker (TAP-M10h), a global index of circadian rhythms robustness (sleep CFI), and TAP-RA. Besides quantifying the RA of the rhythms, which is a direct measure of rhythms quality, TAP-RA indirectly yields information on sleep depth and diurnal activation. Indeed, this index showed more discriminative power in our sample than the sleep depth estimators (TAP and motor activity L5v) themselves, probably because information on daytime is also considered.

Apart from discriminating among the three original categories, i.e., insomnia, DSPD, and healthy controls, the decision tree yielded three different groups of insomnia, which was likely facilitated by the large sample size and the consequent variability of the original insomnia group. One of them was characterized by a later mid-sleep time than the others, which pointed to sleep onset difficulties. Therefore, this subtype was labeled as OI. According to its late mid-sleep time, this insomnia subgroup emerged from the same branch as DSPD. The discrimination between these two pathologies thus relied on their central time of maximal activation, which was later in DSPD. This fits the expected circadian profile for DSPD, as it can be considered to be the clinical manifestation of extreme evening-types (Lack et al., 2009). In contrast, and according to our results, this would not be the case for OI, since it is not a circadian alteration. This finding is noteworthy, given the overlapping symptoms of these two pathologies (Gradisar and Crowley, 2013; Sivertsen et al., 2013; Richardson et al., 2015), especially when the diagnosis is based only on interviews. Our approach strongly supports the objective consideration of sleep–wake patterns and highlights the relevance of taking into account circadian rhythms when clinically evaluating sleep.

Interestingly, DSPD also differed from the other classes (including OI) in terms of its exposure rhythm to environmental light. In particular, it was characterized by null light exposure during the early hours of the morning, together with light exposure during a large part of the night. However, our model does not yield any information about causal relationships. Thus, we cannot infer whether their phase delay was partially a consequence or a cause of their light exposure pattern, since sleep–wake schedules modulate the light exposure schedules. In any case, it seems clear that there is a feedback between both factors and that this pattern of light exposure may contribute to maintaining the sleep and circadian alteration (Auger et al., 2011).

The subjects not showing delayed mid-sleep time were first divided by the RA of their rhythms, yielding another insomnia subgroup. This low contrast between sleep and wakefulness activation is congruent with fractioned sleep and daytime sleepiness. Therefore, this subgroup was considered to be compatible with MI. Among the remaining subjects, i.e., those with a high relative rhythm amplitude, another subgroup of insomnia emerged, which barely differed from healthy controls. The criterion used to differentiate between them was the robustness of the sleep rhythm, but their circadian profile in terms of the mean circadian waveforms was very similar. Thus, this subgroup of insomnia most likely consists of patients with low alteration, and it was therefore labeled as mild insomnia. It is important to note that our labeling of the insomnia subgroups was not meant as a diagnosis labeling, but as a characterization of the different groups obtained.

The most relevant limitation of our study was the large difference in the sample size of the initial categories and, specifically, the small sample size of healthy controls. This was due to the fact that participants were retrospectively selected from a clinical database, thus subjects suffering sleep and/or circadian disorders were more frequent than healthy controls. Such disparity of sample sizes may affect the quality of the model, as it would give more weight to the attributes that would better discriminate among the classes with larger sample size (in our case, insomnia), at the expense of the smaller classes, with the aim of maximizing the number of true positives. The most evident consequence of this limitation is the relatively high probability of misclassifying healthy participants as insomnia, in comparison with other groups. But this entails the risk of also increasing the rate of false positives in the largest group. In other words, the model would make decisions aimed at increasing sensitivity to insomnia, at the expense of specificity. Indeed, according to our results, the specificity for insomnia was the lowest, while it was very high in the other two categories (>95% in the case of DSPD and >99% in the case of the control group).

Despite this limitation, in general, our results showed high rates of sensitivity, accuracy, and specificity, thus confirming that our model was highly reliable for discriminating between the pathologies studied. However, future studies should address this limitation by applying this method to larger, and in particular, more balanced samples.

Another limitation would be that, since the present study follows a retrospective design, the device employed at the time of the data collection would result a little bit old fashioned nowadays. Nonetheless, it had previously shown to be highly useful and accurate for assessing circadian rhythms and sleep (Ortiz-Tudela et al., 2010, 2014). In fact, currently our Chronobiology Laboratory has implemented all temperature, motor activity, and environmental light sensors in a unique device, more modern and sophisticated that assesses simultaneously 15 different variables with a sampling rate of 10 Hz (Madrid-Navarro et al., 2019). It has been recently validated for Parkinson’s disease and allows also an accurate detection of the light type, intensity, and timing the subject is exposed under ambulatory conditions (Arguelles-Prieto et al., 2019).

So far, the results of the present study can be highly useful in clinical practice, since they permit a better characterization of insomnia and DSPD. Furthermore, our results inspire further research to address other classification models for various disorders, such as obstructive sleep apnea/hypopnea (OSAH) or advanced sleep phase disorder (ASPD). In addition, the validation of the ACM technique also encourages its application in research, as it would seem to be suitable for sample selection, evaluation, and even the monitoring of habits in order to offer recommendations (e.g., on light exposure and a sedentary lifestyle). The combination of wearable devices, which make it possible to record millions of data points per subject, and the development of methods based on massive data management and data mining (e.g., machine learning methods) facilitate “digital health,” providing new diagnostic tools and allowing personalized therapies in broad population samples and at an affordable cost.

Data Availability Statement

The original datasets generated for this study are available on request to the corresponding author.

Ethics Statement

The study was retrospective, included no intervention but the usual standard practice at the clinic. A personal dissociated database was used in order to comply with the Spanish Personal Data Protecting Law.

Author Contributions

JM, MR, AC, and BR-M: study concept and design. EE, JA, and FS: acquisition of data. CE-D and BR-M: coordination and managing of database. MC, BR-M, and JM: analysis and interpretation of data. BR-M, JM, and MR: drafting of manuscript. BR-M, EE, CE-D, JA, FS, AC, JM, and MR: critical revision of the manuscript.

Funding

This work was supported by the Ministry of Economy and Competitiveness, the Instituto de Salud Carlos III through CIBERFES (CB16/10/00239) and grants 19899/GERM/15 awarded to JM and the Ministry of Science Innovation and Universities RTI2018-093528-B-I00 to MR (all of them co-financed by FEDER).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We are grateful to the Spanish Sleep Society for the financial support through the grant to BR-M.

Abbreviations

ACM, ambulatory circadian monitoring; AUC, area under curve; CFI, circadian function index; DSPD, delayed sleep phase disorder; IS, interdaily stability; IV, intradaily variability; MI, maintenance insomnia; OI, onset insomnia; PI, primary insomnia; RA, relative amplitude.

Footnotes

References

American Academy of Sleep Medicine, (2005). The International Classification of Sleep Disorders: Diagnostic & Coding Manual, 2nd Edn, Westchester IL: American Academy of Sleep Medicine.

Google Scholar

Ancoli-Israel, S., Cole, R., Alessi, C., Chambers, M., Moorcroft, W., and Pollak, C. P. (2003). The role of actigraphy in the study of sleep and circadian rhythms. Sleep 26, 342–392. doi: 10.1093/sleep/26.3.342

PubMed Abstract | CrossRef Full Text | Google Scholar

Arguelles-Prieto, R., Bonmati-Carrion, M. A., Rol, M. A., and Madrid, J. A. (2019). Determining light intensity, timing and type of visible and circadian light from an ambulatory circadian monitoring device. Front. Physiol. 10:822. doi: 10.3389/fphys.2019.00822

PubMed Abstract | CrossRef Full Text | Google Scholar

Auger, R. R., Burgess, H. J., Dierkhising, R. A., Sharma, R. G., and Slocumb, N. L. (2011). Light exposure among adolescents with delayed sleep phase disorder: a prospective cohort study. Chronobiol. Int. 28, 911–920. doi: 10.3109/07420528.2011.619906

PubMed Abstract | CrossRef Full Text | Google Scholar

Bonmatí-Carrión, M. Á, Middleton, B., Revell, V. L., Skene, D. J., Rol, M. Á, and Madrid, J. A. (2014). Circadian phase assessment by ambulatory monitoring in humans: correlation with dim light melatonin onset. Chronobiol. Int. 31, 31–57.

Google Scholar

Borbély, A. A., Daan, S., Wirz-Justice, A., and Deboer, T. (2016). The two-process model of sleep regulation: a reappraisal. J. Sleep Res. 25, 131–143. doi: 10.1111/jsr.12371

PubMed Abstract | CrossRef Full Text | Google Scholar

Dagliati, A., Marini, S., Sacchi, L., Cogni, G., Marsida, T., Tibollo, V., et al. (2017). Machine learning methods to predict diabetes complications. J. Diabetes Sci. Technol. 12, 295–302. doi: 10.1177/1932296817706375

PubMed Abstract | CrossRef Full Text | Google Scholar

Demšar, J., Curk, T., Erjavec, A., Hočevar, T., Milutinovič, M., Možina, M., et al. (2013). Orange: data mining toolbox in python. J. Mach. Learn. Res. 14, 2349–2353.

Google Scholar

Duda, M., Haber, N., Daniels, J., and Wall, D. P. (2017). Crowdsourced validation of a machine-learning classification system for autism and ADHD. Transl. Psychiatr. 7:e1133. doi: 10.1038/tp.2017.86

PubMed Abstract | CrossRef Full Text | Google Scholar

Fayyad, U. M., and Irani, K. B. (1993). “Multi-interval discretization of continuous-valued attributes for classification learning,” in Proceedings of the 13th International Joint Conference on Artificial Intelligence, Chambery.

Google Scholar

Goel, N., Basner, M., Rao, H., and Dinges, D. F. (2014). Circadian rhythms, sleep deprivation, and human performance. Prog. Mol. Biol. Transl. Sci. 119, 155–190. doi: 10.1016/B978-0-12-396971-2.00007-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Gradisar, M., and Crowley, S. J. (2013). Delayed sleep phase disorder in youth. Curr. Opin. Psychiatr. 26, 580–585. doi: 10.1097/YCO.0b013e328365a1d4

PubMed Abstract | CrossRef Full Text | Google Scholar

Gradisar, M., Gardner, G., and Dohnt, H. (2011). Recent worldwide sleep patterns and problems during adolescence: a review and meta-analysis of age, region, and sleep. Sleep Med. 12, 110–118. doi: 10.1016/j.sleep.2010.11.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, S. J., Cho, K. J., and Oh, S. (2017). Development of machine learning models for diagnosis of glaucoma. PLoS One 12:e0177726. doi: 10.5061/dryad.q6ft5

PubMed Abstract | CrossRef Full Text | Google Scholar

Kubota, K. J., Chen, J. A., and Little, M. A. (2016). Machine learning for large-scale wearable sensor data in Parkinson’s disease: concepts, promises, pitfalls, and futures. Mov. Disord. 31, 1314–1326. doi: 10.1002/mds.26693

PubMed Abstract | CrossRef Full Text | Google Scholar

Lack, L., Bailey, M., Lovato, N., and Wright, H. (2009). Chronotype differences in circadian rhythms of temperature, melatonin, and sleepiness as measured in a modified constant routine protocol. Nat. Sci. Sleep 1, 1–8. doi: 10.2147/NSS.S6234

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, H., and Hussain, F. (2002). Discretization: an enabling technique. Data Min. Knowl. Disc. 6, 393–423.

Google Scholar

Lovato, N., Gradisar, M., Short, M., Dohnt, H., and Micic, G. (2013). Delayed sleep phase disorder in an Australian school-based sample of adolescents. J. Clin. Sleep Med. 15, 939–944. doi: 10.5664/jcsm.2998

PubMed Abstract | CrossRef Full Text | Google Scholar

Madrid-Navarro, C. J., Javier, F., Cuesta, P., Escamilla-Sevilla, F., Campos, M., and Abellán, F. R. (2019). Validation of a device for the ambulatory monitoring of sleep patterns: a pilot study on Parkinson’s disease. Front. Neurol. 10:356. doi: 10.3389/fneur.2019.00356

PubMed Abstract | CrossRef Full Text | Google Scholar

Martinez-Nicolas, A., Madrid, J. A., García, F. J., Campos, M., Moreno-Casbas, M. T., Almaida-Pagan, P. F., et al. (2018). Circadian monitoring as an aging predictor. Sci. Rep. 8:15027. doi: 10.1038/s41598-018-33195-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Martinez-Nicolas, A., Madrid, J. A., and Rol, M. A. (2014). Day–night contrast as source of health for the human circadian system. Chronobiol. Int. 31, 382–393. doi: 10.3109/07420528.2013.861845

PubMed Abstract | CrossRef Full Text | Google Scholar

Martinez-Nicolas, A., Martinez-Madrid, M. J., Almaida-Pagan, P. F., Bonmati-Carrion, M. A., Madrid, J. A., and Rol, M. A. (2019). Assessing chronotypes by ambulatory circadian monitoring. Front. Physiol. 10:1396. doi: 10.3389/fphys.2019.01396

CrossRef Full Text | Google Scholar

Martinez-Nicolas, A., Ortiz-Tudela, E., Madrid, J. A., and Rol, M. A. (2011). Crosstalk between environmental light and internal time in humans. Chronobiol. Int. 28, 617–629. doi: 10.3109/07420528.2011.593278

PubMed Abstract | CrossRef Full Text | Google Scholar

Martinez-Nicolas, A., Ortiz-Tudela, E., Rol, M. Á, and Madrid, J. A. (2013). Uncovering different masking factors on wrist skin temperature rhythm in free-living subjects. PLoS One 8:e61142. doi: 10.1371/journal.pone.0061142

PubMed Abstract | CrossRef Full Text | Google Scholar

Maslove, D. M., Podchiyska, T., and Lowe, H. J. (2013). Discretization of continuous features in clinical datasets. J. Am. Med. Inform. Assoc. 20, 544–553. doi: 10.1136/amiajnl-2012-000929

PubMed Abstract | CrossRef Full Text | Google Scholar

Mossotto, E., Ashton, J. J., Coelho, T., Beattie, R. M., MacArthur, B. D., and Ennis, S. (2017). Classification of paediatric inflammatory bowel disease using machine learning. Sci. Rep. 7:2427. doi: 10.1038/s41598-017-02606-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Mullington, J. M., Abbott, S. M., Carroll, J. E., Davis, C. J., Dijk, D.-J., Dinges, D. F., et al. (2016). Developing biomarker arrays predicting sleep and circadian-coupled risks to health. Sleep 39, 727–736. doi: 10.5665/sleep.5616

PubMed Abstract | CrossRef Full Text | Google Scholar

Ortiz-Tudela, E., Martinez-Nicolas, A., Albares, J., Segarra, F., Campos, M., Estivill, E., et al. (2014). Ambulatory circadian monitoring (ACM) based on thermometry, motor activity and body position (TAP): a comparison with polysomnography. Physiol. Behav. 126, 30–38. doi: 10.1016/j.physbeh.2013.12.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Ortiz-Tudela, E., Martinez-Nicolas, A., Campos, M., Rol, M. Á, and Madrid, J. A. (2010). A new integrated variable based on thermometry, actimetry and body position (TAP) to evaluate circadian system status in humans. PLoS Comput. Biol. 6:e1000996. doi: 10.1371/journal.pcbi.1000996

PubMed Abstract | CrossRef Full Text | Google Scholar

Owens, J., Maxim, R., McGuinn, M., Nobile, C., Msall, M., and Alario, A. (1999). Television-viewing habits and sleep disturbance in school children. Pediatrics 104:e27. doi: 10.1542/peds.104.3.e27

PubMed Abstract | CrossRef Full Text | Google Scholar

Refinetti, R., Lissen, G. C., and Halberg, F. (2013). Procedures for numerical analysis of circadian rhythms. Biol. Rhythm Res. 38, 275–325. doi: 10.1080/09291010600903692

PubMed Abstract | CrossRef Full Text | Google Scholar

Richardson, C. E., Gradisar, M., and Barbero, S. C. (2015). Are cognitive “insomnia” processes involved in the development and maintenance of delayed sleep wake phase disorder? Sleep Med. Rev. 26, 1–8. doi: 10.1016/j.smrv.2015.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Roenneberg, T., Wirz-Justice, A., and Merrow, M. (2003). Life between clocks: daily temporal patterns of human chronotypes. J. Biol. Rhythm. 18, 80–90. doi: 10.1177/0748730402239679

PubMed Abstract | CrossRef Full Text | Google Scholar

Rokach, L., and Maimon, O. (2005). Top-down induction of decision trees classifiers—a survey. IEEE Trans. Syst. Man Cyber. 35, 476–487. doi: 10.1109/TSMCC.2004.84324731

CrossRef Full Text | Google Scholar

Roth, T. (2007). Insomnia: definition, prevalence, etiology, and consequences. J. Clin. Sleep Med. 3(5 Suppl.), 3–6. doi: 10.1378/chest.14-0970

PubMed Abstract | CrossRef Full Text | Google Scholar

Sadeh, A., and Acebo, C. (2002). The role of actigraphy in sleep medicine. Sleep Med. Rev. 6, 113–124. doi: 10.1053/smrv.2001.0182

PubMed Abstract | CrossRef Full Text | Google Scholar

Sarabia, J. A., Rol, M. Á, Mendiola, P., and Madrid, J. A. (2008). Circadian rhythm of wrist temperature in normal-living subjects. A candidate of new index of the circadian system. Physiol. Behav. 95, 570–580. doi: 10.1016/j.physbeh.2008.08.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Schrader, H., Bovim, G., and Sand, T. (1993). The prevalence of advanced and delayed sleep phase syndromes. J. Sleep Res. 2, 51–55. doi: 10.1111/j.1365-2869.1993.tb00061.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Serrano, J. I., Romero, J. P., Castillo, M. D., del Rocon, E., Louis, E. D., and Benito-León, J. (2017). A data mining approach using cortical thickness for diagnosis and characterization of essential tremor. Sci. Rep. 7:2190. doi: 10.1038/s41598-017-02122-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Shochat, T. (2012). Impact of lifestyle and technology developments on sleep. Nat. Sci. Sleep 4, 19–31. doi: 10.2147/NSS.S18891

PubMed Abstract | CrossRef Full Text | Google Scholar

Shochat, T., Flint-Bretler, O., and Tzischinsky, O. (2010). Sleep patterns, electronic media exposure and daytime sleep-related behaviours among Israeli adolescents. Acta Paediatr. Int. J. Paediatr. 99, 1396–1400. doi: 10.1111/j.1651-2227.2010.01821.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Sivertsen, B., Pallesen, S., Stormark, K. M., Bøe, T., Lundervold, A. J., and Hysing, M. (2013). Delayed sleep phase syndrome in adolescents: prevalence and correlates in a large population based study. BMC Public Health 13:1163. doi: 10.1186/1471-2458-13-1163

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Den Bulck, J. (2004). Television viewing, computer game playing, and internet use and self-reported time to bed and time out of bed in secondary-school children. Sleep 27, 101–104. doi: 10.1093/sleep/27.1.101

PubMed Abstract | CrossRef Full Text | Google Scholar

Weitzman, E., Czeisler, C., Coleman, R., Spielman, A., Zimmerman, J., Dement, W., et al. (1981). Delayed sleep phase syndrome: a chronobiological disorder with sleep-onset insomnia. Arch. Gen. Psychiatry 38, 737–746.

PubMed Abstract | Google Scholar

World Health Organization [WHO] (2008). International Statistical Classification of Diseases and Related Health Problems. Geneva: World Health Organization.

Google Scholar

Yazaki, M., Shirakawa, S., Okawa, M., and Takahashi, K. (1999). Demography of sleep disturbances associated with circadian rhythm disorders in Japan. Psychiatr. Clin. Neurosci. 53, 267–268. doi: 10.1046/j.1440-1819.1999.00533.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Zornoza-Moreno, M., Fuentes-Hernández, S., Prieto-Sánchez, M. T., Blanco, J. E., Pagán, A., and Rol, M. -Á, et al. (2013). Influence of gestational diabetes on circadian rhythms of children and their association with fetal adiposity. Diabetes Metab. Res. Rev. 29, 483–491. doi: 10.1002/dmrr.2417

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: circadian rhythms, wrist temperature, actigraphy, light exposure, insomnia, delayed sleep phase, decision tree, digital health

Citation: Rodriguez-Morilla B, Estivill E, Estivill-Domènech C, Albares J, Segarra F, Correa A, Campos M, Rol MA and Madrid JA (2019) Application of Machine Learning Methods to Ambulatory Circadian Monitoring (ACM) for Discriminating Sleep and Circadian Disorders. Front. Neurosci. 13:1318. doi: 10.3389/fnins.2019.01318

Received: 29 March 2019; Accepted: 25 November 2019;
Published: 10 December 2019.

Edited by:

Zhi-Li Huang, Fudan University, China

Reviewed by:

Karen L. Gamble, The University of Alabama at Birmingham, United States
Thomas Penzel, Charité Medical University of Berlin, Germany

Copyright © 2019 Rodriguez-Morilla, Estivill, Estivill-Domènech, Albares, Segarra, Correa, Campos, Rol and Madrid. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Maria Angeles Rol, YW5nZXJvbEB1bS5lcw==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.