<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Bioeng. Biotechnol.</journal-id>
<journal-title>Frontiers in Bioengineering and Biotechnology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Bioeng. Biotechnol.</abbrev-journal-title>
<issn pub-type="epub">2296-4185</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fbioe.2015.00114</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Bioengineering and Biotechnology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Comparison of Standard and Novel Signal Analysis Approaches to Obstructive Sleep Apnea Classification</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Roebuck</surname> <given-names>Aoife</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="cor1">&#x0002A;</xref>
<uri xlink:href="http://frontiersin.org/people/u/186019"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Clifford</surname> <given-names>Gari D.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<uri xlink:href="http://frontiersin.org/people/u/266185"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Engineering Science, University of Oxford</institution>, <addr-line>Oxford</addr-line>, <country>UK</country></aff>
<aff id="aff2"><sup>2</sup><institution>The Sleep &#x00026; Circadian Neuroscience Institute, University of Oxford</institution>, <addr-line>Oxford</addr-line>, <country>UK</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Biomedical Informatics, Emory University</institution>, <addr-line>Atlanta, GA</addr-line>, <country>USA</country></aff>
<aff id="aff4"><sup>4</sup><institution>Department of Biomedical Engineering, Georgia Institute of Technology</institution>, <addr-line>Atlanta, GA</addr-line>, <country>USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Pedro Gomez-Vilda, Universidad Polit&#x000E9;cnica de Madrid, Spain</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Jes&#x000FA;s B. Alonso-Hern&#x000E1;ndez, University of Las Palmas de Gran Canaria, Spain; Jose Luis Blanco, Universidad Polit&#x000E9;cnica de Madrid, Spain</p></fn>
<corresp content-type="corresp" id="cor1">&#x0002A;Correspondence: Aoife Roebuck, Department of Engineering Science, IBME, University of Oxford, Roosevelt Drive, Oxford OX3 7DQ, UK, <email>aoife.roebuck&#x00040;gmail.com</email></corresp>
<fn fn-type="other" id="fn001"><p>Specialty section: This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Bioengineering and Biotechnology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>27</day>
<month>08</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="collection">
<year>2015</year>
</pub-date><volume>3</volume>
<elocation-id>114</elocation-id>
<history>
<date date-type="received">
<day>23</day>
<month>04</month>
<year>2015</year>
</date>
<date date-type="accepted">
<day>27</day>
<month>07</month>
<year>2015</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2015 Roebuck and Clifford.</copyright-statement>
<copyright-year>2015</copyright-year>
<copyright-holder>Roebuck and Clifford</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Obstructive sleep apnea (OSA) is a disorder characterized by repeated pauses in breathing during sleep, which leads to deoxygenation and voiced chokes at the end of each episode. OSA is associated by daytime sleepiness and an increased risk of serious conditions such as cardiovascular disease, diabetes, and stroke. Between 2 and 7% of the adult population globally has OSA, but it is estimated that up to 90% of those are undiagnosed and untreated. Diagnosis of OSA requires expensive and cumbersome screening. Audio offers a potential non-contact alternative, particularly with the ubiquity of excellent signal processing on every phone. Previous studies have focused on the classification of snoring and apneic chokes. However, such approaches require accurate identification of events. This leads to limited accuracy and small study populations. In this work, we propose an alternative approach which uses multiscale entropy (MSE) coefficients presented to a classifier to identify disorder in vocal patterns indicative of sleep apnea. A database of 858 patients was used, the largest reported in this domain. Apneic choke, snore, and noise events encoded with speech analysis features were input into a linear classifier. Coefficients of MSE derived from the first 4&#x02009;h of each recording were used to train and test a random forest to classify patients as apneic or not. Standard speech analysis approaches for event classification achieved an out-of-sample accuracy (Ac) of 76.9% with a sensitivity (Se) of 29.2% and a specificity (Sp) of 88.7% but high variance. For OSA severity classification, MSE provided an out-of-sample Ac of 79.9%, Se of 66.0%, and Sp&#x02009;&#x0003D;&#x02009;88.8%. Including demographic information improved the MSE-based classification performance to Ac&#x02009;&#x0003D;&#x02009;80.5%, Se&#x02009;&#x0003D;&#x02009;69.2%, and Sp&#x02009;&#x0003D;&#x02009;87.9%. These results indicate that audio recordings could be used in screening for OSA, but are generally under-sensitive.</p>
</abstract>
<kwd-group>
<kwd>audio</kwd>
<kwd>MSE</kwd>
<kwd>OSA</kwd>
<kwd>LPC</kwd>
<kwd>MFCCs</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="7"/>
<equation-count count="0"/>
<ref-count count="40"/>
<page-count count="9"/>
<word-count count="7605"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="introduction">
<label>1</label> <title>Introduction</title>
<p>Obstructive Sleep Apnea (OSA) is a disorder that causes breathing to be interrupted repeatedly during sleep. An interruption in breathing results in deoxygenation, leading to a brief arousal (a period of light sleep or wakefulness which is usually not noticed by the individual). Repeated arousals lead to short-term problems, such as daytime sleepiness and poor concentration. OSA is also associated with increased risk of serious chronic conditions, including cardiovascular disease (Monahan and Redline, <xref ref-type="bibr" rid="B26">2011</xref>) and diabetes (Rosenfeld, <xref ref-type="bibr" rid="B33">2014</xref>).</p>
<p>OSA is relatively common, affecting 2&#x02013;7% of the global adult population; the prevalence is similar in the developed and developing world, with a high variance across ethnic groups (Young et al., <xref ref-type="bibr" rid="B40">1993</xref>; Bearpark et al., <xref ref-type="bibr" rid="B2">1995</xref>; Ip et al., <xref ref-type="bibr" rid="B15">2001</xref>, <xref ref-type="bibr" rid="B16">2004</xref>; Kim et al., <xref ref-type="bibr" rid="B22">2004</xref>; Udwadia et al., <xref ref-type="bibr" rid="B36">2004</xref>; Sharma et al., <xref ref-type="bibr" rid="B34">2006</xref>; Lam et al., <xref ref-type="bibr" rid="B23">2007</xref>). It is usually diagnosed on the basis of an overnight sleep study, where data including photoplethysmography (PPG), respiratory effort, electrocardiography, audio, and activity are typically recorded (Roebuck et al., <xref ref-type="bibr" rid="B32">2014</xref>). Because of the amount of data that are collected and analyzed, the screening process can be expensive, typically costing more than &#x00024;1000 (Epstein and Dorlac, <xref ref-type="bibr" rid="B11">1998</xref>). In addition, the screening process removes patients from their normal sleeping environment, preventing repeatable unbiased studies. It is estimated that up to 90% of individuals with OSA are undiagnosed and untreated (Young et al., <xref ref-type="bibr" rid="B39">1997</xref>). Screening of OSA is particularly poor in developing countries, where the resources required for conventional screening and diagnosis are often unavailable. While conventional screening of OSA is expensive, some treatment for those diagnosed with the condition can be relatively cheap; oral appliances, which attempt to enlarge the pharyngeal airway during sleep by holding the tongue or jaw forward, are available for as little as &#x000A3;35 (Somno-Guard 3, Tomed, Bensheim, Germany). Lifestyle changes such as losing weight can often resolve mild to moderate OSA, and oral appliances are available for more serious cases. A cheap method of screening for OSA could therefore greatly reduce the burden of OSA on the healthcare system, particularly in developing countries where sleep lab facilities are very limited.</p>
<p>Since audio recordings are particularly easy to collect, and most mobile phones have in-built high quality audio recording capabilities, this article explores whether audio analysis alone is sufficient to diagnose OSA. This is a comparison between two contrasting methods; a standard approach from speech analysis, which has been done before, and a novel approach which captures non-stationary periodicities. There are a number of approaches that are used to determine whether an event is apneic or not, or whether a subject suffers from OSA. Events can be analyzed using LPC (Ng et al., <xref ref-type="bibr" rid="B27">2008a</xref>), mel-frequency cepstrum coefficients (MFCCs) (Cavusoglu et al., <xref ref-type="bibr" rid="B5">2007</xref>), power spectrum (Ng et al., <xref ref-type="bibr" rid="B28">2008b</xref>), energy distribution (Jones et al., <xref ref-type="bibr" rid="B21">2005</xref>, <xref ref-type="bibr" rid="B19">2006a</xref>,<xref ref-type="bibr" rid="B20">b</xref>; Cavusoglu et al., <xref ref-type="bibr" rid="B5">2007</xref>), and pitch (Abeyratne et al., <xref ref-type="bibr" rid="B1">2005</xref>). Active devices (Michaelson et al., <xref ref-type="bibr" rid="B25">2006</xref>) and using audio recorded while the subject is awake (Goldshtein et al., <xref ref-type="bibr" rid="B13">2011</xref>) are both methods used to determine whether a subject has apnea or not. Although event detection is more common, it is also more laborious as there needs to be a gold standard for comparison. This involves annotating the entire night&#x02019;s recording for each subject in the data set. Active devices and using speech signals allow for the subject to be diagnosed rather than identifying individual events. LPC and MFCCs were chosen as the standard approach from speech analysis because they are two of the most common methods used to determine whether an event is apneic or not. From the literature, it can be seen that LPC classifies apneic snores from benign snores with Se&#x02009;&#x0003D;&#x02009;88% and Sp&#x02009;&#x0003D;&#x02009;82% for LPC (Ng et al., <xref ref-type="bibr" rid="B27">2008a</xref>); while MFCCs achieve Se&#x02009;&#x0003D;&#x02009;82% (Cavusoglu et al., <xref ref-type="bibr" rid="B5">2007</xref>). LPC can provide accurate estimates of speech parameters while not being computationally intensive (Rabiner and Schafer, <xref ref-type="bibr" rid="B29">2007</xref>). However, it does assume stationarity, which may not be true over the event duration. MFCCs were used (where the frequency bands are equally spaced on the mel scale) as they approximate the human auditory system&#x02019;s response more closely than the linearly spaced frequency bands used in the normal cepstrum. This frequency warping can allow for better representation of sound.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<label>2</label> <title>Materials and Methods</title>
<sec id="S2-1">
<label>2.1</label> <title>Data</title>
<p>The data used in this study were provided retrospectively by collaborators at the Respiratory Medicine Group at the Churchill Hospital (Oxford, UK)<xref ref-type="fn" rid="fn1"><sup>1</sup></xref>. Each subject used a portable home sleep study device, Grey Flash (Stowood Scientific Instruments Ltd., Oxford, UK), which recorded a finger PPG from which oxygen saturation and pulse rate were derived, nasal airflow and nasal sound from a nasal cannula, body movement and body position from an accelerometer, and audio from a microphone placed on the nasal cannula. It should be noted that the subject was in charge of connecting themselves to the device, and that each device was calibrated to have approximately constant gain for the audio signal.</p>
<p>A total of 1354 overnight recordings with associated diagnoses were reviewed, of which only 858 were used in this study; 496 recordings were excluded for being too short (less than 4&#x02009;h) or for having a diagnosis unrelated to OSA, such as lung disease and asthma. The demographics for the subjects can be found in Table <xref ref-type="table" rid="T1">1</xref>, broken down by severity of OSA. The table also shows the apnea hypopnea Index (AHI), the oxygen desaturation index (ODI), and the Epworth sleepiness score (ESS). The AHI is the average number of apneas (cessations of breathing) and hypopneas (episodes of shallow breathing) per hour. The ODI is the average number of oxygen desaturations per hour [where a desaturation is defined as at least a 4% decrease below the previous high; no desaturation can be scored unless the saturation subsequently rises by more than 3% (Stradling and Crosby, <xref ref-type="bibr" rid="B35">1990</xref>)]. The AHI and ODI were automatically calculated by the software used to analyze the data in the clinic (Visi-Download, Stowood Scientific Instruments Ltd., Oxford, UK). ODI definition given above is used by Visi-Download to calculate the ODI. The ESS was derived from a standardized questionnaire designed to measure daytime sleepiness (Johns, <xref ref-type="bibr" rid="B17">1991</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Subject demographics for each sub-group: normal, snorer, mild OSA, moderate OSA, and severe OSA (mean&#x02009;&#x000B1;&#x02009;&#x003C3;)</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Group</th>
<th align="center">Normal</th>
<th align="center">Snorer</th>
<th align="center">Mild</th>
<th align="center">Moderate</th>
<th align="center">Severe</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Gender</td>
<td align="center">80 m, 75 f</td>
<td align="center">166 m, 91 f</td>
<td align="center">79 m, 28 f</td>
<td align="center">94 m, 30 f</td>
<td align="center">167 m, 48 f</td>
</tr>
<tr>
<td align="left">Age (years)</td>
<td align="center">45.9&#x02009;&#x000B1;&#x02009;17.1</td>
<td align="center">46.5&#x02009;&#x000B1;&#x02009;12.0</td>
<td align="center">50.5&#x02009;&#x000B1;&#x02009;11.4</td>
<td align="center">53.1&#x02009;&#x000B1;&#x02009;12.4</td>
<td align="center">52.5&#x02009;&#x000B1;&#x02009;12.6</td>
</tr>
<tr>
<td align="left">Neck (cm)</td>
<td align="center">39.4&#x02009;&#x000B1;&#x02009;4.6</td>
<td align="center">41.4&#x02009;&#x000B1;&#x02009;4.3</td>
<td align="center">41.9&#x02009;&#x000B1;&#x02009;4.1</td>
<td align="center">42.9&#x02009;&#x000B1;&#x02009;3.8</td>
<td align="center">45.0&#x02009;&#x000B1;&#x02009;4.8</td>
</tr>
<tr>
<td align="left">Height (cm)</td>
<td align="center">171.2&#x02009;&#x000B1;&#x02009;10.7</td>
<td align="center">173.5&#x02009;&#x000B1;&#x02009;10.4</td>
<td align="center">174.2&#x02009;&#x000B1;&#x02009;9.9</td>
<td align="center">173.0&#x02009;&#x000B1;&#x02009;9.7</td>
<td align="center">175.0&#x02009;&#x000B1;&#x02009;9.1</td>
</tr>
<tr>
<td align="left">Weight (kg)</td>
<td align="center">77.7&#x02009;&#x000B1;&#x02009;23.0</td>
<td align="center">96.0&#x02009;&#x000B1;&#x02009;24.2</td>
<td align="center">212.0&#x02009;&#x000B1;&#x02009;48.8</td>
<td align="center">221.2&#x02009;&#x000B1;&#x02009;49.5</td>
<td align="center">247.3&#x02009;&#x000B1;&#x02009;74.4</td>
</tr>
<tr>
<td align="left">AHI (events/h)</td>
<td align="center">4.4&#x02009;&#x000B1;&#x02009;7.5</td>
<td align="center">6.4&#x02009;&#x000B1;&#x02009;7.4</td>
<td align="center">10.6&#x02009;&#x000B1;&#x02009;9.0</td>
<td align="center">21.5&#x02009;&#x000B1;&#x02009;11.6</td>
<td align="center">47.5&#x02009;&#x000B1;&#x02009;24.5</td>
</tr>
<tr>
<td align="left">ODI (events/h)</td>
<td align="center">3.7&#x02009;&#x000B1;&#x02009;3.5</td>
<td align="center">6.0&#x02009;&#x000B1;&#x02009;5.2</td>
<td align="center">10.3&#x02009;&#x000B1;&#x02009;7.0</td>
<td align="center">22.0&#x02009;&#x000B1;&#x02009;11.6</td>
<td align="center">56.8&#x02009;&#x000B1;&#x02009;32.4</td>
</tr>
<tr>
<td align="left">BMI (kg/m<sup>2</sup>)</td>
<td align="center">29.6&#x02009;&#x000B1;&#x02009;7.9</td>
<td align="center">32.0&#x02009;&#x000B1;&#x02009;8.4</td>
<td align="center">31.9&#x02009;&#x000B1;&#x02009;7.9</td>
<td align="center">33.8&#x02009;&#x000B1;&#x02009;8.5</td>
<td align="center">36.9&#x02009;&#x000B1;&#x02009;11.2</td>
</tr>
<tr>
<td align="left">ESS</td>
<td align="center">11.0&#x02009;&#x000B1;&#x02009;5.6</td>
<td align="center">12.0&#x02009;&#x000B1;&#x02009;5.2</td>
<td align="center">12.2&#x02009;&#x000B1;&#x02009;4.7</td>
<td align="center">12.7&#x02009;&#x000B1;&#x02009;4.7</td>
<td align="center">14.1&#x02009;&#x000B1;&#x02009;5.3</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Neck, neck circumference; m, male; f, female</italic>.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="S2-2">
<label>2.2</label> <title>Annotation and segmentation of data</title>
<p>Twenty-two subjects had specific events identified and labeled using the Visi-Download software; their demographics can be found in Table <xref ref-type="table" rid="T2">2</xref>. These subjects were chosen because they were the first subjects that were collected that met the selection criteria, i.e., they were diagnosed as normal, snorer, mild/moderate/severe OSA, and the recordings were longer than 4.5&#x02009;h with all signals present. The labeling of events followed a protocol that involved dragging an event marker across the relevant section of data. The annotations were made by a clinical research fellow, with 2&#x02009;years of experience in sleep medicine. Each event was labeled with one of three classes: F, first breath after apnea (or choke); S, snoring; and N, noise events. Only 22 records were annotated due to the time and financial constraints (to annotate a single record took at least one hour). A total of 175 choke/first breath (F) events, 201 snoring (S) events, and 190 noise (N) events were annotated. The entire event, regardless of duration, was annotated.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Demographics of annotated subjects (mean&#x02009;&#x000B1;&#x02009;&#x003C3;), m, male; f, female</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Parameter</th>
<th align="center">Subjects (mean&#x02009;&#x000B1;&#x02009;&#x003C3;)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Gender</td>
<td align="center">17 m, 5 f</td>
</tr>
<tr>
<td align="left">Age (years)</td>
<td align="center">48.9&#x02009;&#x000B1;&#x02009;15.3</td>
</tr>
<tr>
<td align="left">Neck (cm)</td>
<td align="center">45.7&#x02009;&#x000B1;&#x02009;3.8</td>
</tr>
<tr>
<td align="left">Height (cm)</td>
<td align="center">177.3&#x02009;&#x000B1;&#x02009;10.7</td>
</tr>
<tr>
<td align="left">Weight (kg)</td>
<td align="center">107.4&#x02009;&#x000B1;&#x02009;24.4</td>
</tr>
<tr>
<td align="left">AHI (events/h)</td>
<td align="center">32.4&#x02009;&#x000B1;&#x02009;31.6</td>
</tr>
<tr>
<td align="left">ODI (events/h)</td>
<td align="center">35.7&#x02009;&#x000B1;&#x02009;34.5</td>
</tr>
<tr>
<td align="left">BMI (kg/m<sup>2</sup>)</td>
<td align="center">34.3&#x02009;&#x000B1;&#x02009;8.9</td>
</tr>
<tr>
<td align="left">ESS</td>
<td align="center">11.7&#x02009;&#x000B1;&#x02009;5.3</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="S2-3">
<label>2.3</label> <title>Methods</title>
<p>The analysis is separated into two approaches. First, linear discriminant analysis (LDA) was used to build a classifier to differentiate between choke/first breath events (F) and snoring or noise events (S/N) using standard features taken in the literature (linear predictive coding (LPC) and cepstral coefficient analysis). Detection of candidate events was performed by hand. These two methods were chosen as they are standard state of the art, and can be used as a baseline comparison. Then, a novel approach for feature extraction (at least in audio analysis) which requires no segmentation of data, multiscale entropy (MSE), was applied to the raw audio. Both LDA and a random forest (RF) were then trained to classify patients as either requiring treatment or not. Before describing the experimental set-up, a brief overview of the techniques is given.</p>
<sec id="S2-3-1">
<label>2.3.1</label> <title>Linear Predictive Coding</title>
<p>Linear predictive coding is a commonly used speech analysis technique as it provides an accurate representation of speech. Specification of the parameters of the linear predictor is a wide topic and a large number of other approaches have been proposed. In fact, the autocorrelation method is the most common (Rabiner and Schafer, <xref ref-type="bibr" rid="B29">2007</xref>) and it is used, for example, for speech coding in the GSM standard. Full details can be found in Chapter 6 of Rabiner and Schafer (<xref ref-type="bibr" rid="B29">2007</xref>).</p>
</sec>
<sec id="S2-3-2">
<label>2.3.2</label> <title>Cepstral Analysis</title>
<p>Bogert et al. (<xref ref-type="bibr" rid="B3">1963</xref>) defined the <italic>cepstrum</italic> as the inverse Fourier transform (IFT) of the log magnitude spectrum of a signal. Cepstral analysis makes use of the source-filter model of speech production. It is widely used in speech processing, particularly for pitch estimation (Rabiner and Schafer, <xref ref-type="bibr" rid="B29">2007</xref>). The cepstrum is created by taking the IFT of the logarithm of the estimated spectrum of a signal. There is a complex cepstrum, a real cepstrum, a power cepstrum, and phase cepstrum. The power cepstrum is often used in the analysis of human speech. Weighted cepstrum distance measures have a direct equivalent interpretation in terms of distance in the frequency domain. This is important in models for human perception of sound which are based on frequency analysis carried out in the inner ear (Rabiner and Schafer, <xref ref-type="bibr" rid="B29">2007</xref>). Davis and Mermelstein (<xref ref-type="bibr" rid="B9">1980</xref>) used this fact as the basis of the MFCCs. The idea behind MFCCs is to compute a frequency analysis based on a filter bank with approximately critical band spacing of the filters and bandwidths. The frequency bands are equally spaced on the mel scale, which approximates the human auditory system&#x02019;s response better than the normal cepstrum. See Chapter 5 of Rabiner and Schafer (<xref ref-type="bibr" rid="B29">2007</xref>) for more details.</p>
</sec>
<sec id="S2-3-3">
<label>2.3.3</label> <title>Multiscale Entropy</title>
<p>Multiscale entropy is a method of measuring the complexity of a finite length time series (Goldberger et al., <xref ref-type="bibr" rid="B12">2000</xref>; Costa et al., <xref ref-type="bibr" rid="B8">2003</xref>; Vrhovec, <xref ref-type="bibr" rid="B37">2009</xref>). MSE has been applied to heart rate and movement data, which has similar issues to the audio signal, such as non-stationarity. Costa et al. (<xref ref-type="bibr" rid="B7">2002</xref>) noted that traditional algorithms indicated that certain pathological processes had a higher complexity than healthy dynamics with long-range correlations. The authors suggested that this paradox was due to the fact that conventional algorithms fail to account for the multiple time scales inherent in healthy physiological dynamics. Due to this hypothesis, MSE was developed and was found to robustly separate healthy and pathological groups. Our motivation for using the technique here is that it may catch the low entropy at short time scales (a few minutes) and the fractured sleep (and hence higher entropy at longer time scales &#x02013; of the order of hours) inherent in OSA.</p>
<p>MSE is calculated as follows: given an <italic>N</italic>-point time series {<italic>x</italic><sub>1</sub>, &#x02026;, <italic>x<sub>i</sub></italic>, &#x02026;, <italic>x<sub>N</sub></italic>}, a consecutive coarse-grained time series can be constructed by averaging a successively increasing number of data points in non-overlapping windows.</p>
<p>Each element of the coarse-grained time series, <italic>y<sub>j</sub></italic><sup>(<italic>&#x003C4;</italic>)</sup>, is calculated according to the equation: <inline-formula><mml:math id="M1"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mi>&#x003C4;</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>&#x0003D;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mi>&#x003C4;</mml:mi></mml:mfrac><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x0003D;</mml:mo><mml:mo stretchy='false'>(</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy='false'>)</mml:mo><mml:mi>&#x003C4;</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>&#x003C4;</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mstyle></mml:mrow></mml:math></inline-formula> where <italic>&#x003C4;</italic> represents the scale factor and {1&#x02009;&#x02264;&#x02009;<italic>j</italic>&#x02009;&#x02264;&#x02009;<italic>N/&#x003C4;</italic>}. The length of each coarse-grained time series is <italic>N/&#x003C4;</italic>. For scale <italic>&#x003C4;</italic>&#x02009;&#x0003D;&#x02009;1, the coarse-grained time series is simply the original time series. The sample entropy (&#x0210B;<sub>s</sub>) is then calculated for each of the time series and can be plotted as a function of the scale factor. &#x0210B;<sub>s</sub> quantifies the regularity of a time series and is the negative natural logarithm of the probability that two sequences similar for <italic>m</italic> points remain similar at the next point, where self-matches are not included. Given <italic>N</italic> data points from a time series <italic>x</italic>(<italic>n</italic>)&#x02009;&#x0003D;&#x02009;<italic>x</italic>(1), <italic>x</italic>(2),&#x02026;, <italic>x</italic>(<italic>N</italic>), the algorithm forms <italic>N</italic>&#x02009;&#x02212;&#x02009;<italic>m</italic>&#x02009;&#x0002B;&#x02009;1 vectors <italic>X</italic>(1),&#x02026;, <italic>X</italic>(<italic>N</italic>&#x02009;&#x02212;&#x02009;<italic>m</italic>&#x02009;&#x0002B;&#x02009;1) defined by <italic>X</italic>(<italic>i</italic>)&#x02009;&#x0003D;&#x02009;[<italic>x</italic>(<italic>i</italic>), <italic>x</italic>(<italic>i</italic>&#x02009;&#x0002B;&#x02009;1),&#x02026;, <italic>x</italic>(<italic>i</italic>&#x02009;&#x0002B;&#x02009;<italic>m</italic>&#x02009;&#x02212;&#x02009;1)], for {1&#x02009;&#x02264;&#x02009;<italic>i</italic>&#x02009;&#x02264;&#x02009;<italic>N</italic>&#x02009;&#x02212;&#x02009;<italic>m</italic>&#x02009;&#x0002B;&#x02009;1}. The vectors, <italic>X</italic>, represent <italic>m</italic> consecutive values of the signal, commencing with the <italic>i</italic>th point. The distance between <italic>X</italic>(<italic>i</italic>) and <italic>X</italic>(<italic>&#x02009;j</italic>), <italic>d</italic>&#x02009;&#x0003D;&#x02009;[<italic>X</italic>(<italic>i</italic>), <italic>X</italic>(<italic>&#x02009;j</italic>)], is then calculated as the maximum absolute difference between their respective scalar components as follows: <italic>d</italic>[<italic>X</italic>(<italic>i</italic>), <italic>X</italic>(<italic>j</italic>)]&#x02009;&#x0003D;&#x02009;<italic>max<sub>k</sub></italic><sub>&#x0003D; 1,2,&#x02026;,</sub> <italic><sub>m</sub></italic>(&#x0007C;<italic>x</italic>(<italic>i</italic>&#x02009;&#x0002B;&#x02009;<italic>k</italic>)&#x02009;&#x02212;&#x02009;<italic>x</italic>(<italic>j</italic>&#x02009;&#x0002B;&#x02009;<italic>k</italic>)&#x0007C;) For a given <italic>X</italic>(<italic>i</italic>), the number of <italic>j</italic>&#x02019;s {1&#x02009;&#x02264;&#x02009;<italic>j</italic>&#x02009;&#x02264;&#x02009;<italic>N</italic>&#x02009;&#x02212;&#x02009;<italic>m, i</italic> &#x02260;<italic>j</italic>} are counted, such that the distance between <italic>X</italic>(<italic>i</italic>) and <italic>X</italic>(<italic>j</italic>) is less than or equal to <italic>r</italic> SDs and the following function is calculated: <inline-formula><mml:math id="M2"><mml:mrow><mml:msubsup><mml:mi>B</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x0003D;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>m</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfrac><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x0003D;</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x02260;</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x0003D;</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msubsup><mml:mrow><mml:mo>&#x00398;</mml:mo><mml:mo stretchy='false'>(</mml:mo><mml:mi>r</mml:mi><mml:mo>.</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>d</mml:mi><mml:mo stretchy='false'>[</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>,</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>j</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>]</mml:mo><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mstyle></mml:mrow></mml:math></inline-formula> where &#x00398; is the Heaviside function (&#x00398; (<italic>z</italic>&#x02009;&#x02265;&#x02009;0)&#x02009;&#x0003D;&#x02009;1) and (&#x00398; (<italic>z</italic>&#x02009;&#x02264;&#x02009;0)&#x02009;&#x0003D;&#x02009;1), &#x003C3; is the standard deviation of the signal <italic>x</italic>(<italic>n</italic>) and <italic>r</italic> is a tolerance window. <inline-formula><mml:math id="M3"><mml:mrow><mml:msubsup><mml:mi>B</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is calculated as follows: <inline-formula><mml:math id="M4"><mml:mrow><mml:msubsup><mml:mi>B</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mo>&#x0003D;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:mfrac><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x0003D;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x0003D;</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msubsup><mml:mrow><mml:msubsup><mml:mi>B</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mstyle></mml:mrow></mml:math></inline-formula>. The dimension is then increased to <italic>m</italic>&#x02009;&#x0002B;&#x02009;1 and <inline-formula><mml:math id="M5"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:math></inline-formula> is calculated as follows: <inline-formula><mml:math id="M6"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x0003D;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>m</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfrac><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x0003D;</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x02260;</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x0003D;</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msubsup><mml:mrow><mml:mtext>&#x00398;</mml:mtext><mml:mo stretchy='false'>(</mml:mo><mml:mi>r</mml:mi><mml:mo>.</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>d</mml:mi><mml:mo stretchy='false'>[</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>,</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>j</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>]</mml:mo><mml:mo stretchy='false'>)</mml:mo><mml:mspace width="0.5em" class="nbsp"/><mml:msubsup><mml:mi>A</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup></mml:mrow></mml:mstyle></mml:mrow></mml:math></inline-formula> is then given by: <inline-formula><mml:math id="M7"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mo>&#x0003D;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:mfrac><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x0003D;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msubsup><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mstyle></mml:mrow></mml:math></inline-formula> and the sample entropy is given by the negative logarithm of the ratio of <inline-formula><mml:math id="M8"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mspace width="0.2em" class="nbsp"/><mml:mtext>to</mml:mtext><mml:mspace width="0.2em" class="nbsp"/><mml:msubsup><mml:mi>B</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup><mml:mo>:</mml:mo><mml:mi>&#x0210B;</mml:mi><mml:mi>s</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>N</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x0003D;</mml:mo><mml:mo>&#x02212;</mml:mo><mml:mi mathvariant="italic">ln</mml:mi><mml:mspace width="0.2em" class="nbsp"/><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi>B</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>. From these equations, it is clear that &#x0210B;<sub>s</sub>, and hence, MSE is a function of three parameters: <italic>m</italic>, <italic>r</italic> and <italic>N</italic>.</p>
</sec>
<sec id="S2-3-4">
<label>2.3.4</label> <title>Classification</title>
<p>Two classifiers were compared for the estimation of OSA severity: a simple linear classifier and a RF. These methods were chosen as standard linear and non-linear benchmark classifiers. Since there were less than 1000 labeled events for classifying choke/first breath vs. snore/noise, only a linear approach was used on that data.</p>
<sec id="S2-3-4-1">
<label>2.3.4.1</label> <title>Linear discriminant analysis</title>
<p>Linear discriminant analysis is a well-known method for drawing a linear boundary between the values of a feature set, and has been used in a variety of applications such as image retrieval and face recognition. Classical LDA projects the data onto a lower-dimensional vector space such that the ratio of the between-class distance to the within-class distance is maximized, thus achieving maximum discrimination. The optimal projection can be computed by applying the eigen-decomposition on the covariance matrices (Ye et al., <xref ref-type="bibr" rid="B38">2004</xref>).</p>
</sec>
<sec id="S2-3-4-2">
<label>2.3.4.2</label> <title>Random forest classification</title>
<p>Random forests are a type of ensemble classifier based on decision trees (Breiman, <xref ref-type="bibr" rid="B4">2001</xref>). Decision trees form a predictive model which uses a set of binary rules to calculate a target value. Training data are passed to the decision tree, which builds a model determining which variable to split on at a given node, what the value of the split is, whether to stop or to split again and when to assign a terminal node to a class. When a large number of trees have been generated, they vote for the most popular class. For the <italic>k<sup>th</sup></italic> tree, a random vector &#x00398;<italic><sub>k</sub></italic> is generated, independent of the past random vectors &#x00398;<sub>1</sub>,&#x02026;,&#x00398;<italic><sub>k</sub></italic><sub>&#x02013;1</sub> but with the same distribution. A tree is grown using the training set and &#x00398;<italic><sub>k</sub></italic>, resulting in a classifier <italic>h</italic>(<bold>x</bold>, &#x00398;<italic><sub>k</sub></italic>) where <bold>x</bold> is an input vector. A RF is a classifier consisting of a collection of tree-structured classifiers <italic>h</italic>(<bold>x</bold>, &#x00398;<italic><sub>k</sub></italic>), <italic>k</italic>&#x02009;&#x0003D;&#x02009;1,&#x02026; where the &#x00398;<italic><sub>k</sub></italic> are independent identically distributed random vectors and each tree casts a unit vote for the most popular class at input <bold>x</bold>.</p>
<p>A tree-based classifier, developed by Johnson et al. (<xref ref-type="bibr" rid="B18">2012</xref>) using a Bayesian framework. The algorithm has many advantages, including high overall performance and automatic handling of missing data, outliers, and normalization. Each tree selects a subset of observations via two regression splits. These observations are then given a contribution, equal to a random constant times the observation&#x02019;s value for a chosen feature plus a random intercept. Furthermore, the tree also assigns a contribution to missing values for this chosen feature based upon a scaled surrogate. The contributions across all trees are summed to provide the contribution for a single &#x0201C;forest,&#x0201D; where a &#x0201C;forest&#x0201D; refers to a group of trees plus an intercept term. The predicted probability output by the forest is the inverse logit of the sum of each tree&#x02019;s contribution plus the intercept term. The intercept term is set to the logit of the mean observed outcome.</p>
<p>The core of the model is the custom Markov chain Monte Carlo (MCMC) sampler which iteratively optimizes the forest. This sampling process has a user-defined number of iterations and a user-defined number of resets (each reset involves reinitializing the forest and restarting the iterative process). After mapping the training data onto the quantiles of a normal distribution, the forest is initialized to a null model, with no contributions assigned for any observations.</p>
<p>At each iteration, the algorithm selects two trees in the forest and randomizes their structure. That is, it randomly reselects the first two features which the tree uses for splitting, the value at which the tree splits those features, the third feature used for contribution calculation, and the multiplicative and additive constants applied to the third feature. The total forest contribution is then recalculated and a Metropolis-Hastings acceptance step is used to determine if the update is accepted. The Metropolis-Hastings algorithm is a MCMC method<xref ref-type="fn" rid="fn2"><sup>2</sup></xref> for obtaining a sequence of random samples from a probability distribution for which direct sampling is difficult (Metropolis et al., <xref ref-type="bibr" rid="B24">1953</xref>; Hastings, <xref ref-type="bibr" rid="B14">1970</xref>). If the update is accepted, the two trees are kept in the forest, otherwise they are discarded and the forest remains unchanged. After a set fraction of the total number of iterations to allow the forest to learn the target distribution (20%), known as the burn-in period, the algorithm begins storing forests at a fixed interval, i.e., once every set number of iterations. Once the number of user-defined iterations is reached, the forest is reinitialized as before and the iterative process restarts. Again after the set burn-in period, the forests begin to be saved at a fixed interval. The final result of this algorithm is a set of forests, each of which will contribute to the final model prediction.</p>
</sec>
</sec>
</sec>
<sec id="S2-4">
<label>2.4</label> <title>Analysis protocol</title>
<sec id="S2-4-5">
<label>2.4.1</label> <title>Event Classification</title>
<p>For LPC and MFCC analysis, an assumption was made that an event detector would be used to find the sections of interest, or that the entire night would be analyzed on an approximately second-to-second basis. Therefore, only a specified amount of time for each event was analyzed, i.e., the first 0.5, 1, 2, or 3&#x02009;s of an event. These window sizes are sufficient as only snores/chokes/noise events are being analyzed, and not apneas which, by definition, last &#x0003E;10&#x02009;s. If an event duration was less than the specified window size, it was not included in the analysis. This meant that as the window size increased, less data were analyzed; Table <xref ref-type="table" rid="T3">3</xref> shows the number of each event type at the different window sizes.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>The number of each event type at the four different window sizes used</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Window</th>
<th align="center">0.5&#x02009;s</th>
<th align="center">1&#x02009;s</th>
<th align="center">2&#x02009;s</th>
<th align="center">3&#x02009;s</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">F</td>
<td align="center">175</td>
<td align="center">175</td>
<td align="center">155</td>
<td align="center">82</td>
</tr>
<tr>
<td align="left">S</td>
<td align="center">201</td>
<td align="center">201</td>
<td align="center">201</td>
<td align="center">159</td>
</tr>
<tr>
<td align="left">N</td>
<td align="center">190</td>
<td align="center">189</td>
<td align="center">185</td>
<td align="center">167</td>
</tr>
</tbody>
</table>
</table-wrap>
<sec id="S2-4-5-3">
<label>2.4.1.1</label> <title>LPC</title>
<p>From speech analysis, a general rule of thumb is that, for voiced sounds, two coefficients provide information about each formant frequency. It has been suggested that voiced sounds are identifiable from the first two or three formants (Rabiner and Schafer, <xref ref-type="bibr" rid="B29">2007</xref>). Using a filter order of 8 ensures that the first three formants can be estimated, which is useful in identifying sections of speech in the audio signal. The work of Ng et al. (<xref ref-type="bibr" rid="B27">2008a</xref>) looked at the first three formants, and achieved promising results in distinguishing between apneic and non-apneic snoring. Figure <xref ref-type="fig" rid="F1">1</xref> shows the pole-zero plots for a choke and a snoring event. There are clear differences between the events, indicating the LPC could be used to distinguish between them.</p>
<fig position="float" id="F1">
<label>Figure 1</label>
<caption><p><bold>Pole-zero plots for a choke event and a snoring event</bold>. There are clear differences between the locations of the poles between the two events types, indicating that it might be possible to distinguish between the two. <bold>(A)</bold> Pole-zero plot for a choke event, where the poles are indicated by the blue crosses. <bold>(B)</bold> Pole-zero plot for a snoring event, where the poles are indicated by the blue crosses.</p></caption>
<graphic xlink:href="fbioe-03-00114-g001.tif"/>
</fig>
</sec>
<sec id="S2-4-5-4">
<label>2.4.1.2</label> <title>MFCCs</title>
<p>The sampling frequency of the audio data was 4&#x02009;kHz. Each audio window of interest was detrended and then multiplied by a Hamming window of the same length. A filterbank with 24 filters was used and the entire length of each event was taken to be a single frame, resulting in 12 MFCCs per event.</p>
</sec>
<sec id="S2-4-5-5">
<label>2.4.1.3</label> <title>Classification</title>
<p>Five-fold cross-validation was performed. Different combinations of features (LPC, MFCCs, and demographics) were used to fix a boundary between the two classes (F vs. S and N, i.e., identifying apneic sounds from non-apneic sounds). The folds were stratified by subject. Dividing the folds by event would mean that events from a single subject may appear in both data sets, which could lead to a bias in results. For LDA, the discriminant function used fitted a multivariate normal density to each group, with pooled estimates of a diagonal covariance matrix (essentially a naive Bayes classifier).</p>
</sec>
</sec>
<sec id="S2-4-6">
<label>2.4.2</label> <title>OSA Severity Classification</title>
<sec id="S2-4-6-6">
<label>2.4.2.1</label> <title>Feature extraction</title>
<p>Only 240&#x02009;min of the audio data were analyzed, beginning 30&#x02009;min into the recording (to remove wakefulness and light sleep at the start of the recording) and ending at 4.5&#x02009;h. This maximized the number of subjects that could be used, while providing a significant number of events for any given subject. The 4&#x02009;h window was also recommended by a clinical expert (personal communication; 2010 conversation between Dr. John Stradling and Aoife Roebuck), as OSA will almost certainly manifest within this period if the subject has OSA. The data were preprocessed by taking the variance every 0.5, 1, or 2&#x02009;s and then the natural logarithm of that time series was taken. This process exaggerated the peaks in the signal, but suppressed large spikes which could overwhelm any metric. Nine MSE coefficients were calculated per subject (<italic>&#x003C4;</italic>&#x02009;&#x0003D;&#x02009;1, 2, 4, 8, 16, 32, 65, 130, 180) for <italic>m</italic>&#x02009;&#x0003D;&#x02009;1: 1: 8 and <italic>r</italic>&#x02009;&#x0003D;&#x02009;0.1: 0.05: 0.25. The scales chosen attempted to capture the time scales that occur during repeated apneas at both short and long time scales and were fixed per earlier studies (Roebuck and Clifford, <xref ref-type="bibr" rid="B31">2012</xref>; Roebuck, <xref ref-type="bibr" rid="B30">2014</xref>). The values used for <italic>m</italic> and <italic>r</italic> are based on reasonable ranges for physiological data taken from Costa et al. (<xref ref-type="bibr" rid="B8">2003</xref>).</p>
</sec>
<sec id="S2-4-6-7">
<label>2.4.2.2</label> <title>Classifier training and testing</title>
<p>Again, five-fold cross-validation was carried out on the data. Each time, onefold was held separately to be the test set, while the other four folds were used as the training data set.</p>
<p>The training set was further divided (five times) into training and validation data sets (in the ratio 70:30) in order to find the optimal MSE downsampling rate (<italic>dsr</italic>), <italic>m</italic> value and <italic>r</italic> value. This was performed by a grid search over every possible combination of <italic>dsr</italic>, <italic>m</italic>, <italic>r</italic> and noting the classification accuracy for LDA and the RF separately. The best overall combination was taken to be the one that was chosen most often in the five iterations. Once this combination had been found, the classifier was trained using the entire training set, and tested on the unused test data, for two different feature sets: MSE alone, and MSE plus demographics.</p>
<p>For the RF, 500 trees were used with each tree split on three variables/features. The process was repeated twice with a new seed for 2&#x02009;&#x000D7;&#x02009;10<sup>6</sup> iterations.</p>
</sec>
</sec>
</sec>
</sec>
<sec id="S3">
<label>3</label> <title>Results</title>
<p>As a comparison, Table <xref ref-type="table" rid="T4">4</xref> shows the performance when using the common clinical thresholds on a variety of features. It is worth noting that both the AHI and ODI are associated with multiple thresholds used for classifying subjects into different categories. Subjects are said to be normal or a snorer if below a threshold of 5 and have mild, moderate, or severe OSA otherwise; 10 or 15 is normal/snorer/mild vs. moderate/severe; 20 or 30 is normal/snorer/mild/moderate vs. severe.</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p><bold>Statistics when using clinical thresholds on the demographics, AHI and ODI where both AHI and ODI were automatically calculated by the software</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Feature</th>
<th align="left">Threshold</th>
<th align="left">Se (%)</th>
<th align="left">Sp (%)</th>
<th align="left">PPV (%)</th>
<th align="left">NPV (%)</th>
<th align="left">Ac (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Gender</td>
<td align="left">male</td>
<td align="left">77.5</td>
<td align="left">36.7</td>
<td align="left">45.1</td>
<td align="left">70.8</td>
<td align="left">53.1</td>
</tr>
<tr>
<td align="left">Age</td>
<td align="left">50.0</td>
<td align="left">61.7</td>
<td align="left">59.0</td>
<td align="left">50.2</td>
<td align="left">69.6</td>
<td align="left">60.0</td>
</tr>
<tr>
<td align="left">Neck</td>
<td align="left">40.0</td>
<td align="left">84.9</td>
<td align="left">40.4</td>
<td align="left">51.7</td>
<td align="left">78.1</td>
<td align="left">59.5</td>
</tr>
<tr>
<td align="left">BMI</td>
<td align="left">35.0</td>
<td align="left">45.0</td>
<td align="left">73.8</td>
<td align="left">53.4</td>
<td align="left">66.8</td>
<td align="left">62.3</td>
</tr>
<tr>
<td align="left">ESS</td>
<td align="left">15.0</td>
<td align="left">46.4</td>
<td align="left">66.4</td>
<td align="left">48.5</td>
<td align="left">64.5</td>
<td align="left">58.3</td>
</tr>
<tr>
<td align="left">AHI</td>
<td align="left">5.0</td>
<td align="left">97.4</td>
<td align="left">55.3</td>
<td align="left">59.2</td>
<td align="left">96.9</td>
<td align="left">72.1</td>
</tr>
<tr>
<td align="left">AHI</td>
<td align="left">10.0</td>
<td align="left">92.9</td>
<td align="left">80.1</td>
<td align="left">75.6</td>
<td align="left">94.4</td>
<td align="left">85.2</td>
</tr>
<tr>
<td align="left">AHI</td>
<td align="left"><bold>15.0</bold></td>
<td align="left"><bold>83.5</bold></td>
<td align="left"><bold>87.6</bold></td>
<td align="left"><bold>81.7</bold></td>
<td align="left"><bold>88.8</bold></td>
<td align="left"><bold>86.0</bold></td>
</tr>
<tr>
<td align="left">AHI</td>
<td align="left">20.0</td>
<td align="left">71.4</td>
<td align="left">94.4</td>
<td align="left">89.5</td>
<td align="left">83.2</td>
<td align="left">85.2</td>
</tr>
<tr>
<td align="left">AHI</td>
<td align="left">30.0</td>
<td align="left">53.0</td>
<td align="left">97.9</td>
<td align="left">94.4</td>
<td align="left">75.8</td>
<td align="left">80.0</td>
</tr>
<tr>
<td align="left">ODI</td>
<td align="left">5.0</td>
<td align="left">97.6</td>
<td align="left">54.2</td>
<td align="left">58.7</td>
<td align="left">97.2</td>
<td align="left">71.6</td>
</tr>
<tr>
<td align="left">ODI</td>
<td align="left">10.0</td>
<td align="left">94.0</td>
<td align="left">81.1</td>
<td align="left">76.8</td>
<td align="left">95.3</td>
<td align="left">86.3</td>
</tr>
<tr>
<td align="left">ODI</td>
<td align="left"><bold>15.0</bold></td>
<td align="left"><bold>85.3</bold></td>
<td align="left"><bold>90.9</bold></td>
<td align="left"><bold>86.2</bold></td>
<td align="left"><bold>90.3</bold></td>
<td align="left"><bold>88.7</bold></td>
</tr>
<tr>
<td align="left">ODI</td>
<td align="left">20.0</td>
<td align="left">74.3</td>
<td align="left">96.0</td>
<td align="left">92.5</td>
<td align="left">84.9</td>
<td align="left">87.3</td>
</tr>
<tr>
<td align="left">ODI</td>
<td align="left">30.0</td>
<td align="left">56.2</td>
<td align="left">98.6</td>
<td align="left">96.4</td>
<td align="left">77.2</td>
<td align="left">81.6</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The metrics in bold are the baseline to beat as this is the classification problem being addressed: normal/snorer/mild OSA vs. moderate OSA/severe OSA</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>The results of the standard speech-based event classification analysis can be found in Table <xref ref-type="table" rid="T5">5</xref>, while the results of the MSE analysis using LDA and a RF can be found in Tables <xref ref-type="table" rid="T6">6</xref> and <xref ref-type="table" rid="T7">7</xref> respectively. The best results for each analysis are in bold font in the tables.</p>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p><bold>Performance when using standard speech analysis techniques and LDA on the test data</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Window</th>
<th align="left">Data</th>
<th align="left">Se (%)</th>
<th align="left">Sp (%)</th>
<th align="left">PPV (%)</th>
<th align="left">NPV (%)</th>
<th align="left">Ac (%)</th>
<th align="left">AUC</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">0.5&#x02009;s</td>
<td align="left">L</td>
<td align="left">2.6&#x02009;&#x000B1;&#x02009;3.6</td>
<td align="left">96.3&#x02009;&#x000B1;&#x02009;2.5</td>
<td align="left">12.4&#x02009;&#x000B1;&#x02009;17.0</td>
<td align="left">69.7&#x02009;&#x000B1;&#x02009;8.6</td>
<td align="left">68.0&#x02009;&#x000B1;&#x02009;8.2</td>
<td align="left">0.58&#x02009;&#x000B1;&#x02009;0.13</td>
</tr>
<tr>
<td align="left"/>
<td align="left">C</td>
<td align="left">5.1&#x02009;&#x000B1;&#x02009;5.0</td>
<td align="left">89.7&#x02009;&#x000B1;&#x02009;7.4</td>
<td align="left">17.3&#x02009;&#x000B1;&#x02009;17.4</td>
<td align="left">68.8&#x02009;&#x000B1;&#x02009;9.2</td>
<td align="left">64.1&#x02009;&#x000B1;&#x02009;7.8</td>
<td align="left">0.51&#x02009;&#x000B1;&#x02009;0.10</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L and C</td>
<td align="left">12.6&#x02009;&#x000B1;&#x02009;4.0</td>
<td align="left">81.4&#x02009;&#x000B1;&#x02009;17.3</td>
<td align="left">31.5&#x02009;&#x000B1;&#x02009;17.2</td>
<td align="left">67.0&#x02009;&#x000B1;&#x02009;5.4</td>
<td align="left">60.0&#x02009;&#x000B1;&#x02009;10.7</td>
<td align="left">0.53&#x02009;&#x000B1;&#x02009;0.14</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L and D</td>
<td align="left">29.8&#x02009;&#x000B1;&#x02009;38.9</td>
<td align="left">66.2&#x02009;&#x000B1;&#x02009;28.7</td>
<td align="left">34.6&#x02009;&#x000B1;&#x02009;41.7</td>
<td align="left">71.1&#x02009;&#x000B1;&#x02009;6.2</td>
<td align="left">56.7&#x02009;&#x000B1;&#x02009;12.7</td>
<td align="left">0.57&#x02009;&#x000B1;&#x02009;0.09</td>
</tr>
<tr>
<td align="left"/>
<td align="left">C and D</td>
<td align="left">18.2&#x02009;&#x000B1;&#x02009;20.0</td>
<td align="left">80.2&#x02009;&#x000B1;&#x02009;12.2</td>
<td align="left">29.0&#x02009;&#x000B1;&#x02009;16.8</td>
<td align="left">70.8&#x02009;&#x000B1;&#x02009;12.1</td>
<td align="left">60.7&#x02009;&#x000B1;&#x02009;4.0</td>
<td align="left">0.57&#x02009;&#x000B1;&#x02009;0.14</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L, C, and D</td>
<td align="left">38.8&#x02009;&#x000B1;&#x02009;37.7</td>
<td align="left">69.9&#x02009;&#x000B1;&#x02009;27.3</td>
<td align="left">33.5&#x02009;&#x000B1;&#x02009;31.8</td>
<td align="left">75.6&#x02009;&#x000B1;&#x02009;17.6</td>
<td align="left">57.0&#x02009;&#x000B1;&#x02009;13.5</td>
<td align="left">0.61&#x02009;&#x000B1;&#x02009;0.17</td>
</tr>
<tr>
<td align="left">1&#x02009;s</td>
<td align="left">L</td>
<td align="left">0.9&#x02009;&#x000B1;&#x02009;1.3</td>
<td align="left">97.8&#x02009;&#x000B1;&#x02009;2.2</td>
<td align="left">NaN&#x02009;&#x000B1;&#x02009;NaN</td>
<td align="left">69.0&#x02009;&#x000B1;&#x02009;9.8</td>
<td align="left">68.0&#x02009;&#x000B1;&#x02009;9.3</td>
<td align="left">0.48&#x02009;&#x000B1;&#x02009;0.07</td>
</tr>
<tr>
<td align="left"/>
<td align="left">C</td>
<td align="left">6.9&#x02009;&#x000B1;&#x02009;6.1</td>
<td align="left">90.6&#x02009;&#x000B1;&#x02009;5.5</td>
<td align="left">21.7&#x02009;&#x000B1;&#x02009;15.9</td>
<td align="left">69.6&#x02009;&#x000B1;&#x02009;11.7</td>
<td align="left">66.1&#x02009;&#x000B1;&#x02009;12.6</td>
<td align="left">0.50&#x02009;&#x000B1;&#x02009;0.08</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L and C</td>
<td align="left">5.6&#x02009;&#x000B1;&#x02009;4.7</td>
<td align="left">86.1&#x02009;&#x000B1;&#x02009;5.6</td>
<td align="left">19.4&#x02009;&#x000B1;&#x02009;19.4</td>
<td align="left">68.7&#x02009;&#x000B1;&#x02009;12.7</td>
<td align="left">62.1&#x02009;&#x000B1;&#x02009;9.0</td>
<td align="left">0.49&#x02009;&#x000B1;&#x02009;0.07</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L and D</td>
<td align="left">26.4&#x02009;&#x000B1;&#x02009;27.9</td>
<td align="left">72.4&#x02009;&#x000B1;&#x02009;22.0</td>
<td align="left">29.7&#x02009;&#x000B1;&#x02009;27.9</td>
<td align="left">69.4&#x02009;&#x000B1;&#x02009;11.2</td>
<td align="left">61.2&#x02009;&#x000B1;&#x02009;14.0</td>
<td align="left">0.53&#x02009;&#x000B1;&#x02009;0.15</td>
</tr>
<tr>
<td align="left"/>
<td align="left">C and D</td>
<td align="left">20.1&#x02009;&#x000B1;&#x02009;11.9</td>
<td align="left">75.4&#x02009;&#x000B1;&#x02009;6.9</td>
<td align="left">26.6&#x02009;&#x000B1;&#x02009;18.3</td>
<td align="left">69.3&#x02009;&#x000B1;&#x02009;2.6</td>
<td align="left">59.2&#x02009;&#x000B1;&#x02009;4.4</td>
<td align="left">0.54&#x02009;&#x000B1;&#x02009;0.15</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L, C, and D</td>
<td align="left">18.5&#x02009;&#x000B1;&#x02009;16.5</td>
<td align="left">80.2&#x02009;&#x000B1;&#x02009;17.8</td>
<td align="left">20.6&#x02009;&#x000B1;&#x02009;15.2</td>
<td align="left">70.1&#x02009;&#x000B1;&#x02009;12.9</td>
<td align="left">63.2&#x02009;&#x000B1;&#x02009;12.9</td>
<td align="left">0.53&#x02009;&#x000B1;&#x02009;0.08</td>
</tr>
<tr>
<td align="left">2&#x02009;s</td>
<td align="left">L</td>
<td align="left">9.1&#x02009;&#x000B1;&#x02009;7.2</td>
<td align="left">93.9&#x02009;&#x000B1;&#x02009;5.8</td>
<td align="left">NaN&#x02009;&#x000B1;&#x02009;NaN</td>
<td align="left">73.1&#x02009;&#x000B1;&#x02009;16.3</td>
<td align="left">69.8&#x02009;&#x000B1;&#x02009;13.9</td>
<td align="left">0.57&#x02009;&#x000B1;&#x02009;0.06</td>
</tr>
<tr>
<td align="left"/>
<td align="left">C</td>
<td align="left">25.8&#x02009;&#x000B1;&#x02009;12.3</td>
<td align="left">85.6&#x02009;&#x000B1;&#x02009;6.4</td>
<td align="left">42.6&#x02009;&#x000B1;&#x02009;13.7</td>
<td align="left">74.4&#x02009;&#x000B1;&#x02009;4.5</td>
<td align="left">68.6&#x02009;&#x000B1;&#x02009;5.9</td>
<td align="left">0.67&#x02009;&#x000B1;&#x02009;0.08</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L and C</td>
<td align="left">26.0&#x02009;&#x000B1;&#x02009;14.7</td>
<td align="left">86.2&#x02009;&#x000B1;&#x02009;11.7</td>
<td align="left">45.3&#x02009;&#x000B1;&#x02009;15.5</td>
<td align="left">73.7&#x02009;&#x000B1;&#x02009;10.3</td>
<td align="left">67.7&#x02009;&#x000B1;&#x02009;10.2</td>
<td align="left">0.65&#x02009;&#x000B1;&#x02009;0.11</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L and D</td>
<td align="left">31.5&#x02009;&#x000B1;&#x02009;34.8</td>
<td align="left">80.8&#x02009;&#x000B1;&#x02009;20.8</td>
<td align="left">NaN&#x02009;&#x000B1;&#x02009;NaN</td>
<td align="left">75.6&#x02009;&#x000B1;&#x02009;11.5</td>
<td align="left">64.9&#x02009;&#x000B1;&#x02009;6.9</td>
<td align="left">0.62&#x02009;&#x000B1;&#x02009;0.07</td>
</tr>
<tr>
<td align="left"/>
<td align="left">C and D</td>
<td align="left">31.0&#x02009;&#x000B1;&#x02009;19.2</td>
<td align="left">83.0&#x02009;&#x000B1;&#x02009;11.2</td>
<td align="left">37.5&#x02009;&#x000B1;&#x02009;31.2</td>
<td align="left">76.4&#x02009;&#x000B1;&#x02009;11.2</td>
<td align="left">68.5&#x02009;&#x000B1;&#x02009;8.1</td>
<td align="left">0.65&#x02009;&#x000B1;&#x02009;0.09</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L, C, and D</td>
<td align="left">39.4&#x02009;&#x000B1;&#x02009;19.6</td>
<td align="left">85.0&#x02009;&#x000B1;&#x02009;7.6</td>
<td align="left">48.5&#x02009;&#x000B1;&#x02009;20.5</td>
<td align="left">77.6&#x02009;&#x000B1;&#x02009;10.6</td>
<td align="left">70.6&#x02009;&#x000B1;&#x02009;7.0</td>
<td align="left">0.73&#x02009;&#x000B1;&#x02009;0.03</td>
</tr>
<tr>
<td align="left">3&#x02009;s</td>
<td align="left">L</td>
<td align="left">6.4&#x02009;&#x000B1;&#x02009;6.1</td>
<td align="left">93.9&#x02009;&#x000B1;&#x02009;4.9</td>
<td align="left">25.5&#x02009;&#x000B1;&#x02009;27.7</td>
<td align="left">80.4&#x02009;&#x000B1;&#x02009;7.5</td>
<td align="left">77.0&#x02009;&#x000B1;&#x02009;8.2</td>
<td align="left">0.61&#x02009;&#x000B1;&#x02009;0.08</td>
</tr>
<tr>
<td align="left"/>
<td align="left">C</td>
<td align="left">10.0&#x02009;&#x000B1;&#x02009;13.7</td>
<td align="left">94.5&#x02009;&#x000B1;&#x02009;5.0</td>
<td align="left">NaN&#x02009;&#x000B1;&#x02009;NaN</td>
<td align="left">81.8&#x02009;&#x000B1;&#x02009;9.3</td>
<td align="left">78.9&#x02009;&#x000B1;&#x02009;10.5</td>
<td align="left">0.62&#x02009;&#x000B1;&#x02009;0.08</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L and C</td>
<td align="left">28.7&#x02009;&#x000B1;&#x02009;25.6</td>
<td align="left">93.6&#x02009;&#x000B1;&#x02009;7.4</td>
<td align="left">45.3&#x02009;&#x000B1;&#x02009;41.0</td>
<td align="left">86.0&#x02009;&#x000B1;&#x02009;13.0</td>
<td align="left">82.1&#x02009;&#x000B1;&#x02009;11.0</td>
<td align="left">0.71&#x02009;&#x000B1;&#x02009;0.19</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L and D</td>
<td align="left">17.3&#x02009;&#x000B1;&#x02009;19.7</td>
<td align="left">86.3&#x02009;&#x000B1;&#x02009;13.0</td>
<td align="left">14.2&#x02009;&#x000B1;&#x02009;8.5</td>
<td align="left">81.0&#x02009;&#x000B1;&#x02009;13.3</td>
<td align="left">73.8&#x02009;&#x000B1;&#x02009;15.4</td>
<td align="left">0.68&#x02009;&#x000B1;&#x02009;0.17</td>
</tr>
<tr>
<td align="left"/>
<td align="left">C and D</td>
<td align="left">34.8&#x02009;&#x000B1;&#x02009;19.1</td>
<td align="left">89.7&#x02009;&#x000B1;&#x02009;10.6</td>
<td align="left">52.6&#x02009;&#x000B1;&#x02009;19.4</td>
<td align="left">84.4&#x02009;&#x000B1;&#x02009;6.5</td>
<td align="left">77.6&#x02009;&#x000B1;&#x02009;4.9</td>
<td align="left">0.76&#x02009;&#x000B1;&#x02009;0.10</td>
</tr>
<tr>
<td align="left"/>
<td align="left">L, C, and D</td>
<td align="left">29.2&#x02009;&#x000B1;&#x02009;17.4</td>
<td align="left">88.7&#x02009;&#x000B1;&#x02009;8.9</td>
<td align="left">49.1&#x02009;&#x000B1;&#x02009;29.9</td>
<td align="left">83.3&#x02009;&#x000B1;&#x02009;9.1</td>
<td align="left"><bold>76.9&#x02009;<italic>&#x000B1;</italic>&#x02009;10.6</bold></td>
<td align="left"><bold>0.73&#x02009;<italic>&#x000B1;</italic>&#x02009;0.11</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>L&#x02009;&#x0003D;&#x02009;LPC, C&#x02009;&#x0003D;&#x02009;MFCC, D&#x02009;&#x0003D;&#x02009;demographics. NaN indicates that the classifier never identified a true positive. The metrics in bold indicate the best performance</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T6">
<label>Table 6</label>
<caption><p><bold>LDA Performance when using MSE and demographics on the test data</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Features</th>
<th align="left">Data set</th>
<th align="center">Se (%)</th>
<th align="center">Sp (%)</th>
<th align="center">PPV (%)</th>
<th align="center">NPV (%)</th>
<th align="center">Ac (%)</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">MSE<italic><sub>aud</sub></italic></td>
<td align="left">Train</td>
<td align="left">42.0&#x02009;&#x000B1;&#x02009;14.0</td>
<td align="left">81.8&#x02009;&#x000B1;&#x02009;9.5</td>
<td align="left">62.5&#x02009;&#x000B1;&#x02009;7.2</td>
<td align="left">68.7&#x02009;&#x000B1;&#x02009;2.8</td>
<td align="left">66.2&#x02009;&#x000B1;&#x02009;1.1</td>
<td align="left">0.66&#x02009;&#x000B1;&#x02009;0.01</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Test</td>
<td align="left">41.1&#x02009;&#x000B1;&#x02009;14.3</td>
<td align="left">78.5&#x02009;&#x000B1;&#x02009;11.7</td>
<td align="left">58.8&#x02009;&#x000B1;&#x02009;15.6</td>
<td align="left">67.1&#x02009;&#x000B1;&#x02009;8.2</td>
<td align="left">63.3&#x02009;&#x000B1;&#x02009;5.2</td>
<td align="left">0.64&#x02009;&#x000B1;&#x02009;0.03</td>
</tr>
<tr>
<td align="left">MSE<italic><sub>aud</sub></italic>&#x02009;&#x0002B;&#x02009;demos</td>
<td align="left">Train</td>
<td align="left">57.0&#x02009;&#x000B1;&#x02009;4.0</td>
<td align="left">78.7&#x02009;&#x000B1;&#x02009;3.3</td>
<td align="left">63.8&#x02009;&#x000B1;&#x02009;1.3</td>
<td align="left">73.7&#x02009;&#x000B1;&#x02009;1.6</td>
<td align="left">70.2&#x02009;&#x000B1;&#x02009;1.3</td>
<td align="left">0.76&#x02009;&#x000B1;&#x02009;0.01</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Test</td>
<td align="left">59.1&#x02009;&#x000B1;&#x02009;7.7</td>
<td align="left">77.5&#x02009;&#x000B1;&#x02009;2.8</td>
<td align="left">64.2&#x02009;&#x000B1;&#x02009;5.8</td>
<td align="left">73.1&#x02009;&#x000B1;&#x02009;7.5</td>
<td align="left">69.6&#x02009;&#x000B1;&#x02009;3.4</td>
<td align="left">0.74&#x02009;&#x000B1;&#x02009;0.03</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap position="float" id="T7">
<label>Table 7</label>
<caption><p><bold>Performance of the RF when using MSE and demographics on the test data</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Features</th>
<th align="center">Se (%)</th>
<th align="center">Sp (%)</th>
<th align="center">PPV (%)</th>
<th align="center">NPV (%)</th>
<th align="center">Ac (%)</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">MSE</td>
<td align="center">66.0&#x02009;&#x000B1;&#x02009;6.8</td>
<td align="center">88.8&#x02009;&#x000B1;&#x02009;1.8</td>
<td align="center">79.0&#x02009;&#x000B1;&#x02009;5.1</td>
<td align="center">80.1&#x02009;&#x000B1;&#x02009;4.1</td>
<td align="center">80.0&#x02009;&#x000B1;&#x02009;3.2</td>
<td align="center">0.86&#x02009;&#x000B1;&#x02009;0.04</td>
</tr>
<tr>
<td align="left">MSE&#x02009;&#x0002B;&#x02009;demos</td>
<td align="center">69.2&#x02009;&#x000B1;&#x02009;5.9</td>
<td align="center">87.9&#x02009;&#x000B1;&#x02009;3.9</td>
<td align="center">79.0&#x02009;&#x000B1;&#x02009;5.3</td>
<td align="center">81.2&#x02009;&#x000B1;&#x02009;5.8</td>
<td align="center"><bold>80.5&#x02009;<italic>&#x000B1;</italic>&#x02009;4.9</bold></td>
<td align="center"><bold>0.88&#x02009;<italic>&#x000B1;</italic>&#x02009;0.04</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The metrics in bold indicate the best performance</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>Note that the standard speech analysis techniques have an extremely low sensitivity. Adding demographics to the analysis almost always increases the sensitivity, but only by a moderate amount. Increasing the window size has a moderate effect, but the sensitivity remains low and the variance is high. For the classic speech analysis approach, the best results were consistently obtained when using a combination of LPC, MFCCs, and demographics, for a 3-s window size, achieving Ac&#x02009;&#x0003D;&#x02009;76.9%, Se&#x02009;&#x0003D;&#x02009;29.2%, Sp&#x02009;&#x0003D;&#x02009;88.7%, AUC&#x02009;&#x0003D;&#x02009;0.73.</p>
<p>As can be seen in Tables <xref ref-type="table" rid="T6">6</xref> and <xref ref-type="table" rid="T7">7</xref>, the best performance is obtained when <italic>MSE</italic>&#x02009;&#x0002B;&#x02009;<italic>demos</italic> are used in the analysis; the RF achieved Ac&#x02009;&#x0003D;&#x02009;80.5% and AUC&#x02009;&#x0003D;&#x02009;0.88 during validation. However, using <italic>MSE</italic> alone achieves comparable results within the variance over the folds. There is a better balance between Se (66.0%) and Sp (88.8%), unlike in the event classification approach, and the AUC is also higher (0.86 compared to 0.84).</p>
</sec>
<sec id="S4" sec-type="discussion">
<label>4</label> <title>Discussion</title>
<p>As can be seen in Table <xref ref-type="table" rid="T4">4</xref>, demographics or questionnaires alone lead to a poor performance for classifying OSA severity; they are either sensitive or specific, never both, and accuracy is only slightly better than random chance (ranging from 53 to 62%). This is not unexpected, particularly when subjects have been asked to note down these figures themselves rather than being measured and recorded by a healthcare professional; studies have shown that people are poor at self-reporting height and weight (Engstrom et al., <xref ref-type="bibr" rid="B10">2003</xref>). It would therefore be prudent to move away from the use of such information and perhaps rely on objective signals only, if they provide a lower error rate. Both the AHI and ODI are good classification features. It is clear from Table <xref ref-type="table" rid="T4">4</xref> that a threshold of 15 provides the best classification. This separates normal/snorer/mild vs. moderate/severe, and is appropriate for this work as this is the classification problem being addressed. Using this threshold gives performance statistics in the high-80% to low-90% range.</p>
<p>It is clear that the data used for the standard speech-based classifier approach is heterogeneous, and that each fold is significantly different, as evidenced by the large SD values for some of the metrics. This is likely due to the lack of annotations, i.e., there are insufficient data for this analysis or there are too many features. It should be noted that in the training data set, there are up to 30 features (when LPC&#x02009;&#x0002B;&#x02009;MFCCs&#x02009;&#x0002B;&#x02009;demos are combined) but only 56 subjects. This causes the problem to be under-specified, and leads to poor results on the test data. In addition, using three or more annotators would make the annotations more robust. Only those events where two or more annotators agree would be used in the analysis, and could improve the performance of the classifier.</p>
<p>The standard speech-based approach was unable to exceed an Ac of 82% when classifying by subject. Table <xref ref-type="table" rid="T5">5</xref> indicates that using LDA results in a very specific classifier (in the high 90&#x02009;s), but very low sensitivity (less than 40%). In addition, the PPV could not be computed for some of the combinations of features, because the classifier never identified a true positive.</p>
<p>The fact that the combination of LPC and MFCC consistently provided the best accuracy in determining whether the event was either a choke or noise/snore, regardless of window size, indicates that the LPC coefficients and the MFCCs provided complementary information. The addition of demographics also improved performance. It is worth noting that less data were used at 3&#x02009;s than at 0.5&#x02009;s (decreased from 391 noise/snore events, 175 chokes at 0.5&#x02009;s to 326 noise/snore, 82 chokes at 3&#x02009;s). The ratio between the classes has completely changed at 3&#x02009;s.</p>
<p>There are a number of limitations to this approach. Annotating the data is labor intensive and, ideally, there should be three annotators to ensure the quality of the annotation. In this work, one clinical research fellow with 2&#x02009;years of training labeled the data, which is not optimal.</p>
<p>Assuming that the data were recorded in a low noise environment, the performance achieved is insufficient for screening subjects. Even with the performance metrics as good as they are here, the accuracy needs to be much higher (high 80%) for this approach to be clinically acceptable (Collop et al., <xref ref-type="bibr" rid="B6">2007</xref>), although the ability of portable monitors to correctly diagnose subjects with OSA needs further validation.</p>
<p>The results presented here are lower than those reported in the literature [Se&#x02009;&#x0003D;&#x02009;88% and Sp&#x02009;&#x0003D;&#x02009;82% for LPC (Ng et al., <xref ref-type="bibr" rid="B27">2008a</xref>); Se&#x02009;&#x0003D;&#x02009;82% for MFCCs (Cavusoglu et al., <xref ref-type="bibr" rid="B5">2007</xref>)]. However, many approaches consider in-sample classification which leads to a significant over-estimation of out-of-sample performance. In addition, the literature considers a different classification problem, i.e., thresholding on a given feature to differentiate apneic snores from benign snores, whereas, in the analysis above, the first breath after an apnea has been differentiated from benign snores and noise, which will also influence performance. The poorer performance could be due to the lack of annotations in this analysis, resulting in an under-specified system. In addition, none of the snoring events used in this analysis have been graded. The availability of a grading may have improved performance. It is worth noting that the classification performance is on the classes of sounds themselves, and not for a subject overall.</p>
<p>Finally, the need for an event detector, and then downstream classification to turn the classified events into a diagnosis are likely to degrade the performance further.</p>
<p>In contrast to the speech-based approaches and events classification, the MSE approach yielded a much higher Se, PPV, and AUC with marginal drops in Ac and Sp. Moreover, the addition of demographics made little difference to the classification performance, indicating that the technique may be more suitable as a general tool. The use of an RF applied to MSE coefficients requires very little preprocessing and no segmentation or event detection. While the sensitivity does not surpass 70%, this technique may be suitable as a post-screening test after screening with a pulse oximeter, particularly in settings where there is significant noise from external sources.</p>
</sec>
<sec id="S5">
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="S6">
<title>Funding</title>
<p>This research was supported by the RCUK Digital Economy Programme grant &#x00023; EP/G036861/1 (Oxford Centre for Doctoral Training in Healthcare Innovation) and the University of Oxford Sleep and Circadian Neuroscience Institute (SCNi), Centre Grant &#x00023; 098461/Z/12/Z.</p>
</sec>
</body>
<back>
<ack>
<p>The authors are deeply grateful to Dr. John Stradling, Professor of Respiratory Medicine and the late Dr. Lyn Davies, Stowood Scientific Instruments Ltd., Beckley, Oxford, for their insights, advice, and support.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abeyratne</surname> <given-names>U. R.</given-names></name> <name><surname>Wakwella</surname> <given-names>A. S.</given-names></name> <name><surname>Hukins</surname> <given-names>C.</given-names></name></person-group> (<year>2005</year>). <article-title>Pitch jump probability measures for the analysis of snoring sounds in apnea</article-title>. <source>Physiol. Meas.</source> <volume>26</volume>, <fpage>779</fpage>&#x02013;<lpage>798</lpage>.<pub-id pub-id-type="doi">10.1088/0967-3334/26/5/016</pub-id><pub-id pub-id-type="pmid">16088068</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bearpark</surname> <given-names>H.</given-names></name> <name><surname>Elliott</surname> <given-names>L.</given-names></name> <name><surname>Grunstein</surname> <given-names>R.</given-names></name> <name><surname>Cullen</surname> <given-names>S.</given-names></name> <name><surname>Schneider</surname> <given-names>H.</given-names></name> <name><surname>Althaus</surname> <given-names>W.</given-names></name> <etal/></person-group> (<year>1995</year>). <article-title>Snoring and sleep apnea. A population study in Australian men</article-title>. <source>Am. J. Respir. Crit. Care Med.</source> <volume>151</volume>, <fpage>1459</fpage>&#x02013;<lpage>1465</lpage>.<pub-id pub-id-type="doi">10.1164/ajrccm.151.5.7735600</pub-id><pub-id pub-id-type="pmid">7735600</pub-id></citation></ref>
<ref id="B3"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Bogert</surname> <given-names>B. P.</given-names></name> <name><surname>Healy</surname> <given-names>M. J. R.</given-names></name> <name><surname>Tukey</surname> <given-names>J. W.</given-names></name></person-group> (<year>1963</year>). &#x0201C;<article-title>The frequency analysis of time series for echoes: cepstrum, pseudo-autocovariance, cross-cepstrum and Saphe cracking</article-title>,&#x0201D; in <conf-name>Proceedings of the Symposium on Time Series Analysis</conf-name>, (<conf-loc>New York</conf-loc>: <conf-sponsor>Wiley</conf-sponsor>), <fpage>209</fpage>&#x02013;<lpage>243</lpage>.</citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breiman</surname> <given-names>L.</given-names></name></person-group> (<year>2001</year>). <article-title>Random forests</article-title>. <source>Mach. Learn.</source> <volume>45</volume>, <fpage>5</fpage>&#x02013;<lpage>32</lpage>.<pub-id pub-id-type="doi">10.1023/A:1017934522171</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cavusoglu</surname> <given-names>M.</given-names></name> <name><surname>Kamasak</surname> <given-names>M.</given-names></name> <name><surname>Erogul</surname> <given-names>O.</given-names></name> <name><surname>Ciloglu</surname> <given-names>T.</given-names></name> <name><surname>Serinagaoglu</surname> <given-names>Y.</given-names></name> <name><surname>Akcam</surname> <given-names>T.</given-names></name></person-group> (<year>2007</year>). <article-title>An efficient method for snore/nonsnore classification of sleep sounds</article-title>. <source>Physiol. Meas.</source> <volume>28</volume>, <fpage>841</fpage>&#x02013;<lpage>853</lpage>.<pub-id pub-id-type="doi">10.1088/0967-3334/28/8/007</pub-id><pub-id pub-id-type="pmid">17664676</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Collop</surname> <given-names>N. A.</given-names></name> <name><surname>McDowell Anderson</surname> <given-names>W.</given-names></name> <name><surname>Boehlecke</surname> <given-names>B.</given-names></name> <name><surname>Claman</surname> <given-names>D.</given-names></name> <name><surname>Goldberg</surname> <given-names>R.</given-names></name> <name><surname>Gottlieb</surname> <given-names>D. J.</given-names></name> <etal/></person-group> (<year>2007</year>). <article-title>Clinical guidelines for the use of unattended portable monitors in the diagnosis of obstructive sleep apnea in adult patients</article-title>. <source>J. Clin. Sleep Med.</source> <volume>3</volume>, <fpage>737</fpage>&#x02013;<lpage>747</lpage>.<pub-id pub-id-type="pmid">18198809</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Costa</surname> <given-names>M.</given-names></name> <name><surname>Goldberger</surname> <given-names>A. L.</given-names></name> <name><surname>Peng</surname> <given-names>C. K.</given-names></name></person-group> (<year>2002</year>). <article-title>Multiscale entropy analysis of complex physiologic time series</article-title>. <source>Phys. Rev. Lett.</source> <volume>89</volume>, <fpage>068102</fpage>.<pub-id pub-id-type="doi">10.1103/PhysRevLett.89.068102</pub-id><pub-id pub-id-type="pmid">12190613</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Costa</surname> <given-names>M.</given-names></name> <name><surname>Peng</surname> <given-names>C. K.</given-names></name> <name><surname>Goldberger</surname> <given-names>A. L.</given-names></name> <name><surname>Hausdorff</surname> <given-names>J. M.</given-names></name></person-group> (<year>2003</year>). <article-title>Multiscale entropy analysis of human gait dynamics</article-title>. <source>Physica A</source> <volume>330</volume>, <fpage>53</fpage>&#x02013;<lpage>60</lpage>.<pub-id pub-id-type="doi">10.1016/j.physa.2003.08.022</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davis</surname> <given-names>S.</given-names></name> <name><surname>Mermelstein</surname> <given-names>P.</given-names></name></person-group> (<year>1980</year>). <article-title>Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences</article-title>. <source>IEEE Trans. Acoust.</source> <volume>28</volume>, <fpage>357</fpage>&#x02013;<lpage>366</lpage>.</citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Engstrom</surname> <given-names>J. L.</given-names></name> <name><surname>Paterson</surname> <given-names>S. A.</given-names></name> <name><surname>Doherty</surname> <given-names>A.</given-names></name> <name><surname>Trabulsi</surname> <given-names>M.</given-names></name> <name><surname>Speer</surname> <given-names>K. L.</given-names></name></person-group> (<year>2003</year>). <article-title>Accuracy of self-reported height and weight in women: an integrative review of the literature</article-title>. <source>J. Midwifery Womens Health</source> <volume>48</volume>, <fpage>338</fpage>&#x02013;<lpage>345</lpage>.<pub-id pub-id-type="doi">10.1016/S1526-9523(03)00281-2</pub-id><pub-id pub-id-type="pmid">14526347</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Epstein</surname> <given-names>L. J.</given-names></name> <name><surname>Dorlac</surname> <given-names>G. R.</given-names></name></person-group> (<year>1998</year>). <article-title>Cost-effectiveness analysis of nocturnal oximetry as a method of screening for sleep apnea-hypopnea syndrome</article-title>. <source>Chest</source> <volume>113</volume>, <fpage>97</fpage>&#x02013;<lpage>103</lpage>.<pub-id pub-id-type="doi">10.1378/chest.113.1.97</pub-id><pub-id pub-id-type="pmid">9440575</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goldberger</surname> <given-names>A. L.</given-names></name> <name><surname>Amaral</surname> <given-names>L. A. N.</given-names></name> <name><surname>Glass</surname> <given-names>L.</given-names></name> <name><surname>Hausdorff</surname> <given-names>J. M.</given-names></name> <name><surname>Ivanov</surname> <given-names>P. C.</given-names></name> <name><surname>Mark</surname> <given-names>R. G.</given-names></name> <etal/></person-group> (<year>2000</year>). <article-title>PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals</article-title>. <source>Circulation</source> <volume>101</volume>, <fpage>e215</fpage>&#x02013;<lpage>e220</lpage>.<pub-id pub-id-type="doi">10.1161/01.CIR.101.23.e215</pub-id><pub-id pub-id-type="pmid">10851218</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goldshtein</surname> <given-names>E.</given-names></name> <name><surname>Tarasiuk</surname> <given-names>A.</given-names></name> <name><surname>Zigel</surname> <given-names>Y.</given-names></name></person-group> (<year>2011</year>). <article-title>Automatic detection of obstructive sleep apnea using speech signals</article-title>. <source>IEEE Trans. Biomed. Eng.</source> <volume>58</volume>, <fpage>1373</fpage>&#x02013;<lpage>1382</lpage>.<pub-id pub-id-type="doi">10.1109/TBME.2010.2100096</pub-id><pub-id pub-id-type="pmid">21172747</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hastings</surname> <given-names>W. K.</given-names></name></person-group> (<year>1970</year>). <article-title>Monte Carlo sampling methods using Markov chains and their applications</article-title>. <source>Biometrika</source> <volume>57</volume>, <fpage>97</fpage>&#x02013;<lpage>109</lpage>.<pub-id pub-id-type="doi">10.1093/biomet/57.1.97</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ip</surname> <given-names>M. S.</given-names></name> <name><surname>Lam</surname> <given-names>B.</given-names></name> <name><surname>Lauder</surname> <given-names>I. J.</given-names></name> <name><surname>Tsang</surname> <given-names>K. W. T.</given-names></name> <name><surname>Chung</surname> <given-names>K. F.</given-names></name> <name><surname>Mok</surname> <given-names>Y. W.</given-names></name> <etal/></person-group> (<year>2001</year>). <article-title>A community study of sleep-disordered breathing in middle-aged Chinese men in Hong Kong</article-title>. <source>Chest</source> <volume>119</volume>, <fpage>62</fpage>&#x02013;<lpage>69</lpage>.<pub-id pub-id-type="doi">10.1378/chest.119.1.62</pub-id><pub-id pub-id-type="pmid">11157585</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ip</surname> <given-names>M. S.</given-names></name> <name><surname>Lam</surname> <given-names>B.</given-names></name> <name><surname>Tang</surname> <given-names>L. C.</given-names></name> <name><surname>Lauder</surname> <given-names>I. J.</given-names></name> <name><surname>Ip</surname> <given-names>T. Y.</given-names></name> <name><surname>Lam</surname> <given-names>W. K.</given-names></name></person-group> (<year>2004</year>). <article-title>A community study of sleep-disordered breathing in middle-aged Chinese women in Hong Kong: prevalence and gender differences</article-title>. <source>Chest</source> <volume>125</volume>, <fpage>127</fpage>&#x02013;<lpage>134</lpage>.<pub-id pub-id-type="doi">10.1378/chest.125.1.127</pub-id><pub-id pub-id-type="pmid">14718431</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johns</surname> <given-names>M. W.</given-names></name></person-group> (<year>1991</year>). <article-title>A new method for measuring daytime sleepiness: the Epworth sleepiness scale</article-title>. <source>Sleep</source> <volume>14</volume>, <fpage>540</fpage>&#x02013;<lpage>545</lpage>.</citation></ref>
<ref id="B18"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Johnson</surname> <given-names>A.</given-names></name> <name><surname>Dunkley</surname> <given-names>N.</given-names></name> <name><surname>Mayaud</surname> <given-names>L.</given-names></name> <name><surname>Tsanas</surname> <given-names>A.</given-names></name> <name><surname>Kramer</surname> <given-names>A.</given-names></name> <name><surname>Clifford</surname> <given-names>G.</given-names></name></person-group> (<year>2012</year>). &#x0201C;<article-title>Patient specific predictions in the intensive care unit using a Bayesian ensemble</article-title>,&#x0201D; in <source>Computing in Cardiology (CinC), 2012</source> ed. <person-group person-group-type="editor"><name><surname>Murray</surname> <given-names>A.</given-names></name></person-group> (<publisher-loc>Krakow</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>249</fpage>&#x02013;<lpage>252</lpage>.</citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>T. M.</given-names></name> <name><surname>Ho</surname> <given-names>M. S.</given-names></name> <name><surname>Earis</surname> <given-names>J. E.</given-names></name> <name><surname>Swift</surname> <given-names>A. C.</given-names></name></person-group> (<year>2006a</year>). <article-title>Acoustic parameters of snoring sound to assess the effectiveness of the M&#x000FC;ller Manoeuvre in predicting surgical outcome</article-title>. <source>Auris Nasus Larynx</source> <volume>33</volume>, <fpage>409</fpage>&#x02013;<lpage>416</lpage>.<pub-id pub-id-type="doi">10.1016/j.anl.2006.05.003</pub-id><pub-id pub-id-type="pmid">16887312</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>T. M.</given-names></name> <name><surname>Walker</surname> <given-names>P.</given-names></name> <name><surname>Ho</surname> <given-names>M.</given-names></name> <name><surname>Earis</surname> <given-names>J. E.</given-names></name> <name><surname>Swift</surname> <given-names>A. C.</given-names></name> <name><surname>Charters</surname> <given-names>P.</given-names></name></person-group> (<year>2006b</year>). <article-title>Acoustic parameters of snoring sound to assess the effectiveness of sleep nasendoscopy in predicting surgical outcome</article-title>. <source>Otolaryngol. Head Neck Surg.</source> <volume>135</volume>, <fpage>269</fpage>&#x02013;<lpage>275</lpage>.<pub-id pub-id-type="doi">10.1016/j.otohns.2005.11.051</pub-id><pub-id pub-id-type="pmid">16890081</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>T. M.</given-names></name> <name><surname>Swift</surname> <given-names>A. C.</given-names></name> <name><surname>Calverley</surname> <given-names>P. M. A.</given-names></name> <name><surname>Ho</surname> <given-names>M. S.</given-names></name> <name><surname>Earis</surname> <given-names>J. E.</given-names></name></person-group> (<year>2005</year>). <article-title>Acoustic analysis of snoring before and after palatal surgery</article-title>. <source>Eur. Respir. J.</source> <volume>25</volume>, <fpage>1044</fpage>&#x02013;<lpage>1049</lpage>.<pub-id pub-id-type="doi">10.1183/09031936.05.00101703</pub-id><pub-id pub-id-type="pmid">15929960</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>J. K.</given-names></name> <name><surname>In</surname> <given-names>K. H.</given-names></name> <name><surname>Kim</surname> <given-names>J. H.</given-names></name> <name><surname>You</surname> <given-names>S. H.</given-names></name> <name><surname>Kang</surname> <given-names>K. H.</given-names></name> <name><surname>Shim</surname> <given-names>J. J.</given-names></name> <etal/></person-group> (<year>2004</year>). <article-title>Prevalence of sleep-disordered breathing in middle-aged Korean men and women</article-title>. <source>Am. J. Respir. Crit. Care Med.</source> <volume>170</volume>, <fpage>1108</fpage>&#x02013;<lpage>1113</lpage>.<pub-id pub-id-type="doi">10.1164/rccm.200404-519OC</pub-id><pub-id pub-id-type="pmid">15347562</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lam</surname> <given-names>B.</given-names></name> <name><surname>Lam</surname> <given-names>D. C. L.</given-names></name> <name><surname>Ip</surname> <given-names>M. S. M.</given-names></name></person-group> (<year>2007</year>). <article-title>Obstructive sleep apnoea in Asia</article-title>. <source>Int. J. Tuberc. Lung Dis.</source> <volume>11</volume>, <fpage>2</fpage>&#x02013;<lpage>11</lpage>.</citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Metropolis</surname> <given-names>N.</given-names></name> <name><surname>Rosenbluth</surname> <given-names>A. W.</given-names></name> <name><surname>Rosenbluth</surname> <given-names>M. N.</given-names></name> <name><surname>Teller</surname> <given-names>A. H.</given-names></name> <name><surname>Teller</surname> <given-names>E.</given-names></name></person-group> (<year>1953</year>). <article-title>Equation of state calculations by fast computing machines</article-title>. <source>J. Chem. Phys.</source> <volume>21</volume>, <fpage>1087</fpage>.<pub-id pub-id-type="doi">10.1063/1.1699114</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michaelson</surname> <given-names>P. G.</given-names></name> <name><surname>Allan</surname> <given-names>P.</given-names></name> <name><surname>Chaney</surname> <given-names>J.</given-names></name> <name><surname>Mair</surname> <given-names>E. A.</given-names></name></person-group> (<year>2006</year>). <article-title>Validations of a portable home sleep study with twelve-lead polysomnography: comparisons and insights into a variable gold standard</article-title>. <source>Ann. Otol. Rhinol. Laryngol.</source> <volume>115</volume>, <fpage>802</fpage>&#x02013;<lpage>809</lpage>.<pub-id pub-id-type="doi">10.1177/000348940611501102</pub-id><pub-id pub-id-type="pmid">17165661</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Monahan</surname> <given-names>K.</given-names></name> <name><surname>Redline</surname> <given-names>S.</given-names></name></person-group> (<year>2011</year>). <article-title>Role of obstructive sleep apnea in cardiovascular disease</article-title>. <source>Curr. Opin. Cardiol.</source> <volume>26</volume>, <fpage>541</fpage>&#x02013;<lpage>547</lpage>.<pub-id pub-id-type="doi">10.1097/HCO.0b013e32834b806a</pub-id><pub-id pub-id-type="pmid">21993356</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname> <given-names>A. K.</given-names></name> <name><surname>Koh</surname> <given-names>T. S.</given-names></name> <name><surname>Baey</surname> <given-names>E.</given-names></name> <name><surname>Lee</surname> <given-names>T. H.</given-names></name> <name><surname>Abeyratne</surname> <given-names>U. R.</given-names></name> <name><surname>Puvanendran</surname> <given-names>K.</given-names></name></person-group> (<year>2008a</year>). <article-title>Could formant frequencies of snore signals be an alternative means for the diagnosis of obstructive sleep apnea?</article-title> <source>Sleep Med.</source> <volume>9</volume>, <fpage>894</fpage>&#x02013;<lpage>898</lpage>.<pub-id pub-id-type="doi">10.1016/j.sleep.2007.07.010</pub-id><pub-id pub-id-type="pmid">17825609</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname> <given-names>A. K.</given-names></name> <name><surname>San Koh</surname> <given-names>T.</given-names></name> <name><surname>Puvanendran</surname> <given-names>K.</given-names></name> <name><surname>Abeyratne</surname> <given-names>U. R.</given-names></name></person-group> (<year>2008b</year>). <article-title>Snore signal enhancement and activity detection via translation-invariant wavelet transform</article-title>. <source>IEEE Trans. Biomed. Eng.</source> <volume>55</volume>, <fpage>2332</fpage>&#x02013;<lpage>2342</lpage>.<pub-id pub-id-type="doi">10.1109/TBME.2008.925682</pub-id><pub-id pub-id-type="pmid">18838358</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rabiner</surname> <given-names>L. R.</given-names></name> <name><surname>Schafer</surname> <given-names>R. W.</given-names></name></person-group> (<year>2007</year>). <article-title>Introduction to digital speech processing</article-title>. <source>Found. Trends Signal Process.</source> <volume>1</volume>, <fpage>1</fpage>&#x02013;<lpage>194</lpage>.<pub-id pub-id-type="doi">10.1561/2000000001</pub-id></citation></ref>
<ref id="B30"><citation citation-type="thesis"><person-group person-group-type="author"><name><surname>Roebuck</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <source>Comparative Analysis of Polysomnographic Signals for Classifying Obstructive Sleep Apnoea</source>. PhD thesis, <publisher-name>University of Oxford, Department of Engineering Science</publisher-name>, <publisher-loc>Oxford</publisher-loc>.</citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roebuck</surname> <given-names>A.</given-names></name> <name><surname>Clifford</surname> <given-names>G.</given-names></name></person-group> (<year>2012</year>). <article-title>Multiscale entropy applied to audio data for classifying obstructive sleep apnoea patients</article-title>. <source>Am. J. Respir. Crit. Care Med.</source> <volume>185</volume>, <fpage>A3841</fpage>.<pub-id pub-id-type="doi">10.1164/ajrccm-conference.2012.185.1_meetingabstracts.a3841</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roebuck</surname> <given-names>A.</given-names></name> <name><surname>Monasterio</surname> <given-names>V.</given-names></name> <name><surname>Gederi</surname> <given-names>E.</given-names></name> <name><surname>Osipov</surname> <given-names>M.</given-names></name> <name><surname>Behar</surname> <given-names>J.</given-names></name> <name><surname>Malhotra</surname> <given-names>A.</given-names></name> <etal/></person-group> (<year>2014</year>). <article-title>A review of signals used in sleep analysis</article-title>. <source>Physiol. Meas.</source> <volume>35</volume>, <fpage>R1</fpage>&#x02013;<lpage>R57</lpage>.<pub-id pub-id-type="doi">10.1088/0967-3334/35/1/R1</pub-id><pub-id pub-id-type="pmid">24346125</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rosenfeld</surname> <given-names>V.</given-names></name></person-group> (<year>2014</year>). <article-title>Sleep dysfunction, diabetes, and pain: a troublesome triad</article-title>. <source>J. Fam. Pract.</source> <volume>63</volume>, <fpage>S19</fpage>&#x02013;<lpage>S24</lpage>.<pub-id pub-id-type="pmid">25061630</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharma</surname> <given-names>S. K.</given-names></name> <name><surname>Kumpawat</surname> <given-names>S.</given-names></name> <name><surname>Banga</surname> <given-names>A.</given-names></name> <name><surname>Goel</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Prevalence and risk factors of obstructive sleep apnea syndrome in a population of Delhi, India</article-title>. <source>Chest</source> <volume>130</volume>, <fpage>149</fpage>&#x02013;<lpage>156</lpage>.<pub-id pub-id-type="doi">10.1378/chest.130.1.149</pub-id><pub-id pub-id-type="pmid">16840395</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stradling</surname> <given-names>J. R.</given-names></name> <name><surname>Crosby</surname> <given-names>J. H.</given-names></name></person-group> (<year>1990</year>). <article-title>Relation between systemic hypertension and sleep hypoxaemia or snoring: analysis in 748 men drawn from general practice</article-title>. <source>BMJ</source> <volume>300</volume>, <fpage>75</fpage>&#x02013;<lpage>78</lpage>.<pub-id pub-id-type="doi">10.1136/bmj.300.6717.75</pub-id><pub-id pub-id-type="pmid">2105777</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Udwadia</surname> <given-names>Z. F.</given-names></name> <name><surname>Doshi</surname> <given-names>A. V.</given-names></name> <name><surname>Lonkar</surname> <given-names>S. G.</given-names></name> <name><surname>Singh</surname> <given-names>C. I.</given-names></name></person-group> (<year>2004</year>). <article-title>Prevalence of sleep-disordered breathing and sleep apnea in middle-aged urban Indian men</article-title>. <source>Am. J. Respir. Crit. Care Med.</source> <volume>169</volume>, <fpage>168</fpage>&#x02013;<lpage>173</lpage>.<pub-id pub-id-type="doi">10.1164/rccm.200302-265OC</pub-id><pub-id pub-id-type="pmid">14604837</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vrhovec</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>Evaluating the progress of the labour with sample entropy calculated from the uterine EMG activity</article-title>. <source>Elektrotehniski vestnik-Electrotechnical Rev.</source> <volume>76</volume>, <fpage>165</fpage>&#x02013;<lpage>170</lpage>.</citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ye</surname> <given-names>J.</given-names></name> <name><surname>Janardan</surname> <given-names>R.</given-names></name> <name><surname>Li</surname> <given-names>Q.</given-names></name></person-group> (<year>2004</year>). <article-title>Two-dimensional linear discriminant analysis</article-title>. <source>Adv. Neural. Inf. Process. Syst.</source> <volume>17</volume>, <fpage>1569</fpage>&#x02013;<lpage>1576</lpage>.</citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Young</surname> <given-names>T.</given-names></name> <name><surname>Evans</surname> <given-names>L.</given-names></name> <name><surname>Finn</surname> <given-names>L.</given-names></name> <name><surname>Palta</surname> <given-names>M.</given-names></name></person-group> (<year>1997</year>). <article-title>Estimation of the clinically diagnosed proportion of sleep apnea syndrome in middle-aged men and women</article-title>. <source>Sleep</source> <volume>20</volume>, <fpage>705</fpage>&#x02013;<lpage>706</lpage>.<pub-id pub-id-type="pmid">9406321</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Young</surname> <given-names>T.</given-names></name> <name><surname>Palta</surname> <given-names>M.</given-names></name> <name><surname>Dempsey</surname> <given-names>J.</given-names></name> <name><surname>Skatrud</surname> <given-names>J.</given-names></name> <name><surname>Weber</surname> <given-names>S.</given-names></name> <name><surname>Badr</surname> <given-names>S.</given-names></name></person-group> (<year>1993</year>). <article-title>The occurrence of sleep-disordered breathing among middle-aged adults</article-title>. <source>N. Engl. J. Med.</source> <volume>328</volume>, <fpage>1230</fpage>&#x02013;<lpage>1235</lpage>.<pub-id pub-id-type="doi">10.1056/NEJM199304293281704</pub-id><pub-id pub-id-type="pmid">8464434</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn1"><p><sup>1</sup>This study was approved by the NHS HRC National Research Ethics Service (NRES) South West REC Centre, Bristol, UK (REC reference SW/12/0211).</p></fn>
<fn id="fn2"><p><sup>2</sup>MCMC methods involve sampling from probability distributions by constructing a Markov chain (a memoryless mathematical system that undergoes transitions from one state to another, among a finite number of possible states) that has the desired distribution as its equilibrium distribution.</p></fn>
</fn-group>
</back>
</article>