Technology Report ARTICLE
The Harvard Automated Processing Pipeline for Electroencephalography (HAPPE): standardized processing software for developmental and high-artifact data
- 1Division of Developmental Medicine, Boston Children's Hospital, Harvard University, United States
- 2Department of Neurology, Boston Children's Hospital, Harvard University, United States
Electroenchephalography (EEG) recordings collected with developmental populations present particular challenges from a data processing perspective. These EEGs have a high degree of artifact contamination and often short recording lengths. As both sample sizes and EEG channel densities increase, traditional processing approaches like manual data rejection are becoming unsustainable. Moreover, such subjective approaches preclude standardized metrics of data quality, despite the heightened importance of such measures for EEGs with high rates of initial artifact contamination. There is presently a paucity of automated resources for processing these EEG data and no consistent reporting of data quality measures. To address these challenges, we propose the Harvard Automated Processing Pipeline for EEG (HAPPE) as a standardized, automated pipeline compatible with EEG recordings of variable lengths and artifact contamination levels, including high-artifact and short EEG recordings from young children or those with neurodevelopmental disorders. HAPPE processes event-related and resting-state EEG data from raw files through a series of filtering, artifact rejection, and re-referencing steps to processed EEG suitable for time-frequency-domain analyses. HAPPE also includes a post-processing report of data quality metrics to facilitate the evaluation and reporting of data quality in a standardized manner. Here, we describe each processing step in HAPPE, perform an example analysis with EEG files we have made freely available, and show that HAPPE outperforms seven alternative, widely-used processing approaches. HAPPE removes more artifact than all alternative approaches while simultaneously preserving greater or equivalent amounts of EEG signal in almost all instances. We also provide distributions of HAPPE’s data quality metrics in an 867 file dataset as a reference distribution and in support of HAPPE’s performance across EEG data with variable artifact contamination and recording lengths. HAPPE software is freely available under the terms of the GNU General Public License at https://github.com/lcnhappe/happe.
Keywords: EEG, Electroencephalography, automated, pipeline, artifact removal, data quality, EEG processing, development
Received: 02 Oct 2017;
Accepted: 06 Feb 2018.
Edited by:Arnaud Delorme, UMR5549 Centre de Recherche Cerveau et Cognition (CerCo), France
Reviewed by:Andras Eke, Semmelweis University, Hungary
Camillo Porcaro, Istituto di Scienze e Tecnologie della Cognizione (ISTC) - CNR, Italy
Fabrizio De Carli, Istituto di Bioimmagini e Fisiologia Molecolare (CNR), Italy
Andrey R. Nikolaev, KU Leuven, Belgium
Copyright: © 2018 Gabard-Durnam, Mendez Leal, Wilkinson and Levin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Dr. Laurel J. Gabard-Durnam, Boston Children's Hospital, Harvard University, Division of Developmental Medicine, 1 Autumn Street, Boston, 02115, MA, United States, firstname.lastname@example.org