ORIGINAL RESEARCH article

Front. Digit. Health

Sec. Connected Health

Volume 7 - 2025 | doi: 10.3389/fdgth.2025.1610772

This article is part of the Research TopicAdvancing Vocal Biomarkers and Voice AI in Healthcare: Multidisciplinary Focus on Responsible and Effective Development and UseView all 9 articles

Influence of Recording Instrumentation on Measurements of Voice in Sentence Contexts: Use of Smartphones and Tablets

Provisionally accepted
  • 1University of Central Florida, Orlando, Florida, United States
  • 2Morsani College of Medicine, USF Health, Tampa, Florida, United States
  • 3University of South Florida, Tampa, United States

The final, formatted version of the article will be published soon.

analysis; acoustic analysis; mobile devices; cepstral analysis; spectral tilt background noise, frequency responseVoice recordings from 24 individuals representing a wide range of typical and disordered voices were analyzed. Two CAPE-V sentences were recorded via a head-and-torso model using (1) a research quality, clinical standard microphone/preamplifier/audio interface and (2) smartphones and tablets using their internal microphones and an attached external headset microphone.Mouth-to-microphone distances and environmental noise levels were controlled. Measures of fundamental frequency (F0) and spectral and cepstral measures of voice quality valid for use in sentence contexts were analyzed across recording conditions.Cepstral peak prominence (CPP) values were sensitive to microphone type, noise, and sentence type conditions. Nevertheless, strong linear relationships were observed across recording methods compared to the clinical standard. Measures of F0 obtained using autocorrelation correlated strongly across recording methods, whereas F0 measures obtained from the CPP (CPP F0) were highly variable and poorly correlated across recording methods and noise conditions.The L/H ratio (a measure of spectral tilt) was significantly affected by recording condition but Measurements of Voice in Sentence Contexts not background noise, and measures of L/H ratio were also observed to correlate strongly across recording methods and noise conditions.Current findings revealed that different recording methods can produce significantly different acoustic measures of voice with sentence-level materials. Since microphone characteristics (e.g., frequency response; use of noise cancellation), mouth-to-microphone distances, and background noise conditions can have significant effects on spectral and cepstral assessment of voice, it is essential that recording methods and conditions are explicitly described when designing voice data collection projects and comparing datasets as it may have an impact on voice analysis.Future investigations should evaluate consistency of results among multiple examples of the same device.

Keywords: microphone frequency response, Cepstral analysis, spectral analysis, sentences, F0, L/H ratio

Received: 12 Apr 2025; Accepted: 20 Jun 2025.

Copyright: © 2025 Awan, Bensoussan, Watts, Boyer, Budinsky and Bahr. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Ruth H Bahr, University of South Florida, Tampa, United States

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.