Browsing by Department "Centro Avanzado de Ingeniería Eléctrica y Electrónica - AC3E"
Now showing 1 - 19 of 19
- Results Per Page
- Sort Options
- Some of the metrics are blocked by yourconsent settings
Publication Acoustic coupling during incomplete glottal closure and its effect on the inverse filtering of oral airflow(2013-06-19); ;Julio C. Ho ;Daryush D. Mehta ;Robert E. HillmanGeorge R. WodickaInverse filtering of oral airflow using closed-phase linear prediction is expected to preserve the effects of source-filter interactions in the glottal airflow pulse. Under incomplete glottal closure, the glottal airflow estimation is more challenging due to a lowered glottal impedance, increased subglottal coupling, and violated all-pole assumption. To account for these effects, a model-based inverse filtering scheme allowing for coupling between glottis and upper and lower airways was developed. Acoustic transmission in the tracts used a frequency-domain transmission line. A linearized, time-varying expression was used for the glottal impedance, along with a dipole representation. Synthetic vowels sounds and actual recordings were used to evaluate the proposed scheme. Subject-specific model parameters were obtained from simultaneous aerodynamic, acoustic, and high-speed videoendoscopic recordings of normal subjects uttering vowels with various degrees of glottal closure. Results illustrated that, even under incomplete glottal closure, the airflow entering the vocal tract preserved source-filter interactions and was comparable to that obtained using closed-phase linear prediction. The scheme also yielded an uncoupled glottal airflow that exhibited a clear pulse de-skewing, making it proportional to the glottal area. Cases with larger glottal gaps exhibited lower mean impedances and less pulse skewing, with airflow estimates proportional to the transglottal pressure drop. - Some of the metrics are blocked by yourconsent settings
Publication An Euler–Bernoulli-type beam model of the vocal folds for describing curved and incomplete glottal closure patterns(2023-11-01) ;Serry, Mohamed A. ;Alzamendi, Gabriel A.; Peterson, Sean D.Incomplete glottal closure is a laryngeal configuration wherein the glottis is not fully obstructed prior to phonation. It has been linked to inefficient voice production and voice disorders. Various incomplete glottal closure patterns can arise and the mechanisms driving them are not well understood. In this work, we introduce an Euler-Bernoulli composite beam vocal fold (VF) model that produces qualitatively similar incomplete glottal closure patterns as those observed in experimental and high-fidelity numerical studies, thus offering insights into the potential underlying physical mechanisms. Refined physiological insights are pursued by incorporating the beam model into a VF posturing model that embeds the five intrinsic laryngeal muscles. Analysis of the combined model shows that co-activating the lateral cricoarytenoid (LCA) and interarytenoid (IA) muscles without activating the thyroarytenoid (TA) muscle results in a bowed (convex) VF geometry with closure at the posterior margin only; this is primarily attributed to the reactive moments at the anterior VF margin. This bowed pattern can also arise during VF compression (due to extrinsic laryngeal muscle activation for example), wherein the internal moment induced passively by the TA muscle tissue is the predominant mechanism. On the other hand, activating the TA muscle without incorporating other adductory muscles results in anterior and mid-membranous glottal closure, a concave VF geometry, and a posterior glottal opening driven by internal moments induced by TA muscle activation. In the case of initial full glottal closure, the posterior cricoarytenoid (PCA) muscle activation cancels the adductory effects of the LCA and IA muscles, resulting in a concave VF geometry and posterior glottal opening. Furthermore, certain maneuvers involving co-activation of all adductory muscles result in an hourglass glottal shape due to a reactive moment at the anterior VF margin and moderate internal moment induced by TA muscle activation. These findings have implications regarding potential laryngeal maneuvers in patients with voice disorders involving imbalances or excessive tension in the laryngeal muscles such as muscle tension dysphonia. - Some of the metrics are blocked by yourconsent settings
Publication An updated theoretical framework for vocal hyperfunction(2020-11-01) ;Hillman, Robert E. ;Stepp, Cara E. ;Van Stan, Jarrad H.; Mehta, Daryush D.Purpose The purpose of this viewpoint article is to facilitate research on vocal hyperfunction (VH). VH is implicated in the most commonly occurring types of voice disorders, but there remains a pressing need to increase our understanding of the etiological and pathophysiological mechanisms associated with VH to improve the prevention, diagnosis, and treatment of VH-related disorders. Method A comprehensive theoretical framework for VH is proposed based on an integration of prevailing clinical views and research evidence. Results The fundamental structure of the current framework is based on a previous (simplified) version that was published over 30 years ago (Hillman et al., 1989). A central premise of the framework is that there are two primary manifestations of VH—phonotraumatic VH and nonphonotraumatic VH—and that multiple factors contribute and interact in different ways to cause and maintain these two types of VH. Key hypotheses are presented about the way different factors may contribute to phonotraumatic VH and nonphonotraumatic VH and how the associated disorders may respond to treatment. Conclusions This updated and expanded framework is meant to help guide future research, particularly the design of longitudinal studies, which can lead to a refinement in knowledge about the etiology and pathophysiology of VH-related disorders. Such new knowledge should lead to further refinements in the framework and serve as a basis for improving the prevention and evidence-based clinical management of VH. - Some of the metrics are blocked by yourconsent settings
Publication Bayesian estimation of vocal function measures using laryngeal high-speed videoendoscopy and glottal airflow estimates: An in vivo case study(2020-05-01) ;Alzamendi, Gabriel A. ;Manríquez, Rodrigo ;Hadwin, Paul J. ;Deng, Jonathan J. ;Peterson, Sean D. ;Erath, Byron D. ;Mehta, Daryush D. ;Hillman, Robert E.This study introduces the in vivo application of a Bayesian framework to estimate subglottal pressure, laryngeal muscle activation, and vocal fold contact pressure from calibrated transnasal high-speed videoendoscopy and oral airflow data. A subject-specific, lumped-element vocal fold model is estimated using an extended Kalman filter and two observation models involving glottal area and glottal airflow. Model-based inferences using data from a vocally healthy male individual are compared with empirical estimates of subglottal pressure and reference values for muscle activation and contact pressure in the literature, thus providing baseline error metrics for future clinical investigations. - Some of the metrics are blocked by yourconsent settings
Publication Comprehensive, Population-Based Sensitivity Analysis of a Two-Mass Vocal Fold Model(2016-02-01) ;Robertson, Daniel; ;Cook, DouglasHoward NusbaumPrevious vocal fold modeling studies have generally focused on generating detailed data regarding a narrow subset of possible model configurations. These studies can be interpreted to be the investigation of a single subject under one or more vocal conditions. In this study, a broad population-based sensitivity analysis is employed to examine the behavior of a virtual population of subjects and to identify trends between virtual individuals as opposed to investigating a single subject or model instance. Four different sensitivity analysis techniques were used in accomplishing this task. Influential relationships between model input parameters and model outputs were identified, and an exploration of the model's parameter space was conducted. Results indicate that the behavior of the selected two-mass model is largely dominated by complex interactions, and that few input-output pairs have a consistent effect on the model. Results from the analysis can be used to increase the efficiency of optimization routines of reduced-order models used to investigate voice abnormalities. Results also demonstrate the types of challenges and difficulties to be expected when applying sensitivity analyses to more complex vocal fold models. Such challenges are discussed and recommendations are made for future studies. - Some of the metrics are blocked by yourconsent settings
Publication Direct measurement and modeling of intraglottal, subglottal, and vocal fold collision pressures during phonation in an individual with a hemilaryngectomy(2021-08-02) ;Mehta, Daryush D. ;Kobler, James B. ;Zeitels, Steven M.; ;Ibarra, Emiro J. ;Alzamendi, Gabriel A. ;Manriquez, Rodrigo ;Erath, Byron D. ;Peterson, Sean D. ;Petrillo, Robert H.Hillman, Robert E.The purpose of this paper is to report on the first in vivo application of a recently developed transoral, dual-sensor pressure probe that directly measures intraglottal, subglottal, and vocal fold collision pressures during phonation. Synchronous measurement of intraglottal and subglottal pressures was accomplished using two miniature pressure sensors mounted on the end of the probe and inserted transorally in a 78-year-old male who had previously undergone surgical removal of his right vocal fold for treatment of laryngeal cancer. The endoscopist used one hand to position the custom probe against the surgically medialized scar band that replaced the right vocal fold and used the other hand to position a transoral endoscope to record laryngeal high-speed videoendoscopy of the vibrating left vocal fold contacting the pressure probe. Visualization of the larynx during sustained phonation allowed the endoscopist to place the dual-sensor pressure probe such that the proximal sensor was positioned intraglottally and the distal sensor subglottally. The proximal pressure sensor was verified to be in the strike zone of vocal fold collision during phonation when the intraglottal pressure signal exhibited three characteristics: an impulsive peak at the start of the closed phase, a rounded peak during the open phase, and a minimum value around zero immediately preceding the impulsive peak of the subsequent phonatory cycle. Numerical voice production modeling was applied to validate model-based predictions of vocal fold collision pressure using kinematic vocal fold measures. The results successfully demonstrated feasibility of in vivo measurement of vocal fold collision pressure in an individual with a hemilaryngectomy, motivating ongoing data collection that is designed to aid in the development of vocal dose measures that incorporate vocal fold impact collision and stresses. - Some of the metrics are blocked by yourconsent settings
Publication Estimating vocal fold contact pressure from raw laryngeal high-speed videoendoscopy using a Hertz contact model(2019-06-01) ;Díaz-Cádiz, Manuel E. ;Peterson, Sean D. ;Galindo, Gabriel E. ;Espinoza, Víctor M. ;Motie-Shirazi, Mohsen ;Erath, Byron D.The development of trauma-induced lesions of the vocal folds (VFs) has been linked to a high collision pressure on the VF surface. However, there are no direct methods for the clinical assessment of VF collision, thus limiting the objective assessment of these disorders. In this study, we develop a video processing technique to directly quantify the mechanical impact of the VFs using solely laryngeal kinematic data. The technique is based on an edge tracking framework that estimates the kinematic sequence of each VF edge with a Kalman filter approach and a Hertzian impact model to predict the contact force during the collision. The proposed formulation overcomes several limitations of prior efforts since it uses a more relevant VF contact geometry, it does not require calibrated physical dimensions, it is normalized by the tissue properties, and it applies a correction factor for using a superior view only. The proposed approach is validated against numerical models, silicone vocal fold models, and prior studies. A case study with high-speed videoendoscopy recordings provides initial insights between the sound pressure level and contact pressure. Thus, the proposed method has a high potential in clinical practice and could also be adapted to operate with laryngeal stroboscopic systems. - Some of the metrics are blocked by yourconsent settings
Publication Estimation of auditory steady-state responses based on the averaging of independent EEG epochs(2019-01-01) ;Prado-Gutierrez, Pavel ;Martínez-Montes, Eduardo; ; Manuel S. MalmiercaThe amplitude of auditory steady-state responses (ASSRs) generated in the brainstem of rats exponentially decreases over the sequential averaging of EEG epochs. This behavior is partially due to the adaptation of the ASSR induced by the continuous and monotonous stimulation. In this study, we analyzed the potential clinical relevance of the ASSR adaptation. ASSR were elicited in eight anesthetized adult rats by 8-kHz tones, modulated in amplitude at 115 Hz. We called independent epochs to those EEG epochs acquired with sufficiently long inter-stimulus interval, so the ASSR contained in any given epoch is not affected by the previous stimulation. We tested whether the detection of ASSRs is improved when the response is computed by averaging independent EEG epochs, containing only unadapted auditory responses. The improvements in the ASSR detection obtained with standard, weighted and sorted averaging were compared. In the absence of artifacts, when the ASSR was elicited by continuous acoustic stimulation, the computation of the ASSR amplitude relied upon the averaging method. While the adaptive behavior of the ASSR was still evident after the weighting of epochs, the sorted averaging resulted in under-estimations of the ASSR amplitude. In the absence of artifacts, the ASSR amplitudes computed by averaging independent epochs did not depend on the averaging procedure. Averaging independent epochs resulted in higher ASSR amplitudes and halved the number of EEG epochs needed to be acquired to achieve the maximum detection rate of the ASSR. Acquisition protocols based on averaging independent EEG epochs, in combination with appropriate averaging methods for artifact reduction might contribute to develop more accurate hearing assessments based on ASSRs. - Some of the metrics are blocked by yourconsent settings
Publication Improved Subglottal Pressure Estimation from Neck-Surface Vibration in Healthy Speakers Producing Non-Modal Phonation(2020-02-01) ;Lin, Jon Z. ;Espinoza, Víctor M. ;Marks, Katherine L.; Mehta, Daryush D.Subglottal air pressure plays a major role in voice production and is a primary factor in controlling voice onset, offset, sound pressure level, glottal airflow, vocal fold collision pressures, and variations in fundamental frequency. Previous work has shown promise for the estimation of subglottal pressure from an unobtrusive miniature accelerometer sensor attached to the anterior base of the neck during typical modal voice production across multiple pitch and vowel contexts. This study expands on that work to incorporate additional accelerometer-based measures of vocal function to compensate for non-modal phonation characteristics and achieve an improved estimation of subglottal pressure. Subjects with normal voices repeated /p/-vowel syllable strings from loud-to-soft levels in multiple vowel contexts (/a/, /i/, and /u/), pitch conditions (comfortable, lower than comfortable, higher than comfortable), and voice quality types (modal, breathy, strained, and rough). Subject-specific, stepwise regression models were constructed using root-mean-square (RMS) values of the accelerometer signal alone (baseline condition) and in combination with cepstral peak prominence, fundamental frequency, and glottal airflow measures derived using subglottal impedance-based inverse filtering. Five-fold cross-validation assessed the robustness of model performance using the root-mean-square error metric for each regression model. Each cross-validation fold exhibited up to a 25% decrease in prediction error when the model incorporated multi-dimensional aspects of the accelerometer signal compared with RMS-only models. Improved estimation of subglottal pressure for non-modal phonation was thus achievable, lending to future studies of subglottal pressure estimation in patients with voice disorders and in ambulatory voice recordings. - Some of the metrics are blocked by yourconsent settings
Publication Modeling the influence of COVID-19 protective measures on the mechanics of phonation(2022-05-01) ;Deng, Jonathan J. ;Serry, Mohamed A.; ;Erath, Byron D.Peterson, Sean D.In an effort to mitigate the 2019 novel coronavirus disease pandemic, mask wearing and social distancing have become standard practices. While effective in fighting the spread of the virus, these protective measures have been shown to deteriorate speech perception and sound intensity, which necessitates speaking louder to compensate. The goal of this paper is to investigate via numerical simulations how compensating for mask wearing and social distancing affects measures associated with vocal health. A three-mass body-cover model of the vocal folds (VFs) coupled with the sub- and supraglottal acoustic tracts is modified to incorporate mask and distance dependent acoustic pressure models. The results indicate that sustaining target levels of intelligibility and/or sound intensity while using these protective measures may necessitate increased subglottal pressure, leading to higher VF collision and, thus, potentially inducing a state of vocal hyperfunction, a progenitor to voice pathologies. - Some of the metrics are blocked by yourconsent settings
Publication Nonlinearities in block-type reduced-order vocal fold models with asymmetric tissue properties(2013-06-19) ;Byron D. Erath ;David E. Sommer ;Sean D. PetersonModeling the vocal fold structure as a reduced-order system is an attractive approach for exploring the dynamics of both normal and pathological phonation. This approach has been used ubiquitously in scientific speech investigations due to its relatively high order of accuracy and low computational cost. In addition, good agreement can also be found between model and clinical data. In the case of pathological speech complex vocal fold dynamics may exist, exhibiting phenomenon such as bifurcation, and chaos. The ability to capture these features in reduced-order vocal fold models is a much celebrated feature. However, the question has arisen whether these nonlinearities arise due to the physics, or if they are merely an artifact of the model and its sensitivity to initial and boundary conditions. We explore the sensitivity of commonly-employed reduced-order vocal fold models to both contact mechanics, and the geometric prescription of the vocal fold model. Nonlinearities arising from asymmetric vocal fold tensioning are investigated. Nonlinearity in the vocal fold dynamics is identified by determining the predictive capability of linear and nonlinear Volterra-Weiner-Korenberg series. Nonlinearities in the vocal fold oscillations are shown to be highly-dependent upon model formulation and implementation, as opposed to physical features of speech. - Some of the metrics are blocked by yourconsent settings
Publication Real-time estimation of aerodynamic features for ambulatory voice biofeedback(2015-07-01) ;Llico, Andrés F.; ;González, Agustín J. ;Wodicka, George R. ;Mehta, Daryush D. ;Van Stan, Jarrad H.Hillman, Robert E.The development of ambulatory voice monitoring devices has the potential to improve the diagnosis and treatment of voice disorders. In this proof-of-concept study, real-time biofeedback is incorporated into a smartphone-based platform that records and processes neck surface acceleration. The focus is on utilizing aerodynamic measures of vocal function as a basis for biofeedback. This is done using regressed Z-scores to compare recorded values to normative estimates based on sound pressure level and fundamental frequency. Initial results from the analysis of different voice qualities suggest that accelerometer-based estimates of aerodynamic parameters can be used for real-time ambulatory biofeedback. - Some of the metrics are blocked by yourconsent settings
Publication Robust fundamental frequency estimation in sustained vowels: Detailed algorithmic comparisons and information fusion with adaptive Kalman filtering(2014-01-01) ;Tsanas, Athanasios; ;Little, Max A. ;Fox, Cynthia ;Ramig, Lorraine O.Clifford, Gari D.There has been consistent interest among speech signal processing researchers in the accurate estimation of the fundamental frequency (F0) of speech signals. This study examines ten F0 estimation algorithms (some well-established and some proposed more recently) to determine which of these algorithms is, on average, better able to estimate F0 in the sustained vowel /a/. Moreover, a robust method for adaptively weighting the estimates of individual F0 estimation algorithms based on quality and performance measures is proposed, using an adaptive Kalman filter (KF) framework. The accuracy of the algorithms is validated using (a) a database of 117 synthetic realistic phonations obtained using a sophisticated physiological model of speech production and (b) a database of 65 recordings of human phonations where the glottal cycles are calculated from electroglottograph signals. On average, the sawtooth waveform inspired pitch estimator and the nearly defect-free algorithms provided the best individual F0 estimates, and the proposed KF approach resulted in a ∼16% improvement in accuracy over the best single F0 estimation algorithm. These findings may be useful in speech signal processing applications where sustained vowels are used to assess vocal quality, when very accurate F0 estimation is required. - Some of the metrics are blocked by yourconsent settings
Publication The difference between first and second harmonic amplitudes correlates between glottal airflow and neck-surface accelerometer signals during phonation(2019-05-01) ;Mehta, Daryush D. ;Espinoza, Víctor M. ;Van Stan, Jarrad H.; Hillman, Robert E.Miniature high-bandwidth accelerometers on the anterior neck surface are used in laboratory and ambulatory settings to obtain vocal function measures. This study compared the widely applied L1–L2 measure (historically, H1–H2)—the difference between the log-magnitude of the first and second harmonics—computed from the glottal airflow waveform with L1–L2 derived from the raw neck-surface acceleration signal in 79 vocally healthy female speakers. Results showed a significant correlation (r = 0.72) between L1–L2 values estimated from both airflow and accelerometer signals, suggesting that raw accelerometer-based estimates of L1–L2 may be interpreted as reflecting glottal physiological parameters and voice quality attributes during phonation. - Some of the metrics are blocked by yourconsent settings
Publication The impact of glottal area discontinuities on block-type vocal fold models with asymmetric tissue properties(2013-03-01) ;Sommer, David E. ;Erath, Byron D.; Peterson, Sean D.Block-type lumped-element models of the vocal folds are widely used for speech investigations due in part to the rich dynamics exhibited over a range of input parameters, particularly for asymmetric tissue properties. While self-consistent in derivation and application, block-type models of the vocal fold masses are inherently susceptible to non-physical aerodynamic loading conditions when vocal fold motion is highly asymmetric. A standard block-type model is compared against two modified models that disallow the non-physiological loading condition. These minor modifications toward a more physiologically relevant aerodynamic model alter the specific vibration regimes and prevalence of chaos, though bifurcations still exist. - Some of the metrics are blocked by yourconsent settings
Publication Toward development of a vocal fold contact pressure probe: Bench-Top validation of a Dual-Sensor Probe using excised human larynx models(2019-10-01) ;Mehta, Daryush D. ;Kobler, James B. ;Zeitels, Steven M.; ;Erath, Byron D. ;Motie-Shirazi, Mohsen ;Peterson, Sean D. ;Petrillo, Robert H.Hillman, Robert E.A critical element in understanding voice production mechanisms is the characterization of vocal fold collision, which is widely considered a primary etiological factor in the development of common phonotraumatic lesions such as nodules and polyps. This paper describes the development of a transoral, dual-sensor intraglottal/subglottal pressure probe for the simultaneous measurement of vocal fold collision and subglottal pressures during phonation using two miniature sensors positioned 7.6 mm apart at the distal end of a rigid cannula. Proof-of-concept testing was performed using excised whole-mount and hemilarynx human tissue aerodynamically driven into self-sustained oscillation, with systematic variation of the superior–inferior positioning of the vocal fold collision sensor. In the hemilarynx experiment, signals from the pressure sensors were synchronized with an acoustic microphone, a tracheal-surface accelerometer, and two high-speed video cameras recording at 4000 frames per second for top–down and en face imaging of the superior and medial vocal fold surfaces, respectively. As expected, the intraglottal pressure signal exhibited an impulse-like peak when vocal fold contact occurred, followed by a broader peak associated with intraglottal pressure build-up during the de-contacting phase. As subglottal pressure was increased, the peak amplitude of the collision pressure increased and typically reached a value below that of the average subglottal pressure. Results provide important baseline vocal fold collision pressure data with which computational models of voice production can be developed and in vivo measurements can be referenced. - Some of the metrics are blocked by yourconsent settings
Publication Toward development of a vocal fold contact pressure probe: Sensor characterization and validation using synthetic vocal fold models(MDPI AG, 2019-01-01) ;Motie-Shirazi, Mohsen; ;Peterson, Sean D. ;Mehta, Daryush D. ;Kobler, James B. ;Hillman, Robert E.Erath, Byron D.Excessive vocal fold collision pressures during phonation are considered to play a primary role in the formation of benign vocal fold lesions, such as nodules. The ability to accurately and reliably acquire intraglottal pressure has the potential to provide unique insights into the pathophysiology of phonotrauma. Difficulties arise, however, in directly measuring vocal fold contact pressures due to physical intrusion from the sensor that may disrupt the contact mechanics, as well as difficulty in determining probe/sensor position relative to the contact location. These issues are quantified and addressed through the implementation of a novel approach for identifying the timing and location of vocal fold contact, and measuring intraglottal and vocal fold contact pressures via a pressure probe embedded in the wall of a hemi-laryngeal flow facility. The accuracy and sensitivity of the pressure measurements are validated against ground truth values. Application to in vivo approaches are assessed by acquiring intraglottal and VF contact pressures using a synthetic, self-oscillating vocal fold model in a hemi-laryngeal configuration, where the sensitivity of the measured intraglottal and vocal fold contact pressure relative to the sensor position is explored. - Some of the metrics are blocked by yourconsent settings
Publication Towards a Corpus (and Language)-Independent Screening of Parkinson’s Disease from Voice and Speech through Domain Adaptation(MDPI AG, 2023-11-01) ;Ibarra, Emiro J. ;Arias-Londoño, Julián D.; Godino-Llorente, Juan I.End-to-end deep learning models have shown promising results for the automatic screening of Parkinson’s disease by voice and speech. However, these models often suffer degradation in their performance when applied to scenarios involving multiple corpora. In addition, they also show corpus-dependent clusterings. These facts indicate a lack of generalisation or the presence of certain shortcuts in the decision, and also suggest the need for developing new corpus-independent models. In this respect, this work explores the use of domain adversarial training as a viable strategy to develop models that retain their discriminative capacity to detect Parkinson’s disease across diverse datasets. The paper presents three deep learning architectures and their domain adversarial counterparts. The models were evaluated with sustained vowels and diadochokinetic recordings extracted from four corpora with different demographics, dialects or languages, and recording conditions. The results showed that the space distribution of the embedding features extracted by the domain adversarial networks exhibits a higher intra-class cohesion. This behaviour is supported by a decrease in the variability and inter-domain divergence computed within each class. The findings suggest that domain adversarial networks are able to learn the common characteristics present in Parkinsonian voice and speech, which are supposed to be corpus, and consequently, language independent. Overall, this effort provides evidence that domain adaptation techniques refine the existing end-to-end deep learning approaches for Parkinson’s disease detection from voice and speech, achieving more generalizable models. - Some of the metrics are blocked by yourconsent settings
Publication Using ambulatory voice monitoring to investigate common voice disorders: Research update(Frontiers Media SA, 2015-01-01) ;Mehta, Daryush D. ;Van Stan, Jarrad H.; ;Ghassemi, Marzyeh ;Guttag, John V. ;Espinoza, Víctor M. ;Cortés, Juan P. ;Cheyne, Harold A.Hillman, Robert E.Many common voice disorders are chronic or recurring conditions that are likely to result from inefficient and/or abusive patterns of vocal behavior, referred to as vocal hyperfunction. The clinical management of hyperfunctional voice disorders would be greatly enhanced by the ability to monitor and quantify detrimental vocal behaviors during an individual’s activities of daily life. This paper provides an update on ongoing work that uses a miniature accelerometer on the neck surface below the larynx to collect a large set of ambulatory data on patients with hyperfunctional voice disorders (before and after treatment) and matched-control subjects. Three types of analysis approaches are being employed in an effort to identify the best set of measures for differentiating among hyperfunctional and normal patterns of vocal behavior: (1) ambulatory measures of voice use that include vocal dose and voice quality correlates, (2) aerodynamic measures based on glottal airflow estimates extracted from the accelerometer signal using subject-specific vocal system models, and (3) classification based on machine learning and pattern recognition approaches that have been used successfully in analyzing long-term recordings of other physiological signals. Preliminary results demonstrate the potential for ambulatory voice monitoring to improve the diagnosis and treatment of common hyperfunctional voice disorders.