Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals

被引:10
|
作者
Lei, Zhengdong [1 ,3 ]
Kennedy, Evan [2 ]
Fasanella, Laura [1 ]
Li-Jessen, Nicole Yee-Key [2 ]
Mongeau, Luc [1 ]
机构
[1] McGill Univ, Dept Mech Engn, Montreal, PQ H3A 0G4, Canada
[2] McGill Univ, Sch Commun Sci & Disorders, Montreal, PQ H3A 0G4, Canada
[3] 845 Sherbrooke St West, Montreal, PQ H3A 0G4, Canada
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 07期
基金
加拿大健康研究院; 美国国家卫生研究院;
关键词
neck-surface vibration; machine learning; voice type discrimination; AUDITORY-PERCEPTUAL EVALUATION; QUALITY;
D O I
10.3390/app9071505
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs' reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy.
引用
收藏
页数:18
相关论文
共 7 条
  • [1] CLASSIFICATION OF VOICE MODES USING NECK-SURFACE ACCELEROMETER DATA
    Borsky, Michal
    Cocude, Marion
    Mehta, Daryush D.
    Zanartu, Matias
    Gudnason, Jon
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5060 - 5064
  • [2] Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders
    Cortes, Juan P.
    Lin, Jon Z.
    Marks, Katherine L.
    Espinoza, Victor M.
    Ibarra, Emiro J.
    Zanartu, Matias
    Hillman, Robert E.
    Mehta, Daryush D.
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [3] Classification of Voice Quality Using Neck-Surface Acceleration: Comparison With Glottal Flow and Radiated Sound
    Wlodarczak, Marcin
    Ludusan, Bogdan
    Sundberg, Johan
    Heldner, Mattias
    JOURNAL OF VOICE, 2025, 39 (01) : 10 - 24
  • [4] Improved Subglottal Pressure Estimation From Neck-Surface Vibration in Healthy Speakers Producing Non-Modal Phonation
    Lin, Jon Z.
    Espinoza, Victor M.
    Marks, Katherine L.
    Zanartu, Matias
    Mehta, Daryush D.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (02) : 449 - 460
  • [5] The difference between first and second harmonic amplitudes correlates between glottal airflow and neck-surface accelerometer signals during phonation
    Mehta, Daryush D.
    Espinoza, Victor M.
    Van Stan, Jarrad H.
    Zanartu, Matias
    Hillman, Robert E.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (05): : EL386 - EL392
  • [6] Characterization of Source-Filter Interactions in Vocal Vibrato Using a Neck-Surface Vibration Sensor: A Pilot Study
    Lester-Smith, Rosemary A.
    Derrick, Elaina
    Larson, Charles R.
    JOURNAL OF VOICE, 2024, 38 (01) : 1 - 9
  • [7] Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model
    Ibarra, Emiro J.
    Parra, Jesus A.
    Alzamendi, Gabriel A.
    Cortes, Juan P.
    Espinoza, Victor M.
    Mehta, Daryush D.
    Hillman, Robert E.
    Zanartu, Matias
    FRONTIERS IN PHYSIOLOGY, 2021, 12