Automatic gender recognition in normal and pathological speech

被引:0
|
作者
Gomez-Garcia, J. A. [1 ]
Godino-Llorente, J., I [1 ]
Castellanos-Dominguez, G. [2 ]
机构
[1] Univ Politecn Madrid, Madrid, Spain
[2] Univ Nacl Colombia, Manizales, Colombia
关键词
gender recognition; pathological voice; inverse filtering; SUSTAINED VOWELS; VOICE; PARAMETERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of voice pathology automatic detection systems is gaining attention in the last few years for presenting advantages compared to traditional diagnosis methods. However, the performance of these systems is influenced by aspects related to the inter-speaker variability, and specially to the heterogeneity introduce by gender differences. To overcome that, a gender recognizer may be employed as a preprocessing stage in order to stratify the speakers and further adjust the detectors to the specific characteristics of each target group. Nevertheless, the reliability of gender recognizers on pathological speech has not been investigated. Having this in mind, the present paper studies the effectiveness of an automatic gender recognizer, based on mel frequency cepstral coefficients and gaussian mixture models, on normal and pathological speech. The analysis is carried out parameterizing the speech, the glottal waveform extracted from speech via inverse filtering, and a vocal tract model. The experiments were carried out using sustained vowels taken from the Saarbriicken and UPM voice disorders databases, and suggest that the gender might be effectively classified when using the proposed methodology. They also suggest that gender recognizers can be successfully employed as a preprocessing stage for a more accurate design of gender-dependent pathology detection systems.
引用
收藏
页码:1706 / 1710
页数:5
相关论文
共 50 条
  • [1] Gender domain adaptation for automatic speech recognition
    Sokolov, Artem
    Savchenko, Anclrey V.
    [J]. 2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 413 - 417
  • [2] Gender Independent Bangla Automatic Speech Recognition
    Hassan, Foyzul
    Kotwal, Mohammed Rokibul Alam
    Khan, Mohammad Saiful Alam
    Huda, Mohammad Nurul
    [J]. 2012 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2012, : 143 - 148
  • [3] AUTOMATIC SPEECH RECOGNITION FOR ACOUSTICAL ANALYSIS AND ASSESSMENT OF CANTONESE PATHOLOGICAL VOICE AND SPEECH
    Lee, Tan
    Liu, Yuanyuan
    Huang, Pei-Wen
    Chien, Jen-Tzung
    Lam, Wang Kong
    Yeung, Yu Ting
    Law, Thomas K. T.
    Lee, Kathy Y. S.
    Kong, Anthony Pak-Hin
    Law, Sam-Po
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6475 - 6479
  • [4] Study of Time and Frequency Variability in Pathological Speech and Error Reduction Methods for Automatic Speech Recognition
    Saz, Oscar
    Miguel, Antonio
    Lleida, Eduardo
    Ortega, Alfonso
    Buera, Luis
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 993 - 996
  • [5] SPEECH ERRORS IN NORMAL AND PATHOLOGICAL SPEECH
    TALO, ES
    [J]. FOLIA PHONIATRICA, 1976, 28 (4-5): : 297 - 297
  • [6] SPEECH ERRORS IN NORMAL AND PATHOLOGICAL SPEECH
    SODERPALM, EA
    [J]. FOLIA PHONIATRICA, 1980, 32 (03): : 243 - 243
  • [7] Automatic speech recognition
    O'Shaughnessy, Douglas
    [J]. 2015 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON), 2015, : 417 - 424
  • [8] AUTOMATIC SPEECH RECOGNITION
    IVALL, T
    [J]. ELECTRONICS & WIRELESS WORLD, 1984, 90 (1581): : 73 - 76
  • [9] AUTOMATIC RECOGNITION OF SPEECH
    MARILL, T
    [J]. IRE TRANSACTIONS ON HUMAN FACTORS IN ELECTRONICS, 1961, HFE2 (01): : 34 - +
  • [10] AUTOMATIC SPEECH RECOGNITION
    RAO, PVS
    PALIWAL, KK
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1986, 9 : 85 - 120