Gender recognition from vocal source

被引:16
|
作者
Sorokin, V. N. [1 ]
Makarov, I. S. [1 ]
机构
[1] Russian Acad Sci, Inst Informat Transmiss Problems, Moscow 101447, Russia
关键词
D O I
10.1134/S1063771008040192
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Efficiency of automatic recognition of male and female voices based on solving the inverse problem for glottis area dynamics and for waveform of the glottal airflow volume velocity pulse is studied. The inverse problem is regularized through the use of analytical models of the voice excitation pulse and of the dynamics of the glottis area, as well as the model of one-dimensional glottal airflow. Parameters of these models and spectral parameters of the volume velocity pulse are considered. The following parameters are found to be most promising: the instant of maximum glottis area, the maximum derivative of the area, the slope of the spectrum of the glottal airflow volume velocity pulse, the amplitude ratios of harmonics of this spectrum, and the pitch. On the plane of the first two main components in the space of these parameters, an almost twofold decrease in the classification error relative to that for the pitch alone is attained. The male voice recognition probability is found to be 94.7%, and the female voice recognition probability is 95.9%.
引用
收藏
页码:571 / 578
页数:8
相关论文
共 50 条
  • [1] Gender recognition from vocal source
    V. N. Sorokin
    I. S. Makarov
    Acoustical Physics, 2008, 54 : 571 - 578
  • [2] Gender Differences in the Recognition of Vocal Emotions
    Lausen, Adi
    Schacht, Annekathrin
    FRONTIERS IN PSYCHOLOGY, 2018, 9
  • [3] Vocal Source Contribution to Speaker Recognition
    Sorokin V.N.
    Sorokin, V.N. (vns@iitp.ru), 2018, Pleiades journals (28) : 546 - 556
  • [4] Speaker recognition using vocal source model
    Sorokin V.N.
    Tananykin A.A.
    Trunov V.G.
    Pattern Recognition and Image Analysis, 2014, 24 (1) : 156 - 173
  • [5] Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features
    Wang, Ning
    Ching, P. C.
    Zheng, Nengheng
    Lee, Tan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 196 - 205
  • [6] Robust speaker recognition using both vocal source and vocal tract features estimated from noisy input utterances
    Wang, Ning
    Ching, P. C.
    Zheng, N. H.
    Lee, Tan
    2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 886 - 891
  • [7] RECOGNITION OF EMOTION FROM VOCAL CUES
    JOHNSON, WF
    EMDE, RN
    SCHERER, KR
    KLINNERT, MD
    ARCHIVES OF GENERAL PSYCHIATRY, 1986, 43 (03) : 280 - 283
  • [8] EFFECTS OF PSEUDO-PERIODICITY OF VOCAL SOURCE PARAMETERS ON VOWEL RECOGNITION
    KAKUSHO, O
    HIRATO, N
    MACHIDA, F
    KATO, K
    ELECTRONICS & COMMUNICATIONS IN JAPAN, 1965, 48 (12): : 76 - &
  • [9] Using Haar transformed vocal source information for automatic speaker recognition
    Zheng, NH
    Ching, PC
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 77 - 80
  • [10] Recognition of Vocal Emotions from Acoustic Profile
    Asawa, Krishna
    Verma, Vikrant
    Agrawal, Ankit
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 710 - 716