Gender recognition from vocal source

被引：16

作者：

Sorokin, V. N. ^{[1
]}

Makarov, I. S. ^{[1
]}

机构：

[1] Russian Acad Sci, Inst Informat Transmiss Problems, Moscow 101447, Russia

来源：

ACOUSTICAL PHYSICS | 2008年 / 54卷 / 04期

关键词：

D O I：

10.1134/S1063771008040192

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Efficiency of automatic recognition of male and female voices based on solving the inverse problem for glottis area dynamics and for waveform of the glottal airflow volume velocity pulse is studied. The inverse problem is regularized through the use of analytical models of the voice excitation pulse and of the dynamics of the glottis area, as well as the model of one-dimensional glottal airflow. Parameters of these models and spectral parameters of the volume velocity pulse are considered. The following parameters are found to be most promising: the instant of maximum glottis area, the maximum derivative of the area, the slope of the spectrum of the glottal airflow volume velocity pulse, the amplitude ratios of harmonics of this spectrum, and the pitch. On the plane of the first two main components in the space of these parameters, an almost twofold decrease in the classification error relative to that for the pitch alone is attained. The male voice recognition probability is found to be 94.7%, and the female voice recognition probability is 95.9%.

引用

页码：571 / 578

页数：8

共 50 条

[1] Gender recognition from vocal source
V. N. Sorokin
I. S. Makarov
Acoustical Physics, 2008, 54 : 571 - 578
[2] Gender Differences in the Recognition of Vocal Emotions
Lausen, Adi
Schacht, Annekathrin
FRONTIERS IN PSYCHOLOGY, 2018, 9
[3] Vocal Source Contribution to Speaker Recognition
Sorokin V.N.
Sorokin, V.N. (vns@iitp.ru), 2018, Pleiades journals (28) : 546 - 556
[4] Speaker recognition using vocal source model
Sorokin V.N.
Tananykin A.A.
Trunov V.G.
Pattern Recognition and Image Analysis, 2014, 24 (1) : 156 - 173
[5] Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features
Wang, Ning
Ching, P. C.
Zheng, Nengheng
Lee, Tan
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 196 - 205
[6] Robust speaker recognition using both vocal source and vocal tract features estimated from noisy input utterances
Wang, Ning
Ching, P. C.
Zheng, N. H.
Lee, Tan
2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 886 - 891
[7] RECOGNITION OF EMOTION FROM VOCAL CUES
JOHNSON, WF
EMDE, RN
SCHERER, KR
KLINNERT, MD
ARCHIVES OF GENERAL PSYCHIATRY, 1986, 43 (03) : 280 - 283
[8] EFFECTS OF PSEUDO-PERIODICITY OF VOCAL SOURCE PARAMETERS ON VOWEL RECOGNITION
KAKUSHO, O
HIRATO, N
MACHIDA, F
KATO, K
ELECTRONICS & COMMUNICATIONS IN JAPAN, 1965, 48 (12): : 76 - &
[9] Using Haar transformed vocal source information for automatic speaker recognition
Zheng, NH
Ching, PC
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 77 - 80
[10] Recognition of Vocal Emotions from Acoustic Profile
Asawa, Krishna
Verma, Vikrant
Agrawal, Ankit
PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 710 - 716

← 1 2 3 4 5 →