SPEAKER NORMALIZATION OF STATIC AND DYNAMIC VOWEL SPECTRAL FEATURES

被引：12

作者：

ZAHORIAN, SA

JAGHARGHI, AJ

机构：

[1] Department of Electrical and Computer Engineering, Old Dominion University, Norfolk, Virginia

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 1991年 / 90卷 / 01期

关键词：

D O I：

10.1121/1.402350

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Two methods are described for speaker normalizing vowel spectral features: one is a multivariable linear transformation of the features and the other is a polynomial warping of the frequency scale. Both normalization algorithms minimize the mean-square error between the transformed data of each speaker and vowel target values obtained from a "typical speaker." These normalization techniques were evaluated both for formants and a form of cepstral coefficients (DCTCs) as spectral parameters, for both static and dynamic features, and with and without fundamental frequency (FO) as an additional feature. The normalizations were tested with a series of automatic classification experiments for vowels. For all conditions, automatic vowel classification rates increased for speaker-normalized data compared to rates obtained for nonnormalized parameters. Typical classification rates for vowel test data for nonnormalized and normalized features respectively are as follows: static formants-69%/79%; formant trajectories -76%/84%; static DCTCs 75%/84%; DCTC trajectories-84%/91%. The linear transformation methods increased the classification rates slightly more than the polynomial frequency warping. The addition of F0 improved the automatic recognition results for nonnormalized vowel spectral features as much as 5.8%. However, the addition of F0 to speaker-normalized spectral features resulted in much smaller increases in automatic recognition rates.

引用

页码：67 / 75

页数：9

共 50 条

[1] Speaker normalization for Chinese vowel recognition in cochlear implants
Luo, X
Fu, QH
[J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2005, 52 (07) : 1358 - 1361
[2] A Bayesian Approach to Speaker Normalization Using Vowel Formant Frequency
Ram, Dhananjay
Kundu, Debasis
Hegde, Rajesh M.
[J]. 2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
[3] Speaker Dependent Changes in Formants Based on Normalization of Vowel Triangle
Stanek, Miroslav
Sigmund, Milan
[J]. 2013 23RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2013, : 329 - 333
[4] VOWEL NORMALIZATION BY FREQUENCY WARPED SPECTRAL MATCHING
MATSUMOTO, H
WAKITA, H
[J]. SPEECH COMMUNICATION, 1986, 5 (02) : 239 - 251
[5] Assamese Dialect Identification Using Static and Dynamic Features from Vowel
Das, Hem Chandra
Bhattacharjee, Utpal
[J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (02) : 306 - 321
[6] Normalization of modulation features for speaker recognition
Thiruvaran, Tharmarajah
Ambikairajah, Eliathamby
Epps, Julien
[J]. PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 599 - +
[7] Vowel identification by cochlear implant users: Contributions of static and dynamic spectral cues
Donaldson, Gail S.
Rogers, Catherine L.
Cardenas, Emily S.
Russell, Benjamin A.
Hanna, Nada H.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04): : 3021 - 3028
[8] Vowel normalization and the perception of speaker changes: An exploration of the contextual tuning hypothesis
Barreda, Santiago
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (05): : 3453 - 3464
[9] EFFECTS OF VARIOUS TYPES OF SPEAKER NORMALIZATION ON AN AUTOMATIC VOWEL RECOGNITION SCHEME
SROKA, SA
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S63 - S63
[10] SOME RESULTS FROM NORMALIZATION OF SPEAKER DIFFERENCES IN A MECHANICAL VOWEL RECOGNIZER
HEMDAL, JF
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 41 (06): : 1594 - &

← 1 2 3 4 5 →