SPEAKER NORMALIZATION OF STATIC AND DYNAMIC VOWEL SPECTRAL FEATURES

被引:12
|
作者
ZAHORIAN, SA
JAGHARGHI, AJ
机构
[1] Department of Electrical and Computer Engineering, Old Dominion University, Norfolk, Virginia
来源
关键词
D O I
10.1121/1.402350
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Two methods are described for speaker normalizing vowel spectral features: one is a multivariable linear transformation of the features and the other is a polynomial warping of the frequency scale. Both normalization algorithms minimize the mean-square error between the transformed data of each speaker and vowel target values obtained from a "typical speaker." These normalization techniques were evaluated both for formants and a form of cepstral coefficients (DCTCs) as spectral parameters, for both static and dynamic features, and with and without fundamental frequency (FO) as an additional feature. The normalizations were tested with a series of automatic classification experiments for vowels. For all conditions, automatic vowel classification rates increased for speaker-normalized data compared to rates obtained for nonnormalized parameters. Typical classification rates for vowel test data for nonnormalized and normalized features respectively are as follows: static formants-69%/79%; formant trajectories -76%/84%; static DCTCs 75%/84%; DCTC trajectories-84%/91%. The linear transformation methods increased the classification rates slightly more than the polynomial frequency warping. The addition of F0 improved the automatic recognition results for nonnormalized vowel spectral features as much as 5.8%. However, the addition of F0 to speaker-normalized spectral features resulted in much smaller increases in automatic recognition rates.
引用
收藏
页码:67 / 75
页数:9
相关论文
共 50 条
  • [1] Speaker normalization for Chinese vowel recognition in cochlear implants
    Luo, X
    Fu, QH
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2005, 52 (07) : 1358 - 1361
  • [2] A Bayesian Approach to Speaker Normalization Using Vowel Formant Frequency
    Ram, Dhananjay
    Kundu, Debasis
    Hegde, Rajesh M.
    [J]. 2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
  • [3] Speaker Dependent Changes in Formants Based on Normalization of Vowel Triangle
    Stanek, Miroslav
    Sigmund, Milan
    [J]. 2013 23RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2013, : 329 - 333
  • [4] VOWEL NORMALIZATION BY FREQUENCY WARPED SPECTRAL MATCHING
    MATSUMOTO, H
    WAKITA, H
    [J]. SPEECH COMMUNICATION, 1986, 5 (02) : 239 - 251
  • [5] Assamese Dialect Identification Using Static and Dynamic Features from Vowel
    Das, Hem Chandra
    Bhattacharjee, Utpal
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (02) : 306 - 321
  • [6] Normalization of modulation features for speaker recognition
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    Epps, Julien
    [J]. PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 599 - +
  • [7] Vowel identification by cochlear implant users: Contributions of static and dynamic spectral cues
    Donaldson, Gail S.
    Rogers, Catherine L.
    Cardenas, Emily S.
    Russell, Benjamin A.
    Hanna, Nada H.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04): : 3021 - 3028
  • [8] Vowel normalization and the perception of speaker changes: An exploration of the contextual tuning hypothesis
    Barreda, Santiago
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (05): : 3453 - 3464
  • [9] EFFECTS OF VARIOUS TYPES OF SPEAKER NORMALIZATION ON AN AUTOMATIC VOWEL RECOGNITION SCHEME
    SROKA, SA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S63 - S63
  • [10] SOME RESULTS FROM NORMALIZATION OF SPEAKER DIFFERENCES IN A MECHANICAL VOWEL RECOGNIZER
    HEMDAL, JF
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 41 (06): : 1594 - &