Acoustic analysis and recognition of whispered speech

被引:0
|
作者
Itoh, T [1 ]
Takeda, K [1 ]
Itakura, F [1 ]
机构
[1] Nagoya Univ, Ctr Integrated Acoust Informat Res, Nagoya, Aichi 4648603, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, acoustic properties and the recognition method of whispered speech are discussed. A whispered speech database that consists of whispered speech, normal speech and their corresponding facial video images of more than 6,000 sentences from 100 speakers was prepared. The comparison between whispered and normal utterances show that 1) the cepstrum distance between them is 4 dB for voiced and 2 dB for unvoiced phonemes, respectively, 2) the spectral tilt of the whispered speech is less sloped than the normal speech and 3) the frequency of the lower formants (below 1.5 kHz) is lower than that of the normal speech. Acoustic models (HMM) trained by the whispered speech database attain an accuracy of 60% in syllable recognition experiments. This accuracy can be improved to 63% when MLLR adaptation is applied, while the normal speech HMM adapted with the whispered speech attain only 56% syllable accuracy.
引用
收藏
页码:429 / 432
页数:4
相关论文
共 50 条
  • [1] Acoustic analysis and recognition of whispered speech
    Itoh, T
    Takeda, K
    Itakura, F
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 389 - 392
  • [2] Analysis and recognition of whispered speech
    Ito, T
    Takeda, K
    Itakura, F
    [J]. SPEECH COMMUNICATION, 2005, 45 (02) : 139 - 152
  • [3] Acoustic analysis of consonants in whispered speech
    Jovicic, Slobodan T.
    Saric, Zoran
    [J]. JOURNAL OF VOICE, 2008, 22 (03) : 263 - 274
  • [4] ACOUSTIC ANALYSIS FOR SPEAKER IDENTIFICATION OF WHISPERED SPEECH
    Fan, Xing
    Hansen, John H. L.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5046 - 5049
  • [5] Acoustic features of whispered speech
    Jovicic, ST
    Dordevic, MM
    [J]. ACUSTICA, 1996, 82 : S228 - S228
  • [6] Acoustic Analysis of Whispered Speech for Phoneme and Speaker Dependency
    Fan, Xing
    Godin, Keith W.
    Hansen, John H. L.
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 188 - 191
  • [7] Study on the Emotion Recognition of Whispered Speech
    Jin, Yun
    Zhao, Yan
    Huang, Chengwei
    Zhao, Li
    [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 242 - 246
  • [8] Tone Recognition of Chinese Whispered Speech
    Gong Chenghui
    Zhao Heming
    [J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 401 - +
  • [9] RECOGNITION OF WORD TONES IN WHISPERED SPEECH
    JENSEN, MK
    [J]. WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1958, 14 (2-3): : 187 - 196
  • [10] Maturation of Speech-in-Speech Recognition for Whispered and Voiced Speech
    Buss, Emily
    Miller, Margaret K.
    Leibold, Lori J.
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2022, 65 (08): : 3117 - 3128