Acoustic analysis and recognition of whispered speech

被引：0

作者：

Itoh, T ^{[1
]}

Takeda, K ^{[1
]}

Itakura, F ^{[1
]}

机构：

[1] Nagoya Univ, Ctr Integrated Acoust Informat Res, Nagoya, Aichi 4648603, Japan

来源：

ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS | 2001年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, acoustic properties and the recognition method of whispered speech are discussed. A whispered speech database that consists of whispered speech, normal speech and their corresponding facial video images of more than 6,000 sentences from 100 speakers was prepared. The comparison between whispered and normal utterances show that 1) the cepstrum distance between them is 4 dB for voiced and 2 dB for unvoiced phonemes, respectively, 2) the spectral tilt of the whispered speech is less sloped than the normal speech and 3) the frequency of the lower formants (below 1.5 kHz) is lower than that of the normal speech. Acoustic models (HMM) trained by the whispered speech database attain an accuracy of 60% in syllable recognition experiments. This accuracy can be improved to 63% when MLLR adaptation is applied, while the normal speech HMM adapted with the whispered speech attain only 56% syllable accuracy.

引用

页码：429 / 432

页数：4

共 50 条

[1] Acoustic analysis and recognition of whispered speech
Itoh, T
Takeda, K
Itakura, F
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 389 - 392
[2] Analysis and recognition of whispered speech
Ito, T
Takeda, K
Itakura, F
[J]. SPEECH COMMUNICATION, 2005, 45 (02) : 139 - 152
[3] Acoustic analysis of consonants in whispered speech
Jovicic, Slobodan T.
Saric, Zoran
[J]. JOURNAL OF VOICE, 2008, 22 (03) : 263 - 274
[4] ACOUSTIC ANALYSIS FOR SPEAKER IDENTIFICATION OF WHISPERED SPEECH
Fan, Xing
Hansen, John H. L.
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5046 - 5049
[5] Acoustic features of whispered speech
Jovicic, ST
Dordevic, MM
[J]. ACUSTICA, 1996, 82 : S228 - S228
[6] Acoustic Analysis of Whispered Speech for Phoneme and Speaker Dependency
Fan, Xing
Godin, Keith W.
Hansen, John H. L.
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 188 - 191
[7] Study on the Emotion Recognition of Whispered Speech
Jin, Yun
Zhao, Yan
Huang, Chengwei
Zhao, Li
[J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 242 - 246
[8] Tone Recognition of Chinese Whispered Speech
Gong Chenghui
Zhao Heming
[J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 401 - +
[9] RECOGNITION OF WORD TONES IN WHISPERED SPEECH
JENSEN, MK
[J]. WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1958, 14 (2-3): : 187 - 196
[10] Maturation of Speech-in-Speech Recognition for Whispered and Voiced Speech
Buss, Emily
Miller, Margaret K.
Leibold, Lori J.
[J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2022, 65 (08): : 3117 - 3128

← 1 2 3 4 5 →