A new speech enhancement: Speech stream segregation

被引：0

作者：

Okuno, HG

Nakatani, T

Kawabata, T

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech stream segregation is presented as a new speech enhancement for automatic speech recognition. Two issues an addressed: speech stream segregation from a mixture of sounds, and interfacing speech stream segregation with automatic speech recognition. Speech stream segregation is modeled as a process of extracting harmonic fragments, grouping these extracted harmonic fragments, and substituting non-harmonic residue for non-harmonic parts of groups. The main problem in interfacing speech stream segregation with HMM-based speech recognition is how to improve the degradation of recognition performance due to spectral distortion of segregated sounds, which is caused mainly by transfer function of a binaural input Our solution is to re-rain the parameters of HMM with training data binauralized for four directions. Experiments with 500 mixtures of two women's utterances of a word showed that the cumulative accuracy of word recognition up to the 10th candidate of each woman's utterance is, on average, 75%.

引用

页码：2356 / 2359

页数：4

共 50 条

[1] Harmonic sound stream segregation using localization and its application to speech stream segregation
Nakatani, T
Okuno, HG
[J]. SPEECH COMMUNICATION, 1999, 27 (3-4) : 209 - 222
[2] Testing the Ecological Validity of Infants' Speech Stream Segregation
Bernier, Dana E.
Soderstrom, Melanie
[J]. CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2013, 67 (04): : 291 - 291
[3] Speech enhancement and segregation based on human auditory mechanisms
Akagi, M
Mizumachi, M
Ishimoto, Y
Unoki, M
[J]. ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 2002, : 186 - 196
[4] A NEW SPEECH ENHANCEMENT METHOD
李冲泥
胡光锐
[J]. Journal of Shanghai Jiaotong University(Science), 1999, (01) : 64 - 66
[5] A new speech enhancement method
Qin, LM
Hu, GR
Li, CN
[J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 92 - 94
[6] Visual Speech Enhancement Without A Real Visual Stream
Hegde, Sindhu B.
Prajwal, K. R.
Mukhopadhyay, Rudrabha
Namboodiri, Vinay
Jawahar, C., V
[J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1925 - 1934
[7] A New Speech Enhancement Algorithm with Generalized Gamma Speech Model
Zhao, Gaihua
Zhou, Bin
Zhang, Xiongwei
Sui Lu-ying
[J]. 2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2012), 2012,
[8] THE STREAM OF SPEECH
REMEZ, RE
RUBIN, PE
[J]. SCANDINAVIAN JOURNAL OF PSYCHOLOGY, 1983, 24 (01) : 63 - 66
[9] Speech enhancement for bandlimited speech
Heide, DA
Kang, GS
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 393 - 396
[10] Effect of age and hearing loss on auditory stream segregation of speech sounds
David, Marion
Tausend, Alexis N.
Strelcyk, Olaf
Oxenham, Andrew J.
[J]. HEARING RESEARCH, 2018, 364 : 118 - 128

← 1 2 3 4 5 →