A new speech enhancement: Speech stream segregation

被引:0
|
作者
Okuno, HG
Nakatani, T
Kawabata, T
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech stream segregation is presented as a new speech enhancement for automatic speech recognition. Two issues an addressed: speech stream segregation from a mixture of sounds, and interfacing speech stream segregation with automatic speech recognition. Speech stream segregation is modeled as a process of extracting harmonic fragments, grouping these extracted harmonic fragments, and substituting non-harmonic residue for non-harmonic parts of groups. The main problem in interfacing speech stream segregation with HMM-based speech recognition is how to improve the degradation of recognition performance due to spectral distortion of segregated sounds, which is caused mainly by transfer function of a binaural input Our solution is to re-rain the parameters of HMM with training data binauralized for four directions. Experiments with 500 mixtures of two women's utterances of a word showed that the cumulative accuracy of word recognition up to the 10th candidate of each woman's utterance is, on average, 75%.
引用
收藏
页码:2356 / 2359
页数:4
相关论文
共 50 条
  • [1] Harmonic sound stream segregation using localization and its application to speech stream segregation
    Nakatani, T
    Okuno, HG
    [J]. SPEECH COMMUNICATION, 1999, 27 (3-4) : 209 - 222
  • [2] Testing the Ecological Validity of Infants' Speech Stream Segregation
    Bernier, Dana E.
    Soderstrom, Melanie
    [J]. CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2013, 67 (04): : 291 - 291
  • [3] Speech enhancement and segregation based on human auditory mechanisms
    Akagi, M
    Mizumachi, M
    Ishimoto, Y
    Unoki, M
    [J]. ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 2002, : 186 - 196
  • [4] A NEW SPEECH ENHANCEMENT METHOD
    李冲泥
    胡光锐
    [J]. Journal of Shanghai Jiaotong University(Science), 1999, (01) : 64 - 66
  • [5] A new speech enhancement method
    Qin, LM
    Hu, GR
    Li, CN
    [J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 92 - 94
  • [6] Visual Speech Enhancement Without A Real Visual Stream
    Hegde, Sindhu B.
    Prajwal, K. R.
    Mukhopadhyay, Rudrabha
    Namboodiri, Vinay
    Jawahar, C., V
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1925 - 1934
  • [7] A New Speech Enhancement Algorithm with Generalized Gamma Speech Model
    Zhao, Gaihua
    Zhou, Bin
    Zhang, Xiongwei
    Sui Lu-ying
    [J]. 2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2012), 2012,
  • [8] THE STREAM OF SPEECH
    REMEZ, RE
    RUBIN, PE
    [J]. SCANDINAVIAN JOURNAL OF PSYCHOLOGY, 1983, 24 (01) : 63 - 66
  • [9] Speech enhancement for bandlimited speech
    Heide, DA
    Kang, GS
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 393 - 396
  • [10] Effect of age and hearing loss on auditory stream segregation of speech sounds
    David, Marion
    Tausend, Alexis N.
    Strelcyk, Olaf
    Oxenham, Andrew J.
    [J]. HEARING RESEARCH, 2018, 364 : 118 - 128