Speech enhancement using transient speech components

被引:0
|
作者
Tantibundhit, C. [1 ]
Boston, J. R. [1 ]
Li, C. C. [1 ]
Durrant, J. D. [1 ]
Shaiman, S. [1 ]
Kovacyk, K. [1 ]
El-Jaroudi, A. [1 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes an algorithm to decompose speech into tonal, transient, and residual components. The algorithm uses an MDCT-based hidden Markov chain model to isolate the tonal component and a wavelet-based hidden Markov tree model to isolate the transient component. We suggest that the auditory system, like the visual system, is probably sensitive to abrupt stimulus changes and that the transient component in speech may be particularly critical to speech perception. To test this suggestion, the transient component isolated by our algorithm was selectively amplified and recombined with the original speech to generate enhanced speech, with energy adjusted to be equal to the energy of the original speech. The intelligibility of the original and enhanced speech was evaluated in eleven human subjects by the modified rhyme protocol. The word recognition rates show that the enhanced speech can provide substantial improvement in speech intelligibility at low SNR levels (8% at -15 dB, 14% at -20dB, and 18% at -25 dB).
引用
收藏
页码:833 / 836
页数:4
相关论文
共 50 条
  • [1] Speech Enhancement using Transient Components in Frequency Domain
    Rezvani, Mohsen
    Kahaei, Mohammad Hossein
    [J]. 2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 241 - 244
  • [2] Speech enhancement based on transient speech information
    Yoo, S
    Boston, JR
    Durrant, JD
    Kovacyk, K
    Karn, S
    Shaiman, S
    El-Jaroudi, A
    Li, CC
    [J]. 2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 62 - 65
  • [3] SPEECH ENHANCEMENT IN TRANSIENT NOISE ENVIRONMENT USING DIFFUSION FILTERING
    Talmon, Ronen
    Cohen, Israel
    Gannot, Sharon
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4782 - 4785
  • [4] Multisensory speech enhancement using lower-frequency components from bone-conducted speech
    Rahman, M. Shahidur
    Saha, Atanu
    Shimamura, Tetsuya
    [J]. IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2019, 14 (11) : 1661 - 1666
  • [5] Robust distributed speech recognition using speech enhancement
    Flynn, Ronan
    Jones, Edward
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (03) : 1267 - 1273
  • [6] Robust recognition of noisy speech using speech enhancement
    Xu, YF
    Zhang, JJ
    Yao, KS
    Cao, ZG
    Ma, ZX
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737
  • [7] Speech enhancement based on the decomposition of speech into deterministic and stochastic components and psychoacoustic model
    Jo, Seokhwan
    Yoo, Chang D.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 897 - +
  • [8] Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
    Tantibundhit, Charturong
    Pernkopf, Franz
    Kubin, Gernot
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1417 - 1428
  • [9] SINGLE-CHANNEL SPEECH ENHANCEMENT IN A TRANSIENT NOISE ENVIRONMENT BY EXPLOITING SPEECH HARMONICITY
    Wu, Kai
    Reju, V. G.
    Khong, Andy W. H.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5088 - 5092
  • [10] Using Deep Speech Recognition to Evaluate Speech Enhancement Methods
    Siddiqui, Shamoon
    Rasool, Ghulam
    Ramachandran, Ravi P.
    Bouaynaya, Nidhal C.
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,