Speech enhancement for bandlimited speech

被引:0
|
作者
Heide, DA [1 ]
Kang, GS [1 ]
机构
[1] USN, Res Lab, Washington, DC 20375 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Throughout the history of telecommunication, speech has rarely been transmitted with its full analog bandwidth (0 to 8 kHz or more) due to limitations in channel bandwidth. This impaired legacy continues with tactical voice communication. The passband of a voice terminal is typically 0 to 4 kHz. Hence, high-frequency speech components (4 to 8 kHz) are removed prior to transmission. As a result, speech intelligibility suffers, particularly for low-data-rate vocoders. In this paper, we describe our speech-processing technique, which permits some of the upperband speech components to be translated into the passband of the vocoder. According to our test results, speech intelligibility is improved by as much as three to four points even for the recently developed and excellent Department of Defense-standard Mixed Excitation Linear Predictor (MELP) 2.4 kb/s vocoder. Note that speech intelligibility is improved without expanding the transmission bandwidth or compromising interoperability with others.
引用
收藏
页码:393 / 396
页数:4
相关论文
共 50 条
  • [1] Wideband speech recovery from bandlimited speech using LP analysis/synthesis
    Yasukawa, H
    [J]. SIGNAL ANALYSIS & PREDICTION I, 1997, : 364 - 367
  • [2] IMPROVEMENT OF SPEECH RESIDUALS FOR SPEECH ENHANCEMENT
    Elshamy, Samy
    Fingscheidt, Tim
    [J]. 2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 219 - 223
  • [3] Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech
    Valentini-Botinhao, Cassia
    Yamagishi, Junichi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (08) : 1420 - 1433
  • [4] NETWORKS FOR SPEECH ENHANCEMENT AND AUTOMATIC SPEECH RECOGNITION
    Vu, Thanh T.
    Bigot, Benjamin
    Chng, Eng Siong
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 499 - 503
  • [5] β-Masking MMSE Speech Enhancement for Speech Recognition
    You, Chang Huai
    Ma, Bin
    [J]. 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 341 - 345
  • [6] Speech enhancement based on transient speech information
    Yoo, S
    Boston, JR
    Durrant, JD
    Kovacyk, K
    Karn, S
    Shaiman, S
    El-Jaroudi, A
    Li, CC
    [J]. 2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 62 - 65
  • [7] A new speech enhancement: Speech stream segregation
    Okuno, HG
    Nakatani, T
    Kawabata, T
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2356 - 2359
  • [8] Speech Enhancement With Inventory Style Speech Resynthesis
    Xiao, X.
    Nickel, R. M.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1243 - 1257
  • [9] Speech enhancement using transient speech components
    Tantibundhit, C.
    Boston, J. R.
    Li, C. C.
    Durrant, J. D.
    Shaiman, S.
    Kovacyk, K.
    El-Jaroudi, A.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 833 - 836
  • [10] SPEECH ENHANCEMENT FOR TELEPHONY NAME SPEECH RECOGNITION
    You, Chang Huai
    Rahardja, Susanto
    Li, Haizhou
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 973 - 976