Post-filtering Technique Using Band Importance Function for Speech Intelligibility Enhancement

被引:1
|
作者
Lai, Ying-Hui [1 ]
Tang, Shih-Tsang [2 ]
Li, Pei-Chun [3 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[2] Ming Chuan Univ, Dept Biomed Engn, Taoyuan, Taiwan
[3] Mackay Med Coll, Dept Audiol & Speech Language Pathol, New Taipei, Taiwan
关键词
GMAPA algorithm; intelligibility-oriented speech enhancement; spectral restoration; COMPRESSION;
D O I
10.1109/BigMM.2016.90
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many applications consider the enhancement of speech intelligibility as the goal for an SE system. In this study, we propose a maximum speech intelligibility (MSI) post-filter that aims to enhance the intelligibility of processed speech signals. The MSI post-filter is designed to specify a weight for each frequency band of the speech signal based on the critical band importance function. To evaluate the MSI post-filter, we combine it with a recently proposed generalized maximum a posteriori spectral amplitude estimation (GMAPA) SE algorithm. In previous studies, it has been verified that GMAPA outperforms several well-known spectral restoration approaches in terms of objective evaluations and speech recognition tests. Experimental results from the present study confirm that GMAPA also provides better results in a set of subjective intelligibility tests conducted with human subjects. Moreover, the integration of GMAPA and MSI can further improve the intelligibility scores over GMAPA alone under 10 dB to 5 dB signal-to-noise ratio conditions.
引用
收藏
页码:487 / 491
页数:5
相关论文
共 50 条
  • [1] COMPARISON OF POST-FILTERING METHODS FOR INTELLIGIBILITY ENHANCEMENT OF TELEPHONE SPEECH
    Jokinen, Emma
    Alku, Paavo
    Vainio, Martti
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2333 - 2337
  • [2] Utilization of the Lombard effect in post-filtering for intelligibility enhancement of telephone speech
    Jokinen, Emma
    Alku, Paavo
    Vainio, Marti
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 590 - 593
  • [3] Frequency-adaptive post-filtering for intelligibility enhancement of narrowband telephone speech
    Jokinen, Emma
    Takanen, Marko
    Alku, Paavo
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1178 - 1182
  • [4] Signal-to-noise ratio adaptive post-filtering method for intelligibility enhancement of telephone speech
    Jokinen, Emma
    Yrttiaho, Santeri
    Pulakka, Hannu
    Vainio, Martti
    Alku, Paavo
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (06): : 3990 - 4001
  • [5] Signal subspace speech enhancement with perceptual post-filtering
    Klein, M
    Kabal, P
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 537 - 540
  • [6] SINGLE-MICROPHONE SPEECH ENHANCEMENT USING MVDR FILTERING AND WIENER POST-FILTERING
    Fischer, Doerte
    Gerkmann, Timo
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 201 - 205
  • [7] Locally Linear Embedding Based Post-Filtering for Speech Enhancement
    Hwang, Hsin-Te
    Wu, Yi-Chiao
    Wang, Syu-Siang
    Hsu, Chin-Cheng
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2018, 34 (06) : 1469 - 1491
  • [8] An adaptive post-filtering method producing an artificial Lombard-like effect for intelligibility enhancement of narrowband telephone speech
    Jokinen, Emma
    Takanen, Marko
    Vainio, Martti
    Alku, Paavo
    [J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (02): : 619 - 628
  • [9] A speech enhancement method based on phrase-error and post-filtering
    Ma, Xiao-Hong
    Li, Rui
    Yin, Fu-Liang
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (09): : 1977 - 1981
  • [10] Speech Enhancement Based on Beamforming and Post-Filtering by Combining Phase Information
    Cheng, Rui
    Bao, Changchun
    [J]. INTERSPEECH 2020, 2020, : 4496 - 4500