Formant filters-based multi-band speech enhancement algorithm for intelligibility improvement

被引:0
|
作者
Jeeva, M. P. Actlin [1 ]
Nagarajan, T. [2 ]
Vijayalakshmi, P. [1 ]
机构
[1] SSN Coll Engn, Dept Elect & Commun Engn, Madras, Tamil Nadu, India
[2] SSN Coll Engn, Dept Informat Technol, Madras, Tamil Nadu, India
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech enhancement algorithms in the past concentrated on improving the speech quality, however they need not necessarily improve intelligibility of the enhanced speech. The current work focuses on improving the quality as well as intelligibility of the well-known multi-band spectral subtraction algorithm. In this regard, to improve speech quality, a temporal-domain filtering-based approach is proposed to obtain sub-bands (ERB-based). To improve intelligibility, it is necessary to identify the type of distortion (attenuation or amplification distortion) that affects the intelligibility of enhanced speech. Therefore, an analysis is performed on the enhanced speech at the phoneme level using segmental-SNR and it is observed that in high SNR regions of the noisy speech (specifically in vowels, liquids, nasals), intelligibility is reduced due to amplification distortion. This may be due to the high spectral resolution of the temporal-domain ERB-based filters. Hence, to improve intelligibility, a set of formant specific filters are proposed based on the formant analysis carried out over vowels, liquids and nasals. The performance of the proposed multi-band spectral subtraction algorithm is evaluated for its quality and intelligibility, using subjective (MOS) and objective (PESQ and CSII) measures, for the speech affected by white, car and babble noise at -5 to 15 dB SNR levels. It is observed that the proposed method improves speech quality and intelligibility by around 0.1-0.5 in terms of PESQ and 2-10% in terms of CSII over conventional multi-band spectral subtraction method.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Intelligibility assessment of a multi-band speech enhancement scheme
    Hussain, A
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1045 - 1048
  • [2] Pipelined Architecture of Multi-Band Spectral Subtraction Algorithm for Speech Enhancement
    Bahoura, Mohammed
    ELECTRONICS, 2017, 6 (04):
  • [3] Speech enhancement method based on multi-band excitation model
    Huang, Qizheng
    Bao, Changchun
    Wang, Xianyun
    Xiang, Yang
    APPLIED ACOUSTICS, 2020, 163
  • [4] Multi-band speech enhancement for functional MRI
    Ramachandran, V.
    Panahi, I. M. S.
    Hu, Y.
    Loizou, P. C.
    Briggs, R. W.
    McCaslin, S. R.
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 3655 - 3658
  • [5] Speech Intelligibility Based Enhancement System Using Modified Deep Neural Network and Adaptive Multi-band Spectral Subtraction
    Dash, Tusar Kanti
    Solanki, Sandeep Singh
    WIRELESS PERSONAL COMMUNICATIONS, 2020, 111 (02) : 1073 - 1087
  • [6] Speech Intelligibility Based Enhancement System Using Modified Deep Neural Network and Adaptive Multi-band Spectral Subtraction
    Tusar Kanti Dash
    Sandeep Singh Solanki
    Wireless Personal Communications, 2020, 111 : 1073 - 1087
  • [7] A Perceptually Motivated Multi-Band Spectral Subtraction Algorithm for Enhancement of Degraded Speech
    Upadhyay, Navneet
    Karmakar, Abhijit
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT), 2012, : 340 - 345
  • [8] Quantum speech multi-band excitation algorithm
    Liang Yan-Xia
    Nie Min
    Liu Xin
    Zhang Mei-Ling
    Jiang Jing
    ACTA PHYSICA SINICA, 2014, 63 (12)
  • [9] Speech Intelligibility Enhancement using an Optimal Formant Shifting Approach
    Nathwani, Karan
    Hafiz, Faizal
    Swain, Akshya
    Biswas, Ritujoy
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021), 2021, : 120 - 125
  • [10] FORMANT SHIFTING FOR SPEECH INTELLIGIBILITY IMPROVEMENT IN CAR NOISE ENVIRONMENT
    Nathwani, Karan
    Daniel, Morgane
    Richard, Gael
    David, Bertrand
    Roussarie, Vincent
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5375 - 5379