Comparative Studies of Single-Channel Speech Enhancement Techniques

被引:0
|
作者
Kumar, Bittu [1 ]
Kumar, Neeraj [2 ]
Kumar, Manoj [3 ]
Prasad, S. V. S. [3 ]
Varma, Ashwini Kumar [1 ]
Ravi, Banoth [4 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Dept Elect & Commun Engn, Hyderabad, Telangana, India
[2] Indian Inst Informat Technol, Dept Elect Engn, Bhopal, India
[3] MLR Inst Technol, Dept Elect & Commun Engn, Hyderabad, India
[4] Indian Inst Informat Technol, Dept Elect Engn, Trichy, India
关键词
Spectral subtraction; MMSE; Speech enhancement; Compressive sensing; Noise estimation; Signal estimation; OBJECTIVE QUALITY MEASURES; NOISE-ESTIMATION ALGORITHM; SIGNAL RECOVERY; SPECTRAL SUBTRACTION;
D O I
10.1080/03772063.2023.2273299
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Several speech enhancement techniques like Spectral Subtraction, MMSE, Log-MMSE, $ \rbeta $ beta-order MMSE, adaptive $ \rbeta $ beta-order MMSE and compressive sensing methods are developed worldwide. Scientists, engineers and researchers have implemented, evaluated and tested all methods individually with the different speech corpus. However, we found few articles on comparative studies of various speech enhancement techniques. In the present paper, several speech enhancement techniques have been studied, and their performance in terms of speech quality measures is compared objectively and subjectively. The results have been evaluated not only through speech quality measures but also in terms of waveform and spectrogram for speech enhancement applications. For this, MATLAB is used for the simulation of all methods. After getting the enhanced speech signals, we evaluated their enhanced speech signals of the methods. Results in terms of objective evaluation parameters indicated that the adaptive $ \rbeta $ beta-order MMSE-based method produces good-quality speech signals compared to the other methods. Also, we evaluated their enhanced speech signals using a listening test, i.e. subjective evaluation. In the subjective quality test through mean opinion score (using the listening test), the performance of the adaptive $ \rbeta $ beta-order MMSE method and GOMP are equal. In the case of waveform and spectrogram, the visualisation of enhanced speech signal obtained from GOMP-based compressive algorithms is very close to clean speech signal.
引用
收藏
页码:5704 / 5720
页数:17
相关论文
共 50 条
  • [31] On Speech Intelligibility Estimation of Phase-Aware Single-Channel Speech Enhancement
    Gaich, Andreas
    Mowlaee, Pejman
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2553 - 2557
  • [32] STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement
    Krawczyk, Martin
    Gerkmann, Timo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1931 - 1940
  • [33] SINGLE-CHANNEL SPEECH ENHANCEMENT IN A TRANSIENT NOISE ENVIRONMENT BY EXPLOITING SPEECH HARMONICITY
    Wu, Kai
    Reju, V. G.
    Khong, Andy W. H.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5088 - 5092
  • [34] A SPECTRAL CONVERSION BASED SINGLE-CHANNEL SINGLE-MICROPHONE SPEECH ENHANCEMENT
    Huy-Khoi Do
    Quang Vinh Thai
    [J]. FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011), 2011, : 583 - +
  • [35] ON SPEECH QUALITY ES TIMATION OF PHASE-AWARE SINGLE-CHANNEL SPEECH ENHANCEMENT
    Gaich, Andreas
    Mowlaee, Pejman
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 216 - 220
  • [36] Phase Estimation in Single-Channel Speech Enhancement: Limits-Potential
    Mowlaee, Pejman
    Kulmer, Josef
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (08) : 1283 - 1294
  • [37] Glance and gaze: A collaborative learning framework for single-channel speech enhancement
    Li, Andong
    Zheng, Chengshi
    Zhang, Lu
    Li, Xiaodong
    [J]. APPLIED ACOUSTICS, 2022, 187
  • [38] Two-Stage Temporal Processing for Single-Channel Speech Enhancement
    Samui, Sunzan
    Chakrabarti, Indrajit
    Ghosh, Soumya Kanti
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3723 - 3727
  • [39] Single-channel speech enhancement using Kalman filtering in the modulation domain
    So, Stephen
    Wojcicki, Kamil K.
    Paliwal, Kuldip K.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 993 - 996
  • [40] Single-channel speech enhancement based on joint constrained dictionary learning
    Linhui Sun
    Yunyi Bu
    Pingan Li
    Zihao Wu
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2021