Single-Channel Speech Enhancement Using Single Dimension Change Accelerated Particle Swarm Optimization for Subspace Partitioning

被引:4
|
作者
Ghorpade, Kalpana [1 ]
Khaparde, Arti [2 ]
机构
[1] Cummins Coll Engn Women, Dept Elect & Telecommun, Pune, Maharashtra, India
[2] MIT World Peace Univ, Dept ECE, Pune, Maharashtra, India
关键词
Eigenvalue decomposition; Modified accelerated particle swarm optimization; Speech enhancement; Subspace method; Voice activity detection; DESIGN; NOISE;
D O I
10.1007/s00034-023-02324-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech signal gets contaminated by background noise affecting its quality and intelligibility. There are different sources of additive noise. This additive noise, either stationary or non-stationary, has a distinct distribution of noise energy in the frequency domain. Degraded speech affects the performance of speech-operated systems. Speech enhancement can reduce this additive noise. Here, we propose a subspace-based single-channel speech enhancement method using modified accelerated particle swarm optimization to optimize subspace partitioning. Principal components of noisy speech are partitioned into speech, speech plus noise, and noise only based on the signal-to-noise ratio of principal components. Voice activity detection is implemented to find the variance of additive noise. Modified accelerated particle swarm optimization optimizes the number of principal components in each partition and the weights of the components in each class. The proposed speech enhancement method gives better results for the quality and intelligibility measures of enhanced speech compared with conventional speech enhancement methods. We got 18.8% improvement in STOI for 0 dB restaurant noise, 20.5% improvement for 0 dB train noise, and 11.55% improvement for 0 dB exhibition noise. We got an improvement of 39.15% in PESQ for 0 dB babble noise, 41.57% for 0 dB car noise, and 31.79% increase for 0 dB airport noise. The average improvement in the segmental SNR of the enhanced speech is 8.32 dB for 0 dB noise. There is 4.4 dB improvement in SDR for the airport noise and 5.54 dB improvement for the station noise. We got this improvement with minimum speech distortion.
引用
收藏
页码:4343 / 4361
页数:19
相关论文
共 50 条
  • [41] Improved Particle Swarm Optimization for Dual-Channel Speech Enhancement
    Asl, Laleh Badri
    Nezhad, Vahid Majid
    2010 INTERNATIONAL CONFERENCE ON SIGNAL ACQUISITION AND PROCESSING: ICSAP 2010, PROCEEDINGS, 2010, : 13 - 17
  • [42] Joint Optimization of Perceptual Gain Function and Deep Neural Networks for Single-Channel Speech Enhancement
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Zhou, Xingyu
    Sun, Meng
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (02) : 714 - 717
  • [43] JOINT OPTIMIZATION OF AUDIBLE NOISE SUPPRESSION AND DEEP NEURAL NETWORKS FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Sun, Meng
    Yang, Jibin
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [44] Supervised single-channel speech enhancement using ratio mask with joint dictionary learning
    Zhang, Long
    Bao, Guangzhao
    Zhang, Jing
    Ye, Zhongfu
    SPEECH COMMUNICATION, 2016, 82 : 38 - 52
  • [45] Evaluation of Single-Channel Speech Enhancement Algorithms by Using Objective Quality and Intelligibility Measures
    Arslan, Ozkan
    Engin, Erkan Zeki
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [46] Single-channel speech enhancement using implicit Wiener filter for high-quality speech communication
    Jaiswal R.K.
    Yeduri S.R.
    Cenkeramaddi L.R.
    International Journal of Speech Technology, 2022, 25 (03) : 745 - 758
  • [47] Glance and gaze: A collaborative learning framework for single-channel speech enhancement
    Li, Andong
    Zheng, Chengshi
    Zhang, Lu
    Li, Xiaodong
    APPLIED ACOUSTICS, 2022, 187
  • [48] Phase Estimation in Single-Channel Speech Enhancement: Limits-Potential
    Mowlaee, Pejman
    Kulmer, Josef
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (08) : 1283 - 1294
  • [49] Two-Stage Temporal Processing for Single-Channel Speech Enhancement
    Samui, Sunzan
    Chakrabarti, Indrajit
    Ghosh, Soumya Kanti
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3723 - 3727
  • [50] Single-channel speech enhancement based on joint constrained dictionary learning
    Linhui Sun
    Yunyi Bu
    Pingan Li
    Zihao Wu
    EURASIP Journal on Audio, Speech, and Music Processing, 2021