Sparse Signal Recovery through Long Short-Term Memory Networks for Compressive Sensing-Based Speech Enhancement

被引:1
|
作者
Shukla, Vasundhara [1 ]
Swami, Preety D. [1 ]
机构
[1] Univ Inst Technol RGPV, Dept Elect & Commun Engn, Bhopal 462033, India
关键词
speech enhancement; compressive sensing; sparse recovery; LSTM; deep learning; UNDERDETERMINED SYSTEMS; LINEAR-EQUATIONS;
D O I
10.3390/electronics12143097
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel speech enhancement approach based on compressive sensing (CS) which uses long short-term memory (LSTM) networks for the simultaneous recovery and enhancement of the compressed speech signals. The advantage of this algorithm is that it does not require an iterative process to recover the compressed signals, which makes the recovery process fast and straight forward. Furthermore, the proposed approach does not require prior knowledge of signal and noise statistical properties for sensing matrix optimization because the used LSTM can directly extract and learn the required information from the training data. The proposed technique is evaluated against white, babble, and f-16 noises. To validate the effectiveness of the proposed approach, perceptual evaluation of speech quality (PESQ), short-time objective intelligibility (STOI), and signal-to-distortion ratio (SDR) were compared to other variants of OMP-based CS algorithms The experimental outcomes show that the proposed approach achieves the maximum improvements of 50.06%, 43.65%, and 374.16% for PESQ, STOI, and SDR respectively, over the different variants of OMP-based CS algorithms.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Compressive Sensing-Based Speech Enhancement
    Wang, Jia-Ching
    Lee, Yuan-Shan
    Lin, Chang-Hong
    Wang, Shu-Fan
    Shih, Chih-Hao
    Wu, Chung-Hsien
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2122 - 2131
  • [2] Compressive sensing-based speech enhancement in non-sparse noisy environments
    Wu, Dalei
    Zhu, Wei-Ping
    Swamy, M. N. S.
    IET SIGNAL PROCESSING, 2013, 7 (05) : 450 - 457
  • [3] MONAURAL SPEECH ENHANCEMENT BASED ON TWO STAGE LONG SHORT-TERM MEMORY NETWORKS
    Xian, Yang
    Sun, Yang
    Wang, Wenwu
    Naqvi, Syed Mohsen
    2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
  • [4] Deep Long Short-Term Memory Networks for Speech Recognition
    Chien, Jen-Tzung
    Misbullah, Alim
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [5] RECONSTRUCTION OF SPARSE VECTORS IN COMPRESSIVE SENSING WITH MULTIPLE MEASUREMENT VECTORS USING BIDIRECTIONAL LONG SHORT-TERM MEMORY
    Palangi, Hamid
    Ward, Rabab
    Deng, Li
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 192 - 196
  • [6] Heart biometrics based on ECG signal by sparse coding and bidirectional long short-term memory
    Yefei Zhang
    Zhidong Zhao
    Yanjun Deng
    Xiaohong Zhang
    Yu Zhang
    Multimedia Tools and Applications, 2021, 80 : 30417 - 30438
  • [7] Heart biometrics based on ECG signal by sparse coding and bidirectional long short-term memory
    Zhang, Yefei
    Zhao, Zhidong
    Deng, Yanjun
    Zhang, Xiaohong
    Zhang, Yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) : 30417 - 30438
  • [8] Research on Long Short-Term Memory Networks Speech Separation Algorithm Based on Beamforming
    Lan Chaofeng
    Liu Yan
    Zhao Hongyun
    Liu Chundong
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (07) : 2531 - 2538
  • [9] Speech Inpainting Based on Multi-Layer Long Short-Term Memory Networks
    Shi, Haohan
    Shi, Xiyu
    Dogan, Safak
    FUTURE INTERNET, 2024, 16 (02)
  • [10] SPEECH ENHANCEMENT USING LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS FOR NOISE ROBUST SPEAKER VERIFICATION
    Kolbaek, Morten
    Tan, Zheng-Hua
    Jensen, Jesper
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 305 - 311