Sparse Signal Recovery through Long Short-Term Memory Networks for Compressive Sensing-Based Speech Enhancement

被引：1

作者：

Shukla, Vasundhara ^{[1
]}

Swami, Preety D. ^{[1
]}

机构：

[1] Univ Inst Technol RGPV, Dept Elect & Commun Engn, Bhopal 462033, India

来源：

ELECTRONICS | 2023年 / 12卷 / 14期

关键词：

speech enhancement; compressive sensing; sparse recovery; LSTM; deep learning; UNDERDETERMINED SYSTEMS; LINEAR-EQUATIONS;

D O I：

10.3390/electronics12143097

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a novel speech enhancement approach based on compressive sensing (CS) which uses long short-term memory (LSTM) networks for the simultaneous recovery and enhancement of the compressed speech signals. The advantage of this algorithm is that it does not require an iterative process to recover the compressed signals, which makes the recovery process fast and straight forward. Furthermore, the proposed approach does not require prior knowledge of signal and noise statistical properties for sensing matrix optimization because the used LSTM can directly extract and learn the required information from the training data. The proposed technique is evaluated against white, babble, and f-16 noises. To validate the effectiveness of the proposed approach, perceptual evaluation of speech quality (PESQ), short-time objective intelligibility (STOI), and signal-to-distortion ratio (SDR) were compared to other variants of OMP-based CS algorithms The experimental outcomes show that the proposed approach achieves the maximum improvements of 50.06%, 43.65%, and 374.16% for PESQ, STOI, and SDR respectively, over the different variants of OMP-based CS algorithms.

引用

页数：18

共 50 条

[1] Compressive Sensing-Based Speech Enhancement
Wang, Jia-Ching
Lee, Yuan-Shan
Lin, Chang-Hong
Wang, Shu-Fan
Shih, Chih-Hao
Wu, Chung-Hsien
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2122 - 2131
[2] Compressive sensing-based speech enhancement in non-sparse noisy environments
Wu, Dalei
Zhu, Wei-Ping
Swamy, M. N. S.
IET SIGNAL PROCESSING, 2013, 7 (05) : 450 - 457
[3] MONAURAL SPEECH ENHANCEMENT BASED ON TWO STAGE LONG SHORT-TERM MEMORY NETWORKS
Xian, Yang
Sun, Yang
Wang, Wenwu
Naqvi, Syed Mohsen
2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
[4] Deep Long Short-Term Memory Networks for Speech Recognition
Chien, Jen-Tzung
Misbullah, Alim
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[5] RECONSTRUCTION OF SPARSE VECTORS IN COMPRESSIVE SENSING WITH MULTIPLE MEASUREMENT VECTORS USING BIDIRECTIONAL LONG SHORT-TERM MEMORY
Palangi, Hamid
Ward, Rabab
Deng, Li
2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 192 - 196
[6] Heart biometrics based on ECG signal by sparse coding and bidirectional long short-term memory
Yefei Zhang
Zhidong Zhao
Yanjun Deng
Xiaohong Zhang
Yu Zhang
Multimedia Tools and Applications, 2021, 80 : 30417 - 30438
[7] Heart biometrics based on ECG signal by sparse coding and bidirectional long short-term memory
Zhang, Yefei
Zhao, Zhidong
Deng, Yanjun
Zhang, Xiaohong
Zhang, Yu
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) : 30417 - 30438
[8] Research on Long Short-Term Memory Networks Speech Separation Algorithm Based on Beamforming
Lan Chaofeng
Liu Yan
Zhao Hongyun
Liu Chundong
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (07) : 2531 - 2538
[9] Speech Inpainting Based on Multi-Layer Long Short-Term Memory Networks
Shi, Haohan
Shi, Xiyu
Dogan, Safak
FUTURE INTERNET, 2024, 16 (02)
[10] SPEECH ENHANCEMENT USING LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS FOR NOISE ROBUST SPEAKER VERIFICATION
Kolbaek, Morten
Tan, Zheng-Hua
Jensen, Jesper
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 305 - 311

← 1 2 3 4 5 →