Build A Module for Improvement Real Time Speech enhancement using Long Short-term Memory Approach

被引:0
|
作者
Van Vo [1 ]
Bach Le Son [2 ]
Huy Vo Phuc [2 ]
机构
[1] FPT Univ, Software Engn Dept, Hanoi, Vietnam
[2] FPT Univ, Informat Technol Specialized Dept, Hanoi, Vietnam
关键词
Speech enhancement; Noise suppression; Deep Learning; Long Short-term Memory; Virtual Call Center; Customer Relationship Management System;
D O I
10.1145/3591569.3591614
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An essential customer experience is required for all businesses today, and customer support as a service brings the right people and processes together. When designing a system for in the context of audio communication for transmission purposes, noise influences must be carefully considered. Improving the quality of phone calls for a smart virtual call center is essential for more effective customer care. This paper proposed a module for improving real-time speech enhancement of phone calls using Long short-term memory (LSTM), an artificial neural network used in the fields of artificial intelligence and deep learning. LSTMs are designed to revoke the long-term dependency issue, remembering information for long periods is generally their default way of behaving. The data set using for this approach is both in English and Vietnamese, the results also improve with evaluation metrics such as PESQ, SI-SDR, STOI.
引用
收藏
页码:259 / 264
页数:6
相关论文
共 50 条
  • [41] HIGHWAY LONG SHORT-TERM MEMORY RNNS FOR DISTANT SPEECH RECOGNITION
    Zhang, Yu
    Chen, Guoguo
    Yu, Dong
    Yao, Kaisheng
    Khudanpur, Sanjeev
    Glass, James
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5755 - 5759
  • [43] Long Short-Term Memory for Speaker Generalization in Supervised Speech Separation
    Chen, Jitong
    Wang, DeLiang
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3314 - 3318
  • [44] Long Short-Term Memory Networks for Noise Robust Speech Recognition
    Woellmer, Martin
    Sun, Yang
    Eyben, Florian
    Schuller, Bjoern
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2966 - 2969
  • [45] Long short-term memory for speaker generalization in supervised speech separation
    Chen, Jitong
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (06): : 4705 - 4714
  • [46] Audiovisual Speech Activity Detection with Advanced Long Short-Term Memory
    Tao, Fei
    Busso, Carlos
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1244 - 1248
  • [47] A PRIORITIZED GRID LONG SHORT-TERM MEMORY RNN FOR SPEECH RECOGNITION
    Hsu, Wei-Ning
    Zhang, Yu
    Glass, James
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 467 - 473
  • [48] Real-Time Short-Term Voltage Stability Assessment Using Combined Temporal Convolutional Neural Network and Long Short-Term Memory Neural Network
    Adhikari, Ananta
    Naetiladdanon, Sumate
    Sangswang, Anawach
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [49] Text Classification Using Long Short-Term Memory
    Sari, Winda Kurnia
    Rini, Dian Palupi
    Malik, Reza Firsandaya
    2019 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICECOS 2019), 2019, : 150 - 155
  • [50] Articulatory-to-speech conversion using bi-directional long short-term memory
    Taguchi, Fumiaki
    Kaburagi, Tokihiko
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2499 - 2503