Build A Module for Improvement Real Time Speech enhancement using Long Short-term Memory Approach

被引:0
|
作者
Van Vo [1 ]
Bach Le Son [2 ]
Huy Vo Phuc [2 ]
机构
[1] FPT Univ, Software Engn Dept, Hanoi, Vietnam
[2] FPT Univ, Informat Technol Specialized Dept, Hanoi, Vietnam
关键词
Speech enhancement; Noise suppression; Deep Learning; Long Short-term Memory; Virtual Call Center; Customer Relationship Management System;
D O I
10.1145/3591569.3591614
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An essential customer experience is required for all businesses today, and customer support as a service brings the right people and processes together. When designing a system for in the context of audio communication for transmission purposes, noise influences must be carefully considered. Improving the quality of phone calls for a smart virtual call center is essential for more effective customer care. This paper proposed a module for improving real-time speech enhancement of phone calls using Long short-term memory (LSTM), an artificial neural network used in the fields of artificial intelligence and deep learning. LSTMs are designed to revoke the long-term dependency issue, remembering information for long periods is generally their default way of behaving. The data set using for this approach is both in English and Vietnamese, the results also improve with evaluation metrics such as PESQ, SI-SDR, STOI.
引用
收藏
页码:259 / 264
页数:6
相关论文
共 50 条
  • [1] Time Series-based Spoof Speech Detection Using Long Short-term Memory and Bidirectional Long Short-term Memory
    Mirza, Arsalan R.
    Al-Talabani, Abdulbasit K.
    ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2024, 12 (02): : 119 - 129
  • [2] Speech Dereverberation Using Long Short-Term Memory
    Mimura, Masato
    Sakai, Shinsuke
    Kawahara, Tatsuya
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2435 - 2439
  • [3] MULTICHANNEL SPEECH ENHANCEMENT BASED ON TIME-FREQUENCY MASKING USING SUBBAND LONG SHORT-TERM MEMORY
    Li, Xiaofei
    Horaud, Radu
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 298 - 302
  • [4] Short-term traffic travel time forecasting using ensemble approach based on long short-term memory networks
    Jia, Xingli
    Zhou, Wuxiao
    Yang, Hongzhi
    Li, Shuangqing
    Chen, Xingpeng
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (06) : 1262 - 1273
  • [5] MONAURAL SPEECH ENHANCEMENT BASED ON TWO STAGE LONG SHORT-TERM MEMORY NETWORKS
    Xian, Yang
    Sun, Yang
    Wang, Wenwu
    Naqvi, Syed Mohsen
    2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
  • [6] Long Short-term Memory for Tibetan Speech Recognition
    Wang, Weizhe
    Chen, Ziyan
    Yang, Hongwu
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1059 - 1063
  • [7] Real time anomalies detection in crowd using convolutional long short-term memory network
    Saba, Tanzila
    JOURNAL OF INFORMATION SCIENCE, 2023, 49 (05) : 1145 - 1152
  • [8] Predictive model for real-time energy disaggregation using long short-term memory
    Li, Bingbing
    Wu, Tongzi
    Bian, Shijie
    Sutherland, John W.
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2023, 72 (01) : 25 - 28
  • [9] Towards real-world objective speech quality and intelligibility assessment using speech-enhancement residuals and convolutional long short-term memory networks
    Dong, Xuan
    Williamson, Donald S.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 148 (05): : 3348 - 3359
  • [10] Emotion Recognition From Speech and Text using Long Short-Term Memory
    Venkateswarlu, Sonagiri China
    Jeevakala, Siva Ramakrishna
    Kumar, Naluguru Udaya
    Munaswamy, Pidugu
    Pendyala, Dhanalaxmi
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2023, 13 (04) : 11166 - 11169