Build A Module for Improvement Real Time Speech enhancement using Long Short-term Memory Approach

被引:0
|
作者
Van Vo [1 ]
Bach Le Son [2 ]
Huy Vo Phuc [2 ]
机构
[1] FPT Univ, Software Engn Dept, Hanoi, Vietnam
[2] FPT Univ, Informat Technol Specialized Dept, Hanoi, Vietnam
关键词
Speech enhancement; Noise suppression; Deep Learning; Long Short-term Memory; Virtual Call Center; Customer Relationship Management System;
D O I
10.1145/3591569.3591614
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An essential customer experience is required for all businesses today, and customer support as a service brings the right people and processes together. When designing a system for in the context of audio communication for transmission purposes, noise influences must be carefully considered. Improving the quality of phone calls for a smart virtual call center is essential for more effective customer care. This paper proposed a module for improving real-time speech enhancement of phone calls using Long short-term memory (LSTM), an artificial neural network used in the fields of artificial intelligence and deep learning. LSTMs are designed to revoke the long-term dependency issue, remembering information for long periods is generally their default way of behaving. The data set using for this approach is both in English and Vietnamese, the results also improve with evaluation metrics such as PESQ, SI-SDR, STOI.
引用
收藏
页码:259 / 264
页数:6
相关论文
共 50 条
  • [21] Language Modeling Using Part-of-speech and Long Short-Term Memory Networks
    Norouzi, Sanaz Saki
    Akbari, Ahmad
    Nasersharif, Babak
    2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 182 - 187
  • [22] LOMBARD SPEECH SYNTHESIS USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS
    Bollepalli, Bajibabu
    Airaksinen, Manu
    Alku, Paavo
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5505 - 5509
  • [23] Modeling Speaker Variability Using Long Short-Term Memory Networks for Speech Recognition
    Li, Xiangang
    Wu, Xihong
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1086 - 1090
  • [24] Part of Speech Tagging for Indonesian Language using Bidirectional Long Short-Term Memory
    Handrata, Dellon
    Purwanto, Christian Nathaniel
    Chandra, Fransisca Haryanti
    Santoso, Joan
    Gunawan
    2019 1ST INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEM (ICORIS), 2019, : 85 - 88
  • [25] A Speech Recognition Method Using Long Short-Term Memory Network in Low Resources
    Shu F.
    Qu D.
    Zhang W.
    Zhou L.
    Guo W.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2017, 51 (10): : 120 - 127
  • [26] Recognition of Spontaneous Conversational Speech using Long Short-Term Memory Phoneme Predictions
    Woellmer, Martin
    Eyben, Florian
    Schuller, Bjoern
    Rigoll, Gerhard
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1946 - 1949
  • [27] The time needed to consolidate short-term memory to long-term memory
    Takeyama, E
    Takenoshita, M
    Nishimura, S
    Yoshiya, I
    ANESTHESIOLOGY, 1998, 89 (3A) : U317 - U317
  • [28] Incremental Face Recognition: Hybrid Approach Using Short-Term Memory and Long-Term Memory
    Kim, Sangwook
    Mallipeddi, Rammohan
    Lee, Minho
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 194 - 201
  • [29] Speech Perception Improvement Algorithm Based on a Dual-Path Long Short-Term Memory Network
    Koh, Hyeong Il
    Na, Sungdae
    Kim, Myoung Nam
    Ieracitano, Cosimo
    Zhang, Xuejun
    BIOENGINEERING-BASEL, 2023, 10 (11):
  • [30] Enhanced Deep Hierarchical Long Short-Term Memory and Bidirectional Long Short-Term Memory for Tamil Emotional Speech Recognition using Data Augmentation and Spatial Features
    Fernandes, Bennilo
    Mannepalli, Kasiprasad
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2021, 29 (04): : 2967 - 2992