TCNN: TEMPORAL CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SPEECH ENHANCEMENT IN THE TIME DOMAIN

被引:0
|
作者
Pandey, Ashutosh [1 ]
Wang, DeLiang [1 ,2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Ctr Cognit & Brain Sci, Columbus, OH 43210 USA
关键词
noise-independent and speaker-independent speech enhancement; real-time implementation; time domain; temporal convolutional neural network; TCNN; NOISE;
D O I
10.1109/icassp.2019.8683634
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work proposes a fully convolutional neural network (CNN) for real-time speech enhancement in the time domain. The proposed CNN is an encoder-decoder based architecture with an additional temporal convolutional module (TCM) inserted between the encoder and the decoder. We call this architecture a Temporal Convolutional Neural Network (TCNN). The encoder in the TCNN creates a low dimensional representation of a noisy input frame. The TCM uses causal and dilated convolutional layers to utilize the encoder output of the current and previous frames. The decoder uses the TCM output to reconstruct the enhanced frame. The proposed model is trained in a speaker-and noise-independent way. Experimental results demonstrate that the proposed model gives consistently better enhancement results than a state-of-the-art real-time convolutional recurrent model. Moreover, since the model is fully convolutional, it has much fewer trainable parameters than earlier models.
引用
收藏
页码:6875 / 6879
页数:5
相关论文
共 50 条
  • [31] Real-Time Video Object Recognition Using Convolutional Neural Network
    Ahn, Byungik
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [32] Efficient Real-Time Object Detection based on Convolutional Neural Network
    Abd Shehab, Mohanad
    Al-Gizi, Ammar
    Swadi, Salah M.
    2021 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL ELECTRICITY (ICATE), 2021,
  • [33] Real-Time Fabric Defect Segmentation Based on Convolutional Neural Network
    Zhen Wang
    Jing Junfeng
    Zhang, Huanhuan
    Yan Zhao
    AATCC JOURNAL OF RESEARCH, 2021, 8 : 91 - 96
  • [34] Real-time, simultaneous myoelectric control using a convolutional neural network
    Ameri, Ali
    Akhaee, Mohammad Ali
    Scheme, Erik
    Englehart, Kevin
    PLOS ONE, 2018, 13 (09):
  • [35] A Convolutional Heterogeneous Spiking Neural Network for Real-time Music Classification
    Liu, Yuguo
    Chen, Wenyu
    Qu, Hong
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 331 - 336
  • [36] KrNet: A Kinetic Real-Time Convolutional Neural Network for Navigational Assistance
    Lin, Shufei
    Wang, Kaiwei
    Yang, Kailun
    Cheng, Ruiqi
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, ICCHP 2018, PT II, 2018, 10897 : 55 - 62
  • [37] Real-time goat face recognition using convolutional neural network
    Billah, Masum
    Wang, Xihong
    Yu, Jiantao
    Jiang, Yu
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 194
  • [38] Age Estimation of Real-Time Faces Using Convolutional Neural Network
    Agbo-Ajala, Olatunbosun
    Viriri, Serestina
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, 2019, 11683 : 316 - 327
  • [39] A Lightweight Convolutional Neural Network for Real-Time Facial Expression Detection
    Zhou, Ning
    Liang, Renyu
    Shi, Wenqian
    IEEE ACCESS, 2021, 9 : 5573 - 5584
  • [40] Fast Convolutional Neural Network for Real-Time Robotic Grasp Detection
    Ribeiro, Eduardo G.
    Grassi Jr, Valdir
    2019 19TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2019, : 49 - 54