TCNN: TEMPORAL CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SPEECH ENHANCEMENT IN THE TIME DOMAIN

被引:0
|
作者
Pandey, Ashutosh [1 ]
Wang, DeLiang [1 ,2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Ctr Cognit & Brain Sci, Columbus, OH 43210 USA
关键词
noise-independent and speaker-independent speech enhancement; real-time implementation; time domain; temporal convolutional neural network; TCNN; NOISE;
D O I
10.1109/icassp.2019.8683634
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work proposes a fully convolutional neural network (CNN) for real-time speech enhancement in the time domain. The proposed CNN is an encoder-decoder based architecture with an additional temporal convolutional module (TCM) inserted between the encoder and the decoder. We call this architecture a Temporal Convolutional Neural Network (TCNN). The encoder in the TCNN creates a low dimensional representation of a noisy input frame. The TCM uses causal and dilated convolutional layers to utilize the encoder output of the current and previous frames. The decoder uses the TCM output to reconstruct the enhanced frame. The proposed model is trained in a speaker-and noise-independent way. Experimental results demonstrate that the proposed model gives consistently better enhancement results than a state-of-the-art real-time convolutional recurrent model. Moreover, since the model is fully convolutional, it has much fewer trainable parameters than earlier models.
引用
收藏
页码:6875 / 6879
页数:5
相关论文
共 50 条
  • [21] A FLOW-BASED NEURAL NETWORK FOR TIME DOMAIN SPEECH ENHANCEMENT
    Strauss, Martin
    Edler, Bernd
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5754 - 5758
  • [22] Real Time Speech Enhancement in the Waveform Domain
    Defossez, Alexandre
    Synnaeve, Gabriel
    Adi, Yossi
    INTERSPEECH 2020, 2020, : 3291 - 3295
  • [23] DCT based densely connected convolutional GRU for real-time speech enhancement
    Jannu, Chaitanya
    Vanambathina, Sunny Dayal
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (01) : 1195 - 1208
  • [24] RT-VENet: A Convolutional Network for Real-time Video Enhancement
    Zhang, Mohan
    Gao, Qiqi
    Wang, Jinglu
    Turbell, Henrik
    Zhao, David
    Yu, Jinhui
    Lu, Yan
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4088 - 4097
  • [25] Real-Time Short-Term Voltage Stability Assessment using Temporal Convolutional Neural Network
    Adhikari, Ananta
    Naetiladdanon, Sumate
    Sangswang, Anawach
    2021 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2021,
  • [26] Robust and Real-Time Visual Tracking with Triplet Convolutional Neural Network
    Kim, Jung Uk
    Kim, Hak Gu
    Ro, Yong Man
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 280 - 286
  • [27] A real-time and accurate convolutional neural network for fabric defect detection
    Xueshen Li
    Yong Zhu
    Complex & Intelligent Systems, 2024, 10 : 3371 - 3387
  • [28] FDDWNET: A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
    Liu, Jia
    Zhou, Quan
    Qiang, Yong
    Kang, Bin
    Wu, Xiaofu
    Zheng, Baoyu
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2373 - 2377
  • [29] Real-Time Fabric Defect Segmentation Based on Convolutional Neural Network
    Zhen Wang
    Jing Junfeng
    Zhang, Huanhuan
    Yan Zhao
    AATCC JOURNAL OF RESEARCH, 2021, 8 (1_SUPPL): : 92 - 97
  • [30] A real-time and accurate convolutional neural network for fabric defect detection
    Li, Xueshen
    Zhu, Yong
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3371 - 3387