TCNN: TEMPORAL CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SPEECH ENHANCEMENT IN THE TIME DOMAIN

被引：0

作者：

Pandey, Ashutosh ^{[1
]}

Wang, DeLiang ^{[1
,2
]}

机构：

[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA

[2] Ohio State Univ, Ctr Cognit & Brain Sci, Columbus, OH 43210 USA

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年

关键词：

noise-independent and speaker-independent speech enhancement; real-time implementation; time domain; temporal convolutional neural network; TCNN; NOISE;

D O I：

10.1109/icassp.2019.8683634

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This work proposes a fully convolutional neural network (CNN) for real-time speech enhancement in the time domain. The proposed CNN is an encoder-decoder based architecture with an additional temporal convolutional module (TCM) inserted between the encoder and the decoder. We call this architecture a Temporal Convolutional Neural Network (TCNN). The encoder in the TCNN creates a low dimensional representation of a noisy input frame. The TCM uses causal and dilated convolutional layers to utilize the encoder output of the current and previous frames. The decoder uses the TCM output to reconstruct the enhanced frame. The proposed model is trained in a speaker-and noise-independent way. Experimental results demonstrate that the proposed model gives consistently better enhancement results than a state-of-the-art real-time convolutional recurrent model. Moreover, since the model is fully convolutional, it has much fewer trainable parameters than earlier models.

引用

页码：6875 / 6879

页数：5

共 50 条

[21] A FLOW-BASED NEURAL NETWORK FOR TIME DOMAIN SPEECH ENHANCEMENT
Strauss, Martin
Edler, Bernd
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5754 - 5758
[22] Real Time Speech Enhancement in the Waveform Domain
Defossez, Alexandre
Synnaeve, Gabriel
Adi, Yossi
INTERSPEECH 2020, 2020, : 3291 - 3295
[23] DCT based densely connected convolutional GRU for real-time speech enhancement
Jannu, Chaitanya
Vanambathina, Sunny Dayal
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (01) : 1195 - 1208
[24] RT-VENet: A Convolutional Network for Real-time Video Enhancement
Zhang, Mohan
Gao, Qiqi
Wang, Jinglu
Turbell, Henrik
Zhao, David
Yu, Jinhui
Lu, Yan
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4088 - 4097
[25] Real-Time Short-Term Voltage Stability Assessment using Temporal Convolutional Neural Network
Adhikari, Ananta
Naetiladdanon, Sumate
Sangswang, Anawach
2021 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2021,
[26] Robust and Real-Time Visual Tracking with Triplet Convolutional Neural Network
Kim, Jung Uk
Kim, Hak Gu
Ro, Yong Man
PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 280 - 286
[27] A real-time and accurate convolutional neural network for fabric defect detection
Xueshen Li
Yong Zhu
Complex & Intelligent Systems, 2024, 10 : 3371 - 3387
[28] FDDWNET: A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
Liu, Jia
Zhou, Quan
Qiang, Yong
Kang, Bin
Wu, Xiaofu
Zheng, Baoyu
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2373 - 2377
[29] Real-Time Fabric Defect Segmentation Based on Convolutional Neural Network
Zhen Wang
Jing Junfeng
Zhang, Huanhuan
Yan Zhao
AATCC JOURNAL OF RESEARCH, 2021, 8 (1_SUPPL): : 92 - 97
[30] A real-time and accurate convolutional neural network for fabric defect detection
Li, Xueshen
Zhu, Yong
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3371 - 3387

← 1 2 3 4 5 →