Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation

被引:6
|
作者
Yun, Deokgyu [1 ]
Choi, Seung Ho [2 ]
机构
[1] Seoul Natl Univ Sci & Technol, Dept Elect Engn, Seoul 139743, South Korea
[2] Seoul Natl Univ Sci & Technol, Dept Elect & IT Media Engn, Seoul 139743, South Korea
关键词
audio data augmentation; dereverberation; deep learning; room impulse response;
D O I
10.3390/s22020592
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper proposes an audio data augmentation method based on deep learning in order to improve the performance of dereverberation. Conventionally, audio data are augmented using a room impulse response, which is artificially generated by some methods, such as the image method. The proposed method estimates a reverberation environment model based on a deep neural network that is trained by using clean and recorded audio data as inputs and outputs, respectively. Then, a large amount of a real augmented database is constructed by using the trained reverberation model, and the dereverberation model is trained with the augmented database. The performance of the augmentation model was verified by a log spectral distance and mean square error between the real augmented data and the recorded data. In addition, according to dereverberation experiments, the proposed method showed improved performance compared with the conventional method.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Data Augmentation for Deep Learning-Based Radio Modulation Classification
    Huang, Liang
    Pan, Weijian
    Zhang, You
    Qian, Liping
    Gao, Nan
    Wu, Yuan
    [J]. IEEE ACCESS, 2020, 8 : 1498 - 1506
  • [2] A Deep Learning-Based Coyote Detection System Using Audio Data
    Jung, Heesun
    Kwon, Bokyung
    Kim, Youngbin
    Lee, Yejin
    Park, Jihyeon
    Pegg, Griffin
    Wang, Yaqin
    Smith, Anthony H.
    [J]. 2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 170 - 175
  • [3] Deep Learning-based Anomaly Detection for Compressors Using Audio Data
    Mobtahej, Pooyan
    Zhang, Xulong
    Hamidi, Maryam
    Zhang, Jing
    [J]. 67TH ANNUAL RELIABILITY & MAINTAINABILITY SYMPOSIUM (RAMS 2021), 2021,
  • [4] Hapke Data Augmentation for Deep Learning-Based Hyperspectral Data Analysis With Limited Samples
    Qin, Kai
    Ge, Fangyuan
    Zhao, Yingjun
    Zhu, Ling
    Li, Ming
    Shi, Cong
    Li, Dong
    Zhou, Xichuan
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (05) : 886 - 890
  • [5] Influence of Autoencoder-Based Data Augmentation on Deep Learning-Based Wireless Communication
    Li, Linyu
    Zhang, Zhengming
    Yang, Luxi
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (09) : 2090 - 2093
  • [6] Deep learning-based gesture recognition for surgical applications: A data augmentation approach
    Santiago, Sofia Sorbet
    Cifuentes, Jenny Alexandra
    [J]. EXPERT SYSTEMS, 2024,
  • [7] Temporal and spatial satellite data augmentation for deep learning-based rainfall nowcasting
    Yesilkoy, Ozlem Baydaroglu
    Demir, Ibrahim
    [J]. JOURNAL OF HYDROINFORMATICS, 2024, 26 (03) : 589 - 607
  • [8] A LEARNING-BASED APPROACH TO DIRECTION OF ARRIVAL ESTIMATION IN NOISY AND REVERBERANT ENVIRONMENTS
    Xiao, Xiong
    Zhao, Shengkui
    Zhong, Xionghu
    Jones, Douglas L.
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2814 - 2818
  • [9] Deep Learning-Based SNR Estimation
    Zheng, Shilian
    Chen, Shurun
    Chen, Tao
    Yang, Zhuang
    Zhao, Zhijin
    Yang, Xiaoniu
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 4778 - 4796
  • [10] Deep Learning-Based Channel Estimation
    Soltani, Mehran
    Pourahmadi, Vahid
    Mirzaei, Ali
    Sheikhzadeh, Hamid
    [J]. IEEE COMMUNICATIONS LETTERS, 2019, 23 (04) : 652 - 655