Unsupervised feature learning for environmental sound classification using Weighted Cycle-Consistent Generative Adversarial Network

被引:33
|
作者
Esmaeilpour, Mohammad [1 ]
Cardinal, Patrick [1 ]
Koerich, Alessandro Lameiras [1 ]
机构
[1] Univ Quebec, ETS, 1100 Notre Dame West, Montreal, PQ H3C 1K3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Environmental sound classification; Generative Adversarial Network (GAN); Cycle-Consistent GAN; K-means plus; Random forests; QUALITY ASSESSMENT; AUDIO; RECOGNITION;
D O I
10.1016/j.asoc.2019.105912
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel environmental sound classification approach incorporating unsupervised feature learning via the spherical K-Means++ algorithm and a new architecture for high-level data augmentation. The audio signal is transformed into a 2D representation using a discrete wavelet transform (DWT). The DWT spectrograms are then augmented by a novel architecture for cycle-consistent generative adversarial network. This high-level augmentation bootstraps generated spectrograms in both intra-and inter-class manners by translating structural features from sample to sample. A codebook is built by coding the DWT spectrograms with the speeded-up robust feature detector and the K-Means++ algorithm. The Random forest is the final learning algorithm which learns the environmental sound classification task from the code vectors. Experimental results in four benchmarking environmental sound datasets (ESC-10, ESC-50, UrbanSound8k, and DCASE-2017) have shown that the proposed classification approach outperforms most of the state-of-the-art classifiers, including convolutional neural networks such as AlexNet and GoogLeNet, improving the classification rate between 3.51% and 14.34%, depending on the dataset. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Optimized self-attention based cycle-consistent generative adversarial network adopted melanoma classification from dermoscopic images
    Harini, P.
    Madhavi, N. Bindu
    Latha, S. Bhargavi
    Sasikumar, A. N.
    [J]. MICROSCOPY RESEARCH AND TECHNIQUE, 2024, 87 (06) : 1271 - 1285
  • [42] Unsupervised domain adaptation using modified cycle generative adversarial network for aerial image classification
    Ren, Jiehuang
    Jia, Liye
    Yue, Junhong
    Liu, Xueyu
    Sun, Lixin
    Wu, Yongfei
    Zhou, Daoxiang
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [43] Unsupervised adversarial and cycle consistent feature extraction network for intelligent fault diagnosis
    Wang, Yi-Die
    Chao, Pei-Pei
    Zhang, Rui-Yuan
    Hong, Tang
    Wei, Yu-Cheng
    Dai, Hong-Liang
    [J]. APPLIED SOFT COMPUTING, 2024, 165
  • [44] CBCT correction using a cycle-consistent generative adversarial network and unpaired training to enable photon and proton dose calculation
    Kurz, Christopher
    Maspero, Matteo
    Savenije, Mark H. F.
    Landry, Guillaume
    Kamp, Florian
    Pinto, Marco
    Li, Minglun
    Parodi, Katia
    Belka, Claus
    van den Berg, Cornelis A. T.
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2019, 64 (22):
  • [45] Mandarin Electro-Laryngeal Speech Enhancement Using Cycle-Consistent Generative Adversarial Networks
    Qian, Zhaopeng
    Xiao, Kejing
    Yu, Chongchong
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [46] Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization
    Yuan, Li
    Tay, Francis E. H.
    Li, Ping
    Zhou, Li
    Feng, Jiashi
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9143 - 9150
  • [47] AgriGAN: unpaired image dehazing via a cycle-consistent generative adversarial network for the agricultural plant phenotype
    Ding, Jin-Ting
    Peng, Yong-Yu
    Huang, Min
    Zhou, Sheng-Jun
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [48] Cycle-consistent 3D-generative adversarial network for virtual bowel cleansing in CT colonography
    Nappi, Janne J.
    Yoshida, Hiroyuki
    [J]. MEDICAL IMAGING 2019: IMAGE PROCESSING, 2019, 10949
  • [49] xAI-CycleGAN, a Cycle-Consistent Generative Assistive Network
    Sloboda, Tibor
    Hudec, Lukas
    Benesova, Wanda
    [J]. COMPUTER VISION SYSTEMS, ICVS 2023, 2023, 14253 : 403 - 411
  • [50] Hounsfield Unit Correction of Prostate CBCT Using Cycle-Consistent Generative Adversarial Networks (CycleGAN)
    Dona, O.
    Wang, Y.
    Xu, A.
    Wuu, C.
    [J]. MEDICAL PHYSICS, 2019, 46 (06) : E317 - E317