Unsupervised feature learning for environmental sound classification using Weighted Cycle-Consistent Generative Adversarial Network

被引:33
|
作者
Esmaeilpour, Mohammad [1 ]
Cardinal, Patrick [1 ]
Koerich, Alessandro Lameiras [1 ]
机构
[1] Univ Quebec, ETS, 1100 Notre Dame West, Montreal, PQ H3C 1K3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Environmental sound classification; Generative Adversarial Network (GAN); Cycle-Consistent GAN; K-means plus; Random forests; QUALITY ASSESSMENT; AUDIO; RECOGNITION;
D O I
10.1016/j.asoc.2019.105912
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel environmental sound classification approach incorporating unsupervised feature learning via the spherical K-Means++ algorithm and a new architecture for high-level data augmentation. The audio signal is transformed into a 2D representation using a discrete wavelet transform (DWT). The DWT spectrograms are then augmented by a novel architecture for cycle-consistent generative adversarial network. This high-level augmentation bootstraps generated spectrograms in both intra-and inter-class manners by translating structural features from sample to sample. A codebook is built by coding the DWT spectrograms with the speeded-up robust feature detector and the K-Means++ algorithm. The Random forest is the final learning algorithm which learns the environmental sound classification task from the code vectors. Experimental results in four benchmarking environmental sound datasets (ESC-10, ESC-50, UrbanSound8k, and DCASE-2017) have shown that the proposed classification approach outperforms most of the state-of-the-art classifiers, including convolutional neural networks such as AlexNet and GoogLeNet, improving the classification rate between 3.51% and 14.34%, depending on the dataset. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] On Translation and Reconstruction Guarantees of the Cycle-Consistent Generative Adversarial Networks
    Chakrabarty, Anish
    Das, Swagatam
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [22] Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks
    Yuan, Li
    Tay, Francis Eng Hock
    Li, Ping
    Feng, Jiashi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (10) : 2711 - 2722
  • [23] Unsupervised cycle-consistent adversarial attacks for visual object tracking
    Yao, Rui
    Zhu, Xiangbin
    Zhou, Yong
    Shao, Zhiwen
    Hu, Fuyuan
    Zhang, Yanning
    [J]. DISPLAYS, 2023, 80
  • [24] TSI-GAN: Unsupervised Time Series Anomaly Detection Using Convolutional Cycle-Consistent Generative Adversarial Networks
    Saravanan, Shyam Sundar
    Luo, Tie
    Ngo, Mao Van
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT I, 2023, 13935 : 39 - 54
  • [25] Towards Unsupervised Learning for Instrument Segmentation in Robotic Surgery with Cycle-Consistent Adversarial Networks
    Pakhomov, Daniil
    Shen, Wei
    Navab, Nassir
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 8499 - 8504
  • [26] Colorization for anime sketches with cycle-consistent adversarial network
    Zhang, Guanghua
    Qu, Mengnan
    Jin, Yuhao
    Song, Qingpeng
    [J]. International Journal of Performability Engineering, 2019, 15 (03) : 910 - 918
  • [27] Endoscopic Ultrasound Image Synthesis Using a Cycle-Consistent Adversarial Network
    Grimwood, Alexander
    Ramalhinho, Joao
    Baum, Zachary M. C.
    Montana-Brown, Nina
    Johnson, Gavin J.
    Hu, Yipeng
    Clarkson, Matthew J.
    Pereira, Stephen P.
    Barratt, Dean C.
    Bonmati, Ester
    [J]. SIMPLIFYING MEDICAL ULTRASOUND, 2021, 12967 : 169 - 178
  • [28] CaGAN: A Cycle-Consistent Generative Adversarial Network With Attention for Low-Dose CT Imaging
    Huang, Zhiyuan
    Chen, Zixiang
    Zhang, Qiyang
    Quan, Guotao
    Ji, Min
    Zhang, Chengjin
    Yang, Yongfeng
    Liu, Xin
    Liang, Dong
    Zheng, Hairong
    Hu, Zhanli
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2020, 6 (06) : 1203 - 1218
  • [29] Speckle noise reduction for medical ultrasound images based on cycle-consistent generative adversarial network
    Liu, Jieyi
    Li, Changchun
    Liu, Liping
    Chen, Haobo
    Han, Hong
    Zhang, Bo
    Zhang, Qi
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [30] Attenuation Correction of PET/MR Using Cycle-Consistent Adversarial Network
    Gong, Kuang
    Yang, Jaewon
    Kim, Kyungsang
    El Fakhri, Georges
    Seo, Youngho
    Li, Quanzheng
    [J]. JOURNAL OF NUCLEAR MEDICINE, 2019, 60