A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION

被引:0
|
作者
Zhang, Yucong [1 ,2 ]
Liu, Juan [1 ]
Tian, Yao [3 ]
Liu, Haifeng [4 ]
Li, Ming [1 ,2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Duke Kunshan Univ, Suzhou Municipal Key Lab Multimodal Intelligent, Kunshan, Peoples R China
[3] OPPO, Data & AI Engn Syst, Beijing, Peoples R China
[4] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Anomalous sound detection; squeeze and excitation; frequency pattern analysis; temporal periodicity analysis;
D O I
10.1109/ICASSP48485.2024.10448126
中图分类号
学科分类号
摘要
In contrast to human speech, machine-generated sounds of the same type often exhibit consistent frequency characteristics and discernible temporal periodicity. However, leveraging these dual attributes in anomaly detection remains relatively under-explored. In this paper, we propose an automated dual-path framework that learns prominent frequency and temporal patterns for diverse machine types. One pathway uses a novel Frequency-and-Time Excited Network (FTE-Net) to learn the salient features across frequency and time axes of the spectrogram. It incorporates a Frequency-and-Time Chunkwise Encoder (FTC-Encoder) and an excitation network. The other pathway uses a 1D convolutional network for utterance-level spectrum. Experimental results on the DCASE 2023 task 2 dataset show the state-of-the-art performance of our proposed method. Moreover, visualizations of the intermediate feature maps in the excitation network are provided to illustrate the effectiveness of our method.
引用
收藏
页码:1266 / 1270
页数:5
相关论文
共 50 条
  • [31] Dual-path joint correction network for underwater image enhancement
    Zhang, Dehuan
    Shen, Jiaqi
    Zhou, Jingchun
    Chen, Erkang
    Zhang, Weishi
    OPTICS EXPRESS, 2022, 30 (18) : 33412 - 33432
  • [32] A Dual-Path Small Convolution Network for Hyperspectral Image Classification
    Dang, Lanxue
    Pang, Peidong
    Zuo, Xianyu
    Liu, Yang
    Lee, Jay
    REMOTE SENSING, 2021, 13 (17)
  • [33] DMPNet: dual-path and multi-scale pansharpening network
    Kaur, Gurpreet
    Malhotra, Manisha
    Singh, Dilbag
    Singhal, Sunita
    FRONTIERS IN COMPUTER SCIENCE, 2025, 6
  • [34] A DEEP DUAL-PATH NETWORK FOR IMPROVED MAMMOGRAM IMAGE PROCESSING
    Li, Heyi
    Chen, Dongdong
    Nailon, William H.
    Davies, Mike E.
    Laurenson, Dave
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1224 - 1228
  • [35] Dual-Path Network-Based Hyperspectral Image Classification
    Kang, Xudong
    Zhuo, Binbin
    Du, Puhong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (03) : 447 - 451
  • [36] Dual-Path Deep Fusion Network for Face Image Hallucination
    Jiang, Kui
    Wang, Zhongyuan
    Yi, Peng
    Lu, Tao
    Jiang, Junjun
    Xiong, Zixiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 378 - 391
  • [37] Dual-Path Attention Network for Compressed Sensing Image Reconstruction
    Sun, Yubao
    Chen, Jiwei
    Liu, Qingshan
    Liu, Bo
    Guo, Guodong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9482 - 9495
  • [38] Lightweight and efficient dual-path fusion network for iris segmentation
    Songze Lei
    Aokui Shan
    Bo Liu
    Yanxiao Zhao
    Wei Xiang
    Scientific Reports, 13
  • [39] A scene text detection based on dual-path feature fusion
    Zhao P.
    Xu B.-P.
    Yan S.
    Liu Z.-Y.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (09): : 2179 - 2186
  • [40] Dual-Path Network Model Fusing Frequency-Domain Features to Diagnose COVID-19
    Yang, Yuhang
    Lin, Min
    Wang, Changying
    Zhong, Yiwen
    Computer Engineering and Applications, 2024, 59 (05) : 321 - 327