Neural Network Model Based on the Tensor Network for Audio Tagging of Domestic Activities

被引:0
|
作者
Yang, LiDong [1 ]
Yue, RenBo [1 ]
Wang, Jing [2 ]
Liu, Min [3 ]
机构
[1] Inner Mongolia Univ Sci & Technol, Sch Informat Engn, Baotou, Peoples R China
[2] Beijing Inst Technol, School Information Elect, Beijing, Peoples R China
[3] China Mobile Res Inst, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
tensor network; matrix product state (MPS); tensor train decomposition; audio tagging; neural network; SIGNAL;
D O I
10.3389/fphy.2022.863291
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Due to the serious problem of population aging, monitoring of domestic activities is increasingly important. Audio tagging of domestic activities is very suitable when the visual data are unavailable due to the interference from light and the environment. Aiming at solving this problem, a neural network model based on the tensor network is proposed for audio tagging of domestic activities that is more interpretable than traditional neural networks. The introduction of the tensor network can compress the network parameters and reduce the redundancy of the training model while maintaining a good performance. First, the important features of a Mel spectrogram of the input audio are extracted through the convolutional neural networks (CNNs). Then, they are converted into the high-order space corresponding with the tensor network. The spatial structure information and important features can be further extracted and retained through the matrix product state (MPS). Large patches of the featured data are divided into small local orderless patches when using the tensor network. The final tagging results are obtained through the MPS layers which is just a tensor network structure based on the tensor train decomposition. In order to evaluate the proposed method, the DCASE 2018 challenge task 5 dataset for monitoring domestic activities is selected. The results showed that the average F1-score of the proposed model in the test set of the development dataset and validation dataset reached 87.7 and 85.9%, which are 3.2 and 2.8% higher than the baseline system, respectively. It is verified that the proposed model can perform better and more efficiently for audio tagging of domestic activities.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] ATGNN: Audio Tagging Graph Neural Network
    Singh, Shubhr
    Steinmetz, Christian J.
    Benetos, Emmanouil
    Phan, Huy
    Stowell, Dan
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 825 - 829
  • [2] Neural network tagging in a toy model
    Milek, M
    Patel, P
    [J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1999, 425 (03): : 577 - 588
  • [3] Study on data augmentation methods for deep neural network-based audio tagging
    Kim, Bum-Jun
    Moon, Hyeongi
    Park, Sung-Wook
    Park, Young Cheol
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2018, 37 (06): : 475 - 482
  • [4] Recognition of Audio Depression Based on Convolutional Neural Network and Generative Antagonism Network Model
    Wang, Zhiyong
    Chen, Longxi
    Wang, Lifeng
    Diao, Guangqiang
    [J]. IEEE ACCESS, 2020, 8 : 101181 - 101191
  • [5] Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging
    Xu, Yong
    Kong, Qiuqiang
    Huang, Qiang
    Wang, Wenwu
    Plumbley, Mark D.
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3461 - 3466
  • [6] Design of Neural Network Model for Cross-Media Audio and Video Score Recognition Based on Convolutional Neural Network Model
    Liu, Hongxia
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [7] Audio Public opinion Analysis Model based on heterogeneous Neural Network
    Jiang, Haikun
    Wu, Xu
    Xie, Xiaqing
    Wu, Jingchen
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 449 - 453
  • [8] Research on Neural Network Machine Translation Model Based on Entity Tagging Improvement
    Xu, Xijun
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [9] A Neural Network based audio content classification
    Mitra, Vikramjit
    Wang, Chia-Jiu
    [J]. TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 582 - +
  • [10] Neural network based audio watermarking algorithm
    Cao, LH
    Wang, X
    Wang, ZJ
    Bai, S
    [J]. ICMIT 2005: INFORMATION SYSTEMS AND SIGNAL PROCESSING, 2005, 6041