MULTI-VIEW NETWORKS FOR MULTI-CHANNEL AUDIO CLASSIFICATION

被引:0
|
作者
Casebeer, Jonah [1 ]
Wang, Zhepei [1 ]
Smaragdis, Paris [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Champaign, IL 61820 USA
基金
美国国家科学基金会;
关键词
Sound recognition; IoT sensing; neural networks;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we introduce the idea of multi-view networks for sound classification with multiple sensors. We show how one can build a multi-channel sound recognition model trained on a fixed number of channels, and deploy it to scenarios with arbitrary (and potentially dynamically changing) number of input channels and not observe degradation in performance. We demonstrate that at inference time you can safely provide this model all available channels as it can ignore noisy information and leverage new information better than standard baseline approaches. The model is evaluated in both an anechoic environment and in rooms generated by a room acoustics simulator. We demonstrate that this model can generalize to unseen numbers of channels as well as unseen room geometries.
引用
收藏
页码:940 / 944
页数:5
相关论文
共 50 条
  • [1] Multi-view video and multi-channel audio broadcasting system
    Oh, Kwan-Jung
    Kim, Manbae
    Yoon, Jae Sam
    Kim, Jongryool
    Park, Ilkwon
    Lee, Seungwon
    Lee, Cheon
    Heo, Jin
    Lee, Sang-Beom
    Park, Pil-Kyu
    Na, Sang-Tae
    Hyun, Myung-Han
    Kim, JongWon
    Byun, Hyeran
    Kim, Hong Kook
    Ho, Yo-Sung
    [J]. 2007 3DTV CONFERENCE, 2007, : 165 - +
  • [2] Parameter-based Multi-channel Audio Panning for Multi-view Broadcasting Systems
    Kim, Yong Guk
    Kim, Hong Kook
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING SYMPOSIA, VOLS 1-5, PROCEEDINGS, 2008, : 270 - 273
  • [3] MULTI-VIEW AUDIO AND MUSIC CLASSIFICATION
    Phan, Huy
    Le Nguyen, Huy
    Chen, Oliver Y.
    Pham, Lam
    Koch, Philipp
    McLoughlin, Ian
    Mertins, Alfred
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 611 - 615
  • [4] Multi-channel Pose-aware Convolution Neural Networks for Multi-view Facial Expression Recognition
    Liu, Yuanyuan
    Zeng, Jiabei
    Shan, Shiguang
    Zheng, Zhuo
    [J]. PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 458 - 465
  • [5] Multi-view classification with convolutional neural networks
    Seeland, Marco
    Maeder, Patrick
    [J]. PLOS ONE, 2021, 16 (01):
  • [6] Multi-Channel Augmented Graph Embedding Convolutional Network for Multi-View Clustering
    Lin, Renjie
    Du, Shide
    Wang, Shiping
    Guo, Wenzhong
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (04): : 2239 - 2249
  • [7] Multi-channel audio statistical restoration
    Liu, Yinhong
    Godsill, Simon
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [8] MCMVDRP: a multi-channel multi-view deep learning framework for cancer drug response prediction
    Li, Xiangyu
    Shi, Xiumin
    Li, Yuxuan
    Wang, Lu
    [J]. JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2024,
  • [9] Equivariant Multi-View Networks
    Esteves, Carlos
    Xu, Yinshuang
    Allen-Blanchette, Christine
    Daniilidis, Kostas
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1568 - 1577
  • [10] Multimodal correlation deep belief networks for multi-view classification
    Zhang, Nan
    Ding, Shifei
    Liao, Hongmei
    Jia, Weikuan
    [J]. APPLIED INTELLIGENCE, 2019, 49 (05) : 1925 - 1936