SALADNET: SELF-ATTENTIVE MULTISOURCE LOCALIZATION IN THE AMBISONICS DOMAIN

被引:8
|
作者
Grumiaux, Pierre-Amaury [1 ]
Kitic, Srdan [1 ]
Srivastava, Prerak [2 ]
Girin, Laurent [3 ]
Guerin, Alexandre [1 ]
机构
[1] Orange Labs, Cesson Sevigne, France
[2] Univ Lorraine, INRIA, Nancy, France
[3] Univ Grenoble Alpes, GIPSA Lab, CNRS, Grenoble INP, Grenoble, France
关键词
Sound source localization; neural networks; self-attention; Ambisonics; parallel computing;
D O I
10.1109/WASPAA52581.2021.9632737
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we propose a novel self-attention based neural network for robust multi-speaker localization from Ambisonics recordings. Starting from a state-of-the-art convolutional recurrent neural network, we investigate the benefit of replacing the recurrent layers by self-attention encoders, inherited from the Transformer architecture. We evaluate these models on synthetic and real-world data, with up to 3 simultaneous speakers. The obtained results indicate that the majority of the proposed architectures either perform on par, or outperform the CRNN baseline, especially in the multisource scenario. Moreover, by avoiding the recurrent layers, the proposed models lend themselves to parallel computing, which is shown to produce considerable savings in execution time.
引用
收藏
页码:336 / 340
页数:5
相关论文
共 50 条
  • [41] Self-Attentive Contrastive Learning for Conditioned Periocular and Face Biometrics
    Ng, Tiong-Sik
    Chai, Jacky Chen Long
    Low, Cheng-Yaw
    Teoh, Andrew Beng Jin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 3251 - 3264
  • [42] Sequential Recommendation with Self-Attentive Multi-Adversarial Network
    Ren, Ruiyang
    Liu, Zhaoyang
    Li, Yaliang
    Zhao, Wayne Xin
    Wang, Hui
    Ding, Bolin
    Wen, Ji-Rong
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 89 - 98
  • [43] Interactive Self-Attentive Siamese Network for Biomedical Sentence Similarity
    Li, Zhengguang
    Lin, Hongfei
    Zheng, Wei
    Tadesse, Michael M.
    Yang, Zhihao
    Wang, Jian
    IEEE ACCESS, 2020, 8 (08): : 84093 - 84104
  • [44] Self-Attentive Sequential Recommendation Models Enriched with More Features
    Trong Dang Huu Ho
    Sang Thi Thanh Nguyen
    PROCEEEDINGS OF 2024 8TH INTERNATIONAL CONFERENCE ON DEEP LEARNING TECHNOLOGIES, ICDLT 2024, 2024, : 49 - 55
  • [45] Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
    Zhu, Yingke
    Ko, Tom
    Snyder, David
    Mak, Brian
    Povey, Daniel
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3573 - 3577
  • [46] PROACTIVE: Self-Attentive Temporal Point Process Flows for Activity Sequences
    Gupta, Vinayak
    Bedathur, Srikanta
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 496 - 504
  • [47] Self-Attentive Classification-Based Anomaly Detection in Unstructured Logs
    Nedelkoski, Sasho
    Bogatinovski, Jasmin
    Acker, Alexander
    Cardoso, Jorge
    Kao, Odej
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1196 - 1201
  • [48] Self-attentive Pyramid Network for Single Image De-raining
    Guo, Taian
    Dai, Tao
    Li, Jiawei
    Xia, Shu-Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 390 - 401
  • [49] PAKT: A Position-Aware Self-attentive Approach for Knowledge Tracing
    Ouyang, Yuanxin
    Zhou, Yucong
    Zhang, Hongbo
    Rong, Wenge
    Xiong, Zhang
    ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT II, 2021, 12749 : 285 - 289
  • [50] TeaDiseaseNet: multi-scale self-attentive tea disease detection
    Sun, Yange
    Wu, Fei
    Guo, Huaping
    Li, Ran
    Yao, Jianfeng
    Shen, Jianbo
    FRONTIERS IN PLANT SCIENCE, 2023, 14