SALADNET: SELF-ATTENTIVE MULTISOURCE LOCALIZATION IN THE AMBISONICS DOMAIN

被引:8
|
作者
Grumiaux, Pierre-Amaury [1 ]
Kitic, Srdan [1 ]
Srivastava, Prerak [2 ]
Girin, Laurent [3 ]
Guerin, Alexandre [1 ]
机构
[1] Orange Labs, Cesson Sevigne, France
[2] Univ Lorraine, INRIA, Nancy, France
[3] Univ Grenoble Alpes, GIPSA Lab, CNRS, Grenoble INP, Grenoble, France
关键词
Sound source localization; neural networks; self-attention; Ambisonics; parallel computing;
D O I
10.1109/WASPAA52581.2021.9632737
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we propose a novel self-attention based neural network for robust multi-speaker localization from Ambisonics recordings. Starting from a state-of-the-art convolutional recurrent neural network, we investigate the benefit of replacing the recurrent layers by self-attention encoders, inherited from the Transformer architecture. We evaluate these models on synthetic and real-world data, with up to 3 simultaneous speakers. The obtained results indicate that the majority of the proposed architectures either perform on par, or outperform the CRNN baseline, especially in the multisource scenario. Moreover, by avoiding the recurrent layers, the proposed models lend themselves to parallel computing, which is shown to produce considerable savings in execution time.
引用
收藏
页码:336 / 340
页数:5
相关论文
共 50 条
  • [31] SELF-ATTENTIVE SENTIMENTAL SENTENCE EMBEDDING FOR SENTIMENT ANALYSIS
    Lin, Sheng-Chieh
    Su, Wen-Yuh
    Chien, Po-Chuan
    Tsai, Ming-Feng
    Wang, Chuan-Ju
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1678 - 1682
  • [32] SAFE: Self-Attentive Function Embeddings for Binary Similarity
    Massarelli, Luca
    Di Luna, Giuseppe Antonio
    Petroni, Fabio
    Baldoni, Roberto
    Querzoni, Leonardo
    DETECTION OF INTRUSIONS AND MALWARE, AND VULNERABILITY ASSESSMENT (DIMVA 2019), 2019, 11543 : 309 - 329
  • [33] Self-Attentive Models for Real-Time Malware Classification
    Lu, Qikai
    Zhang, Hongwen
    Kinawi, Husam
    Niu, Di
    IEEE ACCESS, 2022, 10 : 95970 - 95985
  • [34] Global-Locally Self-Attentive Dialogue State Tracker
    Zhong, Victor
    Xiong, Caiming
    Socher, Richard
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1458 - 1467
  • [35] SANDGLASSET: A LIGHT MULTI-GRANULARITY SELF-ATTENTIVE NETWORK FOR TIME-DOMAIN SPEECH SEPARATION
    Lam, Max W. Y.
    Wang, Jun
    Su, Dan
    Yu, Dong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5759 - 5763
  • [36] A Self-Attentive Model with Gate Mechanism for Spoken Language Understanding
    Li, Changliang
    Li, Liang
    Qi, Ji
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3824 - 3833
  • [37] Self-Attentive Contrastive Learning for Conditioned Periocular and Face Biometrics
    Ng, Tiong-Sik
    Chai, Jacky Chen Long
    Low, Cheng-Yaw
    Beng Jin Teoh, Andrew
    IEEE Transactions on Information Forensics and Security, 2024, 19 : 3251 - 3264
  • [38] Self-Attentive Attributed Network Embedding Through Adversarial Learning
    Yu, Wenchao
    Cheng, Wei
    Aggarwal, Charu
    Zong, Bo
    Chen, Haifeng
    Wang, Wei
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 758 - 767
  • [39] Self-Attentive Recommendation for Multi-Source Review Package
    Chen, Pin-Yu
    Chen, Yu-Hsiu
    Shuai, Hong-Han
    Chang, Yung-Ju
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [40] SELF-ATTENTIVE NETWORKS FOR ONE-SHOT IMAGE RECOGNITION
    Fang, Pin
    Wang, Yisen
    Luo, Yuan
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 934 - 939