SALADNET: SELF-ATTENTIVE MULTISOURCE LOCALIZATION IN THE AMBISONICS DOMAIN

被引：8

作者：

Grumiaux, Pierre-Amaury ^{[1
]}

Kitic, Srdan ^{[1
]}

Srivastava, Prerak ^{[2
]}

Girin, Laurent ^{[3
]}

Guerin, Alexandre ^{[1
]}

机构：

[1] Orange Labs, Cesson Sevigne, France

[2] Univ Lorraine, INRIA, Nancy, France

[3] Univ Grenoble Alpes, GIPSA Lab, CNRS, Grenoble INP, Grenoble, France

来源：

2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2021年

关键词：

Sound source localization; neural networks; self-attention; Ambisonics; parallel computing;

D O I：

10.1109/WASPAA52581.2021.9632737

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this work, we propose a novel self-attention based neural network for robust multi-speaker localization from Ambisonics recordings. Starting from a state-of-the-art convolutional recurrent neural network, we investigate the benefit of replacing the recurrent layers by self-attention encoders, inherited from the Transformer architecture. We evaluate these models on synthetic and real-world data, with up to 3 simultaneous speakers. The obtained results indicate that the majority of the proposed architectures either perform on par, or outperform the CRNN baseline, especially in the multisource scenario. Moreover, by avoiding the recurrent layers, the proposed models lend themselves to parallel computing, which is shown to produce considerable savings in execution time.

引用

页码：336 / 340

页数：5

共 50 条

[31] SELF-ATTENTIVE SENTIMENTAL SENTENCE EMBEDDING FOR SENTIMENT ANALYSIS
Lin, Sheng-Chieh
Su, Wen-Yuh
Chien, Po-Chuan
Tsai, Ming-Feng
Wang, Chuan-Ju
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1678 - 1682
[32] SAFE: Self-Attentive Function Embeddings for Binary Similarity
Massarelli, Luca
Di Luna, Giuseppe Antonio
Petroni, Fabio
Baldoni, Roberto
Querzoni, Leonardo
DETECTION OF INTRUSIONS AND MALWARE, AND VULNERABILITY ASSESSMENT (DIMVA 2019), 2019, 11543 : 309 - 329
[33] Self-Attentive Models for Real-Time Malware Classification
Lu, Qikai
Zhang, Hongwen
Kinawi, Husam
Niu, Di
IEEE ACCESS, 2022, 10 : 95970 - 95985
[34] Global-Locally Self-Attentive Dialogue State Tracker
Zhong, Victor
Xiong, Caiming
Socher, Richard
PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1458 - 1467
[35] SANDGLASSET: A LIGHT MULTI-GRANULARITY SELF-ATTENTIVE NETWORK FOR TIME-DOMAIN SPEECH SEPARATION
Lam, Max W. Y.
Wang, Jun
Su, Dan
Yu, Dong
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5759 - 5763
[36] A Self-Attentive Model with Gate Mechanism for Spoken Language Understanding
Li, Changliang
Li, Liang
Qi, Ji
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3824 - 3833
[37] Self-Attentive Contrastive Learning for Conditioned Periocular and Face Biometrics
Ng, Tiong-Sik
Chai, Jacky Chen Long
Low, Cheng-Yaw
Beng Jin Teoh, Andrew
IEEE Transactions on Information Forensics and Security, 2024, 19 : 3251 - 3264
[38] Self-Attentive Attributed Network Embedding Through Adversarial Learning
Yu, Wenchao
Cheng, Wei
Aggarwal, Charu
Zong, Bo
Chen, Haifeng
Wang, Wei
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 758 - 767
[39] Self-Attentive Recommendation for Multi-Source Review Package
Chen, Pin-Yu
Chen, Yu-Hsiu
Shuai, Hong-Han
Chang, Yung-Ju
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[40] SELF-ATTENTIVE NETWORKS FOR ONE-SHOT IMAGE RECOGNITION
Fang, Pin
Wang, Yisen
Luo, Yuan
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 934 - 939

← 1 2 3 4 5 →