Neural Network Adaptation and Data Augmentation for Multi-Speaker Direction-of-Arrival Estimation

被引:22
|
作者
He, Weipeng [1 ,2 ]
Motlicek, Petr [1 ]
Odobez, Jean-Marc [1 ,2 ]
机构
[1] Idiap Res Inst, CH-1920 Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne, CH-1015 Lausanne, Switzerland
基金
欧盟地平线“2020”;
关键词
Data models; Adaptation models; Direction-of-arrival estimation; Neural networks; Location awareness; Data collection; Robots; DOA estimation; data augmentation; sound source localization; weakly-supervised learning; LOCALIZATION; CLASSIFICATION;
D O I
10.1109/TASLP.2021.3060257
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep neural networks have been successfully applied to sound direction-of-arrival estimation under challenging conditions. However, such a learning-based approach requires a large amount of labeled training data, which is difficult to acquire. To address this problem, we propose a novel approach for multi-speaker direction-of-arrival estimation with data augmentation and weakly-supervised domain adaptation. We generate source domain data with simulation, and collect real data annotated with the number of sound sources as the weak labels. The real data are further augmented by mixing single-source segments. Then, weakly-supervised domain adaptation is applied to models pre-trained on the simulated data. We define a loss function for the adaptation process which exploits the weak labels and the mixture component information in the augmented data. Experiments with real robot audio data show that our proposed approach achieves similar performance as if the fully-labeled real data are used. This paper suggests an effective development procedure for DOA estimation models applied to new types of microphone arrays with minimal data collection efforts.
引用
收藏
页码:1303 / 1317
页数:15
相关论文
共 50 条
  • [41] Deep Convolutional Network-Assisted Multiple Direction-of-Arrival Estimation
    Ma, Jie
    Wang, Min
    Chen, Yiyi
    Wang, Haiming
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 576 - 580
  • [42] Multi-Frequency Distributed Arrays for Underdetermined Direction-of-Arrival Estimation
    Wang, Yi
    Chen, Baixiao
    Yang, Minglei
    Ma, Yan
    [J]. 2016 CIE INTERNATIONAL CONFERENCE ON RADAR (RADAR), 2016,
  • [43] Multi-Task Bayesian Compressive Sensing for Direction-of-Arrival Estimation
    Carlin, M.
    Rocca, P.
    Oliveri, G.
    Massa, A.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON WIRELESS INFORMATION TECHNOLOGY AND SYSTEMS (ICWITS), 2012,
  • [44] Multi-Mode Antenna Specific Direction-of-Arrival Estimation Schemes
    Poehlmann, Robert
    Zhang, Siwei
    Yinusa, Kazeem A.
    Dammann, Armin
    [J]. 2017 IEEE 7TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2017,
  • [45] A Novel Three-Dimensional Direction-of-Arrival Estimation Approach Using a Deep Convolutional Neural Network
    Mylonakis, Constantinos M.
    Zaharis, Zaharias D.
    [J]. IEEE OPEN JOURNAL OF VEHICULAR TECHNOLOGY, 2024, 5 : 643 - 657
  • [46] Direction-of-Arrival Estimation over Sea Surface from Radar Scattering Based on Convolutional Neural Network
    Zhao, Xiuyi
    Yang, Ying
    Chen, Kun-Shan
    [J]. REMOTE SENSING, 2021, 13 (14)
  • [47] Estimation of range and bearing of RF emitters using direction-of-arrival data
    Wasylkiwskyj, W
    [J]. PROCEEDINGS ELMAR-2004: 46TH INTERNATIONAL SYMPOSIUM ELECTRONICS IN MARINE, 2004, : 22 - 29
  • [48] A new approach for coherent direction-of-arrival estimation
    Lai, WK
    Ching, PC
    [J]. ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : D9 - D12
  • [49] HOS-BASED DIRECTION-OF-ARRIVAL ESTIMATION
    LEYMAN, AR
    DURRANI, TS
    [J]. ELECTRONICS LETTERS, 1994, 30 (07) : 540 - 542
  • [50] Direction-of-Arrival Estimation in the Presence of Phase Noise
    Lu, Rui
    Zhang, Ming
    Chen, Xiaoming
    Zhang, Anxue
    Svensson, Tommy
    [J]. IEEE COMMUNICATIONS LETTERS, 2020, 24 (08) : 1710 - 1714