Multiple Sound Source Localization Exploiting Robot Motion and Approaching Control

被引:0
|
作者
Wang, Zhiqing [1 ,2 ]
Zou, Wei [1 ,2 ]
Su, Hu [1 ,2 ]
Guo, Yuxin [1 ,2 ]
Li, Donghui [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Bayesian theory; entropy theory; robot audition; robot motion; sound source localization (SSL); source approaching control; TRACKING;
D O I
10.1109/TIM.2023.3298406
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sound source localization (SSL) and approaching are essential capabilities for robots with auditory sensing. However, most existing methods for SSL only provide the direction of arrival (DoA) of the source, without the source distance, and cannot cope with time-varying number of sound sources. In this article, a novel framework that integrates multi source state estimation and source approaching control is proposed to address these issues. First, auditory probability hypothesis density (A-PHD) method is proposed, which can estimate both the source direction and distance by leveraging the robot motion information along with DoA estimation. A-PHD designs a new state update and merge strategy based on the characteristics of auditory perception, enabling more accurate source number estimation and real-time performance improvement. Second, a new source approaching control method aiming at both improving SSL accuracy and approaching the source is proposed, which utilizes the entropy to quantify the uncertainty of SSL. This method establishes the explicit relationship between the entropy and the robot motion, achieving efficient and smooth source approaching while improving SSL accuracy. Third, based on the methods proposed above, an organic framework consisting of source localization and approaching is formed. In this framework, the results of A-PHD serve as input of the control method, and in turn, the control method utilizes certain robot motions to improve the accuracy of A-PHD and obtains more precise source locations as input. In this process, more accurate SSL and more efficient source approaching are achieved. A series of experiments such as multisource state estimation with various DoA errors, robot motions, missing and false measurements, and source approaching are conducted on a mobile robot with a six-microphone array, verifying the effectiveness of our methods.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] A UNIFIED FRAMEWORK FOR MULTIPLE ARRAYS ON A ROBOT AND APPLICATION TO SOUND LOCALIZATION
    Madmoni, L.
    Barfuss, H.
    Rafaely, B.
    Kellermann, W.
    2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 66 - 70
  • [22] Interactive Sound Source Localization using Robot Audition for Tablet Devices
    Nakamura, Keisuke
    Sinapayen, Lana
    Nakadai, Kazuhiro
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 6137 - 6142
  • [23] Improved sound source localization in horizontal plane for binaural robot audition
    Ui-Hyun Kim
    Kazuhiro Nakadai
    Hiroshi G. Okuno
    Applied Intelligence, 2015, 42 : 63 - 74
  • [24] A Sound Source Localization Method Based on Microphone Array for Mobile Robot
    Liu, Guanqun
    Yuan, Shengrong
    Wu, Junwei
    Zhang, Rubo
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1621 - 1625
  • [25] A linear phase unwrapping method for binaural sound source localization on a robot
    Li, DF
    Levinson, SE
    2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 19 - 23
  • [26] An Intelligent Robot based on Sound Source Localization and Ultrasound Distance Detection
    Charlie
    Mickey
    Tina
    自动化技术与应用, 2008, (11) : 22 - 28
  • [27] Improved sound source localization in horizontal plane for binaural robot audition
    Kim, Ui-Hyun
    Nakadai, Kazuhiro
    Okuno, Hiroshi G.
    APPLIED INTELLIGENCE, 2015, 42 (01) : 63 - 74
  • [28] Self-organization of a sound source localization robot by perceptual cycle
    Nakashima, H
    Mukai, T
    Ohnishi, N
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 834 - 838
  • [29] Integration of Sound Source Localization and Separation to Improve Dialogue Management on a Robot
    Frechette, Maxime
    Letourneau, Dominic
    Valin, Jean-Marc
    Michaud, Francois
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 2358 - 2363
  • [30] Robust sound source localization using a microphone array on a mobile robot
    Valin, JM
    Michaud, F
    Rouat, J
    Létourneau, D
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 1228 - 1233