A Real-Time Sound Source Localization System for Robotic Vacuum Cleaners With a Microphone Array

被引:0
|
作者
Kim, Jun Hyung [1 ]
Kim, Taehan [1 ]
Kim, Seokhyun [2 ]
Song, Ju-Man [2 ]
Park, Yongjin [2 ]
Kim, Minook [2 ]
Son, Jungkwan [2 ]
Jeong, Jimann [2 ]
Park, Hyung-Min [1 ]
机构
[1] Sogang Univ, Dept Elect Engn, Seoul 04107, South Korea
[2] LG Elect CTO, Seoul 06772, South Korea
关键词
Speech enhancement; Robots; Real-time systems; Location awareness; Computational modeling; Correlation; Vacuum systems; Sensors; Direction-of-arrival estimation; Vectors; Deep neural networks (DNNs); ego-noise reduction; microphone array; real-time speech enhancement; sound source localization (SSL); TRACKING;
D O I
10.1109/JSEN.2024.3500007
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the progress of artificial intelligence (AI) technology, home appliances are becoming more advanced to enhance our quality of life. Many smart devices support speech interfaces, including voice commands and user location tracking. However, robotic vacuum cleaners generate strong ego-noise that distorts microphone signals, making it difficult to estimate the user's location. To solve this problem, we propose a real-time sound source localization (SSL) system for a robotic vacuum cleaner equipped with a microphone array. We design a system that consists of speech enhancement, voice activity detection (VAD), and SSL modules. The speech enhancement module includes TRU-Net-Light, which has lower computation and similar speech enhancement performance to tiny recurrent U-net (TRU-Net). The TRU-Net-Light reduces the number of channels to reduce the model size and applies a frequency-axis multihead self-attention to boost representational capacity. The finite state machine-based VAD is designed to detect voice active periods using the output of a speech enhancement module. Furthermore, we present a mask-weighted difference correlation vector and the singular value decomposition (SVD) with smoother coherence transform (DSVD-SCOT) that achieves robust localization performance in severely noisy environments. In the experimented robotic vacuum cleaner, the localization accuracy of the SSL system was 97.9% and 84.0% for signal-to-noise ratios (SNRs) of -3 and -8 dB, respectively. The proposed system was run in real-time, with a real-time factor (RTF) of 0.378, on a single Kryo 585 Silver core in the RB5 platform. A demo of the proposed system is available at https://youtu.be/3d3Cr-cs9aY.
引用
收藏
页码:1243 / 1252
页数:10
相关论文
共 50 条
  • [41] Robust sound source localization using a microphone array on a mobile robot
    Valin, JM
    Michaud, F
    Rouat, J
    Létourneau, D
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 1228 - 1233
  • [42] A Sound Source Localization Method Based on Microphone Array for Mobile Robot
    Liu, Guanqun
    Yuan, Shengrong
    Wu, Junwei
    Zhang, Rubo
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1621 - 1625
  • [43] Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments
    Hoshiba, Kotaro
    Washizaki, Kai
    Wakabayashi, Mizuho
    Ishiki, Takahiro
    Kumon, Makoto
    Bando, Yoshiaki
    Gabriel, Daniel
    Nakadai, Kazuhiro
    Okuno, Hiroshi G.
    SENSORS, 2017, 17 (11):
  • [44] Real-time Localization and Visualization of a Sound Source for Virtual Reality Applications
    Kose, Ahmet
    Tepljakov, Aleksei
    Astapov, Sergei
    2017 25TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2017, : 219 - 224
  • [45] Scalable real-time sound source localization method based on TDOA
    Zahra Heydari
    Aminollah Mahabadi
    Multimedia Tools and Applications, 2023, 82 : 23333 - 23372
  • [46] Scalable real-time sound source localization method based on TDOA
    Heydari, Zahra
    Mahabadi, Aminollah
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (15) : 23333 - 23372
  • [47] Real-time sound source localization based on audiovisual frequency integration
    Tsuji, Tokuo
    Yamamoto, Kenkichi
    Ishii, Idaku
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 322 - +
  • [48] Real-time Super-resolution Sound Source Localization for Robots
    Nakamura, Keisuke
    Nakadai, Kazuhiro
    Ince, Goekhan
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 694 - 699
  • [49] Real-time source separation based on sound localization in a reverberant environment
    Aoki, M
    Furuya, K
    NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 475 - 484
  • [50] Accuracy Study of a Real-Time Hybrid Sound Source Localization Algorithm
    Juzga, Fernando A. Escobar
    Chang, Xin
    Ibala, Christian
    Valderrama, Carlos
    INTELLIGENT TECHNOLOGIES FOR INTERACTIVE ENTERTAINMENT, 2013, 124 : 146 - 155