Environment Compensation Using A Posteriori Statistics for Distant Speech-based Human-Robot Interaction

被引:0
|
作者
Gomez, Randy [1 ]
Nakamura, Keisuke [1 ]
机构
[1] Honda Res Inst Japan Ltd Co, Wako, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distant hands-free interaction with robot using voice is a difficult task. As the speech signal is reflected within the physical constraints of the room, the speech is contaminated by the environment and speech enhancement alone is not enough to rectify this mismatch in condition. The use of adaptation techniques may result to a robust model adapted to the new environment condition using some small amount of adaptation data. However, the dynamic nature of the interaction between the human and the robot (e.g., constant change in positions) implies a dynamic change in environment condition. Adaptation data collection corresponding to every change may become impractical. In this paper we propose a compensation mechanism short of being an adaptation technique. Our method can cope with the change in environment condition without the use of actual adaptation data. First, we estimate the a posteriori statistics, known in advance and empirically verified to perform well. Then, we infuse the a posteriori statistics to the problematic statistics (i.e., due to environment change) to give a boost to the latter. Weights are selected offline by minimizing the error of the envelopes between the posteriori and the problematic statistics for efficient estimation and fast update. Experimental results in real human-robot environment conditions show that our proposed method performs better than existing adaptation techniques in most cases without the need of adaptation data.
引用
收藏
页码:1211 / 1216
页数:6
相关论文
共 50 条
  • [1] Temporal Smearing Compensation in Reverberant Environment for Speech-based Human-Robot Interaction
    Gomez, Randy
    Nakamura, Keisuke
    Mizumoto, Takeshi
    Nakadai, Kazuhiro
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 3347 - 3353
  • [2] Speech-based Human-Robot Interaction Robust to Acoustic Reflections in Real Environment
    Gomez, Randy
    Inoue, Koji
    Nakamura, Keisuke
    Mizumoto, Takeshi
    Nakadai, Kazuhiro
    [J]. 2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 1367 - 1373
  • [3] A tension-moderating mechanism for promoting speech-based human-robot interaction
    Kanda, T
    Iwase, K
    Shiomi, M
    Ishiguro, H
    [J]. 2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 527 - 532
  • [4] Human Interaction Smart Subsystem-Extending Speech-Based Human-Robot Interaction Systems with an Implementation of External Smart Sensors
    Podpora, Michal
    Gardecki, Arkadiusz
    Beniak, Ryszard
    Klin, Bartlomiej
    Lopez Vicario, Jose
    Kawala-Sterniuk, Aleksandra
    [J]. SENSORS, 2020, 20 (08)
  • [5] Multi-party Human-Robot Interaction with Distant-Talking Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    Nakamura, Keisuke
    Nakadai, Kazuhiro
    [J]. HRI'12: PROCEEDINGS OF THE SEVENTH ANNUAL ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2012, : 439 - 446
  • [6] Utilizing Visual Cues in Robot Audition for Sound Source Discrimination in Speech-based Human-Robot Communication
    Gomez, Randy
    Ivanchuk, Levko
    Nakamura, Keisuke
    Mizumoto, Takeshi
    Nakadai, Kazuhiro
    [J]. 2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 4216 - 4222
  • [7] Toward a quizmaster robot for speech-based multiparty interaction
    Nishimuta, Izaya
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Okuno, Hiroshi G.
    [J]. ADVANCED ROBOTICS, 2015, 29 (18) : 1205 - 1219
  • [8] Human-robot interaction based on human emotions extracted from speech
    Kirandziska, Vesna
    Ackovska, Nevena
    [J]. 2012 20TH TELECOMMUNICATIONS FORUM (TELFOR), 2012, : 1381 - 1384
  • [9] Research on multimodal human-robot interaction based on speech and gesture
    Deng Yongda
    Li Fang
    Xin Huang
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2018, 72 : 443 - 454
  • [10] Space, Speech, and Gesture in Human-Robot Interaction
    Mead, Ross
    [J]. ICMI '12: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2012, : 333 - 336