A Spoken Dialog System Speech Interface Based on a Microphone Array

被引:0
|
作者
Coelho, Gustavo Esteves [1 ]
Serralheiro, Antonio Joaquim [1 ]
Neto, Joao Paulo [1 ]
机构
[1] INESC ID, Spoken Language Syst Lab L2F, P-1000029 Lisbon, Portugal
关键词
Home Automation; Microphone Arrays; Automatic Speech Recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a Spoken Dialog System (SDS) with a Microphone Array (MA). Our goal is to create a hands-free home automation system with a speech interface to control home devices. The MA interface enables to create ubiquitous speech acquisition for the SDS. The implemented system allows any user - in any position in a room - to establish a dialog with a virtual butler that is able to control a wide range of home appliances (room lights, air-conditioner, windows shades and hi-fi features). This virtual butler has a 3D animated face that is, while the dialog is engaged, able to steer to the user's position and respond to his/hers commands with synthesized speech. The presented results show that the MA, as distant talk interface, performs quite well and is a step towards a more realistic human-machine interaction.
引用
收藏
页码:21 / 30
页数:10
相关论文
共 50 条
  • [1] Interface for Barge-in Free Spoken Dialogue System Based on Sound Field Reproduction and Microphone Array
    Shigeki Miyabe
    Yoichi Hinamoto
    Hiroshi Saruwatari
    Kiyohiro Shikano
    Yosuke Tatekura
    EURASIP Journal on Advances in Signal Processing, 2007
  • [2] Interface for barge-in free spoken dialogue system based on sound field control and microphone array
    Hinamoto, Y
    Mino, K
    Saruwatari, H
    Shikano, K
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 505 - 508
  • [3] Interface for barge-in free spoken dialogue system based on sound field reproduction and microphone array
    Miyabe, Shigeki
    Hinamoto, Yoichi
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    Tatekura, Yosukie
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [4] Microphone array system for speech recognition
    Kiyohara, K
    Kaneda, Y
    Takahashi, S
    Nomura, H
    Kojima, J
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
  • [5] Robust continuous speech recognition system based on a microphone array
    Lleida, E
    Fernandez, J
    Masgrau, E
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 241 - 244
  • [6] Neural network based adaptive microphone array system for speech enhancement
    Grbic, N
    Dahl, M
    Claesson, I
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 2180 - 2183
  • [7] Dealing with uncertainty in microphone placement in a microphone array speech recognition system
    Himawan, Ivan
    Sridharan, Sridha
    McCowan, Kin
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1565 - +
  • [8] Interface for barge-in free spoken dialogue system combining adaptive sound field control and microphone array
    Asai, T
    Saruwatari, H
    Shikano, K
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (06) : 1613 - 1618
  • [9] Wearable Speech Enhancement System based on MEMS Microphone Array for Disabled People
    Palla, Alessandro
    Fanucci, Luca
    Sannino, Roberto
    Settin, Mattia
    2015 10TH IEEE INTERNATIONAL CONFERENCE ON DESIGN & TECHNOLOGY OF INTEGRATED SYSTEMS IN NANOSCALE ERA (DTIS), 2015,
  • [10] Microphone array speech enhancement based on optimized IMCRA
    Li, Qiuying
    Zhang, Tao
    Geng, Yanzhang
    Gao, Zhen
    NOISE CONTROL ENGINEERING JOURNAL, 2021, 69 (06) : 468 - 476