A speech-centric perspective for human-computer interface

被引:0
|
作者
Deng, L [1 ]
Acero, A [1 ]
Wang, Y [1 ]
Wang, K [1 ]
Hon, H [1 ]
Droppo, J [1 ]
Mahajan, M [1 ]
Huang, XD [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
关键词
human-computer; interaction; speech-centric multimodal interface; robust speech recognition; spoken language understanding; MiPad;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Speech technology has been playing a central role in enhancing human-machine interactions, especially for small devices for which GUI has obvious limitations. The speech-centric perspective for human-computer interface advanced in this paper derives from the view that speech is the only natural and expressive modality to enable people to access information from and to interact with any device. In this paper, we describe the work conducted at Microsoft Research, in the project codenamed Dr. no, aimed at the development of enabling technologies for speech-centric multimodal human-computer interaction. In particular, we present MiPad as the first Dr. Who's application that addresses specifically the mobile user interaction scenario. MiPad is a wireless mobile PDA prototype that enables users to accomplish many common tasks using a multimodal spoken language interface and wireless-data technologies. It fully integrates continuous speech recognition and spoken language understanding, and provides a novel solution to the current prevailing problem of pecking with tiny styluses or typing on minuscule keyboards in today's PDAs or smart phones.
引用
收藏
页码:263 / 267
页数:5
相关论文
共 50 条
  • [1] A Speech-Centric Perspective for Human-Computer Interface: A Case Study
    Li Deng
    Dong Yu
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2005, 41 : 255 - 269
  • [2] A speech-centric perspective for human-computer interface: A case study
    Deng, L
    Yu, D
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2005, 41 (03): : 255 - 269
  • [3] Speech recognition in the human-computer interface
    Rebman, CM
    Aiken, MW
    Cegielski, CG
    [J]. INFORMATION & MANAGEMENT, 2003, 40 (06) : 509 - 519
  • [4] Design of an Embedded Speech-Centric Interface for Applications in Handheld Terminals
    Gallardo-Antolin, Ascension
    Garcia-Moral, Ana I.
    Pereiro-Estevan, Yago
    Diaz-de-Maria, Fernando
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2013, 28 (02) : 24 - 33
  • [5] Effects of Stereoscopy on a Human-Computer Interface for Network Centric Operations
    Zocco, Alessandro
    Cannone, Davide
    De Paolis, Lucio Tommaso
    [J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, : 249 - 255
  • [6] Speech-centric multimodal interfaces
    Flanagan, JL
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2004, 21 (06) : 76 - 81
  • [7] Context-Centric Speech-Based Human-Computer Interaction
    Hung, Victor C.
    Gonzalez, Avelino J.
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2013, 28 (10) : 1010 - 1037
  • [8] A Paradigm for Mobile Speech-Centric Services
    Larsen, Lars Bo
    Jensen, Kasper L.
    Larsen, Soren
    Rasmussen, Morten
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2344 - 2347
  • [9] Enhanced Human-Computer Speech Interface Using Wavelet Computing
    Ayat, Saeed
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON VIRTUAL ENVIRONMENTS, HUMAN-COMPUTER INTERFACES AND MEASUREMENT SYSTEMS, 2008, : 37 - 40
  • [10] Human-computer interactions in speech therapy using a blowing interface
    Ruminski, Jacek
    Bujnowski, Adam
    Wtorek, Jerzy
    [J]. 2014 7TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTIONS (HSI), 2014, : 178 - 181