A speech-centric perspective for human-computer interface

被引:0
|
作者
Deng, L [1 ]
Acero, A [1 ]
Wang, Y [1 ]
Wang, K [1 ]
Hon, H [1 ]
Droppo, J [1 ]
Mahajan, M [1 ]
Huang, XD [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
关键词
human-computer; interaction; speech-centric multimodal interface; robust speech recognition; spoken language understanding; MiPad;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Speech technology has been playing a central role in enhancing human-machine interactions, especially for small devices for which GUI has obvious limitations. The speech-centric perspective for human-computer interface advanced in this paper derives from the view that speech is the only natural and expressive modality to enable people to access information from and to interact with any device. In this paper, we describe the work conducted at Microsoft Research, in the project codenamed Dr. no, aimed at the development of enabling technologies for speech-centric multimodal human-computer interaction. In particular, we present MiPad as the first Dr. Who's application that addresses specifically the mobile user interaction scenario. MiPad is a wireless mobile PDA prototype that enables users to accomplish many common tasks using a multimodal spoken language interface and wireless-data technologies. It fully integrates continuous speech recognition and spoken language understanding, and provides a novel solution to the current prevailing problem of pecking with tiny styluses or typing on minuscule keyboards in today's PDAs or smart phones.
引用
收藏
页码:263 / 267
页数:5
相关论文
共 50 条
  • [11] Human-Computer Interaction: Process and Principles of Human-Computer Interface Design
    Chao, Gong
    2009 INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING, PROCEEDINGS, 2009, : 230 - 233
  • [12] Design motivations 101: Perspective on human-computer interface in a larger context
    Mroczkiewicz, KJ
    Fisher, EM
    Ginn, JJH
    Demmon, TL
    EISTA '04: International Conference on Education and Information Systems: Technologies and Applications, Vol, 2, Proceedings: EDUCATION AND TRAINING SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 132 - 137
  • [13] OPTIMIZATION IN SPEECH-CENTRIC INFORMATION PROCESSING: CRITERIA AND TECHNIQUES
    He, Xiaodong
    Deng, Li
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5241 - 5244
  • [14] Line Spectral Frequency-based Noise Suppression for Speech-Centric Interface of Smart Devices
    Jang, Gil-Jin
    Park, Jeong-Sik
    Kim, Ji-Hwan
    Seo, Yong-Ho
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2011, 11 (04) : 3 - 8
  • [15] Toward multimodal human-computer interface
    Sharma, Rajeev
    Pavlovic, Vladimir I.
    Huang, Thomas S.
    Proceedings of the IEEE, 1998, 86 (5 pt 1): : 853 - 869
  • [16] A human-computer interface for a robotic system
    Pereira, J
    Gil, P
    Lopes, FM
    VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 146 - 151
  • [17] ENTERTAINMENT TECHNOLOGY AND THE HUMAN-COMPUTER INTERFACE
    GOODRUM, AA
    BULLETIN OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1994, 21 (01): : 18 - 19
  • [18] Toward multimodal human-computer interface
    Sharma, R
    Pavlovic, VI
    Huang, TS
    PROCEEDINGS OF THE IEEE, 1998, 86 (05) : 853 - 869
  • [19] INNOVATIVE INTERFACE FOR HUMAN-COMPUTER INTERACTION
    Rolshofen, W.
    Dietz, P.
    Schaefer, G.
    9TH INTERNATIONAL DESIGN CONFERENCE - DESIGN 2006, VOLS 1 AND 2, 2006, (36): : 611 - +
  • [20] Displays as Advanced Human-Computer Interface
    Nakatani, Yoshio
    IDW'11: PROCEEDINGS OF THE 18TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2011, : 437 - 440