A Paradigm for Mobile Speech-Centric Services

被引:0
|
作者
Larsen, Lars Bo [1 ]
Jensen, Kasper L. [1 ]
Larsen, Soren [1 ]
Rasmussen, Morten [1 ]
机构
[1] Univ Aalborg, Dept Elect Syst, Aalborg, Denmark
关键词
Distributed ASR; client server architecture; multi modal interaction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presented in this paper describes a new paradigm for speech interaction on mobile devices. A general framework for a distributed architecture is introduced and described. This is followed by a discussion of how to design multi modal interfaces affording spoken input. The solution has been to create an architecture capable of supporting several alternative GUIs, e.g. with spoken input, stylus input or a combination. Speech GUIs are designed entirely without GUI widgets requiring stylus or button input, instead relying on highlighting parts of text to create emphasis and steer the users' attention. This is exemplified through the presentation of a prototype for a Car Rental application.
引用
收藏
页码:2344 / 2347
页数:4
相关论文
共 50 条
  • [1] Speech-centric multimodal interfaces
    Flanagan, JL
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2004, 21 (06) : 76 - 81
  • [2] OPTIMIZATION IN SPEECH-CENTRIC INFORMATION PROCESSING: CRITERIA AND TECHNIQUES
    He, Xiaodong
    Deng, Li
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5241 - 5244
  • [3] A speech-centric perspective for human-computer interface
    Deng, L
    Acero, A
    Wang, Y
    Wang, K
    Hon, H
    Droppo, J
    Mahajan, M
    Huang, XD
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 263 - 267
  • [4] Speech-Centric Information Processing: An Optimization-Oriented Approach
    He, Xiaodong
    Deng, Li
    [J]. PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1116 - 1135
  • [5] Design of an Embedded Speech-Centric Interface for Applications in Handheld Terminals
    Gallardo-Antolin, Ascension
    Garcia-Moral, Ana I.
    Pereiro-Estevan, Yago
    Diaz-de-Maria, Fernando
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2013, 28 (02) : 24 - 33
  • [6] Speech-Centric Multimodal Interaction for Easy-To-Access Online Services - A Personal Life Assistant for the Elderly
    Teixeira, Antonio
    Hamalainen, Annika
    Avelar, Jairo
    Almeida, Nuno
    Nemeth, Geza
    Fegyo, Tibor
    Zainko, Csaba
    Csapo, Tamas
    Toth, Balint
    Oliveira, Andre
    Dias, Miguel Sales
    [J]. 5TH INTERNATIONAL CONFERENCE ON SOFTWARE DEVELOPMENT AND TECHNOLOGIES FOR ENHANCING ACCESSIBILITY AND FIGHTING INFO-EXCLUSION, DSAI 2013, 2014, 27 : 389 - 397
  • [7] A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness
    Feng, Tiantian
    Hebbar, Rajat
    Mehlman, Nicholas
    Shi, Xuan
    Kommineni, Aditya
    Narayanan, Shrikanth
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (03)
  • [8] A speech-centric perspective for human-computer interface: A case study
    Deng, L
    Yu, D
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2005, 41 (03): : 255 - 269
  • [9] A Speech-Centric Perspective for Human-Computer Interface: A Case Study
    Li Deng
    Dong Yu
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2005, 41 : 255 - 269
  • [10] Line Spectral Frequency-based Noise Suppression for Speech-Centric Interface of Smart Devices
    Jang, Gil-Jin
    Park, Jeong-Sik
    Kim, Ji-Hwan
    Seo, Yong-Ho
    [J]. ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2011, 11 (04) : 3 - 8