Speech Recognition and Spoken Language Understanding for Mobile Personal Assistants: a Case Study of "Shabette Concier"

被引:4
|
作者
Tsujino, Kosuke [1 ]
Nakashima, Yusuke [1 ]
Iizuka, Shinya [1 ]
Isoda, Yoshinori [2 ]
机构
[1] NTT DOCOMO Inc, Res Labs, Yokosuka, Kanagawa, Japan
[2] NTT DOCOMO Inc, Serv & Solut Dev Dept, Yokosuka, Kanagawa, Japan
关键词
speech recognition; natural language processing; spoken language understanding; big data;
D O I
10.1109/MDM.2013.98
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Recent success of mobile personal assistants deeply relies on the advancement in statistical automatic speech recognition (ASR) and spoken language understanding (SLU) technologies. This article describes practical design of ASR and SLU systems for "Shabette Concier", a voice-based personal assistant application commercially released by NTT docomo in Japan in March 2012. Utilization of a large amount of field speech data gathered from the actual service is focused and the potential of big data is discussed.
引用
收藏
页码:225 / 228
页数:4
相关论文
共 50 条
  • [1] SPOKEN LANGUAGE UNDERSTANDING WITHOUT SPEECH RECOGNITION
    Chen, Yuan-Ping
    Price, Ryan
    Bangalore, Srinivas
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6189 - 6193
  • [2] Language Modeling for Speech Recognition of Spoken Cantonese
    Yeung, Yu Ting
    Cao, Houwei
    Zheng, N. H.
    Lee, Tan
    Ching, P. C.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
  • [3] Spoken Language Understanding with a Novel Simultaneous Recognition Technique for Intelligent Personal Assistant Software
    Lee, Changsu
    Ko, Youngjoong
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (03)
  • [4] End-to-End Spoken Language Understanding for Generalized Voice Assistants
    Saxon, Michael
    Choudhary, Samridhi
    McKenna, Joseph P.
    Mouchtaris, Athanasios
    [J]. INTERSPEECH 2021, 2021, : 4738 - 4742
  • [5] Speech Recognition Research on Uyghur Accent Spoken Language
    Yang, Yating
    Ma, Bo
    Tang, Xinyu
    Turghun, Osman
    [J]. 2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 163 - 166
  • [6] A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
    Wang, Pu
    BabaAli, Bagher
    Van Hamme, Hugo
    [J]. INTERSPEECH 2021, 2021, : 36 - 40
  • [7] A STUDY OF THE SPOKEN LANGUAGE AND COLLOQUIAL SPEECH
    BORETTIDEMACCHIA, SH
    [J]. ESTUDIOS FILOLOGICOS, 1985, (20): : 115 - 126
  • [8] Japanese Personal Name and Location Search for Spoken Utterances by Using Hierarchical Language Model of Speech Recognition
    Hu, Xinhui
    Wu, Youzheng
    Kashioka, Hideki
    [J]. RECENT ADVANCES OF ASIAN LANGUAGE PROCESSING TECHNOLOGIES, 2008, : 193 - 198
  • [9] Predicting Interaction Quality of Conversational Assistants With Spoken Language Understanding Model Confidences
    Gao, Yue
    Piovano, Enrico
    Soliman, Tamer
    Moniruzzaman, Monir
    Kumar, Anoop
    Bradford, Melanie
    Nandi, Subhrangshu
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4581 - 4587
  • [10] A CONVERSATIONAL NEURAL LANGUAGE MODEL FOR SPEECH RECOGNITION IN DIGITAL ASSISTANTS
    Cho, Eunjoon
    Kumar, Shankar
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5784 - 5788