Speech Recognition and Spoken Language Understanding for Mobile Personal Assistants: a Case Study of "Shabette Concier"

被引：4

作者：

Tsujino, Kosuke ^{[1
]}

Nakashima, Yusuke ^{[1
]}

Iizuka, Shinya ^{[1
]}

Isoda, Yoshinori ^{[2
]}

机构：

[1] NTT DOCOMO Inc, Res Labs, Yokosuka, Kanagawa, Japan

[2] NTT DOCOMO Inc, Serv & Solut Dev Dept, Yokosuka, Kanagawa, Japan

来源：

2013 IEEE 14TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2013), VOL 2 | 2013年

关键词：

speech recognition; natural language processing; spoken language understanding; big data;

D O I：

10.1109/MDM.2013.98

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Recent success of mobile personal assistants deeply relies on the advancement in statistical automatic speech recognition (ASR) and spoken language understanding (SLU) technologies. This article describes practical design of ASR and SLU systems for "Shabette Concier", a voice-based personal assistant application commercially released by NTT docomo in Japan in March 2012. Utilization of a large amount of field speech data gathered from the actual service is focused and the potential of big data is discussed.

引用

页码：225 / 228

页数：4

共 50 条

[1] SPOKEN LANGUAGE UNDERSTANDING WITHOUT SPEECH RECOGNITION
Chen, Yuan-Ping
Price, Ryan
Bangalore, Srinivas
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6189 - 6193
[2] Language Modeling for Speech Recognition of Spoken Cantonese
Yeung, Yu Ting
Cao, Houwei
Zheng, N. H.
Lee, Tan
Ching, P. C.
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
[3] Spoken Language Understanding with a Novel Simultaneous Recognition Technique for Intelligent Personal Assistant Software
Lee, Changsu
Ko, Youngjoong
[J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (03)
[4] End-to-End Spoken Language Understanding for Generalized Voice Assistants
Saxon, Michael
Choudhary, Samridhi
McKenna, Joseph P.
Mouchtaris, Athanasios
[J]. INTERSPEECH 2021, 2021, : 4738 - 4742
[5] Speech Recognition Research on Uyghur Accent Spoken Language
Yang, Yating
Ma, Bo
Tang, Xinyu
Turghun, Osman
[J]. 2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 163 - 166
[6] A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Wang, Pu
BabaAli, Bagher
Van Hamme, Hugo
[J]. INTERSPEECH 2021, 2021, : 36 - 40
[7] A STUDY OF THE SPOKEN LANGUAGE AND COLLOQUIAL SPEECH
BORETTIDEMACCHIA, SH
[J]. ESTUDIOS FILOLOGICOS, 1985, (20): : 115 - 126
[8] Japanese Personal Name and Location Search for Spoken Utterances by Using Hierarchical Language Model of Speech Recognition
Hu, Xinhui
Wu, Youzheng
Kashioka, Hideki
[J]. RECENT ADVANCES OF ASIAN LANGUAGE PROCESSING TECHNOLOGIES, 2008, : 193 - 198
[9] Predicting Interaction Quality of Conversational Assistants With Spoken Language Understanding Model Confidences
Gao, Yue
Piovano, Enrico
Soliman, Tamer
Moniruzzaman, Monir
Kumar, Anoop
Bradford, Melanie
Nandi, Subhrangshu
[J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4581 - 4587
[10] A CONVERSATIONAL NEURAL LANGUAGE MODEL FOR SPEECH RECOGNITION IN DIGITAL ASSISTANTS
Cho, Eunjoon
Kumar, Shankar
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5784 - 5788

← 1 2 3 4 5 →