Adaptive categorical understanding for spoken dialogue systems

被引:10
|
作者
Potamianos, A [1 ]
Narayanan, S
Riccardi, G
机构
[1] Tech Univ Crete, Dept Elect & Comp Engn, Khania 73100, Greece
[2] Univ So Calif, Viterbi Sch Engn, Dept Elect Engn, Los Angeles, CA 90089 USA
[3] AT&T Labs Res, Florham Pk, NJ 07932 USA
来源
关键词
acoustic confidence scores; dialogue modeling; language model adaptation; natural language processing; n-gram models; speech understanding;
D O I
10.1109/TSA.2005.845836
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, the speech understanding problem in the context of a spoken dialogue system is formalized in a maximum likelihood framework. Off-line adaptation of stochastic language models that interpolate dialogue state specific and general application-level language models is proposed. Word and dialogue-state n-grams are used for building categorical understanding and dialogue models, respectively. Acoustic confidence scores are incorporated in the understanding formulation. Problems due to data sparseness and out-of-vocabulary words are discussed. The performance of the speech recognition and understanding language models are evaluated with the "Carmen Sandiego" multimodal computer game corpus. Incorporating dialogue models reduces relative understanding error rate by 15%-25%, while acoustic confidence scores achieve a further 10% error reduction for this computer gaming application.
引用
收藏
页码:321 / 329
页数:9
相关论文
共 50 条
  • [1] Evaluation of spoken language understanding and dialogue systems
    Hildebrandt, B
    Rautenstrauch, H
    Sagerer, G
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 685 - 688
  • [2] Adaptive language models for spoken dialogue systems
    Solsona, RA
    Fosler-Lussier, E
    Kuo, HKJ
    Potamianos, A
    Zitouni, I
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 37 - 40
  • [3] Deep Contextual Language Understanding in Spoken Dialogue Systems
    Liu, Chunxi
    Xu, Puyang
    Sarikaya, Ruhi
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 120 - 124
  • [4] Corpus-based discourse understanding in spoken dialogue systems
    Higashinaka, R
    Nakano, M
    Aikawa, K
    [J]. 41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2003, : 240 - 247
  • [5] An Evaluation Framework for Natural Language Understanding in Spoken Dialogue Systems
    Gordon, Joshua B.
    Passonneau, Rebecca J.
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 72 - 77
  • [6] Spoken Dialogue Systems
    Foster, Mary Ellen
    [J]. COMPUTATIONAL LINGUISTICS, 2010, 36 (04) : 781 - 783
  • [7] Spoken dialogue systems
    Jokinen, Kristiina
    McTear, Michael
    [J]. Synthesis Lectures on Human Language Technologies, 2010, 2 (01): : 1 - 167
  • [8] Spoken Dialogue Systems
    Rosset, Sophie
    Vilnat, Anne
    [J]. TRAITEMENT AUTOMATIQUE DES LANGUES, 2010, 51 (01): : 151 - 154
  • [9] Ranking Multiple Dialogue States by Corpus Statistics to Improve Discourse Understanding in Spoken Dialogue Systems
    Higashinaka, Ryuichiro
    Nakano, Mikio
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (09): : 1771 - 1782
  • [10] Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems
    Janarthanam, Srinivasan
    Lemon, Oliver
    [J]. EMPIRICAL METHODS IN NATURAL LANGUAGE GENERATION: DATA-ORIENTED METHODS AND EMPIRICAL EVALUATION, 2010, 5790 : 67 - +