Spoken Language Understanding of Human-Machine Conversations for Language Learning Applications

被引:0
|
作者
Yao Qian
Rutuja Ubale
Patrick Lange
Keelan Evanini
Vikram Ramanarayanan
Frank K. Soong
机构
[1] Educational Testing Service Research,
[2] Educational Testing Service Research,undefined
[3] University of California,undefined
[4] Microsoft Research Asia,undefined
来源
关键词
Spoken language understanding; Human-machine conversational systems; Computer assisted language learning; End-to-end modeling; Education;
D O I
暂无
中图分类号
学科分类号
摘要
Spoken language understanding (SLU) in human machine conversational systems is the process of interpreting the semantic meaning conveyed by a user’s spoken utterance. Traditional SLU approaches transform the word string transcribed by an automatic speech recognition (ASR) system into a semantic label that determines the machine’s subsequent response. However, the robustness of SLU results can suffer in the context of a human-machine conversation-based language learning system due to the presence of ambient noise, heavily accented pronunciation, ungrammatical utterances, etc. To address these issues, this paper proposes an end-to-end (E2E) modeling approach for SLU and evaluates the semantic labeling performance of a bidirectional LSTM-RNN with input at three different levels: acoustic (filterbank features), phonetic (subphone posteriorgrams), and lexical (ASR hypotheses). Experimental results for spoken responses collected in a dialog application designed for English learners to practice job interviewing skills show that multi-level BLSTM-RNNs can utilize complementary information from the three different levels to improve the semantic labeling performance. An analysis of results on OOV utterances, which can be common in a conversation-based dialog system, also indicates that using subphone posteriorgrams outperforms ASR hypotheses and incorporating the lower-level features for semantic labeling can be advantageous to improving the final SLU performance.
引用
收藏
页码:805 / 817
页数:12
相关论文
共 50 条
  • [21] Spoken language understanding
    Wang, YY
    Deng, L
    Acero, A
    IEEE SIGNAL PROCESSING MAGAZINE, 2005, 22 (05) : 16 - 31
  • [22] Semantics synchronous understanding for robust spoken language applications
    Wang, KS
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 640 - 645
  • [23] Decolonizing Language Resources in The Human-Machine Era
    Gammelgaard, Anna Osterskov
    Pedersen, Casper Gjodvad
    Kaspersen, Emilie Strudahl
    Thomsen, Marius Risbaek
    Samson, Jonathan Kok
    Fabricius, Anne H.
    INTERVENTIONS-INTERNATIONAL JOURNAL OF POSTCOLONIAL STUDIES, 2024, 26 (08): : 1189 - 1210
  • [24] MULTITASK LEARNING FOR LOW RESOURCE SPOKEN LANGUAGE UNDERSTANDING
    Meeus, Quentin
    Moens, Marie Francine
    Van Hamme, Hugo
    INTERSPEECH 2022, 2022, : 4073 - 4077
  • [25] A Joint Learning Framework With BERT for Spoken Language Understanding
    Zhang, Zhichang
    Zhang, Zhenwen
    Chen, Haoyuan
    Zhang, Zhiman
    IEEE ACCESS, 2019, 7 : 168849 - 168858
  • [26] Research on Spoken Language Understanding Based on Deep Learning
    Yanli Hui
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [27] Spoken language understanding using weakly supervised learning
    Wu, Wei-Lin
    Lu, Ru-Zhan
    Duan, Jian-Yong
    Liu, Hui
    Gao, Feng
    Chen, Yu-Quan
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 358 - 382
  • [28] An Active Learning Approach for Statistical Spoken Language Understanding
    Garcia, Fernando
    Hurtado, Lluis-F.
    Sanchis, Emilio
    Segarra, Encarna
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2011, 7042 : 565 - 572
  • [29] An analysis of observation length requirements for machine understanding of human behaviors from spoken language
    Chakravarthula, Sandeep Nallan
    Baucom, Brian R. W.
    Narayanan, Shrikanth
    Georgiou, Panayiotis
    COMPUTER SPEECH AND LANGUAGE, 2021, 66
  • [30] Identification of Spoken Language using Machine Learning Approach
    Shahriar, Md Asif
    Aziz, Iftekhar
    Banik, Shovan
    Sattar, Abdus
    2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,