Context Model Acquisition from Spoken Utterances

被引:2
|
作者
Weigelt, Sebastian [1 ]
Hey, Tobias [1 ]
Tichy, Walter F. [1 ]
机构
[1] Karlsruhe Inst Technol, Inst Program Struct & Data Org, D-76131 Karlsruhe, Germany
关键词
Spoken language interfaces; spoken language understanding; language model; programming in natural language; end-user programming; ontologies; natural language processing; knowledge representation; context model; context; natural language understanding;
D O I
10.1142/S0218194017400058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current systems with spoken language interfaces do not leverage contextual information. Therefore, they struggle with understanding speakers' intentions. We propose a system that creates a context model from user utterances to overcome this lack of information. It comprises eight types of contextual information organized in three layers: individual, conceptual, and hierarchical. We have implemented our approach as a part of the project PARSE. It aims at enabling laypersons to construct simple programs by dialog. Our implementation incrementally generates context including occurring entities and actions as well as their conceptualizations, state transitions, and other types of contextual information. Its analyses are knowledge- or rule-based (depending on the context type), but we make use of many well-known probabilistic NLP techniques. In a user study we have shown the feasibility of our approach, achieving F-1 scores from 72% up to 98% depending on the type of contextual information. The context model enables us to resolve complex identity relations. However, quantifying this effect is subject to future work. Likewise, we plan to investigate whether our context model is useful for other language understanding tasks, e.g. anaphora resolution, topic analysis, or correction of automatic speech recognition errors.
引用
收藏
页码:1439 / 1453
页数:15
相关论文
共 50 条
  • [21] A system for acquisition of noun concepts from utterances for images using the label acquisition rules
    Uchida, Yuzu
    Araki, Kenji
    AI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4830 : 798 - 802
  • [22] Towards Programming in Natural Language: Learning New Functions from Spoken Utterances
    Weigelt, Sebastian
    Steurer, Vanessa
    Hey, Tobias
    Tichy, Walter F.
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2020, 14 (02) : 249 - 272
  • [23] The Intonation of Assertive Utterances with Illocutionary Force or Exclamatory Utterances? Concerning Spanish Spoken in Colombia
    Velasquez Upegui, Eva Patricia
    MOENIA-REVISTA LUCENSE DE LINGUISTICA & LITERATURA, 2015, 21 : 111 - 130
  • [24] Model Order Estimation Using Bayesian NMF for Discovering Phone Patterns in Spoken Utterances
    Mirzaei, Sayeh
    Van Hamme, Hugo
    Norouzi, Yaser
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1716 - 1720
  • [25] Recognizing Words from Gestures: Discovering Gesture Descriptors Associated with Spoken Utterances
    Okada, Shogo
    Otsuka, Kazuhiro
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 430 - 437
  • [26] Probabilistic, multi-staged interpretation of spoken utterances
    Zukerman, Ingrid
    Niemann, Michael
    George, Sarah
    Marom, Yuval
    AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 1215 - +
  • [27] The elaboration of spoken utterances among Swedish learners of French
    Albépart-Ottesen, C
    MODERNA SPRAK, 2000, 94 (02): : 176 - 183
  • [28] Segmentation of spoken dialogue by interjections, disfluent utterances and pauses
    Takagi, K
    Itahashi, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 697 - 700
  • [29] Unsupervised Multi-Topic Labeling for Spoken Utterances
    Weigelt, Sebastian
    Keim, Jan
    Hey, Tobias
    Tichy, Walter F.
    2019 IEEE INTERNATIONAL CONFERENCE ON HUMANIZED COMPUTING AND COMMUNICATION (HCC 2019), 2019, : 38 - 45
  • [30] Towards a flexible and contextually appropriate generation of spoken utterances
    Larrey, P
    Vigouroux, N
    Pérennou, G
    1998 IEEE 4TH WORKSHOP INTERACTIVE VOICE TECHNOLOGY FOR TELECOMMUNICATIONS APPLICATIONS - IVTTA '98, 1998, : 124 - 129