CONTEXTUAL LANGUAGE MODELS FOR RANKING ANSWERS TO NATURAL LANGUAGE DEFINITION QUESTIONS

被引:8
|
作者
Figueroa, Alejandro [2 ]
Atkinson, John [1 ]
机构
[1] Univ Concepcion, Dept Comp Sci, Concepcion, Chile
[2] Yahoo Res Latin Amer, Santiago, Chile
关键词
context definition models; definition questions; feature analysis; lexicalized dependency paths; statistical language models; question answering;
D O I
10.1111/j.1467-8640.2012.00426.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question - answering systems make good use of knowledge bases (KBs, e.g., Wikipedia) for responding to definition queries. Typically, systems extract relevant facts from articles regarding the question across KBs, and then they are projected into the candidate answers. However, studies have shown that the performance of this kind of method suddenly drops, whenever KBs supply narrow coverage. This work describes a new approach to deal with this problem by constructing context models for scoring candidate answers, which are, more precisely, statistical n-gram language models inferred from lexicalized dependency paths extracted from Wikipedia abstracts. Unlike state-of-the-art approaches, context models are created by capturing the semantics of candidate answers (e.g., "novel", "singer", "coach", and "city"). This work is extended by investigating the impact on context models of extra linguistic knowledge such as part-of-speech tagging and named-entity recognition. Results showed the effectiveness of context models as n-gram lexicalized dependency paths and promising context indicators for the presence of definitions in natural language texts.
引用
收藏
页码:528 / 548
页数:21
相关论文
共 50 条
  • [21] The language of evidence based medicine: Answers to common questions
    Degen, Ryan M.
    Hodgins, Justin L.
    Bhandari, Mohit
    INDIAN JOURNAL OF ORTHOPAEDICS, 2008, 42 (02) : 111 - 117
  • [22] Answers without questions: The emergence of fragments in child language
    Ginzburg, Jonathan
    Kolliakou, Dimitra
    JOURNAL OF LINGUISTICS, 2009, 45 (03) : 641 - 673
  • [23] Large Language Models are Not Models of Natural Language: They are Corpus Models
    Veres, Csaba
    IEEE ACCESS, 2022, 10 : 61970 - 61979
  • [24] Natural Language Understanding for Grading Essay Questions in Persian Language
    Mokhtari-Fard, Iman
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 144 - 153
  • [25] Bilattices and the Semantics of Natural Language Questions
    R. Nelken
    N. Francez
    Linguistics and Philosophy, 2002, 25 : 37 - 64
  • [26] Bilattices and the semantics of natural language questions
    Nelken, R
    Francez, N
    LINGUISTICS AND PHILOSOPHY, 2002, 25 (01) : 37 - 64
  • [27] The Journey of Language Models in Understanding Natural Language
    Liu, Yuanrui
    Zhou, Jingping
    Sang, Guobiao
    Huang, Ruilong
    Zhao, Xinzhe
    Fang, Jintao
    Wang, Tiexin
    Li, Bohan
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 331 - 363
  • [28] Statistics Corner: Questions and Answers about Language Testing Statistics
    Akbarian, Is'haaq
    JOURNAL OF ASIA TEFL, 2022, 19 (03): : 1138 - 1140
  • [29] Automatically Finding Answers to "Why" and "How to" Questions for Arabic Language
    Salem, Ziad
    Sadek, Jawad
    Chakkour, Fairouz
    Haskkour, Nadia
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT IV, 2010, 6279 : 586 - +
  • [30] Language and hybrids: too many answers for too few questions
    Bruner, Emiliano
    JOURNAL OF ANTHROPOLOGICAL SCIENCES, 2013, 91 : 245 - 247