Automatic Question Answering using the Web: Beyond the factoid

被引:42
|
作者
Soricut, R
Brill, E
机构
[1] Univ So Calif, Inst Informat Sci, Marina Del Rey, CA 90292 USA
[2] Microsoft Res, Redmond, WA 98052 USA
来源
INFORMATION RETRIEVAL | 2006年 / 9卷 / 02期
关键词
Statistical Model; Data Structure; Information Theory; Search Engine; Language Model;
D O I
10.1007/s10791-006-7149-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we describe and evaluate a Question Answering (QA) system that goes beyond answering factoid questions. Our approach to QA assumes no restrictions on the type of questions that are handled, and no assumption that the answers to be provided are factoids. We present an unsupervised approach for collecting question and answer pairs from FAQ pages, which we use to collect a corpus of 1 million question/answer pairs from FAQ pages available on the Web. This corpus is used to train various statistical models employed by our QA system: a statistical chunker used to transform a natural language-posed question into a phrase-based query to be submitted for exact match to an off-the-shelf search engine; an answer/question translation model, used to assess the likelihood that a proposed answer is indeed an answer to the posed question; and an answer language model, used to assess the likelihood that a proposed answer is a well-formed answer. We evaluate our QA system in a modular fashion, by comparing the performance of baseline algorithms against our proposed algorithms for various modules in our QA system. The evaluation shows that our system achieves reasonable performance in terms of answer accuracy for a large variety of complex, non-factoid questions.
引用
收藏
页码:191 / 206
页数:16
相关论文
共 50 条
  • [1] Automatic question answering using the web: Beyond the Factoid
    Radu Soricut
    Eric Brill
    [J]. Information Retrieval, 2006, 9 : 191 - 206
  • [2] Automatic question answering: Beyond the factoid
    Soricut, R
    Brill, E
    [J]. HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 57 - 64
  • [3] QUESTION ANSWERING SYSTEM FOR FACTOID BASED QUESTION
    Ranjan, Prakash
    Balabantaray, Rakesh Chandra
    [J]. PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 221 - 224
  • [4] Neural factoid geospatial question answering
    Li, Haonan
    Hamzei, Ehsan
    Majic, Ivan
    Hua, Hua
    Renz, Jochen
    Tomko, Martin
    Vasardani, Maria
    Winter, Stephan
    Baldwin, Timothy
    [J]. JOURNAL OF SPATIAL INFORMATION SCIENCE, 2021, (23): : 65 - 90
  • [5] Web Question Answering: Beyond Factoids
    Agichtein, Eugene
    Carmel, David
    Clarke, Charles L. A.
    Paritosh, Praveen
    Pelleg, Dan
    Szpektor, Idan
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 1143 - 1143
  • [6] A Factoid Question Answering System for Vietnamese
    Phuong Le-Hong
    Duc-Thien Bui
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1049 - 1055
  • [7] Factoid Question Answering with Distant Supervision
    Zhang, Hongzhi
    Liang, Xiao
    Xu, Guangluan
    Fu, Kun
    Li, Feng
    Huang, Tinglei
    [J]. ENTROPY, 2018, 20 (06)
  • [8] Contrasting Human Opinion of Non-factoid Question Answering with Automatic Evaluation
    Ji, Tianbo
    Graham, Yvette
    Jones, Gareth J. F.
    [J]. CHIIR'20: PROCEEDINGS OF THE 2020 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL, 2020, : 348 - 352
  • [9] Automatic Question Answering from Web Documents
    LI Xin1
    2. Department of Computer Science
    3. Joint Research Laboratory of Excellence
    [J]. Wuhan University Journal of Natural Sciences, 2007, (05) : 875 - 880
  • [10] Beyond keywords: Automated question answering on the web
    Roussinov, Dmitri
    Fan, Weiguo
    Robles-Flores, Jose
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (09) : 60 - 65