Sibyl, a Factoid Question-Answering System for Spoken Documents

被引:12
|
作者
Comas, Pere R. [1 ]
Turmo, Jordi [1 ]
Marquez, Lluis [1 ]
机构
[1] Tech Univ Catalonia, TALP Res Ctr, Barcelona, Spain
关键词
Algorithms; Experimentation; Question answering; spoken document retrieval;
D O I
10.1145/2328967.2328972
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we present a factoid question-answering system, Sibyl, specifically tailored for question answering (QA) on spoken-word documents. This work explores, for the first time, which techniques can be robustly adapted from the usual QA on written documents to the more difficult spoken document scenario. More specifically, we study new information retrieval (IR) techniques designed or speech, and utilize several levels of linguistic information for the speech-based QA task. These include named-entity detection with phonetic information, syntactic parsing applied to speech transcripts, and the use of coreference resolution. Sibyl is largely based on supervised machine-learning techniques, with special focus on the answer extraction step, and makes little use of handcrafted knowledge. Consequently, it should be easily adaptable to other domains and languages. Sibyl and all its modules are extensively evaluated on the European Parliament Plenary Sessions English corpus, comparing manual with automatic transcripts obtained by three different automatic speech recognition (ASR) systems that exhibit significantly different word error rates. This data belongs to the CLEF 2009 track for QA on speech transcripts. The main results confirm that syntactic information is very useful for learning to rank question candidates, improving results on both manual and automatic transcripts, unless the ASR quality is very low. At the same time, our experiments on coreference resolution reveal that the state-of-the-art technology is not mature enough to be effectively exploited for QA with spoken documents. Overall, the performance of Sibyl is comparable or better than the state-of-the-art on this corpus, confirming the validity of our approach.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] An Arabic Question-Answering system for factoid questions
    Brini, Wissal
    Ellouze, Mariem
    Mesfar, Slim
    Belguith, Lamia Hadrich
    [J]. IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 417 - +
  • [2] A Non-Factoid Question-Answering Taxonomy
    Bolotova, Valeriia
    Blinov, Vladislav
    Scholer, Falk
    Croft, W. Bruce
    Sanderson, Mark
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1196 - 1207
  • [3] Using Dependency Parsing and Machine Learning for Factoid Question Answering on Spoken Documents
    Comas, Pere R.
    Turmo, Jordi
    Marquez, Lluis
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1265 - 1268
  • [4] Arabic factoid Question-Answering system for Islamic sciences using normalized corpora
    Maraoui, Hajer
    Haddar, Kais
    Romary, Laurent
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 69 - 79
  • [5] WestSearch Plus: A Non-factoid Question-Answering System for the Legal Domain
    McElvain, Gayle
    Sanchez, George
    Matthews, Sean
    Teo, Don
    Pompili, Filippo
    Custis, Tonya
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1361 - 1364
  • [6] A SENTIMENT BASED NON-FACTOID QUESTION-ANSWERING FRAMEWORK
    Ye, Qiaofei
    Misra, Kanishka
    Devarapalli, Hemanth
    Rayz, Julia Taylor
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 372 - 377
  • [7] Summarizing Answers in Non-Factoid Community Question-Answering
    Song, Hongya
    Ren, Zhaochun
    Liang, Shangsong
    Li, Piji
    Ma, Jun
    de Rijke, Maarten
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 405 - 414
  • [8] Question-answering system
    Stupina, A. A.
    Zhukov, E. A.
    Ezhemanskaya, S. N.
    Karaseva, M. V.
    Korpacheva, L. N.
    [J]. XII INTERNATIONAL SCIENTIFIC AND RESEARCH CONFERENCE TOPICAL ISSUES IN AERONAUTICS AND ASTRONAUTICS, 2016, 155
  • [9] QUESTION ANSWERING SYSTEM FOR FACTOID BASED QUESTION
    Ranjan, Prakash
    Balabantaray, Rakesh Chandra
    [J]. PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 221 - 224
  • [10] A Factoid Question Answering System for Vietnamese
    Phuong Le-Hong
    Duc-Thien Bui
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1049 - 1055