Sibyl, a Factoid Question-Answering System for Spoken Documents

被引:12
|
作者
Comas, Pere R. [1 ]
Turmo, Jordi [1 ]
Marquez, Lluis [1 ]
机构
[1] Tech Univ Catalonia, TALP Res Ctr, Barcelona, Spain
关键词
Algorithms; Experimentation; Question answering; spoken document retrieval;
D O I
10.1145/2328967.2328972
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we present a factoid question-answering system, Sibyl, specifically tailored for question answering (QA) on spoken-word documents. This work explores, for the first time, which techniques can be robustly adapted from the usual QA on written documents to the more difficult spoken document scenario. More specifically, we study new information retrieval (IR) techniques designed or speech, and utilize several levels of linguistic information for the speech-based QA task. These include named-entity detection with phonetic information, syntactic parsing applied to speech transcripts, and the use of coreference resolution. Sibyl is largely based on supervised machine-learning techniques, with special focus on the answer extraction step, and makes little use of handcrafted knowledge. Consequently, it should be easily adaptable to other domains and languages. Sibyl and all its modules are extensively evaluated on the European Parliament Plenary Sessions English corpus, comparing manual with automatic transcripts obtained by three different automatic speech recognition (ASR) systems that exhibit significantly different word error rates. This data belongs to the CLEF 2009 track for QA on speech transcripts. The main results confirm that syntactic information is very useful for learning to rank question candidates, improving results on both manual and automatic transcripts, unless the ASR quality is very low. At the same time, our experiments on coreference resolution reveal that the state-of-the-art technology is not mature enough to be effectively exploited for QA with spoken documents. Overall, the performance of Sibyl is comparable or better than the state-of-the-art on this corpus, confirming the validity of our approach.
引用
收藏
页数:40
相关论文
共 50 条
  • [31] Predicting answer acceptability for question-answering system
    Roy, Pradeep Kumar
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2023,
  • [32] AN AUTOMATIC QUESTION-ANSWERING SYSTEM FOR STELLAR ASTRONOMY
    VALLEE, JF
    HYNEK, JA
    [J]. PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 1966, 78 (463) : 315 - &
  • [33] Exploring Retriever-Reader Approaches in Question-Answering on Scientific Documents
    Dieu-Hien Nguyen
    Nguyen-Khang Le
    Minh Le Nguyen
    [J]. RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, 2022, 1716 : 383 - 395
  • [34] Neural factoid geospatial question answering
    Li, Haonan
    Hamzei, Ehsan
    Majic, Ivan
    Hua, Hua
    Renz, Jochen
    Tomko, Martin
    Vasardani, Maria
    Winter, Stephan
    Baldwin, Timothy
    [J]. JOURNAL OF SPATIAL INFORMATION SCIENCE, 2021, (23): : 65 - 90
  • [35] Automatic question answering: Beyond the factoid
    Soricut, R
    Brill, E
    [J]. HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 57 - 64
  • [36] Factoid Question Answering with Distant Supervision
    Zhang, Hongzhi
    Liang, Xiao
    Xu, Guangluan
    Fu, Kun
    Li, Feng
    Huang, Tinglei
    [J]. ENTROPY, 2018, 20 (06)
  • [37] A Question Answering System on Regulatory Documents
    Collarana, Diego
    Heuss, Timm
    Lehmann, Jens
    Lytra, Ioanna
    Maheshwari, Gaurav
    Nedelchev, Rostislav
    Schmidt, Thorsten
    Trivedi, Priyansh
    [J]. LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2018), 2018, 313 : 41 - 50
  • [38] Research on Question-Answering System Based on Deep Learning
    Song, Bo
    Zhuo, Yue
    Li, Xiaomei
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2018, PT II, 2018, 10942 : 522 - 529
  • [39] Question-Answering System in the TIL-Script Language
    Duzi, Marie
    Fait, Michal
    [J]. INFORMATION MODELLING AND KNOWLEDGE BASES XXXI, 2020, 321 : 501 - 518
  • [40] QUESTION-ANSWERING STRATEGIES FOR CHILDREN
    RAPHAEL, TE
    [J]. READING TEACHER, 1982, 36 (02): : 186 - 190