Sibyl, a Factoid Question-Answering System for Spoken Documents

被引:12
|
作者
Comas, Pere R. [1 ]
Turmo, Jordi [1 ]
Marquez, Lluis [1 ]
机构
[1] Tech Univ Catalonia, TALP Res Ctr, Barcelona, Spain
关键词
Algorithms; Experimentation; Question answering; spoken document retrieval;
D O I
10.1145/2328967.2328972
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we present a factoid question-answering system, Sibyl, specifically tailored for question answering (QA) on spoken-word documents. This work explores, for the first time, which techniques can be robustly adapted from the usual QA on written documents to the more difficult spoken document scenario. More specifically, we study new information retrieval (IR) techniques designed or speech, and utilize several levels of linguistic information for the speech-based QA task. These include named-entity detection with phonetic information, syntactic parsing applied to speech transcripts, and the use of coreference resolution. Sibyl is largely based on supervised machine-learning techniques, with special focus on the answer extraction step, and makes little use of handcrafted knowledge. Consequently, it should be easily adaptable to other domains and languages. Sibyl and all its modules are extensively evaluated on the European Parliament Plenary Sessions English corpus, comparing manual with automatic transcripts obtained by three different automatic speech recognition (ASR) systems that exhibit significantly different word error rates. This data belongs to the CLEF 2009 track for QA on speech transcripts. The main results confirm that syntactic information is very useful for learning to rank question candidates, improving results on both manual and automatic transcripts, unless the ASR quality is very low. At the same time, our experiments on coreference resolution reveal that the state-of-the-art technology is not mature enough to be effectively exploited for QA with spoken documents. Overall, the performance of Sibyl is comparable or better than the state-of-the-art on this corpus, confirming the validity of our approach.
引用
收藏
页数:40
相关论文
共 50 条
  • [41] Answer formulation for question-answering
    Kosseim, L
    Plamondon, L
    Guillemette, LJ
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 24 - 34
  • [42] Improved semantic similarity computation in question-answering system
    Jiang, PL
    Hu, HQ
    Ren, FJ
    Kuroiwa, S
    [J]. PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2005, : 429 - 433
  • [43] Performance of natural language classifiers in a question-answering system
    Bakis, R.
    Connors, D. P.
    Dube, P.
    Kapanipathi, P.
    Kumar, A.
    Malioutov, D.
    Venkatramani, C.
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (4-5)
  • [44] REQUEST - NATURAL-LANGUAGE QUESTION-ANSWERING SYSTEM
    PLATH, WJ
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1976, 20 (04) : 326 - 335
  • [45] Study on Finding Experts in Community Question-Answering System
    Yang, Rongrong
    Wu, Jianhua
    [J]. APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 1760 - 1764
  • [46] JAPANESE QUESTION-ANSWERING SYSTEM ON TOPIC OF FIGURE MANIPULATIONS
    AMAMIYA, M
    SHIMAZU, A
    WAKANA, T
    [J]. REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1978, 26 (7-8): : 1045 - 1056
  • [47] Question-Answering Dialog System for Large Audiovisual Archives
    Chylek, Adam
    Smidl, Lubos
    Svec, Jan
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 385 - 397
  • [48] Financial FAQ Question-Answering System Based on Question Semantic Similarity
    Hong, Wenxing
    Li, Jun
    Li, Shuyan
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2024, 2024, 14886 : 152 - 163
  • [49] Syntactic Open Domain Arabic Question/Answering System for Factoid Questions
    Fareed, Noha S.
    Mousa, Hamdy M.
    Elsisi, Ashraf B.
    [J]. 2014 9th International Conference on Informatics and Systems (INFOS), 2014,
  • [50] MCQA: A Responsive Question-answering System for Online Education
    Wang, Yi
    Deng, Jinsheng
    Yang, Xi
    Yi, Jianyu
    Ye, Zhaohui
    [J]. SENSORS AND MATERIALS, 2023, 35 (12) : 4325 - 4336