Mining causality from texts for question answering system

被引:14
|
作者
Pechsiri, Chaveevan [1 ]
Kawtrakul, Asanee [1 ]
机构
[1] Kasetsart Univ, Bangkok, Thailand
来源
关键词
elementary discourse unit (EDU); multiple EDU causality extraction; causative antecedent; effective consequence; causality boundary identification; verb-pair rules extraction;
D O I
10.1093/ietisy/e90-d.10.1523
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This research aims to develop automatic knowledge mining of causality from texts for supporting an automatic question answering system (QA) in answering 'why' question, which is among the most crucial forms of questions. The out come of this research will assist people in diagnosing problems, such as in plant diseases, health, industrial and etc. While the previous works have extracted causality knowledge within only one or two adjacent EDUs (Elementary Discourse Units), this research focuses to mine causality knowledge existing within multiple EDUs which takes multiple causes and multiple effects in to consideration, where the adjacency between cause and effect is unnecessary. There are two main problems: how to identify the interesting causality events from documents, and how to identify the boundaries of the causative unit and the effective unit in term of the multiple EDUs. In addition, there are at least three main problems involved in boundaries identification: the implicit boundary de-limiter, the nonadjacent cause-consequence, and the effect surrounded by causes. This research proposes using verb-pair rules learnt by comparing the Naive Bayes classifier (NB) and Support Vector Machine (SVM) to identify causality EDUs in Thai agricultural and health news domains. The boundary identification problems are solved by utilizing verb-pair rules, Centering Theory and cue phrase set. The reason for emphasizing on using verbs to extract causality is that they explicitly make, in a certain way, the consequent events of cause-effect, e.g. 'Aphids suck the sap from rice leaves. Then leaves will shrink. Later, they will become yellow and dry.'. The outcome of the proposed methodology shown that the verb-pair rules extracted from NB outperform those extracted from SVM when the corpus contains high occurence of each verb, while the results from SVM is better than NB when the corpus contains less occurence of each verb. The verb-pair rules extracted from NB for causality extraction has the highest precision (0.88) with the recall of 0.75 from the plant disease corpus whereas from SVM has the highest precision (0.89) with the recall of 0.76 from bird flu news. For boundary determination, our methodology can handle very well with approximate 96% accuracy. In addition, the extracted causality results from this research can be generalized as laws in the Inductive-Statistical theory of Hempel's explanation theory, which will be useful for QA and reasoning.
引用
收藏
页码:1523 / 1533
页数:11
相关论文
共 50 条
  • [1] Causality for Question Answering
    Breja, Manvi
    Jain, Sanjay Kumar
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS (COLINS 2020), VOL I: MAIN CONFERENCE, 2020, 2604
  • [2] Establishing a Question Answering System for Forensic Texts
    Spranger, Michael
    Labudde, Dirk
    3RD INTERNATIONAL CONFERENCE ON INTEGRATED INFORMATION (IC-ININFO), 2014, 147 : 197 - 205
  • [3] Question answering and information extraction from texts
    Kontos, J
    Malagardi, I
    ADVANCES IN INTELLIGENT SYSTEMS: CONCEPTS, TOOLS AND APPLICATIONS, 1999, 21 : 121 - 130
  • [4] Automated question generation and question answering from Turkish texts
    Akyon, Fatih Cagatay
    Cavusoglu, Devrim
    Cengiz, Cemil
    Altinuc, Sinan Onur
    Temizel, Alptekin
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (05) : 1931 - +
  • [5] Metagnostic Deductive Question Answering with Explanation from Texts
    Kontos, John
    Armaos, Joseph
    Malagardi, Ioanna
    UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: APPLICATIONS AND SERVICES, PT 4, 2011, 6768 : 72 - 80
  • [6] Question answering system with text mining and deep networks
    Ardac, Hueseyin Avni
    Erdogmus, Pakize
    EVOLVING SYSTEMS, 2024, 15 (05) : 1787 - 1799
  • [7] Extending a Logic-Based Question Answering System for Administrative Texts
    Gloeckner, Ingo
    Pelzer, Bjoern
    MULTILINGUAL INFORMATION ACCESS EVALUATION I: TEXT RETRIEVAL EXPERIMENTS, 2010, 6241 : 265 - +
  • [8] LeVinQam: A question answering mining platform
    Duval, P
    Merceron, A
    Rinderknecht, C
    Scholl, M
    ITHET 2004: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY BASED HIGHER EDUCATION AND TRAINING, 2004, : 250 - 255
  • [9] Mining Query Subtopics from Questions in Community Question Answering
    Wu, Yu
    Wu, Wei
    Li, Zhoujun
    Zhou, Ming
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 339 - 345
  • [10] What is the ultimate question answering system? Lessons learned from existing question answering systems
    Loerch, UW
    Guesgen, HW
    Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, Vols 1and 2, 2004, : 323 - 329