Multi-Column Convolutional Neural Networks with Causality-Attention for Why-Question Answering

被引:33
|
作者
Oh, Jong-Hoon [1 ]
Torisawa, Kentaro [1 ]
Kruengkrai, Canasai [1 ]
Iida, Ryu [1 ]
Kloetzer, Julien [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Kyoto, Japan
关键词
Question Answering; Convolutional Neural Network; Neural Attention; Causality; Why-Question Answering;
D O I
10.1145/3018661.3018737
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Why-question answering (why-QA) is a task to retrieve answers (or answer passages) to why-questions (e.g., "why are tsunamis generated?") from a text archive. Several previously proposed methods for why-QA improved their performance by automatically recognizing causalities that are expressed with such explicit cues as "because" in answer passages and using the recognized causalities as a clue for finding proper answers. However, in answer passages, causalities might be implicitly expressed, (i.e., without any explicit cues): "An earthquake suddenly displaced sea water and a tsunami was generated." The previous works did not deal with such implicitly expressed causalities and failed to find proper answers that included the causalities. We improve why-QA based on the following two ideas. First, implicitly expressed causalities in one text might be expressed in other texts with explicit cues. If we can automatically recognize such explicitly expressed causalities from a text archive and use them to complement the implicitly expressed causalities in an answer passage, we can improve why-QA. Second, the causes of similar events tend to be described with a similar set of words (e.g., "seismic energy" and "tectonic plates" for "the Great East Japan Earthquake" and "the 1906 San Francisco Earthquake"). As such, even if we cannot find in a text archive any explicitly expressed cause of an event (e.g., "the Great East Japan Earthquake") expressed in a question (e.g., "Why did the Great East Japan earthquake happen?"), we might be able to identify its implicitly expressed causes with a set of words (e.g., "tectonic plates") that appear in the explicitly expressed cause of a similar event (e.g., "the 1906 San Francisco Earthquake"). We implemented these two ideas in our multi-column convolutional neural networks with a novel attention mechanism, which we call causality attention. Through experiments on Japanese why-QA, we confirmed that our proposed method outperformed the state-of-the-art systems.
引用
收藏
页码:415 / 424
页数:10
相关论文
共 50 条
  • [1] Question Answering over Freebase with Multi-Column Convolutional Neural Networks
    Dong, Li
    Wei, Furu
    Zhou, Ming
    Xu, Ke
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 260 - 269
  • [2] BCA: Bilinear Convolutional Neural Networks and Attention Networks for legal question answering
    Zhang, Haiguang
    Zhang, Tongyue
    Cao, Faxin
    Wang, Zhizheng
    Zhang, Yuanyu
    Sun, Yuanyuan
    Vicente, Mark Anthony
    AI OPEN, 2022, 3 : 172 - 181
  • [3] Improving Event Causality Recognition with Multiple Background Knowledge Sources Using Multi-Column Convolutional Neural Networks
    Kruengkrai, Canasai
    Torisawa, Kentaro
    Hashimoto, Chikara
    Kloetzer, Julien
    Oh, Jong-Hoon
    Tanaka, Masahiro
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3466 - 3473
  • [4] Causality Analysis Method and Model Related to Why-Question Answering in Business Intelligence Context
    Guessoum, Meriem Amel
    Djiroun, Rahma
    Boukhalfa, Kamel
    ADVANCES IN COMPUTING SYSTEMS AND APPLICATIONS, 2022, 513 : 15 - 26
  • [5] Image Inpainting via Generative Multi-column Convolutional Neural Networks
    Wang, Yi
    Tao, Xin
    Qi, Xiaojuan
    Shen, Xiaoyong
    Jia, Jiaya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] Application of Multi-Column Heterogeneous Convolutional Neural Networks in image classification
    Wang, Guo-Zhen
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2019, 19 (02) : 307 - 316
  • [7] Multi-Organ Plant Identification With Multi-Column Deep Convolutional Neural Networks
    He, Anfeng
    Tian, Xinmei
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 2020 - 2025
  • [8] Image extrapolation based on multi-column convolutional attention network
    Zhang, Xiaofeng
    Wu, Songsong
    Ding, Hao
    Li, Zuoyong
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1938 - 1942
  • [9] Extended Multi-column Convolutional Neural Network for Crowd Counting
    Xue, Zhiyuan
    Shen, Jie
    Xiong, Xin
    Yuan, Chong
    Bian, Yinlong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 533 - 540
  • [10] Image Steganalysis via Multi-Column Convolutional Neural Network
    Qi Ke
    Liu DongMing
    Zhang Daxing
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 550 - 553