Multi-Column Convolutional Neural Networks with Causality-Attention for Why-Question Answering

被引:33
|
作者
Oh, Jong-Hoon [1 ]
Torisawa, Kentaro [1 ]
Kruengkrai, Canasai [1 ]
Iida, Ryu [1 ]
Kloetzer, Julien [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Kyoto, Japan
关键词
Question Answering; Convolutional Neural Network; Neural Attention; Causality; Why-Question Answering;
D O I
10.1145/3018661.3018737
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Why-question answering (why-QA) is a task to retrieve answers (or answer passages) to why-questions (e.g., "why are tsunamis generated?") from a text archive. Several previously proposed methods for why-QA improved their performance by automatically recognizing causalities that are expressed with such explicit cues as "because" in answer passages and using the recognized causalities as a clue for finding proper answers. However, in answer passages, causalities might be implicitly expressed, (i.e., without any explicit cues): "An earthquake suddenly displaced sea water and a tsunami was generated." The previous works did not deal with such implicitly expressed causalities and failed to find proper answers that included the causalities. We improve why-QA based on the following two ideas. First, implicitly expressed causalities in one text might be expressed in other texts with explicit cues. If we can automatically recognize such explicitly expressed causalities from a text archive and use them to complement the implicitly expressed causalities in an answer passage, we can improve why-QA. Second, the causes of similar events tend to be described with a similar set of words (e.g., "seismic energy" and "tectonic plates" for "the Great East Japan Earthquake" and "the 1906 San Francisco Earthquake"). As such, even if we cannot find in a text archive any explicitly expressed cause of an event (e.g., "the Great East Japan Earthquake") expressed in a question (e.g., "Why did the Great East Japan earthquake happen?"), we might be able to identify its implicitly expressed causes with a set of words (e.g., "tectonic plates") that appear in the explicitly expressed cause of a similar event (e.g., "the 1906 San Francisco Earthquake"). We implemented these two ideas in our multi-column convolutional neural networks with a novel attention mechanism, which we call causality attention. Through experiments on Japanese why-QA, we confirmed that our proposed method outperformed the state-of-the-art systems.
引用
收藏
页码:415 / 424
页数:10
相关论文
共 50 条
  • [31] Single-Image Crowd Counting via Multi-Column Convolutional Neural Network
    Zhang, Yingying
    Zhou, Desen
    Chen, Siqin
    Gao, Shenghua
    Ma, Yi
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 589 - 597
  • [32] Toward a Multi-Column Knowledge-Oriented Neural Network for Web Corpus Causality Mining
    Ali, Wajid
    Zuo, Wanli
    Wang, Ying
    Ali, Rahman
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [33] Generating 3D Faces using Multi-column Graph Convolutional Networks
    Li, Kun
    Liu, Jingying
    Lai, Yu-Kun
    Yang, Jingyu
    COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 215 - 224
  • [34] Counting challenging crowds robustly using a multi-column multi-task convolutional neural network
    Yang, Biao
    Cao, Jinmeng
    Wang, Nan
    Zhang, Yuyu
    Zou, Ling
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 64 : 118 - 129
  • [35] Multi-source Multi-level Attention Networks for Visual Question Answering
    Yu, Dongfei
    Fu, Jianlong
    Tian, Xinmei
    Mei, Tao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (02)
  • [36] Multi-feature Counting of Dense Crowd Image Based on Multi-column Convolutional Neural Network
    Gong, Songchenchen
    Bourennane, El-Bay
    Gao, Junyu
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 215 - 219
  • [37] Multi-Column Deep Neural Networks for Offline Handwritten Chinese Character Classification
    Ciresan, Dan
    Meier, Ueli
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [38] Multi-modal spatial relational attention networks for visual question answering
    Yao, Haibo
    Wang, Lipeng
    Cai, Chengtao
    Sun, Yuxin
    Zhang, Zhi
    Luo, Yongkang
    IMAGE AND VISION COMPUTING, 2023, 140
  • [39] Multi-Modal Explicit Sparse Attention Networks for Visual Question Answering
    Guo, Zihan
    Han, Dezhi
    SENSORS, 2020, 20 (23) : 1 - 15
  • [40] Sonar Image Detection Based on Multi-Scale Multi-Column Convolution Neural Networks
    Wang, Zhen
    Zhang, Shanwen
    IEEE ACCESS, 2019, 7 : 160755 - 160767