Multi-Column Convolutional Neural Networks with Causality-Attention for Why-Question Answering

被引:33
|
作者
Oh, Jong-Hoon [1 ]
Torisawa, Kentaro [1 ]
Kruengkrai, Canasai [1 ]
Iida, Ryu [1 ]
Kloetzer, Julien [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Kyoto, Japan
关键词
Question Answering; Convolutional Neural Network; Neural Attention; Causality; Why-Question Answering;
D O I
10.1145/3018661.3018737
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Why-question answering (why-QA) is a task to retrieve answers (or answer passages) to why-questions (e.g., "why are tsunamis generated?") from a text archive. Several previously proposed methods for why-QA improved their performance by automatically recognizing causalities that are expressed with such explicit cues as "because" in answer passages and using the recognized causalities as a clue for finding proper answers. However, in answer passages, causalities might be implicitly expressed, (i.e., without any explicit cues): "An earthquake suddenly displaced sea water and a tsunami was generated." The previous works did not deal with such implicitly expressed causalities and failed to find proper answers that included the causalities. We improve why-QA based on the following two ideas. First, implicitly expressed causalities in one text might be expressed in other texts with explicit cues. If we can automatically recognize such explicitly expressed causalities from a text archive and use them to complement the implicitly expressed causalities in an answer passage, we can improve why-QA. Second, the causes of similar events tend to be described with a similar set of words (e.g., "seismic energy" and "tectonic plates" for "the Great East Japan Earthquake" and "the 1906 San Francisco Earthquake"). As such, even if we cannot find in a text archive any explicitly expressed cause of an event (e.g., "the Great East Japan Earthquake") expressed in a question (e.g., "Why did the Great East Japan earthquake happen?"), we might be able to identify its implicitly expressed causes with a set of words (e.g., "tectonic plates") that appear in the explicitly expressed cause of a similar event (e.g., "the 1906 San Francisco Earthquake"). We implemented these two ideas in our multi-column convolutional neural networks with a novel attention mechanism, which we call causality attention. Through experiments on Japanese why-QA, we confirmed that our proposed method outperformed the state-of-the-art systems.
引用
收藏
页码:415 / 424
页数:10
相关论文
共 50 条
  • [21] Multi-scale and multi-column convolutional neural network for crowd density estimation
    Chen, Lei
    Wang, Guodong
    Hou, Guojia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (05) : 6661 - 6674
  • [22] Multi-scale and multi-column convolutional neural network for crowd density estimation
    Lei Chen
    Guodong Wang
    Guojia Hou
    Multimedia Tools and Applications, 2021, 80 : 6661 - 6674
  • [23] Multi-image Crowd Counting Using Multi-column Convolutional Neural Network
    Kurnaz, Oguzhan
    Hanilci, Cemal
    PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 223 - 232
  • [24] Multi-level Attention Networks for Visual Question Answering
    Yu, Dongfei
    Fu, Jianlong
    Mei, Tao
    Rui, Yong
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4187 - 4195
  • [25] Multi-view Attention Networks for Visual Question Answering
    Li, Min
    Bai, Zongwen
    Deng, Jie
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 788 - 794
  • [26] Convolutional Deep Neural Networks for Document-Based Question Answering
    Fu, Jian
    Qiu, Xipeng
    Huang, Xuanjing
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 790 - 797
  • [27] Embedding Perspective Analysis Into Multi-Column Convolutional Neural Network for Crowd Counting
    Yang, Yifan
    Li, Guorong
    Du, Dawei
    Huang, Qingming
    Sebe, Nicu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1395 - 1407
  • [28] A Crowd Counting Method Based on Multi-column Dilated Convolutional Neural Network
    Wu, Weiqun
    Sang, Jun
    Alam, Mohammad S.
    Xia, Xiaofeng
    Tan, Jinghan
    PATTERN RECOGNITION AND TRACKING XXX, 2019, 10995
  • [29] Smart connected electronic gastroscope system for gastric cancer screening using multi-column convolutional neural networks
    Wang, Hao
    Ding, Shuai
    Wu, Desheng
    Zhang, Youtao
    Yang, Shanlin
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2019, 57 (21) : 6795 - 6806
  • [30] Incorporating Statistical Features in Convolutional Neural Networks for Question Answering with Financial Data
    Shijia, E.
    Xu, Shiyao
    Xiang, Yang
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1955 - 1959