Multi-Column Convolutional Neural Networks with Causality-Attention for Why-Question Answering

被引:33
|
作者
Oh, Jong-Hoon [1 ]
Torisawa, Kentaro [1 ]
Kruengkrai, Canasai [1 ]
Iida, Ryu [1 ]
Kloetzer, Julien [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Kyoto, Japan
来源
WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING | 2017年
关键词
Question Answering; Convolutional Neural Network; Neural Attention; Causality; Why-Question Answering;
D O I
10.1145/3018661.3018737
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Why-question answering (why-QA) is a task to retrieve answers (or answer passages) to why-questions (e.g., "why are tsunamis generated?") from a text archive. Several previously proposed methods for why-QA improved their performance by automatically recognizing causalities that are expressed with such explicit cues as "because" in answer passages and using the recognized causalities as a clue for finding proper answers. However, in answer passages, causalities might be implicitly expressed, (i.e., without any explicit cues): "An earthquake suddenly displaced sea water and a tsunami was generated." The previous works did not deal with such implicitly expressed causalities and failed to find proper answers that included the causalities. We improve why-QA based on the following two ideas. First, implicitly expressed causalities in one text might be expressed in other texts with explicit cues. If we can automatically recognize such explicitly expressed causalities from a text archive and use them to complement the implicitly expressed causalities in an answer passage, we can improve why-QA. Second, the causes of similar events tend to be described with a similar set of words (e.g., "seismic energy" and "tectonic plates" for "the Great East Japan Earthquake" and "the 1906 San Francisco Earthquake"). As such, even if we cannot find in a text archive any explicitly expressed cause of an event (e.g., "the Great East Japan Earthquake") expressed in a question (e.g., "Why did the Great East Japan earthquake happen?"), we might be able to identify its implicitly expressed causes with a set of words (e.g., "tectonic plates") that appear in the explicitly expressed cause of a similar event (e.g., "the 1906 San Francisco Earthquake"). We implemented these two ideas in our multi-column convolutional neural networks with a novel attention mechanism, which we call causality attention. Through experiments on Japanese why-QA, we confirmed that our proposed method outperformed the state-of-the-art systems.
引用
收藏
页码:415 / 424
页数:10
相关论文
共 50 条
  • [41] SPOKEN MULTIPLE-CHOICE QUESTION ANSWERING USING MULTIMODAL CONVOLUTIONAL NEURAL NETWORKS
    Luo, Shang-Bao
    Lee, Hung-Shin
    Chen, Kuan-Yu
    Wang, Hsin-Min
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 772 - 778
  • [42] Adversarial Entity Graph Convolutional Networks for multi-hop inference question answering
    Du, Yongping
    Yan, Rui
    Hou, Ying
    Pei, Yu
    Han, Honggui
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [43] Crowd density detection method based on crowd gathering mode and multi-column convolutional neural network
    Bai, Liu
    Wu, Cheng
    Xie, Feng
    Wang, Yiming
    IMAGE AND VISION COMPUTING, 2021, 105
  • [44] Image-Based Crowd Stability Analysis Using Improved Multi-Column Convolutional Neural Network
    Zhao, Rongyong
    Dong, Daheng
    Wang, Yan
    Li, Cuiling
    Ma, Yunlong
    Fuentes Enriquez, Veronica
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) : 5480 - 5489
  • [45] Multi-modal co-attention relation networks for visual question answering
    Zihan Guo
    Dezhi Han
    The Visual Computer, 2023, 39 : 5783 - 5795
  • [46] Multi-Granularity Hierarchical Attention Fusion Networks for Reading Comprehension and Question Answering
    Wang, Wei
    Yan, Ming
    Wu, Chen
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1705 - 1714
  • [47] Multi-modal co-attention relation networks for visual question answering
    Guo, Zihan
    Han, Dezhi
    VISUAL COMPUTER, 2023, 39 (11): : 5783 - 5795
  • [48] ESTIMATION OF SIGNAL-DEPENDENT NOISE LEVEL FUNCTION USING MULTI-COLUMN CONVOLUTIONAL NEURAL NETWORK
    Yang, Jingyu
    Liu, Xin
    Song, Xiaolin
    Li, Kun
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2418 - 2422
  • [49] Rape seedling density estimation in-field conditions based on improved multi-column convolutional neural network
    Yang, Hai-Chao
    Yuan, Hao-Yu
    Wang, Yan-Li
    Li, Yi
    Yin, Zi-Qin
    AGRONOMY JOURNAL, 2024, 116 (03) : 810 - 825
  • [50] Enhancing Solar Energy Forecast Using Multi-Column Convolutional Neural Network and Multipoint Time Series Approach
    Kumar, Anil
    Kashyap, Yashwant
    Kosmopoulos, Panagiotis
    REMOTE SENSING, 2023, 15 (01)