Snowball: Extracting Causal Chains from Climate Change Text Corpora

被引:6
|
作者
Alashri, Saud [1 ]
Tsai, Jiun-Yi [2 ]
Koppela, Anvesh Reddy [3 ]
Davulcu, Hasan [1 ]
机构
[1] Arizona State Univ, CIDSE, Tempe, AZ 85287 USA
[2] No Arizona Univ, Sch Commun, Flagstaff, AZ 86011 USA
[3] Amazon Marketpl, Seattle, WA USA
关键词
Climate Change; Causal Relations; Causal Chains; Text Mining; Information Extraction; Natural Language Processing;
D O I
10.1109/ICDIS.2018.00045
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Unpacking causal relationships is essential for developing solutions for managing climate risks that threaten sociopolitical stability. However, the automatic discovery of complex causal chains among interlinked events and their participating actors within large corpora is not well studied. Previous studies on extracting causal relationships from text were based on laborious and incomplete hand developed lists of causal verbs, such as "causes" and "results in". Such approaches result in limited recall because standard causal verbs may not generalize well to accommodate surface variations in texts when different keywords and phrases are used to express similar causal effects. This paper presents a Snowball system to generalize <Subject, Verb, Object> triplets extracted from corpora of online news articles, and cluster them into higher-level concepts without drift. We start with a seed set of causal verbs and apply a concept generalization technique to extract causal chains and their participating actors. Our novel algorithms overcome surface variations in written expressions of causal relationships and discover the domino effects between climate events and human security. Unlike prior studies, our semi-supervised approach alleviates the need for labor intensive keyword list development and annotated datasets. Experimental evaluations by domain experts achieve an average precision of 82%, a significant improvement from prior work. Qualitative assessments of causal chains show that results are consistent with the 2014 IPCC report illuminating causal mechanisms underlying the linkages between climatic stresses and social instability.
引用
收藏
页码:234 / 241
页数:8
相关论文
共 50 条
  • [1] ExcavatorCovid: Extracting Events and Relations from Text Corpora for Temporal and Causal Analysis for COVID-19
    Min, Bonan
    Rozonoyer, Ben
    Qiu, Haoling
    Zamanian, Alex
    Xue, Nianwen
    MacBride, Jessica
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2021, : 63 - 71
  • [2] Extracting semantic representations from large text corpora
    Patel, M
    Bullinaria, JA
    Levy, JP
    [J]. 4TH NEURAL COMPUTATION AND PSYCHOLOGY WORKSHOP, LONDON, 9-11 APRIL 1997: CONNECTIONIST REPRESENTATIONS, 1997, : 199 - 212
  • [3] Snowball:: A prototype system for extracting relations from large text collections
    Agichtein, E
    Gravano, L
    Pavel, J
    Sokolova, V
    Voskoboynik, A
    [J]. SIGMOD RECORD, 2001, 30 (02) : 612 - 612
  • [4] A Study of Extracting Causal Relationships from Text
    Gujarathi, Pranav
    Reddy, Manohar
    Tayade, Neha
    Chakraborty, Sunandan
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, 2023, 544 : 807 - 828
  • [5] A Process for Extracting Knowledge Base for Chatbots from Text Corpora
    Krassmann, Aliane Loureiro
    Flach, Joao Marcos
    Cestari da Silva Grando, Anita Raquel
    Rockenbach Tarouco, Liane Margarida
    Bercht, Magda
    [J]. PROCEEDINGS OF 2019 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2019, : 322 - 329
  • [6] Automatic Extraction of Causal Chains from Text
    Huminski, Aliaksandr
    Bin, Ng Yan
    [J]. LIBRES-LIBRARY AND INFORMATION SCIENCE RESEARCH ELECTRONIC JOURNAL, 2019, 29 (02): : 99 - 108
  • [7] A lightweight tool for automatically extracting causal relationships from text
    Cole, Stephen V.
    Royal, Matthew D.
    Valtorta, Marco G.
    Huhns, Michael N.
    Bowles, John B.
    [J]. PROCEEDINGS OF THE IEEE SOUTHEASTCON 2006, 2006, : 125 - 129
  • [8] Tree pattern expression for extracting information from syntactically parsed text corpora
    Choi, Yong Suk
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 22 (1-2) : 211 - 231
  • [9] Tree pattern expression for extracting information from syntactically parsed text corpora
    Yong Suk Choi
    [J]. Data Mining and Knowledge Discovery, 2011, 22 : 211 - 231
  • [10] A new computing method for extracting contiguous phraseological sequences from academic text corpora
    Wei, Naixing
    Li, Jingjie
    [J]. INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2013, 18 (04) : 506 - 535