A lightweight and multilingual framework for crisis information extraction from Twitter data

被引:0
|
作者
Roberto Interdonato
Jean-Loup Guillaume
Antoine Doucet
机构
[1] CIRAD,
[2] TETIS,undefined
[3] Univ. of Montpellier,undefined
[4] APT,undefined
[5] Cirad,undefined
[6] CNRS,undefined
[7] Irstea,undefined
[8] L3I,undefined
[9] Université de La Rochelle,undefined
来源
关键词
Crisis management; Situational awareness; Informativeness ranking;
D O I
暂无
中图分类号
学科分类号
摘要
Obtaining relevant timely information during crisis events is a challenging task that can be fundamental to handle the consequences deriving from both unexpected events (e.g., terrorist attacks) and partially predictable ones (i.e., natural disasters). Even though microblogging-based online social networks (e.g., Twitter) have become an attractive data source in these emergency situations, overcoming the information overload deriving from mass events is not trivial. The aim of this work was to enable unsupervised extraction of relevant information from Twitter data during a crisis event, offering a lightweight alternative to learning-based approaches. The proposed lightweight crisis management framework integrates natural language processing and clustering techniques in order to produce a ranking of tweets relevant to a crisis situation based on their informativeness. Experiments carried out on six Twitter collections in two languages (English and French) proved the significance and the flexibility of our approach.
引用
收藏
相关论文
共 50 条
  • [1] A lightweight and multilingual framework for crisis information extraction from Twitter data
    Interdonato, Roberto
    Guillaume, Jean-Loup
    Doucet, Antoine
    SOCIAL NETWORK ANALYSIS AND MINING, 2019, 9 (01)
  • [2] Unsupervised Crisis Information Extraction from Twitter Data
    Interdonato, Roberto
    Doucet, Antoine
    Guillaume, Jean-Loup
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 579 - 580
  • [3] Traffic Condition Information Extraction From Twitter Data
    Herwanto, Guntur Budi
    Dewantara, Deny Prasetya
    2018 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICELTICS): INTELLIGENT DEVICES AND COMPUTING FOR ACCELERATING INDUSTRY 4.0 AND ENRICHING SMART SOCIETIES, 2018, : 95 - 100
  • [4] A FRAMEWORK FOR MASSIVE TWITTER DATA EXTRACTION AND ANALYSIS
    AlvaroCuesta
    Barrero, David F.
    R-Moreno, Maria D.
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2014, 27 (01) : 50 - 67
  • [5] FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction
    Nguyen, Minh Van
    Ngo, Nghia Trung
    Min, Bonan
    Nguyen, Thien Huu
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2022, : 131 - 139
  • [6] Lightweight Multilingual Entity Extraction and Linking
    Pappu, Aasish
    Blanco, Roi
    Mehdad, Yashar
    Stent, Amanda
    Thadani, Kapil
    WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 365 - 374
  • [7] Adaptive Information Extraction of Disaster Information from Twitter
    Rcgalado, Ralph Vincent J.
    Chua, Jenina L.
    Co, Justin L.
    Cheng, Herman C.
    Magpantay, Angelo Bruce L.
    Kalaw, Kristine Ma. Dominique F.
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2014, : 286 - 289
  • [9] Multilingual information framework for handling textual data in digital media
    Cruz-Lara, S
    Gupta, S
    García, JDF
    Romary, L
    Proceedings of the 2005 International Conference on Active Media Technology (AMT 2005), 2005, : 81 - 84
  • [10] Multilingual Open Information Extraction
    Gamallo, Pablo
    Garcia, Marcos
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 711 - 722