A lightweight and multilingual framework for crisis information extraction from Twitter data

被引:13
|
作者
Interdonato, Roberto [1 ]
Guillaume, Jean-Loup [2 ]
Doucet, Antoine [2 ]
机构
[1] Univ Montpellier, CIRAD, TETIS, APT,CNRS,Irstea, Montpellier, France
[2] Univ La Rochelle, L3I, La Rochelle, France
关键词
Crisis management; Situational awareness; Informativeness ranking; QUALITY;
D O I
10.1007/s13278-019-0608-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Obtaining relevant timely information during crisis events is a challenging task that can be fundamental to handle the consequences deriving from both unexpected events (e.g., terrorist attacks) and partially predictable ones (i.e., natural disasters). Even though microblogging-based online social networks (e.g., Twitter) have become an attractive data source in these emergency situations, overcoming the information overload deriving from mass events is not trivial. The aim of this work was to enable unsupervised extraction of relevant information from Twitter data during a crisis event, offering a lightweight alternative to learning-based approaches. The proposed lightweight crisis management framework integrates natural language processing and clustering techniques in order to produce a ranking of tweets relevant to a crisis situation based on their informativeness. Experiments carried out on six Twitter collections in two languages (English and French) proved the significance and the flexibility of our approach.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A lightweight and multilingual framework for crisis information extraction from Twitter data
    Roberto Interdonato
    Jean-Loup Guillaume
    Antoine Doucet
    [J]. Social Network Analysis and Mining, 2019, 9
  • [2] Unsupervised Crisis Information Extraction from Twitter Data
    Interdonato, Roberto
    Doucet, Antoine
    Guillaume, Jean-Loup
    [J]. 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 579 - 580
  • [3] Traffic Condition Information Extraction From Twitter Data
    Herwanto, Guntur Budi
    Dewantara, Deny Prasetya
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICELTICS): INTELLIGENT DEVICES AND COMPUTING FOR ACCELERATING INDUSTRY 4.0 AND ENRICHING SMART SOCIETIES, 2018, : 95 - 100
  • [4] A FRAMEWORK FOR MASSIVE TWITTER DATA EXTRACTION AND ANALYSIS
    AlvaroCuesta
    Barrero, David F.
    R-Moreno, Maria D.
    [J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2014, 27 (01) : 50 - 67
  • [5] FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction
    Nguyen, Minh Van
    Ngo, Nghia Trung
    Min, Bonan
    Nguyen, Thien Huu
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2022, : 131 - 139
  • [6] Lightweight Multilingual Entity Extraction and Linking
    Pappu, Aasish
    Blanco, Roi
    Mehdad, Yashar
    Stent, Amanda
    Thadani, Kapil
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 365 - 374
  • [7] Adaptive Information Extraction of Disaster Information from Twitter
    Rcgalado, Ralph Vincent J.
    Chua, Jenina L.
    Co, Justin L.
    Cheng, Herman C.
    Magpantay, Angelo Bruce L.
    Kalaw, Kristine Ma. Dominique F.
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2014, : 286 - 289
  • [9] Multilingual information framework for handling textual data in digital media
    Cruz-Lara, S
    Gupta, S
    García, JDF
    Romary, L
    [J]. Proceedings of the 2005 International Conference on Active Media Technology (AMT 2005), 2005, : 81 - 84
  • [10] Multilingual Open Information Extraction
    Gamallo, Pablo
    Garcia, Marcos
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 711 - 722