Location detection and disambiguation from twitter messages

被引:0
|
作者
Diana Inkpen
Ji Liu
Atefeh Farzindar
Farzaneh Kazemi
Diman Ghazi
机构
[1] University of Ottawa,School of Electrical Engineering and Computer Science
[2] NLP Technologies,undefined
关键词
Artificial intelligence; Information extraction; Machine learning; Natural language processing; Social media;
D O I
暂无
中图分类号
学科分类号
摘要
A remarkable amount of Twitter messages are generated every second. Detecting the location entities mentioned in these messages is useful in text mining applications. Therefore, techniques for extracting the location entities from the Twitter textual content are needed. In this work, we approach this task in a similar manner to the Named Entity Recognition (NER) task, but we focus only on locations, while NER systems detect names of persons, organizations, locations, and sometimes more (e.g., dates, times). But, unlike NER systems, we address a deeper task: classifying the detected locations into names of cities, provinces/states, and countries in order to map them into physical locations. We approach the task in a novel way, consisting in two stages. In the first stage, we train Conditional Random Fields (CRF) models that are able to detect the locations mentioned in the messages. We train three classifiers: one for cities, one for provinces/states, and one for countries, with various sets of features. Since a dataset annotated with this kind of information was not available, we collected and annotated our own dataset to use for training and testing. In the second stage, we resolve the remaining ambiguities, namely, cases when there exists more than one place with the same name. We proposed a set of heuristics able to choose the correct physical location in these cases. Our two-stage model will allow a social media monitoring system to visualize the places mentioned in Twitter messages on a map of the world or to compute statistics about locations. This kind of information can be of interest to business or marketing applications.
引用
收藏
页码:237 / 253
页数:16
相关论文
共 50 条
  • [1] Location detection and disambiguation from twitter messages
    Inkpen, Diana
    Liu, Ji
    Farzindar, Atefeh
    Kazemi, Farzaneh
    Ghazi, Diman
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 49 (02) : 237 - 253
  • [2] Inferring the Location of Twitter Messages Based on User Relationships
    Davis, Clodoveu A., Jr.
    Pappa, Gisele L.
    Rocha de Oliveira, Diogo Renno
    Arcanjo, Filipe de L.
    TRANSACTIONS IN GIS, 2011, 15 (06) : 735 - 751
  • [3] Geocoding location expressions in Twitter messages: A preference learning method
    Zhang, Wei
    Gelernter, Judith
    JOURNAL OF SPATIAL INFORMATION SCIENCE, 2014, (09): : 37 - 70
  • [4] Detecting Locations from Twitter Messages
    Inkpen, Diana
    ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091
  • [5] Location Extraction from Twitter Messages using Bidirectional Long Short-Term Memory Model
    Chen, Zi
    Pokharel, Badal
    Li, Bingnan
    Lim, Samsung
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON GEOGRAPHICAL INFORMATION SYSTEMS THEORY, APPLICATIONS AND MANAGEMENT (GISTAM), 2020, : 45 - 50
  • [6] Predicting Signs of Depression from Twitter Messages
    Mahasiriakalayot, Suwaroj
    Senivongse, Twittie
    Taephant, Nattasuda
    2022 19TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2022), 2022,
  • [7] Cluster-discovery of Twitter messages for event detection and trending
    Kaleel, Shakira Banu
    Abhari, Abdolreza
    JOURNAL OF COMPUTATIONAL SCIENCE, 2015, 6 : 47 - 57
  • [8] Infectious Twitter messages
    Lessky-Hoehl, Renate
    PADIATRIE UND PADOLOGIE, 2023, 58 (03): : 142 - 143
  • [9] Twitter Trending Topics Meaning Disambiguation
    Han, Soyeon Caren
    Chung, Hyunsuk
    Kim, Do Hyeong
    Lee, Sungyoung
    Kang, Byeong Ho
    KNOWLEDGE MANAGEMENT AND ACQUISITION FOR SMART SYSTEMS AND SERVICES, PKAW 2014, 2014, 8863 : 126 - 137
  • [10] Location Extraction from Social Media: Geoparsing, Location Disambiguation, and Geotagging
    Middleton, Stuart E.
    Kordopatis-Zilos, Giorgos
    Papadopoulos, Symeon
    Kompatsiaris, Yiannis
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2018, 36 (04)