Inferring the Location of Twitter Messages Based on User Relationships

被引:113
|
作者
Davis, Clodoveu A., Jr. [1 ]
Pappa, Gisele L. [1 ]
Rocha de Oliveira, Diogo Renno [1 ]
Arcanjo, Filipe de L. [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Ciencia Comp, Belo Horizonte, MG, Brazil
关键词
D O I
10.1111/j.1467-9671.2011.01297.x
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
User interaction in social networks, such as Twitter and Facebook, is increasingly becoming a source of useful information on daily events. The online monitoring of short messages posted in such networks often provides insight on the repercussions of events of several different natures, such as (in the recent past) the earthquake and tsunami in Japan, the royal wedding in Britain and the death of Osama bin Laden. Studying the origins and the propagation of messages regarding such topics helps social scientists in their quest for improving the current understanding of human relationships and interactions. However, the actual location associated to a tweet or to a Facebook message can be rather uncertain. Some tweets are posted with an automatically determined location (from an IP address), or with a user-informed location, both in text form, usually the name of a city. We observe that most Twitter users opt not to publish their location, and many do so in a cryptic way, mentioning non-existing places or providing less specific place names (such as "Brazil"). In this article, we focus on the problem of enriching the location of tweets using alternative data, particularly the social relationships between Twitter users. Our strategy involves recursively expanding the network of locatable users using following-follower relationships. Verification is achieved using cross-validation techniques, in which the location of a fraction of the users with known locations is used to determine the location of the others, thus allowing us to compare the actual location to the inferred one and verify the quality of the estimation. With an estimate of the precision of the method, it can then be applied to locationless tweets. Our intention is to infer the location of as many users as possible, in order to increase the number of tweets that can be used in spatial analyses of social phenomena. The article demonstrates the feasibility of our approach using a dataset comprising tweets that mention keywords related to dengue fever, increasing by 45% the number of locatable tweets.
引用
收藏
页码:735 / 751
页数:17
相关论文
共 50 条
  • [41] Detecting Locations from Twitter Messages
    Inkpen, Diana
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091
  • [42] Geotagging Twitter Messages in Crisis Management
    Ghahremanlou, Lida
    Sherchan, Wanita
    Thom, James A.
    [J]. COMPUTER JOURNAL, 2015, 58 (09): : 1937 - 1954
  • [43] TUMS: Twitter-Based User Modeling Service
    Tao, Ke
    Abel, Fabian
    Gao, Qi
    Houben, Geert-Jan
    [J]. SEMANTIC WEB: ESWC 2011 WORKSHOPS, 2012, 7117 : 269 - 283
  • [44] Text-Based Twitter User Geolocation Prediction
    Han, Bo
    Cook, Paul
    Baldwin, Timothy
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 49 : 451 - 500
  • [45] Automatic Sentiment Analysis of Twitter Messages
    Lima, Ana C. E. S.
    de Castro, Leandro N.
    [J]. 2012 FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ASPECTS OF SOCIAL NETWORKS (CASON), 2012, : 52 - 57
  • [46] Twitter User Clustering Based on Their Preferences and the Louvain Algorithm
    Lopez Sanchez, Daniel
    Revuelta, Jorge
    De la Prieta, Fernando
    Gil-Gonzalez, Ana B.
    Cach Dang
    [J]. TRENDS IN PRACTICAL APPLICATIONS OF SCALABLE MULTI-AGENT SYSTEMS, THE PAAMS COLLECTION, 2016, 473 : 349 - 356
  • [47] Twitter Message Recommendation Based on User Interest Profiles
    Makki, Raheleh
    Soto, Axel J.
    Brooks, Stephen
    Milios, Evangelos E.
    [J]. PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, 2016, : 406 - 410
  • [48] TURank: Twitter User Ranking Based on User-Tweet Graph Analysis
    Yamaguchi, Yuto
    Takahashi, Tsubasa
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    [J]. WEB INFORMATION SYSTEM ENGINEERING-WISE 2010, 2010, 6488 : 240 - 253
  • [49] User Anonymity on Twitter
    Peddinti, Sai Teja
    Ross, Keith W.
    Cappos, Justin
    [J]. IEEE SECURITY & PRIVACY, 2017, 15 (03) : 84 - 87
  • [50] Messages in Distance and Location
    魏芳云
    [J]. 前沿, 1995, (07) : 16 - 20