Strategies for combining Twitter users geo-location methods

被引:10
|
作者
Ribeiro, Silvio, Jr. [1 ]
Pappa, Gisele L. [1 ]
机构
[1] Univ Fed Minas Gerais, Comp Sci Dept, Belo Horizonte, MG, Brazil
关键词
Location inference; Twitter; Social networks; Geoinference; Methods combination;
D O I
10.1007/s10707-017-0296-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Twitter has become a major player in the social media scene with over half billion users and over 500 million tweets published daily. With this abundant data, researchers saw the opportunity to explore this data for monitoring events and tracking epidemics. In this type of application, knowing the location of the user is essential. However, most of the information about location self-reported by users is difficult to process, and barely 1% of all published tweets are geolocated. Hence, user location inference is often performed by analyzing public available information from the user profile and his tweets. In this work, we evaluate and compare 16 approaches for user location inference based on different information sources that include interaction networks and text from tweets. We show that methods working with the user friendship network obtain higher values of accuracy and recall when compared to the other methods. From these results, we verify the agreement of pairs of methods regarding the predicted location and the users they cover. We find out that most methods disagree in their inferences while covering different sets of users. These results open up an opportunity to combine different methods in order to improve location accuracy and user recall. We propose four methods for combining the outputs of the evaluated methods. Two of them, one based on a weighting vote scheme (GAVe) and another based on a meta decision tree cover at least 98% of the users in the dataset, while location 75% of them within a distance of 100 km from their real location.
引用
下载
收藏
页码:563 / 587
页数:25
相关论文
共 50 条
  • [41] Analysis of SAR image geo-location accuracy for mapping
    Gonçalves, JA
    SAR IMAGE ANALYSIS, MODELING, AND TECHNIQUES VI, 2004, 5236 : 190 - 199
  • [42] Geo-Location Data Privacy Issues - Key takeaways
    PROCEEDINGS OF THE 29TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2016), 2016, : 1051 - 1063
  • [43] Indoor Geo-location Approach for Dense Multipath Environments
    Kabir, Md. Humayun
    Randrianandraina, Holitiana
    Sugimoto, Chika
    Kohno, Ryuji
    2013 IEEE 78TH VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2013,
  • [44] Compressive Spectrum Sensing Augmented by Geo-location Database
    Qin, Zhijin
    Wei, Lin
    Gao, Yue
    Parini, Clive G.
    2015 IEEE Wireless Communications and Networking Conference Workshops (WCNCW), 2015, : 170 - 175
  • [45] Practical Implementation of Geo-location TVWS Database for Ethiopia
    Hussien, Habib M.
    Katzis, Konstantinos
    Mfupe, Luzango P.
    Bekele, Ephrem T.
    Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST, 2021, 385 : 495 - 510
  • [46] Geo-location estimation from two shadow trajectories
    Wu, Lin
    Cao, Xiaochun
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 585 - 590
  • [47] Construction of a Geo-Location Service Utilizing Microblogging Platforms
    Bassi, Jonathan
    Manna, Sukanya
    Sun, Yu
    2016 IEEE TENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2016, : 162 - 165
  • [48] Vector map geo-location using GPS tracks
    Bao, Yuanlu
    Xu, Hao
    Liu, Zhenan
    GEOINFORMATICS 2007: CARTOGRAPHIC THEORY AND MODELS, 2007, 6751
  • [49] Missile geo-location using missile borne SAR
    Qin, Yu Liang
    Deng, Bin
    Wang, Hong Qiang
    Li, Xiang
    SECOND INTERNATIONAL CONFERENCE ON SPACE INFORMATION TECHNOLOGY, PTS 1-3, 2007, 6795
  • [50] The Challenge of Creating Geo-Location Markup for Digital Books
    Hinze, Annika
    Bainbridge, David
    Cunningham, Sally Jo
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, TPDL 2016, 2016, 9819 : 294 - 306