Strategies for combining Twitter users geo-location methods

被引:10
|
作者
Ribeiro, Silvio, Jr. [1 ]
Pappa, Gisele L. [1 ]
机构
[1] Univ Fed Minas Gerais, Comp Sci Dept, Belo Horizonte, MG, Brazil
关键词
Location inference; Twitter; Social networks; Geoinference; Methods combination;
D O I
10.1007/s10707-017-0296-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Twitter has become a major player in the social media scene with over half billion users and over 500 million tweets published daily. With this abundant data, researchers saw the opportunity to explore this data for monitoring events and tracking epidemics. In this type of application, knowing the location of the user is essential. However, most of the information about location self-reported by users is difficult to process, and barely 1% of all published tweets are geolocated. Hence, user location inference is often performed by analyzing public available information from the user profile and his tweets. In this work, we evaluate and compare 16 approaches for user location inference based on different information sources that include interaction networks and text from tweets. We show that methods working with the user friendship network obtain higher values of accuracy and recall when compared to the other methods. From these results, we verify the agreement of pairs of methods regarding the predicted location and the users they cover. We find out that most methods disagree in their inferences while covering different sets of users. These results open up an opportunity to combine different methods in order to improve location accuracy and user recall. We propose four methods for combining the outputs of the evaluated methods. Two of them, one based on a weighting vote scheme (GAVe) and another based on a meta decision tree cover at least 98% of the users in the dataset, while location 75% of them within a distance of 100 km from their real location.
引用
下载
收藏
页码:563 / 587
页数:25
相关论文
共 50 条
  • [1] Strategies for combining Twitter users geo-location methods
    Silvio Ribeiro
    Gisele L. Pappa
    GeoInformatica, 2018, 22 : 563 - 587
  • [2] GLDM: Geo-location prediction of twitter users with deep learning methods
    Al-Jamaan, Rawabe
    Yklef, Mourad
    Alothaim, Abdulrahman
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 2723 - 2734
  • [3] A machine learning based approach to identify geo-location of Twitter users
    Onan, Aytug
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, DATA AND CLOUD COMPUTING (ICC 2017), 2017,
  • [4] Automatic Event Geo-Location in Twitter
    Acampora, Giovanni
    Anastasio, Paolo
    Risi, Michele
    Tortora, Genoveffa
    Vitiello, Autilia
    IEEE ACCESS, 2020, 8 : 128213 - 128223
  • [5] Measurement and control of geo-location privacy on Twitter
    Luceri L.
    Andreoletti D.
    Tornatore M.
    Braun T.
    Giordano S.
    Online Social Networks and Media, 2020, 17
  • [6] GLDM: Geo-location prediction of twitter users with deep learning methods (vol 45, pg 2723, 2023)
    Al-Jamaan, R.
    Ykhlef, M.
    Alothaim, A.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 5491 - 5491
  • [7] Spatio-Temporal Multiple Geo-Location Identification on Twitter
    Ghoorchian, Kambiz
    Girdzijauskas, Sarunas
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3412 - 3421
  • [8] Visualization of Twitter Geo-location for Equalization Analysis of Smart Cities in Indonesia
    Juairiah, Ria Siti
    Ubaya, Huda
    Proceedings - 2nd International Conference on Informatics, Multimedia, Cyber, and Information System, ICIMCIS 2020, 2020, : 278 - 283
  • [9] Femtocell Geo-location Challenge
    Abdullah, Labeeb Mohsin
    Baba, Mohd Dani
    Ali, Sinan Ghassan Abid
    Lim, Azman Osman
    Tan, Yasuo
    2014 IEEE 5TH CONTROL AND SYSTEM GRADUATE RESEARCH COLLOQUIUM (ICSGRC), 2014, : 239 - 241
  • [10] The Behaviors of Heterogeneous Users in a Geo-location Database-assisted Cognitive Network
    Li, Tong
    Xie, Gang
    Gao, JinChun
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 637 - 642