Detecting Marionette Microblog Users for Improved Information Credibility

被引:4
|
作者
Wu, Xian [1 ]
Fan, Wei [2 ]
Gao, Jing [3 ]
Feng, Zi-Ming [1 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai 200240, Peoples R China
[2] Baidu Res Big Data Lab, Sunnyvale, CA 94089 USA
[3] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14214 USA
关键词
marionette microblog user; information credibility; fake follower; fake retweet;
D O I
10.1007/s11390-015-1584-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose to detect a special group of microblog users: the "marionette" users, who are created or employed by backstage "puppeteers", either through programs or manually. Unlike normal users that access microblog for information sharing or social communication, the marionette users perform specific tasks to earn financial profits. For example, they follow certain users to increase their "statistical popularity", or retweet some tweets to amplify their "statistical impact". The fabricated follower or retweet counts not only mislead normal users to wrong information, but also seriously impair microblog-based applications, such as hot tweets selection and expert finding. In this paper, we study the important problem of detecting marionette users on microblog platforms. This problem is challenging because puppeteers are employing complicated strategies to generate marionette users that present similar behaviors as normal users. To tackle this challenge, we propose to take into account two types of discriminative information: 1) individual user tweeting behavior and 2) the social interactions among users. By integrating both information into a semi-supervised probabilistic model, we can effectively distinguish marionette users from normal ones. By applying the proposed model to one of the most popular microblog platforms (Sina Weibo) in China, we find that the model can detect marionette users with F-measure close to 0.9. In addition, we apply the proposed model to calculate the marionette ratio of the top 200 most followed microbloggers and the top 50 most retweeted posts in Sina Weibo. To accelerate the detecting speed and reduce feature generation cost, we further propose a light-weight model which utilizes fewer features to identify marionettes from retweeters.
引用
收藏
页码:1082 / 1096
页数:15
相关论文
共 50 条
  • [1] Detecting Marionette Microblog Users for Improved Information Credibility
    Xian Wu
    Wei Fan
    Jing Gao
    Zi-Ming Feng
    Yong Yu
    Journal of Computer Science and Technology, 2015, 30 : 1082 - 1096
  • [2] A Text Analysis Based Method for Obtaining Credibility Assessment of Chinese Microblog Users
    Ma, Zhaoyi
    Gao, Qin
    SOCIAL COMPUTING AND SOCIAL MEDIA: TECHNOLOGIES AND ANALYTICS, SCSM 2018, PT II, 2018, 10914 : 229 - 235
  • [3] Detecting prominent microblog users over crisis events phases
    Bizid, Imen
    Nayef, Nibal
    Boursier, Patrice
    Doucet, Antoine
    INFORMATION SYSTEMS, 2018, 78 : 173 - 188
  • [4] Modeling information diffusion on microblog networks based on users' behaviors
    Liu Hong-Li
    Huang Ya-Li
    Luo Chun-Hai
    Hu Hai-Bo
    ACTA PHYSICA SINICA, 2016, 65 (15)
  • [5] Incorporating message format into user evaluation of microblog information credibility: A nonlinear perspective
    Yin, Chunxiao
    Zhang, Xiaofei
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [6] Finding high-influence microblog users with an improved PSO algorithm
    Zhang, Biao
    Zhong, Shuai
    Wen, Kunmei
    Li, Ruixuan
    Gu, Xiwu
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2013, 18 (04) : 349 - 356
  • [7] Information Consolidation on Users of Social Networks to Determine Their Credibility
    Markovets, Oleksandr
    Albota, Solomiia
    Horpyniuk, Oksana
    COLINS 2021: COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS, VOL I, 2021, 2870
  • [8] Argumentation Graphical Model for Microblog Credibility Assessment
    Huang Q.-S.
    Dai D.
    Feng X.-P.
    Fu X.-D.
    Liu L.
    Liu L.-J.
    1600, Univ. of Electronic Science and Technology of China (46): : 392 - 398
  • [9] Trustworthiness Criteria for Supporting users to Assess the Credibility of Web Information
    Pattanaphanchai, Jarutas
    O'Hara, Kieron
    Hall, Wendy
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 1123 - 1130
  • [10] Information Credibility Evaluation in Presence of Users' Safety in New Retailing
    Wang, Dong
    Wang, Kehong
    Yan, Lemei
    Yue, Zeyu
    Zhang, Jiewen
    JOURNAL OF WEB ENGINEERING, 2021, 20 (03): : 641 - 667