Detecting Marionette Microblog Users for Improved Information Credibility

被引:4
|
作者
Wu, Xian [1 ]
Fan, Wei [2 ]
Gao, Jing [3 ]
Feng, Zi-Ming [1 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai 200240, Peoples R China
[2] Baidu Res Big Data Lab, Sunnyvale, CA 94089 USA
[3] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14214 USA
关键词
marionette microblog user; information credibility; fake follower; fake retweet;
D O I
10.1007/s11390-015-1584-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose to detect a special group of microblog users: the "marionette" users, who are created or employed by backstage "puppeteers", either through programs or manually. Unlike normal users that access microblog for information sharing or social communication, the marionette users perform specific tasks to earn financial profits. For example, they follow certain users to increase their "statistical popularity", or retweet some tweets to amplify their "statistical impact". The fabricated follower or retweet counts not only mislead normal users to wrong information, but also seriously impair microblog-based applications, such as hot tweets selection and expert finding. In this paper, we study the important problem of detecting marionette users on microblog platforms. This problem is challenging because puppeteers are employing complicated strategies to generate marionette users that present similar behaviors as normal users. To tackle this challenge, we propose to take into account two types of discriminative information: 1) individual user tweeting behavior and 2) the social interactions among users. By integrating both information into a semi-supervised probabilistic model, we can effectively distinguish marionette users from normal ones. By applying the proposed model to one of the most popular microblog platforms (Sina Weibo) in China, we find that the model can detect marionette users with F-measure close to 0.9. In addition, we apply the proposed model to calculate the marionette ratio of the top 200 most followed microbloggers and the top 50 most retweeted posts in Sina Weibo. To accelerate the detecting speed and reduce feature generation cost, we further propose a light-weight model which utilizes fewer features to identify marionettes from retweeters.
引用
收藏
页码:1082 / 1096
页数:15
相关论文
共 50 条
  • [21] Microblog Users' Life Time Activity Prediction
    Jin Jiahe
    Geng Ruibin
    Chen Xi
    Cai Shun
    2013 10TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2013, : 481 - 486
  • [22] An index model for measuring microblog users' influence
    Fuyong YUAN
    Jing FENG
    Qianqian FU
    Journal of Data and Information Science, 2012, (04) : 67 - 76
  • [23] Sleep Quality Evaluation of Active Microblog Users
    Wu, Kai
    Ma, Jun
    Chen, Zhumin
    Ren, Pengjie
    WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 178 - 189
  • [24] Leveraging Careful Microblog Users for Spammer Detection
    Fu, Hao
    Xie, Xing
    Rui, Yong
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 419 - 429
  • [25] Information Propagation in Microblog Networks
    Zhang, Chenyi
    Sun, Jianling
    Wang, Ke
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 196 - 202
  • [26] Credibility Perception for Arab Users
    AlMansour, Amal Abdullah
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 1030 - 1038
  • [27] Information Diffusion Model for Microblog
    Wu, Kai
    Ji, Xinsheng
    Liu, Caixia
    PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 212 - 215
  • [28] Detecting Suicide Ideation from Sina Microblog
    Gao, Yuanbo
    Li, Baobin
    Wang, Xuefei
    Wang, Jingying
    Zhou, Yang
    Bai, Shuotian
    Zhu, Tingshao
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 182 - 187
  • [29] How low-credibility gossip information impact the users opinions in social network
    Xia, Xinyue
    Fei, Meng
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (4-5) : 3261 - 3274
  • [30] A microblog content credibility evaluation model based on collaborative key points
    Xing, Ling
    Yao, Jinglong
    Wu, Honghai
    Ma Huahong
    SCIENTIFIC REPORTS, 2022, 12 (01)