Classifying Social Media Users with Machine Learning

被引:0
|
作者
Li G. [1 ]
Zhou H. [1 ]
Mao J. [1 ]
Chen S. [1 ]
机构
[1] Center for Studies of Information Resources, Wuhan University, Wuhan
关键词
Feature Extraction; Machine Learning; SVM; User Classification;
D O I
10.11925/infotech.2096-3467.2018.1207
中图分类号
学科分类号
摘要
[Objective] This paper uses multi-dimensional information of social media users to automatically classify them. [Methods] First, we defined social media users as individual, media, government, and organization. Then, we extracted the following features from user profiles: demographic characteristics, namings, and self-descriptions. Third, we created a user classification models based on machine learning algorithms and evaluated its performance with real Twitter dataset. [Results] Both precision and recall of the proposed model were greater than 83%. The naming, demographic characteristics, and self-description features posed increasing contributions to the classification model. [Limitations] The sample size needs to be expanded, which helps us better analyzed the characteristics of different users. [Conclusions] The proposed method could accurately identify four types of users, which benefits social media user classification research in the future. © 2019 The Author(s).
引用
收藏
页码:1 / 9
页数:8
相关论文
共 30 条
  • [1] Social Network Service
  • [2] Boyd D M, Ellison N B., Social Network Sites: Definition, History, and Scholarship[J], Journal of Computer Mediated Communication, 13, 1, pp. 210-230, (2008)
  • [3] Digital in 2018
  • [4] He Chaobo, Tang Yong, Mai Huiqiang, Et al., A Survey on Online Social Network Mining, Journal of Wuhan University: Natural Science Edition, 60, 3, pp. 189-200, (2014)
  • [5] Chen Jiawei, Exploring the Sense of Community for an Online Sport Community: A Case Study of Nippon Professional Baseball Club, (2006)
  • [6] Gomez-Rodriguez M, Leskovec J, Krause A., Inferring Network of Diffusion and Influence[C], Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1019-1028, (2010)
  • [7] Deng Sanhong, Liu Xiwen, Jiang Xun, Constructing Cases Knowledge Base of Emergency Based on Stakeholder’s Theory, Library & Information, 3, pp. 1-8, (2015)
  • [8] Mu Tao, Chen Wei, Chen Songjian, User Classification Method Based on Multi-Layer Network Traffic Analysis, Journal of Computer Applications, 37, 3, pp. 705-710, (2017)
  • [9] Su Zhaohui, Customer Relationship Management, pp. 14-16, (2016)
  • [10] He Chaobo, Yang Zhenxiong, Hong Shaowen, Et al., User Classification Method in Online Social Network Using Random Walks, Computer Science, 42, 2, pp. 198-202, (2015)