Discovering Communities with Self-adaptive k Clustering in Microblog Data

被引:3
|
作者
Huang, Ting [1 ]
Peng, Dunlu [1 ]
Cao, Lidong [1 ]
机构
[1] Shanghai Univ Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 201800, Peoples R China
关键词
microblogging; clustering; adaptive k; community recognition; social network;
D O I
10.1109/CGC.2012.92
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, microblogging has been a popular social network service whose population has incredibly increased in past few years. Many business companies regard microblogging service as an indispensable medium to directly obtain timely opinions from customers and potential customers. A community in social network refers to a crowd of people having similar interests or paying their attention on same things. User community recognition in microblogging social network service is very important for identifying hot topics or users' interests which are very helpful for companies to improve their marketing strategies. However, the massive non-structural tweet data brings tremendous challenge for efficiently mining the valuable communities hidden in it. Tweet data is characterized as containing massive information, being involved in large fields, short-length and non-structure. This makes tweets quite different from the conventional text documents. In order to analyze the data more effectively, in this paper, we propose a set of techniques to preprocess tweets, such as word identification, categories matching and data standardization. An unsupervised learning method has been presented to automatically cluster microblog users into different communities. In the method, an optimized CLARANS algorithm has been developed according to the characteristics of microblog data. During the process of clustering, the interactive relationship between tweets is also exploited to improve the clustering quality. In addition, a self-adaptive k strategy is employed to make the proposed approach more applicable. In order to investigate the performance of our approach from different aspects, we conducted a series of experiments with the microblog data collected from SINA Weibo.
引用
收藏
页码:383 / 390
页数:8
相关论文
共 50 条
  • [41] Self-Adaptive Two-phase Support Vector Clustering for multi-relational data mining
    Ling, Ping
    Wang, Yan
    Zhou, Chun-Guang
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 225 - 229
  • [42] Infrared Image Segmentation Algorithm Using Histogram-Based Self-adaptive K-means Clustering
    Zhao, Zhiqiang
    Ling, Xin
    Wu, Jian
    Rui, Xiaoyong
    [J]. PROCEEDINGS OF THE 2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND AUTOMATION ENGINEERING, 2016, 42 : 682 - 688
  • [43] Grain Classification using Hierarchical Clustering and Self-Adaptive Neural Network
    Chen Xiao
    Chen Tao
    Xun Yi
    Li Wei
    Tan Yuzhi
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 4415 - 4418
  • [44] A self-adaptive graph-based clustering method with noise identification
    Lin Li
    Xiang Chen
    Chengyun Song
    [J]. Pattern Analysis and Applications, 2023, 26 (3) : 907 - 916
  • [45] A self-adaptive graph-based clustering method with noise identification
    Li, Lin
    Chen, Xiang
    Song, Chengyun
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 907 - 916
  • [46] Research on Weibo Hotspot Finding Based on Self-Adaptive Incremental Clustering
    宋慧琳
    彭迪云
    黄欣
    冯俊
    [J]. Journal of Shanghai Jiaotong University(Science), 2019, 24 (03) : 364 - 371
  • [47] OBJECT RECOGNITION BASED ON ORB AND SELF-ADAPTIVE KERNEL CLUSTERING ALGORITHM
    Zhang, Yazhong
    Miao, Zhenjiang
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 1397 - 1402
  • [48] Fair Self-Adaptive Clustering for Hybrid Cellular-Vehicular Networks
    Garbiso, Julian
    Diaconescu, Ada
    Coupechoux, Marceau
    Leroy, Bertrand
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (02) : 1225 - 1236
  • [49] Research on Weibo Hotspot Finding Based on Self-Adaptive Incremental Clustering
    Song H.
    Peng D.
    Huang X.
    Feng J.
    [J]. Journal of Shanghai Jiaotong University (Science), 2019, 24 (03) : 364 - 371
  • [50] Image Fusion Algorithm Based on Self-adaptive Fuzzy Clustering Method
    Zhang Hong
    Sun XiaoNan
    Sun YanFeng
    Liu Lei
    [J]. NCM 2008 : 4TH INTERNATIONAL CONFERENCE ON NETWORKED COMPUTING AND ADVANCED INFORMATION MANAGEMENT, VOL 1, PROCEEDINGS, 2008, : 446 - 449