Discovering Communities with Self-adaptive k Clustering in Microblog Data

被引:3
|
作者
Huang, Ting [1 ]
Peng, Dunlu [1 ]
Cao, Lidong [1 ]
机构
[1] Shanghai Univ Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 201800, Peoples R China
关键词
microblogging; clustering; adaptive k; community recognition; social network;
D O I
10.1109/CGC.2012.92
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, microblogging has been a popular social network service whose population has incredibly increased in past few years. Many business companies regard microblogging service as an indispensable medium to directly obtain timely opinions from customers and potential customers. A community in social network refers to a crowd of people having similar interests or paying their attention on same things. User community recognition in microblogging social network service is very important for identifying hot topics or users' interests which are very helpful for companies to improve their marketing strategies. However, the massive non-structural tweet data brings tremendous challenge for efficiently mining the valuable communities hidden in it. Tweet data is characterized as containing massive information, being involved in large fields, short-length and non-structure. This makes tweets quite different from the conventional text documents. In order to analyze the data more effectively, in this paper, we propose a set of techniques to preprocess tweets, such as word identification, categories matching and data standardization. An unsupervised learning method has been presented to automatically cluster microblog users into different communities. In the method, an optimized CLARANS algorithm has been developed according to the characteristics of microblog data. During the process of clustering, the interactive relationship between tweets is also exploited to improve the clustering quality. In addition, a self-adaptive k strategy is employed to make the proposed approach more applicable. In order to investigate the performance of our approach from different aspects, we conducted a series of experiments with the microblog data collected from SINA Weibo.
引用
收藏
页码:383 / 390
页数:8
相关论文
共 50 条
  • [21] BIONIC SELF-ADAPTIVE DATA COMPRESSION
    POWELL, BK
    [J]. JOURNAL OF SPACECRAFT AND ROCKETS, 1972, 9 (01) : 48 - &
  • [22] Self-adaptive and dynamic clustering for online anomaly detection
    Lee, Seungmin
    Kim, Gisung
    Kim, Sehun
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 14891 - 14898
  • [23] A study on an improved algorithm of self-adaptive clustering network
    Wu, Xiaojun
    Wang, Shitong
    Zheng, Yujie
    Yu, Dongjun
    Su, Dongxue
    Yang, Jingyu
    Ni, Xiuqing
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 804 - 804
  • [24] Self-Adaptive Multiprototype-Based Competitive Learning Approach: A k-Means-Type Algorithm for Imbalanced Data Clustering
    Lu, Yang
    Cheung, Yiu-Ming
    Tang, Yuan Yan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (03) : 1598 - 1612
  • [25] Microblog summarization using self-adaptive multi-objective binary differential evolution
    Naveen Saini
    Sriparna Saha
    Pushpak Bhattacharyya
    [J]. Applied Intelligence, 2022, 52 : 1686 - 1702
  • [26] Discovering communities in linked data by multi-view clustering
    Drost, I
    Bickel, S
    Scheffer, T
    [J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 342 - +
  • [27] Microblog summarization using self-adaptive multi-objective binary differential evolution
    Saini, Naveen
    Saha, Sriparna
    Bhattacharyya, Pushpak
    [J]. APPLIED INTELLIGENCE, 2022, 52 (02) : 1686 - 1702
  • [28] A density-based competitive data stream clustering network with self-adaptive distance metric
    Xu, Baile
    Shen, Furao
    Zhao, Jinxi
    [J]. NEURAL NETWORKS, 2019, 110 : 141 - 158
  • [29] ESA-Stream: Efficient Self-Adaptive Online Data Stream Clustering (Extended Abstract)
    Li, Yanni
    Li, Hui
    Wang, Zhi
    Liu, Bing
    Cui, Jiangtao
    Fei, Hang
    [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2329 - +
  • [30] Self-Adaptive Clustering of Dynamic Multi-Graph Learning
    Bo Zhou
    Yangding Li
    Xincheng Huang
    Jiaye Li
    [J]. Neural Processing Letters, 2022, 54 : 2533 - 2548