Social Media Analysis using Optimized K-Means Clustering

被引:0
|
作者
Alsayat, Ahmed [1 ]
El-Sayed, Hoda [1 ]
机构
[1] Bowie State Univ, Dept Comp Sci, Bowie, MD 20715 USA
关键词
K-Means; Genetic Algorithm; Clustering; Social Media Analysis; DataMining;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The increasing influence of social media and enormous participation of users creates new opportunities to study human social behavior along with the capability to analyze large amount of data streams. One of the interesting problems is to distinguish between different kinds of users, for example users who are leaders and introduce new issues and discussions on social media. Furthermore, positive or negative attitudes can also be inferred from those discussions. Such problems require a formal interpretation of social media logs and unit of information that can spread from person to person through the social network. Once the social media data such as user messages are parsed and network relationships are identified, data mining techniques can be applied to group different types of communities. However, the appropriate granularity of user communities and their behavior is hardly captured by existing methods. In this paper, we present a framework for the novel task of detecting communities by clustering messages from large streams of social data. Our framework uses K-Means clustering algorithm along with Genetic algorithm and Optimized Cluster Distance (OCD) method to cluster data. The goal of our proposed framework is twofold that is to overcome the problem of general K-Means for choosing best initial centroids using Genetic algorithm, as well as to maximize the distance between clusters by pairwise clustering using OCD to get an accurate clusters. We used various cluster validation metrics to evaluate the performance of our algorithm. The analysis shows that the proposed method gives better clustering results and provides a novel use-case of grouping user communities based on their activities. Our approach is optimized and scalable for real-time clustering of social media data.
引用
收藏
页码:61 / 66
页数:6
相关论文
共 50 条
  • [31] K-means clustering using entropy minimization
    Okafor, A
    Pardalos, PM
    THEORY AND ALGORITHMS FOR COOPERATIVE SYSTEMS, 2004, 4 : 339 - 351
  • [32] Application of ant K-means on clustering analysis
    Kuo, RJ
    Wang, HS
    Hu, TL
    Chou, SH
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2005, 50 (10-12) : 1709 - 1724
  • [33] K-Means Cloning: Adaptive Spherical K-Means Clustering
    Hedar, Abdel-Rahman
    Ibrahim, Abdel-Monem M.
    Abdel-Hakim, Alaa E.
    Sewisy, Adel A.
    ALGORITHMS, 2018, 11 (10):
  • [34] Improved K-means Clustering Algorithm Based on the Optimized Initial Centriods
    Wang, Shunye
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 450 - 453
  • [35] Analysis Clustering of Electricity Usage Profile Using K-Means Algorithm
    Amri, Yasirli
    Fadhilah, Amanda Lailatul
    Fatmawati
    Setiani, Novi
    Rani, Septia
    INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY FOR SUSTAINABLE DEVELOPMENT (ICET4SD) 2015, 2016, 105
  • [36] NMR metabolic analysis of samples using fuzzy K-means clustering
    Cuperlovic-Culf, Miroslava
    Belacel, Nabil
    Cuif, Adrian S.
    Chute, Ian C.
    Ouellette, Rodney J.
    Burton, Ian W.
    Karakach, Tobias K.
    Walter, John A.
    MAGNETIC RESONANCE IN CHEMISTRY, 2009, 47 : S96 - S104
  • [37] Data Analysis of Educational Evaluation Using K-Means Clustering Method
    Liu, Rui
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [38] One optimized choosing method of K-means document clustering center
    Suo, Hongguang
    Nie, Kunming
    Sun, Xin
    Wang, Yuwei
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 490 - 495
  • [39] Optimized K-Means Clustering Algorithm based on Artificial Fish Swarm
    Yu, HaiTao
    Cheng, Xiaoxu
    Jia, Meijuan
    Jiang, Qingfeng
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 1783 - 1787
  • [40] A K-means Optimized Clustering Algorithm Based on Improved Genetic Algorithm
    Pu, Qiu-Mei
    Wu, Qiong
    Li, Qian
    Lecture Notes in Electrical Engineering, 2022, 801 LNEE : 133 - 140