Feature Selection-Based Clustering on Micro-blogging Data

被引:6
|
作者
Dutta, Soumi [1 ,2 ]
Ghatak, Sujata [1 ]
Das, Asit Kumar [2 ]
Gupta, Manan [1 ]
Dasgupta, Sayantika [1 ]
机构
[1] Inst Engn & Management, Kolkata 700091, India
[2] Indian Inst Engn Sci & Technol Shibpur, Howrah 711103, India
关键词
Clustering; Feature selection; Micro-blogs;
D O I
10.1007/978-981-10-8055-5_78
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing popularity of micro-blogging phenomena opens up a flexible platform for the public as communication media for the public. For any trending/non-trending topic, thousands of post are posted daily in micro-blogs. During any important event, such as natural calamity and election, and sports event, such as IPL and World Cup, a huge number of messages (micro-blogs) are posted. Due to fast and huge exchange of messages causes information overload, hence clustering or grouping similar messages is an effective way to reduce that. Less content and noisy nature of messages are challenging factor in micro-blog data clustering. Incremental huge data is another challenge to clustering. So, in this work, a novel clustering approach is proposed for micro-blogs combining feature selection technique. The proposed approach has been applied to several experimental dataset, and it is compared with several existing clustering techniques which results in better outcome than other methods.
引用
收藏
页码:885 / 895
页数:11
相关论文
共 50 条
  • [21] Topology-Based Recommendation of Users in Micro-Blogging Communities
    Marcelo G. Armentano
    Daniela Godoy
    Analía Amandi
    Journal of Computer Science and Technology, 2012, 27 : 624 - 634
  • [22] Spammer Detection Based on Hidden Markov Model in Micro-Blogging
    Dang, Qi
    Gao, Few
    Zhou, Yadong
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 407 - 412
  • [23] Predicting Active Users' Personality Based on Micro-Blogging Behaviors
    Li, Lin
    Li, Ang
    Hao, Bibo
    Guan, Zengda
    Zhu, Tingshao
    PLOS ONE, 2014, 9 (01):
  • [24] Followee recommendation based on text analysis of micro-blogging activity
    Armentano, M. G.
    Godoy, D.
    Amandi, A. A.
    INFORMATION SYSTEMS, 2013, 38 (08) : 1116 - 1127
  • [25] Detection of Financial Opportunities in Micro-Blogging Data With a Stacked Classification System
    De Arriba-Perez, Francisco
    Garcia-Mendez, Silvia
    Regueiro-Janeiro, Jose Angel
    Gonzalez-Castano, Francisco J.
    IEEE ACCESS, 2020, 8 : 215679 - 215690
  • [26] A Framework for Peer-to-Peer Micro-blogging
    Asthana, H.
    Cox, Ingemar J.
    2013 33RD IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW 2013), 2013, : 184 - 189
  • [27] Unsupervised topic discovery in micro-blogging networks
    Vicient, Carlos
    Moreno, Antonio
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (17-18) : 6472 - 6485
  • [28] The Impact of Longstanding Messages In Micro-Blogging Classification
    Costa, Joana
    Silva, Catarina
    Antunes, Mario
    Ribeiro, Bernardete
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [29] Scalable Micro-blogging in Mobile Social Communities
    Narmawala, Zunnun
    Srivastava, Sanjay
    2015 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2015, : 66 - 69
  • [30] Modeling User's Temporal Dynamic Profile in Micro-blogging using Clustering Method
    Li, Runsheng
    He, Wei
    Wu, Zhenyu
    Hu, Jun
    Liu, Yu
    2012 6TH INTERNATIONAL CONFERENCE ON NEW TRENDS IN INFORMATION SCIENCE, SERVICE SCIENCE AND DATA MINING (ISSDM2012), 2012, : 808 - 812