A Novel Rough Set Based Clustering Approach for Streaming Data

被引:0
|
作者
Yogita [1 ]
Toshniwal, Durga [1 ]
机构
[1] Indian Inst Technol, Roorkee, Uttar Pradesh, India
关键词
Clustering; Streaming data; Cluster approximation; Rough set;
D O I
10.1007/978-81-322-1602-5_131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a very important data mining task. Clustering of streaming data is very challenging because streaming data cannot be scanned multiple times and also new concepts may keep evolving in data over time. Inherent uncertainty involved in real world data stream further magnifies the challenge of working with streaming data. Rough set is a soft computing technique which can be used to deal with uncertainty involved in cluster analysis. In this paper, we propose a novel rough set based clustering method for streaming data. It describes a cluster as a pair of lower approximation and an upper approximation. Lower approximation comprises of the data objects that can be assigned with certainty to the respective cluster, whereas upper approximation contains those data objects whose belongingness to the various clusters in not crisp along with the elements of lower approximation. Uncertainty in assigning a data object to a cluster is captured by allowing overlapping in upper approximation. Proposed method generates soft-cluster. Keeping in view the challenges of streaming data, the proposed method is incremental and adaptive to evolving concept. Experimental results on synthetic and real world data sets show that our proposed approach outperforms Leader clustering algorithm in terms of classification accuracy. Proposed method generates more natural clusters as compare to k-means clustering and it is robust to outliers. Performance of proposed method is also analyzed in terms of correctness and accuracy of rough clustering.
引用
收藏
页码:1253 / 1265
页数:13
相关论文
共 50 条
  • [1] Rough Set Approach for Categorical Data Clustering
    Herawan, Tutut
    Yanto, Iwan Tri Riyadi
    Deris, Mustafa Mat
    [J]. DATABASE THEORY AND APPLICATION, 2009, 64 : 179 - 186
  • [2] A novel Approach to the Image Clustering Based on Rough Set and Ant Colony Algorithms
    Rao Fen
    Li Xiangjun
    Qiu Taorong
    Guo Chuanjun
    Zhang Yafen
    [J]. 2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL IV, 2010, : 387 - 390
  • [3] A Novel and Efficient Rough Set Based Clustering Technique for Gene Expression Data
    Adhikary, Krishnendu
    Das, Suman
    Roy, Samir
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON BUSINESS AND INFORMATION MANAGEMENT (ICBIM), 2014,
  • [4] Ant Based Clustering of Time Series Discrete Data - A Rough Set Approach
    Pancerz, Krzysztof
    Lewicki, Arkadiusz
    Tadeusiewicz, Ryszard
    [J]. SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT I, 2011, 7076 : 645 - +
  • [5] Rough set based information theoretic approach for clustering uncertain categorical data
    Uddin, Jamal
    Ghazali, Rozaida
    Abawajy, Jemal H.
    Shah, Habib
    Husaini, Noor Aida
    Zeb, Asim
    [J]. PLOS ONE, 2022, 17 (05):
  • [6] On rough set based fuzzy clustering for graph data
    Wenqian He
    Shihu Liu
    Weihua Xu
    Fusheng Yu
    Wentao Li
    Fang Li
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 3463 - 3490
  • [7] An Algorithm for Clustering Data Based on Rough Set Theory
    Wu, Shangzhi
    [J]. ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 2, 2008, : 433 - 436
  • [8] On rough set based fuzzy clustering for graph data
    He, Wenqian
    Liu, Shihu
    Xu, Weihua
    Yu, Fusheng
    Li, Wentao
    Li, Fang
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (11) : 3463 - 3490
  • [9] Rough set based incremental clustering of interval data
    Asharaf, S
    Murty, MN
    Shevade, SK
    [J]. PATTERN RECOGNITION LETTERS, 2006, 27 (06) : 515 - 519
  • [10] An integrated covering-based rough fuzzy set clustering approach for sequential data
    Prabhavathy, P.
    Tripathy, B.K.
    [J]. International Journal of Reasoning-based Intelligent Systems, 2015, 7 (3-4) : 296 - 304