An algorithm for high-dimensional traffic data clustering

被引:0
|
作者
Zheng, Pengjun [1 ]
McDonald, Mike [1 ]
机构
[1] Univ Southampton, Transportat Res Grp, Southampton SO17 1BJ, Hants, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional fuzzy clustering may converge to a local optimum that is significantly inferior to the global optimal partition. In this paper, a two-stage fuzzy clustering method is proposed. In the first stage, clustering is applied on the compact data that is obtained by dimensionality reduction from the full-dimensional data. The optimal partition identified from the compact data is then used as the initial partition in the second stage clustering based on full-dimensional data, thus effectively reduces the possibility of local optimum. It is found that the proposed two-stage clustering method can generally avoid local optimum without computation overhead. The proposed method has been applied to identify optimal day groups for traffic profiling using operational traffic data. The identified day groups are found to be intuitively reasonable and meaningful.
引用
收藏
页码:59 / 68
页数:10
相关论文
共 50 条
  • [31] Clustering of imbalanced high-dimensional media data
    Brodinova, Sarka
    Zaharieva, Maia
    Filzmoser, Peter
    Ortner, Thomas
    Breiteneder, Christian
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (02) : 261 - 284
  • [32] Clustering of imbalanced high-dimensional media data
    Šárka Brodinová
    Maia Zaharieva
    Peter Filzmoser
    Thomas Ortner
    Christian Breiteneder
    Advances in Data Analysis and Classification, 2018, 12 : 261 - 284
  • [33] The Role of Hubness in Clustering High-Dimensional Data
    Tomasev, Nenad
    Radovanovic, Milos
    Mladenic, Dunja
    Ivanovic, Mirjana
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 183 - 195
  • [34] An effective clustering scheme for high-dimensional data
    Xuansen He
    Fan He
    Yueping Fan
    Lingmin Jiang
    Runzong Liu
    Allam Maalla
    Multimedia Tools and Applications, 2024, 83 : 45001 - 45045
  • [35] A classification algorithm for high-dimensional data
    Roy, Asim
    INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 345 - 355
  • [36] High-dimensional shared nearest neighbor clustering algorithm
    Yin, J
    Fan, XL
    Chen, YQ
    Ren, JT
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 494 - 502
  • [37] A gravity inspired clustering algorithm for gene selection from high-dimensional microarray data
    Jayashree, P.
    Brindha, V.
    Karthik, P.
    IMAGING SCIENCE JOURNAL, 2024, 72 (04): : 421 - 435
  • [38] A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data
    Song, Qinbao
    Ni, Jingjie
    Wang, Guangtao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) : 1 - 14
  • [39] HSCFC: High-dimensional streaming data clustering algorithm based on feedback control system
    Ding, Guohui
    Wang, Yankai
    Li, Chenyang
    Sun, Haohan
    Li, Cailong
    Wang, Lei
    Yin, Haijun
    Huang, Tiantian
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 146 : 156 - 165
  • [40] A Valid Clustering Algorithm for High-dimensional Large Data Sets Based on Distributed Method
    Guo Xian e
    Yan Junmei
    PROCEEDINGS OF 2009 INTERNATIONAL WORKSHOP ON INFORMATION SECURITY AND APPLICATION, 2009, : 1 - 6