A Scalable and Dynamic Self-Organizing Map for Clustering Large Volumes of Text Data

被引:0
|
作者
Matharage, Sumith [2 ]
Ganegedara, Hiran [1 ]
Alahakoon, Damminda [2 ]
机构
[1] Monash Univ, Fac IT, Cognit & Connectionist Syst Lab, Clayton, Vic, Australia
[2] Deakin Univ, Sch Informat Syst, Geelong, Vic 3217, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self Organizing Map (SOM) and Growing Self Organizing Map (GSOM) are widely used techniques for text mining. Mining large text data sets is significantly processor intensive [1]. Recently Fast Growing Self Organizing Map (FastGSOM) was proposed an improvement to the GSOM for clustering text data more efficiently[2]. For text corpuses with thousands of documents, the time requirement could still be a bottleneck with high turnaround times for the analysis process. We propose a new scalable parallel algorithm for text analysis using FastGSOM which can harness the power of parallel and distributed computing for efficient analysis of large scale text datasets. We demonstrate that the proposed algorithm has similar or better accuracy compared to GSOM and is several orders more efficient when operating in parallel.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] An Enhancing Dynamic Self-Organizing Map for Data Clustering
    Wang, Ting
    Yu, Xinghuo
    Alahakoon, Damminda
    Fei, Shumin
    [J]. 2013 10TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2013, : 1324 - 1329
  • [2] A dynamic linkage method for text data based on self-organizing map
    Hijikata, Y
    Takeuchi, H
    Yoshida, T
    Nishida, S
    [J]. RO-MAN '97 SENDAI: 6TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN COMMUNICATION, PROCEEDINGS, 1997, : 420 - 425
  • [3] ConSOM: A conceptional self-organizing map model for text clustering
    Liu, Yuanchao
    Wang, Xiaolong
    Wu, Chong
    [J]. NEUROCOMPUTING, 2008, 71 (4-6) : 857 - 862
  • [4] Self-Organizing Map for Multi-view Text Clustering
    Fraj, Maha
    Ben Hajkacem, Mohamed Aymen
    Essoussi, Nadia
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2020), 2020, 12393 : 396 - 408
  • [5] Clustering of the self-organizing map
    Vesanto, J
    Alhoniemi, E
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03): : 586 - 600
  • [6] Self-organizing map clustering analysis for molecular data
    Wang, Lin
    Jiang, Minghu
    Lu, Yinghua
    Noe, Frank
    Smith, Jeremy C.
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1250 - 1255
  • [7] BAYESIAN SELF-ORGANIZING MAP FOR DATA CLASSIFICATION AND CLUSTERING
    Guo, Xiaolian
    Wang, Haiying
    Glass, David H.
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2013, 11 (05)
  • [8] An Improved Self-Organizing Map for Bugs Data Clustering
    Ahmed, Attika
    Ghazali, Rozaida
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS (I2CACIS), 2016, : 135 - 140
  • [9] Clustering of the Self-Organizing Time Map
    Sarlin, Peter
    Yao, Zhiyuan
    [J]. NEUROCOMPUTING, 2013, 121 : 317 - 327
  • [10] Gravitational Clustering of the Self-Organizing Map
    Ilc, Nejc
    Dobnikar, Andrej
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT II, 2011, 6594 : 11 - 20