A Scalable and Dynamic Self-Organizing Map for Clustering Large Volumes of Text Data

被引:0
|
作者
Matharage, Sumith [2 ]
Ganegedara, Hiran [1 ]
Alahakoon, Damminda [2 ]
机构
[1] Monash Univ, Fac IT, Cognit & Connectionist Syst Lab, Clayton, Vic, Australia
[2] Deakin Univ, Sch Informat Syst, Geelong, Vic 3217, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self Organizing Map (SOM) and Growing Self Organizing Map (GSOM) are widely used techniques for text mining. Mining large text data sets is significantly processor intensive [1]. Recently Fast Growing Self Organizing Map (FastGSOM) was proposed an improvement to the GSOM for clustering text data more efficiently[2]. For text corpuses with thousands of documents, the time requirement could still be a bottleneck with high turnaround times for the analysis process. We propose a new scalable parallel algorithm for text analysis using FastGSOM which can harness the power of parallel and distributed computing for efficient analysis of large scale text datasets. We demonstrate that the proposed algorithm has similar or better accuracy compared to GSOM and is several orders more efficient when operating in parallel.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] A MODIFIED KOHONEN SELF-ORGANIZING MAP (KSOM) CLUSTERING FOR FOUR CATEGORICAL DATA
    Ahmad, Azlin
    Yusof, Rubiyah
    [J]. JURNAL TEKNOLOGI, 2016, 78 (6-13): : 75 - 80
  • [42] A novel kernel Self-Organizing Map Algorithm for Clustering
    Chen, Ning
    Zhang, Hongyi
    Pu, Jiexin
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 2978 - +
  • [43] Distance matrix based clustering of the Self-Organizing Map
    Vesanto, J
    Sulkava, M
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 951 - 956
  • [44] Asymmetric -Means Clustering of the Asymmetric Self-Organizing Map
    Olszewski, Dominik
    [J]. NEURAL PROCESSING LETTERS, 2016, 43 (01) : 231 - 253
  • [45] A Hybrid Collaborative Clustering Using Self-Organizing Map
    Filali, Ameni
    Jlassi, Chiraz
    Arous, Najet
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 709 - 716
  • [46] A conditional clustering algorithm using self-organizing map
    Tateyama, T
    Kawata, S
    Ohta, H
    [J]. SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 3259 - 3264
  • [47] SELF-ORGANIZING MAP FOR CLUSTERING OF REMOTE SENSING IMAGERY
    Stoical, Radu-Mihai
    Neagoe, Victor-Emil
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2014, 76 (01): : 69 - 80
  • [48] Clustering gene expression data using adaptive double self-organizing map
    Ressom, H
    Wang, DL
    Natarajan, P
    [J]. PHYSIOLOGICAL GENOMICS, 2003, 14 (01) : 35 - 46
  • [49] The supervised network self-organizing map for classification of large data sets
    Papadimitriou, S
    Mavroudi, S
    Vladutu, L
    Pavlides, G
    Bezerianos, A
    [J]. APPLIED INTELLIGENCE, 2002, 16 (03) : 185 - 203
  • [50] The Supervised Network Self-Organizing Map for Classification of Large Data Sets
    Stergios Papadimitriou
    Seferina Mavroudi
    Liviu Vladutu
    G. Pavlides
    Anastasios Bezerianos
    [J]. Applied Intelligence, 2002, 16 : 185 - 203