Using swarm intelligence for XML clustering

被引:0
|
作者
Wang, Tong [1 ]
Liu, Daxin [1 ]
Lin, Xuanzuo [2 ]
Sun, Xiaohua [3 ]
机构
[1] Harbin Engn Univ, Harbin 150001, Peoples R China
[2] Northeast Agr Univ, Harbin 150036, Peoples R China
[3] Harbin Univ Sci & Technol, Harbin 150080, Peoples R China
关键词
clustering; swarm intelligence; data mining; XML;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining in large-scale XML documents set can facilitate to query and manage XML documents. This paper proposes a novel XML document clustering method based on swarm intelligence. Firstly, the approach extracts path sequences from documents, and then, the documents are transformed to vectors in a high-dimensional Euclidean space. Finally, CSX clustering method is applied to with high performance. The advantage of the approach is that swarm intelligence can help skip out of the local optima of the search space. Data sets are obtained from DBLP, and the experiment results show that the performance of the proposed techniques outperformed the standard C-means method in clustering compact and accuracy.
引用
收藏
页码:6000 / +
页数:2
相关论文
共 13 条
  • [1] Computational experience on four algorithms for the hard clustering problem
    AlSultan, KS
    Khan, MM
    [J]. PATTERN RECOGNITION LETTERS, 1996, 17 (03) : 295 - 308
  • [2] BITEBOUL S, 2000, DATA WEB RELATIONS S
  • [3] Bonabeau E., 1999, Swarm Intelligence: From Natural to Artificial Systems, DOI 10.1093/oso/9780195131581.001.0001
  • [4] Costa G, 2004, LECT NOTES ARTIF INT, V3202, P137
  • [5] Ant algorithms and stigmergy
    Dorigo, M
    Bonabeau, E
    Theraulaz, G
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2000, 16 (08): : 851 - 871
  • [6] DOUCET A, 2002, P 1 WORKSH IN EV XML, P81
  • [7] Kenndy J., 1995, P IEEE INT C NEUR NE, V4, P1942, DOI [10.4018/ijmfmp.2015010104, DOI 10.4018/IJMFMP.2015010104]
  • [8] MacQueen J., 1967, P 5 BERK S MATH STAT, V14, P281, DOI DOI 10.1234/12345678
  • [9] Nierman A., 2002, P 5 INT WORKSH WEB D, P61
  • [10] PARSOPOULOS KE, 2002, P 4 GRACM C COMP MEC, P27