WTCA: A Web Text Clustering Algorithm Based on DFSSM

被引:0
|
作者
Zheng, Yu [1 ]
Rong, Qian [2 ]
机构
[1] Northeast Forestry Univ, Coll Sci, Harbin 150040, Peoples R China
[2] Beijing Elect Sci & Technol Inst, Dept Comp Sci, Beijing 100070, Peoples R China
关键词
Web text mining; Clustering analysis; SOM; Richly structured datasets;
D O I
10.1109/CHICC.2008.4605816
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A key challenge of data mining is to tackling the problem of mining richly structured datasets such as Web pages. In this paper, we propose a Web text clustering algorithm (WTCA) based on DFSSM, which is our original work. The algorithm includes the training stage of SOM and the clustering stage. It can distinguish the most meaningful features from the Concept Space without the evaluation function. We applied the algorithm to the Chinese Modem Long-distance Education Network, and compared our work with some. popular clustering algorithms. The, experimental results show that the average accuracy of WTCA is better than that of the other three algorithms.
引用
收藏
页码:811 / +
页数:3
相关论文
共 50 条
  • [1] DFSSM Based Web Text Clustering Algorithm
    Qian, Rong
    Zhang, Kejun
    Zhao, Xiaorong
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 703 - 707
  • [2] A New Web Text Clustering Algorithm Based on DFSSM
    Yang, Bingru
    Song, Zefeng
    Wang, Yinglong
    Song, Wei
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 27 - 32
  • [3] Fuzzy Set Based Clustering Algorithm of Web Text
    Wan, Hongxin
    Peng, Yun
    [J]. ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 19 - +
  • [4] Massive Data Mining Algorithm for Web Text Based on Clustering Algorithm
    Luo, Nan-Chao
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (02) : 362 - 365
  • [5] Fuzzy Set Based Web Opinion Text Clustering Algorithm
    Wan, Hongxin
    Peng, Yun
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 2604 - 2607
  • [6] An Algorithm of Web Text Clustering Analysis Based on Fuzzy Set
    Peng, Yun
    Ding, Shu-liang
    [J]. ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 109 - 113
  • [7] Graph based AHC Algorithm for Text Clustering
    Jo, Taeho
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 309 - 314
  • [8] Text clustering algorithm based on lexical graph
    Sha, Yun
    Zhang, Guoying
    Jiang, Huina
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 277 - 281
  • [9] Link-Based Clustering Algorithm for Clustering Web Documents
    Ashokkumar, P.
    Don, S.
    [J]. JOURNAL OF TESTING AND EVALUATION, 2019, 47 (06) : 4096 - 4107
  • [10] CAS based clustering algorithm for Web users
    Miao Wan
    Lixiang Li
    Jinghua Xiao
    Yixian Yang
    Cong Wang
    Xiaolei Guo
    [J]. Nonlinear Dynamics, 2010, 61 : 347 - 361