Web page classification based on a simplified swarm optimization

被引:23
|
作者
Lee, Ji-Hyun [1 ]
Yeh, Wei-Chang [2 ]
Chuang, Mei-Chi [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol, Grad Sch Culture Technol, Taejon 305701, South Korea
[2] Natl Tsing Hua Univ, Dept Ind Engn & Engn Management, Integrat & Collaborat Lab, Hsinchu 300, Taiwan
关键词
Web page classification; Simplified swarm optimization; Taguchi method; GENETIC ALGORITHM;
D O I
10.1016/j.amc.2015.07.120
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Owing to the incredible increase in the amount of information on the World Wide Web, there is a strong need for an efficient web page classification to retrieve useful information quickly. In this paper, we propose a novel simplified swarm optimization (SSO) to learn the best weights for every feature in the training dataset and adopt the best weights to classify the new web pages in the testing dataset. Moreover, the parameter settings play an important role in the update mechanism of the SSO so that we utilize a Taguchi method to determine the parameter settings. In order to demonstrate the effectiveness of the algorithm, we compare its performance with that of the well-known genetic algorithm (GA), Bayesian classifier, and K-nearest neighbor (KNN) classifiers according to four datasets. The experimental results indicate that the SSO yields better performance than the other three approaches. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:13 / 24
页数:12
相关论文
共 50 条
  • [1] Web Page Classification Using Firefly Optimization
    Sarac, Esra
    Ozel, Selma Ayse
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [2] An Ant Colony Optimization Based Feature Selection for Web Page Classification
    Sarac, Esra
    Ozel, Selma Ayse
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [3] Web page classification based on SVM
    Xue, Weimin
    Bao, Hong
    Xue, Weimin
    Huang, Weitong
    Lu, Yuchang
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 6111 - +
  • [4] Web Page Classification Based on Social Annotations
    Shen, J.
    Xu, F. Y.
    Bi, L.
    Wei, L. H.
    He, K.
    Zhu, Y.
    ITESS: 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES, PT 1, 2008, : 1115 - 1121
  • [5] An approach to Web page classification based on granules
    Duan, Qiguo
    Miao, Duoqian
    Wang, Ruizhi
    Chen, Min
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 279 - 282
  • [6] Feature optimization and hybrid classification for malicious web page detection
    Deng, Weiping
    Peng, Yan
    Yang, Fan
    Song, Jun
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (16):
  • [7] Malicious Web Page Detection Based on Feature Classification
    Phakoontod, Chanachai
    Limthanmaphon, Benchaphon
    2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 66 - 71
  • [8] A Web Page Classification Algorithm Based On Link Information
    Xu, Zhaohui
    Yan, Fuliang
    Qin, Jie
    Zhu, Haifeng
    2011 TENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2011, : 82 - 86
  • [9] A Tool for Link-Based Web Page Classification
    Hernandez, Inma
    Rivero, Carlos R.
    Ruiz, David
    Corchuelo, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7023 : 443 - 452
  • [10] Artificial Immune System Based Web Page Classification
    Onan, Aytug
    SOFTWARE ENGINEERING IN INTELLIGENT SYSTEMS (CSOC2015), VOL 3, 2015, 349 : 189 - 199