Web page classification based on a simplified swarm optimization

被引:23
|
作者
Lee, Ji-Hyun [1 ]
Yeh, Wei-Chang [2 ]
Chuang, Mei-Chi [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol, Grad Sch Culture Technol, Taejon 305701, South Korea
[2] Natl Tsing Hua Univ, Dept Ind Engn & Engn Management, Integrat & Collaborat Lab, Hsinchu 300, Taiwan
关键词
Web page classification; Simplified swarm optimization; Taguchi method; GENETIC ALGORITHM;
D O I
10.1016/j.amc.2015.07.120
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Owing to the incredible increase in the amount of information on the World Wide Web, there is a strong need for an efficient web page classification to retrieve useful information quickly. In this paper, we propose a novel simplified swarm optimization (SSO) to learn the best weights for every feature in the training dataset and adopt the best weights to classify the new web pages in the testing dataset. Moreover, the parameter settings play an important role in the update mechanism of the SSO so that we utilize a Taguchi method to determine the parameter settings. In order to demonstrate the effectiveness of the algorithm, we compare its performance with that of the well-known genetic algorithm (GA), Bayesian classifier, and K-nearest neighbor (KNN) classifiers according to four datasets. The experimental results indicate that the SSO yields better performance than the other three approaches. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:13 / 24
页数:12
相关论文
共 50 条
  • [41] Entity-Based Classification of Web Page in Search Engine
    Liu, Yicen
    Liu, Mingrong
    Xiang, Liang
    Yang, Qing
    Digital Libraries: Universal and Ubiquitous Access to Information, Proceedings, 2008, 5362 : 410 - 411
  • [42] Research on Web Page Classification Method Based on Query Log
    叶飞跃
    马祎星
    Journal of Shanghai Jiaotong University(Science), 2018, 23 (03) : 404 - 410
  • [43] Web Page Segmentation with Structured Prediction and its Application in Web Page Classification
    Bing, Lidong
    Guo, Rui
    Lam, Wai
    Niu, Zheng-Yu
    Wang, Haifeng
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 767 - 776
  • [44] Enhancing Web Page Classification Models
    Elsalmy, Fayrouz
    Ismail, Rasha
    AbdelMoez, Walid
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 742 - 750
  • [45] Mixture Models for Web Page Classification
    Bai JingHua
    Zhang XiaoXian
    Li ZhiXin
    Li XiaoPing
    INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 499 - 505
  • [46] Ensemble approach for web page classification
    Gupta, Amit
    Bhatia, Rajesh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) : 25219 - 25240
  • [47] Heterogeneous learner for web page classification
    Yu, HJ
    Chang, KCC
    Han, JW
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 538 - 545
  • [48] Towards Effective Web Page Classification
    Gu, Min
    Zhu, Feng
    Guo, Qing
    Gu, Yanhui
    Zhou, Junsheng
    Qu, Weiguang
    2016 INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC), 2016, : 126 - 127
  • [49] Ensemble approach for web page classification
    Amit Gupta
    Rajesh Bhatia
    Multimedia Tools and Applications, 2021, 80 : 25219 - 25240
  • [50] Web Page Classification with Social Annotations
    Zubiaga, Arkaitz
    Martinez, Raquel
    Fresno, Victor
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (43): : 225 - 233