Web page classification based on a simplified swarm optimization

被引:23
|
作者
Lee, Ji-Hyun [1 ]
Yeh, Wei-Chang [2 ]
Chuang, Mei-Chi [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol, Grad Sch Culture Technol, Taejon 305701, South Korea
[2] Natl Tsing Hua Univ, Dept Ind Engn & Engn Management, Integrat & Collaborat Lab, Hsinchu 300, Taiwan
关键词
Web page classification; Simplified swarm optimization; Taguchi method; GENETIC ALGORITHM;
D O I
10.1016/j.amc.2015.07.120
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Owing to the incredible increase in the amount of information on the World Wide Web, there is a strong need for an efficient web page classification to retrieve useful information quickly. In this paper, we propose a novel simplified swarm optimization (SSO) to learn the best weights for every feature in the training dataset and adopt the best weights to classify the new web pages in the testing dataset. Moreover, the parameter settings play an important role in the update mechanism of the SSO so that we utilize a Taguchi method to determine the parameter settings. In order to demonstrate the effectiveness of the algorithm, we compare its performance with that of the well-known genetic algorithm (GA), Bayesian classifier, and K-nearest neighbor (KNN) classifiers according to four datasets. The experimental results indicate that the SSO yields better performance than the other three approaches. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:13 / 24
页数:12
相关论文
共 50 条
  • [31] Deep Learning Based Classification of Visual Behavior on Web Page
    Zhang, Meng-jie
    Lv, Sheng-fu
    Li, Mi
    INTERNATIONAL CONFERENCE ON ENERGY, ENVIRONMENT AND CHEMICAL ENGINEERING (ICEECE 2015), 2015, : 266 - 270
  • [32] Rough set based ensemble classifier for web page classification
    Saha, Suman
    Murthy, C. A.
    Pal, Sankar K.
    FUNDAMENTA INFORMATICAE, 2007, 76 (1-2) : 171 - 187
  • [33] Web Page Classification based on Context to the Content Extraction of Articles
    Patel, Ankit Dilip
    Pandya, Vimal N.
    2017 2ND INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2017, : 539 - 541
  • [34] Feature Selection for Data Classification in the Semiconductor Industry by a Hybrid of Simplified Swarm Optimization
    Yeh, Wei-Chang
    Chu, Chia-Li
    ELECTRONICS, 2024, 13 (12)
  • [35] Chinese Web Page Classification Based on Vector Space Model
    Wei, Li
    Zhang, Ling
    Li, Huamei
    Chen, Xiaozhou
    ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1801 - 1804
  • [36] A Method of Web Page Classification Based on Feature Dimension Reduction
    Ren, Xun-yi
    Zhang, Dan
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM 2016), 2016, : 252 - 256
  • [37] Research on Web Page Classification Method Based on Query Log
    Ye F.
    Ma Y.
    Journal of Shanghai Jiaotong University (Science), 2018, 23 (3) : 404 - 410
  • [38] Neural networks for web page classification based on augmented PCA
    Selamat, A
    Omatu, S
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1792 - 1797
  • [39] Web Page Classification based on Schema.org Collection
    Krutil, Jonas
    Kudelka, Milos
    Snasel, Vaclav
    2012 FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ASPECTS OF SOCIAL NETWORKS (CASON), 2012, : 356 - 360
  • [40] Knowledge Based Deep Inception Model for Web Page Classification
    Gupta, Amit
    Bhatia, Rajesh
    JOURNAL OF WEB ENGINEERING, 2021, 20 (07): : 2131 - 2167