A Web Page Classification Method Based on TCP/IP Header Features

被引:0
|
作者
Huang, Di [1 ]
Zhang, Xin-Yi [1 ]
Tang, Qi-Wei [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
web page classification; packet header feature; instance-based learning;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Web page classification has wide applications. Due to various types of web pages and vast amounts of network traffic, it is difficult to classify web pages by deeply inspecting the content of each packet. This paper presents a learning-based classification method according to TCP/IP header features. First, we propose an approach to select features and improve the Relief algorithm, which can pick features with robustness. Then we raise a labeling strategy to assign each feature with a label when training the classifier. Last, we put forward a learning-based classification method which takes labels and multi-layer semantics into consideration. The experiment results show that the proposed strategy can improve the processing speed and the accuracy of classification.
引用
收藏
页码:61 / 64
页数:4
相关论文
共 50 条
  • [1] Web Page Element Classification Based on Visual Features
    Burget, Radek
    Rudolfova, Ivana
    [J]. 2009 FIRST ASIAN CONFERENCE ON INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2009, : 67 - 72
  • [2] Web Page Classification: Features and Algorithms
    Qi, Xiaoguang
    Davison, Brian D.
    [J]. ACM COMPUTING SURVEYS, 2009, 41 (02)
  • [3] TCP/IP Header Classification for Detecting Spoofed DDoS Attack in Cloud Environment
    Osanaiye, Opeyemi. A.
    Dlodlo, Mqhele
    [J]. IEEE EUROCON 2015 - INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL (EUROCON), 2015, : 219 - 224
  • [4] Web Page Classification Method Based on Semantics and Structure
    Li, Huaxin
    Zhang, Zhaoxin
    Xu, Yongdong
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 238 - 243
  • [5] Web page classification based on heterogeneous features and a combination of multiple classifiers
    Deng, Li
    Du, Xin
    Shen, Ji-zhong
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (07) : 995 - 1004
  • [6] Web page classification based on heterogeneous features and a combination of multiple classifiers
    Li Deng
    Xin Du
    Ji-zhong Shen
    [J]. Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 995 - 1004
  • [7] Research on Web Page Classification Method Based on Query Log
    Ye F.
    Ma Y.
    [J]. Journal of Shanghai Jiaotong University (Science), 2018, 23 (3) : 404 - 410
  • [8] A Method of Web Page Classification Based on Feature Dimension Reduction
    Ren, Xun-yi
    Zhang, Dan
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM 2016), 2016, : 252 - 256
  • [9] Research on Web Page Classification Method Based on Query Log
    叶飞跃
    马祎星
    [J]. Journal of Shanghai Jiaotong University(Science), 2018, 23 (03) : 404 - 410
  • [10] A Novel TCP/IP Header Hijacking Attack on SDN
    Mohammadi, Ali Akbar
    Hussain, Rasheed
    Oracevic, Alma
    Kazmi, Syed Muhammad Ahsan Raza
    Hussain, Fatima
    Aloqaily, Moayad
    Son, Junggab
    [J]. INFOCOM WKSHPS 2022 - IEEE Conference on Computer Communications Workshops, 2022,