Study on Web-page classification algorithm based on rough set theory

被引:0
|
作者
Yin, Shiqun [1 ]
Wang, Fang [1 ]
Xie, Zhong [1 ]
Qiu, Yuhui [1 ]
机构
[1] Southwest Univ, Fac Comp & Informat Sci, Chongqing 400715, Peoples R China
关键词
rough set; classification rule; feature selection; Web-page; vector space model;
D O I
10.1109/ISIP.2008.118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The large number of Web-page documents is comprise high dimensional huge text database with the development of Internet technology. But it is only a very small portion with the relevant users. The Web-page should be assigned to a category structure through the Web-page classification technology. it is not only convenient for customers to browse Web-page, but also easier to make Web-page seek through restriction search scope. Mining in high dimensional data is extraordinarily difficult because of the curse of dimensionality. We must adopt feature select to solve these problems. A algorithm is given in this paper to reduce the Web-page feature term and extract classification rule at last used attribute reduction on rough set theory. Experimental results show that this method has been greatly reduced feature vector space dimension and gotten easy-to-understand classification rules, and its accuracy is higher and the speed of classification is faster than based on the classification of vector comparison.
引用
收藏
页码:202 / 206
页数:5
相关论文
共 50 条
  • [21] A web page classification algorithm based on feature selection
    Zhou, Hongfang
    Guo, Jie
    Wang, Xinyi
    Duan, Wencong
    Wang, Peng
    Cao, Wenquan
    [J]. Journal of Information and Computational Science, 2015, 12 (04): : 1549 - 1556
  • [22] A Comparative Study to Enhance the Performance of Web-Page Recommendation System
    Naseer, Mehwish
    Zhang, Wu
    Zhu, Wenhao
    [J]. 2019 22ND IEEE INTERNATIONAL MULTI TOPIC CONFERENCE (INMIC), 2019, : 114 - 119
  • [23] A new classification algorithm based on rough set and entropy
    Yang, J
    Wang, H
    Hu, WG
    Hu, ZH
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 364 - 367
  • [24] Rough set based hybrid algorithm for text classification
    Miao, Duoqian
    Duan, Qiguo
    Zhang, Hongyu
    Jiao, Na
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (05) : 9168 - 9174
  • [25] Improved CBA Classification Algorithm Based on Rough Set
    Tan, Zheng
    Wang, Hanhu
    Chen, Mei
    Zhang, Xiaoping
    [J]. NDT: 2009 FIRST INTERNATIONAL CONFERENCE ON NETWORKED DIGITAL TECHNOLOGIES, 2009, : 43 - +
  • [26] Classification of Volcanic Rocks based on Rough Set Theory
    Shaaban, Shaaban M.
    Tawfik, Sameh Z.
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2020, 10 (02) : 5501 - 5504
  • [27] An email classification model based on rough set theory
    Zhao, WQ
    Zhang, ZL
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ACTIVE MEDIA TECHNOLOGY (AMT 2005), 2005, : 403 - 408
  • [28] Prediction of Customer Classification Based on Rough Set Theory
    Li, Ju
    Wang, Xing
    Xu, Shan
    [J]. 2010 SYMPOSIUM ON SECURITY DETECTION AND INFORMATION PROCESSING, 2010, 7 : 366 - 370
  • [29] Classification of Cyber Attacks based on Rough Set Theory
    Amin, Adnan
    Anwar, Sajid
    Adnan, Awais
    Khan, Muhammad Aamir
    Iqbal, Zafar
    [J]. 2015 FIRST INTERNATIONAL CONFERENCE ON ANTI-CYBERCRIME (ICACC), 2015, : 141 - 146
  • [30] Rough set based classification of real world Web services
    Hala S. Own
    Hamdi Yahyaoui
    [J]. Information Systems Frontiers, 2015, 17 : 1301 - 1311