Web Page Classification Using RNN

被引:15
|
作者
Buber, Ebubekir [1 ]
Diri, Banu [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
关键词
web page classification; classification; categorization; deep learning; RNN; transfer learning;
D O I
10.1016/j.procs.2019.06.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Web page classification is an information retrieval application that provides useful information that can be a basis for many different application domains. In this study, a deep learning-based system has been developed for the classification of web pages. The meta tag information contained in the web page is used to classify a web page. The meta tags used are title, description and keywords. RNN based deep learning architecture was used during the tests. Transfer learning is the name given to the approach to building a machine learning model with the use of pre-trained parameters to solve a problem. The effect of using transfer learning on the system has also been examined. According to the results obtained, success rate of web page classification system is approximately 85%. It is not observed that transfer learning has significant contribution to the success rates. However, the use of transfer learning has reduced the consumed system resources. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:62 / 72
页数:11
相关论文
共 50 条
  • [41] Chinese web-page classification study
    Huang, Weitong
    Lu-Xiong Xu
    Duan, Junfeng
    Lu, Yuchang
    2007 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-7, 2007, : 2141 - +
  • [42] A Chinese Web Page Automatic Classification System
    Huang, Rongyou
    Zhao, Xinjian
    WEB INFORMATION SYSTEMS AND MINING, 2010, 6318 : 61 - +
  • [43] Improvement of Feature Extraction in Web Page Classification
    Jiao Lijuan
    Feng Liping
    2010 2ND INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY (EBISS 2010), 2010, : 363 - 365
  • [44] An approach to Web page classification based on granules
    Duan, Qiguo
    Miao, Duoqian
    Wang, Ruizhi
    Chen, Min
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 279 - 282
  • [45] Data Mining Techniques for Web Page Classification
    Fiol-Roig, Gabriel
    Miro-Julia, Margaret
    Herraiz, Eduardo
    HIGHLIGHTS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2011, 89 : 61 - 68
  • [46] A hybrid neural network for web page classification
    Cao, YK
    Li, YF
    Yu, ZZ
    DIGITAL LIBRARIES: INTERNATIONAL COLLABORATION AND CROSS-FERTILIZATION, PROCEEDINGS, 2004, 3334 : 641 - 641
  • [47] Study on WEB Page Fusion Classification Model
    Zhang, Xiao-dan
    Zhu, Li-jun
    SMART MATERIALS AND INTELLIGENT SYSTEMS, PTS 1 AND 2, 2011, 143-144 : 944 - +
  • [48] Automatic classification of academic web page types
    Kenekayoro, Patrick
    Buckley, Kevan
    Thelwall, Mike
    SCIENTOMETRICS, 2014, 101 (02) : 1015 - 1026
  • [49] Incremental document clustering for web page classification
    Wong, WC
    Fu, AWC
    ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 2002, : 101 - 110
  • [50] Large-Scale Web Page Classification
    Marath, Sathi T.
    Shepherd, Michael
    Milios, Evangelos
    Duffy, Jack
    2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 1813 - 1822