Web Page Classification Using RNN

被引:15
|
作者
Buber, Ebubekir [1 ]
Diri, Banu [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
关键词
web page classification; classification; categorization; deep learning; RNN; transfer learning;
D O I
10.1016/j.procs.2019.06.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Web page classification is an information retrieval application that provides useful information that can be a basis for many different application domains. In this study, a deep learning-based system has been developed for the classification of web pages. The meta tag information contained in the web page is used to classify a web page. The meta tags used are title, description and keywords. RNN based deep learning architecture was used during the tests. Transfer learning is the name given to the approach to building a machine learning model with the use of pre-trained parameters to solve a problem. The effect of using transfer learning on the system has also been examined. According to the results obtained, success rate of web page classification system is approximately 85%. It is not observed that transfer learning has significant contribution to the success rates. However, the use of transfer learning has reduced the consumed system resources. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:62 / 72
页数:11
相关论文
共 50 条
  • [31] Using Machine Learning for Web Page Classification in Search Engine Optimization
    Matosevic, Goran
    Dobsa, Jasminka
    Mladenic, Dunja
    FUTURE INTERNET, 2021, 13 (01): : 1 - 20
  • [32] News Web Page Classification Using Url Content and Structure Attributes
    Dwivedi, Sanjay K.
    Arya, Chandrakala
    PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 317 - 322
  • [33] Web page classification using an ensemble of support vector machine classifiers
    Zhong S.
    Zou D.
    Journal of Networks, 2011, 6 (11) : 1625 - 1630
  • [34] Classification of web link information and implementation of dynamic web page using Link Map System
    Mun, Yilhyeong
    Lee, Minkyung
    Cho, Dongsub
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 506 - 511
  • [35] An improved SVM web page classification algorithm
    Ren, Xun-yi
    Shi, Chen
    Zhang, Dan
    Wang, Wen-si
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [36] Ant Colony Algorithm for Web Page Classification
    Moayed, Majid Javid
    Sabery, A. Hamid
    Khanteymoory, Alireza
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 1598 - +
  • [37] A novel approach for effective web page classification
    Mangai, J. Alamelu
    Kumar, V. Santhosh
    Balamurugan, S. Appavu
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2013, 5 (03) : 233 - 245
  • [38] Automatic classification of academic web page types
    Patrick Kenekayoro
    Kevan Buckley
    Mike Thelwall
    Scientometrics, 2014, 101 : 1015 - 1026
  • [39] Web Page Classification Based on Social Annotations
    Shen, J.
    Xu, F. Y.
    Bi, L.
    Wei, L. H.
    He, K.
    Zhu, Y.
    ITESS: 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES, PT 1, 2008, : 1115 - 1121
  • [40] Web page classification: A soft computing approach
    Ribeiro, A
    Fresno, V
    Garcia-Alegre, MC
    Guinea, D
    ADVANCES IN WEB INTELLIGENCE, 2003, 2663 : 103 - 112