Web Page Classification Using RNN

被引:15
|
作者
Buber, Ebubekir [1 ]
Diri, Banu [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
关键词
web page classification; classification; categorization; deep learning; RNN; transfer learning;
D O I
10.1016/j.procs.2019.06.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Web page classification is an information retrieval application that provides useful information that can be a basis for many different application domains. In this study, a deep learning-based system has been developed for the classification of web pages. The meta tag information contained in the web page is used to classify a web page. The meta tags used are title, description and keywords. RNN based deep learning architecture was used during the tests. Transfer learning is the name given to the approach to building a machine learning model with the use of pre-trained parameters to solve a problem. The effect of using transfer learning on the system has also been examined. According to the results obtained, success rate of web page classification system is approximately 85%. It is not observed that transfer learning has significant contribution to the success rates. However, the use of transfer learning has reduced the consumed system resources. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:62 / 72
页数:11
相关论文
共 50 条
  • [1] Web Page Classification Using Firefly Optimization
    Sarac, Esra
    Ozel, Selma Ayse
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [2] Dynamic Web Page Generation using Classification of Web Link Information
    Mun, Yilhyeong
    Cho, Dongsub
    PROCEEDINGS OF 2008 INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTING AND COMPUTATIONAL SCIENCES: ADVANCES IN APPLIED COMPUTING AND COMPUTATIONAL SCIENCES, 2008, : 96 - 101
  • [3] Automatic Web Page Classification Using Various Features
    Wen, Hao
    Fang, Liping
    Guan, Ling
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 368 - +
  • [4] Web Page Classification Using Image Analysis Features
    de Boer, Viktor
    van Someren, Maarten W.
    Lupascu, Tiberiu
    WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2011, 75 : 272 - +
  • [5] Web Page Classification Using WSD and YAGO and Ontology
    Modi, Sangita S.
    Jagtap, Sudhir B.
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 887 - 891
  • [6] Web page downloading and classification
    Tran, LQ
    Moon, CW
    Le, DX
    Thoma, GR
    FOURTEENTH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2001, : 321 - 326
  • [7] Web Page Genre Classification
    Chen, Guangyu
    Choi, Ben
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 2353 - 2357
  • [8] Automatic Web Page Classification
    Materna, Jiri
    RASLAN 2008: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING: SECOND WORKSHOP, 2008, : 84 - 93
  • [9] Web page genre classification
    Computer Science, Louisiana Tech University, LA 71272, United States
    Proc ACM Symp Appl Computing, (2353-2357):
  • [10] On Chinese web page classification
    Liang, JZ
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 634 - 639