Classification of Web Site by Naive-Bayes and Convolutional Neural Networks

被引:0
|
作者
Liu, Xueyan [1 ]
Uda, Ryuya [1 ]
机构
[1] Tokyo Univ Technol, 1404-1 Katakuramachi, Hachioji, Tokyo, Japan
关键词
Web Site Structure; HyperText Markup Language; Comparative Analysis; Self-Organizing Maps; Naive-Bayes; Convolutional Neural Network; Classification;
D O I
10.1145/3164541.3164581
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An approach for automatic classification and evaluation method for the structures of public World Wide Web sites by Naive-Bayes and Two-layer Convolutional Neural Networks is proposed in this paper. The proposed method is also worthy for analyzing contents and hypertext structures for commercial, education and nonprofit organizations. The aim of this proposal is to be available to use Internet safely and conveniently for users who do not have expert knowledge. In this paper, we explain the method for creating and evaluating models, and define the most relevant attributes to this process. We also implemented the method as a system for classifying web sites. The introduced software tool supports the automated collection of parameters of web sites, and it assures the necessary critical mass of empirical data. With the pre-processed information, statistical clustering (SOM and K-Means), text classification (Naive-Bayes), and Two-layer Convolutional Neural Networks are evaluated in this paper.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Text Classification on Mahout with Naive-Bayes Machine Learning Algorithm
    Salur, Mehmet Umut
    Tokat, Sezai
    Aydilek, Ibrahim Berkan
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [2] 改进的Naive-Bayes方法
    张晓引
    岳丽华
    中国科学技术大学学报, 1999, (01) : 104 - 110
  • [3] Incremental discretization for Naive-Bayes classifier
    Lu, Jingli
    Yang, Ying
    Webb, Geoffrey I.
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 223 - 238
  • [4] When Naive Bayes Nearest Neighbors Meet Convolutional Neural Networks
    Kuzborskij, Ilja
    Carlucci, Fabio Maria
    Caputo, Barbara
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2100 - 2109
  • [5] Acceleration of Naive-Bayes Algorithm on Multicore Processor for Massive Text Classification
    Zhou, Lijun
    Yu, Zhiyi
    Lin, Jie
    Zhu, Shikai
    Shi, Weijing
    Zhou, Haijie
    Song, Kunpeng
    Zeng, Xiaoyang
    2014 14TH INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS (ISIC), 2014, : 344 - 347
  • [6] Convolutional Neural Networks for Web Documents Classification
    Artene, Codrut-Georgian
    Tibeica, Marius Nicolae
    Vecliuc, Dumitru Daniel
    Leon, Florin
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 289 - 302
  • [7] On why discretization works for naive-Bayes classifiers
    Yang, Y
    Webb, GI
    AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 440 - 452
  • [8] Combining Hiearachical Clustering and Naive-Bayes Nearest-Neighbor For Image Classification
    Fu, Chen
    Jia, Shijie
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE (LEMCS 2015), 2015, 117 : 31 - 34
  • [9] Optimized Naive-Bayes and Decision Tree Approaches for fMRI Smoking Cessation Classification
    Tahmassebi, Amirhessam
    Gandomi, Amir H.
    Schulte, Mieke H. J.
    Goudriaan, Anna E.
    Foo, Simon Y.
    Meyer-Baese, Anke
    COMPLEXITY, 2018,
  • [10] Latent Dirichlet conditional naive-Bayes models
    Banerjee, Arindam
    Shan, Hanhuai
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 421 - 426