A structural and content-based analysis for Web filtering

被引:20
|
作者
Lee, PY [1 ]
Hui, SC
Fong, ACM
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 2263, Singapore
[2] Massey Univ, Inst Informat & Math Sci, Auckland, New Zealand
关键词
Web sites; filters; classification; neural networks; content analysis;
D O I
10.1108/10662240310458350
中图分类号
F [经济];
学科分类号
02 ;
摘要
With the proliferation of objectionable materials (e.g. pornography, violence, drugs, etc.) available on the WWW, there is an urgent need for effective countermeasures to protect children and other unsuspecting users from exposure to such materials. Using pornographic Web pages as a case study, this paper presents a thorough analysis of the distinguishing features of such Web pages. The objective of the study is to gain knowledge on the structure and characteristics of typical pornographic Web pages so that effective Web filtering techniques can be developed to filter them automatically. In this paper, we first survey the existing techniques for Web content filtering. A study on the characteristics of pornographic Web pages is then presented. The implementation of a Web content filtering system that combines the use of an artificial neural network and the knowledge gained in the analysis of pornographic Web pages is also given.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 50 条
  • [1] Using Visual Content-based Analysis with Textual and Structural Analysis for Improving Web Filtering
    Hammami, Mohamed
    Chen, Liming
    Chahir, Youssef
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2005, 1 (04) : 241 - 254
  • [2] WebGuard: A Web filtering engine combining textual, structural, and visual content-based analysis
    Hammami, M
    Chahir, Y
    Chen, LM
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (02) : 272 - 284
  • [3] WebAngels filter:A violent Web filtering engine using textual and structural content-based analysis
    Guermazi, Radhouane
    Hammami, Mohamed
    Hamadou, Abdelmajid Ben
    [J]. ADVANCES IN DATA MINING, PROCEEDINGS: MEDICAL APPLICATIONS, E-COMMERCE, MARKETING, AND THEORETICAL ASPECTS, 2008, 5077 : 268 - +
  • [4] Content-based text classiriers for pornographic web filtering
    Polpinij, Jantima
    Chotthanom, Anirut
    Sibunruang, Chumsak
    Chamchong, Rapeepom
    Puangpronpitag, Somnuk
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 1481 - +
  • [5] Content-based filtering of Web documents: the MaX system and the EUFORBIA project
    Elisa Bertino
    Elena Ferrari
    Andrea Perego
    [J]. International Journal of Information Security, 2003, 2 (1) : 45 - 58
  • [6] Content-Based Spam Filtering
    Almeida, Tiago A.
    Yamakami, Akebo
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [8] Content-Based Security for the Web
    Afanasyev, Alexander
    Halderman, J. Alex
    Ruoti, Scott
    Seamons, Kent
    Yu, Yingdi
    Zappala, Daniel
    Zhang, Lixia
    [J]. PROCEEDINGS OF THE 2016 NEW SECURITY PARADIGMS WORKSHOP (NSPW'16), 2016, : 49 - 60
  • [9] Combining Collaborative Filtering and Semantic Content-based Approaches to Recommend Web Services
    Lecue, Freddy
    [J]. 2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 200 - 205
  • [10] WCM: A web content-based method of stakeholder analysis
    Raum, Susanne
    Rawlings-Sanaei, Felicity
    [J]. METHODSX, 2022, 9