Detecting Visually Similar Web Pages: Application to Phishing Detection

被引:58
|
作者
Chen, Teh-Chung [1 ]
Dick, Scott [1 ]
Miller, James [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2M7, Canada
关键词
Security; Human Factors; Algorithmic complexity theory; Gestalt theory; Web page similarity; anti-phishing technologies; IMAGE; CLASSIFICATION; DISTANCE;
D O I
10.1145/1754393.1754394
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a novel approach for detecting visual similarity between two Web pages. The proposed approach applies Gestalt theory and considers a Web page as a single indivisible entity. The concept of supersignals, as a realization of Gestalt principles, supports our contention that Web pages must be treated as indivisible entities. We objectify, and directly compare, these indivisible supersignals using algorithmic complexity theory. We illustrate our approach by applying it to the problem of detecting phishing scams. Via a large-scale, real-world case study, we demonstrate that 1) our approach effectively detects similar Web pages; and 2) it accuractely distinguishes legitimate and phishing pages.
引用
收藏
页数:38
相关论文
共 50 条
  • [31] WebDigest: Layout-preserving visually enhanced web pages
    Maeda, J
    Fukuda, K
    Takagi, H
    Asakawa, C
    2003 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2003, : 418 - 421
  • [32] WebGen system -: Visually impaired users create web pages
    Bartek, Ludek
    Plhak, Jaromir
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PROCEEDINGS, 2008, 5105 : 466 - 473
  • [33] An intrusion detection system for detecting phishing attacks
    Pamunuwa, Hasika
    Wijesekera, Duminda
    Farkas, Csilla
    SECURE DATA MANAGEMENT, PROCEEDINGS, 2007, 4721 : 181 - +
  • [34] Visually Summarizing Web Pages Through Internal and External Images
    Jiao, Binxing
    Yang, Linjun
    Xu, Jizheng
    Tian, Qi
    Wu, Feng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (06) : 1673 - 1683
  • [35] AN INVESTIGATION OF CLUSTERING ALGORITHMS IN THE IDENTIFICATION OF SIMILAR WEB PAGES
    De Lucia, Andrea
    Risi, Michele
    Scanniello, Giuseppe
    Tortora, Genoveffa
    JOURNAL OF WEB ENGINEERING, 2009, 8 (04): : 346 - 370
  • [36] Towards Detecting Phishing Web Contents for Secure Internet Surfing
    Sadi, Muhammad Sheikh
    Khan, Md. Mizanur Rahman
    Islam, Md. Merazul
    Srijon, Shuvradeb Barman
    Mia, Md. Mahmudul Haque
    2012 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2012, : 237 - 241
  • [37] A Model of Detecting Phishing Websites Based on PHA and Web Noise
    Cui, Jing-shi
    Wang, Zi-jian
    Wang, Bai-ling
    Wang, Wei
    Xin, Guo-dong
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 1084 - 1090
  • [38] Learning to Detect Phishing Web Pages Using Lexical and String Complexity Analysis
    Patil, Dharmaraj
    Pattewar, Tareek
    Pardeshi, Shailendra
    Punjabi, Vipul
    Wagh, Rajnikant
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 10 (01)
  • [39] A survey and classification of web phishing detection schemes
    Varshney, Gaurav
    Misra, Manoj
    Atrey, Pradeep K.
    SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (18) : 6266 - 6284
  • [40] A hybrid approach for phishing web site detection
    Dadkhah, Mehdi
    Shamshirband, Shahaboddin
    Wahab, Ainuddin Wahid Abdul
    ELECTRONIC LIBRARY, 2016, 34 (06): : 927 - 944