Detecting Visually Similar Web Pages: Application to Phishing Detection

被引:58
|
作者
Chen, Teh-Chung [1 ]
Dick, Scott [1 ]
Miller, James [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2M7, Canada
关键词
Security; Human Factors; Algorithmic complexity theory; Gestalt theory; Web page similarity; anti-phishing technologies; IMAGE; CLASSIFICATION; DISTANCE;
D O I
10.1145/1754393.1754394
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a novel approach for detecting visual similarity between two Web pages. The proposed approach applies Gestalt theory and considers a Web page as a single indivisible entity. The concept of supersignals, as a realization of Gestalt principles, supports our contention that Web pages must be treated as indivisible entities. We objectify, and directly compare, these indivisible supersignals using algorithmic complexity theory. We illustrate our approach by applying it to the problem of detecting phishing scams. Via a large-scale, real-world case study, we demonstrate that 1) our approach effectively detects similar Web pages; and 2) it accuractely distinguishes legitimate and phishing pages.
引用
收藏
页数:38
相关论文
共 50 条
  • [41] Anomaly based web phishing page detection
    Pan, Ying
    Ding, Xuhua
    22ND ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, PROCEEDINGS, 2006, : 381 - +
  • [42] Web Phishing Detection Based on Graph Mining
    Zou Futai
    Gang Yuxiang
    Pei Bei
    Pan Li
    Li Linsen
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1061 - 1066
  • [43] Detecting and partitioning data objects in complex Web pages
    Ye, SR
    Chua, TS
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 669 - 672
  • [44] Detecting Off-Topic Pages in Web Archives
    AlNoamany, Yasmin
    Weigle, Michele C.
    Nelson, Michael L.
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2015, 9316 : 225 - 237
  • [45] Detection of the Innovative Logotypes on the Web Pages
    Mironczuk, Marcin
    Perelkiewicz, Michal
    Protasiewicz, Jaroslaw
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2017, PT II, 2017, 10246 : 104 - 115
  • [46] Detection and Logging Changes in Web Pages
    Beglerovic, Vildana
    Pirija, Lejla
    Prazina, Irfan
    Okanovic, Vensada
    2022 21ST INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2022,
  • [47] Comparing clustering algorithms for the identification of similar pages in web applications
    De Lucia, Andrea
    Risi, Michele
    Scanniello, Giuseppe
    Tortora, Genoveffa
    WEB ENGINEERING, PROCEEDINGS, 2007, 4607 : 415 - +
  • [48] SPWalk: Similar Property Oriented Feature Learning for Phishing Detection
    Liu, Xiuwen
    Fu, Jianming
    IEEE ACCESS, 2020, 8 : 87031 - 87045
  • [49] Application of Feature Engineering for Phishing Detection
    Zhang, Wei
    Ren, Huan
    Jiang, Qingshan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (04): : 1062 - 1070
  • [50] Detecting Phishing Web sites: A Heuristic URL-Based Approach
    Luong Anh Tuan Nguyen
    Ba Lam To
    Huu Khuong Nguyen
    Minh Hoang Nguyen
    2013 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2013, : 597 - 602