A graph-theoretic approach for the detection of phishing webpages

被引:30
|
作者
Tan, Choon Lin [1 ]
Chiew, Kang Leng [1 ]
Yong, Kelvin S. C. [2 ]
Sze, San Nah [1 ]
Abdullah, Johari [1 ]
Sebastian, Yakub [3 ]
机构
[1] Univ Malaysia Sarawak, Fac Comp Sci & Informat Technol, Kota Samarahan 94300, Sarawak, Malaysia
[2] Swinburne Univ Technol, Fac Engn Comp & Sci, Sarawak Campus,Jalan Simpang Tiga, Sarawak 93350, Malaysia
[3] Charles Darwin Univ, Coll Engn IT & Environm, Ellengowan Dr, Casuarina, NT 0810, Australia
关键词
Phishing detection; Hyperlinks; Web graph; Graph features; Machine learning; FEATURE-SELECTION; WEBSITES;
D O I
10.1016/j.cose.2020.101793
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Over the years, various technical means have been developed to protect Internet users from phishing attacks. To enrich the anti-phishing efforts, we capitalise on concepts from graph theories, and propose a set of novel graph features to improve the phishing detection accuracy. The initial phase of the proposed technique involved the extraction of hyperlinks in the webpage under scrutiny and fetching the corresponding neighbourhood webpages. During this process, the page linking data were collected, and used to construct a web graph which models the overall hyperlink and network structure of the webpage. From the web graph, graph measures were computed and extracted as graph features to derive a classifier for detecting phishing webpages. Experimental results show that the proposed graph features achieve an improved overall accuracy of 97.8% when C4.5 was utilised as classifier, outperforming the existing conventional features derived from the same data samples. Unlike conventional features, the proposed graph features leverage inherent phishing patterns that are only visible at a higher level of abstraction, thus making it robust and difficult to be evaded by direct manipulations on the webpage contents. Our proposed graph-based technique also shows promising results when benchmarked against a prominent phishing detection technique. Hence, the proposed technique is an important contribution to the existing anti-phishing research towards improving the detection performance. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A graph-theoretic approach for inparalog detection
    Tremblay-Savard, Olivier
    Swenson, Krister M.
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [2] A graph-theoretic approach for inparalog detection
    Olivier Tremblay-Savard
    Krister M Swenson
    [J]. BMC Bioinformatics, 13
  • [3] Corruption and its detection: a graph-theoretic approach
    Mukwembi, Thebeth Rufaro
    Mukwembi, Simon
    [J]. COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2017, 23 (02) : 293 - 300
  • [4] Corruption and its detection: a graph-theoretic approach
    Thebeth Rufaro Mukwembi
    Simon Mukwembi
    [J]. Computational and Mathematical Organization Theory, 2017, 23 : 293 - 300
  • [5] A graph-theoretic approach to steganography
    Hetzl, S
    Mutzel, P
    [J]. COMMUNICATIONS AND MULTIMEDIA SECURITY, 2005, 3677 : 119 - 128
  • [6] A graph-theoretic approach to multitasking
    Alon, Noga
    Reichman, Daniel
    Shinkar, Igor
    Wagner, Tal
    Musslick, Sebastian
    Cohen, Jonathan D.
    Griffiths, Thomas L.
    Dey, Biswadip
    Ozcimder, Kayhan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [7] Graph-theoretic approach to Bell experiments with low detection efficiency
    Xu, Zhen-Peng
    Steinberg, Jonathan
    Singh, Jaskaran
    Lopez-Tarrida, Antonio J.
    Portillo, Jose R.
    Cabello, Adan
    [J]. QUANTUM, 2023, 7
  • [8] GRAPH-THEORETIC APPROACH TO METABOLIC PATHWAYS
    GOLDSTEIN, BN
    SELIVANOV, VA
    [J]. BIOMEDICA BIOCHIMICA ACTA, 1990, 49 (8-9) : 645 - 650
  • [9] MULTIVARIABLE CONTROL A GRAPH-THEORETIC APPROACH
    REINSCHKE, KJ
    [J]. LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1988, 108 : 1 - +
  • [10] Graph-Theoretic Approach to Quantum Correlations
    Cabello, Adan
    Severini, Simone
    Winter, Andreas
    [J]. PHYSICAL REVIEW LETTERS, 2014, 112 (04)