Development of anti-phishing browser based on random forest and rule of extraction framework

被引:15
|
作者
Gowda, Mohith H. R. [1 ]
Adithya, M., V [2 ]
Prasad, Gunesh S. [3 ]
Vinay, S. [4 ]
机构
[1] PES Coll Engn, Comp Sci & Engn, 4011 Vasuda Krupa,3rd Cross, Mandya 571401, Karnataka, India
[2] PES Coll Engn, Comp Sci & Engn, 1932,1st Main Rd,Near Vinayaka Auto Stand, Mandya 571401, Karnataka, India
[3] PES Coll Engn, Comp Sci & Engn, 20-19,5th Cross,Near Shakthi Nagar Pk, Mysore 570019, Karnataka, India
[4] PES Coll Engn, Informat Sci & Engn, PES Engn Coll Rd,PES Coll Campus, Mandya 571401, Karnataka, India
关键词
Phishing attack; Machine learning; Intelligent browser engine; Rule of extraction algorithm; Browser architecture; EFFICIENT;
D O I
10.1186/s42400-020-00059-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Phishing is a technique under Social Engineering attacks which is most widely used to get user sensitive information, such as login credentials and credit and debit card information, etc. It is carried out by a person masquerading as an authentic individual. To protect web users from these attacks, various anti-phishing techniques are developed, but they fail to protect the user from these attacks in various ways. In this paper, we propose a novel technique to identify phishing websites effortlessly on the client side by proposing a novel browser architecture. In this system, we use the rule of extraction framework to extract the properties or features of a website using the URL only. This list consists of 30 different properties of a URL, which will later be used by the Random Forest Classification machine learning model to detect the authenticity of the website. A dataset consisting of 11,055 tuples is used to train the model. These processes are carried out on the client-side with the help of a redesigned browser architecture. Today Researches have come up with machine learning frameworks to detect phishing sites, but they are not in a state to be used by individuals having no technical knowledge. To make sure that these tools are accessible to every individual, we have improvised and introduced detection methods into the browser architecture named as 'Embedded Phishing Detection Browser' (EPDB), which is a novel method to preserve the existing user experience while improving the security. The newly designed browser architecture introduces a special segment to perform phishing detection operations in real-time. We have prototyped this technique to ensure maximum security, better accuracy of 99.36% in the identification of phishing websites in real-time.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Towards a contingency approach with whitelist- and blacklist-based anti-phishing applications: what do usability tests indicate?
    Li, Linfeng
    Berki, Eleni
    Helenius, Marko
    Ovaska, Saila
    BEHAVIOUR & INFORMATION TECHNOLOGY, 2014, 33 (11) : 1136 - 1147
  • [42] A Random Forest Classification Algorithm Based on Dichotomy Rule Fusion
    Xiao, Yueyue
    Huang, Wei
    Wang, Jinsong
    PROCEEDINGS OF 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2020), 2020, : 182 - 185
  • [43] Tips, Tricks, and Training: Supporting Anti-Phishing Awareness among Mid-Career Office Workers Based on Employees' Current Practices
    Tally, Anne C.
    Abbott, Jacob
    Bochner, Ashley
    Das, Sanchari
    Nippert-Eng, Christena
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2023, 2023,
  • [44] Phishing Website Detection Based on Deep Convolutional Neural Network and Random Forest Ensemble Learning
    Yang, Rundong
    Zheng, Kangfeng
    Wu, Bin
    Wu, Chunhua
    Wang, Xiujuan
    SENSORS, 2021, 21 (24)
  • [45] Odinson: A Fast Rule-based Information Extraction Framework
    Valenzuela-Escarcega, Marco A.
    Hahn-Powell, Gus
    Bell, Dane
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2183 - 2191
  • [46] Random clustering forest for extended belief rule-based system
    Chen, Nan-Nan
    Gong, Xiao-Ting
    Wang, Ying-Ming
    Zhang, Chun-Yang
    Fu, Yang-Geng
    SOFT COMPUTING, 2021, 25 (06) : 4609 - 4619
  • [47] Patch Forest: A Hybrid Framework of Random Forest and Patch-based Segmentation
    Xie, Zhongliu
    Gillies, Duncan
    MEDICAL IMAGING 2016: IMAGE PROCESSING, 2016, 9784
  • [48] RULE EXTRACTION FROM RANDOM FOREST FOR INTRA-DAY TRADING USING CROBEX DATA
    Vlah Jeric, Silvija
    PROCEEDINGS OF FEB ZAGREB 11TH INTERNATIONAL ODYSSEY CONFERENCE ON ECONOMICS AND BUSINESS, 2020, 2 (01): : 411 - 419
  • [49] Research on Machine Learning Framework Based on Random Forest Algorithm
    Ren, Qiong
    Cheng, Hui
    Han, Hai
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS I, 2017, 1820
  • [50] A MULTIVARIATE RANDOM FOREST BASED FRAMEWORK FOR DRUG SENSITIVITY PREDICTION
    Wan, Qian
    Pal, Ranadip
    2013 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2013), 2013, : 53 - 53