Development of anti-phishing browser based on random forest and rule of extraction framework

被引：15

作者：

Gowda, Mohith H. R. ^{[1
]}

Adithya, M., V ^{[2
]}

Prasad, Gunesh S. ^{[3
]}

Vinay, S. ^{[4
]}

机构：

[1] PES Coll Engn, Comp Sci & Engn, 4011 Vasuda Krupa,3rd Cross, Mandya 571401, Karnataka, India

[2] PES Coll Engn, Comp Sci & Engn, 1932,1st Main Rd,Near Vinayaka Auto Stand, Mandya 571401, Karnataka, India

[3] PES Coll Engn, Comp Sci & Engn, 20-19,5th Cross,Near Shakthi Nagar Pk, Mysore 570019, Karnataka, India

[4] PES Coll Engn, Informat Sci & Engn, PES Engn Coll Rd,PES Coll Campus, Mandya 571401, Karnataka, India

来源：

CYBERSECURITY | 2020年 / 3卷 / 01期

关键词：

Phishing attack; Machine learning; Intelligent browser engine; Rule of extraction algorithm; Browser architecture; EFFICIENT;

D O I：

10.1186/s42400-020-00059-1

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Phishing is a technique under Social Engineering attacks which is most widely used to get user sensitive information, such as login credentials and credit and debit card information, etc. It is carried out by a person masquerading as an authentic individual. To protect web users from these attacks, various anti-phishing techniques are developed, but they fail to protect the user from these attacks in various ways. In this paper, we propose a novel technique to identify phishing websites effortlessly on the client side by proposing a novel browser architecture. In this system, we use the rule of extraction framework to extract the properties or features of a website using the URL only. This list consists of 30 different properties of a URL, which will later be used by the Random Forest Classification machine learning model to detect the authenticity of the website. A dataset consisting of 11,055 tuples is used to train the model. These processes are carried out on the client-side with the help of a redesigned browser architecture. Today Researches have come up with machine learning frameworks to detect phishing sites, but they are not in a state to be used by individuals having no technical knowledge. To make sure that these tools are accessible to every individual, we have improvised and introduced detection methods into the browser architecture named as 'Embedded Phishing Detection Browser' (EPDB), which is a novel method to preserve the existing user experience while improving the security. The newly designed browser architecture introduces a special segment to perform phishing detection operations in real-time. We have prototyped this technique to ensure maximum security, better accuracy of 99.36% in the identification of phishing websites in real-time.

引用

页数：14

共 50 条

[41] Towards a contingency approach with whitelist- and blacklist-based anti-phishing applications: what do usability tests indicate?
Li, Linfeng
Berki, Eleni
Helenius, Marko
Ovaska, Saila
BEHAVIOUR & INFORMATION TECHNOLOGY, 2014, 33 (11) : 1136 - 1147
[42] A Random Forest Classification Algorithm Based on Dichotomy Rule Fusion
Xiao, Yueyue
Huang, Wei
Wang, Jinsong
PROCEEDINGS OF 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2020), 2020, : 182 - 185
[43] Tips, Tricks, and Training: Supporting Anti-Phishing Awareness among Mid-Career Office Workers Based on Employees' Current Practices
Tally, Anne C.
Abbott, Jacob
Bochner, Ashley
Das, Sanchari
Nippert-Eng, Christena
PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2023, 2023,
[44] Phishing Website Detection Based on Deep Convolutional Neural Network and Random Forest Ensemble Learning
Yang, Rundong
Zheng, Kangfeng
Wu, Bin
Wu, Chunhua
Wang, Xiujuan
SENSORS, 2021, 21 (24)
[45] Odinson: A Fast Rule-based Information Extraction Framework
Valenzuela-Escarcega, Marco A.
Hahn-Powell, Gus
Bell, Dane
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2183 - 2191
[46] Random clustering forest for extended belief rule-based system
Chen, Nan-Nan
Gong, Xiao-Ting
Wang, Ying-Ming
Zhang, Chun-Yang
Fu, Yang-Geng
SOFT COMPUTING, 2021, 25 (06) : 4609 - 4619
[47] Patch Forest: A Hybrid Framework of Random Forest and Patch-based Segmentation
Xie, Zhongliu
Gillies, Duncan
MEDICAL IMAGING 2016: IMAGE PROCESSING, 2016, 9784
[48] RULE EXTRACTION FROM RANDOM FOREST FOR INTRA-DAY TRADING USING CROBEX DATA
Vlah Jeric, Silvija
PROCEEDINGS OF FEB ZAGREB 11TH INTERNATIONAL ODYSSEY CONFERENCE ON ECONOMICS AND BUSINESS, 2020, 2 (01): : 411 - 419
[49] Research on Machine Learning Framework Based on Random Forest Algorithm
Ren, Qiong
Cheng, Hui
Han, Hai
ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS I, 2017, 1820
[50] A MULTIVARIATE RANDOM FOREST BASED FRAMEWORK FOR DRUG SENSITIVITY PREDICTION
Wan, Qian
Pal, Ranadip
2013 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2013), 2013, : 53 - 53

← 1 2 3 4 5 →