EmailProfiler: Spearphishing Filtering with Header and Stylometric Features of Emails

被引:40
|
作者
Duman, Sevtap [1 ]
Cakmakci, Kubra Kalkan [2 ]
Egele, Manuel [3 ]
Robertson, William [1 ]
Kirda, Engin [1 ]
机构
[1] Northeastern Univ, Coll Comp & Informat Sci, Boston, MA 02115 USA
[2] Bogazici Univ, Dept Engn, Istanbul, Turkey
[3] Boston Univ, Elect & Comp Engn, Boston, MA 02215 USA
关键词
EMOTION-RECOGNITION; COMPLEX EMOTIONS; CHILDREN; AUTISM; MIND;
D O I
10.1109/COMPSAC.2016.105
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Spearphishing is a prominent targeted attack vector in today's Internet. By impersonating trusted email senders through carefully crafted messages and spoofed metadata, adversaries can trick victims into launching attachments containing malicious code or into clicking on malicious links that grant attackers a foothold into otherwise well-protected networks. Spearphishing is effective because it is fundamentally difficult for users to distinguish legitimate emails from spearphishing emails without additional defensive mechanisms. However, such mechanisms, such as cryptographic signatures, have found limited use in practice due to their perceived difficulty of use for normal users. In this paper, we present a novel automated approach to defending users against spearphishing attacks. The approach first builds probabilistic models of both email metadata and stylometric features of email content. Then, subsequent emails are compared to these models to detect characteristic indicators of spearphishing attacks. Several instantiations of this approach are possible, including performing model learning and evaluation solely on the receiving side, or senders publishing models that can be checked remotely by the receiver. Our evaluation of a real data set drawn from 20 email users demonstrates that the approach effectively discriminates spearphishing attacks from legitimate email while providing significant ease-of-use benefits over traditional defenses.
引用
收藏
页码:408 / 416
页数:9
相关论文
共 50 条
  • [21] Role of Machine Learning in Authorship Attribution with Select Stylometric Features
    Gupta, Sumit
    Patra, Tapas Kumar
    Chaudhuri, Chitrita
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 920 - 932
  • [22] A small set of stylometric features differentiates Latin prose and verse
    Chaudhuri, Pramit
    Dasgupta, Tathagata
    Dexter, Joseph P.
    Iyer, Krithika
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2019, 34 (04) : 716 - 729
  • [23] Categorizing Emails Using Machine Learning with Textual Features
    Zhang, Haoran
    Rangrej, Jagadish
    Rais, Saad
    Hillmer, Michael
    Rudzicz, Frank
    Malikov, Kamil
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 3 - 15
  • [24] Intrinsic Plagiarism Detection System Using Stylometric Features and DBSCAN
    Saini, Anu
    Sri, Manepalli Ratna
    Thakur, Mansi
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 13 - 18
  • [25] Determining Window Size from Plagiarism Corpus for Stylometric Features
    Suchomel, Simon
    Brandejs, Michal
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 : 293 - 299
  • [26] Investigating Iranian University Students' Emails for Pragmatic Features
    Motallebzadeh, Khalil
    Mohsenzadeh, Hoda
    Sobhani, Atefe
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN ELT, 2014, 98 : 1263 - 1272
  • [27] Effect of PE File Header Features on Accuracy
    Al-Khshali, Hasan H.
    Ilyas, Muhammad
    Ucan, Osman N.
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1115 - 1120
  • [28] Classifying Spam Emails using Text and Readability Features
    Shams, Rushdi
    Mercer, Robert E.
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 657 - 666
  • [29] A method to Measure the Efficiency of Phishing Emails Detection Features
    Al-Daeef, Melad Mohamed
    Basir, Nurlida
    Saudi, Madihah Mohd
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [30] Authorship Attribution on Bengali Literature using Stylometric Features and Neural Network
    Islam, Md. Ashikul
    Kabir, Md. Minhazul
    Islam, Md. Saiful
    Tasnim, Ayesha
    2018 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT), 2018, : 360 - 363