Detecting Malicious Exploit Kits using Tree-based Similarity Searches

被引:20
|
作者
Taylor, Teryl [1 ]
Hu, Xin [2 ]
Wang, Ting [3 ]
Jang, Jiyong [2 ]
Stoecklin, Marc Ph. [2 ]
Monrose, Fabian [1 ]
Sailer, Reiner [2 ]
机构
[1] Univ North Carolina Chapel Hill, Chapel Hill, NC 27514 USA
[2] IBM TJ Watson Res Ctr, Armonk, NY 10504 USA
[3] Lehigh Univ, Bethlehem, PA USA
关键词
D O I
10.1145/2857705.2857718
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unfortunately, the computers we use for everyday activities can be infiltrated while simply browsing innocuous sites that, unbeknownst to the website owner, may be laden with malicious advertisements. So-called malvertising, redirects browsers to web-based exploit kits that are designed to find vulnerabilities in the browser and subsequently download malicious payloads. We propose a new approach for detecting such malfeasance by leveraging the inherent structural patterns in HTTP traffic to classify exploit kit instances. Our key insight is that an exploit kit leads the browser to download payloads using multiple requests from malicious servers. We capture these interactions in a "tree-like" form, and using a scalable index of malware samples, model the detection process as a sub-tree similarity search problem. The approach is evaluated on 3800 hours of real-world traffic including over 4 billion flows and reduces false positive rates by four orders of magnitude over current state-of-the-art techniques with comparable true positive rates. We show that our approach can operate in near real-time, and is able to handle peak traffic levels on a large enterprise network-identifying 28 new exploit kit instances during our analysis period.
引用
收藏
页码:255 / 266
页数:12
相关论文
共 50 条
  • [1] Application of tree-based searches to Matching Pursuit
    Cotter, SF
    Rao, BD
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 3933 - 3936
  • [2] Application of tree-based searches to Matching Pursuit
    Cotter, SF
    Rao, B
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4045 - 4045
  • [3] Tree-based trajectory planning to exploit atmospheric energy
    Langelaan, Jack W.
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 2328 - 2333
  • [4] On Tree-Based Methods for Similarity Learning
    Clemencon, Stephan
    Vogel, Robin
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, 2019, 11943 : 676 - 688
  • [5] Detecting distant homologs using phylogenetic tree-based HMMs
    Qian, B
    Goldstein, RA
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 52 (03): : 446 - 453
  • [6] LSTM RNN: detecting exploit kits using redirection chain sequences
    Jonah Burgess
    Philip O’Kane
    Sakir Sezer
    Domhnall Carlin
    Cybersecurity, 4
  • [7] LSTM RNN: detecting exploit kits using redirection chain sequences
    Burgess, Jonah
    O'Kane, Philip
    Sezer, Sakir
    Carlin, Domhnall
    CYBERSECURITY, 2021, 4 (01)
  • [8] Maintaining and Analyzing Production Process Definitions Using a Tree-Based Similarity Measure
    Stumptner, Reinhard
    Lettner, Christian
    Freudenthaler, Bernhard
    Pichler, Josef
    Kirchmayr, Wilhelm
    Draxler, Ewald
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2015, 2015, 9343 : 366 - 380
  • [9] A tree-based similarity for evaluating concept proximities in an ontology
    Blanchard, Emmanuel
    Kuntz, Pascale
    Harzallah, Mounira
    Briand, Henri
    DATA SCIENCE AND CLASSIFICATION, 2006, : 3 - +
  • [10] Impact of the Initialization in Tree-Based Fast Similarity Search Techniques
    Serrano, Aureo
    Mico, Luisa
    Oncina, Jose
    SIMILARITY-BASED PATTERN RECOGNITION, 2011, 7005 : 163 - 176