Hierarchical Management of Large-Scale Malware Data

被引:0
|
作者
Kellogg, Lee [1 ]
Ruttenberg, Brian [1 ]
O'Connor, Alison [1 ]
Howard, Michael [1 ]
Pfeffer, Avi [1 ]
机构
[1] Charles River Analyt, 625 Mt Auburn St, Cambridge, MA 02138 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the pace of generation of new malware accelerates, clustering and classifying newly discovered malware requires new approaches to data management. We describe our Big Data approach to managing malware to support effective and efficient malware analysis on large and rapidly evolving sets of malware. The key element of our approach is a hierarchical organization of the malware, which organizes malware into families, maintains a rich description of the relationships between malware, and facilitates efficient online analysis of new malware as they are discovered. Using clustering evaluation metrics, we show that our system discovers malware families comparable to those produced by traditional hierarchical clustering algorithms, while scaling much better with the size of the data set. We also show the flexibility of our system as it relates to substituting various data representations, methods of comparing malware binaries, clustering algorithms, and other factors. Our approach will enable malware analysts and investigators to quickly understand and quantify changes in the global malware ecosystem.
引用
下载
收藏
页码:666 / 674
页数:9
相关论文
共 50 条
  • [21] Benchmarking large-scale data management for Internet of Things
    Hendawi, Abdeltawab
    Gupta, Jayant
    Liu, Jiayi
    Teredesai, Ankur
    Ramakrishnan, Naveen
    Shah, Mohak
    El-Sappagh, Shaker
    Kwak, Kyung-Sup
    Ali, Mohamed
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (12): : 8207 - 8230
  • [22] NetSearch: Googling Large-scale Network Management Data
    Qiu, Tongqing
    Ge, Zihui
    Pei, Dan
    Wang, Jia
    Xu, Jun
    2014 IFIP NETWORKING CONFERENCE, 2014,
  • [23] MANAGEMENT OF THE DATA ASSOCIATED WITH LARGE-SCALE SEQUENCING AND MAPPING
    FICKETT, JW
    CINKOSKY, MJ
    BURKS, C
    GOAD, WB
    MISHRA, SK
    TUNG, CS
    BIOPHYSICAL JOURNAL, 1987, 51 (02) : A440 - A440
  • [24] A hierarchical approach to removal of unwanted variation for large-scale metabolomics data
    Kim, Taiyun
    Tang, Owen
    Vernon, Stephen T.
    Kott, Katharine A.
    Koay, Yen Chin
    Park, John
    James, David E.
    Grieve, Stuart M.
    Speed, Terence P.
    Yang, Pengyi
    Figtree, Gemma A.
    O'Sullivan, John F.
    Yang, Jean Yee Hwa
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [25] A hierarchical approach to removal of unwanted variation for large-scale metabolomics data
    Taiyun Kim
    Owen Tang
    Stephen T. Vernon
    Katharine A. Kott
    Yen Chin Koay
    John Park
    David E. James
    Stuart M. Grieve
    Terence P. Speed
    Pengyi Yang
    Gemma A. Figtree
    John F. O’Sullivan
    Jean Yee Hwa Yang
    Nature Communications, 12
  • [26] Hierarchical control for large-scale systems
    School of Automation and Info. Eng., Xi'an University of Technology, Xi'an 710048, China
    Journal of Systems Engineering and Electronics, 2001, 12 (04) : 41 - 45
  • [27] SHIP: Scalable Hierarchical Power Control for Large-Scale Data Centers
    Wang, Xiaorui
    Chen, Ming
    Lefurgy, Charles
    Keller, Tom W.
    18TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2009, : 91 - +
  • [28] Hierarchical Classification for Large-Scale Learning
    Wang, Boshi
    Barbu, Adrian
    ELECTRONICS, 2023, 12 (22)
  • [29] Hierarchical Control for Large-Scale Systems
    钱富才
    李琦
    刘丁
    Journal of Systems Engineering and Electronics, 2001, (04) : 41 - 45
  • [30] Geographically distributed data management to support large-scale data analysis
    Emara, Tamer Z.
    Trinh, Thanh
    Huang, Joshua Zhexue
    SCIENTIFIC REPORTS, 2023, 13 (01)