Malware Family Discovery Using Reversible Jump MCMC Sampling of Regimes

被引:2
|
作者
Bolton, Alexander D. [1 ]
Heard, Nicholas A. [1 ,2 ]
机构
[1] Imperial Coll London, Dept Math, South Kensington Campus, London SW7 2AZ, England
[2] Univ Bristol, Heilbronn Inst Math Res, Bristol, Avon, England
基金
英国工程与自然科学研究理事会;
关键词
Change point analysis; Dynamic instruction trace; Regime-switching; Reversible jump Markov chain Monte Carlo; CHAIN MONTE-CARLO; MARKOV-CHAIN; MODEL;
D O I
10.1080/01621459.2018.1423984
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Malware is computer software that has either been designed or modified with malicious intent. Hundreds of thousands of new malware threats appear on the internet each day. This is made possible through reuse of known exploits in computer systems that have not been fully eradicated; existing pieces of malware can be trivially modified and combined to create new malware, which is unknown to anti-virus programs. Finding new software with similarities to known malware is therefore an important goal in cyber-security. A dynamic instruction trace of a piece of software is the sequence of machine language instructions it generates when executed. Statistical analysis of a dynamic instruction trace can help reverse engineers infer the purpose and origin of the software that generated it. Instruction traces have been successfully modeled as simple Markov chains, but empirically there are change points in the structure of the traces, with recurring regimes of transition patterns. Here, reversible jump Markov chain Monte Carlo for change point detection is extended to incorporate regime-switching, allowing regimes to be inferred from malware instruction traces. A similarity measure for malware programs based on regime matching is then used to infer the originating families, leading to compelling performance results.
引用
收藏
页码:1490 / 1502
页数:13
相关论文
共 50 条
  • [1] Annealed Importance Sampling Reversible Jump MCMC Algorithms
    Karagiannis, Georgios
    Andrieu, Christophe
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2013, 22 (03) : 623 - 648
  • [2] Segmentation of color images via reversible jump MCMC sampling
    Kato, Zoltan
    [J]. IMAGE AND VISION COMPUTING, 2008, 26 (03) : 361 - 371
  • [3] Changepoint detection using reversible jump MCMC methods
    Suparman, S
    Doisy, M
    Tourneret, JY
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1569 - 1572
  • [4] Generic reversible jump MCMC using graphical models
    David J. Lunn
    Nicky Best
    John C. Whittaker
    [J]. Statistics and Computing, 2009, 19
  • [5] Generic reversible jump MCMC using graphical models
    Lunn, David J.
    Best, Nicky
    Whittaker, John C.
    [J]. STATISTICS AND COMPUTING, 2009, 19 (04) : 395 - 408
  • [6] Bayesian analysis of the patterns of biological susceptibility via reversible jump MCMC sampling
    Liu, Rui-Yin
    Tao, Jian
    Shi, Ning-Zhong
    He, Xuming
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (03) : 1498 - 1508
  • [7] A bayesian approach to map QTLs using reversible jump MCMC
    da Silva, Joseane Padilha
    Leandro, Roseli Aparecida
    [J]. CIENCIA E AGROTECNOLOGIA, 2009, 33 (04): : 1061 - 1070
  • [8] Adaptive Proposal Construction for Reversible Jump MCMC
    Ehlers, Ricardo S.
    Brooks, Stephen P.
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2008, 35 (04) : 677 - 690
  • [9] Bayesian Volterra system identification using reversible jump MCMC algorithm
    Karakus, O.
    Kuruoglu, E. E.
    Altinkaya, M. A.
    [J]. SIGNAL PROCESSING, 2017, 141 : 125 - 136
  • [10] Convergence assessment for reversible jump MCMC simulations
    Brooks, SP
    Giudici, P
    [J]. BAYESIAN STATISTICS 6, 1999, : 733 - 742