Multi-resolution similarity hashing

被引:46
|
作者
Roussev, Vassil [1 ]
Richard, Golden G., III [1 ]
Marziale, Lodovico [1 ]
机构
[1] Univ New Orleans, Dept Comp Sci, New Orleans, LA 70148 USA
关键词
hashing; similarity hashing; digital forensics; multi-resolution hash; file correlation; data correlation;
D O I
10.1016/j.diin.2007.06.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale digital forensic investigations present at least two fundamental challenges. The first one is accommodating the computational needs of a large amount of data to be processed. The second one is extracting useful information from the raw data in an automated fashion. Both of these problems could result in long processing times that can seriously hamper an investigation. In this paper, we discuss a new approach to one of the basic operations that is invariably applied to raw data - hashing. The essential idea is to produce an efficient and scalable hashing scheme that can be used to supplement the traditional cryptographic hashing during the initial pass over the raw evidence. The goal is to retain enough information to allow binary data to be queried for similarity at various levels of granularity without any further pre-processing/indexing. The specific solution we propose, called a multi-resolution similarity hash (or MRS hash), is a generalization of recent work in the area. Its main advantages are robust performance raw speed comparable to a high-grade block-level crypto hash, scalability - ability to compare targets that vary in size by orders of magnitude, and space efficiency - typically below 0.5% of the size of the target. (C) 2007 DFRWS. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:S105 / S113
页数:9
相关论文
共 50 条
  • [31] On the multi-resolution ESPRIT algorithm
    Lemma, AN
    van der Veen, AJ
    Deprettere, EF
    [J]. NINTH IEEE SIGNAL PROCESSING WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, PROCEEDINGS, 1998, : 248 - 251
  • [32] Multi-resolution area matching
    Pedersini, F
    Sarti, A
    Tubaro, S
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2000, : 553 - 556
  • [33] Multi-resolution planning for earthmoving
    Singh, S
    Cannon, H
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, 1998, : 121 - 126
  • [34] Quadrilateral multi-resolution expression
    Chen, Ren
    Luo, Xiaonan
    Ling, Ruotian
    Zheng, Guifeng
    [J]. Journal of Computational Information Systems, 2006, 2 (02): : 889 - 895
  • [35] Multi-resolution image inpainting
    Shih, TK
    Lu, LC
    Wang, YH
    Chang, RC
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 485 - 488
  • [36] Multi-resolution template kernels
    Needham, CJ
    Boyle, RD
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 233 - 236
  • [37] Multi-resolution SAR Tomography
    Verde, Simona
    Fornaro, Gianfranco
    Pauciullo, Antonio
    Reale, Diego
    [J]. 13TH EUROPEAN CONFERENCE ON SYNTHETIC APERTURE RADAR, EUSAR 2021, 2021, : 80 - 85
  • [38] Multi-resolution binary image embedding
    Wong, PW
    [J]. SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS V, 2003, 5020 : 423 - 429
  • [39] Global Multi-Resolution Topography synthesis
    Ryan, William B. F.
    Carbotte, Suzanne M.
    Coplan, Justin O.
    O'Hara, Suzanne
    Melkonian, Andrew
    Arko, Robert
    Weissel, Rose Anne
    Ferrini, Vicki
    Goodwillie, Andrew
    Nitsche, Frank
    Bonczkowski, Juliet
    Zemsky, Richard
    [J]. GEOCHEMISTRY GEOPHYSICS GEOSYSTEMS, 2009, 10
  • [40] Dynamic multi-resolution spatial models
    Johannesson, Gardar
    Cressie, Noel
    Huang, Hsin-Cheng
    [J]. ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2007, 14 (01) : 5 - 25