Distributed Kd-Trees for Retrieval from Very Large Image Collections

被引:34
|
作者
Aly, Mohamed [1 ]
Munich, Mario [2 ]
Perona, Pietro [1 ]
机构
[1] CALTECH, Computat Vis Grp, Pasadena, CA 91125 USA
[2] Evolut Robot, Pasadena, CA 91106 USA
关键词
D O I
10.5244/C.25.40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distributed Kd-Trees is a method for building image retrieval systems that can handle hundreds of millions of images. It is based on dividing the Kd-Tree into a "root subtree" that resides on a root machine, and several "leaf subtrees", each residing on a leaf machine. The root machine handles incoming queries and farms out feature matching to an appropriate small subset of the leaf machines. Our implementation employs the MapReduce architecture to efficiently build and distribute the Kd-Tree for millions of images. It can run on thousands of machines, and provides orders of magnitude more throughput than the state-of-the-art, with better recognition performance. We show experiments with up to 100 million images running on 2048 machines, with run time of a fraction of a second for each query image.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Randomly Projected KD-Trees with Distance Metric Learning for Image Retrieval
    Wu, Pengcheng
    Hoi, Steven C. H.
    Duc Dung Nguyen
    He, Ying
    ADVANCES IN MULTIMEDIA MODELING, PT II, 2011, 6524 : 371 - 382
  • [2] Medical image collection indexing: Shape-based retrieval using KD-trees
    Robinson, GP
    Tagare, HD
    Duncan, JS
    Jaffe, CC
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 1996, 20 (04) : 209 - 217
  • [3] Medical image collection indexing: Shape-based retrieval using KD-trees
    Department of Diagnostic Radiology, Yale University, New Haven, CT 06520, United States
    不详
    不详
    COMPUT. MED. IMAGING GRAPH., 4 (209-217):
  • [4] Optimised KD-trees for fast image descriptor matching
    Silpa-Anan, Chanop
    Hartley, Richard
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2308 - +
  • [5] Kd-trees and the real disclosure risks of large statistical databases
    Herranz, Javier
    Nin, Jordi
    Sole, Marc
    INFORMATION FUSION, 2012, 13 (04) : 260 - 273
  • [6] Integrating improved inner distance method with Kd-trees for fast shape retrieval
    Shen, Jin-Ke
    Zhang, Feng-Min
    Wang, Hai-Hui
    Wu, Yun-Tao
    Journal of Harbin Institute of Technology (New Series), 2010, 17 (SUPPL. 2) : 101 - 104
  • [7] Very fast EM-based mixture model clustering using multiresolution kd-trees
    Moore, AW
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 543 - 549
  • [8] Image Retrieval for Online Browsing in Large Image Collections
    Mikulik, Andrej
    Chum, Ondrej
    Matas, Jiri
    SIMILARITY SEARCH AND APPLICATIONS (SISAP), 2013, 8199 : 3 - 15
  • [9] Reverse Annotation Based Retrieval from Large Document Image Collections
    Sankar, Pramod K.
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 921 - 921
  • [10] The implementation of distributed retrieval system for a large number of collections
    Kim, Kwang-Young
    Kim, Jin-Suk
    Lee, Seok-Hyong
    Lee, Min-Ho
    Choi, Sung-Pil
    Choi, Yun-Soo
    Jin, Du-Seok
    Cho, Min-Hee
    Jeong, Chang-Hoo
    Kang, Nam-Gyu
    Choe, Ho-Seop
    Seo, Jerry
    Yoon, Hwa-Mook
    EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2007, : 244 - 248