Practical scalable image analysis and indexing using Hadoop

被引:7
|
作者
Hare, Jonathon S. [1 ]
Samangooei, Sina [1 ]
Lewis, Paul H. [1 ]
机构
[1] Univ Southampton, Sch Elect & Comp Sci, Southampton SO17 1BJ, Hants, England
关键词
MapReduce; Hadoop; Bag of visual words; Image retrieval; SCALE;
D O I
10.1007/s11042-012-1256-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ability to handle very large amounts of image data is important for image analysis, indexing and retrieval applications. Sadly, in the literature, scalability aspects are often ignored or glanced over, especially with respect to the intricacies of actual implementation details. In this paper we present a case-study showing how a standard bag-of-visual-words image indexing pipeline can be scaled across a distributed cluster of machines. In order to achieve scalability, we investigate the optimal combination of hybridisations of the MapReduce distributed computational framework which allows the components of the analysis and indexing pipeline to be effectively mapped and run on modern server hardware. We then demonstrate the scalability of the approach practically with a set of image analysis and indexing tools built on top of the Apache Hadoop MapReduce framework. The tools used for our experiments are freely available as open-source software, and the paper fully describes the nuances of their implementation.
引用
收藏
页码:1215 / 1248
页数:34
相关论文
共 50 条
  • [31] Image indexing using moments and wavelets
    Mandal, MK
    Aboulnasr, T
    Panchanathan, S
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1996, 42 (03) : 557 - 565
  • [32] Distributed Image Processing Using Hadoop and HIPI
    Arsh, Swapnil
    Bhatt, Abhishek
    Kumar, Praveen
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 2673 - 2676
  • [33] An Efficient Data Indexing Approach on Hadoop Using Java']Java Persistence API
    Yang Lai
    Shi ZhongZhi
    INTELLIGENT INFORMATION PROCESSING V, 2010, 340 : 213 - 224
  • [34] Biospark: scalable analysis of large numerical datasets from biological simulations and experiments using Hadoop and Spark
    Klein, Max
    Sharma, Rati
    Bohrer, Chris H.
    Avelis, Cameron M.
    Roberts, Elijah
    BIOINFORMATICS, 2017, 33 (02) : 303 - 305
  • [35] Multimedia content analysis and indexing: Evaluation of a distributed and scalable architecture
    Mandviwala, HA
    Blackwell, S
    Weikart, C
    Van Thong, J
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 137 - 145
  • [36] Scalable indexing of HD Video
    Morand, C.
    Benois-Pineau, J.
    Domenger, J-Ph.
    2008 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2008, : 401 - 408
  • [37] Scalable indexing for perceptual data
    Qamra, Arun
    Chang, Edward Y.
    MULTIMEDIA CONTENT ANALYSIS AND MINING, PROCEEDINGS, 2007, 4577 : 24 - +
  • [38] Text-based Image Indexing and Retrieval using Formal Concept Analysis
    Ahmad, Imran Shafiq
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2008, 2 (03): : 150 - 170
  • [39] Image Indexing and Retrieval Using GSOM Algorithm
    Gabryel, Marcin
    Grycuk, Rafal
    Korytkowski, Marcin
    Holotyak, Taras
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2015, 9119 : 706 - 714
  • [40] Image indexing using classified vector quantization
    Wei, Hai
    Shen, Lan-Sun
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2001, 29 (07): : 933 - 936