Accelerating Large-scale Image Retrieval on Heterogeneous Architectures with Spark

被引:3
|
作者
Wang, Hanli [1 ]
Xiao, Bo
Wang, Lei
Wu, Jun
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai, Peoples R China
关键词
Heterogeneous Computing; Spark; Image Retrieval; Graphics Processing Units;
D O I
10.1145/2733373.2806392
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Apache Spark is a general-purpose cluster computing system for big data processing and has drawn much attention recently from several fields, such as pattern recognition, machine learning and so on. Unlike MapReduce, Spark is especially suitable for iterative and interactive computations. With the computing power of Spark, a utility library, referred to as IRlib, is proposed in this work to accelerate large-scale image retrieval applications by jointly harnessing the power of GPU. Similar to the built-in machine learning library of Spark, namely MLlib, IRlib fits into the Spark APIs and benefits from the powerful functionalities of Spark. The main contributions of IRlib lie in two-folds. First, IRlib provides a uniform set of APIs for the programming of image retrieval applications. Second, the computational performance of Spark equipped with multiple GPUs is dramatically boosted by developing high performance modules for common image retrieval related algorithms. Comparative experiments concerning large-scale image retrieval are carried out to demonstrate the significant performance improvement achieved by IRlib as compared with single CPU thread implementation as well as Spark without GPUs employed.
引用
收藏
页码:1023 / 1026
页数:4
相关论文
共 50 条
  • [1] Accelerating Large-Scale Genomic Analysis with Spark
    Li, Xueqi
    Tan, Guangming
    Zhang, Chunming
    Li, Xu
    Zhang, Zhonghai
    Sun, Ninghui
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 747 - 751
  • [2] Large-Scale Image Retrieval with Elasticsearch
    Amato, Giuseppe
    Bolettieri, Paolo
    Carrara, Fabio
    Falchi, Fabrizio
    Gennaro, Claudio
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 925 - 928
  • [3] Accelerating Relevance Vector Machine for Large-Scale Data on Spark
    Liu, Fang
    Zhong, Hao
    Li, Si-Han
    [J]. 4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [4] A Reconfigurable Simulator for Large-scale Heterogeneous Multicore Architectures
    Meng, Jiayuan
    Skadron, Kevin
    [J]. IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2011), 2011, : 119 - 120
  • [5] Region Division for Large-scale Image Retrieval
    Rao, Yunbo
    Liu, Wei
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (10) : 5197 - 5218
  • [6] Similarity caching in large-scale image retrieval
    Falchi, Fabrizio
    Lucchese, Claudio
    Orlando, Salvatore
    Perego, Raffaele
    Rabitti, Fausto
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (05) : 803 - 818
  • [7] Weak Attributes for Large-Scale Image Retrieval
    Yu, Felix X.
    Ji, Rongrong
    Tsai, Ming-Hen
    Ye, Guangnan
    Chang, Shih-Fu
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2949 - 2956
  • [8] Manhattan Hashing for Large-Scale Image Retrieval
    Kong, Weihao
    Li, Wu-Jun
    Guo, Minyi
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 45 - 54
  • [9] Deep Hashing for Large-scale Image Retrieval
    Li Mengting
    Liu Jun
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10940 - 10944
  • [10] CHCF: A Cloud-Based Heterogeneous Computing Framework for Large-Scale Image Retrieval
    Wang, Hanli
    Xiao, Bo
    Wang, Lei
    Zhu, Fengkuangtian
    Jiang, Yu-Gang
    Wu, Jun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (12) : 1900 - 1913