A fast and scalable similarity search in high-dimensional image datasets

被引:0
|
作者
Hanyf, Youssef [1 ]
Silkan, Hassan [1 ]
机构
[1] Chouaib Doukkali Univ, Fac Sci, Comp Sci Dept, El Jadida, Morocco
关键词
similarity search; high-dimensional images datasets; D-index; image datasets indexing; scalable datasets; content-based retrieval; metric spaces; data structure; ALGORITHM;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Owing to the development of image data production and use, the quantity of image datasets has exponentially increased in the last decade. Consequently, the similarity searching cost in image datasets becomes a severe problem which affects the efficiency of similarity search engines in this data type. In this paper, we address the problem of reducing the similarity search cost in large, high-dimensional and scalable image datasets; we propose an improvement of the D-index method to reduce the searching cost and to deal efficiently with scalable datasets. The proposed improvement is based on two propositions; first, we propose criteria and algorithms to choose effective separation values which can reduce the searching cost. Second, we propose an algorithm for updating the structure in case of scalable datasets to resist the impact of objects' insertion on the searching cost. The experiments show that the proposed D-index version has proved a good searching performance in comparison with the classical D-index and a significant resistance to the dataset scalability against the original D-index.
引用
收藏
页码:95 / 104
页数:10
相关论文
共 50 条
  • [41] Fast and scalable learning of sparse changes in high-dimensional graphical model structure
    Wang, Beilun
    Zhang, Jiaqi
    Xu, Haoqing
    Tao, Te
    [J]. NEUROCOMPUTING, 2022, 514 : 39 - 57
  • [42] Fast and Scalable Spike and Slab Variable Selection in High-Dimensional Gaussian Processes
    Dance, Hugh
    Paige, Brooks
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [43] High-dimensional similarity search using data-sensitive space partitioning
    Kulkarni, Sachin
    Orlandic, Ratko
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, 4080 : 738 - 750
  • [44] Bandwidth Efficient Near-Storage Accelerator for High-Dimensional Similarity Search
    Sun, Gongjin
    Jun, Sang-Woo
    [J]. 2020 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2020), 2020, : 129 - 138
  • [45] An efficient high-dimensional index structure using cell signatures for similarity search
    Chang, JW
    Song, KT
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2001, 2118 : 26 - 33
  • [46] A Hierarchical Bitmap Indexing Method for Similarity Search in High-Dimensional Multimedia Databases
    Nang, Jongho
    Park, Joohyoun
    Yang, Jihoon
    Kim, Saejoon
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2010, 26 (02) : 393 - 407
  • [47] Scalable high-dimensional indexing with Hadoop
    Shestakov, Denis
    Moise, Diana
    Gudmundsson, Gylfi
    Amsaleg, Laurent
    [J]. 2013 11TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI 2013), 2013, : 207 - 212
  • [48] LGTM: A Fast and Accurate kNN Search Algorithm in High-Dimensional Spaces
    Arai, Yusuke
    Amagata, Daichi
    Fujita, Sumio
    Hara, Takahiro
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT II, 2021, 12924 : 220 - 231
  • [49] Progressive high-dimensional similarity join
    Tok, Wee Hyong
    Bressan, Stephane
    Lee, Mong-Li
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, 4653 : 233 - +
  • [50] MLR-Index: An Index Structure for Fast and Scalable Similarity Search in High Dimensions
    Malik, Rahul
    Kim, Sangkyum
    Jin, Xin
    Ramachandran, Chandrasekar
    Han, Jiawei
    Gupta, Indranil
    Nahrstedt, Klara
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2009, 5566 : 167 - 184