A fast and scalable similarity search in high-dimensional image datasets

被引:0
|
作者
Hanyf, Youssef [1 ]
Silkan, Hassan [1 ]
机构
[1] Chouaib Doukkali Univ, Fac Sci, Comp Sci Dept, El Jadida, Morocco
关键词
similarity search; high-dimensional images datasets; D-index; image datasets indexing; scalable datasets; content-based retrieval; metric spaces; data structure; ALGORITHM;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Owing to the development of image data production and use, the quantity of image datasets has exponentially increased in the last decade. Consequently, the similarity searching cost in image datasets becomes a severe problem which affects the efficiency of similarity search engines in this data type. In this paper, we address the problem of reducing the similarity search cost in large, high-dimensional and scalable image datasets; we propose an improvement of the D-index method to reduce the searching cost and to deal efficiently with scalable datasets. The proposed improvement is based on two propositions; first, we propose criteria and algorithms to choose effective separation values which can reduce the searching cost. Second, we propose an algorithm for updating the structure in case of scalable datasets to resist the impact of objects' insertion on the searching cost. The experiments show that the proposed D-index version has proved a good searching performance in comparison with the classical D-index and a significant resistance to the dataset scalability against the original D-index.
引用
收藏
页码:95 / 104
页数:10
相关论文
共 50 条
  • [1] High-Dimensional Similarity Search for Scalable Data Science
    Echihabi, Karima
    Zoumpatianos, Kostas
    Palpanas, Themis
    [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2369 - 2372
  • [2] Fast similarity search for high-dimensional dataset
    Wang, Quan
    You, Suya
    [J]. ISM 2006: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2006, : 799 - +
  • [3] EncSIM: An Encrypted Similarity Search Service for Distributed High-dimensional Datasets
    Liu, Xiaoning
    Yuan, Xingliang
    Wang, Cong
    [J]. 2017 IEEE/ACM 25TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2017,
  • [4] Fast Scalable Approximate Nearest Neighbor Search for High-dimensional Data
    Bashyam, K. G. Renga
    Vadhiyar, Sathish
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 294 - 302
  • [5] Fast approximate similarity search in extremely high-dimensional data sets
    Houle, ME
    Sakuma, J
    [J]. ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 619 - 630
  • [6] The GC-tree: A high-dimensional index structure for similarity search in image databases
    Cha, GH
    Chung, CW
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (02) : 235 - 247
  • [7] Clustering for approximate similarity search in high-dimensional spaces
    Li, C
    Chang, E
    Garcia-Molina, H
    Wiederhold, G
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (04) : 792 - 808
  • [8] Memory Vectors for Similarity Search in High-Dimensional Spaces
    Iscen, Ahmet
    Furon, Teddy
    Gripon, Vincent
    Rabbat, Michael
    Jegou, Herve
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2018, 4 (01) : 65 - 77
  • [9] What's Wrong with High-Dimensional Similarity Search?
    Blott, Stephen
    Weber, Roger
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 3 - 3
  • [10] An adaptive index structure for high-dimensional similarity search
    Wu, P
    Manjunath, BS
    Chandrasekaran, S
    [J]. ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 71 - 77