Big Data Retrieval Using Locality-Sensitive Hashing with Document-Based NoSQL Database

被引:3
|
作者
Gayathiri, N. R. [1 ]
Natarajan, A. M. [2 ]
机构
[1] Bannari Amman Inst Technol, Dept Artificial Intelligence & Data Sci, Sathyamangalam 638401, India
[2] KPR Inst Engn & Technol, Dept Comp Sci & Engn, Coimbatore 641407, Tamil Nadu, India
关键词
LSH; buckets; hyperplanes; query; document; MongoDB;
D O I
10.1080/03772063.2021.1912654
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A locality-sensitive hashing (LSH) method in the document-based NoSQL database is proposed for enhancing the ability of arbitrary reads over the previous methodologies. The proposed hash index improves efficiency by reducing the amount of accessing data for search queries by creating buckets based on hyperplanes. The LSH hashes the input data where similar items with high probability maps to the same bucket. They attempt to decrease the volume of candidate data objects matched when reducing the missed nearest neighbors. The data space is divided with randomly chosen hyperplanes to decrease the volume of candidate objects. The values which are nearer to the boundaries (adjacent to the two sides of the hyperplane) are considered. The bucket label's string length is equivalent to the amount of used hyperplanes. The effect of LSH for bucket size balancing and analysis of the non-indexed, hash index, and global-indexed dataset on MongoDB depicts the pre-eminence of the presented hash index.
引用
收藏
页码:969 / 978
页数:10
相关论文
共 50 条
  • [1] Parallel set similarity join on big data based on Locality-Sensitive Hashing
    Sohrabi, Mohammad Karim
    Azgomi, Hosseion
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2017, 145 : 1 - 12
  • [2] A Scalable Content-based Image Retrieval Scheme Using Locality-sensitive Hashing
    Wang Weihong
    Wang Song
    [J]. PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NATURAL COMPUTING, VOL I, 2009, : 151 - 154
  • [3] Locality-Sensitive Hashing of Soft Biometrics for Efficient Face Image Database Search and Retrieval
    Alshahrani, Ameerah Abdullah
    Jaha, Emad Sami
    [J]. ELECTRONICS, 2023, 12 (06)
  • [4] Using Locality-sensitive Hashing for Rendezvous Search
    Jiang, Guann-Yng
    Chang, Cheng-Shang
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 1743 - 1749
  • [5] GPU-BASED KERNELIZED LOCALITY-SENSITIVE HASHING FOR SATELLITE IMAGE RETRIEVAL
    Lukac, Niko
    Zalik, Borut
    Cui, Shiyong
    Datcu, Mihai
    [J]. 2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 1468 - 1471
  • [6] P-QALSH: Parallelizing Query Aware Locality-Sensitive Hashing for Big Data
    Huang, Yikai
    Yao, Zhili
    Feng, Jianlin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 629 - 635
  • [7] Using Locality-Sensitive Hashing for SVM Classification of Large Data Sets
    Gonzalez-Lima, Maria D.
    Ludena, Carenne C.
    [J]. MATHEMATICS, 2022, 10 (11)
  • [8] Stratified Locality-Sensitive Hashing for Accelerated Physiological Time Series Retrieval
    Kim, Yongwook Bryce
    Hemberg, Erik
    O'Reilly, Una-May
    [J]. 2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 2479 - 2483
  • [9] Locality-Sensitive Hashing for Information Retrieval System on Multiple GPGPU Devices
    Toan Nguyen Mau
    Inoguchi, Yasushi
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [10] A method using locality-sensitive hashing for large-scale content-based image retrieval
    Wang Weihong
    Wang Song
    [J]. CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 1816 - 1820