Big Data Retrieval Using Locality-Sensitive Hashing with Document-Based NoSQL Database

被引:3
|
作者
Gayathiri, N. R. [1 ]
Natarajan, A. M. [2 ]
机构
[1] Bannari Amman Inst Technol, Dept Artificial Intelligence & Data Sci, Sathyamangalam 638401, India
[2] KPR Inst Engn & Technol, Dept Comp Sci & Engn, Coimbatore 641407, Tamil Nadu, India
关键词
LSH; buckets; hyperplanes; query; document; MongoDB;
D O I
10.1080/03772063.2021.1912654
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A locality-sensitive hashing (LSH) method in the document-based NoSQL database is proposed for enhancing the ability of arbitrary reads over the previous methodologies. The proposed hash index improves efficiency by reducing the amount of accessing data for search queries by creating buckets based on hyperplanes. The LSH hashes the input data where similar items with high probability maps to the same bucket. They attempt to decrease the volume of candidate data objects matched when reducing the missed nearest neighbors. The data space is divided with randomly chosen hyperplanes to decrease the volume of candidate objects. The values which are nearer to the boundaries (adjacent to the two sides of the hyperplane) are considered. The bucket label's string length is equivalent to the amount of used hyperplanes. The effect of LSH for bucket size balancing and analysis of the non-indexed, hash index, and global-indexed dataset on MongoDB depicts the pre-eminence of the presented hash index.
引用
收藏
页码:969 / 978
页数:10
相关论文
共 50 条
  • [41] Efficient Locality-Sensitive Hashing Over High-Dimensional Data Streams
    Yang, Chengcheng
    Deng, Dong
    Shang, Shuo
    Shao, Ling
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1994 - 1997
  • [42] Efficient locality-sensitive hashing over high-dimensional streaming data
    Hao Wang
    Chengcheng Yang
    Xiangliang Zhang
    Xin Gao
    [J]. Neural Computing and Applications, 2023, 35 : 3753 - 3766
  • [43] Efficient Outlier Detection in Hyperedge Streams Using MinHash and Locality-Sensitive Hashing
    Ranshous, Stephen
    Chaudhary, Mandar
    Samatova, Nagiza F.
    [J]. COMPLEX NETWORKS & THEIR APPLICATIONS VI, 2018, 689 : 105 - 116
  • [44] Hardware acceleration of k-mer clustering using locality-sensitive hashing
    Soto, Javier E.
    Krohmer, Thomas
    Hernandez, Cecilia
    Figueroa, Miguel
    [J]. 2019 22ND EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2019, : 659 - 662
  • [45] Privacy-preserving Distributed Service Recommendation based on Locality-Sensitive Hashing
    Qi, Lianyong
    Xiang, Haolong
    Dou, Wanchun
    Yang, Chi
    Qin, Yongrui
    Zhang, Xuyun
    [J]. 2017 IEEE 24TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2017), 2017, : 49 - 56
  • [46] A Locality-Sensitive Hashing-Based Jamming Detection System for IoT Networks
    Ganeshkumar, P.
    Albalawi, Talal
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 5943 - 5959
  • [47] A Graph Classification Method Based on Support Vector Machines and Locality-Sensitive Hashing
    Gonzalez-Lima, Maria D.
    Ludena, Carenne C.
    Otazo-Sanchez, Gibran G.
    [J]. IEEE ACCESS, 2024, 12 : 15791 - 15799
  • [48] Amplified locality-sensitive hashing-based recommender systems with privacy protection
    Chi, Xiaoxiao
    Yan, Chao
    Wang, Hao
    Rafique, Wajid
    Qi, Lianyong
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (14):
  • [49] A Projection-based Locality-Sensitive Hashing Technique for Reducing False Negatives
    Lee, Keon Myung
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1341 - 1346
  • [50] Improving Kernel Locality-Sensitive Hashing Using Pre-Images and Bounds
    Bodo, Zalan
    Csato, Lehel
    [J]. 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,