A Locality Sensitive Hashing Technique for Categorical Data

被引:4
|
作者
Lee, Kyung Mi [1 ]
Lee, Keon Myung [1 ]
机构
[1] Chungbuk Natl Univ, Dept Comp Sci, Chonju 361763, Chungbuk, South Korea
关键词
data analysis; categorical data; locality sensitive hashing; similar pair identification;
D O I
10.4028/www.scientific.net/AMM.241-244.3159
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The measured data may contain various types of attributes such as continuous, categorical, and set-valued attributes. Several locality-sensitive hashing techniques, which enable to find similar pairs of data in a fast and approximate way, have been developed for data with either numeric or set-valued attributes. This paper introduces a new locality sensitive-hashing technique applicable to data with categorical attributes.
引用
收藏
页码:3159 / 3164
页数:6
相关论文
共 50 条
  • [1] DASH: Data Aware Locality Sensitive Hashing
    Tan, Zongyuan
    Wang, Hongya
    Du, Ming
    Zhang, Jie
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 85 - 100
  • [2] A Novel Cluster Prediction Approach Based on Locality-Sensitive Hashing for Fuzzy Clustering of Categorical Data
    Toan Nguyen Mau
    Inoguchi, Yasushi
    Van-Nam Huynh
    IEEE ACCESS, 2022, 10 : 34196 - 34206
  • [3] ON THE DISTORTION OF LOCALITY SENSITIVE HASHING
    Chierichetti, Flavio
    Kumar, Ravi
    Panconesi, Alessandro
    Terolli, Erisa
    SIAM JOURNAL ON COMPUTING, 2019, 48 (02) : 350 - 372
  • [4] Catching the Flow with Locality Sensitive Hashing in Programmable Data Planes
    Cao, Zuowei
    Chen, Xiao
    Sheng, Yiqiang
    Nil, Hong
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 216 - 220
  • [5] A Fast Word Retrieval Technique Based on Kernelized Locality Sensitive Hashing
    Mondal, Tanmoy
    Ragot, Nicolas
    Ramel, Jean-Yves
    Pal, Umapada
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1195 - 1199
  • [6] Similar Pair Identification using Locality-Sensitive Hashing Technique
    Lee, Kyung Mi
    Lee, Keon Myung
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 2117 - 2119
  • [7] Lower bounds on locality sensitive hashing
    Motwani, Rajeev
    Naor, Assaf
    Panigrahy, Rina
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2007, 21 (04) : 930 - 935
  • [8] Refining Codes for Locality Sensitive Hashing
    Liu, Huawen
    Zhou, Wenhua
    Wu, Zongda
    Zhang, Shichao
    Li, Gang
    Li, Xuelong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (03) : 1274 - 1284
  • [9] Kernelized Locality-Sensitive Hashing
    Kulis, Brian
    Grauman, Kristen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (06) : 1092 - 1104
  • [10] Locality Sensitive Hashing Using GMM
    Schmieder, Fabian
    Yang, Bin
    PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 569 - 581