A Locality Sensitive Hashing Technique for Categorical Data

被引:4
|
作者
Lee, Kyung Mi [1 ]
Lee, Keon Myung [1 ]
机构
[1] Chungbuk Natl Univ, Dept Comp Sci, Chonju 361763, Chungbuk, South Korea
关键词
data analysis; categorical data; locality sensitive hashing; similar pair identification;
D O I
10.4028/www.scientific.net/AMM.241-244.3159
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The measured data may contain various types of attributes such as continuous, categorical, and set-valued attributes. Several locality-sensitive hashing techniques, which enable to find similar pairs of data in a fast and approximate way, have been developed for data with either numeric or set-valued attributes. This paper introduces a new locality sensitive-hashing technique applicable to data with categorical attributes.
引用
收藏
页码:3159 / 3164
页数:6
相关论文
共 50 条
  • [1] DASH: Data Aware Locality Sensitive Hashing
    Tan, Zongyuan
    Wang, Hongya
    Du, Ming
    Zhang, Jie
    [J]. WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 85 - 100
  • [2] A Novel Cluster Prediction Approach Based on Locality-Sensitive Hashing for Fuzzy Clustering of Categorical Data
    Toan Nguyen Mau
    Inoguchi, Yasushi
    Van-Nam Huynh
    [J]. IEEE ACCESS, 2022, 10 : 34196 - 34206
  • [3] ON THE DISTORTION OF LOCALITY SENSITIVE HASHING
    Chierichetti, Flavio
    Kumar, Ravi
    Panconesi, Alessandro
    Terolli, Erisa
    [J]. SIAM JOURNAL ON COMPUTING, 2019, 48 (02) : 350 - 372
  • [4] Catching the Flow with Locality Sensitive Hashing in Programmable Data Planes
    Cao, Zuowei
    Chen, Xiao
    Sheng, Yiqiang
    Nil, Hong
    [J]. PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 216 - 220
  • [5] A Fast Word Retrieval Technique Based on Kernelized Locality Sensitive Hashing
    Mondal, Tanmoy
    Ragot, Nicolas
    Ramel, Jean-Yves
    Pal, Umapada
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1195 - 1199
  • [6] Similar Pair Identification using Locality-Sensitive Hashing Technique
    Lee, Kyung Mi
    Lee, Keon Myung
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 2117 - 2119
  • [7] Lower bounds on locality sensitive hashing
    Motwani, Rajeev
    Naor, Assaf
    Panigrahy, Rina
    [J]. SIAM JOURNAL ON DISCRETE MATHEMATICS, 2007, 21 (04) : 930 - 935
  • [8] Locality Sensitive Hashing Using GMM
    Schmieder, Fabian
    Yang, Bin
    [J]. PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 569 - 581
  • [9] Kernelized Locality-Sensitive Hashing
    Kulis, Brian
    Grauman, Kristen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (06) : 1092 - 1104
  • [10] Refining Codes for Locality Sensitive Hashing
    Liu, Huawen
    Zhou, Wenhua
    Wu, Zongda
    Zhang, Shichao
    Li, Gang
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (03) : 1274 - 1284