A Locality Sensitive Hashing Technique for Categorical Data

被引:4
|
作者
Lee, Kyung Mi [1 ]
Lee, Keon Myung [1 ]
机构
[1] Chungbuk Natl Univ, Dept Comp Sci, Chonju 361763, Chungbuk, South Korea
关键词
data analysis; categorical data; locality sensitive hashing; similar pair identification;
D O I
10.4028/www.scientific.net/AMM.241-244.3159
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The measured data may contain various types of attributes such as continuous, categorical, and set-valued attributes. Several locality-sensitive hashing techniques, which enable to find similar pairs of data in a fast and approximate way, have been developed for data with either numeric or set-valued attributes. This paper introduces a new locality sensitive-hashing technique applicable to data with categorical attributes.
引用
收藏
页码:3159 / 3164
页数:6
相关论文
共 50 条
  • [41] A Projection-based Locality-Sensitive Hashing Technique for Reducing False Negatives
    Lee, Keon Myung
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1341 - 1346
  • [42] Large-Scale Distributed Locality-Sensitive Hashing for General Metric Data
    Silva, Eliezer
    Teixeira, Thiago
    Teodoro, George
    Valle, Eduardo
    SIMILARITY SEARCH AND APPLICATIONS, 2014, 8821 : 82 - 93
  • [43] Efficient locality-sensitive hashing over high-dimensional streaming data
    Wang, Hao
    Yang, Chengcheng
    Zhang, Xiangliang
    Gao, Xin
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (05): : 3753 - 3766
  • [44] Efficient Locality-Sensitive Hashing Over High-Dimensional Data Streams
    Yang, Chengcheng
    Deng, Dong
    Shang, Shuo
    Shao, Ling
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1994 - 1997
  • [45] MapReduce Based Personalized Locality Sensitive Hashing for Similarity Joins on Large Scale Data
    Wang, Jingjing
    Lin, Chen
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015
  • [46] Efficient Data Stream Clustering with Sliding Windows based on Locality-Sensitive Hashing
    Youn, Jonghem
    Shim, Junho
    Lee, Sang-Goo
    IEEE ACCESS, 2018, 6 : 63757 - 63776
  • [47] Parallel set similarity join on big data based on Locality-Sensitive Hashing
    Sohrabi, Mohammad Karim
    Azgomi, Hosseion
    SCIENCE OF COMPUTER PROGRAMMING, 2017, 145 : 1 - 12
  • [48] Efficient locality-sensitive hashing over high-dimensional streaming data
    Hao Wang
    Chengcheng Yang
    Xiangliang Zhang
    Xin Gao
    Neural Computing and Applications, 2023, 35 : 3753 - 3766
  • [49] Locality-sensitive hashing of permutations for proximity searching
    Figueroa, Karina
    Camarena-Ibarrola, Antonio
    Valero-Elizondo, Luis
    Reyes, Nora
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4677 - 4684
  • [50] LoSHa: A General Framework for Scalable Locality Sensitive Hashing
    Li, Jinfeng
    Cheng, James
    Yang, Fan
    Huang, Yuzhen
    Zhao, Yunjian
    Yan, Xiao
    Zhao, Ruihao
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 635 - 644