FAST METRIC EMBEDDING INTO THE HAMMING CUBE

被引:0
|
作者
Dirksen, Sjoerd [1 ]
Mendelson, Shahar [2 ,3 ]
Stollenwerk, Alexander [4 ]
机构
[1] Univ Utrecht, Math Inst, Utrecht, Netherlands
[2] Univ Warwick, Dept Stat, Coventry, England
[3] Australian Natl Univ, Ctr Math & Applicat, Canberra, Australia
[4] UCLouvain, ICTEAM Inst, Louvain La Neuve, Belgium
关键词
dimension reduction; Johnson-Lindenstrauss embeddings; Hamming cube; circulant matrices; Gaussian width; JOHNSON-LINDENSTRAUSS TRANSFORM; DIMENSION REDUCTION; LEMMA;
D O I
10.1137/22M1520220
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of embedding a subset of Rn into a low -dimensional Hamming cube in an almost isometric way. We construct a simple, data -oblivious, and computationally efficient map that achieves this task with high probability; we first apply a specific structured random matrix, which we call the double circulant matrix; using that a matrix requires linear storage and matrix -vector multiplication that can be performed in near -linear time. We then binarize each vector by comparing each of its entries to a random threshold, selected uniformly at random from a well-chosen interval. We estimate the number of bits required for this encoding scheme in terms of two natural geometric complexity parameters of the set: its Euclidean covering numbers and its localized Gaussian complexity. The estimate we derive turns out to be the best that one can hope for, up to logarithmic terms. The key to the proof is a phenomenon of independent interest: we show that the double circulant matrix mimics the behavior of the Gaussian matrix in two important ways. First, it maps an arbitrary set in Rn into a set of well -spread vectors. Second, it yields a fast nearisometric embedding of any finite subset of \ell2n into \ell1m. This embedding achieves the same dimension reduction as the Gaussian matrix in near -linear time, under an optimal condition ---up to logarithmic factors ---on the number of points to be embedded. This improves a well-known construction due to Ailon and Chazelle.
引用
收藏
页码:315 / 345
页数:31
相关论文
共 50 条
  • [41] Low Distortion Embedding of the Hamming Space into a Sphere with Quadrance Metric and k-means Clustering of Nominal-continuous Data
    Denisiuk, Aleksander
    Grabowski, Michal
    FUNDAMENTA INFORMATICAE, 2017, 153 (03) : 221 - 233
  • [42] Metric complements to subspaces in the Boolean cube
    Oblaukhov A.K.
    Oblaukhov, A.K. (oblaukhov@gmail.com), 2016, Izdatel'stvo Nauka (10) : 397 - 403
  • [43] New bounds of permutation codes under Hamming metric and Kendall's τ-metric
    Wang, Xin
    Zhang, Yiwei
    Yang, Yiting
    Ge, Gennian
    DESIGNS CODES AND CRYPTOGRAPHY, 2017, 85 (03) : 533 - 545
  • [44] Embedding of eigenfunctions of the Johnson graph into eigenfunctions of the Hamming graph
    Vorob'ev K.V.
    Vorob'ev, K. V. (konstantin.vorobev@gmail.com), 1600, Izdatel'stvo Nauka (08): : 136 - 142
  • [45] Clustering, Hamming Embedding, Generalized LSH and the Max Norm
    Neyshabur, Behnam
    Makarychev, Yury
    Srebro, Nathan
    ALGORITHMIC LEARNING THEORY (ALT 2014), 2014, 8776 : 306 - 320
  • [46] Deep Hamming Embedding Based Hashing for Image Retrieval
    Lin J.
    Liu H.
    Zheng Z.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (06): : 542 - 550
  • [47] Bi-Lipschitz bijection between the Boolean cube and the Hamming ball
    Benjamini, Itai
    Cohen, Gil
    Shinkar, Igor
    ISRAEL JOURNAL OF MATHEMATICS, 2016, 212 (02) : 677 - 703
  • [48] Hamming and simplex codes for the sum-rank metric
    Umberto Martínez-Peñas
    Designs, Codes and Cryptography, 2020, 88 : 1521 - 1539
  • [49] Parametric local multiview hamming distance metric learning
    Zhai, Deming
    Liu, Xianming
    Chang, Hong
    Zhen, Yi
    Chen, Xilin
    Guo, Maozu
    Gao, Wen
    PATTERN RECOGNITION, 2018, 75 : 250 - 262
  • [50] Bi-Lipschitz bijection between the Boolean cube and the Hamming ball
    Itai Benjamini
    Gil Cohen
    Igor Shinkar
    Israel Journal of Mathematics, 2016, 212 : 677 - 703