ADAGIO: Fast Data-aware Near-Isometric Linear Embeddings

被引:0
|
作者
Blasiok, Jaroslaw [1 ]
Tsourakakis, Charalampos E. [1 ]
机构
[1] Harvard Univ, SEAS, Cambridge, MA 02138 USA
关键词
JOHNSON-LINDENSTRAUSS; ALGORITHMS; NEIGHBOR;
D O I
10.1109/ICDM.2016.127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many important applications, including signal reconstruction, parameter estimation, and signal processing in a compressed domain, rely on a low-dimensional representation of the dataset that preserves all pairwise distances between the data points and leverages the inherent geometric structure that is typically present. Recently Hedge, Sankaranarayanan, Yin and Baraniuk [19] proposed the first data-aware near-isometric linear embedding which achieves the best of both worlds. However, their method NuMax does not scale to large-scale datasets. Our main contribution is a simple, data-aware, near-isometric linear dimensionality reduction method which significantly outperforms a state-of-the-art method [19] with respect to scalability while achieving high quality near-isometries. Furthermore, our method comes with strong worst-case theoretical guarantees that allow us to guarantee the quality of the obtained near-isometry. We verify experimentally the efficiency of our method on numerous real-world datasets, where we find that our method (<10 secs) is more than 3 000xfaster than the state-of-the-art method [19] (>9 hours) on medium scale datasets with 60 000 datapoints in 784 dimensions. Finally, we use our method as a preprocessing step to increase the computational efficiency of a classification application and for speeding up approximate nearest neighbor queries.
引用
收藏
页码:31 / 40
页数:10
相关论文
共 10 条
  • [1] NEAR-ISOMETRIC LINEAR EMBEDDINGS OF MANIFOLDS
    Hegde, Chinmay
    Sankaranarayanan, Aswin C.
    Baraniuk, Richard G.
    2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 728 - 731
  • [2] NuMax: A Convex Approach for Learning Near-Isometric Linear Embeddings
    Hegde, Chinmay
    Sankaranarayanan, Aswin C.
    Yin, Wotao
    Baraniuk, Richard G.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (22) : 6109 - 6121
  • [3] FEATURE EXTRACTION USING NEAR-ISOMETRIC LINEAR EMBEDDINGS FOR HYPERSPECTRAL IMAGERY CLASSIFICATION
    Sun, Weiwei
    Zhang, Liangpei
    Du, Bo
    2016 8TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2016,
  • [4] Near-Isometric Properties of Kronecker-Structured Random Tensor Embeddings
    Jiang, Qijia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Linear-Time Verification of Data-Aware Dynamic Systems with Arithmetic
    Felli, Paolo
    Montali, Marco
    Winkler, Sarah
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5642 - 5650
  • [6] Fast Synthetic Data-Aware Log Generation for Temporal Declarative Models
    Bergami, Giacomo
    PROCEEDINGS OF THE 6TH ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS AND NETWORK DATA ANALYTICS, GRADES-NDA 2023, 2023,
  • [7] A Sparse and Low-Rank Near-Isometric Linear Embedding Method for Feature Extraction in Hyperspectral Imagery Classification
    Sun, Weiwei
    Yang, Gang
    Du, Bo
    Zhang, Lefei
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (07): : 4032 - 4046
  • [8] Data-Aware Resource Allocation of Linear Pipeline Applications in a Distributed Environment
    Stavrinides, Georgios L.
    Karatza, Helen D.
    2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 121 - 126
  • [9] Linear-Time Verification of Data-Aware Processes Modulo Theories via Covers and Automata
    Gianola, Alessandro
    Montali, Marco
    Winkler, Sarah
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9, 2024, : 10525 - 10534
  • [10] SRAM Cell with Data-Aware Power-Gating Write-Asist for Near-Threshold Operation
    Oh, Tae Woo
    Jung, Seong-Ook
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,