A Scalable Unsupervised Feature Merging Approach to Efficient Dimensionality Reduction of High-dimensional Visual Data

被引:6
|
作者
Liu, Lingqiao [1 ]
Wang, Lei [2 ]
机构
[1] Australian Natl Univ, CECS, Canberra, ACT 0200, Australia
[2] Univ Wollongong, Sch Comp Sci & Software Engn, Wollongong, NSW 2522, Australia
关键词
D O I
10.1109/ICCV.2013.374
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To achieve a good trade-off between recognition accuracy and computational efficiency, it is often needed to reduce high-dimensional visual data to medium-dimensional ones. For this task, even applying a simple full-matrix-based linear projection causes significant computation and memory use. When the number of visual data is large, how to efficiently learn such a projection could even become a problem. The recent feature merging approach offers an efficient way to reduce the dimensionality, which only requires a single scan of features to perform reduction. However, existing merging algorithms do not scale well with high-dimensional data, especially in the unsupervised case. To address this problem, we formulate unsupervised feature merging as a PCA problem imposed with a special structure constraint. By exploiting its connection with k-means, we transform this constrained PCA problem into a feature clustering problem. Moreover, we employ the hashing technique to improve its scalability. These produce a scalable feature merging algorithm for our dimensionality reduction task. In addition, we develop an extension of this method by leveraging the neighborhood structure in the data to further improve dimensionality reduction performance. In further, we explore the incorporation of bipolar merging - a variant of merging function which allows the subtraction operation - into our algorithms. Through three applications in visual recognition, we demonstrate that our methods can not only achieve good dimensionality reduction performance with little computational cost but also help to create more powerful representation at both image level and local feature level.
引用
收藏
页码:3008 / 3015
页数:8
相关论文
共 50 条
  • [31] A sparse grid based method for generative dimensionality reduction of high-dimensional data
    Bohn, Bastian
    Garcke, Jochen
    Griebel, Michael
    JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 309 : 1 - 17
  • [32] SeekAView: An Intelligent Dimensionality Reduction Strategy for Navigating High-Dimensional Data Spaces
    Krause, Josua
    Dasgupta, Aritra
    Fekete, Jean-Daniel
    Bertini, Enrico
    2016 IEEE 6TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2016, : 11 - 19
  • [33] Effective Data Dimensionality Reduction Workflow for High-Dimensional Gene Expression Datasets
    Das, Utsha
    Srizon, Azmain Yakin
    Hasan, Md Al Mehedi
    Rahman, Julia
    Ben Islam, Md Khaled
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 182 - 185
  • [34] Recent Dimensionality Reduction Techniques for High-Dimensional COVID-19 Data
    Dallas, Ioannis L.
    Vrahatis, Aristidis G.
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, CIBB 2021, 2022, 13483 : 227 - 241
  • [35] Distance-preserving projection of high-dimensional data for nonlinear dimensionality reduction
    Yang, L
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) : 1243 - 1246
  • [36] Comparing and Exploring High-Dimensional Data with Dimensionality Reduction Algorithms and Matrix Visualizations
    Cutura, Rene
    Aupetit, Michael
    Fekete, Jean-Daniel
    Sedlmair, Michael
    PROCEEDINGS OF THE WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES AVI 2020, 2020,
  • [37] Scalable High-Dimensional Multivariate Linear Regression for Feature-Distributed Data
    Huang, Shuo-Chieh
    Tsay, Ruey S.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [38] Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints
    Yan, Su
    Bouaziz, Sofien
    Lee, Dongwon
    Barlow, Jesse
    NEUROCOMPUTING, 2012, 76 (01) : 114 - 124
  • [39] A Light Causal Feature Selection Approach to High-Dimensional Data
    Ling, Zhaolong
    Li, Ying
    Zhang, Yiwen
    Yu, Kui
    Zhou, Peng
    Li, Bo
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 7639 - 7650
  • [40] Multistage feature selection approach for high-dimensional cancer data
    Alkuhlani, Alhasan
    Nassef, Mohammad
    Farag, Ibrahim
    SOFT COMPUTING, 2017, 21 (22) : 6895 - 6906