SRDA: An efficient algorithm for large-scale discriminant analysis

被引:327
|
作者
Cai, Deng
He, Xiaofei
Han, Jiawei
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Yahoo, Burbank, CA 91504 USA
基金
美国国家科学基金会;
关键词
linear discriminant analysis; spectral regression; dimensionality reduction;
D O I
10.1109/TKDE.2007.190669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linear Discriminant Analysis (LDA) has been a popular method for extracting features that preserves class separability. The projection functions of LDA are commonly obtained by maximizing the between-class covariance and simultaneously minimizing the within-class covariance. It has been widely used in many fields of information processing, such as machine learning, data mining, information retrieval, and pattern recognition. However, the computation of LDA involves dense matrices eigendecomposition, which can be computationally expensive in both time and memory. Specifically, LDA has O(mnt + t(3) )time complexity and requires O(mn + mt + nt) memory, where m is the number of samples, n is the number of features, and t = min(m,n). When both m and n are large, it is infeasible to apply LDA. In this paper, we propose a novel algorithm for discriminant analysis, called Spectral Regression Discriminant Analysis (SRDA). By using spectral graph analysis, SRDA casts discriminant analysis into a regression framework that facilitates both efficient computation and the use of regularization techniques. Specifically, SRDA only needs to solve a set of regularized least squares problems, and there is no eigenvector computation involved, which is a huge save of both time and memory. Our theoretical analysis shows that SRDA can be computed with O(ms) time and O(ms) memory, where s(<= n) n is the average number of nonzero features in each sample. Extensive experimental results on four real-world data sets demonstrate the effectiveness and efficiency of our algorithm.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [41] An Efficient Distributed Algorithm for Resource Allocation in Large-Scale Coupled Systems
    Niu, Di
    Li, Baochun
    2013 PROCEEDINGS IEEE INFOCOM, 2013, : 1501 - 1509
  • [42] An efficient dynamic load-balancing algorithm in a large-scale cluster
    Zhang, BY
    Mo, ZY
    Yang, GW
    Zheng, WM
    DISTRIBUTED AND PARALLEL COMPUTING, 2005, 3719 : 174 - 183
  • [43] DSolving: a novel and efficient intelligent algorithm for large-scale sliding puzzles
    Wang, GuiPing
    Li, Ren
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2017, 29 (04) : 809 - 822
  • [44] DSS: A Scalable and Efficient Stratified Sampling Algorithm for Large-Scale Datasets
    Li, Minne
    Li, Dongsheng
    Shen, Siqi
    Zhang, Zhaoning
    Lu, Xicheng
    NETWORK AND PARALLEL COMPUTING, 2016, 9966 : 133 - 146
  • [45] Efficient implementation of an active set algorithm for large-scale portfolio selection
    Stein, Michael
    Branke, Juergen
    Schmeck, Hartmut
    COMPUTERS & OPERATIONS RESEARCH, 2008, 35 (12) : 3945 - 3961
  • [46] RBFxSOM: An Efficient Algorithm for Large-Scale Multi-System Learning
    Ohkubo, Takashi
    Tokunaga, Kazuhiro
    Furukawa, Tetsuo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (07) : 1388 - 1396
  • [47] EFFICIENT ALGORITHM FOR DETERMINING CONTROLLABILITY AND OBSERVABILITY OF LARGE-SCALE COMPOSITE SYSTEMS
    SINHA, NK
    ELSHERIEF, H
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1978, 9 (11) : 1295 - 1302
  • [48] A breakdown-free block conjugate gradient method for large-scale discriminant analysis
    Shi, Wenya
    Chen, Zhixiang
    AIMS MATHEMATICS, 2024, 9 (07): : 18777 - 18795
  • [49] Theoretical analysis of the DAMAS algorithm and efficient implementation of the covariance matrix fitting method for large-scale problems
    Chardon, Gilles
    Picheral, Jose
    Ollivier, Francois
    JOURNAL OF SOUND AND VIBRATION, 2021, 508
  • [50] Parallel sensitivity analysis for efficient large-scale dynamic optimization
    Hartwich, Arndt
    Stockmann, Klaus
    Terboven, Christian
    Feuerriegel, Stefan
    Marquardt, Wolfgang
    OPTIMIZATION AND ENGINEERING, 2011, 12 (04) : 489 - 508