Fast Unsupervised Projection for Large-Scale Data

被引:14
|
作者
Wang, Jingyu [1 ]
Wang, Lin [2 ]
Nie, Feiping [3 ]
Li, Xuelong [3 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Dimensionality reduction (DR); orthogonality; representative points; subspace projection; unsupervised learning; LINEAR DISCRIMINANT-ANALYSIS; DIMENSIONALITY REDUCTION;
D O I
10.1109/TNNLS.2021.3053840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction (DR) technique has been frequently used to alleviate information redundancy and reduce computational complexity. Traditional DR methods generally are inability to deal with nonlinear data and have high computational complexity. To cope with the problems, we propose a fast unsupervised projection (FUP) method. The simplified graph of FUP is constructed by samples and representative points, where the number of the representative points selected through iterative optimization is less than that of samples. By generating the presented graph, it is proved that large-scale data can be projected faster in numerous scenarios. Thereafter, the orthogonality FUP (OFUP) method is proposed to ensure the orthogonality of projection matrix. Specifically, the OFUP method is proved to be equivalent to PCA upon certain parameter setting. Experimental results on benchmark data sets show the effectiveness in retaining the essential information.
引用
收藏
页码:3634 / 3644
页数:11
相关论文
共 50 条
  • [1] Fast Plagiarism Detection in Large-Scale Data
    Szmit, Radoslaw
    [J]. BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: TOWARDS EFFICIENT SOLUTIONS FOR DATA ANALYSIS AND KNOWLEDGE REPRESENTATION, 2017, 716 : 329 - 343
  • [2] Twitter Sentiment Analysis for Large-Scale Data: An Unsupervised Approach
    Pandarachalil, Rafeeque
    Sendhilkumar, Selvaraju
    Mahalakshmi, G. S.
    [J]. COGNITIVE COMPUTATION, 2015, 7 (02) : 254 - 262
  • [3] Twitter Sentiment Analysis for Large-Scale Data: An Unsupervised Approach
    Rafeeque Pandarachalil
    Selvaraju Sendhilkumar
    G. S. Mahalakshmi
    [J]. Cognitive Computation, 2015, 7 : 254 - 262
  • [4] A Fast Retrieval Algorithm for Large-Scale XML Data
    Tanioka, Hiroki
    [J]. FOCUSED ACCESS TO XML DOCUMENTS, 2008, 4862 : 129 - 137
  • [5] Large-Scale Unsupervised Semantic Segmentation
    Gao, Shanghua
    Li, Zhong-Yu
    Yang, Ming-Hsuan
    Cheng, Ming-Ming
    Han, Junwei
    Torr, Philip
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7457 - 7476
  • [6] Large-Scale Unsupervised Object Discovery
    Vo, Huy V.
    Sizikova, Elena
    Schmid, Cordelia
    Perez, Patrick
    Ponce, Jean
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Probabilistic unsupervised classification for large-scale analysis of spectral imaging data
    Paradis, Emmanuel
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 107
  • [8] Fast Semisupervised Learning With Bipartite Graph for Large-Scale Data
    He, Fang
    Nie, Feiping
    Wang, Rong
    Li, Xuelong
    Jia, Weimin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 626 - 638
  • [9] Dynamic and fast processing of queries on large-scale RDF data
    Yuan, Pingpeng
    Xie, Changfeng
    Jin, Hai
    Liu, Ling
    Yang, Guang
    Shi, Xuanhua
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 311 - 334
  • [10] Fast Chinese calligraphic character recognition with large-scale data
    Gao Pengcheng
    Wu Jiangqin
    Lin Yuan
    Xia Yang
    Mao Tianjiao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (17) : 7221 - 7238