An Active Learning Based LDA Algorithm for Large-Scale Data Classification

被引:0
|
作者
Yu X. [1 ]
Zhou Y.-P. [1 ]
Ren C.-N. [1 ]
机构
[1] School of Information Science and Technology, Qingdao University of Science and Technology, Qingdao
来源
Yu, Xu (yuxu0532@163.com) | 1600年 / Science and Engineering Research Support Society卷 / 09期
基金
中国国家自然科学基金;
关键词
Active learning; Large scale data set; Linear Discriminant Analysis; The MNIST data set;
D O I
10.14257/ijdta.2016.9.11.03
中图分类号
学科分类号
摘要
As traditional Linear Discriminant Analysis algorithm runs slowly in large data set, this paper proposed a fast LDA algorithm based on active learning. In the proposed algorithm, the original training set is divided into three parts, i.e. initial training set, correction set and testing set. Secondly, LDA algorithm is running on the initial training set, and the projection vector can be obtained. Thirdly, we select from correction set the samples whose projection is farthest from the mean vector, add them into the initial training set and compute the projection vector again. Repeat this step until the classification precision attains the expected target or the correction set is empty. The simulation experiments on the UCI data set and the MNIST data set show that the proposed algorithm running fast on large data set, and has a good classification precision. © 2016 SERSC.
引用
收藏
页码:29 / 36
页数:7
相关论文
共 50 条
  • [41] Learning Fused Representations for Large-Scale Multimodal Classification
    Nawaz, Shah
    Calefati, Alessandro
    Janjua, Muhammad Kamran
    Anwaar, Muhammad Umer
    Gallo, Ignazio
    IEEE SENSORS LETTERS, 2019, 3 (01)
  • [42] Quick extreme learning machine for large-scale classification
    Albtoush, Audi
    Fernandez-Delgado, Manuel
    Cernadas, Eva
    Barro, Senen
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (08): : 5923 - 5938
  • [43] Continuous Learning for Large-scale Personalized Domain Classification
    Li, Han
    Lee, Jihwan
    Mudgal, Sidharth
    Sarikaya, Ruhi
    Kim, Young-Bum
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3784 - 3794
  • [44] Deep Learning-Based Large-Scale Automatic Satellite Crosswalk Classification
    Berriel, Rodrigo F.
    Lopes, Andre Teixeira
    de Souza, Alberto F.
    Oliveira-Santos, Thiago
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (09) : 1513 - 1517
  • [45] Linear Regression-Based Efficient SVM Learning for Large-Scale Classification
    Wu, Jianxin
    Yang, Hao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2357 - 2369
  • [46] Learning From Large-Scale Noisy Web Data With Ubiquitous Reweighting for Image Classification
    Li, Jia
    Song, Yafei
    Zhu, Jianfeng
    Cheng, Lele
    Su, Ying
    Ye, Lin
    Yuan, Pengcheng
    Han, Shumin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1808 - 1814
  • [47] ALLIE: Active Learning on Large-scale Imbalanced Graphs
    Cui, Limeng
    Tang, Xianfeng
    Katariya, Sumeet
    Rao, Nikhil
    Agrawal, Pallav
    Subbian, Karthik
    Lee, Dongwon
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 690 - 698
  • [48] An Active Learning Method Based on Mistake Sampling for Large Scale Imbalanced Classification
    Guo, Jia
    Wan, Xin
    Lin, Hao
    Li, Peng
    Liu, Guannan
    He, Yueying
    2017 14TH INTERNATIONAL CONFERENCE ON SERVICES SYSTEMS AND SERVICES MANAGEMENT (ICSSSM), 2017,
  • [49] SU-SAMPLING BASED ACTIVE LEARNING FOR LARGE-SCALE HISTOPATHOLOGY IMAGE
    Shen, Yiqing
    Ke, Jing
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 116 - 120
  • [50] Large-scale microarray data based feature selection for improved molecular classification
    Lu, Liangqun
    Daigle, Bernie J., Jr.
    BMC BIOINFORMATICS, 2017, 18