An Active Learning Based LDA Algorithm for Large-Scale Data Classification

被引:0
|
作者
Yu X. [1 ]
Zhou Y.-P. [1 ]
Ren C.-N. [1 ]
机构
[1] School of Information Science and Technology, Qingdao University of Science and Technology, Qingdao
来源
Yu, Xu (yuxu0532@163.com) | 1600年 / Science and Engineering Research Support Society卷 / 09期
基金
中国国家自然科学基金;
关键词
Active learning; Large scale data set; Linear Discriminant Analysis; The MNIST data set;
D O I
10.14257/ijdta.2016.9.11.03
中图分类号
学科分类号
摘要
As traditional Linear Discriminant Analysis algorithm runs slowly in large data set, this paper proposed a fast LDA algorithm based on active learning. In the proposed algorithm, the original training set is divided into three parts, i.e. initial training set, correction set and testing set. Secondly, LDA algorithm is running on the initial training set, and the projection vector can be obtained. Thirdly, we select from correction set the samples whose projection is farthest from the mean vector, add them into the initial training set and compute the projection vector again. Repeat this step until the classification precision attains the expected target or the correction set is empty. The simulation experiments on the UCI data set and the MNIST data set show that the proposed algorithm running fast on large data set, and has a good classification precision. © 2016 SERSC.
引用
收藏
页码:29 / 36
页数:7
相关论文
共 50 条
  • [31] Large-scale network intrusion detection based on distributed learning algorithm
    Daxin Tian
    Yanheng Liu
    Yang Xiang
    International Journal of Information Security, 2009, 8 : 25 - 35
  • [32] An online conjugate gradient algorithm for large-scale data analysis in machine learning
    Xue, Wei
    Wan, Pengcheng
    Li, Qiao
    Zhong, Ping
    Yu, Gaohang
    Tao, Tao
    AIMS MATHEMATICS, 2021, 6 (02): : 1515 - 1537
  • [33] A fast algorithm for learning a ranking function from large-scale data sets
    Raykar, Vikas C.
    Duraiswami, Ramani
    Krishnapuram, Balaji
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (07) : 1158 - 1170
  • [34] Momentum Online LDA for Large-scale Datasets
    Ouyang, Jihong
    Lu, You
    Li, Ximing
    21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 1075 - 1076
  • [35] CLUSTERING LARGE-SCALE DATA BASED ON MODIFIED AFFINITY PROPAGATION ALGORITHM
    Serdah, Ahmed M.
    Ashour, Wesam M.
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2016, 6 (01) : 23 - 33
  • [36] Fuzzy clustering algorithm based on multiple medoids for large-scale data
    Chen A.-G.
    Wang S.-T.
    Kongzhi yu Juece/Control and Decision, 2016, 31 (12): : 2122 - 2130
  • [37] A QUICK AND FEATURE BASED VISUALIZATION ALGORITHM FOR LARGE-SCALE FLOW DATA
    Zhong Liang
    Chi Tian He
    Zhang Xin
    2009 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-5, 2009, : 2592 - 2595
  • [38] Quick extreme learning machine for large-scale classification
    Audi Albtoush
    Manuel Fernández-Delgado
    Eva Cernadas
    Senén Barro
    Neural Computing and Applications, 2022, 34 : 5923 - 5938
  • [39] Large-scale machine learning for metagenomics sequence classification
    Vervier, Kevin
    Mahe, Pierre
    Tournoud, Maud
    Veyrieras, Jean-Baptiste
    Vert, Jean-Philippe
    BIOINFORMATICS, 2016, 32 (07) : 1023 - 1032
  • [40] Good Practice in Large-Scale Learning for Image Classification
    Akata, Zeynep
    Perronnin, Florent
    Harchaoui, Zaid
    Schmid, Cordelia
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) : 507 - 520