An Active Learning Based LDA Algorithm for Large-Scale Data Classification

被引:0
|
作者
Yu X. [1 ]
Zhou Y.-P. [1 ]
Ren C.-N. [1 ]
机构
[1] School of Information Science and Technology, Qingdao University of Science and Technology, Qingdao
来源
Yu, Xu (yuxu0532@163.com) | 1600年 / Science and Engineering Research Support Society卷 / 09期
基金
中国国家自然科学基金;
关键词
Active learning; Large scale data set; Linear Discriminant Analysis; The MNIST data set;
D O I
10.14257/ijdta.2016.9.11.03
中图分类号
学科分类号
摘要
As traditional Linear Discriminant Analysis algorithm runs slowly in large data set, this paper proposed a fast LDA algorithm based on active learning. In the proposed algorithm, the original training set is divided into three parts, i.e. initial training set, correction set and testing set. Secondly, LDA algorithm is running on the initial training set, and the projection vector can be obtained. Thirdly, we select from correction set the samples whose projection is farthest from the mean vector, add them into the initial training set and compute the projection vector again. Repeat this step until the classification precision attains the expected target or the correction set is empty. The simulation experiments on the UCI data set and the MNIST data set show that the proposed algorithm running fast on large data set, and has a good classification precision. © 2016 SERSC.
引用
收藏
页码:29 / 36
页数:7
相关论文
共 50 条
  • [21] Large-scale Landsat image classification based on deep learning methods
    Zhao, Xuemei
    Gao, Lianru
    Chen, Zhengchao
    Zhang, Bing
    Liao, Wenzhi
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2019, 8
  • [22] Extreme Learning Machine for large-scale graph classification based on MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Yuan, Ye
    Wang, Guoren
    Chen, Lei
    NEUROCOMPUTING, 2017, 261 : 106 - 114
  • [23] Distributed learning strategy based on chips for classification with large-scale dataset
    Yang, Bo
    Su, Xiaohong
    Wang, Yadong
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (05) : 899 - 920
  • [24] Active Learning for Large-Scale Entity Resolution
    Qian, Kun
    Popa, Lucian
    Sen, Prithviraj
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1379 - 1388
  • [25] Active disks for large-scale data processing
    Riedel, E
    Faloutsos, C
    Gibson, GA
    Nagle, D
    COMPUTER, 2001, 34 (06) : 68 - +
  • [26] A classification based framework for quantitative description of large-scale microarray data
    Sangurdekar, Dipen P.
    Srienc, Friedrich
    Khodursky, Arkady B.
    GENOME BIOLOGY, 2006, 7 (04)
  • [27] A classification based framework for quantitative description of large-scale microarray data
    Dipen P Sangurdekar
    Friedrich Srienc
    Arkady B Khodursky
    Genome Biology, 7
  • [28] A Reinforcement Learning Based Large-Scale Refinery Production Scheduling Algorithm
    Chen, Yuandong
    Ding, Jinliang
    Chen, Qingda
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) : 6041 - 6055
  • [29] Large-scale network intrusion detection algorithm based on distributed learning
    College of Computer Science and Technology, Jilin University, Changchun 130012, China
    不详
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (04): : 993 - 1003
  • [30] Large-scale network intrusion detection based on distributed learning algorithm
    Tian, Daxin
    Liu, Yanheng
    Xiang, Yang
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2009, 8 (01) : 25 - 35