An Active Learning Based LDA Algorithm for Large-Scale Data Classification

被引:0
|
作者
Yu X. [1 ]
Zhou Y.-P. [1 ]
Ren C.-N. [1 ]
机构
[1] School of Information Science and Technology, Qingdao University of Science and Technology, Qingdao
来源
Yu, Xu (yuxu0532@163.com) | 1600年 / Science and Engineering Research Support Society卷 / 09期
基金
中国国家自然科学基金;
关键词
Active learning; Large scale data set; Linear Discriminant Analysis; The MNIST data set;
D O I
10.14257/ijdta.2016.9.11.03
中图分类号
学科分类号
摘要
As traditional Linear Discriminant Analysis algorithm runs slowly in large data set, this paper proposed a fast LDA algorithm based on active learning. In the proposed algorithm, the original training set is divided into three parts, i.e. initial training set, correction set and testing set. Secondly, LDA algorithm is running on the initial training set, and the projection vector can be obtained. Thirdly, we select from correction set the samples whose projection is farthest from the mean vector, add them into the initial training set and compute the projection vector again. Repeat this step until the classification precision attains the expected target or the correction set is empty. The simulation experiments on the UCI data set and the MNIST data set show that the proposed algorithm running fast on large data set, and has a good classification precision. © 2016 SERSC.
引用
收藏
页码:29 / 36
页数:7
相关论文
共 50 条
  • [1] Large-Scale Image Classification Using Active Learning
    Alajlan, Naif
    Pasolli, Edoardo
    Melgani, Farid
    Franzoso, Andrea
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (01) : 259 - 263
  • [2] Large-scale data classification method based on machine learning model
    Department of Electrical Engineering, Dalian Institute of Science and Technology, Dalian, China
    Int. J. Database Theory Appl., 2 (71-80):
  • [3] A Deep Multiview Active Learning for Large-Scale Image Classification
    Yao, Tuozhong
    Wang, Wenfeng
    Gu, Yuhong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [4] Kernel Logistic Regression Algorithm for Large-Scale Data Classification
    Elbashir, Murtada
    Wang, Jianxin
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (05) : 465 - 472
  • [5] A Fast Distributed Classification Algorithm for Large-scale Imbalanced Data
    Wang, Huihui
    Gao, Yang
    Shi, Yinghuan
    Wang, Hao
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1251 - 1256
  • [6] EFFICIENT CLASSIFICATION FOR LARGE-SCALE PROBLEMS BY MULTIPLE LDA SUBSPACES
    Uray, Martina
    Roth, Peter M.
    Bischof, Horst
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2009, : 299 - 306
  • [7] A large-scale lychee image parallel classification algorithm based on spark and deep learning
    Xiao, Yiming
    Wang, Jianhua
    Xiong, Hongyi
    Xiao, Fangjun
    Huang, Renhuan
    Hong, Licong
    Wu, Bofei
    Zhou, Jinfeng
    Long, Yongbin
    Lan, Yubin
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 230
  • [8] Deep learning based data augmentation for large-scale mineral image recognition and classification
    Liu, Yang
    Wang, Xueyi
    Zhang, Zelin
    Deng, Fang
    MINERALS ENGINEERING, 2023, 204
  • [9] Hierarchical Classification for Large-Scale Learning
    Wang, Boshi
    Barbu, Adrian
    ELECTRONICS, 2023, 12 (22)
  • [10] Large-scale data classification based on the integrated fusion of fuzzy learning and graph neural network
    Snasel, Vaclav
    Stepnicka, Martin
    Ojha, Varun
    Suganthan, Ponnuthurai Nagaratnam
    Gao, Ruobin
    Kong, Lingping
    INFORMATION FUSION, 2024, 102