A Serial Sample Selection Framework for Active Learning

被引:0
|
作者
Li, Chengchao [1 ]
Zhao, Pengpeng [1 ]
Wu, Jian [1 ]
Xu, Haihui [1 ]
Cui, Zhiming [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
关键词
Data Mining; Active Learning; Sampling Strategy; Uncertainty; Representativeness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active Learning is a machine learning and data mining technique that selects the most informative samples for labeling and uses them as training data. It aims to obtain a high performance classifier by labeling as little data as possible from large amount of unlabeled samples, which means sampling strategy is the core issue. Existing approaches either tend to ignore information in unlabeled data and are prone to querying outliers or noise samples, or calculate large amounts of non-informative samples leading to significant computation cost. In order to solve above problems, this paper proposed a serial active learning framework. It first measures uncertainty of unlabeled samples and selects the most uncertain sample set. From which, it further generates the most representative sample set based on the mutual information criterion. Finally, the framework selects the most informative sample from the most representative sample set based on expected error reduction strategy. Experimental results on multiple datasets show that our approach outperforms Random Sampling and the state of the art adaptive active learning method.
引用
收藏
页码:435 / 446
页数:12
相关论文
共 50 条
  • [1] A serial sample selection framework for active learning
    Li, Chengchao
    Zhao, Pengpeng
    Wu, Jian
    Xu, Haihui
    Cui, Zhiming
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8933 : 435 - 446
  • [2] Learning to Sample: an Active Learning Framework
    Shao, Jingyu
    Wang, Qing
    Liu, Fangbing
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 538 - 547
  • [3] ALBS: An Active Learning Framework Based on Syncretic Sample Selection Strategy
    Pan, Longfei
    Wang, Xiaojun
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 533 - 538
  • [4] Active learning sample selection - based on multicriteria
    He, Zhonghai
    Shen, Kun
    Zhang, Xiaofang
    JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2023, 31 (06) : 289 - 297
  • [5] A NEW METHOD FOR SAMPLE SELECTION IN ACTIVE LEARNING
    Chen, Wei
    Liu, Gang
    Guo, Jun
    Guo, Yu-Jing
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 2270 - 2274
  • [6] An improved sample selection framework for learning with noisy labels
    Zhang, Qian
    Zhu, Yi
    Yang, Ming
    Jin, Ge
    Zhu, Yingwen
    Lu, Yanjun
    Zou, Yu
    Chen, Qiu
    PLOS ONE, 2024, 19 (12):
  • [7] UNSUPERVISED SAMPLE SELECTION FOR ACTIVE LEARNING WITH QUADRATIC PROGRAMMING
    Wang, Yunbin
    Song, Na
    Wang, Shiping
    Journal of Applied and Numerical Optimization, 2024, 6 (03): : 339 - 350
  • [8] Sample Selection based Active Learning for Imbalanced Data
    Chairi, Ikram
    Alaoui, Souad
    Lyhyaoui, Abdelouahid
    10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, : 645 - 651
  • [9] Active partial label learning based on adaptive sample selection
    Yan Li
    Chang Liu
    Suyun Zhao
    Qiang Hua
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 1603 - 1617
  • [10] Active partial label learning based on adaptive sample selection
    Li, Yan
    Liu, Chang
    Zhao, Suyun
    Hua, Qiang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (06) : 1603 - 1617