Optimised probabilistic active learning (OPAL)For fast, non-myopic, cost-sensitive active classification

被引:0
|
作者
Georg Krempl
Daniel Kottke
Vincent Lemaire
机构
[1] University Magdeburg,KMD Lab
[2] Orange Labs,undefined
来源
Machine Learning | 2015年 / 100卷
关键词
Active learning; Non-myopic; Cost-sensitive; Unequal misclassification costs; Misclassification loss; Imbalanced data; Uncertainty sampling; Error reduction;
D O I
暂无
中图分类号
学科分类号
摘要
In contrast to ever increasing volumes of automatically generated data, human annotation capacities remain limited. Thus, fast active learning approaches that allow the efficient allocation of annotation efforts gain in importance. Furthermore, cost-sensitive applications such as fraud detection pose the additional challenge of differing misclassification costs between classes. Unfortunately, the few existing cost-sensitive active learning approaches rely on time-consuming steps, such as performing self-labelling or tedious evaluations over samples. We propose a fast, non-myopic, and cost-sensitive probabilistic active learning approach for binary classification. Our approach computes the expected reduction in misclassification loss in a labelling candidate’s neighbourhood. We derive and use a closed-form solution for this expectation, which considers the possible values of the true posterior of the positive class at the candidate’s position, its possible label realisations, and the given labelling budget. The resulting myopic algorithm runs in the same linear asymptotic time as uncertainty sampling, while its non-myopic counterpart requires an additional factor of O(m·logm)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O(m \cdot \log m)$$\end{document} in the budget size. The experimental evaluation on several synthetic and real-world data sets shows competitive or better classification performance and runtime, compared to several uncertainty sampling- and error-reduction-based active learning strategies, both in cost-sensitive and cost-insensitive settings.
引用
收藏
页码:449 / 476
页数:27
相关论文
共 50 条
  • [1] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [2] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [3] Optimised probabilistic active learning (OPAL)
    Krempl, Georg
    Kottke, Daniel
    Lemaire, Vincent
    [J]. MACHINE LEARNING, 2015, 100 (2-3) : 449 - 476
  • [4] Active learning for cost-sensitive classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daumé Iii, Hal
    Langford, John
    [J]. Journal of Machine Learning Research, 2019, 20
  • [5] Active Learning for Multiclass Cost-Sensitive Classification Using Probabilistic Models
    Chen, Po-Lung
    Lin, Hsuan-Tien
    [J]. 2013 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2013, : 13 - 18
  • [6] Non-myopic Active Learning with Performance Guarantee
    Zhao, Yue
    Wang, Hui
    Liu, Xiaofeng
    Xu, Yanmin
    Ji, Qiang
    [J]. 2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 836 - 841
  • [7] Active Cost-Sensitive Learning
    Margineantu, Dragos D.
    [J]. 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1622 - 1623
  • [8] Learning cost-sensitive active classifiers
    Greiner, R
    Grove, AJ
    Roth, D
    [J]. ARTIFICIAL INTELLIGENCE, 2002, 139 (02) : 137 - 174
  • [9] Active Learning for Cost-Sensitive Classification Using Logistic Regression Model
    Zhou, Siyuan
    Zhang, Ya
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2016, : 284 - 287
  • [10] A Near-optimal Non-myopic Active Learning Method
    Zhao, Yue
    Yang, Guosheng
    Xu, Xiaona
    Ji, Qiang
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1715 - 1718