Cost-Sensitive Active Learning for Incomplete Data

被引:7
|
作者
Wang, Min [1 ]
Yang, Chunyu [1 ]
Zhao, Fei [1 ]
Min, Fan [2 ]
Wang, Xizhao [3 ]
机构
[1] Southwest Petr Univ, Coll Elect Engn & Informat, Chengdu 610500, Peoples R China
[2] Southwest Petr Univ, Sch Comp Sci, Chengdu 518060, Peoples R China
[3] Shenzhen Univ, Inst Big Data, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Data models; Heuristic algorithms; Semisupervised learning; Labeling; Training; Task analysis; Active learning; cost sensitive; incomplete data; unified evaluation and dynamic selection; MISSING VALUES; CLASSIFICATION; REGRESSION; ENSEMBLE;
D O I
10.1109/TSMC.2022.3182122
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Practical data often suffer from missing attribute values and lack of class labels. A reasonable machine learning scenario involves obtaining certain values and labels at cost on request. In this article, we propose the cost-sensitive active learning through unified evaluation and dynamic selection (CALS) algorithm to handle the learning task in this new scenario. For data representation, we consider misclassification cost, label query cost, and attribute query cost. For the cost/benefit estimation, we design a unified assessment of attribute values and labels with softmax regression. For the selection of attribute value and label, we propose an optimal acquisition scheme with permutation and greedy strategies. We perform experiments with synthetic, benchmark, and domain datasets. The results of the significance test verify the effectiveness of CALS and its superiority over cost-sensitive active learning and missing data imputation algorithms.
引用
收藏
页码:405 / 416
页数:12
相关论文
共 50 条
  • [1] Active Cost-Sensitive Learning
    Margineantu, Dragos D.
    [J]. 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1622 - 1623
  • [2] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [3] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [4] Active learning for cost-sensitive classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daumé Iii, Hal
    Langford, John
    [J]. Journal of Machine Learning Research, 2019, 20
  • [5] Learning cost-sensitive active classifiers
    Greiner, R
    Grove, AJ
    Roth, D
    [J]. ARTIFICIAL INTELLIGENCE, 2002, 139 (02) : 137 - 174
  • [6] Cost-sensitive classification with time constraint on incomplete data
    Lee, Yong-Shiuan
    Wu, Chia-Chi
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2024, 17 (03)
  • [7] A Cost-sensitive Active Learning for Imbalance Data with Uncertainty and Diversity Combination
    Dong, Huailong
    Zhu, Bowen
    Zhang, Jing
    [J]. ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 218 - 224
  • [8] Cost-Sensitive Active Visual Category Learning
    Vijayanarasimhan, Sudheendra
    Grauman, Kristen
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 91 (01) : 24 - 44
  • [9] Cost-Sensitive Active Visual Category Learning
    Sudheendra Vijayanarasimhan
    Kristen Grauman
    [J]. International Journal of Computer Vision, 2011, 91 : 24 - 44
  • [10] Cost-sensitive learning for imbalanced data streams
    Loezer, Lucas
    Enembreck, Fabricio
    Barddal, Jean Paul
    Britto Jr, Alceu de Souza
    [J]. PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 498 - 504