Active learning for ordinal classification on incomplete data

被引:1
|
作者
He, Deniu [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing, Peoples R China
关键词
Active learning; Incomplete data; Ordinal classification; Imputation uncertainty; MULTIPLE IMPUTATION; REGRESSION; DENSITY; MIXTURE;
D O I
10.3233/IDA-226664
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing active learning algorithms typically assume that the data provided are complete. Nonetheless, data with missing values are common in real-world applications, and active learning on incomplete data is less studied. This paper studies the problem of active learning for ordinal classification on incomplete data. Although cutting-edge imputation methods can be used to impute the missing values before commencing active learning, inaccurately imputed instances are unavoidable and may degrade the ordinal classifier's performance once labeled. Therefore, the crucial question in this work is how to reduce the negative impact of imprecisely filled instances on active learning. First, to avoid selecting filled instances with high imputation imprecision, we propose penalizing the query selection with a novel imputation uncertainty measure that combines a feature-level imputation uncertainty and a knowledge-level imputation uncertainty. Second, to mitigate the adverse influence of potentially labeled imprecisely imputed instances, we suggest using a diversity-based uncertainty sampling strategy to select query instances in specified candidate instance regions. Extensive experiments on nine public ordinal classification datasets with varying value missing rates show that the proposed approach outperforms several baseline methods.
引用
收藏
页码:613 / 634
页数:22
相关论文
共 50 条
  • [21] Active Learning for Imbalanced Ordinal Regression
    Ge, Jiaming
    Chen, Haiyan
    Zhang, Dongfang
    Hou, Xiaye
    Yuan, Ligang
    IEEE ACCESS, 2020, 8 (08): : 180608 - 180617
  • [22] Active Learning Classification of Drifted Streaming Data
    Wozniak, Michal
    Ksieniewicz, Pawel
    Cyganek, Boguslaw
    Kasprzak, Andrzej
    Walkowiak, Krzysztof
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 1724 - 1733
  • [23] Genetic Programming with Interval Functions and Ensemble Learning for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Xue, Bing
    Andreae, Peter
    AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 11320 : 577 - 589
  • [24] Incomplete data decomposition for classification
    Latkowski, R
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2002, 2475 : 413 - 420
  • [25] Pattern classification for incomplete data
    Gabrys, Bogdan
    International Conference on Knowledge-Based Intelligent Electronic Systems, Proceedings, KES, 2000, 1 : 454 - 457
  • [26] Classification of Incomplete Data by Observation
    Lorrentz, Pierre
    ENGINEERING LETTERS, 2010, 18 (04)
  • [27] Pattern classification for incomplete data
    Gabrys, B
    KES'2000: FOURTH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED INTELLIGENT ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, VOLS 1 AND 2, PROCEEDINGS, 2000, : 454 - 457
  • [28] HANDLING OF INCOMPLETE DATA IN CLASSIFICATION
    CHAN, L
    BIOMETRICS, 1972, 28 (04) : 1162 - 1162
  • [29] Classification of incomplete data by observation
    Lorrentz, Pierre
    Engineering Letters, 2011, 18 (04)
  • [30] Threshold model for incomplete ordinal data from repeated measurements
    Chan, W
    Tang, ML
    AMERICAN STATISTICAL ASSOCIATION - 1996 PROCEEDINGS OF THE SOCIAL STATISTICS SECTION, 1996, : 123 - 128