Multiple instance classification: Bag noise filtering for negative instance noise cleaning

被引:5
|
作者
Luengo, Julian [1 ]
Sanchez-Tarrago, Danel [2 ]
Prati, Ronaldo C. [3 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada 18071, Spain
[2] Cent Univ Marta Abreu Las Villas, Dept Comp Sci, Santa Clara, Cuba
[3] Fed Univ ABC UFABC, Ctr Math Comp Sci & Cognit CMCC, Santo Andre, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Multiple instance classification; Data preprocessing; Noisy data; Instance noise; Noise filtering; STATISTICAL COMPARISONS; PERFORMANCE; CLASSIFIERS;
D O I
10.1016/j.ins.2021.07.076
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data in the real world is far from being perfect. The appearance of noise is a common issue that arises from the limitations of data acquisition mechanisms and human knowledge. In classification, label noise will hinder the performance of almost all classifiers, inducing a bias in the built model. While label noise has recently attracted researchers' attention in standard classification, it has only recently begun to be studied in multiple instance clas-sification. In this work, we propose the usage of filtering algorithms for multiple instance classification that are able to reduce the impact of negative instances within the bags. In order to do so, we decompose the bags to form a standard classification problem that can be efficiently treated by a specialized noise filter. Such a decomposition is tackled in different ways, with the aim of exploiting the knowledge offered by the examples from opposite bags. The bags are then rebuilt, without the identified noise instances. In our experiments, we show that by applying our approach we can diminish the impact of noise and even obtain better results at 0% noise level for several classifiers. Our approach sets out a promising approach to dealing with noise in the bags of multiple instance datasets and further improve the classification rate of the built models. (c) 2021 Published by Elsevier Inc.
引用
收藏
页码:388 / 400
页数:13
相关论文
共 50 条
  • [31] Multiple Instance Learning with Bag-Level Randomized Trees
    Komarek, Tomas
    Somol, Petr
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I, 2019, 11051 : 259 - 272
  • [32] A Bag Oversampling Approach for Class Imbalance in Multiple Instance Learning
    Mera, Carlos
    Arrieta, Jose
    Orozco-Alzate, Mauricio
    Branch, John
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2015, 2015, 9423 : 724 - 731
  • [33] Multiple Instance Learning via Bag Space Construction and ELM
    Wen, Chao
    Zhou, Mingquan
    Li, Zhan
    [J]. 2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2018, 10836
  • [34] Instance difficulty-based noise correction for crowdsourcing
    Hu, Yufei
    Jiang, Liangxiao
    Li, Chaoqun
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [35] A Parametrical Model for Instance-Dependent Label Noise
    Yang, Shuo
    Wu, Songhua
    Yang, Erkun
    Han, Bo
    Liu, Yang
    Xu, Min
    Niu, Gang
    Liu, Tongliang
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14055 - 14068
  • [36] Multiple instance classification via quadratic programming
    Kucukasci, Emel Seyma
    Baydogan, Mustafa Gokce
    Taskin, Z. Caner
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2022, 83 (04) : 639 - 670
  • [37] Sparse multiple instance learning as document classification
    Yan, Shengye
    Zhu, Xiaodong
    Liu, Guoqing
    Wu, Jianxin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) : 4553 - 4570
  • [38] Sparse multiple instance learning as document classification
    Shengye Yan
    Xiaodong Zhu
    Guoqing Liu
    Jianxin Wu
    [J]. Multimedia Tools and Applications, 2017, 76 : 4553 - 4570
  • [39] Multiple instance classification via quadratic programming
    Emel Şeyma Küçükaşcı
    Mustafa Gökçe Baydoğan
    Z. Caner Taşkın
    [J]. Journal of Global Optimization, 2022, 83 : 639 - 670
  • [40] Multiple-Instance feature extraction at the bag and instance levels using the maximum trace-difference criterion
    Chai, Jing
    Chen, Bo
    Liu, Fan
    Chen, Zehua
    Ding, Xinghao
    [J]. INFORMATION SCIENCES, 2017, 385 : 353 - 377