Sparse multiple instance learning as document classification

被引:0
|
作者
Shengye Yan
Xiaodong Zhu
Guoqing Liu
Jianxin Wu
机构
[1] NUIST,B
[2] Minieye,DAT, CICAEET, School of Information and Control
[3] Youjia Innovation LLC,National Key Laboratory for Novel Software Technology
[4] Nanjing University,undefined
来源
关键词
Sparse multiple instance learning; Low witness rate; Structural representation; Document classification;
D O I
暂无
中图分类号
学科分类号
摘要
This work focuses on multiple instance learning (MIL) with sparse positive bags (which we name as sparse MIL). A structural representation is presented to encode both instances and bags. This representation leads to a non-i.i.d. MIL algorithm, miStruct, which uses a structural similarity to compare bags. Furthermore, MIL with this representation is shown to be equivalent to a document classification problem. Document classification also suffers from the fact that only few paragraphs/words are useful in revealing the category of a document. By using the TF-IDF representation which has excellent empirical performance in document classification, the miDoc method is proposed. The proposed methods achieve significantly higher accuracies and AUC (area under the ROC curve) than the state-of-the-art in a large number of sparse MIL problems, and the document classification analogy explains their efficacy in sparse MIL problems.
引用
收藏
页码:4553 / 4570
页数:17
相关论文
共 50 条
  • [1] Sparse multiple instance learning as document classification
    Yan, Shengye
    Zhu, Xiaodong
    Liu, Guoqing
    Wu, Jianxin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) : 4553 - 4570
  • [2] Document Image Classification and Labeling using Multiple Instance Learning
    Kumar, Jayant
    Pillai, Jaishanker
    Doermann, David
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1059 - 1063
  • [3] Classification of COPD with Multiple Instance Learning
    Cheplygina, Veronika
    Sorensen, Lauge
    Tax, David M. J.
    Pedersen, Jesper Holst
    Loog, Marco
    de Bruijne, Marleen
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1508 - 1513
  • [4] Multiple instance learning for malware classification
    Stiborek, Jan
    Pevny, Tomas
    Rehak, Martin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 93 : 346 - 357
  • [5] Sparse Network Inversion for Key Instance Detection in Multiple Instance Learning
    Shin, Beomjo
    Cho, Junsu
    Yu, Hwanjo
    Choi, Seungjin
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4083 - 4090
  • [6] Score Thresholding for Accurate Instance Classification in Multiple Instance Learning
    Carbonneau, Marc-Andre
    Granger, Eric
    Gagnon, Ghyslain
    [J]. 2016 SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2016,
  • [7] Sparse multiple instance learning with non -convex penalty
    Zhang, Yuqi
    Zhang, Haibin
    Tian, Yingjie
    [J]. NEUROCOMPUTING, 2020, 391 : 142 - 156
  • [8] Diversified Multiple Instance Learning for Document-Level Multi-Aspect Sentiment Classification
    Ji, Yunjie
    Liu, Hao
    He, Bolei
    Xiao, Xinyan
    Wu, Hua
    Yu, Yanhua
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7012 - 7023
  • [9] Learning Sparse Kernel Classifiers for Multi-Instance Classification
    Fu, Zhouyu
    Lu, Guojun
    Ting, Kai Ming
    Zhang, Dengsheng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (09) : 1377 - 1389
  • [10] Multiple instance learning for medical image classification based on instance importance
    Struski, Lukasz
    Janusz, Szymon
    Tabor, Jacek
    Markiewicz, Michal
    Lewicki, Arkadiusz
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 91