Simultaneous instance pooling and bag representation selection approach for multiple-instance learning (MIL) using vision transformer

被引:0
|
作者
Muhammad Waqas
Muhammad Atif Tahir
Muhammad Danish Author
Sumaya Al-Maadeed
Ahmed Bouridane
Jia Wu
机构
[1] National University of Computer Emerging Science (FAST-NUCES),FAST School of Computing
[2] The University of Texas MD Anderson Cancer Center,Department of Imaging Physics
[3] United Arab Emirates University,College of information technology
[4] Qatar University,Department of Computer Science and Engineering
[5] University of Sharjah,Cybersecurity and Data Analytics Research Center
来源
关键词
Multiple-instance learning (MIL); Vision transformers; Attention-based pooling; Bag representation selection;
D O I
暂无
中图分类号
学科分类号
摘要
In multiple-instance learning (MIL), the existing bag encoding and attention-based pooling approaches assume that the instances in the bag have no relationship among them. This assumption is unsuited, as the instances in the bags are rarely independent in diverse MIL applications. In contrast, the instance relationship assumption-based techniques incorporate the instance relationship information in the classification process. However, in MIL, the bag composition process is complicated, and it may be possible that instances in one bag are related and instances in another bag are not. In present MIL algorithms, this relationship assumption is not explicitly modeled. The learning algorithm is trained based on one of two relationship assumptions (whether instances in all bags have a relationship or not). Hence, it is essential to model the assumption of instance relationships in the bag classification process. This paper proposes a robust approach that generates vector representation for the bag for both assumptions and the representation selection process to determine whether to consider the instances related or unrelated in the bag classification process. This process helps to determine the essential bag representation vector for every individual bag. The proposed method utilizes attention pooling and vision transformer approaches to generate bag representation vectors. Later, the representation selection subnetwork determines the vector representation essential for bag classification in an end-to-end trainable manner. The generalization abilities of the proposed framework are demonstrated through extensive experiments on several benchmark datasets. The experiments demonstrate that the proposed approach outperforms other state-of-the-art MIL approaches in bag classification.
引用
收藏
页码:6659 / 6680
页数:21
相关论文
共 50 条
  • [21] An Instance Selection Approach to Multiple Instance Learning
    Fu, Zhouyu
    Robles-Kelly, Antonio
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 911 - +
  • [22] Drug activity prediction using multiple-instance learning via joint instance and feature selection
    Zhendong Zhao
    Gang Fu
    Sheng Liu
    Khaled M Elokely
    Robert J Doerksen
    Yixin Chen
    Dawn E Wilkins
    [J]. BMC Bioinformatics, 14
  • [23] Drug activity prediction using multiple-instance learning via joint instance and feature selection
    Zhao, Zhendong
    Fu, Gang
    Liu, Sheng
    Elokely, Khaled M.
    Doerksen, Robert J.
    Chen, Yixin
    Wilkins, Dawn E.
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [24] Multiple-instance learning via multiple-point concept based instance selection
    Liming Yuan
    Guangping Xu
    Lu Zhao
    Xianbin Wen
    Haixia Xu
    [J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 2113 - 2126
  • [25] Multiple-instance learning via multiple-point concept based instance selection
    Yuan, Liming
    Xu, Guangping
    Zhao, Lu
    Wen, Xianbin
    Xu, Haixia
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (09) : 2113 - 2126
  • [26] Multiple instance learning based on positive instance selection and bag structure construction
    Li, Zhan
    Geng, Guo-Hua
    Feng, Jun
    Peng, Jin-ye
    Wen, Chao
    Liang, Jun-li
    [J]. PATTERN RECOGNITION LETTERS, 2014, 40 : 19 - 26
  • [27] Improving Representation of the Positive Class in Imbalanced Multiple-Instance Learning
    Mera, Carlos
    Orozco-Alzate, Mauricio
    Branch, John
    [J]. IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 266 - 273
  • [28] Multiple-Instance feature extraction at the bag and instance levels using the maximum trace-difference criterion
    Chai, Jing
    Chen, Bo
    Liu, Fan
    Chen, Zehua
    Ding, Xinghao
    [J]. INFORMATION SCIENCES, 2017, 385 : 353 - 377
  • [29] MIL-VT: Multiple Instance Learning Enhanced Vision Transformer for Fundus Image Classification
    Yu, Shuang
    Ma, Kai
    Bi, Qi
    Bian, Cheng
    Ning, Munan
    He, Nanjun
    Li, Yuexiang
    Liu, Hanruo
    Zheng, Yefeng
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 45 - 54
  • [30] MIL-SKDE: Multiple-instance learning with supervised kernel density estimation
    Du, Ruo
    Wu, Qiang
    He, Xiangjian
    Yang, Jie
    [J]. SIGNAL PROCESSING, 2013, 93 (06) : 1471 - 1484