Simultaneous instance pooling and bag representation selection approach for multiple-instance learning (MIL) using vision transformer

被引：0

作者：

Muhammad Waqas

Muhammad Atif Tahir

Muhammad Danish Author

Sumaya Al-Maadeed

Ahmed Bouridane

Jia Wu

机构：

[1] National University of Computer Emerging Science (FAST-NUCES),FAST School of Computing

[2] The University of Texas MD Anderson Cancer Center,Department of Imaging Physics

[3] United Arab Emirates University,College of information technology

[4] Qatar University,Department of Computer Science and Engineering

[5] University of Sharjah,Cybersecurity and Data Analytics Research Center

来源：

Neural Computing and Applications | 2024年 / 36卷

关键词：

Multiple-instance learning (MIL); Vision transformers; Attention-based pooling; Bag representation selection;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In multiple-instance learning (MIL), the existing bag encoding and attention-based pooling approaches assume that the instances in the bag have no relationship among them. This assumption is unsuited, as the instances in the bags are rarely independent in diverse MIL applications. In contrast, the instance relationship assumption-based techniques incorporate the instance relationship information in the classification process. However, in MIL, the bag composition process is complicated, and it may be possible that instances in one bag are related and instances in another bag are not. In present MIL algorithms, this relationship assumption is not explicitly modeled. The learning algorithm is trained based on one of two relationship assumptions (whether instances in all bags have a relationship or not). Hence, it is essential to model the assumption of instance relationships in the bag classification process. This paper proposes a robust approach that generates vector representation for the bag for both assumptions and the representation selection process to determine whether to consider the instances related or unrelated in the bag classification process. This process helps to determine the essential bag representation vector for every individual bag. The proposed method utilizes attention pooling and vision transformer approaches to generate bag representation vectors. Later, the representation selection subnetwork determines the vector representation essential for bag classification in an end-to-end trainable manner. The generalization abilities of the proposed framework are demonstrated through extensive experiments on several benchmark datasets. The experiments demonstrate that the proposed approach outperforms other state-of-the-art MIL approaches in bag classification.

引用

页码：6659 / 6680

页数：21

共 50 条

[21] An Instance Selection Approach to Multiple Instance Learning
Fu, Zhouyu
Robles-Kelly, Antonio
[J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 911 - +
[22] Drug activity prediction using multiple-instance learning via joint instance and feature selection
Zhendong Zhao
Gang Fu
Sheng Liu
Khaled M Elokely
Robert J Doerksen
Yixin Chen
Dawn E Wilkins
[J]. BMC Bioinformatics, 14
[23] Drug activity prediction using multiple-instance learning via joint instance and feature selection
Zhao, Zhendong
Fu, Gang
Liu, Sheng
Elokely, Khaled M.
Doerksen, Robert J.
Chen, Yixin
Wilkins, Dawn E.
[J]. BMC BIOINFORMATICS, 2013, 14
[24] Multiple-instance learning via multiple-point concept based instance selection
Liming Yuan
Guangping Xu
Lu Zhao
Xianbin Wen
Haixia Xu
[J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 2113 - 2126
[25] Multiple-instance learning via multiple-point concept based instance selection
Yuan, Liming
Xu, Guangping
Zhao, Lu
Wen, Xianbin
Xu, Haixia
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (09) : 2113 - 2126
[26] Multiple instance learning based on positive instance selection and bag structure construction
Li, Zhan
Geng, Guo-Hua
Feng, Jun
Peng, Jin-ye
Wen, Chao
Liang, Jun-li
[J]. PATTERN RECOGNITION LETTERS, 2014, 40 : 19 - 26
[27] Improving Representation of the Positive Class in Imbalanced Multiple-Instance Learning
Mera, Carlos
Orozco-Alzate, Mauricio
Branch, John
[J]. IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 266 - 273
[28] Multiple-Instance feature extraction at the bag and instance levels using the maximum trace-difference criterion
Chai, Jing
Chen, Bo
Liu, Fan
Chen, Zehua
Ding, Xinghao
[J]. INFORMATION SCIENCES, 2017, 385 : 353 - 377
[29] MIL-VT: Multiple Instance Learning Enhanced Vision Transformer for Fundus Image Classification
Yu, Shuang
Ma, Kai
Bi, Qi
Bian, Cheng
Ning, Munan
He, Nanjun
Li, Yuexiang
Liu, Hanruo
Zheng, Yefeng
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 45 - 54
[30] MIL-SKDE: Multiple-instance learning with supervised kernel density estimation
Du, Ruo
Wu, Qiang
He, Xiangjian
Yang, Jie
[J]. SIGNAL PROCESSING, 2013, 93 (06) : 1471 - 1484

← 1 2 3 4 5 →