Multi-instance clustering with applications to multi-instance prediction

被引：107

作者：

Zhang, Min-Ling ^{[1
,2
]}

Zhou, Zhi-Hua ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210093, Peoples R China

[2] Hohai Univ, Coll Comp & Informat Engn, Nanjing 210098, Peoples R China

来源：

APPLIED INTELLIGENCE | 2009年 / 31卷 / 01期

基金：

国家高技术研究发展计划(863计划); 美国国家科学基金会;

关键词：

Machine learning; Multi-instance learning; Clustering; Representation transformation; NEURAL-NETWORKS;

D O I：

10.1007/s10489-007-0111-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the setting of multi-instance learning, each object is represented by a bag composed of multiple instances instead of by a single instance in a traditional learning setting. Previous works in this area only concern multi-instance prediction problems where each bag is associated with a binary (classification) or real-valued (regression) label. However, unsupervised multi-instance learning where bags are without labels has not been studied. In this paper, the problem of unsupervised multi-instance learning is addressed where a multi-instance clustering algorithm named Bamic is proposed. Briefly, by regarding bags as atomic data items and using some form of distance metric to measure distances between bags, Bamic adapts the popular k -Medoids algorithm to partition the unlabeled training bags into k disjoint groups of bags. Furthermore, based on the clustering results, a novel multi-instance prediction algorithm named Bartmip is developed. Firstly, each bag is re-represented by a k-dimensional feature vector, where the value of the i-th feature is set to be the distance between the bag and the medoid of the i-th group. After that, bags are transformed into feature vectors so that common supervised learners are used to learn from the transformed feature vectors each associated with the original bag's label. Extensive experiments show that Bamic could effectively discover the underlying structure of the data set and Bartmip works quite well on various kinds of multi-instance prediction problems.

引用

页码：47 / 68

页数：22

共 50 条

[1] Multi-instance clustering with applications to multi-instance prediction
Min-Ling Zhang
Zhi-Hua Zhou
Applied Intelligence, 2009, 31 : 47 - 68
[2] Constrained instance clustering in multi-instance multi-label learning
Pei, Yuanli
Fern, Xiaoli Z.
PATTERN RECOGNITION LETTERS, 2014, 37 : 107 - 114
[3] Multi-Instance Learning for Bankruptcy Prediction
Kotsiantis, Sotiris
Kanellopoulos, Dimitris
THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 1007 - +
[4] SIMULTANEOUS INSTANCE ANNOTATION AND CLUSTERING IN MULTI-INSTANCE MULTI-LABEL LEARNING
Pham, Anh T.
Raich, Raviv
Fern, Xiaoli Z.
2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
[5] Multi-Instance Learning with Key Instance Shift
Zhang, Ya-Lin
Zhou, Zhi-Hua
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3441 - 3447
[6] EFFICIENT INSTANCE ANNOTATION IN MULTI-INSTANCE LEARNING
Pham, Anh T.
Raich, Raviv
Fern, Xiaoli Z.
2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 137 - 140
[7] Fragmentary Multi-Instance Classification
Wu, Jie
Zhuge, Wenzhang
Liu, Xinwang
Liu, Li
Hou, Chenping
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (10) : 5156 - 5169
[8] Marginalized Multi-Instance Kernels
Kwok, James T.
Cheung, Pak-Ming
20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 901 - 906
[9] Multi-Instance Dimensionality Reduction
Sun, Yu-Yin
Ng, Michael K.
Zhou, Zhi-Hua
PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 587 - 592
[10] Fuzzy Multi-Instance Classifiers
Vluymans, Sarah
Sanchez Tarrago, Danel
Saeys, Yvan
Cornelis, Chris
Herrera, Francisco
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2016, 24 (06) : 1395 - 1409

← 1 2 3 4 5 →