Multi-instance clustering with applications to multi-instance prediction

被引:107
|
作者
Zhang, Min-Ling [1 ,2 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
[2] Hohai Univ, Coll Comp & Informat Engn, Nanjing 210098, Peoples R China
基金
国家高技术研究发展计划(863计划); 美国国家科学基金会;
关键词
Machine learning; Multi-instance learning; Clustering; Representation transformation; NEURAL-NETWORKS;
D O I
10.1007/s10489-007-0111-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the setting of multi-instance learning, each object is represented by a bag composed of multiple instances instead of by a single instance in a traditional learning setting. Previous works in this area only concern multi-instance prediction problems where each bag is associated with a binary (classification) or real-valued (regression) label. However, unsupervised multi-instance learning where bags are without labels has not been studied. In this paper, the problem of unsupervised multi-instance learning is addressed where a multi-instance clustering algorithm named Bamic is proposed. Briefly, by regarding bags as atomic data items and using some form of distance metric to measure distances between bags, Bamic adapts the popular k -Medoids algorithm to partition the unlabeled training bags into k disjoint groups of bags. Furthermore, based on the clustering results, a novel multi-instance prediction algorithm named Bartmip is developed. Firstly, each bag is re-represented by a k-dimensional feature vector, where the value of the i-th feature is set to be the distance between the bag and the medoid of the i-th group. After that, bags are transformed into feature vectors so that common supervised learners are used to learn from the transformed feature vectors each associated with the original bag's label. Extensive experiments show that Bamic could effectively discover the underlying structure of the data set and Bartmip works quite well on various kinds of multi-instance prediction problems.
引用
收藏
页码:47 / 68
页数:22
相关论文
共 50 条
  • [1] Multi-instance clustering with applications to multi-instance prediction
    Min-Ling Zhang
    Zhi-Hua Zhou
    Applied Intelligence, 2009, 31 : 47 - 68
  • [2] Constrained instance clustering in multi-instance multi-label learning
    Pei, Yuanli
    Fern, Xiaoli Z.
    PATTERN RECOGNITION LETTERS, 2014, 37 : 107 - 114
  • [3] Multi-Instance Learning for Bankruptcy Prediction
    Kotsiantis, Sotiris
    Kanellopoulos, Dimitris
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 1007 - +
  • [4] SIMULTANEOUS INSTANCE ANNOTATION AND CLUSTERING IN MULTI-INSTANCE MULTI-LABEL LEARNING
    Pham, Anh T.
    Raich, Raviv
    Fern, Xiaoli Z.
    2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [5] Multi-Instance Learning with Key Instance Shift
    Zhang, Ya-Lin
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3441 - 3447
  • [6] EFFICIENT INSTANCE ANNOTATION IN MULTI-INSTANCE LEARNING
    Pham, Anh T.
    Raich, Raviv
    Fern, Xiaoli Z.
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 137 - 140
  • [7] Fragmentary Multi-Instance Classification
    Wu, Jie
    Zhuge, Wenzhang
    Liu, Xinwang
    Liu, Li
    Hou, Chenping
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (10) : 5156 - 5169
  • [8] Marginalized Multi-Instance Kernels
    Kwok, James T.
    Cheung, Pak-Ming
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 901 - 906
  • [9] Multi-Instance Dimensionality Reduction
    Sun, Yu-Yin
    Ng, Michael K.
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 587 - 592
  • [10] Fuzzy Multi-Instance Classifiers
    Vluymans, Sarah
    Sanchez Tarrago, Danel
    Saeys, Yvan
    Cornelis, Chris
    Herrera, Francisco
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2016, 24 (06) : 1395 - 1409