Theory and Algorithms for Shapelet-Based Multiple-Instance Learning

被引:1
|
作者
Suehiro, Daiki [1 ,2 ]
Hatan, Kohei [2 ,3 ]
Takimoto, Eiji [4 ]
Yamamoto, Shuji [5 ,6 ]
Bannai, Kenichi [5 ,6 ]
Takeda, Akiko [7 ,8 ]
机构
[1] Kyushu Univ, Fac Informat Sci & Elect Engn, Dept Adv Informat Technol, Nishi Ku, Fukuoka 8190395, Japan
[2] RIKEN, Ctr Adv Intelligence Project, Nishi Ku, Fukuoka 8190395, Japan
[3] Kyushu Univ, Fac Arts & Sci, Nishi Ku, Fukuoka 8190395, Japan
[4] Kyushu Univ, Fac Informat Sci & Elect Engn, Dept Informat, Nishi Ku, Fukuoka 8190395, Japan
[5] Keio Univ, Dept Math, Minatokita Ku, Yokohama, Kanagawa 2238522, Japan
[6] RIKEN, Ctr Adv Intelligence Project, Minatokita Ku, Yokohama, Kanagawa 2238522, Japan
[7] Univ Tokyo, Dept Creat Informat, Bunkyo Ku, Tokyo 1138656, Japan
[8] RIKEN, Ctr Adv Intelligence Project, Bunkyo Ku, Tokyo 1138656, Japan
关键词
TIME-SERIES; CLASSIFICATION;
D O I
10.1162/neco_a_01297
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new formulation of multiple-instance learning (MIL), in which a unit of data consists of a set of instances called a bag. The goal is to find a good classifier of bags based on the similarity with a "shapelet" (or pattern), where the similarity of a bag with a shapelet is the maximum similarity of instances in the bag. In previous work, some of the training instances have been chosen as shapelets with no theoretical justification. In our formulation, we use all possible, and thus infinitely many, shapelets, resulting in a richer class of classifiers. We show that the formulation is tractable, that is, it can be reduced through linear programming boosting (LPBoost) to difference of convex (DC) programs of finite (actually polynomial) size. Our theoretical result also gives justification to the heuristics of some previous work. The time complexity of the proposed algorithm highly depends on the size of the set of all instances in the training sample. To apply to the data containing a large number of instances, we also propose a heuristic option of the algorithm without the loss of the theoretical guarantee. Our empirical study demonstrates that our algorithm uniformly works for shapelet learning tasks on time-series classification and various MIL tasks with comparable accuracy to the existing methods. Moreover, we show that the proposed heuristics allow us to achieve the result in reasonable computational time.
引用
收藏
页码:1580 / 1613
页数:34
相关论文
共 50 条
  • [1] An Iterative Instance Selection Based Framework for Multiple-Instance Learning
    Yuan, Liming
    Wen, Xianbin
    Zhao, Lu
    Xu, Haixia
    [J]. 2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 772 - 779
  • [2] Multiple-instance learning algorithms for computer-aided detection
    Dundar, M. Murat
    Fung, Glenn
    Krishnapuram, Balaji
    Rao, R. Bharat
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2008, 55 (03) : 1015 - 1021
  • [3] Compact Multiple-Instance Learning
    Chai, Jing
    Liu, Weiwei
    Tsang, Ivor W.
    Shen, Xiaobo
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2007 - 2010
  • [4] ON GENERALIZED MULTIPLE-INSTANCE LEARNING
    Scott, Stephen
    Zhang, Jun
    Brown, Joshua
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2005, 5 (01) : 21 - 35
  • [5] A framework for multiple-instance learning
    Maron, O
    Lozano-Perez, T
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 570 - 576
  • [6] On multiple-instance learning of halfspaces
    Diochnos, D. I.
    Sloan, R. H.
    Turan, Gy
    [J]. INFORMATION PROCESSING LETTERS, 2012, 112 (23) : 933 - 936
  • [7] UNSUPERVISED MULTIPLE-INSTANCE LEARNING FOR INSTANCE SEARCH
    Wang, Zhenzhen
    Yuan, Junsong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [8] MULTIPLE-INSTANCE LEARNING WITH PAIRWISE INSTANCE SIMILARITY
    Yuan, Liming
    Liu, Jiafeng
    Tang, Xianglong
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2014, 24 (03) : 567 - 577
  • [9] Salient Instance Selection for Multiple-Instance Learning
    Yuan, Liming
    Liu, Songbo
    Huang, Qingcheng
    Liu, Jiafeng
    Tang, Xianglong
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 58 - 67
  • [10] ENSEMBLE-BASED INSTANCE RELEVANCE ESTIMATION IN MULTIPLE-INSTANCE LEARNING
    Waqas, Muhammd
    Tahir, Muhammad Atif
    Qureshi, Rizwan
    [J]. PROCEEDINGS OF THE 2021 9TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP), 2021,