Multi-instance learning by maximizing the area under receiver operating characteristic curve

被引:0
|
作者
I. Edhem Sakarya
O. Erhun Kundakcioglu
机构
[1] Eindhoven University of Technology,Department of Industrial Engineering and Innovation Sciences
[2] Ozyegin University,Department of Industrial Engineering
来源
关键词
Multi-instance learning; Mixed integer linear programming; Area under curve;
D O I
暂无
中图分类号
学科分类号
摘要
The purpose of this study is to solve the multi-instance classification problem by maximizing the area under the Receiver Operating Characteristic (ROC) curve obtained for witness instances. We derive a mixed integer linear programming model that chooses witnesses and produces the best possible ROC curve using a linear ranking function for multi-instance classification. The formulation is solved using a commercial mathematical optimization solver as well as a fast metaheuristic approach. When the data is not linearly separable, we illustrate how new features can be generated to tackle the problem. We present a comprehensive computational study to compare our methods against the state-of-the-art approaches in the literature. Our study reveals the success of an optimal linear ranking function through cross validation for several benchmark instances.
引用
收藏
页码:351 / 375
页数:24
相关论文
共 50 条
  • [31] On the optimism correction of the area under the receiver operating characteristic curve in logistic prediction models
    Iparragirre, Amaia
    Barrio, Irantzu
    Xose Rodriguez-Alvarez, Maria
    SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2019, 43 (01) : 145 - 162
  • [32] Scalable Multi-Instance Learning
    Wei, Xiu-Shen
    Wu, Jianxin
    Zhou, Zhi-Hua
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 1037 - 1042
  • [33] Multi-Instance Learning with Key Instance Shift
    Zhang, Ya-Lin
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3441 - 3447
  • [34] EFFICIENT INSTANCE ANNOTATION IN MULTI-INSTANCE LEARNING
    Pham, Anh T.
    Raich, Raviv
    Fern, Xiaoli Z.
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 137 - 140
  • [35] Receiver operating characteristic curve analysis in diagnostic accuracy studies: A guide to interpreting the area under the curve value
    Corbacioglu, Seref Kerem
    Aksel, Goekhan
    TURKISH JOURNAL OF EMERGENCY MEDICINE, 2023, 23 (04): : 195 - 198
  • [36] Confidence Intervals for the Area Under the Receiver Operating Characteristic Curve in the Presence of Ignorable Missing Data
    Choi, Hunyong
    Matthews, Gregory J.
    Hare, Ofer
    INTERNATIONAL STATISTICAL REVIEW, 2019, 87 (01) : 152 - 177
  • [37] Statistical inference for the area under the receiver operating characteristic curve in the presence of random measurement error
    Schisterman, EF
    Faraggi, D
    Reiser, B
    Trevisan, M
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2001, 154 (02) : 174 - 179
  • [38] Interval estimation for the area under the receiver operating characteristic curve when data are subject to error
    Li, Yanhong
    Koval, John J.
    Donner, Allan
    Zou, G. Y.
    STATISTICS IN MEDICINE, 2010, 29 (24) : 2521 - 2531
  • [39] Optimal two-phase sampling for estimating the area under the receiver operating characteristic curve
    Wu, Yougui
    STATISTICS IN MEDICINE, 2021, 40 (04) : 1059 - 1071
  • [40] The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
    Parodi, Stefano
    Verda, Damiano
    Bagnasco, Francesca
    Muselli, Marco
    EPIDEMIOLOGY AND HEALTH, 2022, 44