Nonlinear separation of data via mixed 0-1 Integer and Linear Programming

被引:5
|
作者
Kim, Kwangsoo [1 ]
Ryoo, Hong Seo [1 ]
机构
[1] Korea Univ, Div Informat Management Engn, Seoul 136713, South Korea
基金
新加坡国家研究基金会;
关键词
supervised learning; binary classification; mixed 0-1 integer and linear program; global optimization;
D O I
10.1016/j.amc.2007.03.067
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper presents a new mathematical programming-based learning methodology for separation of two types of data. Specifically, we develop a new l(1)-norm error distance metric and use it to develop a Mixed 0-1 Integer and Linear Programming ( MILP) model that optimizes the interplay of user-provided discriminant functions, including kernel functions for support vector machines, to implement a nonlinear, nonconvex and/or disjoint decision boundary for the best separation of data at hand. With the concurrent optimization of discriminant functions, the MILP-based learning can be used for finding the optimal and least complex classification rule for noise-free data and for implementing a most robust classification rule for real-life data with noise. With extensive experiments on separation of two dimensional artificial datasets that are clean and noisy, we graphically illustrate the aforementioned advantages of the new MILP-based learning methodology. With experiments on real-life benchmark datasets from the UC Irvine Repository of machine learning databases, in comparison with the multisurface method and the support vector machines, we demonstrate the advantage of using and concurrently optimizing more than a single discriminant function for a robust separation of real-life data, hence the utility of the proposed methodology in supervised learning. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:183 / 196
页数:14
相关论文
共 50 条