Online Nonlinear Classification for High-Dimensional Data

被引:0
|
作者
Vanli, N. Denizcan [1 ]
Ozkan, Huseyin [1 ]
Delibalta, Ibrahim [2 ,3 ]
Kozat, Suleyman S. [1 ]
机构
[1] Bilkent Univ, Dept Elect & Elect Engn, Ankara, Turkey
[2] AVEA Commun Serv Inc, AveaLabs, Istanbul, Turkey
[3] Koc Univ, Grad Sch Social Sci & Humanities, Istanbul, Turkey
关键词
Online classification; randomized algorithms; nonlinear classification; high-dimensional data;
D O I
10.1109/BigDataCongress.2015.109
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study online binary classification problem under the empirical zero-one loss function. We introduce a novel randomized classification algorithm based on highly dynamic hierarchical models that partition the feature space. Our approach jointly and sequentially learns the partitioning of the feature space, the optimal classifier among all doubly exponential number of classifiers defined by the tree, and the individual region classifiers in order to directly minimize the cumulative loss. Although we adapt the entire hierarchical model to minimize a global loss function, the computational complexity of the introduced algorithm scales linearly with the dimensionality of the feature space and the depth of the tree. Furthermore, our algorithm can be applied to any streaming data without requiring a training phase or prior information, hence processes data on-the-fly and then discards it, which makes the introduced algorithm significantly appealing for applications involving "big data". We evaluate the performance of the introduced algorithm over different real data sets.
引用
收藏
页码:685 / 688
页数:4
相关论文
共 50 条
  • [1] High-dimensional Data Stream Classification via Sparse Online Learning
    Wang, Dayong
    Wu, Pengcheng
    Zhao, Peilin
    Wu, Yue
    Miao, Chunyan
    Hoi, Steven C. H.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 1007 - 1012
  • [2] Penalized Gaussian Process Regression and Classification for High-Dimensional Nonlinear Data
    Yi, G.
    Shi, J. Q.
    Choi, T.
    [J]. BIOMETRICS, 2011, 67 (04) : 1285 - 1294
  • [3] A classification algorithm for high-dimensional data
    Roy, Asim
    [J]. INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 345 - 355
  • [5] Enhanced algorithm for high-dimensional data classification
    Wang, Xiaoming
    Wang, Shitong
    [J]. APPLIED SOFT COMPUTING, 2016, 40 : 1 - 9
  • [6] A training algorithm for classification of high-dimensional data
    Vieira, A
    Barradas, N
    [J]. NEUROCOMPUTING, 2003, 50 : 461 - 472
  • [7] A Compressive Classification Framework for High-Dimensional Data
    Tabassum, Muhammad Naveed
    Ollila, Esa
    [J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2020, 1 : 177 - 186
  • [8] Ensemble Method for Classification of High-Dimensional Data
    Piao, Yongjun
    Park, Hyun Woo
    Jin, Cheng Hao
    Ryu, Keun Ho
    [J]. 2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 245 - +
  • [9] INNOVATED INTERACTION SCREENING FOR HIGH-DIMENSIONAL NONLINEAR CLASSIFICATION
    Fan, Yingying
    Kong, Yinfei
    Li, Daoji
    Zheng, Zemin
    [J]. ANNALS OF STATISTICS, 2015, 43 (03): : 1243 - 1272
  • [10] Online Markov Blanket Learning for High-Dimensional Data
    Zhaolong Ling
    Bo Li
    Yiwen Zhang
    Ying Li
    Haifeng Ling
    [J]. Applied Intelligence, 2023, 53 : 5977 - 5997