Sample complexity for computational classification problems

被引:2
|
作者
Ryabko, Daniil [1 ]
机构
[1] IDSIA, CH-6928 Lugano, Switzerland
关键词
pattern recognition; classification; sample complexity; Kolmogorov complexity; computability analysis;
D O I
10.1007/s00453-007-0037-z
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In a statistical setting of the classification (pattern recognition) problem the number of examples required to approximate an unknown labelling function is linear in the VC dimension of the target learning class. In this work we consider the question of whether such bounds exist if we restrict our attention to computable classification methods, assuming that the unknown labelling function is also computable. We find that in this case the number of examples required for a computable method to approximate the labelling function not only is not linear, but grows faster (in the VC dimension of the class) than any computable function. No time or space constraints are put on the predictors or target functions; the only resource we consider is the training examples. The task of classification is considered in conjunction with another learning problem - data compression. An impossibility result for the task of data compression allows us to estimate the sample complexity for pattern recognition.
引用
下载
收藏
页码:69 / 77
页数:9
相关论文
共 50 条