Classification with small samples of high-dimensional data is important in many application areas. Quantile classifiers are distance-based classifiers that require a single parameter, regardless of the dimension, and classify observations according to a sum of weighted componentwise distances of the components of an observation to the within-class quantiles. An optimal percentage for the quantiles can be chosen by minimizing the misclassification error in the training sample. It is shown that this choice is consistent for the classification rule with the asymptotically optimal quantile and that under some assumptions, as the number of variables goes to infinity, the probability of correct classification converges to unity. The effect of skewness of the distributions of the predictor variables is discussed. The optimal quantile classifier gives low misclassification rates in a comprehensive simulation study and in a real-data application.
机构:
Univ N Carolina, Dept Biostat, Chapel Hill, NC 27515 USAUniv N Carolina, Dept Biostat, Chapel Hill, NC 27515 USA
Pritchard, David A.
Liu, Yufeng
论文数: 0引用数: 0
h-index: 0
机构:
Univ N Carolina, Dept Biostat, Chapel Hill, NC 27515 USA
Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
Univ N Carolina, Lineberger Comprehens Canc Ctr, Carolina Ctr Genome Sci, Dept Genet, Chapel Hill, NC 27515 USAUniv N Carolina, Dept Biostat, Chapel Hill, NC 27515 USA
机构:
Michigan State Univ, Dept Math, E Lansing, MI 48824 USA
Michigan State Univ, Dept Stat & Probabil, E Lansing, MI 48824 USASwiss Fed Inst Technol, Dept Math, RiskLab, CH-8092 Zurich, Switzerland
机构:
Department of Statistics, Cochin University of Science and Technology, CochinDepartment of Statistics, Cochin University of Science and Technology, Cochin
Krishnan A.S.
Sankaran P.G.
论文数: 0引用数: 0
h-index: 0
机构:
Department of Statistics, Cochin University of Science and Technology, CochinDepartment of Statistics, Cochin University of Science and Technology, Cochin