Upper and Lower Tight Error Bounds for Feature Omission with an Extension to Context Reduction

被引:0
|
作者
Schlueter, Ralf [1 ]
Beck, Eugen [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Human Language Technol & Pattern Recognit, Ahornstr 55, D-52056 Aachen, Germany
基金
欧洲研究理事会;
关键词
Error bound; Bayes error; feature selection; language model; perplexity; context reduction; pattern classification; sequence classification; LANGUAGE; RECOGNITION;
D O I
10.1109/TPAMI.2017.2788434
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, fundamental analytic results in the form of error bounds are presented that quantify the effect of feature omission and selection for pattern classification in general, as well as the effect of context reduction in string classification, like automatic speech recognition, printed/handwritten character recognition, or statistical machine translation. A general simulation framework is introduced that supports discovery and proof of error bounds, which lead to the error bounds presented here. Initially derived tight lower and upper bounds for feature omission are generalized to feature selection, followed by another extension to context reduction of string class priors (aka language models) in string classification. For string classification, the quantitative effect of string class prior context reduction on symbol-level Bayes error is presented. The tightness of the original feature omission bounds seems lost in this case, as further simulations indicate. However, combining both feature omission andcontext reduction, the tightness of the bounds is retained. A central result of this work is the proof of the existence, and the amount of a statistical threshold w.r.t. the introduction of additional features in general pattern classification, or the increase of context in string classification beyond which a decrease in Bayes error is guaranteed.
引用
收藏
页码:502 / 514
页数:13
相关论文
共 50 条
  • [21] Minimum-error quantum state discrimination: Extremely tight upper and lower bounds with application in digital quantum communication
    Yaghoobianzadeh, Amir Mohammad
    Salehi, Jawad A.
    [J]. PHYSICAL REVIEW A, 2024, 110 (04)
  • [22] Partial Convex Recolorings of Trees and Galled Networks: Tight Upper and Lower Bounds
    Moran, Shlomo
    Snir, Sagi
    Sung, Wing-Kin
    [J]. ACM TRANSACTIONS ON ALGORITHMS, 2011, 7 (04)
  • [23] TIGHT UPPER AND LOWER BOUNDS ON THE PATH-LENGTH OF BINARY-TREES
    DESANTIS, A
    PERSIANO, G
    [J]. SIAM JOURNAL ON COMPUTING, 1994, 23 (01) : 12 - 23
  • [24] On Numerical Solution of the Markov Renewal Equation: Tight Upper and Lower Kernel Bounds
    D. A. Elkins
    M. A. Wortman
    [J]. Methodology And Computing In Applied Probability, 2001, 3 (3) : 239 - 253
  • [25] Optimal lower and upper bounds for the geometric convex combination of the error function
    Yong-Min Li
    Wei-Feng Xia
    Yu-Ming Chu
    Xiao-Hui Zhang
    [J]. Journal of Inequalities and Applications, 2015
  • [26] LOWER AND UPPER BOUNDS ON OPTIMAL FILTERING ERROR OF CERTAIN DIFFUSION PROCESSES
    ZAKAI, M
    ZIV, J
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1972, 18 (03) : 325 - +
  • [27] Upper and lower error bounds for plate-bending finite elements
    Juhani Pitkäranta
    Manil Suri
    [J]. Numerische Mathematik, 2000, 84 : 611 - 648
  • [28] Saddlepoint Approximations of Lower and Upper Bounds to the Error Probability in Channel Coding
    Font-Segura, Josep
    Vazquez-Vilar, Gonzalo
    Martinez, Alfonso
    Guillen i Fabregas, Albert
    Lancho, Alejandro
    [J]. 2018 52ND ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2018,
  • [29] Upper and lower error bounds for plate-bending finite elements
    Pitkäranta, J
    Suri, M
    [J]. NUMERISCHE MATHEMATIK, 2000, 84 (04) : 611 - 648
  • [30] Optimal lower and upper bounds for the geometric convex combination of the error function
    Li, Yong-Min
    Xia, Wei-Feng
    Chu, Yu-Ming
    Zhang, Xiao-Hui
    [J]. JOURNAL OF INEQUALITIES AND APPLICATIONS, 2015, : 1 - 8