Efficient OCR using simple features and decision trees with backtracking

被引:0
|
作者
Abuhaiba, Ibrahim S. I. [1 ]
机构
[1] Islam Univ Gaza, Dept Elect & Comp Engn, Gaza, Israel
来源
关键词
OCR; normalization; projections; geometrical features; decision tree learning;
D O I
暂无
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, it is shown that it is adequate to use simple and easy-to-compute features such as those we call sliced horizontal and vertical projections to solve efficiently the OCR problem for machine-printed documents. Recognition is achieved using a decision tree supported with backtracking, smoothing, row and column cropping, and other additions to increase the success rate. Symbols from Times New Roman typeface are used to train our system. Activating backtracking, smoothing, and cropping achieved more than 98% success rate for a recognition time below 30 ms per character. The recognition algorithm was exposed to a hard test by polluting the original dataset with additional artificial noise and could maintain a high success rate and low error rate for highly polluted images, which is a result of backtracking, smoothing, and row and column cropping. Results indicate that we can depend on simple features and hints to reliably recognize characters. The error rate can be decreased by increasing the size of the training dataset. The recognition time can be reduced by using some programming optimization techniques and more powerful computers.
引用
收藏
页码:223 / 243
页数:21
相关论文
共 50 条
  • [31] Brain Tumor Identification Using Gaussian Mixture Model Features and Decision Trees Classifier
    Chaddad, Ahmad
    Zinn, Pascal O.
    Colen, Rivka R.
    2014 48TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2014,
  • [32] An efficient approach for knowledge discovery in decision trees using inter quartile range transform
    Battula, Bhanu Prakash
    Rama Krishna, K.V.S.S.
    Kim, Tai-Hoon
    International Journal of Control and Automation, 2015, 8 (07): : 325 - 334
  • [33] ANALYSIS OF THE ARGUMENTATIONS OF DECISION MAKERS USING DECISION TREES
    GALLHOFER, IN
    SARIS, WE
    QUALITY & QUANTITY, 1979, 13 (05) : 411 - 430
  • [34] Using Decision Trees in Economizer Repair Decision Making
    Sun, Yong
    Ma, Lin
    Robinson, Warwick
    Fidge, Colin
    2010 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE, 2010, : 139 - +
  • [36] Accurate Robust and Efficient Error Estimation for Decision Trees
    Fan, Lixin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [37] Simple and efficient fully-functional succinct trees
    Cordova, Joshimar
    Navarro, Gonzalo
    THEORETICAL COMPUTER SCIENCE, 2016, 656 : 135 - 145
  • [38] Efficient Non-greedy Optimization of Decision Trees
    Norouzi, Mohammad
    Collins, Maxwell D.
    Johnson, Matthew
    Fleet, David J.
    Kohli, Pushmeet
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [39] An Efficient Extension to Mixture Techniques for Prediction and Decision Trees
    Fernando C. Pereira
    Yoram Singer
    Machine Learning, 1999, 36 : 183 - 199
  • [40] Efficient Quantum Agnostic Improper Learning of Decision Trees
    Chatterjee, Sagnik
    Sapv, Tharrmashastha
    Bera, Debajyoti
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238