Feature Representations for Scene Text Character Recognition: A Comparative Study

被引:38
|
作者
Yi, Chucai [1 ]
Yang, Xiaodong [2 ]
Tian, Yingli [1 ,2 ]
机构
[1] CUNY, Grad Ctr, Dept Comp Sci, New York, NY 10016 USA
[2] CUNY, City Coll, Dept Elect Engn, New York, NY 10016 USA
关键词
scene text character recognition; performance evaluation; text feature representation; feature descriptors; Global HOG; dictionary of visual words; coding-pooling;
D O I
10.1109/ICDAR.2013.185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing text character from natural scene images is a challenging problem due to background interferences and multiple character patterns. Scene Text Character (STC) recognition, which generally includes feature representation to model character structure and multi-class classification to predict label and score of character class, mostly plays a significant role in word-level text recognition. The contribution of this paper is a complete performance evaluation of imagebased STC recognition, by comparing different sampling methods, feature descriptors, dictionary sizes, coding and pooling schemes, and SVM kernels. We systematically analyze the impact of each option in the feature representation and classification. The evaluation results on two datasets CHARS74K and ICDAR2003 demonstrate that Histogram of Oriented Gradient (HOG) descriptor, soft-assignment coding, max pooling, and Chi-Square Support Vector Machines (SVM) obtain the best performance among local sampling based feature representations. To improve STC recognition, we apply global sampling feature representation. We generate Global HOG (GHOG) by computing HOG descriptor from global sampling. GHOG enables better character structure modeling and obtains better performance than local sampling based feature representations. The GHOG also outperforms existing methods in the two benchmark datasets.
引用
收藏
页码:907 / 911
页数:5
相关论文
共 50 条
  • [1] Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning
    Coates, Adam
    Carpenter, Blake
    Case, Carl
    Satheesh, Sanjeev
    Suresh, Bipin
    Wang, Tao
    Wu, David J.
    Ng, Andrew Y.
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 440 - 445
  • [2] Feature Pooling in Scene Character Recognition: A Comprehensive Study
    Zhang, Zhong
    Wang, Hong
    Liu, Shuang
    Shao, Yunxue
    [J]. COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2150 - 2157
  • [3] Optical Character Recognition for Scene Text Detection, Mining and Recognition
    Nathiya, N.
    Pradeepa, K.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 662 - 665
  • [4] A Feature Learning Method for Scene Text Recognition
    Ho Vu Duong
    Quoc Ngoc Ly
    [J]. 2012 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2012, : 176 - 180
  • [5] A NEW PERSPECTIVE FOR FLEXIBLE FEATURE GATHERING IN SCENE TEXT RECOGNITION VIA CHARACTER ANCHOR POOLING
    Long, Shangbang
    Guan, Yushuo
    Bian, Kaigui
    Yao, Cong
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2458 - 2462
  • [6] CHARACTER REGION AWARENESS NETWORK FOR SCENE TEXT RECOGNITION
    Shang, Mingyu
    Gao, Jie
    Sun, Jun
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [7] A comparative study of Gabor feature and gradient feature for handwritten Chinese character recognition
    Ding, Kai
    Liu, Zhibin
    Jin, Lianwen
    Zhu, Xinghua
    [J]. 2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 1182 - 1186
  • [8] Synthetically Supervised Feature Learning for Scene Text Recognition
    Liu, Yang
    Wang, Zhaowen
    Jin, Hailin
    Wassell, Ian
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 449 - 465
  • [9] Random Projected Convolutional Feature for Scene Text Recognition
    Wu, Rui
    Yang, Shuli
    Leng, Dawei
    Luo, Zhenbo
    Wang, Yunhong
    [J]. PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 132 - 137
  • [10] Attention Guided Feature Encoding for Scene Text Recognition
    Hassan, Ehtesham
    Lekshmi, V. L.
    [J]. JOURNAL OF IMAGING, 2022, 8 (10)