Multi-oriented touching text character segmentation in graphical documents using dynamic programming

被引:32
|
作者
Pratim Roy, Partha [1 ]
Pal, Umapada
Llados, Josep [1 ]
Delalandre, Mathieu [2 ]
机构
[1] Autonomous Univ Barcelona, CVC, E-08193 Barcelona, Spain
[2] Univ Tours, LI Lab, F-37041 Tours, France
关键词
Touching character segmentation; Multi-oriented character recognition; Dynamic programming; HANDWRITTEN NUMERAL STRINGS; RECOGNITION;
D O I
10.1016/j.patcog.2011.09.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The touching character segmentation problem becomes complex when touching strings are multi-oriented. Moreover in graphical documents sometimes characters in a single-touching string have different orientations. Segmentation of such complex touching is more challenging. In this paper, we present a scheme towards the segmentation of English multi-oriented touching strings into individual characters. When two or more characters touch, they generate a big cavity region in the background portion. Based on the convex hull information, at first, we use this background information to find some initial points for segmentation of a touching string into possible primitives (a primitive consists of a single character or part of a character). Next, the primitives are merged to get optimum segmentation. A dynamic programming algorithm is applied for this purpose using the total likelihood of characters as the objective function. A SVM classifier is used to find the likelihood of a character. To consider multi-oriented touching strings the features used in the SVM are invariant to character orientation. Experiments were performed in different databases of real and synthetic touching characters and the results show that the method is efficient in segmenting touching characters of arbitrary orientations and sizes. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1972 / 1983
页数:12
相关论文
共 50 条
  • [1] Recognition of Multi-Oriented Touching Characters in Graphical Documents
    Roy, Partha Pratim
    Pal, Umapada
    Llados, Josep
    SIXTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS & IMAGE PROCESSING ICVGIP 2008, 2008, : 297 - +
  • [2] Multi-Oriented Text Recognition in Graphical Documents using HMM
    Roy, Partha Pratim
    Roy, Sangheeta
    Pal, Umapada
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 136 - 140
  • [3] Touching Text Character Localization in Graphical Documents Using SIFT
    Pratim Roy, Partha
    Pal, Umapada
    Llados, Josep
    GRAPHICS RECOGNITION: ACHIEVEMENTS, CHALLENGES, AND EVOLUTION, 2010, 6020 : 199 - +
  • [4] Convex Hull based Approach for Multi-Oriented Character Recognition from Graphical Documents
    Roy, Partha Pratim
    Pal, Umapada
    Llados, Josep
    Kimura, Fumitaka
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 773 - +
  • [5] Multi-Oriented Text Extraction in Stylistic Documents
    Singh, Brij Mohan
    Sharma, Rahul
    Ghosh, Debashis
    Mittal, Ankush
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2015, 15 (01)
  • [6] Fused Text Segmentation Networks for Multi-oriented Scene Text Detection
    Dai, Yuchen
    Huang, Zheng
    Gao, Yuting
    Xu, Youxuan
    Chen, Kai
    Guo, Jie
    Qiu, Weidong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3604 - 3609
  • [7] A Character Flow Framework for Multi-Oriented Scene Text Detection
    Yang, Wen-Jun
    Zou, Bei-Ji
    Li, Kai-Wen
    Liu, Shu
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2021, 36 (03) : 465 - 477
  • [8] A Character Flow Framework for Multi-Oriented Scene Text Detection
    Wen-Jun Yang
    Bei-Ji Zou
    Kai-Wen Li
    Shu Liu
    Journal of Computer Science and Technology, 2021, 36 : 465 - 477
  • [9] A general approach for multi-oriented text line extraction of handwritten documents
    Ouwayed, Nazih
    Belaid, Abdel
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (04) : 297 - 314
  • [10] A general approach for multi-oriented text line extraction of handwritten documents
    Nazih Ouwayed
    Abdel Belaïd
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 297 - 314