A Novel Text Line Segmentation Method Based on Contour Curve Tracking for Tibetan Historical Documents

被引:14
|
作者
Zhou, Fengming [1 ]
Wang, Weilan [1 ]
Lin, Qiang [1 ]
机构
[1] Northwest Minzu Univ, Coll Math & Comp Sci, Lanzhou, Gansu, Peoples R China
基金
美国国家科学基金会;
关键词
Tibetan historical document; text line segmentation; barycentre coordinates; connected component; contour curve; FREESTYLE HANDWRITTEN DOCUMENTS;
D O I
10.1142/S0218001418540253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we proposed a novel method for text line segmentation of Tibetan historical document image with uchen script based on contour tracking. Our method is mainly to segment the text lines from the image documents using the contour curve of the text lines, which consists of three parts: First, we calculate the barycentre coordinates of the connected components for the text regions, and then the barycentre of each text line is connected in order, so that the main part of each text line is connected and a new connected component is formed; then the contour curve of the connected component is obtained using the contour tracing algorithm; Second, the contour curve and the barycentre gravity are used to assign key elements (such as the syllable point, the upper vowel, the lower vowel, and the broken strokes and so on) of the text lines, and next the candidate text lines are obtained based on these connected components; Finally, the contour tracking algorithm is used to calculate the contour curve of the candidate text lines and segment the text lines. We evaluated our text line segmentation method on the 200 document image data sets. Experimental results show that the proposed method based on contour curve tracing can accurately segment the text lines of image documents and achieve the encouraging results.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A Text-Line Segmentation Method for Historical Tibetan Documents Based on Baseline Detection
    Li, Yanxing
    Ma, Longlong
    Duan, Lijuan
    Wu, Jian
    [J]. COMPUTER VISION, PT I, 2017, 771 : 356 - 367
  • [2] Research on Text Line Segmentation of Historical Tibetan Documents Based on the Connected Component Analysis
    Wang, Yiqun
    Wang, Weilan
    Li, Zhenjiang
    Han, Yuehui
    Wang, Xiaojuan
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 74 - 87
  • [3] Text Line Segmentation of Tibetan Historical Documents Based on Text Core Regions Combined with Expansion Growth
    Li Jincheng
    Wang Xiaojuan
    Wang Weilan
    Lin Qiang
    Hu Pengfei
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (02)
  • [4] A novel method of text line segmentation for historical document image of the uchen Tibetan
    Li, Zhenjiang
    Wang, Weilan
    Chen, Yang
    Hao, Yusheng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 61 : 23 - 32
  • [5] Text line segmentation of historical documents: a survey
    Laurence Likforman-Sulem
    Abderrazak Zahour
    Bruno Taconet
    [J]. International Journal of Document Analysis and Recognition (IJDAR), 2007, 9 : 123 - 138
  • [6] Text line segmentation of historical documents: a survey
    Likforman-Sulem, Laurence
    Zahour, Abderrazak
    Taconet, Bruno
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) : 123 - 138
  • [7] Text Line segmentation of historical Arabic documents
    Zahour, Abderrazak
    Likforman-Sulem, Laurence
    Boussalaa, Wafa
    Taconet, Bruno
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 138 - +
  • [8] Text Line Segmentation in Images of Handwritten Historical Documents
    Sanchez, A.
    Suarez, P. D.
    Melloz, C. A. B.
    Oliveira, A. L. I.
    Alves, V. M. O.
    [J]. 2008 FIRST INTERNATIONAL WORKSHOPS ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2008, : 232 - +
  • [9] A Tracking Approach for Text Line Segmentation in Handwritten Documents
    Setitra, Insaf
    Hadjadj, Zineb
    Meziane, Abdelkrim
    [J]. ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 193 - 198
  • [10] Touching text line segmentation combined local baseline and connected component for Uchen Tibetan historical documents
    Hu, Pengfei
    Wang, Weilan
    Li, Qiaoqiao
    Wang, Tiejun
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (06)