Graph Model Optimization based Historical Chinese Character Segmentation Method

被引:6
|
作者
Ji, Jingning [1 ]
Peng, Liangrui [1 ]
Li, Bohan [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
historical Chinese document; character segmentation; graph model;
D O I
10.1109/DAS.2014.57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Historical Chinese document recognition technology is important for digital library. However, historical Chinese character segmentation remains a difficult problem due to the complex structure of Chinese characters and various writing styles. This paper presents a novel method for historical Chinese character segmentation based on graph model. After a preliminary over-segmentation stage, the system applies a merging process. The candidate segmentation positions are denoted by the nodes of a graph, and the merging process is regarded as selecting an optimal path of the graph. The weight of edge in the graph is calculated by the cost function which considers geometric features and recognition confidence. Experimental results show that the proposed method is effective with a detection rate of 94.6% and an accuracy rate of 96.1% on a test set of practical historical Chinese document samples.
引用
收藏
页码:282 / 286
页数:5
相关论文
共 50 条
  • [1] Local Projection based Character Segmentation Method for Historical Chinese Documents
    Yang, Linjie
    Peng, Liangrui
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [2] Touching Character Segmentation Method for Chinese Historical Documents
    Sun, Xiaolu
    Peng, Liangrui
    Ding, Xiaoqing
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [3] Chinese Character Components Segmentation Method Based on Faster RCNN
    Gao, Xiang
    Yang, Fang
    Chen, Tian
    Si, Jianhui
    IEEE ACCESS, 2022, 10 : 98095 - 98103
  • [4] Segmentation Based on Shape Prior and Graph Model Optimization
    Xiao, Qinkun
    Zhang, Nan
    Gao, Song
    Li, Fei
    Gao, Yue
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 3, 2010, : 405 - 408
  • [5] A recognition-based method for segmentation of Chinese character in images and videos
    Yang, Wuyi
    Zhang, Shuwu
    Zheng, Haibo
    Zeng, Zhi
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 723 - 728
  • [6] HRRegionNet: Chinese Character Segmentation in Historical Documents with Regional Awareness
    Tang, Chia-Wei
    Liu, Chao-Lin
    Chiu, Po-Sen
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 3 - 17
  • [7] HRCenterNet: An Anchorless Approach to Chinese Character Segmentation in Historical Documents
    Tang, Chia-Wei
    Liu, Chao-Lin
    Po-Sen Chiu
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1924 - 1930
  • [8] Historical Chinese Character Recognition Method Based on Style Transfer Mapping
    Li, Bohan
    Peng, Liangrui
    Ji, Jingning
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 96 - 100
  • [9] A Component Recognition-based Chinese Character Segmentation and Structure Discrimination Method
    Bao, Yongtang
    Qi, Yue
    Yu, Bowen
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENT COMMUNICATION, 2015, 16 : 371 - 374
  • [10] Character Segmentation Method for Irregularly Arranged Text in Chinese
    Yang X.
    Niu X.
    Liang W.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (09): : 1542 - 1548