Fast Chinese calligraphic character recognition with large-scale data

被引:13
|
作者
Gao Pengcheng [1 ]
Wu Jiangqin [1 ]
Lin Yuan [1 ]
Xia Yang [1 ]
Mao Tianjiao [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310003, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Calligraphic character; Shape context; Fast recognition; SHAPE;
D O I
10.1007/s11042-014-1969-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Chinese calligraphy draws a lot of attention for its beauty and elegance. But due to the complexity of shape and styles of calligraphic characters, it is difficult for common users to recognize them. Thus it would be great if a tool is provided to help users to recognize the unknown calligraphic characters. The well-known OCR (Optical Character Recognition) technology can hardly help people to recognize the unknown characters because of their deformation and complexity. In CADAL, a Calligraphic Character Dictionary (CalliCD) which contains character images labeled with semantic meaning has been constructed and provided to common users to use online. With the help of CalliCD, user can learn more about the unknown calligraphic character by performing similarity based searching. But as with the growth of CalliCD, it takes intolerable time to do the similarity based one-to-one searching. Strategies that can handle large scale data are needed. In this paper, a fast recognition schema based on retrieval is proposed. In addition, a novel shape descriptor, called GIST-SC, is proposed to represent calligraphic character image for efficient and effective retrieval. The schema works in three steps. Firstly approximate nearest neighbors of the character image to be recognized are found quickly. Secondly, one-to-one fine matching between approximate nearest neighbors and the character image to be recognized is performed. Finally the recognition based on semantic probability is given. Our experiments show that the GIST-SC descriptor and the recognition schema are efficient and effective for Chinese calligraphic character recognition with CalliCD.
引用
收藏
页码:7221 / 7238
页数:18
相关论文
共 50 条
  • [1] Fast Chinese calligraphic character recognition with large-scale data
    Gao Pengcheng
    Wu Jiangqin
    Lin Yuan
    Xia Yang
    Mao Tianjiao
    [J]. Multimedia Tools and Applications, 2015, 74 : 7221 - 7238
  • [2] Fast age-based Chinese Calligraphic Character Retrieval on Large Scale Data
    Gao Pengcheng
    Wu Jiangqin
    Lin Yuan
    Xia Yang
    Mao Tianjiao
    Wei Baogang
    [J]. 2014 IEEE/ACM JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2014, : 211 - 218
  • [3] LSH-Based Large Scale Chinese Calligraphic Character Recognition
    Lin, Yuan
    Wu, Jiangqin
    Gao, Pengcheng
    Xia, Yang
    Mao, Tianjiao
    [J]. JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, : 323 - 329
  • [4] Fast Searching Chinese Calligraphic Information Based on Character Recognition
    Yang, Lijie
    Ma, Jie
    Xu, Tianchen
    Pang, Zhequn
    [J]. PROCEEDINGS OF 4TH IEEE INTERNATIONAL CONFERENCE ON APPLIED SYSTEM INNOVATION 2018 ( IEEE ICASI 2018 ), 2018, : 358 - 361
  • [5] EMBEDDED LARGE-SCALE HANDWRITTEN CHINESE CHARACTER RECOGNITION
    Chherawala, Youssouf
    Dolfing, Hans J. G. A.
    Dixon, Ryan S.
    Bellegarda, Jerome R.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8169 - 8173
  • [6] Large-scale continual learning for ancient Chinese character recognition
    Xu, Yue
    Zhang, Xu-Yao
    Zhang, Zhaoxiang
    Liu, Cheng-Lin
    [J]. PATTERN RECOGNITION, 2024, 150
  • [7] Large-scale Optical Character Recognition of Pre-modern Chinese Texts
    Sturgeon, Donald
    [J]. INTERNATIONAL JOURNAL OF BUDDHIST THOUGHT & CULTURE, 2018, 28 (02): : 11 - 44
  • [8] LCSegNet: An Efficient Semantic Segmentation Network for Large-Scale Complex Chinese Character Recognition
    Wu, Xiangping
    Chen, Qingcai
    Xiao, Yulun
    Li, Wei
    Liu, Xin
    Hu, Baotian
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3427 - 3440
  • [9] Skeleton-Based Recognition of Chinese Calligraphic Character Image
    Yu, Kai
    Wu, Jiangqin
    Zhuang, Yueting
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 228 - 237
  • [10] Chinese character handwriting: A large-scale behavioral study and a database
    Ruiming Wang
    Shuting Huang
    Yacong Zhou
    Zhenguang G. Cai
    [J]. Behavior Research Methods, 2020, 52 : 82 - 96