Offline handwritten arabic character segmentation with probabilistic model

被引:0
|
作者
Xiu, PP [1 ]
Peng, LR [1 ]
Ding, XQ [1 ]
Wang, H [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research on offline handwritten Arabic character recognition has received more and more attention in recent years, because of the increasing needs of Arabic document digitization. The variation in Arabic handwriting brings great difficulty in character segmentation and recognition, eg., the sub-parts (diacritics) of the Arabic character may shift away from the main part. In this paper, a new probabilistic segmentation model is proposed. First, a contour-based over-segmentation method is conducted, cutting the word image into graphemes. The graphemes are sorted into 3 queues, which are character main parts, sub-parts (diacritics) above or below main parts respectively. The confidence for each character is calculated by the probabilistic model, taking into account both of the recognizer output and the geometric confidence besides with logical constraint. Then, the global optimization is conducted to find optimal cutting path, taking weighted average of character confidences as objective function. Experiments on handwritten Arabic documents with various writing styles show the proposed method is effective.
引用
收藏
页码:402 / 412
页数:11
相关论文
共 50 条
  • [41] Offline handwritten character detection using image components
    Basavaraj, L.
    Samuel, R. D. Sudhaker
    [J]. ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 461 - 465
  • [42] Offline handwritten Chinese character recognition by radical decomposition
    Shi, Daming
    Damper, Robert I.
    Gunn, Steve R.
    [J]. ACM Transactions on Asian Language Information Processing, 2003, 2 (01): : 27 - 48
  • [43] Polar Transformation System for Offline Handwritten Character Recognition
    Wang, Xianjing
    Sajjanhar, Awl
    [J]. SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING 2011, 2011, 368 : 15 - 24
  • [44] Development of a Benchmark Odia Handwritten Character Database for an Efficient Offline Handwritten Character Recognition with a Chronological Survey
    Dey, Raghunath
    Balabantaray, Rakesh Chandra
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (06)
  • [45] A Comprehensive Survey of Handwritten Character Segmentation
    Vyas, Mayur
    Verma, Karun
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 1462 - 1465
  • [46] A CUSTOMIZABLE FUZZY SYSTEM FOR OFFLINE HANDWRITTEN CHARACTER RECOGNITION
    Batuwita, Rukshan
    Palade, Vasile
    Bandara, Dharmapriya C.
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2011, 20 (03) : 425 - 455
  • [47] Application of bidirectional probabilistic character language model in handwritten words recognition
    Sas, Jerzy
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 679 - 687
  • [48] Handwritten Character Segmentation for Kannada Scripts
    Naveena, C.
    Aradhya, V. N. Manjunath
    [J]. PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 144 - 149
  • [49] Character segmentation in handwritten words - An overview
    Lu, Y
    Shridhar, M
    [J]. PATTERN RECOGNITION, 1996, 29 (01) : 77 - 96
  • [50] Character Segmentation in Malayalam Handwritten Documents
    Shanjana, C.
    James, Ajay
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING AND TECHNOLOGY RESEARCH (ICAETR), 2014,