Offline handwritten arabic character segmentation with probabilistic model

被引:0
|
作者
Xiu, PP [1 ]
Peng, LR [1 ]
Ding, XQ [1 ]
Wang, H [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research on offline handwritten Arabic character recognition has received more and more attention in recent years, because of the increasing needs of Arabic document digitization. The variation in Arabic handwriting brings great difficulty in character segmentation and recognition, eg., the sub-parts (diacritics) of the Arabic character may shift away from the main part. In this paper, a new probabilistic segmentation model is proposed. First, a contour-based over-segmentation method is conducted, cutting the word image into graphemes. The graphemes are sorted into 3 queues, which are character main parts, sub-parts (diacritics) above or below main parts respectively. The confidence for each character is calculated by the probabilistic model, taking into account both of the recognizer output and the geometric confidence besides with logical constraint. Then, the global optimization is conducted to find optimal cutting path, taking weighted average of character confidences as objective function. Experiments on handwritten Arabic documents with various writing styles show the proposed method is effective.
引用
收藏
页码:402 / 412
页数:11
相关论文
共 50 条
  • [1] Segmentation of Offline Handwritten Arabic Text
    Ghaleb, Hashem
    Nagabhushan, P.
    Pal, Umapada
    [J]. 2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 41 - 45
  • [2] An Enhanced Technique for Offline Arabic Handwritten Words Segmentation
    Abdeen, Roqyiah M.
    Afifi, Ahmed
    El-Sisi, Ashraf B.
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 663 - 681
  • [3] New segmentation algorithm for individual offline handwritten character segmentation
    Batuwita, KBMR
    Bandara, GEMDC
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 215 - 229
  • [4] New segmentation algorithm for offline handwritten connected character segmentation
    Jayarathna, U. K. S.
    Bandara, G. E. M. D. C.
    [J]. 2006 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2, 2006, : 540 - +
  • [5] An improved segmentation algorithm for individual offline handwritten character segmentation
    Batuwita, K. B. M. R.
    Bandara, G. E. M. D. C.
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, : 982 - +
  • [6] An artificial immune system for offline isolated handwritten arabic character recognition
    Chaouki Boufenar
    Mohamed Batouche
    Marc Schoenauer
    [J]. Evolving Systems, 2018, 9 : 25 - 41
  • [7] An artificial immune system for offline isolated handwritten arabic character recognition
    Boufenar, Chaouki
    Batouche, Mohamed
    Schoenauer, Marc
    [J]. EVOLVING SYSTEMS, 2018, 9 (01) : 25 - 41
  • [8] Segmentation-driven offline handwritten Chinese and Arabic script recognition
    Ding, Xiaoqing
    Liu, Hailong
    [J]. ARABIC AND CHINESE HANDWRITING RECOGNITION, 2008, 4768 : 196 - 217
  • [9] Segmentation Accuracy for Offline Arabic Handwritten Recognition Based on Bounding Box Algorithm
    Humied, Ismail. A.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2016, 16 (09): : 98 - 109
  • [10] Off-line handwritten arabic character segmentation algorithm: ACSA
    Sari, T
    Souici, L
    Sellami, M
    [J]. EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 452 - 457