Off line Arabic character recognition - A review

被引:119
|
作者
Khorsheed, MS [1 ]
机构
[1] Univ Cambridge, Comp Lab, Cambridge CB2 3QG, England
关键词
Arabic OCR; feature extraction; Fourier Transform; hidden Markov models; horizontal projection; Hough Transform; neural networks; off-line recognition; preprocessing segmentation; vertical projection;
D O I
10.1007/s100440200004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Off-line recognition requires transferring the text under consideration into an image file. This represents the only available solution to bring the printed materials to the electronic media. However, the transferring process causes the system to lose the temporal information of that text. Other complexities that an off-line recognition system has to deal with are the lower resolution of the document and the poor binarisation, which can contribute to readability when essential features of the characters are deleted or obscured. Recognising Arabic script presents two additional challenges: orthography is cursive and letter shape is context sensitive. Certain character combinations form new ligature shapes, which are often font-dependent. Some ligatures involve vertical stacking of characters. Since not all letters connect, word boundary location becomes an interesting problem, as spacing may separate not only words, but also certain characters within a word. Various techniques have been implemented to achieve high recognition rates. These techniques have tackled different aspects of the recognition system. This review is organised into five major sections, covering a general overview, Arabic writing characteristics, Arabic text recognition system, Arabic OCR software and conclusions.
引用
收藏
页码:31 / 45
页数:15
相关论文
共 50 条
  • [1] Off-Line Arabic Character Recognition – A Review
    M. S. Khorsheed
    [J]. Pattern Analysis & Applications, 2002, 5 : 31 - 45
  • [2] Off line Arabic character recognition - A survey
    Amin, A
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 596 - 599
  • [3] OFF-LINE ARABIC CHARACTER-RECOGNITION
    GORAINE, H
    USHER, M
    ALEMAMI, S
    [J]. COMPUTER, 1992, 25 (07) : 71 - 74
  • [4] Off-line Arabic character recognition: The state of the art
    Amin, A
    [J]. PATTERN RECOGNITION, 1998, 31 (05) : 517 - 530
  • [5] Investigation on deep learning for off-line handwritten Arabic character recognition
    Boufenar, Chaouki
    Kerboua, Adlen
    Batouche, Mohamed
    [J]. COGNITIVE SYSTEMS RESEARCH, 2018, 50 : 180 - 195
  • [6] Arabic Optical Character Recognition: A Review
    Alghyaline, Salah
    [J]. CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 135 (03): : 1825 - 1861
  • [7] Arabic optical character recognition software: A review
    Alkhateeb F.
    Abu Doush I.
    Albsoul A.
    [J]. Pattern Recognition and Image Analysis, 2017, 27 (4) : 763 - 776
  • [8] Off Line Arabic Handwritten Character Using Neural Network
    Shamsan, Ehab A.
    Khalifa, Othman O.
    Hassan, Aisha
    Hamdan, H. G. Muhammad
    [J]. 2017 IEEE 4TH INTERNATIONAL CONFERENCE ON SMART INSTRUMENTATION, MEASUREMENT AND APPLICATION (ICSIMA 2017), 2017,
  • [9] Off-line arabic signature recognition and verification
    Ismail, MA
    Gad, S
    [J]. PATTERN RECOGNITION, 2000, 33 (10) : 1727 - 1740
  • [10] Arabic character recognition
    Dershowitz, Nachum
    Rosenberg, Andrey
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8001 : 584 - 602