A Comparative Study between Methods of Arabic Baseline Detection

被引:14
|
作者
AL-Shatnawi, Atallah [1 ]
Omar, Khairuddin [1 ]
机构
[1] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Dept Syst Sci & Management, Bangi, Malaysia
关键词
OCR; Handwritten; Offline; Arabic; Preprocessing; Baseline; Horizontal Projection; Skeleton; Contour; Principle Component Analysis; CHARACTER-RECOGNITION;
D O I
10.1109/ICEEI.2009.5254814
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Preprocessing is the most important stage in the Arabic OCR system; it has a direct effect on the reliability and efficiency of the segmentation and feature extraction stages. It is worth mentioning that Arabic language is cursively written, and its characters have between two to four shapes. An Arabic word likely consists of two or more characters which are connected through an imaginary line called baseline. Detecting baseline is one of the main majorities in preprocessing Arabic OCR system. The baseline can be used for both skew normalization and character segmentation. In this paper the challenges of the Arabic baseline detection methods are listed and clarified. Also this paper aims to provide a brief comparison between the methods of Arabic baseline detection. The comparison has been done based on each of the natures of the Arabic language written, and the diacritics, such as dots and zigzag, and the word slop, and the subwords found.
引用
收藏
页码:73 / 77
页数:5
相关论文
共 50 条
  • [1] Methods of Arabic Language Baseline Detection - The State of Art
    AL-Shatnawi, Atallah
    Omar, Khairuddin
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (10): : 137 - 143
  • [2] Arabic rumor detection: A comparative study
    Amoudi, Ghada
    Albalawi, Rasha
    Baothman, Fatimah
    Jamal, Amani
    Alghamdi, Hanan
    Alhothali, Areej
    ALEXANDRIA ENGINEERING JOURNAL, 2022, 61 (12) : 12511 - 12523
  • [3] METHODS OF QUESTIONING IN ARABIC AND YORUBA: A COMPARATIVE STUDY
    Adebisi, Aliy Abdulwahid
    Akanni, Abdullahi Shittu
    IJAZ ARABI JOURNAL OF ARABIC LEARNING, 2019, 2 (01): : 27 - 49
  • [4] Comparative Study Between METEOR and BLEU Methods of MT: Arabic into English Translation as a Case Study
    Hadla, Laith S.
    Hailat, Taghreed M.
    Al-Kabi, Mohammed N.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (11) : 215 - 223
  • [5] Baseline Detection on Arabic Handwritten Documents
    Fawzi, Ahmed
    Pastor, Moises
    Martinez-Hinarejos, Carlos-D.
    PROCEEDINGS OF THE 2017 ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 17), 2017, : 193 - 196
  • [6] Comparative study between three methods of outlying detection on experimental results
    P. M. S. Oliveira
    C. S. Munita
    R. Hazenfratz
    Journal of Radioanalytical and Nuclear Chemistry, 2010, 283 : 433 - 437
  • [7] Comparative study between three methods of outlying detection on experimental results
    Oliveira, P. M. S.
    Munita, C. S.
    Hazenfratz, R.
    JOURNAL OF RADIOANALYTICAL AND NUCLEAR CHEMISTRY, 2010, 283 (02) : 433 - 437
  • [8] A Comparative Study on Text Representation Models for Topic Detection in Arabic
    Koulali, Rim
    Meziane, Abdelouafi
    COMPUTACION Y SISTEMAS, 2019, 23 (03): : 683 - 691
  • [9] A Comparative Study of Statistical Feature Reduction Methods for Arabic Text Categorization
    Harrag, Fouzi
    El-Qawasmeh, Eyas
    Al-Salman, Abdul Malik S.
    NETWORKED DIGITAL TECHNOLOGIES, PT 2, 2010, 88 : 676 - +
  • [10] The discourse argumentative markers: comparative study between the Arabic and the Spanish
    Rifi, Ismail El Messaoudi
    ANAQUEL DE ESTUDIOS ARABES, 2018, 29 : 137 - 174