A novel comprehensive database for offline Persian handwriting recognition

被引:11
|
作者
Sadri, Javad [1 ,2 ]
Yeganehzad, Mohammad Reza [2 ]
Saghi, Javad [2 ]
机构
[1] Concordia Univ, Fac Engn & Comp Sci, Dept Comp Sci & Software Engn, Montreal, PQ H3G 1M8, Canada
[2] Univ Birjand, Fac Elect & Comp Engn, Dept Comp Engn, POB 615-97175, Bidand, Iran
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Persian handwriting recognition; Persian offline recognition; Persian handwriting database; Check recognition; Numeral string; Digit recognition; Persian date; Persian alphabet; Unconstrained Persian handwriting recognition; DIGIT RECOGNITION; ONLINE; SEGMENTATION;
D O I
10.1016/j.patcog.2016.03.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Developing a standard database for offline handwriting recognition is an essential task. This paper offers a novel comprehensive database for conducting research on offline Persian handwriting recognition. Seven pages of forms were designed and completed by 500 native Persian writers, who were equally balanced in terms of gender and randomly selected from all over Iran. Then, the completed forms were scanned at a resolution of 300 DPI. Through several intensive processing steps, a huge number of isolated digits, numeral strings, touching digits, dates, words, names, alphabetical letters, free texts, arithmetic, and especial symbols from all these forms were extracted and organized as a standard database. All samples in this database were assigned with detailed ground truth and stored in three color formats: true color, gray level, and binary. Also, all subsets of this database were randomly partitioned into training, validation, and testing sets. We hope this comprehensive database will extend research in the pattern recognition community. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:378 / 393
页数:16
相关论文
共 50 条
  • [21] A Spanish dataset for reproducible benchmarked offline handwriting recognition
    Espana-Boquera, Salvador
    Jose Castro-Bleda, Maria
    LANGUAGE RESOURCES AND EVALUATION, 2022, 56 (03) : 1009 - 1022
  • [22] Offline Arabic Handwriting Recognition System based on HMM
    Xiang, Dong
    Yan, Huahua
    Chen, Xianqiao
    Cheng, Yanfen
    PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 526 - 529
  • [23] Combining Online and Offline Systems for Arabic Handwriting Recognition
    Azeem, Sherif Abdel
    Ahmed, Hany
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3725 - 3728
  • [24] Offline Chinese handwriting recognition: An assessment of current technology
    Srihari S.N.
    Yang X.
    Ball G.R.
    Front. Comput. Sci. China, 2007, 2 (137-155): : 137 - 155
  • [25] Offline Arabic Handwriting Recognition Using BLSTMs Combination
    Jemni, Sana Khamekhem
    Kessentini, Yousri
    Kanoun, Slim
    Ogier, Jean-Marc
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 31 - 36
  • [27] Recognition of Persian online handwriting using elastic fuzzy pattern recognition
    Halavati, Ramin
    Shouraki, Saeed Bagheri
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (03) : 491 - 513
  • [28] White-Space Models for Offline Arabic Handwriting Recognition
    Dreuw, Philippe
    Jonas, Stephan
    Ney, Hermann
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2656 - 2659
  • [29] Hybrid modeling of an OffLine Arabic Handwriting Recognition System AHRS
    Meddeb, Ons
    Maraoui, Mohsen
    Aljawarneh, Shadi
    2016 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2016,
  • [30] Offline Handwriting Recognition on Devanagari using a new Benchmark Dataset
    Dutta, Kartik
    Krishnan, Praveen
    Mathew, Minesh
    Jawahar, C. V.
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 25 - 30