An image database of handwritten Bangla words with automatic benchmarking facilities for character segmentation algorithms

被引:0
|
作者
Samir Malakar
Ram Sarkar
Subhadip Basu
Mahantapas Kundu
Mita Nasipuri
机构
[1] Asutosh College,Department of Computer Science
[2] Jadavpur University,Department of Computer Science and Engineering
来源
关键词
Character segmentation; Handwritten word; Bangla script; Image database; Word recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Recognition of unconstrained handwritten word images is an interesting research problem which gets more challenging when lexicon-free words are considered. Prerequisite for developing a lexicon-free handwritten word recognition technique is the segmentation of a word image into its constituent character set. Therefore, a competent character segmentation technique is required to design a comprehensive word recognition module. However, the literature study reveals that there is no standard word image database with ground truth information. As a result, most character segmentation algorithms found in the literature rely on self-made databases with manual evaluation. To fill the research need, in the present scope of the work, a comprehensive database consisting of handwritten Bangla word images is prepared primarily for evaluating any character segmentation algorithms. Additionally, the present work also provides two types of ground truth images related to segmented character shapes of the word images. Besides, an evaluation tool is developed for assessing the performance of any character segmentation algorithm on the developed benchmark database. The benchmark result, as found here, is 0.9212 (F-score) which outperforms some state-of-the-art methods.
引用
收藏
页码:449 / 468
页数:19
相关论文
共 50 条
  • [1] An image database of handwritten Bangla words with automatic benchmarking facilities for character segmentation algorithms
    Malakar, Samir
    Sarkar, Ram
    Basu, Subhadip
    Kundu, Mahantapas
    Nasipuri, Mita
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (01): : 449 - 468
  • [2] An Approach for Character Segmentation of Handwritten Bangla and Devanagari Script
    Bhattad, Anmol J.
    Chaudhuri, Bidyut B.
    [J]. 2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 676 - 680
  • [3] A novel segmentation technique for online handwritten Bangla words
    Sen, Shibaprasad
    Chowdhury, Shubham
    Mitra, Mridul
    Schwenker, Friedhelm
    Sarkar, Ram
    Roy, Kaushik
    [J]. PATTERN RECOGNITION LETTERS, 2020, 139 : 26 - 33
  • [4] Character segmentation in handwritten words - An overview
    Lu, Y
    Shridhar, M
    [J]. PATTERN RECOGNITION, 1996, 29 (01) : 77 - 96
  • [5] An image database for benchmarking of automatic face detection and recognition algorithms
    Loui, AC
    Judice, CN
    Liu, S
    [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 146 - 150
  • [6] Performance Evaluation of Different Algorithms for Handwritten Isolated Bangla Character Recognition
    Meerza, Syed Irfan Ali
    Islam, Moinul
    Uzzal, Md. Mohiuddin
    [J]. 2019 1ST INTERNATIONAL CONFERENCE ON ROBOTICS, ELECTRICAL AND SIGNAL PROCESSING TECHNIQUES (ICREST), 2019, : 412 - 416
  • [7] Benchmarking Image Segmentation Algorithms
    Estrada, Francisco J.
    Jepson, Allan D.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 85 (02) : 167 - 181
  • [8] An Efficient Line Segmentation Approach for Handwritten Bangla Document Image
    Mullick, K.
    Banerjee, S.
    Bhattacharya, U.
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 130 - +
  • [9] Benchmarking Image Segmentation Algorithms
    Francisco J. Estrada
    Allan D. Jepson
    [J]. International Journal of Computer Vision, 2009, 85 : 167 - 181
  • [10] A benchmark image database of isolated Bangla handwritten compound characters
    Nibaran Das
    Kallol Acharya
    Ram Sarkar
    Subhadip Basu
    Mahantapas Kundu
    Mita Nasipuri
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2014, 17 : 413 - 431