Text Baseline Recognition Using a Recurrent Convolutional Neural Network

被引:1
|
作者
Woedlinger, Matthias [1 ]
Sablatnig, Robert [1 ]
机构
[1] TU Wien, Comp Vis Lab, Vienna, Austria
关键词
Applications of deep learning to document analysis; Document understanding; Historical document analysis; SEGMENTATION;
D O I
10.1109/ICPR48806.2021.9412624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection of baselines of text is a necessary preprocessing step for many modern methods of automatic handwriting recognition. In this work, we present a two-stage system for the automatic detection of text baselines of handwritten text. In a first step, we perform pixel-wise segmentation on the document image to classify pixels as baselines, start points, end points and background. This segmentation is then used to extract the start points of lines. Starting from these points we extract the baseline using a recurrent convolutional neural network that directly outputs the baseline coordinates. This method allows the direct extraction of baseline coordinates as the output of a neural network without the use of any post-processing steps. We evaluate the model on the cBAD dataset from the ICDAR 2019 competition on baseline detection.
引用
收藏
页码:4673 / 4679
页数:7
相关论文
共 50 条
  • [1] Scene text recognition using residual convolutional recurrent neural network
    Zhengchao Lei
    Sanyuan Zhao
    Hongmei Song
    Jianbing Shen
    [J]. Machine Vision and Applications, 2018, 29 : 861 - 871
  • [2] Scene text recognition using residual convolutional recurrent neural network
    Lei, Zhengchao
    Zhao, Sanyuan
    Song, Hongmei
    Shen, Jianbing
    [J]. MACHINE VISION AND APPLICATIONS, 2018, 29 (05) : 861 - 871
  • [3] Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network
    Chandio, Asghar Ali
    Asikuzzaman, MD.
    Pickering, Mark R.
    Leghari, Mehwish
    [J]. IEEE ACCESS, 2022, 10 : 10062 - 10078
  • [4] Recurrent Convolutional Neural Network for Object Recognition
    Liang, Ming
    Hu, Xiaolin
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3367 - 3375
  • [5] Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network
    El Bahi, Hassan
    [J]. Soft Computing, 2024, 28 (20) : 12249 - 12268
  • [6] Music genre recognition using convolutional recurrent neural network architecture
    Bisharad, Dipjyoti
    Laskar, Rabul Hussain
    [J]. EXPERT SYSTEMS, 2019, 36 (04)
  • [7] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
    Hassan El Bahi
    Abdelkarim Zatni
    [J]. Multimedia Tools and Applications, 2019, 78 : 26453 - 26481
  • [8] A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application
    Liu, Yiyi
    Wang, Yuxin
    Shi, Hongjian
    [J]. SYMMETRY-BASEL, 2023, 15 (04):
  • [9] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
    El Bahi, Hassan
    Zatni, Abdelkarim
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 26453 - 26481
  • [10] Fully Convolutional Recurrent Network for Handwritten Chinese Text Recognition
    Xie, Zecheng
    Sun, Zenghui
    Jin, Lianwen
    Feng, Ziyong
    Zhang, Shuye
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4011 - 4016