Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks

被引:2
|
作者
Fizaine, Florian Come [1 ,2 ]
Bard, Patrick [1 ]
Paindavoine, Michel [1 ]
Robin, Cecile [2 ,3 ]
Bouye, Edouard [2 ]
Lefevre, Raphael [4 ]
Vinter, Annie [1 ]
机构
[1] Univ Bourgogne, LEAD CNRS, F-21000 Dijon, France
[2] Arch Dept Cote dOr, F-21000 Dijon, France
[3] Inst Natl Patrimoine, F-75002 Paris, France
[4] Soc Natl Chemins Fer Francais, F-93200 St Denis, France
关键词
deep learning; line segmentation; instance segmentation; Mask-RCNN; U-Net; historical document analysis; DOCUMENTS;
D O I
10.3390/jimaging10030065
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Text line segmentation is a necessary preliminary step before most text transcription algorithms are applied. The leading deep learning networks used in this context (ARU-Net, dhSegment, and Doc-UFCN) are based on the U-Net architecture. They are efficient, but fall under the same concept, requiring a post-processing step to perform instance (e.g., text line) segmentation. In the present work, we test the advantages of Mask-RCNN, which is designed to perform instance segmentation directly. This work is the first to directly compare Mask-RCNN- and U-Net-based networks on text segmentation of historical documents, showing the superiority of the former over the latter. Three studies were conducted, one comparing these networks on different historical databases, another comparing Mask-RCNN with Doc-UFCN on a private historical database, and a third comparing the handwritten text recognition (HTR) performance of the tested networks. The results showed that Mask-RCNN outperformed ARU-Net, dhSegment, and Doc-UFCN using relevant line segmentation metrics, that performance evaluation should not focus on the raw masks generated by the networks, that a light mask processing is an efficient and simple solution to improve evaluation, and that Mask-RCNN leads to better HTR performance.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] EnRDeA U-Net Deep Learning of Semantic Segmentation on Intricate Noise Roads
    Yu, Xiaodong
    Kuan, Ta-Wen
    Tseng, Shih-Pang
    Chen, Ying
    Chen, Shuo
    Wang, Jhing-Fa
    Gu, Yuhang
    Chen, Tuoli
    ENTROPY, 2023, 25 (07)
  • [32] Development of Deep Learning with RDA U-Net Network for Bladder Cancer Segmentation
    Lee, Ming-Chan
    Wang, Shao-Yu
    Pan, Cheng-Tang
    Chien, Ming-Yi
    Li, Wei-Ming
    Xu, Jin-Hao
    Luo, Chi-Hung
    Shiue, Yow-Ling
    CANCERS, 2023, 15 (04)
  • [33] New U-Net for Image Deblurring Using Deep Learning
    Jeong W.
    Kim S.
    Lee C.
    Transactions of the Korean Institute of Electrical Engineers, 2023, 72 (07): : 843 - 848
  • [34] Brain Tumour Segmentation Using U-net Based Adversarial Networks
    Teki, Satyanarayana Murthy
    Varma, Mohan Krishna
    Yadav, Anjana K.
    TRAITEMENT DU SIGNAL, 2019, 36 (04) : 353 - 359
  • [35] DBT Masses Automatic Segmentation Using U-Net Neural Networks
    Lai, Xiaobo
    Yang, Weiji
    Li, Ruipeng
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2020, 2020
  • [36] Automated Cardiac Chamber Size and Cardiac Physiology Measurement in Water Fleas by U-Net and Mask RCNN Convolutional Networks
    Saputra, Ferry
    Farhan, Ali
    Suryanto, Michael Edbert
    Kurnia, Kevin Adi
    Chen, Kelvin H-C
    Vasquez, Ross D.
    Roldan, Marri Jmelou M.
    Huang, Jong-Chin
    Lin, Yih-Kai
    Hsiao, Chung-Der
    ANIMALS, 2022, 12 (13):
  • [37] Hippocampus segmentation in MR brain images using learned fuzzy mask and U-Net
    Sadeghi, Alireza
    Khutanlou, Hassan
    2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
  • [38] Segmentation of dry bean (Phaseolus vulgaris L.) leaf disease images with U-Net and classification using deep learning algorithms
    Ramazan Kursun
    Kubilay Kurtulus Bastas
    Murat Koklu
    European Food Research and Technology, 2023, 249 : 2543 - 2558
  • [39] An Efficient and Optimal Deep Learning Architecture using Custom U-Net and Mask R-CNN Models for Kidney Tumor Semantic Segmentation
    Parvathi, Sitanaboina S. L.
    Jonnadula, Harikiran
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 314 - 320
  • [40] Segmentation of dry bean (Phaseolus vulgaris L.) leaf disease images with U-Net and classification using deep learning algorithms
    Kursun, Ramazan
    Bastas, Kubilay Kurtulus
    Koklu, Murat
    EUROPEAN FOOD RESEARCH AND TECHNOLOGY, 2023, 249 (10) : 2543 - 2558