Exploiting State-of-the-Art Deep Learning Methods for Document Image Analysis

被引:7
|
作者
Pondenkandath, Vinaychandran [1 ,2 ]
Seuret, Mathias [2 ]
Ingold, Rolf [2 ]
Afzal, Muhammad Zeshan [1 ]
Liwicki, Marcus [1 ,2 ]
机构
[1] TU Kaiserslautern, MindGarage, Kaiserslautern, Germany
[2] Univ Fribourg, DIVA, Fribourg, Switzerland
基金
瑞士国家科学基金会;
关键词
RECOGNITION;
D O I
10.1109/ICDAR.2017.325
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper provides details of our (partially awardwinning) methods submitted to four competitions of ICDAR 2017. In particular, they are designed to (i) classify scripts, (ii) perform pixel-based labeling for layout analysis, (iii) identify writers, and (iv) recognize font size and types. The methods build on the current state-of-the-art in Deep Learning and have been adapted to the specific needs of the individual tasks. All methods are variants of Convolutional Neural Network (CNN) with specialized architectures, initialization, and other tricks which have been introduced in the field of deep learning within the last few years.
引用
收藏
页码:30 / 35
页数:6
相关论文
共 50 条
  • [21] Deep learning and the electrocardiogram: review of the current state-of-the-art
    Somani, Sulaiman
    Russak, Adam J.
    Richter, Felix
    Zhao, Shan
    Vaid, Akhil
    Chaudhry, Fayzan
    De Freitas, Jessica K.
    Naik, Nidhi
    Miotto, Riccardo
    Nadkarni, Girish N.
    Narula, Jagat
    Argulian, Edgar
    Glicksberg, Benjamin S.
    EUROPACE, 2021, 23 (08): : 1179 - 1191
  • [22] Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning
    Cummins, Nicholas
    Baird, Alice
    Schuller, Bjoern W.
    METHODS, 2018, 151 : 41 - 54
  • [23] TSC prediction and dynamic control of BOF steelmaking with state-of-the-art machine learning and deep learning methods
    Xie, Tian-yi
    Zhang, Cai-dong
    Zhou, Quan-lin
    Tian, Zhi-qiang
    Liu, Shuai
    Guo, Han-jie
    JOURNAL OF IRON AND STEEL RESEARCH INTERNATIONAL, 2024, 31 (01) : 174 - 194
  • [24] Deep Learning Applications for COVID-19 Analysis: A State-of-the-Art Survey
    Li, Wenqian
    Deng, Xing
    Shao, Haijian
    Wang, Xia
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2021, 129 (01): : 65 - 98
  • [25] TSC prediction and dynamic control of BOF steelmaking with state-of-the-art machine learning and deep learning methods
    Tian-yi Xie
    Cai-dong Zhang
    Quan-lin Zhou
    Zhi-qiang Tian
    Shuai Liu
    Han-jie Guo
    Journal of Iron and Steel Research International, 2024, 31 : 174 - 194
  • [26] Lightweight image super-resolution based on deep learning: State-of-the-art and future directions
    Gendy, Garas
    He, Guanghui
    Sabor, Nabil
    INFORMATION FUSION, 2023, 94 : 284 - 310
  • [27] Deep machine learning provides state-of-the-art performance in image-based plant phenotyping
    Pound, Michael P.
    Atkinson, Jonathan A.
    Townsend, Alexandra J.
    Wilson, Michael H.
    Griffiths, Marcus
    Jackson, Aaron S.
    Bulat, Adrian
    Tzimiropoulos, Georgios
    Wells, Darren M.
    Murchie, Erik H.
    Pridmore, Tony P.
    French, Andrew P.
    GIGASCIENCE, 2017, 6 (10):
  • [28] Artificial Intelligence and Deep Learning for Advancing PET Image Reconstruction: State-of-the-Art and Future Directions
    Hellwig, Dirk
    Hellwig, Nils Constantin
    Boehner, Steven
    Fuchs, Timo
    Fischer, Regina
    Schmidt, Daniel
    NUKLEARMEDIZIN-NUCLEAR MEDICINE, 2023, 62 (06): : 334 - 342
  • [29] A Roadmap to Deep Learning: A State-of-the-Art Step Towards Machine Learning
    Garg, Dweepna
    Goel, Parth
    Kandaswamy, Gokulnath
    Ganatra, Amit
    Kotecha, Ketan
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 160 - 170
  • [30] Deep Learning-Based Monocular Depth Estimation Methods-A State-of-the-Art Review
    Khan, Faisal
    Salahuddin, Saqib
    Javidnia, Hossein
    SENSORS, 2020, 20 (08)