Combining Convolutional Neural Networks and LSTMs for Segmentation-Free OCR

被引:12
|
作者
Rawls, Stephen [1 ]
Cao, Huaigu [1 ]
Kumar, Senthil [1 ]
Natarajan, Prem [1 ]
机构
[1] Univ Southern Calif, Informat Sci Inst, Los Angeles, CA 90007 USA
关键词
D O I
10.1109/ICDAR.2017.34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel end-to-end trainable OCR system combining a CNN for feature extraction with 1-D LSTMs for sequence modeling. We present results on English and Arabic handwriting data, and on English machine print data, showing state-of-the-art performance. We believe that our method is simpler than existing 2D LSTM models, and will make it easier to use techniques borrowed from CNN research in computer vision to improve OCR performance.
引用
收藏
页码:155 / 160
页数:6
相关论文
共 50 条
  • [21] Combining Convolutional Neural Networks for Fungi Classification
    Prommakhot, Anuruk
    Srinonchat, Jakkree
    [J]. IEEE ACCESS, 2024, 12 : 58021 - 58030
  • [22] Combining belief networks and neural networks for scene segmentation
    Feng, XJ
    Williams, CKI
    Felderhof, SN
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) : 467 - 483
  • [23] Combining Fully Convolutional and Recurrent Neural Networks for 3D Biomedical Image Segmentation
    Chen, Jianxu
    Yang, Lin
    Zhang, Yizhe
    Alber, Mark
    Chen, Danny Z.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [24] Combining neural networks and belief networks for image segmentation
    Williams, CKI
    Feng, XJ
    [J]. NEURAL NETWORKS FOR SIGNAL PROCESSING VIII, 1998, : 393 - 401
  • [25] Intelligent Localization of Transformer Internal Degradations Combining Deep Convolutional Neural Networks and Image Segmentation
    Duan, Jiajun
    He, Yigang
    Du, Bolun
    Ghandour, Ruaa M. Rashad
    Wu, Wenjie
    Zhang, Hui
    [J]. IEEE ACCESS, 2019, 7 : 62705 - 62720
  • [26] Segmentation-Free Ocular Detection and Recognition
    Rodriguez, Andres
    Panza, Jeffrey
    Kumar, B. V. K. Vijaya
    [J]. SENSING TECHNOLOGIES FOR GLOBAL HEALTH, MILITARY MEDICINE, DISASTER RESPONSE, AND ENVIRONMENTAL MONITORING AND BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION VIII, 2011, 8029
  • [27] Convolutional Neural Networks for SAR Image Segmentation
    Malmgren-Hansen, David
    Nobel-Jorgensen, Morten
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 231 - 236
  • [28] Egyptian Hieroglyphs Segmentation with Convolutional Neural Networks
    Guidi, Tommaso
    Python, Lorenzo
    Forasassi, Matteo
    Cucci, Costanza
    Franci, Massimiliano
    Argenti, Fabrizio
    Barucci, Andrea
    [J]. ALGORITHMS, 2023, 16 (02)
  • [29] Convolutional neural networks for brain tumour segmentation
    Bhandari, Abhishta
    Koppen, Jarrad
    Agzarian, Marc
    [J]. INSIGHTS INTO IMAGING, 2020, 11 (01)
  • [30] Group Convolutional Neural Networks for DWI Segmentation
    Liu, Renfei
    Lauze, Francois
    Bekkers, Erik
    Erleben, Kenny
    Darkner, Sune
    [J]. GEOMETRIC DEEP LEARNING IN MEDICAL IMAGE ANALYSIS, VOL 194, 2022, 194 : 96 - 106