Accurate, data-efficient, unconstrained text recognition with convolutional neural networks

被引:58
|
作者
Yousef, Mohamed [1 ]
Hussain, Khaled F. [1 ]
Mohammed, Usama S. [2 ]
机构
[1] Assiut Univ, Fac Comp & Informat, Comp Sci Dept, Asyut 71515, Egypt
[2] Assiut Univ, Elect Engn Dept, Fac Engn, Asyut 71515, Egypt
关键词
Text recognition; Optical character recognition; Handwriting recognition; CAPTCHA Solving; License plate recognition; Convolutional neural network; Deep learning; SCENE TEXT; LSTM;
D O I
10.1016/j.patcog.2020.107482
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unconstrained text recognition is an important computer vision task, featuring a wide variety of different sub-tasks, each with its own set of challenges. One of the biggest promises of deep neural networks has been the convergence and automation of feature extractors from input raw signals, allowing for the highest possible performance with minimum required domain knowledge. To this end, we propose a data-efficient, end-to-end neural network model for generic, unconstrained text recognition. In our proposed architecture we strive for simplicity and efficiency without sacrificing recognition accuracy. Our proposed architecture is a fully convolutional network without any recurrent connections trained with the CTC loss function. Thus it operates on arbitrary input sizes and produces strings of arbitrary length in a very efficient and parallelizable manner. We show the generality and superiority of our proposed text recognition architecture by achieving state-of-the-art results on seven public benchmark datasets, covering a wide spectrum of text recognition tasks, namely: Handwriting Recognition, CAPTCHA recognition, OCR, License Plate Recognition, and Scene Text Recognition. Our proposed architecture has won the ICFHR2018 Competition on Automated Text Recognition on a READ Dataset. (C) 2020 Published by Elsevier Ltd.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [21] A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks
    Aejaz Farooq Ganai
    Farida Khursheed
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 351 - 371
  • [22] A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks
    Ganai, Aejaz Farooq
    Khursheed, Farida
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (04) : 351 - 371
  • [23] Ncfm: Accurate Handwritten Digits Recognition using Convolutional Neural Networks
    Yin, Yan
    Wu, JunMin
    Zheng, HuanXin
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 525 - 531
  • [24] Convolutional Neural Networks for Unsupervised Anomaly Detection in Text Data
    Gorokhov, Oleg
    Petrovskiy, Mikhail
    Mashechkin, Igor
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2017, 2017, 10585 : 500 - 507
  • [25] An Efficient Approach to Face Emotion Recognition with Convolutional Neural Networks
    Bialek, Christian
    Matiolanski, Andrzej
    Grega, Michal
    ELECTRONICS, 2023, 12 (12)
  • [26] Efficient GPU implementation of convolutional neural networks for speech recognition
    van den Berg, Ewout
    Brand, Daniel
    Bordawekar, Rajesh
    Rachevsky, Leonid
    Ramabhadran, Bhuvana
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1483 - 1487
  • [27] Transfer Learning with Efficient Convolutional Neural Networks for Fruit Recognition
    Huang, Ziliang
    Cao, Yan
    Wang, Tianbao
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 358 - 362
  • [28] An Efficient License Plate Detection Approach With Deep Convolutional Neural Networks in Unconstrained Scenarios
    Jia, Wei
    Xie, Mingshan
    IEEE ACCESS, 2023, 11 : 85626 - 85639
  • [29] Unconstrained Iris Segmentation Using Convolutional Neural Networks
    Ahmad, Sohaib
    Fuller, Benjamin
    COMPUTER VISION - ACCV 2018 WORKSHOPS, 2019, 11367 : 450 - 466
  • [30] Unconstrained Age Estimation with Deep Convolutional Neural Networks
    Ranjan, Rajeev
    Zhou, Sabrina
    Chen, Jun Cheng
    Kumar, Amit
    Alavi, Azadeh
    Patel, Vishal M.
    Chellappa, Rama
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 351 - 359