Read-Write-Learn: Self-Learning for Handwriting Recognition

被引:0
|
作者
Boteanu, Adrian [1 ]
Cheng, Du [1 ]
Kadioglu, Serdar [1 ]
机构
[1] Fidel Investments, Boston, MA 02210 USA
关键词
handwriting recognition; handwriting generation; self-learning;
D O I
10.1145/3573128.3609343
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwriting recognition relies on supervised data for training. Annotations typically include both the written text and the author's identity to facilitate the recognition of a particular style. A large annotation set is required for robust recognition, which is not always available in historical texts and low-annotation languages. To mitigate this challenge, we propose the Read-Write-Learn framework. In this setting, we augment the training process of handwriting recognition with a language model and a handwriting generator. Specifically, in the first reading step, we employ a language model to identify text that is likely detected correctly by the recognition model. Then, in the writing step, we generate more training data in the same writing style. Finally, in the learning step, we use the newly generated data in the same writing style to finetune the recognition model. Our Read-Write-Learn framework allows the recognition model to incrementally converge on the new style. Our experiments on historical handwritten documents demonstrate the benefits of the approach, and we present several examples to showcase improved recognition.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Neuroimaging correlates of handwriting quality as children learn to read and write
    Gimenez, Paul
    Bugescu, Nicolle
    Black, Jessica M.
    Hancock, Roeland
    Pugh, Kenneth
    Nagamine, Masanori
    Kutner, Emily
    Mazaika, Paul
    Hendren, Robert
    McCandliss, Bruce D.
    Hoeft, Fumiko
    [J]. FRONTIERS IN HUMAN NEUROSCIENCE, 2014, 8
  • [2] Learn to read and write: app for the literacy learning
    Gomez-Diaz, Raquel
    Garcia-Rodriguez, Araceli
    Antonio Cordon-Garcia, Jose
    [J]. EDUCATION IN THE KNOWLEDGE SOCIETY, 2015, 16 (04): : 118 - 137
  • [3] Learning-to-learn efficiently with self-learning
    Kunde, Shruti
    Choudhry, Sharod Roy
    Pandit, Amey
    Singhal, Rekha
    [J]. PROCEEDINGS OF THE 6TH WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2022, 2022,
  • [4] Learning to read in order to write, learning to write in order to be read
    Dhondt, JL
    [J]. ANNALES DE BIOLOGIE CLINIQUE, 2001, 59 (04) : 381 - 381
  • [5] THEY CAN ALL LEARN TO READ AND WRITE
    CUNNINGHAM, PM
    [J]. EDUCATIONAL LEADERSHIP, 1986, 43 (05) : 82 - 83
  • [6] LEARNING TO READ AND WRITE
    Galifret-Granjon, N.
    [J]. ANNEE PSYCHOLOGIQUE, 1952, 52 (02): : 443 - 456
  • [7] LEARNING TO READ AND WRITE
    BRACEY, GW
    [J]. PHI DELTA KAPPAN, 1989, 70 (07) : 559 - 564
  • [8] SELF-LEARNING COMPUTER PROGRAM FOR CELL RECOGNITION
    BARTELS, PH
    BAHR, GF
    BELLAMY, JC
    BIBBO, M
    RICHARDS, DL
    WIED, GL
    [J]. ACTA CYTOLOGICA, 1970, 14 (08) : 486 - &
  • [9] Towards Self-Learning Optical Music Recognition
    Pacha, Alexander
    Eidenberger, Horst
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 795 - 800
  • [10] Teaching to read and write to learn in Primary Education
    Martinez, Isabel
    Martin, Elena
    Mateos, Mar
    [J]. CULTURA Y EDUCACION, 2011, 23 (03): : 399 - 414