CFOR: Character-First Open-Set Text Recognition via Context-Free Learning

被引:0
|
作者
Liu, Chang [1 ,2 ]
Yang, Chun [1 ]
Fang, Zhiyu [1 ]
Qin, Hai-Bo [1 ]
Yin, Xu-Cheng [1 ]
机构
[1] University of Science and Technology Beijing, School of Computer and Communication Engineering, Beijing,100083, China
[2] Luleå Tekniska Universitet, ML-Group, Luleå,971 87, Sweden
关键词
Adversarial machine learning;
D O I
10.1109/TIP.2024.3480711
中图分类号
学科分类号
摘要
The open-set text recognition task is a generalized form of the (close-set) text recognition task, where the model is further challenged to spot and incrementally recognize novel characters not covered by the training data. Novel characters also indicate that the language model of the training set is biased from the 'real-world'. In this work, we alleviate the confounding effect of such biases by learning from individual character representations isolated from their context. Specifically, we propose a Character-First Open-Set Text Recognition framework that cotrains the feature extractor with two context-free learning tasks. First, a Context Isolation Learning task is proposed to wipe the context for each character from the input image, utilizing a character mask learned in a weak supervision manner. Second, the framework adopts an Individual Character Learning task, which is a single-character classification task with synthetic samples. After training on English and simplified Chinese data, our framework can adapt to recognize unseen characters in Japanese, Korean, Greek, and other scripts without retraining, and can reliably spot unseen characters in Japanese with an F1-score over 64%. The framework also shows 91.5% line accuracy on IIIT5k and a speed of over 69 FPS single-batched, making it a feasible universal lightweight OCR solution that works well for both open-set and close-set use cases. © 1992-2012 IEEE.
引用
收藏
页码:6497 / 6507
相关论文
共 50 条
  • [1] Open-Set Text Recognition via Character-Context Decoupling
    Liu, Chang
    Yang, Chun
    Yin, Xu-Cheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4513 - 4522
  • [2] Towards open-set text recognition via label-to-prototype learning
    Liu, Chang
    Yang, Chun
    Qin, Hai-Bo
    Zhu, Xiaobin
    Liu, Cheng-Lin
    Yin, Xu-Cheng
    PATTERN RECOGNITION, 2023, 134
  • [3] Deep Active Learning via Open-Set Recognition
    Mandivarapu, Jaya Krishna
    Camp, Blake
    Estrada, Rolando
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [4] Open-set Text Recognition via Part-based Similarity
    Liu, Chang
    Yang, Chun
    Yin, Xu-Cheng
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (10): : 1977 - 1987
  • [5] Learning Placeholders for Open-Set Recognition
    Zhou, Da-Wei
    Ye, Han-Jia
    Zhan, De-Chuan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4399 - 4408
  • [6] OPEN-SET RECOGNITION VIA AUGMENTATION-BASED SIMILARITY LEARNING
    Esmaeilpour, Sepideh
    Shu, Lei
    Liu, Bing
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [7] Open-set Recognition with Supervised Contrastive Learning
    Kodama, Yuto
    Wang, Yinan
    Kawakami, Rei
    Naemura, Takeshi
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [8] Learning Network Architecture for Open-Set Recognition
    Zhang, Xuelin
    Cheng, Xuelian
    Zhang, Donghao
    Bonnington, Paul
    Ge, Zongyuan
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3362 - 3370
  • [9] OpenGAN: Open-Set Recognition via Open Data Generation
    Kong, Shu
    Ramanan, Deva
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 793 - 802
  • [10] Open-set iris recognition based on deep learning
    Sun, Jie
    Zhao, Shipeng
    Miao, Sheng
    Wang, Xuan
    Yu, Yanan
    IET IMAGE PROCESSING, 2022, 16 (09) : 2361 - 2372