CFOR: Character-First Open-Set Text Recognition via Context-Free Learning

被引:0
|
作者
Liu, Chang [1 ,2 ]
Yang, Chun [1 ]
Fang, Zhiyu [1 ]
Qin, Hai-Bo [1 ]
Yin, Xu-Cheng [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Lulea Tekn Univ, ML Grp, S-97187 Lulea, Sweden
基金
中国国家自然科学基金;
关键词
Zero-shot learning; anomaly detection; text recognition; NETWORK;
D O I
10.1109/TIP.2024.3480711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The open-set text recognition task is a generalized form of the (close-set) text recognition task, where the model is further challenged to spot and incrementally recognize novel characters not covered by the training data. Novel characters also indicate that the language model of the training set is biased from the "real-world". In this work, we alleviate the confounding effect of such biases by learning from individual character representations isolated from their context. Specifically, we propose a Character-First Open-Set Text Recognition framework that cotrains the feature extractor with two context-free learning tasks. First, a Context Isolation Learning task is proposed to wipe the context for each character from the input image, utilizing a character mask learned in a weak supervision manner. Second, the framework adopts an Individual Character Learning task, which is a single-character classification task with synthetic samples. After training on English and simplified Chinese data, our framework can adapt to recognize unseen characters in Japanese, Korean, Greek, and other scripts without retraining, and can reliably spot unseen characters in Japanese with an F1-score over 64%. The framework also shows 91.5% line accuracy on IIIT5k and a speed of over 69 FPS single-batched, making it a feasible universal lightweight OCR solution that works well for both open-set and close-set use cases.
引用
收藏
页码:6497 / 6507
页数:11
相关论文
共 50 条
  • [31] Open-Set Radar Emitter Recognition via Deep Metric Autoencoder
    Yang, Chen
    Liu, Huiling
    Yang, Shuyuan
    Feng, Zhixi
    Tang, Xiaogang
    Zhang, Feng
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 18281 - 18291
  • [32] SeeTek: Very Large-Scale Open-set Logo Recognition with Text-Aware Metric Learning
    Li, Chenge
    Fehervari, Istvan
    Zhao, Xiaonan
    Macedo, Ives
    Appalaraju, Srikar
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 587 - 596
  • [33] RECOGNITION TIME OF LANGUAGES GENERATED BY CONTEXT-FREE GRAMMARS WITH CONTROL SET
    ITO, H
    INAGAKI, Y
    FUKUMURA, T
    ELECTRONICS & COMMUNICATIONS IN JAPAN, 1972, 55 (06): : 142 - 149
  • [34] ORALI: Open-set recognition and active learning for unknown lithology identification
    Zhu, Xinyi
    Zhang, Hongbing
    Ren, Quan
    Rui, Jianwen
    Zhang, Lingyuan
    Zhang, Dailu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [35] Deep metric learning for open-set human action recognition in videos
    Gutoski, Matheus
    Lazzaretti, Andre Eugenio
    Lopes, Heitor Silverio
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1207 - 1220
  • [36] Deep metric learning for open-set human action recognition in videos
    Matheus Gutoski
    André Eugênio Lazzaretti
    Heitor Silvério Lopes
    Neural Computing and Applications, 2021, 33 : 1207 - 1220
  • [37] Open-Set Patient Activity Recognition With Radar Sensors and Deep Learning
    Bhavanasi, Geethika
    Werthen-Brabants, Lorin
    Dhaene, Tom
    Couckuyt, Ivo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [38] Open-Set Patient Activity Recognition With Radar Sensors and Deep Learning
    Bhavanasi, Geethika
    Werthen-Brabants, Lorin
    Dhaene, Tom
    Couckuyt, Ivo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [39] Open-set long-tailed recognition via orthogonal prototype learning and false rejection correction
    Deng, Binquan
    Kamel, Aouaidjia
    Zhang, Chongsheng
    NEURAL NETWORKS, 2025, 181
  • [40] Towards open-set touchless palmprint recognition via weight-based meta metric learning
    Shao, Huikai
    Zhong, Dexing
    PATTERN RECOGNITION, 2022, 121