CFOR: Character-First Open-Set Text Recognition via Context-Free Learning

被引:0
|
作者
Liu, Chang [1 ,2 ]
Yang, Chun [1 ]
Fang, Zhiyu [1 ]
Qin, Hai-Bo [1 ]
Yin, Xu-Cheng [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Lulea Tekn Univ, ML Grp, S-97187 Lulea, Sweden
基金
中国国家自然科学基金;
关键词
Zero-shot learning; anomaly detection; text recognition; NETWORK;
D O I
10.1109/TIP.2024.3480711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The open-set text recognition task is a generalized form of the (close-set) text recognition task, where the model is further challenged to spot and incrementally recognize novel characters not covered by the training data. Novel characters also indicate that the language model of the training set is biased from the "real-world". In this work, we alleviate the confounding effect of such biases by learning from individual character representations isolated from their context. Specifically, we propose a Character-First Open-Set Text Recognition framework that cotrains the feature extractor with two context-free learning tasks. First, a Context Isolation Learning task is proposed to wipe the context for each character from the input image, utilizing a character mask learned in a weak supervision manner. Second, the framework adopts an Individual Character Learning task, which is a single-character classification task with synthetic samples. After training on English and simplified Chinese data, our framework can adapt to recognize unseen characters in Japanese, Korean, Greek, and other scripts without retraining, and can reliably spot unseen characters in Japanese with an F1-score over 64%. The framework also shows 91.5% line accuracy on IIIT5k and a speed of over 69 FPS single-batched, making it a feasible universal lightweight OCR solution that works well for both open-set and close-set use cases.
引用
收藏
页码:6497 / 6507
页数:11
相关论文
共 50 条
  • [41] MOoSE: Multi-Orientation Sharing Experts for Open-Set Scene Text Recognition
    Liu, Chang
    Corbille, Simon
    Smith, Elisa Hope Barney
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 93 - 110
  • [42] Few-shot open-set recognition via pairwise discriminant aggregation
    Jin, Jian
    Shen, Yang
    Fu, Zhenyong
    Yang, Jian
    NEUROCOMPUTING, 2024, 602
  • [43] Logit prototype learning with active multimodal representation for robust open-set recognition
    Yimin FU
    Zhunga LIU
    Zicheng WANG
    Science China(Information Sciences), 2024, 67 (06) : 297 - 312
  • [44] Towards Accurate Open-Set Recognition via Background-Class Regularization
    Cho, Wonwoo
    Choo, Jaegul
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 658 - 674
  • [45] Open-Set Jamming Pattern Recognition via Generated Unknown Jamming Data
    Wang, Guoqiang
    Gao, Yulong
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1079 - 1083
  • [46] Open-Set Semi-Supervised Text Classification via Adversarial Disagreement Maximization
    Chen, Junfan
    Zhang, Richong
    Chen, Junchi
    Hu, Chunming
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 2170 - 2180
  • [47] Open-Set Recognition in Unknown DDoS Attacks Detection With Reciprocal Points Learning
    Shieh, Chin-Shiuh
    Ho, Fu-An
    Horng, Mong-Fong
    Nguyen, Thanh-Tuan
    Chakrabarti, Prasun
    IEEE ACCESS, 2024, 12 : 56461 - 56476
  • [48] Logit prototype learning with active multimodal representation for robust open-set recognition
    Fu, Yimin
    Liu, Zhunga
    Wang, Zicheng
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (06)
  • [49] Incremental Learning With Open-Set Recognition for Remote Sensing Image Scene Classification
    Liu, Weiwei
    Nie, Xiangli
    Zhang, Bo
    Sun, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [50] Efficient Open-Set Recognition for Interference Signals Based on Convolutional Prototype Learning
    Chen, Xiangwei
    Zhao, Zhijin
    Ye, Xueyi
    Zheng, Shilian
    Lou, Caiyi
    Yang, Xiaoniu
    APPLIED SCIENCES-BASEL, 2022, 12 (09):