Investigations on Self-supervised Learning for Script-, Font-type, and Location Classification on Historical Documents

被引:2
|
作者
Zenk, Johan [1 ]
Kordon, Florian [1 ]
Mayr, Martin [1 ]
Seuret, Mathias [1 ]
Christlein, Vincent [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Erlangen, Bavaria, Germany
来源
PROCEEDINGS OF THE 2023 INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2023 | 2023年
关键词
self-supervised learning; document analysis; classification;
D O I
10.1145/3604951.3605519
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of automated classification of historical documents, we investigate three contemporary self-supervised learning (SSL) techniques (SimSiam, Dino, and VICReg) for the pre-training of three different document analysis tasks, namely script-type, font-type, and location classification. Our study draws samples from multiple datasets that contain images of manuscripts, prints, charters, and letters. The representations derived via pre-text training are taken as inputs for k-NN classification and a parametric linear classifier. The latter is placed atop the pre-trained backbones to enable fine-tuning of the entire network to further improve the classification by exploiting task-specific label data. The network's final performance is assessed via independent test sets obtained from the ICDAR2021 Competition on Historical Document Classification. We empirically show that representations learned with SSL are significantly better suited for subsequent document classification than features generated by commonly used transfer learning on ImageNet.
引用
收藏
页码:97 / 102
页数:6
相关论文
共 50 条
  • [21] Self-supervised learning for efficient seismic facies classification
    Chikhaoui, Khalil
    Alfarraj, Motaz
    GEOPHYSICS, 2024, 89 (05) : IM61 - IM76
  • [22] Embedding Global Contrastive and Local Location in Self-Supervised Learning
    Zhao, Wenyi
    Li, Chongyi
    Zhang, Weidong
    Yang, Lu
    Zhuang, Peixian
    Li, Lingqiao
    Fan, Kefeng
    Yang, Huihua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2275 - 2289
  • [23] scPretrain: multi-task self-supervised learning for cell-type classification
    Zhang, Ruiyi
    Luo, Yunan
    Ma, Jianzhu
    Zhang, Ming
    Wang, Sheng
    BIOINFORMATICS, 2022, 38 (06) : 1607 - 1614
  • [24] Comparison of Different Supervised and Self-supervised Learning Techniques in Skin Disease Classification
    Cino, Loris
    Mazzeo, Pier Luigi
    Distante, Cosimo
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 : 77 - 88
  • [25] Semi-supervised Time Series Classification Model with Self-supervised Learning
    Xi, Liang
    Yun, Zichao
    Liu, Han
    Wang, Ruidong
    Huang, Xunhua
    Fan, Haoyi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [26] Nearest Neighboring Self-Supervised Learning for Hyperspectral Image Classification
    Qin, Yao
    Ye, Yuanxin
    Zhao, Yue
    Wu, Junzheng
    Zhang, Han
    Cheng, Kenan
    Li, Kun
    REMOTE SENSING, 2023, 15 (06)
  • [27] LFM Signal Sources Classification Based on Self-Supervised Learning
    Yang, Tianqi
    Mi, Siya
    PROGRESS IN ELECTROMAGNETICS RESEARCH LETTERS, 2023, 112 : 103 - 110
  • [28] Classification-Based Self-Supervised Learning for Anomaly Detection
    Li, Honghu
    Zhu, Yuesheng
    He, Ying
    THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [29] LARGE-CONTEXT CONVERSATIONAL REPRESENTATION LEARNING: SELF-SUPERVISED LEARNING FOR CONVERSATIONAL DOCUMENTS
    Masumura, Ryo
    Makishima, Naoki
    Ihori, Mana
    Takashima, Akihiko
    Tanaka, Tomohiro
    Orihashi, Shota
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 1012 - 1019
  • [30] Conditional Self-Supervised Learning for Few-Shot Classification
    An, Yuexuan
    Xue, Hui
    Zhao, Xingyu
    Zhang, Lu
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2140 - 2146