Investigations on Self-supervised Learning for Script-, Font-type, and Location Classification on Historical Documents

被引:2
|
作者
Zenk, Johan [1 ]
Kordon, Florian [1 ]
Mayr, Martin [1 ]
Seuret, Mathias [1 ]
Christlein, Vincent [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Erlangen, Bavaria, Germany
来源
PROCEEDINGS OF THE 2023 INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2023 | 2023年
关键词
self-supervised learning; document analysis; classification;
D O I
10.1145/3604951.3605519
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of automated classification of historical documents, we investigate three contemporary self-supervised learning (SSL) techniques (SimSiam, Dino, and VICReg) for the pre-training of three different document analysis tasks, namely script-type, font-type, and location classification. Our study draws samples from multiple datasets that contain images of manuscripts, prints, charters, and letters. The representations derived via pre-text training are taken as inputs for k-NN classification and a parametric linear classifier. The latter is placed atop the pre-trained backbones to enable fine-tuning of the entire network to further improve the classification by exploiting task-specific label data. The network's final performance is assessed via independent test sets obtained from the ICDAR2021 Competition on Historical Document Classification. We empirically show that representations learned with SSL are significantly better suited for subsequent document classification than features generated by commonly used transfer learning on ImageNet.
引用
收藏
页码:97 / 102
页数:6
相关论文
共 50 条
  • [41] SELF-SUPERVISED LEARNING FOR FEW-SHOT IMAGE CLASSIFICATION
    Chen, Da
    Chen, Yuefeng
    Li, Yuhong
    Mao, Feng
    He, Yuan
    Xue, Hui
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1745 - 1749
  • [42] Self-supervised Visual Feature Learning and Classification Framework: Based on Contrastive Learning
    Wang, Zhibo
    Yan, Shen
    Zhang, Xiaoyu
    Lobo, Niels Da Vitoria
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 719 - 725
  • [43] Self-supervised learning and semi-supervised learning for multi-sequence medical image classification
    Wang, Yueyue
    Song, Danjun
    Wang, Wentao
    Rao, Shengxiang
    Wang, Xiaoying
    Wang, Manning
    NEUROCOMPUTING, 2022, 513 : 383 - 394
  • [44] Pyramid-based self-supervised learning for histopathological image classification
    Wang, Junjie
    Quan, Hao
    Wang, Chengguang
    Yang, Genke
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
  • [45] DEEP SELF-SUPERVISED PIXEL-LEVEL LEARNING FOR HYPERSPECTRAL CLASSIFICATION
    Gonzalez-Santiago, Jonathan
    Schenkel, Fabian
    Gross, Wolfgang
    Middelmann, Wolfgang
    2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,
  • [46] Self-Supervised Feature Learning With CRF Embedding for Hyperspectral Image Classification
    Wang, Yuebin
    Mei, Jie
    Zhang, Liqiang
    Zhang, Bing
    Zhu, Panpan
    Li, Yang
    Li, Xingang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (05): : 2628 - 2642
  • [47] Classification of Plant Leaf Disease Recognition Based on Self-Supervised Learning
    Wang, Yuzhi
    Yin, Yunzhen
    Li, Yaoyu
    Qu, Tengteng
    Guo, Zhaodong
    Peng, Mingkang
    Jia, Shujie
    Wang, Qiang
    Zhang, Wuping
    Li, Fuzhong
    AGRONOMY-BASEL, 2024, 14 (03):
  • [48] Few-Shot Hyperspectral Image Classification With Self-Supervised Learning
    Li, Zhaokui
    Guo, Hui
    Chen, Yushi
    Liu, Cuiwei
    Du, Qian
    Fang, Zhuoqun
    Wang, Yan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [49] Parkinson's Disease Classification with Self-supervised Learning and Attention Mechanism
    Zhang, Yuchen
    Lei, Haijun
    Huang, Zhongwei
    Zhao, Menglu
    Li, Zhen
    Liu, Chuan-Ming
    Lei, Baiying
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4601 - 4607
  • [50] Self-Supervised Adversarial Learning for Domain Adaptation of Pavement Distress Classification
    Wu, Yanwen
    Hong, Mingjian
    Li, Ao
    Huang, Sheng
    Liu, Huijun
    Ge, Yongxin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1966 - 1977