Self-supervised Representation Learning on Document Images

被引:5
|
作者
Cosma, Adrian [1 ,2 ]
Ghidoveanu, Mihai [1 ,3 ]
Panaitescu-Liess, Michael [1 ,3 ]
Popescu, Marius [1 ,3 ]
机构
[1] Sparktech Software, Bucharest, Romania
[2] Univ Politehn Bucuresti, Bucharest, Romania
[3] Univ Bucharest, Fac Math & Comp Sci, Bucharest, Romania
来源
DOCUMENT ANALYSIS SYSTEMS | 2020年 / 12116卷
关键词
Self-supervision; Pre-training; Transfer learning; Document images; Convolutional neural networks; CLASSIFICATION; SIMILARITY;
D O I
10.1007/978-3-030-57058-3_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work analyses the impact of self-supervised pre-training on document images in the context of document image classification. While previous approaches explore the effect of self-supervision on natural images, we show that patch-based pre-training performs poorly on document images because of their different structural properties and poor intra-sample semantic information. We propose two context-aware alternatives to improve performance on the Tobacco-3482 image classification task. We also propose a novel method for self-supervision, which makes use of the inherent multi-modality of documents (image and text), which performs better than other popular self-supervised methods, including supervised ImageNet pre-training, on document image classification scenarios with a limited amount of data.
引用
收藏
页码:103 / 117
页数:15
相关论文
共 50 条
  • [1] SelfDoc: Self-Supervised Document Representation Learning
    Li, Peizhao
    Gu, Jiuxiang
    Kuen, Jason
    Morariu, Vlad, I
    Zhao, Handong
    Jain, Rajiv
    Manjunatha, Varun
    Liu, Hongfu
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5648 - 5656
  • [2] Self-supervised Representation Learning for Astronomical Images
    Hayat, Md Abul
    Stein, George
    Harrington, Peter
    Lukic, Zarija
    Mustafa, Mustafa
    [J]. ASTROPHYSICAL JOURNAL LETTERS, 2021, 911 (02)
  • [3] Self-Supervised Representation Learning for Document Image Classification
    Siddiqui, Shoaib Ahmed
    Dengel, Andreas
    Ahmed, Sheraz
    [J]. IEEE ACCESS, 2021, 9 : 164358 - 164367
  • [4] Self-supervised Visual Representation Learning for Histopathological Images
    Yang, Pengshuai
    Hong, Zhiwei
    Yin, Xiaoxu
    Zhu, Chengzhan
    Jiang, Rui
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 47 - 57
  • [5] Self-supervised Segment Contrastive Learning for Medical Document Representation
    Abro, Waheed Ahmed
    Kteich, Hanane
    Bouraoui, Zied
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 312 - 321
  • [6] HistoSSL: Self-Supervised Representation Learning for Classifying Histopathology Images
    Jin, Xu
    Huang, Teng
    Wen, Ke
    Chi, Mengxian
    An, Hong
    [J]. MATHEMATICS, 2023, 11 (01)
  • [7] DocMAE: Document Image Rectification via Self-supervised Representation Learning
    Liu, Shaokai
    Feng, Hao
    Zhou, Wengang
    Li, Houqiang
    Liu, Cong
    Wu, Feng
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1613 - 1618
  • [8] Whitening for Self-Supervised Representation Learning
    Ermolov, Aleksandr
    Siarohin, Aliaksandr
    Sangineto, Enver
    Sebe, Nicu
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] Self-Supervised Representation Learning for CAD
    Jones, Benjamin T.
    Hu, Michael
    Kodnongbua, Milin
    Kim, Vladimir G.
    Schulz, Adriana
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21327 - 21336
  • [10] Self-supervised Multi-task Representation Learning for Sequential Medical Images
    Dong, Nanqing
    Kampffmeyer, Michael
    Voiculescu, Irina
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 779 - 794