In-domain versus out-of-domain transfer learning for document layout analysis

被引:0
|
作者
De Nardin, Axel [1 ]
Zottin, Silvia [1 ]
Piciarelli, Claudio [1 ]
Foresti, Gian Luca [1 ]
Colombi, Emanuela [2 ]
机构
[1] Univ Udine, Dept Math Informat & Phys, Via Sci,206, I-33100 Udine, UD, Italy
[2] Univ Udine, Dept Humanities & Cultural Heritage, Vicolo Florio,2, I-33100 Udine, UD, Italy
关键词
Document analysis; Layout segmentation; Semantic segmentation; Transfer learning;
D O I
10.1007/s10032-024-00497-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data availability is a big concern in the field of document analysis, especially when working on tasks that require a high degree of precision when it comes to the definition of the ground truths on which to train deep learning models. A notable example is represented by the task of document layout analysis in handwritten documents, which requires pixel-precise segmentation maps to highlight the different layout components of each document page. These segmentation maps are typically very time-consuming and require a high degree of domain knowledge to be defined, as they are intrinsically characterized by the content of the text. For this reason in the present work, we explore the effects of different initialization strategies for deep learning models employed for this type of task by relying on both in-domain and cross-domain datasets for their pre-training. To test the employed models we use two publicly available datasets with heterogeneous characteristics both regarding their structure as well as the languages of the contained documents. We show how a combination of cross-domain and in-domain transfer learning approaches leads to the best overall performance of the models, as well as speeding up their convergence process.
引用
下载
收藏
页数:15
相关论文
共 50 条
  • [21] A Green Pipeline for Out-of-Domain Public Sentiment Analysis
    Xie, Ming
    Jiang, Jing
    Shen, Tao
    Wang, Yang
    Gerrard, Leah
    Clarke, Allison
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT I, 2022, 13087 : 190 - 202
  • [22] KNN-Contrastive Learning for Out-of-Domain Intent Classification
    Zhou, Yunhua
    Liu, Peiju
    Qiu, Xipeng
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5129 - 5141
  • [23] In-Domain Transfer Learning Strategy for Tumor Detection on Brain MRI
    Terzi, Duygu Sinanc
    Azginoglu, Nuh
    DIAGNOSTICS, 2023, 13 (12)
  • [24] Improving Unsupervised Out-of-domain Detection through Pseudo Labeling and Learning
    Lee, Byounghan
    Kim, Jaesik
    Park, Junekyu
    Sohn, Kyung-Ah
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1031 - 1041
  • [25] Using Representation Learning and Out-of-domain Data for a Paralinguistic Speech Task
    Milde, Benjamin
    Biemann, Chris
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 904 - 908
  • [26] Optimizing Upstream Representations for Out-of-Domain Detection with Supervised Contrastive Learning
    Wang, Bo
    Mine, Tsunenori
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2585 - 2595
  • [27] Learning from noisy out-of-domain corpus using dataless classification
    Jin, Yiping
    Wanvarie, Dittaya
    Le, Phu T., V
    NATURAL LANGUAGE ENGINEERING, 2022, 28 (01) : 39 - 69
  • [28] Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning
    Zeng, Zhiyuan
    He, Keqing
    Yan, Yuanmeng
    Liu, Zijun
    Wu, Yanan
    Xu, Hong
    Jiang, Huixing
    Xu, Weiran
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 870 - 878
  • [29] Using out-of-domain data to improve on-domain language models
    Iyer, R
    Ostendorf, M
    Gish, H
    IEEE SIGNAL PROCESSING LETTERS, 1997, 4 (08) : 221 - 223
  • [30] Certifying Out-of-Domain Generalization for Blackbox Functions
    Weber, Maurice
    Li, Linyi
    Wang, Boxin
    Zhao, Zhikuan
    Li, Bo
    Zhang, Ce
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,