In-domain versus out-of-domain transfer learning for document layout analysis

被引:0
|
作者
De Nardin, Axel [1 ]
Zottin, Silvia [1 ]
Piciarelli, Claudio [1 ]
Foresti, Gian Luca [1 ]
Colombi, Emanuela [2 ]
机构
[1] Univ Udine, Dept Math Informat & Phys, Via Sci,206, I-33100 Udine, UD, Italy
[2] Univ Udine, Dept Humanities & Cultural Heritage, Vicolo Florio,2, I-33100 Udine, UD, Italy
关键词
Document analysis; Layout segmentation; Semantic segmentation; Transfer learning;
D O I
10.1007/s10032-024-00497-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data availability is a big concern in the field of document analysis, especially when working on tasks that require a high degree of precision when it comes to the definition of the ground truths on which to train deep learning models. A notable example is represented by the task of document layout analysis in handwritten documents, which requires pixel-precise segmentation maps to highlight the different layout components of each document page. These segmentation maps are typically very time-consuming and require a high degree of domain knowledge to be defined, as they are intrinsically characterized by the content of the text. For this reason in the present work, we explore the effects of different initialization strategies for deep learning models employed for this type of task by relying on both in-domain and cross-domain datasets for their pre-training. To test the employed models we use two publicly available datasets with heterogeneous characteristics both regarding their structure as well as the languages of the contained documents. We show how a combination of cross-domain and in-domain transfer learning approaches leads to the best overall performance of the models, as well as speeding up their convergence process.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Cross-domain document layout analysis using document style guide
    Wu, Xingjiao
    Xiao, Luwei
    Du, Xiangcheng
    Zheng, Yingbin
    Li, Xin
    Ma, Tianlong
    Jin, Cheng
    He, Liang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [42] Pretraining boosts out-of-domain robustness for pose estimation
    Mathis, Alexander
    Biasi, Thomas
    Schneider, Steffen
    Yuksekgonul, Mert
    Rogers, Byron
    Bethge, Matthias
    Mathis, Mackenzie W.
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1858 - 1867
  • [43] An Out-of-Domain Test Suite for Dependency Parsing of German
    Seeker, Wolfgang
    Kuhn, Jonas
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4066 - 4073
  • [44] Detecting Annotation Scheme Variation in Out-of-Domain Treebanks
    Versley, Yannick
    Steen, Julius
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2354 - 2360
  • [45] Can You Label Less by Using Out-of-Domain Data? Active & Transfer Learning with Few-shot Instructions
    Kocielnik, Rafal
    Kangaslahti, Sara
    Prabhumoye, Shrimai
    Hari, Meena
    Alvarez, R. Michael
    Anandkumar, Anima
    TRANSFER LEARNING FOR NATURAL LANGUAGE PROCESSING WORKSHOP, VOL 203, 2022, 203 : 22 - 32
  • [46] Optimal transport-based transfer learning for smart manufacturing: Tool wear prediction using out-of-domain data
    Xie, Rui
    Wu, Dazhong
    MANUFACTURING LETTERS, 2021, 29 (29) : 104 - 107
  • [47] Domain adaptive learning for document layout analysis and object detection using classifier alignment mechanism
    Mishra, Prerna
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 116
  • [48] The predictability of the effectiveness of chains of classifiers in the out-of-domain detection
    Cofta, Piotr
    Engineering Applications of Artificial Intelligence, 2025, 139
  • [49] On the Effects of Transformer Size on In- and Out-of-Domain Calibration
    Dan, Soham
    Roth, Dan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2096 - 2101
  • [50] A domain-knowledge based reconstruction framework for out-of-domain news title classification
    Yuan, Shi
    Liu, Ningning
    Sun, Bo
    Zhao, Chen
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237