In-domain versus out-of-domain transfer learning for document layout analysis

被引：0

作者：

De Nardin, Axel ^{[1
]}

Zottin, Silvia ^{[1
]}

Piciarelli, Claudio ^{[1
]}

Foresti, Gian Luca ^{[1
]}

Colombi, Emanuela ^{[2
]}

机构：

[1] Univ Udine, Dept Math Informat & Phys, Via Sci,206, I-33100 Udine, UD, Italy

[2] Univ Udine, Dept Humanities & Cultural Heritage, Vicolo Florio,2, I-33100 Udine, UD, Italy

来源：

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION | 2024年

关键词：

Document analysis; Layout segmentation; Semantic segmentation; Transfer learning;

D O I：

10.1007/s10032-024-00497-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data availability is a big concern in the field of document analysis, especially when working on tasks that require a high degree of precision when it comes to the definition of the ground truths on which to train deep learning models. A notable example is represented by the task of document layout analysis in handwritten documents, which requires pixel-precise segmentation maps to highlight the different layout components of each document page. These segmentation maps are typically very time-consuming and require a high degree of domain knowledge to be defined, as they are intrinsically characterized by the content of the text. For this reason in the present work, we explore the effects of different initialization strategies for deep learning models employed for this type of task by relying on both in-domain and cross-domain datasets for their pre-training. To test the employed models we use two publicly available datasets with heterogeneous characteristics both regarding their structure as well as the languages of the contained documents. We show how a combination of cross-domain and in-domain transfer learning approaches leads to the best overall performance of the models, as well as speeding up their convergence process.

引用

下载

页数：15

共 50 条

[21] A Green Pipeline for Out-of-Domain Public Sentiment Analysis
Xie, Ming
Jiang, Jing
Shen, Tao
Wang, Yang
Gerrard, Leah
Clarke, Allison
ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT I, 2022, 13087 : 190 - 202
[22] KNN-Contrastive Learning for Out-of-Domain Intent Classification
Zhou, Yunhua
Liu, Peiju
Qiu, Xipeng
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5129 - 5141
[23] In-Domain Transfer Learning Strategy for Tumor Detection on Brain MRI
Terzi, Duygu Sinanc
Azginoglu, Nuh
DIAGNOSTICS, 2023, 13 (12)
[24] Improving Unsupervised Out-of-domain Detection through Pseudo Labeling and Learning
Lee, Byounghan
Kim, Jaesik
Park, Junekyu
Sohn, Kyung-Ah
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1031 - 1041
[25] Using Representation Learning and Out-of-domain Data for a Paralinguistic Speech Task
Milde, Benjamin
Biemann, Chris
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 904 - 908
[26] Optimizing Upstream Representations for Out-of-Domain Detection with Supervised Contrastive Learning
Wang, Bo
Mine, Tsunenori
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2585 - 2595
[27] Learning from noisy out-of-domain corpus using dataless classification
Jin, Yiping
Wanvarie, Dittaya
Le, Phu T., V
NATURAL LANGUAGE ENGINEERING, 2022, 28 (01) : 39 - 69
[28] Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning
Zeng, Zhiyuan
He, Keqing
Yan, Yuanmeng
Liu, Zijun
Wu, Yanan
Xu, Hong
Jiang, Huixing
Xu, Weiran
ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 870 - 878
[29] Using out-of-domain data to improve on-domain language models
Iyer, R
Ostendorf, M
Gish, H
IEEE SIGNAL PROCESSING LETTERS, 1997, 4 (08) : 221 - 223
[30] Certifying Out-of-Domain Generalization for Blackbox Functions
Weber, Maurice
Li, Linyi
Wang, Boxin
Zhao, Zhikuan
Li, Bo
Zhang, Ce
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,

← 1 2 3 4 5 →