In-domain versus out-of-domain transfer learning for document layout analysis

被引:0
|
作者
De Nardin, Axel [1 ]
Zottin, Silvia [1 ]
Piciarelli, Claudio [1 ]
Foresti, Gian Luca [1 ]
Colombi, Emanuela [2 ]
机构
[1] Univ Udine, Dept Math Informat & Phys, Via Sci,206, I-33100 Udine, UD, Italy
[2] Univ Udine, Dept Humanities & Cultural Heritage, Vicolo Florio,2, I-33100 Udine, UD, Italy
关键词
Document analysis; Layout segmentation; Semantic segmentation; Transfer learning;
D O I
10.1007/s10032-024-00497-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data availability is a big concern in the field of document analysis, especially when working on tasks that require a high degree of precision when it comes to the definition of the ground truths on which to train deep learning models. A notable example is represented by the task of document layout analysis in handwritten documents, which requires pixel-precise segmentation maps to highlight the different layout components of each document page. These segmentation maps are typically very time-consuming and require a high degree of domain knowledge to be defined, as they are intrinsically characterized by the content of the text. For this reason in the present work, we explore the effects of different initialization strategies for deep learning models employed for this type of task by relying on both in-domain and cross-domain datasets for their pre-training. To test the employed models we use two publicly available datasets with heterogeneous characteristics both regarding their structure as well as the languages of the contained documents. We show how a combination of cross-domain and in-domain transfer learning approaches leads to the best overall performance of the models, as well as speeding up their convergence process.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] In-domain versus out-of-domain transfer learning in plankton image classification
    Andrea Maracani
    Vito Paolo Pastore
    Lorenzo Natale
    Lorenzo Rosasco
    Francesca Odone
    [J]. Scientific Reports, 13
  • [2] In-domain versus out-of-domain transfer learning in plankton image classification
    Maracani, Andrea
    Pastore, Vito Paolo
    Natale, Lorenzo
    Rosasco, Lorenzo
    Odone, Francesca
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [3] In-Domain versus Out-of-Domain training for Text-Dependent JFA
    Kenny, Patrick
    Stafylakis, Themos
    Alam, Jahangir
    Ouellet, Pierre
    Kockmann, Marcel
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1332 - 1336
  • [4] GAN-BASED OUT-OF-DOMAIN DETECTION USING BOTH IN-DOMAIN AND OUT-OF-DOMAIN SAMPLES
    Liang, Chaojie
    Huang, Peijie
    Lai, Wenbin
    Ruan, Ziheng
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7663 - 7667
  • [5] Towards Textual Out-of-Domain Detection Without In-Domain Labels
    Jin, Di
    Gao, Shuyang
    Kim, Seokhwan
    Liu, Yang
    Hakkani-Tur, Dilek
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1386 - 1395
  • [6] Glioma subtype classification from histopathological images using in-domain and out-of-domain transfer learning: An experimental study
    Despotovic, Vladimir
    Kim, Sang-Yoon
    Hau, Ann-Christin
    Kakoichankava, Aliaksandra
    Klamminger, Gilbert Georg
    Borgmann, Felix Bruno Kleine
    Frauenknecht, Katrin B. M.
    Mittelbronn, Michel
    Nazarov, Petr, V
    [J]. HELIYON, 2024, 10 (05)
  • [7] Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech
    Christensen, H.
    Aniol, M. B.
    Bell, P.
    Green, P.
    Hain, T.
    King, S.
    Swietojanski, P.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3609 - 3612
  • [8] Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems
    Ryu, Seonghan
    Kim, Seokhwan
    Choi, Junhwi
    Yu, Hwanjo
    Lee, Gary Geunbae
    [J]. PATTERN RECOGNITION LETTERS, 2017, 88 : 26 - 32
  • [9] IN-DOMAIN AND OUT-OF-DOMAIN DATA AUGMENTATION TO IMPROVE CHILDREN'S SPEAKER VERIFICATION SYSTEM IN LIMITED DATA SCENARIO
    Shahnawazuddin, S.
    Ahmad, Waquar
    Adiga, Nagaraj
    Kumar, Avinash
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7554 - 7558
  • [10] The GENEREG Corpus for Gene Expression Regulation Events-An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability
    Buyko, Ekaterina
    Beisswanger, Elena
    Hahn, Udo
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2662 - 2666