In-domain versus out-of-domain transfer learning in plankton image classification

被引:0
|
作者
Andrea Maracani
Vito Paolo Pastore
Lorenzo Natale
Lorenzo Rosasco
Francesca Odone
机构
[1] Istituto Italiano di Tecnologia,
[2] MaLGa-DIBRIS,undefined
[3] Università degli studi di Genova,undefined
[4] CBMM,undefined
[5] Massachusetts Institute of Technology,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Plankton microorganisms play a huge role in the aquatic food web. Recently, it has been proposed to use plankton as a biosensor, since they can react to even minimal perturbations of the aquatic environment with specific physiological changes, which may lead to alterations in morphology and behavior. Nowadays, the development of high-resolution in-situ automatic acquisition systems allows the research community to obtain a large amount of plankton image data. Fundamental examples are the ZooScan and Woods Hole Oceanographic Institution (WHOI) datasets, comprising up to millions of plankton images. However, obtaining unbiased annotations is expensive both in terms of time and resources, and in-situ acquired datasets generally suffer from severe imbalance, with only a few images available for several species. Transfer learning is a popular solution to these challenges, with ImageNet1K being the most-used source dataset for pre-training. On the other hand, datasets like the ZooScan and the WHOI may represent a valuable opportunity to compare out-of-domain and large-scale plankton in-domain source datasets, in terms of performance for the task at hand.In this paper, we design three transfer learning pipelines for plankton image classification, with the aim of comparing in-domain and out-of-domain transfer learning on three popular benchmark plankton datasets. The general framework consists in fine-tuning a pre-trained model on a plankton target dataset. In the first pipeline, the model is pre-trained from scratch on a large-scale plankton dataset, in the second, it is pre-trained on large-scale natural image datasets (ImageNet1K or ImageNet22K), while in the third, a two-stage fine-tuning is implemented (ImageNet →\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rightarrow $$\end{document} large-scale plankton dataset →\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rightarrow $$\end{document} target plankton dataset). Our results show that an out-of-domain ImageNet22K pre-training outperforms the plankton in-domain ones, with an average boost in test accuracy of around 6%. In the next part of this work, we adopt three ImageNet22k pre-trained Vision Transformers and one ConvNeXt, obtaining results on par (or slightly superior) with the state-of-the-art, corresponding to the usage of CNN models ensembles, with a single model. Finally, we design and test an ensemble of our Vision Transformers and the ConvNeXt, outperforming the state-of-the-art existing works on plankton image classification on the three target datasets. To support scientific community contribution and further research, our implemented code is open-source and available at https://github.com/Malga-Vision/plankton_transfer.
引用
收藏
相关论文
共 50 条
  • [31] Out-of-domain utterance detection using classification confidences of multiple topics
    Lane, Ian
    Kawahara, Tatsuya
    Matsui, Tomoko
    Nakamura, Satoshi
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 150 - 161
  • [32] In-Domain Transfer Learning Strategy for Tumor Detection on Brain MRI
    Terzi, Duygu Sinanc
    Azginoglu, Nuh
    [J]. DIAGNOSTICS, 2023, 13 (12)
  • [33] Improving Unsupervised Out-of-domain Detection through Pseudo Labeling and Learning
    Lee, Byounghan
    Kim, Jaesik
    Park, Junekyu
    Sohn, Kyung-Ah
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1031 - 1041
  • [34] Dyadic Transfer Learning for Cross-Domain Image Classification
    Wang, Hua
    Nie, Feiping
    Huang, Heng
    Ding, Chris
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 551 - 556
  • [35] Using Representation Learning and Out-of-domain Data for a Paralinguistic Speech Task
    Milde, Benjamin
    Biemann, Chris
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 904 - 908
  • [36] Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning
    Zeng, Zhiyuan
    He, Keqing
    Yan, Yuanmeng
    Liu, Zijun
    Wu, Yanan
    Xu, Hong
    Jiang, Huixing
    Xu, Weiran
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 870 - 878
  • [37] Optimizing Upstream Representations for Out-of-Domain Detection with Supervised Contrastive Learning
    Wang, Bo
    Mine, Tsunenori
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2585 - 2595
  • [38] Using out-of-domain data to improve on-domain language models
    Iyer, R
    Ostendorf, M
    Gish, H
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1997, 4 (08) : 221 - 223
  • [39] Certifying Out-of-Domain Generalization for Blackbox Functions
    Weber, Maurice
    Li, Linyi
    Wang, Boxin
    Zhao, Zhikuan
    Li, Bo
    Zhang, Ce
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [40] Out-of-domain detection based on confidence measures from multiple topic classification
    Lane, IR
    Kawahara, T
    Matsui, T
    Nakamura, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 757 - 760