Domain adaptation in small-scale and heterogeneous biological datasets

被引:2
|
作者
Orouji, Seyedmehdi [1 ]
Liu, Martin C. [2 ,3 ]
Korem, Tal [3 ,4 ,5 ]
Peters, Megan A. K. [1 ,5 ,6 ]
机构
[1] Univ Calif Irvine, Dept Cognit Sci, Irvine, CA 92697 USA
[2] Columbia Univ Irving Med Ctr, Dept Biomed Informat, New York, NY USA
[3] Columbia Univ Irving Med Ctr, Dept Syst Biol, Program Math Genom, New York, NY 10032 USA
[4] Columbia Univ Irving Med Ctr, Dept Obstet & Gynecol, New York, NY 10032 USA
[5] CIFAR, CIFAR Azrieli Global Scholars program, Toronto, ON, Canada
[6] CIFAR, Program Brain Mind & Consciousness, Toronto, ON, Canada
来源
SCIENCE ADVANCES | 2024年 / 10卷 / 51期
关键词
CLASSIFICATION; IDENTIFICATION; VISUALIZATION; INFERENCE; SOFTWARE; NETWORK; MIXTURE; IMPACT; KERNEL;
D O I
10.1126/sciadv.adp6040
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Machine-learning models are key to modern biology, yet models trained on one dataset are often not generalizable to other datasets from different cohorts or laboratories due to both technical and biological differences. Domain adaptation, a type of transfer learning, alleviates this problem by aligning different datasets so that models can be applied across them. However, most state-of-the-art domain adaptation methods were designed for large-scale data such as images, whereas biological datasets are smaller and have more features, and these are also complex and heterogeneous. This Review discusses domain adaptation methods in the context of such biological data to inform biologists and guide future domain adaptation research. We describe the benefits and challenges of domain adaptation in biological research and critically explore some of its objectives, strengths, and weaknesses. We argue for the incorporation of domain adaptation techniques to the computational biologist's toolkit, with further development of customized approaches.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Expanding Small-Scale Datasets with Guided Imagination
    Zhang, Yifan
    Zhou, Daquan
    Hooi, Bryan
    Wang, Kai
    Feng, Jiashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Unsupervised domain adaptation for activity recognition across heterogeneous datasets
    Sanabria, Andrea Rosales
    Ye, Juan
    PERVASIVE AND MOBILE COMPUTING, 2020, 64
  • [3] Cross-view action recognition with small-scale datasets
    Goyal, Gaurvi
    Noceti, Nicoletta
    Odone, Francesca
    IMAGE AND VISION COMPUTING, 2022, 120
  • [4] Small-scale demixing in confluent biological tissues
    Sahu, Preeti
    Sussman, Daniel M.
    Ruebsam, Matthias
    Mertz, Aaron F.
    Horsley, Valerie
    Dufresne, Eric R.
    Niessen, Carien M.
    Marchetti, M. Cristina
    Manning, M. Lisa
    Schwarz, J. M.
    SOFT MATTER, 2020, 16 (13) : 3325 - 3337
  • [5] SMALL-SCALE FISHERIES IN THE CONTEMPORARY WORLD: ADAPTATION AND MANAGEMENT
    Blount, Ben
    REVIEWS IN ANTHROPOLOGY, 2005, 34 (01) : 1 - 19
  • [6] Adaptation of closed substrate culture by small-scale farmers
    Gul, A.
    Engindeniz, S.
    Eltez, R. Z.
    Aykut, N.
    Gulcin, H.
    PROCEEDINGS OF THE IIIRD BALKAN SYMPOSIUM ON VEGETABLE AND POTATOES, 2007, (729): : 261 - +
  • [7] End-to-end visual speech recognition for small-scale datasets
    Petridis, Stavros
    Wang, Yujiang
    Ma, Pingchuan
    Li, Zuwei
    Pantic, Maja
    PATTERN RECOGNITION LETTERS, 2020, 131 : 421 - 427
  • [8] Generalized Scene Classification From Small-Scale Datasets With Multitask Learning
    Zheng, Xiangtao
    Gong, Tengfei
    Li, Xiaobin
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [9] Dehaze on small-scale datasets via self-supervised learning
    Chen, Zhaojie
    Li, Qi
    Feng, Huajun
    Xu, Zhihai
    Chen, Yueting
    Jiang, Tingting
    VISUAL COMPUTER, 2024, 40 (06): : 4235 - 4249
  • [10] Analysis of small-scale biological compartments by capillary electrophoresis
    Govindaraju, K
    Lloyd, DK
    JOURNAL OF CHROMATOGRAPHY B, 2000, 745 (01): : 127 - 135