A Transfer Learning Approach for Integrating Biological Data Across Platforms

被引:0
|
作者
Achanta, Hema K. [1 ]
Misganaw, Burook [1 ]
Vidyasagar, M. [1 ]
机构
[1] Univ Texas Dallas, Erik Jonsson Sch Engn & Comp Sci, Richardson, TX 75080 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transfer learning refers to situations where a classifier is trained on one set of data and tested on another set of data that may have an entirely different probability distribution. Biological data derived from diverse platforms, and possibly using diverse technologies, is a natural candidate for applying transfer learning methodologies. In this paper, we adapt the l(1)-norm SVM to fit into the paradigm of Transfer Learning, by using the importance weighting approach. Our aim is to integrate biological data from diverse platforms. To validate our approach, we applied the proposed algorithm to the problem of classifying breast cancer tumors as EstrogenReceptor-positive (ER-positive) or Estrogen-Receptor-negative (ER-negative), which is the first step in personalizing therapy to the patient. The standard approach used in Biology is to convert data to Z-scores, that is, to subtract the mean and divide by the standard deviation. The algorithm proposed here shows better performance than using Z-scores to account for platform variations.
引用
收藏
页码:6695 / 6697
页数:3
相关论文
共 50 条
  • [1] Integrating Biological Data Across Multiple Platforms Using Importance-Weighted Transfer Learning and Applications to Breast Cancer Data Sets
    Achanta, Hema K.
    Misganaw, Burook
    Vidyasagar, M.
    [J]. 2017 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA 2017), 2017, : 955 - 960
  • [2] Systems morphogenetics: Integrating biological data across space, time, and function
    Cheng, KC
    Gittlen, J
    [J]. FASEB JOURNAL, 2005, 19 (04): : A787 - A787
  • [3] Inference of differentiation trajectories by transfer learning across biological processes
    Jumde, Gaurav
    Spanjaard, Bastiaan
    Junker, Jan Philipp
    [J]. CELL SYSTEMS, 2024, 15 (01) : 75 - 82.e5
  • [4] Decomposing Cell Identity for Transfer Learning across Cellular Measurements, Platforms, Tissues, and Species
    Stein-O'Brien, Genevieve L.
    Clark, Brian S.
    Sherman, Thomas
    Zibetti, Cristina
    Hu, Qiwen
    Sealfon, Rachel
    Liu, Sheng
    Qian, Jiang
    Colantuoni, Carlo
    Blackshaw, Seth
    Goff, Loyal A.
    Fertig, Elana J.
    [J]. CELL SYSTEMS, 2019, 8 (05) : 395 - +
  • [5] Integrating biomarkers across omic platforms: an approach to improve stratification of patients with indolent and aggressive prostate cancer
    Murphy, Keefe
    Murphy, Brendan T.
    Boyce, Susie
    Flynn, Louise
    Gilgunn, Sarah
    O'Rourke, Colm J.
    Rooney, Cathy
    Stockmann, Henning
    Walsh, Anna L.
    Finn, Stephen
    O'Kennedy, Richard J.
    O'Leary, John
    Pennington, Stephen R.
    Perry, Antoinette S.
    Rudd, Pauline M.
    Saldova, Radka
    Sheils, Orla
    Shields, Denis C.
    Watson, R. William
    [J]. MOLECULAR ONCOLOGY, 2018, 12 (09) : 1513 - 1525
  • [6] A PROBABILISTIC APPROACH TO DETERMINING BIOLOGICAL STRUCTURE - INTEGRATING UNCERTAIN DATA SOURCES
    ALTMAN, RB
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1995, 42 (06) : 593 - 616
  • [7] Mobility data disaggregation: a transfer learning approach
    Katranji, Mehdi
    Thuillier, Etienne
    Kraiem, Sami
    Moalic, Laurent
    Selem, Fouad Hadj
    [J]. 2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2016, : 1672 - 1677
  • [8] A new, improved and generalizable approach for the analysis of biological data generated by -omic platforms
    Pleasants, A. B.
    Wake, G. C.
    Shorten, P. R.
    Hassell-Sweatman, C. Z. W.
    McLean, C. A.
    Holbrook, J. D.
    Gluckman, P. D.
    Sheppard, A. M.
    [J]. JOURNAL OF DEVELOPMENTAL ORIGINS OF HEALTH AND DISEASE, 2015, 6 (01) : 17 - 26
  • [9] A data-driven framework to new product demand prediction: Integrating product differentiation and transfer learning approach
    Afrin, Kahkashan
    Nepal, Bimal
    Monplaisir, Leslie
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 246 - 257
  • [10] ginmappeR: an unified approach for integrating gene and protein identifiers across biological sequence databases
    Sola, Fernando
    Ayala, Daniel
    Pulido, Marina
    Ayala, Rafael
    Lopez-Cerero, Lorena
    Hernandez, Inma
    Ruiz, David
    [J]. BIOINFORMATICS ADVANCES, 2024, 4 (01):