Scalable embedding of multiple perspectives for indefinite life-science data analysis

被引:1
|
作者
Munch, Maximilian [1 ]
Heilig, Simon [2 ]
Vath, Philipp [2 ]
Schleif, Frank-Michael [2 ]
机构
[1] Univ Groningen, Bernoulli Inst Math Comp Sci & Artificial Intelli, Groningen, Netherlands
[2] Univ Appl Sci Wurzburg Schweinfurt, Dept Comp Sci & Business Informat Syst, Wurzburg, Germany
关键词
Indefinite learning; complex-valued embedding; life science data; multi-perspective embedding; multimodal data;
D O I
10.1109/SSCI50451.2021.9659914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Life science data analysis frequently encounters particular challenges that cannot be solved with classical techniques from data analytics or machine learning domains. The complex inherent structure of the data and especially the encoding in non-standard ways, e.g., as genome- or protein-sequences, graph structure or histograms, often limit the development of appropriate classification models. To address these limitations, the application of domain-specific expert similarity measures has gained a lot of attention in the past. However, the use of such expert measures suffers from two major drawbacks: (a) there is not one outstanding similarity measure that guarantees success in all application scenarios, and (b) such similarity functions often lead to indefinite data that cannot be processed by classical machine learning methods. In order to tackle both of these limitations, this paper presents a method to embed indefinite life science data with various similarity measures at the same time into a complex-valued vector space. We test our approach on various life science data sets and evaluate the performance against other competitive methods to show its efficiency.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] The burden of multiple sclerosis 2015: Methods of data collection, assessment and analysis of costs, quality of life and symptoms
    Kobelt, Gisela
    Eriksson, Jennifer
    Phillips, Glenn
    Berg, Jenny
    [J]. MULTIPLE SCLEROSIS JOURNAL, 2017, 23 : 4 - 16
  • [42] SLiCE: An open building data model for scalable high-definition life cycle engineering, dynamic impact assessment, and systematic hotspot analysis
    Roeck, Martin
    Passer, Alexander
    Allacker, Karen
    [J]. SUSTAINABLE PRODUCTION AND CONSUMPTION, 2024, 45 : 450 - 463
  • [43] Missing not at random in end of life care studies: multiple imputation and sensitivity analysis on data from the ACTION study
    Carreras, Giulia
    Miccinesi, Guido
    Wilcock, Andrew
    Preston, Nancy
    Nieboer, Daan
    Deliens, Luc
    Groenvold, Mogensm
    Lunder, Urska
    van der Heide, Agnes
    Baccini, Michela
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2021, 21 (01)
  • [44] Missing not at random in end of life care studies: multiple imputation and sensitivity analysis on data from the ACTION study
    Giulia Carreras
    Guido Miccinesi
    Andrew Wilcock
    Nancy Preston
    Daan Nieboer
    Luc Deliens
    Mogensm Groenvold
    Urska Lunder
    Agnes van der Heide
    Michela Baccini
    [J]. BMC Medical Research Methodology, 21
  • [45] Using Data Science to Characterize Patterns in Cardiovascular Clinical Data: Topological Analysis of Baseline Characteristics in Patients With Relapsed/Refractory Multiple Myeloma Enrolled in Carfilzomib Trials
    Quach, Hang
    Ludwig, Heinz
    Chari, Ajai
    Richter, Joshua
    Goldrick, Amanda
    Abbasi, Siddique
    Gnacadja, Gilles
    Mikhael, Joseph
    [J]. CLINICAL LYMPHOMA MYELOMA & LEUKEMIA, 2022, 22 : S407 - S408
  • [46] Free light chain analysis: Is it useful to monitor patients undergoing haematological stem cell therapy for multiple myeloma? An analysis of "real life" data
    Willenbacher, E.
    Gastl, G.
    Nachbaur, D.
    Willenbacher, W.
    [J]. BONE MARROW TRANSPLANTATION, 2008, 41 : S184 - S184
  • [47] Determining the Power Rate of Change of 353 Plant Inverters Time-series Data Across Multiple Climate Zones, Using a Month-by-Month Data Science Analysis
    Curran, Alan J.
    Hu, Yang
    Haddadian, Rojiar
    Braid, Jennifer L.
    Meakin, David
    Peshek, Timothy J.
    French, Roger H.
    [J]. 2017 IEEE 44TH PHOTOVOLTAIC SPECIALIST CONFERENCE (PVSC), 2017, : 1927 - 1932
  • [48] Linked Patient-Reported Outcomes Data From Patients With Multiple Sclerosis Recruited on an Open Internet Platform to Health Care Claims Databases Identifies a Representative Population for Real-Life Data Analysis in Multiple Sclerosis
    Risson, Valery
    Ghodge, Bhaskar
    Bonzani, Ian C.
    Korn, Jonathan R.
    Medin, Jennie
    Saraykar, Tanmay
    Sengupta, Souvik
    Saini, Deepanshu
    Olson, Melvin
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2016, 18 (09)
  • [49] Impact of Visceral Obesity on Clinical Outcome and Quality of Life for Patients with Multiple Myeloma: A Secondary Data Analysis of STaMINA (BMT CTN 0702) Trial
    Malek, Ehsan
    Kort, Jeries
    Metheny, Leland
    Fu, Pingfu
    Li, Gen
    Hari, Parameswaran
    Efebera, Yvonne
    Callander, Natalie S.
    Qazilbash, Muzaffar H.
    Giralt, Sergio
    Krishnan, Amrita
    Stadtmauer, Edward A.
    Lazarus, Hillard M.
    [J]. TRANSPLANTATION AND CELLULAR THERAPY, 2024, 30 (07):
  • [50] Predictors of six-month change in health-related quality of life in people with multiple sclerosis: A secondary data analysis of a randomized controlled trial
    Patt, Nadine
    Kupjetz, Marie
    Schlagheck, Marit Lea
    Hersche, Ruth
    Joisten, Niklas
    Kool, Jan
    Gonzenbach, Roman
    Nigg, Claudio R.
    Zimmer, Philipp
    Bansi, Jens
    [J]. MULTIPLE SCLEROSIS AND RELATED DISORDERS, 2024, 90