Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry

被引:13
|
作者
Cao, Han [1 ]
Meyer-Lindenberg, Andreas [1 ]
Schwarz, Emanuel [1 ]
机构
[1] Heidelberg Univ, Med Fac Mannheim, Cent Inst Mental Hlth, Dept Psychiat & Psychotherapy, D-68159 Mannheim, Germany
关键词
multi-task learning; machine learning; biomarker discovery; psychiatry; GENE-EXPRESSION; MEGA-ANALYSIS; SCHIZOPHRENIA; PROFILES; DISEASES; FUTURE;
D O I
10.3390/ijms19113387
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about algorithm properties for such integrative machine learning. Here, we performed a comparative analysis of eight machine learning algorithms for identification of reproducible biological fingerprints across data sources, using five transcriptome-wide expression datasets of schizophrenia patients and controls as a use case. We found that multi-task learning (MTL) with network structure (MTL_NET) showed superior accuracy compared to other MTL formulations as well as single task learning, and tied performance with support vector machines (SVM). Compared to SVM, MTL_NET showed significant benefits regarding the variability of accuracy estimates, as well as its robustness to cross-dataset and sampling variability. These results support the utility of this algorithm as a flexible tool for integrative machine learning in psychiatry.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Machine Learning Strategies for Analyzing Road Traffic Accident
    Gupta, Sumit
    Kumar, Awadhesh
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT I, 2024, 14531 : 394 - 405
  • [22] Evaluation of risk analysis process in medical big data using Machine Learning
    Rajeshkumar, K.
    Dhanasekaran, S.
    Vasudevan, V.
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [23] Machine learning and big data based clinical EBM evidence acquisition and evaluation
    Jin, Meng
    Jiang, Jingsi
    Zhang, Kai
    Bao, Xiaoyuan
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 1275 - 1278
  • [24] Evaluation of echosounder data preparation strategies for modern machine learning models
    Ordonez, Alba
    Utseth, Ingrid
    Brautaset, Olav
    Korneliussen, Rolf
    Handegard, Nils Olav
    FISHERIES RESEARCH, 2022, 254
  • [25] SHM data anomaly classification using machine learning strategies: A comparative study
    Chou, Jau-Yu
    Fu, Yuguang
    Huang, Shieh-Kung
    Chang, Chia-Ming
    SMART STRUCTURES AND SYSTEMS, 2022, 29 (01) : 77 - 91
  • [26] Machine learning on big data for future computing
    Jeong, Young-Sik
    Hassan, Houcine
    Sangaiah, Arun Kumar
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (06): : 2925 - 2929
  • [27] Machine Learning Challenges in Big Data Era
    Veganzones-Bodon, Miguel
    DYNA, 2019, 94 (05): : 478 - 479
  • [28] Machine learning on big data for future computing
    Young-Sik Jeong
    Houcine Hassan
    Arun Kumar Sangaiah
    The Journal of Supercomputing, 2019, 75 : 2925 - 2929
  • [29] Machine Learning Meets Big Spatial Data
    Sabek, Ibrahim
    Mokbel, Mohamed F.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 1982 - 1985
  • [30] Machine Learning for Astronomical Big Data Processing
    Xu, Long
    Yan, Yihua
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,