Comparative Evaluation of Machine Learning Strategies for Analyzing Big Data in Psychiatry

被引:13
|
作者
Cao, Han [1 ]
Meyer-Lindenberg, Andreas [1 ]
Schwarz, Emanuel [1 ]
机构
[1] Heidelberg Univ, Med Fac Mannheim, Cent Inst Mental Hlth, Dept Psychiat & Psychotherapy, D-68159 Mannheim, Germany
关键词
multi-task learning; machine learning; biomarker discovery; psychiatry; GENE-EXPRESSION; MEGA-ANALYSIS; SCHIZOPHRENIA; PROFILES; DISEASES; FUTURE;
D O I
10.3390/ijms19113387
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The requirement of innovative big data analytics has become a critical success factor for research in biological psychiatry. Integrative analyses across distributed data resources are considered essential for untangling the biological complexity of mental illnesses. However, little is known about algorithm properties for such integrative machine learning. Here, we performed a comparative analysis of eight machine learning algorithms for identification of reproducible biological fingerprints across data sources, using five transcriptome-wide expression datasets of schizophrenia patients and controls as a use case. We found that multi-task learning (MTL) with network structure (MTL_NET) showed superior accuracy compared to other MTL formulations as well as single task learning, and tied performance with support vector machines (SVM). Compared to SVM, MTL_NET showed significant benefits regarding the variability of accuracy estimates, as well as its robustness to cross-dataset and sampling variability. These results support the utility of this algorithm as a flexible tool for integrative machine learning in psychiatry.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A survey of machine learning for big data processing
    Junfei Qiu
    Qihui Wu
    Guoru Ding
    Yuhua Xu
    Shuo Feng
    EURASIP Journal on Advances in Signal Processing, 2016
  • [42] Big Data, Predictive Analytics and Machine Learning
    Ongsulee, Pariwat
    Chotchaung, Veena
    Bamrungsi, Eak
    Rodcheewit, Thanaporn
    2018 16TH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING (ICT&KE), 2018, : 37 - 42
  • [43] Machine Learning Research in Big Data Environment
    Jiang, Shi
    2018 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL & ELECTRONICS ENGINEERING AND COMPUTER SCIENCE (ICEEECS 2018), 2018, : 227 - 231
  • [44] Automated Trading with Machine Learning on Big Data
    Ruta, Dymitr
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 824 - 830
  • [45] Efficient Machine Learning for Big Data: A Review
    Al-Jarrah, Omar Y.
    Yoo, Paul D.
    Muhaidat, Sami
    Karagiannidis, George K.
    Taha, Kamal
    BIG DATA RESEARCH, 2015, 2 (03) : 87 - 93
  • [46] A survey of machine learning for big data processing
    Qiu, Junfei
    Wu, Qihui
    Ding, Guoru
    Xu, Yuhua
    Feng, Shuo
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2016,
  • [47] Big Data, Machine Learning, and Molecular Imaging
    Morris, Michael
    Saboury, Babak
    Siegel, Eliot
    JOURNAL OF NUCLEAR MEDICINE, 2018, 59
  • [48] Big data and machine learning for crop protection
    Ip, Ryan H. L.
    Ang, Li-Minn
    Seng, Kah Phooi
    Broster, J. C.
    Pratley, J. E.
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 151 : 376 - 383
  • [49] Editorial: Big data and machine learning in sociology
    Leitgoeb, Heinz
    Prandner, Dimitri
    Wolbring, Tobias
    FRONTIERS IN SOCIOLOGY, 2023, 8
  • [50] Big Data and Machine Learning in Health Care
    Beam, Andrew L.
    Kohane, Isaac S.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 319 (13): : 1317 - 1318