Predictive Learning with Sparse Heterogeneous Data

被引:0
|
作者
Cherkassky, Vladimir [1 ]
Cai, Feng [1 ]
Liang, Lichen [2 ]
机构
[1] Univ Minnesota, Dept Elect & Comp Engn, Minneapolis, MN 55455 USA
[2] Massachusetts Gen Hosp, Boston, MA 02114 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
many applications of machine learning involve sparse and heterogeneous data. For example, estimation of predictive (diagnostic) models using patients' data from clinical studies requires effective integration of genetic, clinical and demographic data. Typically all heterogeneous inputs are properly encoded and mapped onto a single feature vector, used for estimating (training) a predictive model. This approach, known as standard inductive learning, is used in most application studies. More recently, several new learning methodologies have emerged. In particular, when training data can be naturally separated into several groups (or structured), we can view learning (estimation) for each group as a separate task, leading to Multi-Task Learning framework. Similarly, a setting where training data is structured, but the objective is to estimate a single predictive model (for all groups), leads to Learning with Structured Data and SVM+ methodology recently proposed by Vapnik. This paper demonstrates advantages and limitations of these new data modeling approaches for modeling heterogeneous data (relative to standard inductive SVM) via empirical comparisons using several publicly available medical data sets.
引用
收藏
页码:3155 / +
页数:2
相关论文
共 50 条
  • [1] Distributed Machine Learning with Sparse Heterogeneous Data
    Richards, Dominic
    Negahban, Sahand N.
    Rebeschini, Patrick
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Predictive Performance of Machine Learning Algorithms Trained with Sparse Data
    Dewey, H. Heath
    DeVries, Derek R.
    [J]. 2021 IEEE AEROSPACE CONFERENCE (AEROCONF 2021), 2021,
  • [3] HSETA: A Heterogeneous and Sparse Data Learning Hybrid Framework for Estimating Time of Arrival
    Chen, Kaiqi
    Chu, Guowei
    Yang, Xuexi
    Shi, Yan
    Lei, Kaiyuan
    Deng, Min
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21873 - 21884
  • [4] Learning Deep Representations from Heterogeneous Patient Data for Predictive Diagnosis
    Zhou, Chongyu
    Jia, Yao
    Motani, Mehul
    Chew, Jingwei
    [J]. ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 115 - 123
  • [5] Predictive approaches for sparse model learning
    Shevade, SK
    Sundararajan, S
    Keerthi, SS
    [J]. NEURAL INFORMATION PROCESSING, 2004, 3316 : 434 - 439
  • [6] HETEROGENEOUS ARCHITECTURE FOR SPARSE DATA PROCESSING
    Adavally, Shashank
    Weaver, Alex
    Vasireddy, Pranathi
    Kavi, Krishna
    Mehta, Gayatri
    Gulur, Nagendra
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 6 - 15
  • [7] Deep Learning from Heterogeneous Sequences of Sparse Medical Data for Early Prediction of Sepsis
    Ul Alam, Mahbub
    Henriksson, Aron
    Valik, John Karlsson
    Ward, Logan
    Naucler, Pontus
    Dalianis, Hercules
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2020, : 45 - 55
  • [8] Stabilized Sparse Online Learning for Sparse Data
    Ma, Yuting
    Zheng, Tian
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [9] Adaptive Optimization for Sparse Data on Heterogeneous GPUs
    Ma, Yujing
    Rusu, Florin
    Wu, Kesheng
    Sim, Alexander
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 1088 - 1097
  • [10] Minimax predictive density for sparse count data
    Yano, Keisuke
    Kaneko, Ryoya
    Komaki, Fumiyasu
    [J]. BERNOULLI, 2021, 27 (02) : 1212 - 1238