Multi-instance multi-label distance metric learning for genome-wide protein function prediction

被引:14
|
作者
Xu, Yonghui [1 ]
Min, Huaqing [2 ]
Song, Hengjie [2 ]
Wu, Qingyao [2 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein function prediction; Genome wide; Distance metric learning; Machine learning; Multi-instance multi-label learning;
D O I
10.1016/j.compbiolchem.2016.02.011
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multi-instance multi-label (MIML) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with not only multiple instances but also multiple class labels. To find an appropriate MIML learning method for genome-wide protein function prediction, many studies in the literature attempted to optimize objective functions in which dissimilarity between instances is measured using the Euclidean distance. But in many real applications, Euclidean distance may be unable to capture the intrinsic similarity/dissimilarity in feature space and label space. Unlike other previous approaches, in this paper, we propose to learn a multi-instance multi label distance metric learning framework (MIMLDML) for genome-wide protein function prediction. Specifically, we learn a Mahalanobis distance to preserve and utilize the intrinsic geometric information of both feature space and label space for MIML learning. In addition, we try to deal with the sparsely labeled data by giving weight to the labeled data. Extensive experiments on seven real-world organisms covering the biological three-domain system (Le., archaea, bacteria, and eukaryote; Woese et al., 1990) show that the MIMLDML algorithm is superior to most state-of-the-art MIML learning algorithms. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:30 / 40
页数:11
相关论文
共 50 条
  • [21] Multi-Instance Multi-Label Learning for Gene Mutation Prediction in Hepatocellular Carcinoma
    Xu, Kaixin
    Zhao, Ziyuan
    Gu, Jiapan
    Zeng, Zeng
    Ying, Chan Wan
    Choon, Lim Kheng
    Hua, Thng Choon
    Chow, Pierce K. H.
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 6095 - 6098
  • [22] A multi-instance multi-label learning algorithm based on instance correlations
    Chanjuan Liu
    Tongtong Chen
    Xinmiao Ding
    Hailin Zou
    Yan Tong
    Multimedia Tools and Applications, 2016, 75 : 12263 - 12284
  • [23] Dynamic Programming for Instance Annotation in Multi-Instance Multi-Label Learning
    Pham, Anh T.
    Raich, Raviv
    Fern, Xiaoli Z.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2381 - 2394
  • [24] SIMULTANEOUS INSTANCE ANNOTATION AND CLUSTERING IN MULTI-INSTANCE MULTI-LABEL LEARNING
    Pham, Anh T.
    Raich, Raviv
    Fern, Xiaoli Z.
    2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [25] Gene function prediction based on combining gene ontology hierarchy with multi-instance multi-label learning
    Li, Zejun
    Liao, Bo
    Li, Yun
    Liu, Wenhua
    Chen, Min
    Cai, Lijun
    RSC ADVANCES, 2018, 8 (50) : 28503 - 28509
  • [26] Multi-label multi-instance learning with missing object tags
    Shen, Yi
    Peng, Jinye
    Feng, Xiaoyi
    Fan, Jianping
    MULTIMEDIA SYSTEMS, 2013, 19 (01) : 17 - 36
  • [27] Multi-label multi-instance learning with missing object tags
    Yi Shen
    Jinye Peng
    Xiaoyi Feng
    Jianping Fan
    Multimedia Systems, 2013, 19 : 17 - 36
  • [28] Deep Multi-Instance Multi-Label Learning for Image Annotation
    Guo, Hai-Feng
    Han, Lixin
    Su, Shoubao
    Sun, Zhou-Bao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (03)
  • [29] Multi-Instance Multi-Label Learning For Automatic Tag Recommendation
    Shen, Chen
    Jiao, Jun
    Yang, Yahui
    Wang, Bin
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 4910 - +
  • [30] Multi-instance multi-label learning for surgical image annotation
    Loukas, Constantinos
    Sgouros, Nicholas P.
    INTERNATIONAL JOURNAL OF MEDICAL ROBOTICS AND COMPUTER ASSISTED SURGERY, 2020, 16 (02):