The Framework of Protein Function Prediction Based on Boolean Matrix Decomposition

被引:0
|
作者
Liu, Lin [1 ]
Tang, Lin [2 ]
Tang, Mingjing [3 ]
Zhou, Wei [4 ]
机构
[1] School of Information, Yunnan Normal University, Kunming,650500, China
[2] Key Laboratory of Educational Informatization for Nationalities, Yunnan Normal University, Ministry of Education, Kunming,650500, China
[3] President Office, Yunnan Normal University, Kunming,650500, China
[4] National Pilot School of Software, Yunnan University, Kunming,650091, China
基金
中国国家自然科学基金;
关键词
Clustering algorithms - Boolean functions - Classification (of information) - Forecasting;
D O I
10.7544/issn1000-1239.2019.20180274
中图分类号
学科分类号
摘要
Protein is the most essential and versatile macromolecule of living cells, and thus the research on protein functions is of great significance in decoding the secret of life. Previous researches have suggested that prediction of protein function is essentially a multi-label classification problem. Nonetheless, the large number of protein functional annotation labels brings the huge challenge to various kinds of multi-label classifiers applied to protein function prediction. To achieve more accuracy prediction of protein function by multi-label classifiers, we consider the characteristics of high correlation between protein functional labels, and propose a framework of protein function prediction based on Boolean matrix decomposition (PFP-BMD). Meanwhile, considering the problem of hardly satisfying exact decomposition and column in condition simultaneously of current Boolean matrix decomposition algorithms, an exact Boolean matrix decomposition algorithm based on label clusters is proposed, which realizes the hierarchical extended clustering of labels by the label-associated matrix. What's more, we prove its ability of optimal Boolean matrix decomposition based on related deductions. The experimental results show that this exact Boolean matrix decomposition algorithm possesses considerable advantage in reducing the computational complexity in comparison with existing algorithms. In addition, the application of the proposed algorithm in PFP-BMD can effectively improve the accuracy of protein function prediction, and more importantly, reducing and restoring dimensions in the functional label space of proteins using this algorithm lays the foundation of a more efficient classification of various multi-label classifiers. © 2019, Science Press. All right reserved.
引用
收藏
页码:1020 / 1033
相关论文
共 50 条
  • [1] XOR-based Boolean Matrix Decomposition
    Wicker, Jorg
    Hua, Yan Cathy
    Rebello, Rayner
    Pfahringer, Bernhard
    [J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 638 - 647
  • [2] Extended Boolean Matrix Decomposition
    Lu, Haibing
    Vaidya, Jaideep
    Atluri, Vijayalakshmi
    Hong, Yuan
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 317 - +
  • [3] Boolean function system decomposition
    Bokr, J
    [J]. AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2000, 34 (02) : 30 - 35
  • [5] FastStep: Scalable Boolean Matrix Decomposition
    Araujo, Miguel
    Ribeiro, Pedro
    Faloutsos, Christos
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 461 - 473
  • [6] QBF-Based Boolean Function Bi-Decomposition
    Chen, Huan
    Janota, Mikolas
    Marques-Silva, Joao
    [J]. DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 816 - 819
  • [7] Protein Function Prediction Based on Multiple Networks Collaborative Matrix Factorization
    [J]. Wang, Jun (kingjun@swu.edu.cn), 2017, Science Press (54):
  • [8] Boolean Matrix Decomposition by Formal Concept Sampling
    Osicka, Petr
    Trnecka, Martin
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2243 - 2246
  • [9] Characteristic matrix of covering and its application to Boolean matrix decomposition
    Wang, Shiping
    Zhu, William
    Zhu, Qingxin
    Min, Fan
    [J]. INFORMATION SCIENCES, 2014, 263 : 186 - 197
  • [10] Data Mining Framework for Protein Function Prediction
    Rahman, Shuzlina Abdul
    Hussein, Zeti Azura Mohamed
    Abu Bakar, Azuraliza
    [J]. INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 1009 - +