Identifying Hosts of Families of Viruses: A Machine Learning Approach

被引:3
|
作者
Raj, Anil [1 ]
Dewar, Michael
Palacios, Gustavo [2 ]
Rabadan, Raul [3 ]
Wiggins, Christopher H. [1 ]
机构
[1] Columbia Univ, Dept Appl Phys & Appl Math, Ctr Computat Biol & Bioinformat, New York, NY 10027 USA
[2] Columbia Univ, Ctr Infect & Immun, Mailman Sch Publ Hlth, New York, NY USA
[3] Columbia Univ, Coll Phys & Surg, Dept Biomed Informat, Ctr Computat Biol & Bioinformat, New York, NY USA
来源
PLOS ONE | 2011年 / 6卷 / 12期
基金
美国国家卫生研究院;
关键词
RNA-BINDING; 2C;
D O I
10.1371/journal.pone.0027631
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying emerging viral pathogens and characterizing their transmission is essential to developing effective public health measures in response to an epidemic. Phylogenetics, though currently the most popular tool used to characterize the likely host of a virus, can be ambiguous when studying species very distant to known species and when there is very little reliable sequence information available in the early stages of the outbreak of disease. Motivated by an existing framework for representing biological sequence information, we learn sparse, tree-structured models, built from decision rules based on subsequences, to predict viral hosts from protein sequence data using popular discriminative machine learning tools. Furthermore, the predictive motifs robustly selected by the learning algorithm are found to show strong host-specificity and occur in highly conserved regions of the viral proteome.
引用
收藏
页数:8
相关论文
共 50 条
  • [2] Identifying novel oncogenes: A machine learning approach
    Ambuj Kumar
    Vidya Rajendran
    Rao Sethumadhavan
    Rituraj Purohit
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2013, 5 : 241 - 246
  • [3] Identifying Zeolite Frameworks with a Machine Learning Approach
    Yang, Shujiang
    Lach-hab, Mohammed
    Vaisman, Iosif I.
    Blaisten-Barojas, Estela
    [J]. JOURNAL OF PHYSICAL CHEMISTRY C, 2009, 113 (52): : 21721 - 21725
  • [4] Identifying Novel Oncogenes: A Machine Learning Approach
    Kumar, Ambuj
    Rajendran, Vidya
    Sethumadhavan, Rao
    Purohit, Rituraj
    [J]. INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2013, 5 (04) : 241 - 246
  • [5] Identifying Cislunar Orbital Families via Machine Learning on Light Curves
    Badura, Gregory P.
    Deblasio, Dan
    Najera, Aryzbe
    Chavez-Lopez, Ana C.
    Velez-Reyes, Miguel
    Un, Nathan
    Shimane, Yuri
    Ho, Koki
    [J]. JOURNAL OF THE ASTRONAUTICAL SCIENCES, 2024, 71 (03):
  • [6] A Machine Learning Approach to Identifying Changes in Suicidal Language
    Pestian, John
    Santel, Daniel
    Sorter, Michael
    Bayram, Ulya
    Connolly, Brian
    Glauser, Tracy
    DelBello, Melissa
    Tamang, Suzanne
    Cohen, Kevin
    [J]. SUICIDE AND LIFE-THREATENING BEHAVIOR, 2020, 50 (05) : 939 - 947
  • [7] A machine learning approach to identifying different types of uncertainty
    Saltzman, Bennett
    Yung, Julieta
    [J]. ECONOMICS LETTERS, 2018, 171 : 58 - 62
  • [8] Identifying and classifying social groups: A machine learning approach
    Roffilli, Matteo
    Lomi, Alessandro
    [J]. DATA SCIENCE AND CLASSIFICATION, 2006, : 149 - +
  • [9] Identifying Politically Connected Firms: A Machine Learning Approach
    Titl, Vitezslav
    Mazrekaj, Deni
    Schiltz, Fritz
    [J]. OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 2024, 86 (01) : 137 - 155
  • [10] Identifying Disease - Treatment Relations using Machine Learning Approach
    Keerrthega, M. C.
    Thenmozhi, D.
    [J]. FOURTH INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTER SCIENCE & ENGINEERING (ICRTCSE 2016), 2016, 87 : 306 - 315