Predicting host tropism of influenza A virus proteins using random forest

被引:46
|
作者
Eng, Christine L. P. [1 ]
Tong, Joo Chuan [1 ,2 ]
Tan, Tin Wee [1 ]
机构
[1] Natl Univ Singapore, Yong Loo Lin Sch Med, Dept Biochem, Singapore 117599, Singapore
[2] ASTAR, Inst High Performance Comp, Singapore 138632, Singapore
关键词
AMINO-ACID; RECEPTOR-BINDING; MOLECULAR-DYNAMICS; HUMAN INFECTION; HEMAGGLUTININ; ORIGIN; RECOGNITION; DETERMINANTS; GANGLIOSIDES; POLYMERASE;
D O I
10.1186/1755-8794-7-S3-S1
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Majority of influenza A viruses reside and circulate among animal populations, seldom infecting humans due to host range restriction. Yet when some avian strains do acquire the ability to overcome species barrier, they might become adapted to humans, replicating efficiently and causing diseases, leading to potential pandemic. With the huge influenza A virus reservoir in wild birds, it is a cause for concern when a new influenza strain emerges with the ability to cross host species barrier, as shown in light of the recent H7N9 outbreak in China. Several influenza proteins have been shown to be major determinants in host tropism. Further understanding and determining host tropism would be important in identifying zoonotic influenza virus strains capable of crossing species barrier and infecting humans. Results: In this study, computational models for 11 influenza proteins have been constructed using the machine learning algorithm random forest for prediction of host tropism. The prediction models were trained on influenza protein sequences isolated from both avian and human samples, which were transformed into amino acid physicochemical properties feature vectors. The results were highly accurate prediction models (ACC>96.57; AUC>0.980; MCC>0.916) capable of determining host tropism of individual influenza proteins. In addition, features from all 11 proteins were used to construct a combined model to predict host tropism of influenza virus strains. This would help assess a novel influenza strain's host range capability. Conclusions: From the prediction models constructed, all achieved high prediction performance, indicating clear distinctions in both avian and human proteins. When used together as a host tropism prediction system, zoonotic strains could potentially be identified based on different protein prediction results. Understanding and predicting host tropism of influenza proteins lay an important foundation for future work in constructing computation models capable of directly predicting interspecies transmission of influenza viruses.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Predicting host tropism of influenza A virus proteins using random forest
    Christine LP Eng
    Joo Chuan Tong
    Tin Wee Tan
    [J]. BMC Medical Genomics, 7
  • [2] Predicting Zoonotic Risk of Influenza A Viruses from Host Tropism Protein Signature Using Random Forest
    Eng, Christine L. P.
    Tong, Joo Chuan
    Tan, Tin Wee
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (06)
  • [3] A Study on Host Tropism Determinants of Influenza Virus Using Machine Learning
    Kwon, Eunmi
    Cho, Myeongji
    Kim, Hayeon
    Son, Hyeon S.
    [J]. CURRENT BIOINFORMATICS, 2020, 15 (02) : 121 - 134
  • [4] Computational identification of physicochemical signatures for host tropism of influenza A virus
    Yin, Rui
    Zhou, Xinrui
    Zheng, Jie
    Kwoh, Chee Keong
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2018, 16 (06)
  • [5] A random forest model for predicting exosomal proteins using evolutionary information and motifs
    Arora, Akanksha
    Patiyal, Sumeet
    Sharma, Neelam
    Devi, Naorem Leimarembi
    Kaur, Dashleen
    Raghava, Gajendra P. S.
    [J]. PROTEOMICS, 2024, 24 (06)
  • [6] Avian Influenza Virus Tropism in Humans
    AbuBakar, Umarqayum
    Amrani, Lina
    Kamarulzaman, Farah Ayuni
    Karsani, Saiful Anuar
    Hassandarvish, Pouya
    Khairat, Jasmine Elanie
    [J]. VIRUSES-BASEL, 2023, 15 (04):
  • [7] Viral tropism and the pathogenesis of influenza in the mammalian host
    Mansfield, Keith G.
    [J]. AMERICAN JOURNAL OF PATHOLOGY, 2007, 171 (04): : 1089 - 1092
  • [8] A Random Forest Model for Predicting Allosteric and Functional Sites on Proteins
    Chen, Ava S-Y.
    Westwood, Nicholas J.
    Brear, Paul
    Rogers, Graeme W.
    Mavridis, Lazaros
    Mitchell, John B. O.
    [J]. MOLECULAR INFORMATICS, 2016, 35 (3-4) : 125 - 135
  • [9] Predicting student dropouts using random forest
    Devi, Kapila
    Ratnoo, Saroj
    [J]. JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2022, 25 (07) : 1579 - 1590
  • [10] Differential use of importin-α isoforms governs cell tropism and host adaptation of influenza virus
    Gülsah Gabriel
    Karin Klingel
    Anna Otte
    Swantje Thiele
    Ben Hudjetz
    Gökhan Arman-Kalcek
    Martina Sauter
    Tatiana Shmidt
    Franziska Rother
    Sigrid Baumgarte
    Björn Keiner
    Enno Hartmann
    Michael Bader
    George G. Brownlee
    Ervin Fodor
    Hans-Dieter Klenk
    [J]. Nature Communications, 2