PlasmidHunter: accurate and fast prediction of plasmid sequences using gene content profile and machine learning

被引:2
|
作者
Tian, Renmao [1 ]
Zhou, Jizhong [2 ]
Imanian, Behzad [1 ,3 ]
机构
[1] IIT, Inst Food Safety & Hlth, 6502 S Archer Rd, Bedford Pk, IL 60501 USA
[2] Univ Oklahoma, Inst Environm Genom, Dept Microbiol & Plant Biol, 101 David Boren Blvd, Norman, OK 73019 USA
[3] IIT, Food Sci & Nutr Dept, 10 West 35th St, Chicago, IL 60616 USA
关键词
artificial intelligence (AI); machine learning (ML); plasmid prediction; genomic sequencing; RESISTANCE;
D O I
10.1093/bib/bbae322
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Plasmids are extrachromosomal DNA found in microorganisms. They often carry beneficial genes that help bacteria adapt to harsh conditions. Plasmids are also important tools in genetic engineering, gene therapy, and drug production. However, it can be difficult to identify plasmid sequences from chromosomal sequences in genomic and metagenomic data. Here, we have developed a new tool called PlasmidHunter, which uses machine learning to predict plasmid sequences based on gene content profile. PlasmidHunter can achieve high accuracies (up to 97.6%) and high speeds in benchmark tests including both simulated contigs and real metagenomic plasmidome data, outperforming other existing tools.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Accurate and fast machine learning algorithm for systems outage prediction
    Gu, Chan
    Chen, Chen
    Tang, Wei
    SOLAR ENERGY, 2023, 251 (286-294) : 286 - 294
  • [2] Accurate Prediction of Microstructure of Composites using Machine Learning
    Sang, Sheng
    Xu, Chen
    Fan, Jiadi
    Miao, Daniel
    Side, Conner
    Wang, Ziping
    ADVANCED THEORY AND SIMULATIONS, 2023, 6 (02)
  • [3] Classification of bacterial plasmid and chromosome derived sequences using machine learning
    Zou, Xiaohui
    Nguyen, Marcus
    Overbeek, Jamie
    Cao, Bin
    Davis, James J.
    PLOS ONE, 2022, 17 (12):
  • [4] mirkwood: Fast and Accurate SED Modeling Using Machine Learning
    Gilda, Sankalp
    Lower, Sidney
    Narayanan, Desika
    ASTROPHYSICAL JOURNAL, 2021, 916 (01):
  • [5] Fast and Accurate Prediction of Corrosion Rate of Natural Gas Pipeline Using a Hybrid Machine Learning Approach
    Liu, Hongbo
    Cai, Xinlei
    Meng, Xiangzhao
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [6] Fast and accurate prediction of partial charges using Atom-Path-Descriptor-based machine learning
    Wang, Jike
    Cao, Dongsheng
    Tang, Cunchen
    Chen, Xi
    Sun, Huiyong
    Hou, Tingjun
    BIOINFORMATICS, 2020, 36 (18) : 4721 - 4728
  • [7] Machine learning approach for fast and accurate prediction of optical properties of organic molecules
    Afzal, M. Atif
    Hachmann, Johannes
    Cheng, Chong
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [8] Machine learning for accurate and fast bandgap prediction of solid-state materials
    Verma, Shomik
    Kajale, Shivam
    Gomez-Bombarelli, Rafael
    2022 IEEE HIGH PERFORMANCE EXTREME COMPUTING VIRTUAL CONFERENCE (HPEC), 2022,
  • [9] Accurate Performance and Power Prediction for FPGAs Using Machine Learning
    Sawalha, Lina
    Abuaita, Tawfiq
    Cowley, Martin
    Akhmatdinov, Sergei
    Dubs, Adam
    2022 IEEE 30TH INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2022), 2022, : 228 - 228
  • [10] Accurate prediction of essential proteins using ensemble machine learning
    鲁德志
    吴淏
    侯俞彤
    吴云成
    刘媛媛
    王金武
    Chinese Physics B, 2025, 34 (01) : 112 - 119