An intelligent computational model for prediction of promoters and their strength via natural language processing

被引:15
|
作者
Tahir, Muhammad [1 ,2 ]
Hayat, Maqsood [1 ]
Gul, Sarah [4 ]
Chong, Kil To [2 ,3 ]
机构
[1] Abdul Wali Khan Univ, Dept Comp Sci, Mardan 23200, KP, Pakistan
[2] Chonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[3] Chonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
[4] Int Islamic Univ, Dept Biol Sci, FBAS, Islamabad, Pakistan
基金
新加坡国家研究基金会;
关键词
Promoters; Convolution neural network (CNN); Natural language processing; DNA; word2vec; SEQUENCE-BASED PREDICTOR; RECOMBINATION SPOTS; ENSEMBLE CLASSIFIER; PROTEIN TYPES; IDENTIFICATION; SITES; FEATURES; SPACE; DISCRIMINATION; TRINUCLEOTIDE;
D O I
10.1016/j.chemolab.2020.104034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In DNA, a promoter is an essential part of genes that controls the transcription of specific genes in a particular tissue or cells. The combination of RNA polymerase and a number of various proteins named "sigma-factors" can define the transcription start site (TSS) by inducing RNA holoenzyme. Further, Promoter is categorized into strong and weak promoters on the basis of promoter strength. Owing to exponential increase of RNA/DNA and protein samples in the post-genomic era, developing a simple and efficient sequential-based intelligent computational model for the discrimination of promoters is a challenging job. An intelligent computational model namely: 2L-iPSW(word2vec) was introduced for discrimination of promoters and their strength, in this regard. Machine learning and Deep learning algorithms in conjunction with natural language processing method i.e., "word2vec" are used. The proposed computational model 2L-iPSW(word2vec) achieved 91.42% of accuracy for 1st layer contains promoters and non-promoters which is 8.29% higher than the existing model, whereas 82.42% of accuracy for 2nd layer identifies strong promoter and weak promoter which is 11.22% advanced than the present model. Proposed 2L-iPSW(word2vec) model obtained efficient success rates than the present models in terms of all assessment metrics. It is thus greatly observed that the 2L-iPSW(word2vec) model will lead a useful tool for academic research on promoter identification.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Computational lexicon: The central structure in natural language processing systems
    Shamsfard, M.
    Abdollahzadeh, Barforoush, A.
    Amirkabir (Journal of Science and Technology), 2001, 12 (48): : 449 - 462
  • [42] Analyzing learner language: towards a flexible natural language processing architecture for intelligent language tutors
    Amaral, Luiz
    Meurers, Detmar
    Ziai, Ramon
    COMPUTER ASSISTED LANGUAGE LEARNING, 2011, 24 (01) : 1 - 16
  • [43] Graph-based intelligent accident hazard ontology using natural language processing for tracking, prediction, and learning
    Hong, Eunbin
    Lee, Seungyeon
    Kim, Hayoung
    Park, Jeongeun
    Seo, Myoung Bae
    Yi, June-Seong
    AUTOMATION IN CONSTRUCTION, 2024, 168
  • [44] Cockpit-Llama: Driver Intent Prediction in Intelligent Cockpit via Large Language Model
    Chen, Yi
    Li, Chengzhe
    Yuan, Qirui
    Li, Jinyu
    Fan, Yuze
    Ge, Xiaojun
    Li, Yun
    Gao, Fei
    Zhao, Rui
    SENSORS, 2025, 25 (01)
  • [45] A MULTIPROCESSING MODEL OF NATURAL-LANGUAGE PROCESSING
    DEY, P
    HAYASHI, Y
    THEORETICAL LINGUISTICS, 1990, 16 (01) : 11 - 23
  • [46] Language Without Words: A Pointillist Model for Natural Language Processing
    Song, Peiyou
    Shu, Anhei
    Phipps, David
    Tiwari, Mohit
    Wallach, Dan S.
    Crandall, Jedidiah R.
    Luger, George F.
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 11 - 15
  • [47] A Model of Language Processing as Hierarchic Sequential Prediction
    van Schijndel, Marten
    Exley, Andy
    Schuler, William
    TOPICS IN COGNITIVE SCIENCE, 2013, 5 (03) : 522 - 540
  • [48] Supporting Collaborative Modeling via Natural Language Processing
    Aydemir, Fatma Basak
    Dalpiaz, Fabiano
    CONCEPTUAL MODELING, ER 2020, 2020, 12400 : 223 - 238
  • [49] INTELLIGENT NATURAL-LANGUAGE INTERFACE FOR A SIGNAL-PROCESSING SYSTEM
    MORRIS, DT
    ASUMU, DE
    IEE PROCEEDINGS-E COMPUTERS AND DIGITAL TECHNIQUES, 1990, 137 (05): : 371 - 379
  • [50] Research on Natural Language Recognition Processing System in Computer Intelligent Graphics
    Wang, Shengyao
    PROCEEDINGS OF 2023 INTERNATIONAL CONFERENCE ON AI AND METAVERSE IN SUPPLY CHAIN MANAGEMENT, AIMSCM 2023, 2023,