Efficient and scalable motif discovery using graph-based search

被引:0
|
作者
Sinha, Arnit U. [1 ]
Bhatnagar, Raj [1 ]
机构
[1] Univ Cincinnati, Dept ECECS, Cincinnati, OH 45221 USA
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Identification of short repeated patterns (motifs) in genomic sequences is the key to many problems in bioinformatics. The promoter regions of genes are an important target of search for such motifs (Transcription Factor Binding Sites). We present a new algorithm, Mottice, for detecting potential binding sites which are present in a given set of genomic sequences. An informed search is performed by organizing the input patterns and their variants in a graph. Such a strategy efficiently leads to the desired solutions. The background is modeled as a Markov process and a composite score function is used. We demonstrate the performance of our algorithm by testing it on real-life data sets from yeast and human promoter sequences. We compared the performance with several popular algorithms and found that other algorithms work well with lower organisms like yeast but only a couple of them work well with human data. We show that our algorithm scales linearly with the size of input dataset. We compare the computational efficiency of our algorithm with other algorithms and show that it performs faster for different datasets and motif sizes.
引用
收藏
页码:197 / +
页数:2
相关论文
共 50 条
  • [1] Efficient Graph-based Signal Motif Discovery with Performance Bounds
    You, Zeyu
    Raich, Raviv
    Huang, Yonghong
    SIGNAL PROCESSING, 2023, 213
  • [2] Graph-based Approaches for Motif Discovery
    Zaslavsky, Elena
    CLUSTER CHALLENGES IN BIOLOGICAL NETWORKS, 2009, : 83 - 99
  • [3] Efficient and scalable filtering of graph-based metadata
    Liu, HF
    Petrovic, M
    Jacobsen, HA
    JOURNAL OF WEB SEMANTICS, 2005, 3 (04): : 294 - 310
  • [4] Efficient graph-based search for object detection
    Wei, Hui
    Yang, Chengzhuan
    Yu, Qian
    INFORMATION SCIENCES, 2017, 385 : 395 - 414
  • [5] ELPIS: Graph-Based Similarity Search for Scalable Data Science
    Azizi, Ilias
    Echihabi, Karima
    Palpanas, Themis
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (06): : 1548 - 1559
  • [6] GoFast: Graph-based optimization for efficient and scalable query evaluation
    Zouaghi, Ishaq
    Mesmoudi, Amin
    Galicia, Jorge
    Bellatreche, Ladjel
    Aguili, Taoufik
    INFORMATION SYSTEMS, 2021, 99
  • [7] Quick-Motif: An Efficient and Scalable Framework for Exact Motif Discovery
    Li, Yuhong
    Hou, Leong U.
    Yiu, Man Lung
    Gong, Zhiguo
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 579 - 590
  • [8] TrieAMD: a scalable and efficient apriori motif discovery approach
    Al-Turaiki, Isra
    Badr, Ghada
    Mathkour, Hassan
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (01) : 13 - 30
  • [9] ParlayANN: Scalable and Deterministic Parallel Graph-Based Approximate Nearest Neighbor Search Algorithms
    Manohar, Magdalen Dobson
    Shen, Zheqi
    Blelloch, Guy E.
    Dhulipala, Laxman
    Gu, Yan
    Simhadri, Harsha Vardhan
    Sun, Yihan
    PROCEEDINGS OF THE 29TH ACM SIGPLAN ANNUAL SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, PPOPP 2024, 2024, : 270 - 285
  • [10] Graph-based Biomedical Knowledge Discovery
    Altuner, Osman
    Bakir-Gungor, Burcu
    Bakal, Gokhan
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,