Discovering consensus patterns in biological databases

被引:0
|
作者
ElTabakh, Mohamed Y. [1 ]
Aref, Walid G. [1 ]
Ouzzani, Mourad [2 ]
Ali, Mohamed H. [1 ]
机构
[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47906 USA
[2] Purdue Univ, Cyber Ctr, W Lafayette, IN 47906 USA
来源
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Consensus patterns, like motifs and tandem repeats, are highly conserved patterns with very few substitutions where no gaps are allowed. In this paper, we present a progressive hierarchical clustering technique for discovering consensus patterns in biological databases over a certain length range. This technique can discover consensus patterns with various requirements by applying a post-processing phase. The progressive nature of the hierarchical clustering algorithm makes it scalable and efficient. Experiments to discover motifs and tandem repeats on real biological databases show significant performance gain over non-progressive clustering techniques.
引用
收藏
页码:170 / +
页数:3
相关论文
共 50 条
  • [1] Discovering frequent structured patterns from string databases: An application to biological sequences
    Palopoli, L
    Terracina, G
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2002, 2534 : 34 - 46
  • [2] Discovering relational patterns across multiple databases
    Zhu, Xingquan
    Wu, Xindong
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 701 - +
  • [3] On discovering "potentially useful" patterns from databases
    Xie, Ying
    Johnsten, Tom
    Raghavan, Vijay V.
    Ramachandran, K.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 494 - +
  • [4] Discovering Partial Periodic Spatial Patterns in Spatiotemporal Databases
    Kiran, R. Uday
    Saideep, C.
    Zettsu, Koji
    Toyoda, Masashi
    Kitsuregawa, Masaru
    Reddy, P. Krishna
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 233 - 238
  • [5] An Efficient Approach to Discovering Sequential Patterns in Large Databases
    Yen, Show-Jane
    Cho, Chung-Wen
    [J]. LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 685 - 690
  • [6] Discovering periodic cluster patterns in event sequence databases
    Guisheng Chen
    Zhanshan Li
    [J]. Applied Intelligence, 2022, 52 : 15387 - 15404
  • [7] Discovering probabilistically weighted sequential patterns in uncertain databases
    Islam, Md Sahidul
    Kar, Pankaj Chandra
    Samiullah, Md
    Ahmed, Chowdhury Farhan
    Leung, Carson Kai-Sang
    [J]. APPLIED INTELLIGENCE, 2023, 53 (06) : 6525 - 6553
  • [8] Discovering Periodic-Frequent Patterns in Transactional Databases
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    Lee, Young-Hoo
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 242 - 253
  • [9] Discovering Transitional Patterns and Their Significant Milestones in Transaction Databases
    Wan, Qian
    An, Aijun
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (12) : 1692 - 1707
  • [10] Discovering probabilistically weighted sequential patterns in uncertain databases
    Md Sahidul Islam
    Pankaj Chandra Kar
    Md Samiullah
    Chowdhury Farhan Ahmed
    Carson Kai-Sang Leung
    [J]. Applied Intelligence, 2023, 53 : 6525 - 6553