RScan: fast searching structural similarities for structured RNAs in large databases

被引：2

作者：

Xue, Chenghai ^{[1
]}

Liu, Guo-Ping

机构：

[1] Tsinghua Univ, Dept Automat, TNLIST, MOE Key Lab Bioinformat, Beijing 100084, Peoples R China

[2] Tsinghua Univ, Dept Automat, TNLIST, Bioinformat Div, Beijing 100084, Peoples R China

[3] Chinese Acad Sci, LCSIS, Inst Automat, Beijing 100080, Peoples R China

[4] Univ Glamorgan, Dept Engn, Pontypridd CF37 1DL, M Glam, Wales

来源：

BMC GENOMICS | 2007年 / 8卷 / 1期

基金：

英国惠康基金;

关键词：

D O I：

10.1186/1471-2164-8-257

中图分类号：

Q81 [生物工程学（生物技术）]; Q93 [微生物学];

学科分类号：

071005 ; 0836 ; 090102 ; 100705 ;

摘要：

Background: Many RNAs have evolutionarily conserved secondary structures instead of primary sequences. Recently, there are an increasing number of methods being developed with focus on the structural alignments for finding conserved secondary structures as well as common structural motifs in pair-wise or multiple sequences. A challenging task is to search similar structures quickly for structured RNA sequences in large genomic databases since existing methods are too slow to be used in large databases. Results: An implementation of a fast structural alignment algorithm, RScan, is proposed to fulfill the task. RScan is developed by levering the advantages of both hashing algorithms and local alignment algorithms. In our experiment, on the average, the times for searching a tRNA and an rRNA in the randomized A. pernix genome are only 256 seconds and 832 seconds respectively by using RScan, but need 3,178 seconds and 8,951 seconds respectively by using an existing method RSEARCH. Remarkably, RScan can handle large database queries, taking less than 4 minutes for searching similar structures for a microRNA precursor in human chromosome 21. Conclusion: These results indicate that RScan is a preferable choice for real-life application of searching structural similarities for structured RNAs in large databases.

引用

页数：11

共 50 条

[1] RScan: fast searching structural similarities for structured RNAs in large databases
Chenghai Xue
Guo-Ping Liu
[J]. BMC Genomics, 8
[2] Searching trademark databases for verbal similarities
Fall, C. J.
Giraud-Carrier, C.
[J]. WORLD PATENT INFORMATION, 2005, 27 (02) : 135 - 143
[3] STRUCTURAL MOLECULAR FORMULA FOR FLEXIBLE AND EFFICIENT SUBSTRUCTURE SEARCHING OF LARGE DATABASES
DROMEY, RG
[J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1978, 18 (03): : 163 - 168
[4] Compression of nucleotide databases for fast searching
Williams, H
Zobel, J
[J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1997, 13 (05): : 549 - 554
[5] Searching the literature - Understanding and using structured electronic databases
Fine, Elizabeth V.
Bliss, Donna Zimmaro
[J]. JOURNAL OF WOUND OSTOMY AND CONTINENCE NURSING, 2006, 33 (06) : 594 - 605
[6] Searching DNA databases for similarities to DNA sequences: when is a match significant?
Anderson, I
Brass, A
[J]. BIOINFORMATICS, 1998, 14 (04) : 349 - 356
[7] Vehicle-Triggered Video Compression/Decompression For Fast and Efficient Searching In Large Video Databases
Bulan, Orhan
Bernal, Edgar A.
Loce, Robert P.
Wu, Wencheng
[J]. VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS, 2013, 8663
[8] SEARCHING OF LARGE DATABASES OF CHEMICAL-REACTIONS
MILNE, GWA
[J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1995, 210 : 65 - CINF
[9] Searching information in large databases: Gamble or strategy?
Klaus, S
[J]. CHEMICAL ENGINEERING & TECHNOLOGY, 2001, 24 (06) : 597 - 601
[10] Searching for Be star candidates within large databases
Sabogal, Beatriz
Garcia-Varela, Alejandro
[J]. WIDE-FIELD VARIABILITY SURVEYS: A 21ST CENTURY PERSPECTIVE, 2017, 152

← 1 2 3 4 5 →