Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping

被引:11
|
作者
Raimondi, Daniele [1 ,2 ,3 ,4 ]
Orlando, Gabriele [1 ,2 ,4 ]
Moreau, Yves [3 ,5 ]
Vranken, Wim F. [1 ,2 ]
机构
[1] ULB VUB, Interuniv Inst Bioinformat Brussels, B-1050 Brussels, Belgium
[2] Vrije Univ Brussel, Struct Biol Brussels, B-1050 Brussels, Belgium
[3] Katholieke Univ Leuven, ESAT STADIUS, B-3001 Leuven, Belgium
[4] Univ Libre Bruxelles, Machine Learning Grp, B-1050 Brussels, Belgium
[5] Imec, B-3001 Leuven, Belgium
关键词
CONTACT PREDICTION; SEQUENCE; KERNELS; IDENTIFICATION; PROFILES; SEARCH;
D O I
10.1093/bioinformatics/bty309
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Evolutionary information is crucial for the annotation of proteins in bioinformatics. The amount of retrieved homologs often correlates with the quality of predicted protein annotations related to structure or function. With a growing amount of sequences available, fast and reliable methods for homology detection are essential, as they have a direct impact on predicted protein annotations. Results: We developed a discriminative, alignment-free algorithm for homology detection with quasi-linear complexity, enabling theoretically much faster homology searches. To reach this goal, we convert the protein sequence into numeric biophysical representations. These are shrunk to a fixed length using a novel vector quantization method which uses a Discrete Cosine Transform compression. We then compute, for each compressed representation, similarity scores between proteins with the Dynamic Time Warping algorithm and we feed them into a Random Forest. The WARP performances are comparable with state of the art methods.
引用
收藏
页码:3118 / 3125
页数:8
相关论文
共 50 条
  • [41] Dynamic time warping in phoneme modeling for fast pronunciation error detection
    Miodonska, Zuzanna
    Bugdol, Marcin D.
    Krecichwost, Michal
    COMPUTERS IN BIOLOGY AND MEDICINE, 2016, 69 : 277 - 285
  • [42] A fast algorithm for n-D discrete cosine transform
    王智顺
    李文化
    何振亚
    Science in China(Series E:Technological Sciences), 1998, (01) : 45 - 54
  • [44] Fast recursive algorithm for the discrete cosine transform with arbitrary length
    School of Physics and Electrical Information Science, Ningxia University, Yinchuan 750021, China
    Dianzi Yu Xinxi Xuebao, 2007, 2 (418-420):
  • [45] Fast Two-Dimensional Smoothing with Discrete Cosine Transform
    Lyubin, Pavel
    Shchetinin, Eugeny
    DISTRIBUTED COMPUTER AND COMMUNICATION NETWORKS, DCCN 2016, 2016, 678 : 646 - 656
  • [46] A fast algorithm for n-D discrete cosine transform
    Wang, ZS
    Li, WH
    He, ZY
    SCIENCE IN CHINA SERIES E-TECHNOLOGICAL SCIENCES, 1998, 41 (01): : 45 - 54
  • [47] On fast algorithms for computing the inverse modified discrete cosine transform
    Fan, YH
    Madisetti, VK
    Mersereau, RM
    IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (03) : 61 - 64
  • [48] A Fast Modified Signed Discrete Cosine Transform For Image Compression
    Haweel, Reem T.
    El-Kilani, Wail S.
    Ramadan, Hassan H.
    2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2014, : 56 - 61
  • [49] Fast polynomial multiplication and convolutions related to the discrete cosine transform
    Baszenski, G
    Tasche, M
    LINEAR ALGEBRA AND ITS APPLICATIONS, 1997, 252 : 1 - 25
  • [50] FAST ALGORITHMS FOR THE 2-D DISCRETE COSINE TRANSFORM
    KAMANGAR, FA
    RAO, KR
    IEEE TRANSACTIONS ON COMPUTERS, 1982, 31 (09) : 899 - 906