Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping

被引:10
|
作者
Raimondi, Daniele [1 ,2 ,3 ,4 ]
Orlando, Gabriele [1 ,2 ,4 ]
Moreau, Yves [3 ,5 ]
Vranken, Wim F. [1 ,2 ]
机构
[1] ULB VUB, Interuniv Inst Bioinformat Brussels, B-1050 Brussels, Belgium
[2] Vrije Univ Brussel, Struct Biol Brussels, B-1050 Brussels, Belgium
[3] Katholieke Univ Leuven, ESAT STADIUS, B-3001 Leuven, Belgium
[4] Univ Libre Bruxelles, Machine Learning Grp, B-1050 Brussels, Belgium
[5] Imec, B-3001 Leuven, Belgium
关键词
CONTACT PREDICTION; SEQUENCE; KERNELS; IDENTIFICATION; PROFILES; SEARCH;
D O I
10.1093/bioinformatics/bty309
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Evolutionary information is crucial for the annotation of proteins in bioinformatics. The amount of retrieved homologs often correlates with the quality of predicted protein annotations related to structure or function. With a growing amount of sequences available, fast and reliable methods for homology detection are essential, as they have a direct impact on predicted protein annotations. Results: We developed a discriminative, alignment-free algorithm for homology detection with quasi-linear complexity, enabling theoretically much faster homology searches. To reach this goal, we convert the protein sequence into numeric biophysical representations. These are shrunk to a fixed length using a novel vector quantization method which uses a Discrete Cosine Transform compression. We then compute, for each compressed representation, similarity scores between proteins with the Dynamic Time Warping algorithm and we feed them into a Random Forest. The WARP performances are comparable with state of the art methods.
引用
收藏
页码:3118 / 3125
页数:8
相关论文
共 50 条
  • [1] Ultra fast warping window optimization for Dynamic Time Warping
    Tan, Chang Wei
    Herrmann, Matthieu
    Webb, Geoffrey, I
    [J]. 2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 589 - 598
  • [2] A FAST DISCRETE COSINE TRANSFORM
    SINILNIKOV, AM
    [J]. IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 1989, 32 (07): : 52 - 55
  • [3] cuDTW plus plus : Ultra-Fast Dynamic Time Warping on CUDA-Enabled GPUs
    Schmidt, Bertil
    Hundt, Christian
    [J]. EURO-PAR 2020: PARALLEL PROCESSING, 2020, 12247 : 597 - 612
  • [4] FAST COMPUTATION OF THE DISCRETE COSINE TRANSFORM AND THE DISCRETE HARTLEY TRANSFORM
    MALVAR, HS
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (10): : 1484 - 1485
  • [5] Improved dynamic time warping based on the discrete wavelet transform
    Barbon, Sylvio, Jr.
    Guido, Rodrigo Capobianco
    Chen, Shi-Huang
    Vieira, Lucimar Sasso
    Sanchez, Fabricio Lopes
    [J]. ISM WORKSHOPS 2007: NINTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA - WORKSHOPS, PROCEEDINGS, 2007, : 256 - +
  • [6] FAST DISCRETE COSINE TRANSFORM PRUNING
    SKODRAS, AN
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (07) : 1833 - 1837
  • [7] PRUNING THE FAST DISCRETE COSINE TRANSFORM
    WANG, ZD
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1991, 39 (05) : 640 - 644
  • [8] FAST ALGORITHMS FOR THE DISCRETE COSINE TRANSFORM
    FEIG, E
    WINOGRAD, S
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (09) : 2174 - 2193
  • [9] FAST COMPUTATION OF THE DISCRETE COSINE TRANSFORM AND THE DISCRETE HARTLEY TRANSFORM - COMMENT
    NAGESHA, V
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (03): : 439 - 440
  • [10] Fast recursive algorithms for short-time discrete cosine transform
    Kober, V
    Cristobal, G
    [J]. ELECTRONICS LETTERS, 1999, 35 (15) : 1236 - 1238