We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping.
- Authors
Raimondi, Daniele; Orlando, Gabriele; Moreau, Yves; Vranken, Wim F
- Abstract
Motivation Evolutionary information is crucial for the annotation of proteins in bioinformatics. The amount of retrieved homologs often correlates with the quality of predicted protein annotations related to structure or function. With a growing amount of sequences available, fast and reliable methods for homology detection are essential, as they have a direct impact on predicted protein annotations. Results We developed a discriminative, alignment-free algorithm for homology detection with quasi-linear complexity, enabling theoretically much faster homology searches. To reach this goal, we convert the protein sequence into numeric biophysical representations. These are shrunk to a fixed length using a novel vector quantization method which uses a Discrete Cosine Transform compression. We then compute, for each compressed representation, similarity scores between proteins with the Dynamic Time Warping algorithm and we feed them into a Random Forest. The WARP performances are comparable with state of the art methods. Availability and implementation The method is available at http://ibsquare.be/warp. Supplementary information Supplementary data are available at Bioinformatics online.
- Subjects
HOMOLOGY (Biochemistry); DISCRETE cosine transforms; AMINO acid sequence; VECTOR quantization; BIOINFORMATICS
- Publication
Bioinformatics, 2018, Vol 34, Issue 18, p3118
- ISSN
1367-4803
- Publication type
Article
- DOI
10.1093/bioinformatics/bty309