We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
ParDRe: faster parallel duplicated reads removal tool for sequencing studies.
- Authors
González-Domínguez, Jorge; Schmidt, Bertil
- Abstract
Current next generation sequencing technologies often generate duplicated or nearduplicated reads that (depending on the application scenario) do not provide any interesting biological information but can increase memory requirements and computational time of downstream analysis. In this work we present ParDRe, a de novo parallel tool to remove duplicated and nearduplicated reads through the clustering of Single-End or Paired-End sequences from fasta or fastq files. It uses a novel bitwise approach to compare the suffixes of DNA strings and employs hybrid MPI/multithreading to reduce runtime on multicore systems. We show that ParDRe is up to 27.29 times faster than Fulcrum (a representative state-of-the-art tool) on a platform with two 8-core Sandy-Bridge processors.
- Subjects
NUCLEOTIDE sequencing; NUCLEOTIDE sequence; PARALLEL processing; THREADS (Computer programs); MULTICORE processors
- Publication
Bioinformatics, 2016, Vol 32, Issue 10, p1562
- ISSN
1367-4803
- Publication type
Article
- DOI
10.1093/bioinformatics/btw038