We found a match
Your institution may have rights to this item. Sign in to continue.
- Title
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.
- Authors
Yin, Changchuan
- Abstract
To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
- Subjects
NUCLEOTIDE sequence; GENETIC code; EXONS (Genetics); INTRONS; NUMERICAL analysis
- Publication
Journal of Bioinformatics & Computational Biology, 2015, Vol 13, Issue 2, p-1
- ISSN
0219-7200
- Publication type
Article
- DOI
10.1142/S0219720015500043