We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Human genomes as email attachments.
- Authors
Scott Christley; Yiming Lu; Chen Li; Xiaohui Xie
- Abstract
Summary: The amount of genomic sequence data being generated and made available through public databases continues to increase at an ever-expanding rate. Downloading, copying, sharing and manipulating these large datasets are becoming difficult and time consuming for researchers. We need to consider using advanced compression techniques as part of a standard data format for genomic data. The inherent structure of genome data allows for more efficient lossless compression than can be obtained through the use of generic compression programs. We apply a series of techniques to James Watsons genome that in combination reduce it to a mere 4MB, small enough to be sent as an email attachment. Availability: Our algorithms are implemented in C and are freely available from http://www.ics.uci.edu/~xhx/project/DNAzip. Contact: chenli@ics.uci.edu; xhx@ics.uci.edu Supplementary information: Supplementary data are available at Bioinformatics online.
- Publication
Bioinformatics, 2009, Vol 25, Issue 2, p274
- ISSN
1367-4803
- Publication type
Academic Journal
- DOI
10.1093/bioinformatics/btn582