We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
EnGenIUS — ENVIRONMENTAL GENOME INFORMATIONAL UTILITY SYSTEM.
- Authors
KAPLAREVIC, MIHAILO; MURRAY, ALISON E.; CARY, STEPHEN C.; GAO, GUANG R.
- Abstract
Short-insert shotgun sequencing approaches have been applied in recent years to environmental genomic libraries. In the case of complex multispecies microbial communities, there can be many sequence reads that are not incorporated into assemblies, and thus need to be annotated and accessible as single reads. Most existing annotation systems and genome databases accommodate assembled genomes containing contiguous gene-encoding sequences. Thus, a solution is required that can work effectively with environmental genomic annotation information to facilitate data analysis. The Environmental Genome Informational Utility System (EnGenIUS) is a comprehensive environmental genome (metagenome) research toolset that was specifically designed to accommodate the needs of large (> 250 K sequence reads) environmental genome sequencing efforts. The core EnGenIUS modules consist of a set of UNIX scripts and PHP programs used for data preprocessing, an annotation pipeline with accompanying analysis tools, two entity relational databases, and a graphical user interface. The annotation pipeline has a modular structure and can be customized to best fit input data set properties. The integrated entity relational databases store raw data and annotation analysis results. Access to the underlying databases and services is facilitated through a web-based graphical user interface. Users have the ability to browse, upload, download, and analyze preprocessed data, based on diverse search criteria. The EnGenIUS toolset was successfully tested using the Alvinella pompejana epibiont environmental genome data set, which comprises more than 300 K sequence reads. A fully browsable EnGenIUS portal is available at (access code: "guest"). The scope of this paper covers the implementation details and technical aspects of the EnGenIUS toolset.
- Subjects
GENOMES; GENE libraries; DATABASES; BIOINFORMATICS; COMPUTATIONAL biology
- Publication
Journal of Bioinformatics & Computational Biology, 2008, Vol 6, Issue 6, p1193
- ISSN
0219-7200
- Publication type
Article
- DOI
10.1142/S0219720008003850