We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Accurate large-scale phylogeny-aware alignment using BAli-Phy.
- Authors
Gupta, Maya; Zaharias, Paul; Warnow, Tandy
- Abstract
Motivation BAli-Phy, a popular Bayesian method that co-estimates multiple sequence alignments and phylogenetic trees, is a rigorous statistical method, but due to its computational requirements, it has generally been limited to relatively small datasets (at most about 100 sequences). Here, we repurpose BAli-Phy as a 'phylogeny-aware' alignment method: we estimate the phylogeny from the input of unaligned sequences, and then use that as a fixed tree within BAli-Phy. Results We show that this approach achieves high accuracy, greatly superior to Prank, the current most popular phylogeny-aware alignment method, and is even more accurate than MAFFT, one of the top performing alignment methods in common use. Furthermore, this approach can be used to align very large datasets (up to 1000 sequences in this study). Availability and implementation See https://doi.org/10.13012/B2IDB-7863273%5fV1 for datasets used in this study. Supplementary information Supplementary data are available at Bioinformatics online.
- Subjects
SEQUENCE alignment; PARSIMONIOUS models; PHYLOGENY; PRACTICAL jokes; BIOINFORMATICS
- Publication
Bioinformatics, 2021, Vol 37, Issue 24, p4677
- ISSN
1367-4803
- Publication type
Article
- DOI
10.1093/bioinformatics/btab555