We found a match
Your institution may have rights to this item. Sign in to continue.
- Title
Optimized splitting of mixed-species RNA sequencing data.
- Authors
Song, Xuan; Gao, Hai Yun; Herrup, Karl; Hart, Ronald P.
- Abstract
Gene expression studies using xenograft transplants or co-culture systems, usually with mixed human and mouse cells, have proven to be valuable to uncover cellular dynamics during development or in disease models. However, the mRNA sequence similarities among species presents a challenge for accurate transcript quantification. To identify optimal strategies for analyzing mixed-species RNA sequencing data, we evaluate both alignment-dependent and alignment-independent methods. Alignment of reads to a pooled reference index is effective, particularly if optimal alignments are used to classify sequencing reads by species, which are re-aligned with individual genomes, generating > 9 7 % accuracy across a range of species ratios. Alignment-independent methods, such as convolutional neural networks, which extract the conserved patterns of sequences from two species, classify RNA sequencing reads with over 85% accuracy. Importantly, both methods perform well with different ratios of human and mouse reads. While non-alignment strategies successfully partitioned reads by species, a more traditional approach of mixed-genome alignment followed by optimized separation of reads proved to be the more successful with lower error rates.
- Subjects
RNA sequencing; CONVOLUTIONAL neural networks; ERROR rates; GENE expression
- Publication
Journal of Bioinformatics & Computational Biology, 2022, Vol 20, Issue 2, p1
- ISSN
0219-7200
- Publication type
Article
- DOI
10.1142/S0219720022500019