Knowledge Resource Center for Ecological Environment in Arid Area
DOI | 10.1093/bioinformatics/btt127 |
BRANCH: boosting RNA-Seq assemblies with partial or related genomic sequences | |
Bao, Ergude1; Jiang, Tao1; Girke, Thomas2 | |
通讯作者 | Girke, Thomas |
来源期刊 | BIOINFORMATICS
![]() |
ISSN | 1367-4803 |
EISSN | 1460-2059 |
出版年 | 2013 |
卷号 | 29期号:10页码:1250-1259 |
英文摘要 | Motivation: De novo transcriptome assemblies of RNA-Seq data are important for genomics applications of unsequenced organisms. Owing to the complexity and often incomplete representation of transcripts in sequencing libraries, the assembly of high-quality transcriptomes can be challenging. However, with the rapidly growing number of sequenced genomes, it is now feasible to improve RNA-Seq assemblies by guiding them with genomic sequences. Results: This study introduces BRANCH, an algorithm designed for improving de novo transcriptome assemblies by using genomic information that can be partial or complete genome sequences from the same or a related organism. Its input includes assembled RNA reads (transfrags), genomic sequences (e.g. contigs) and the RNA reads themselves. It uses a customized version of BLAT to align the transfrags and RNA reads to the genomic sequences. After identifying exons from the alignments, it defines a directed acyclic graph and maps the transfrags to paths on the graph. It then joins and extends the transfrags by applying an algorithm that solves a combinatorial optimization problem, called the Minimum weight Minimum Path Cover with given Paths. In performance tests on real data from Caenorhabditis elegans and Saccharomyces cerevisiae, assisted by genomic contigs from the same species, BRANCH improved the sensitivity and precision of transfrags generated by Velvet/Oases or Trinity by 5.1-56.7% and 0.3-10.5%, respectively. These improvements added 3.8-74.1% complete transcripts and 8.3-3.8% proteins to the initial assembly. Similar improvements were achieved when guiding the BRANCH processing of a transcriptome assembly from a more complex organism (mouse) with genomic sequences from a related species (rat). |
类型 | Article |
语种 | 英语 |
国家 | USA |
收录类别 | SCI-E |
WOS记录号 | WOS:000319428200002 |
WOS关键词 | SPLICE JUNCTIONS ; TRANSCRIPTOME ; ALIGNMENT ; ALGORITHMS ; REVEALS ; IDBA |
WOS类目 | Biochemical Research Methods ; Biotechnology & Applied Microbiology ; Computer Science, Interdisciplinary Applications ; Mathematical & Computational Biology ; Statistics & Probability |
WOS研究方向 | Biochemistry & Molecular Biology ; Biotechnology & Applied Microbiology ; Computer Science ; Mathematical & Computational Biology ; Mathematics |
资源类型 | 期刊论文 |
条目标识符 | http://119.78.100.177/qdio/handle/2XILL650/176144 |
作者单位 | 1.Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA; 2.Univ Calif Riverside, Dept Bot & Plant Sci, Riverside, CA 92521 USA |
推荐引用方式 GB/T 7714 | Bao, Ergude,Jiang, Tao,Girke, Thomas. BRANCH: boosting RNA-Seq assemblies with partial or related genomic sequences[J],2013,29(10):1250-1259. |
APA | Bao, Ergude,Jiang, Tao,&Girke, Thomas.(2013).BRANCH: boosting RNA-Seq assemblies with partial or related genomic sequences.BIOINFORMATICS,29(10),1250-1259. |
MLA | Bao, Ergude,et al."BRANCH: boosting RNA-Seq assemblies with partial or related genomic sequences".BIOINFORMATICS 29.10(2013):1250-1259. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。