Arid
DOI10.1186/1471-2164-15-S5-S6
A memory-efficient algorithm to obtain splicing graphs and de novo expression estimates from de Bruijn graphs of RNA-Seq data
Sze, Sing-Hoi1,2; Tarone, Aaron M.3
通讯作者Sze, Sing-Hoi
来源期刊BMC GENOMICS
ISSN1471-2164
出版年2014
卷号15
英文摘要

Background: The recent advance of high-throughput sequencing makes it feasible to study entire transcriptomes through the application of de novo sequence assembly algorithms. While a popular strategy is to first construct an intermediate de Bruijn graph structure to represent the transcriptome, an additional step is needed to construct predicted transcripts from the graph.


Results: Since the de Bruijn graph contains all branching possibilities, we develop a memory-efficient algorithm to recover alternative splicing information and library-specific expression information directly from the graph without prior genomic knowledge. We implement the algorithm as a postprocessing module of the Velvet assembler. We validate our algorithm by simulating the transcriptome assembly of Drosophila using its known genome, and by performing Drosophila transcriptome assembly using publicly available RNA-Seq libraries. Under a range of conditions, our algorithm recovers sequences and alternative splicing junctions with higher specificity than Oases or Trans-ABySS.


Conclusions: Since our postprocessing algorithm does not consume as much memory as Velvet and is less memory-intensive than Oases, it allows biologists to assemble large libraries with limited computational resources. Our algorithm has been applied to perform transcriptome assembly of the non-model blow fly Lucilia sericata that was reported in a previous article, which shows that the assembly is of high quality and it facilitates comparison of the Lucilia sericata transcriptome to Drosophila and two mosquitoes, prediction and experimental validation of alternative splicing, investigation of differential expression among various developmental stages, and identification of transposable elements.


类型Article
语种英语
国家USA
收录类别SCI-E
WOS记录号WOS:000345682800006
WOS类目Biotechnology & Applied Microbiology ; Genetics & Heredity
WOS研究方向Biotechnology & Applied Microbiology ; Genetics & Heredity
资源类型期刊论文
条目标识符http://119.78.100.177/qdio/handle/2XILL650/181165
作者单位1.Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA;
2.Texas A&M Univ, Dept Biochem & Biophys, College Stn, TX 77843 USA;
3.Texas A&M Univ, Dept Entomol, College Stn, TX 77843 USA
推荐引用方式
GB/T 7714
Sze, Sing-Hoi,Tarone, Aaron M.. A memory-efficient algorithm to obtain splicing graphs and de novo expression estimates from de Bruijn graphs of RNA-Seq data[J],2014,15.
APA Sze, Sing-Hoi,&Tarone, Aaron M..(2014).A memory-efficient algorithm to obtain splicing graphs and de novo expression estimates from de Bruijn graphs of RNA-Seq data.BMC GENOMICS,15.
MLA Sze, Sing-Hoi,et al."A memory-efficient algorithm to obtain splicing graphs and de novo expression estimates from de Bruijn graphs of RNA-Seq data".BMC GENOMICS 15(2014).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Sze, Sing-Hoi]的文章
[Tarone, Aaron M.]的文章
百度学术
百度学术中相似的文章
[Sze, Sing-Hoi]的文章
[Tarone, Aaron M.]的文章
必应学术
必应学术中相似的文章
[Sze, Sing-Hoi]的文章
[Tarone, Aaron M.]的文章
相关权益政策
暂无数据
收藏/分享

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。