Arid
DOI10.1007/978-3-319-56154-7_44
Obtaining the Most Accurate de novo Transcriptomes for Non-model Organisms: The Case of Castanea sativa
Espigares, Marina1; Seoane, Pedro1; Bautista, Rocio2; Quintana, Julia3; Gomez, Luis3,4; Gonzalo Claros, M.1
通讯作者Gonzalo Claros, M.
会议名称5th International Work-Conference on Bioinformatics and Biomedical Engineering (IWBBIO)
会议日期APR 26-28, 2017
会议地点Granada, SPAIN
英文摘要

Gene expression analyses of non-model organisms must start with the construction of a high accurate de novo transcriptome as a reference. The best way to determine the suitability of any de novo transcriptome assembling is its comparison with other well-known "reference" transcriptomes. In this study, we took six complete plant transcriptomes (Arabidopsis thaliana, Vitis vinifera, Zea mays, Populus trichocarpa, Triticum aestivum and Oryza sativa) and compared all of them using a series of metrics system for a principal component analysis, resulting that A. thaliana and P. trichocarpa were the best references. This has been automated using AutoFlow. A primary assembly of short reads from Illumina Platform (50 nt, single reads) and long reads from Roche-454 technology from Castanea sativa was performed individually using k-mers from 25 to 35 and different assemblers (Oases v2, SOAPdenovoTrans, RAY, MIRA4 and MINIMUS). The resulting contigs were then reconciled with the aim of obtaining the best transcriptome. Oases and SOAP were used for the assembling of short reads, MIRA and MINIMUS for the assembling of long reads or the reconciliations, and RAY, that can compute de novo transcript assembling from heterogeneous (long and short reads) next-generation sequencing data, was included to avoid the reconciliation step. A total of 90 different assemblies were generated in a single run of the pipeline. A hierarchical clustering on the PCA components (HCPC) was implemented to automatically identify the best assembling strategies based on the shortest distance in HCPC to the two plant reference transcriptomes is selected. In this approach, reconciliation of Roche/ 454 long reads with Illumina contigs produce more complete and accurate gene reconstructions than other combinations. Surprisingly, reconstructions based only on Illumina and the ones creates with RAY seem to be less accurate. For this specific study, the most complete and accurate transcriptome corresponds to the Illumina contigs obtained with SOAPdenovoTrans and reassembled with 454 long reads using MIRA4. This is only a one example of a transcriptome building. Many other assembling can be performed just changing parameters, k-mers, sequencing technology, assemblers, reference organisms, etc. The pipeline in AutoFlow is easily customizable for those purposes.


英文关键词Transcriptome Assembling Workflow PCA Non-model organism
来源出版物BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT II
ISSN0302-9743
EISSN1611-3349
出版年2017
卷号10209
页码489-499
ISBN978-3-319-56153-0
EISBN978-3-319-56154-7
出版者SPRINGER INTERNATIONAL PUBLISHING AG
类型Proceedings Paper
语种英语
国家Spain
收录类别CPCI-S
WOS记录号WOS:000426120500044
WOS关键词ASSEMBLER
WOS类目Engineering, Biomedical ; Mathematical & Computational Biology ; Medical Informatics
WOS研究方向Engineering ; Mathematical & Computational Biology ; Medical Informatics
资源类型会议论文
条目标识符http://119.78.100.177/qdio/handle/2XILL650/306445
作者单位1.Univ Malaga, Dept Biol Mol & Bioquim, Malaga, Spain;
2.Univ Malaga, Plataforma Andaluza Bioinformat, Malaga, Spain;
3.Univ Politecn Madrid, ETSI Forestal Montes & Medio Nat, Dept Sistemas & Recursos Nat, E-28040 Madrid, Spain;
4.Univ Politecn Madrid, CBGP UPM INIA, Campus Montegancedo, Madrid 28223, Spain
推荐引用方式
GB/T 7714
Espigares, Marina,Seoane, Pedro,Bautista, Rocio,et al. Obtaining the Most Accurate de novo Transcriptomes for Non-model Organisms: The Case of Castanea sativa[C]:SPRINGER INTERNATIONAL PUBLISHING AG,2017:489-499.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Espigares, Marina]的文章
[Seoane, Pedro]的文章
[Bautista, Rocio]的文章
百度学术
百度学术中相似的文章
[Espigares, Marina]的文章
[Seoane, Pedro]的文章
[Bautista, Rocio]的文章
必应学术
必应学术中相似的文章
[Espigares, Marina]的文章
[Seoane, Pedro]的文章
[Bautista, Rocio]的文章
相关权益政策
暂无数据
收藏/分享

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。