Arid
DOI10.3390/genes9030123
Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets
Springer, Mark S.1; Gatesy, John2,3
通讯作者Springer, Mark S. ; Gatesy, John
来源期刊GENES
ISSN2073-4425
出版年2018
卷号9期号:3
英文摘要

The Summary coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including the critical assumption that recombination occurs freely among but not within coalescence genes (c-genes), which are the fundamental units of analysis for these methods. Each c-gene has a single branching history, and large sets of these independent gene histories should be the input for genome-scale coalescence estimates of phylogeny. By contrast, numerous studies have reported the results of coalescence analyses in which complete protein-coding sequences are treated as c-genes even though exons for these loci can span more than a megabase of DNA. Empirical estimates of recombination breakpoints suggest that c-genes may be much shorter, especially when large clades with many species are the focus of analysis. Although this idea has been challenged recently in the literature, the inverse relationship between c-gene size and increased taxon sampling in a dataset-the ’recombination ratchet’-is a fundamental property of c-genes. For taxonomic groups characterized by genes with long intron sequences, complete protein-coding sequences are likely not valid c-genes and are inappropriate units of analysis for summary coalescence methods unless they occur in recombination deserts that are devoid of incomplete lineage sorting (ILS). Finally, it has been argued that coalescence methods are robust when the no-recombination within loci assumption is violated, but recombination must matter at some scale because ILS, a by-product of recombination, is the raison d’etre for coalescence methods. That is, extensive recombination is required to yield the large number of independently segregating c-genes used to infer a species tree. If coalescent methods are powerful enough to infer the correct species tree for difficult phylogenetic problems in the anomaly zone, where concatenation is expected to fail because of ILS, then there should be a decreasing probability of inferring the correct species tree using longer loci with many intralocus recombination breakpoints (i.e., increased levels of concatenation).


英文关键词coalescence genes phylogenomics protein-coding sequences recombination breakpoints recombination ratchet
类型Review
语种英语
国家USA
收录类别SCI-E
WOS记录号WOS:000428508800006
WOS关键词ESTIMATING SPECIES TREES ; LAND PLANT ORIGINS ; MOLECULAR SYSTEMATICS ; RECOMBINANT SEQUENCES ; MAMMAL PHYLOGENY ; MOSAIC STRUCTURE ; DNA-SEQUENCES ; SISTER GROUP ; MODEL ; CHARACTER
WOS类目Genetics & Heredity
WOS研究方向Genetics & Heredity
资源类型期刊论文
条目标识符http://119.78.100.177/qdio/handle/2XILL650/209522
作者单位1.Univ Calif Riverside, Dept Evolut Ecol & Organismal Biol, Riverside, CA 92521 USA;
2.Amer Museum Nat Hist, Div Vertebrate Zool, New York, NY 10024 USA;
3.Amer Museum Nat Hist, Sackler Inst Comparat Genom, New York, NY 10024 USA
推荐引用方式
GB/T 7714
Springer, Mark S.,Gatesy, John. Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets[J],2018,9(3).
APA Springer, Mark S.,&Gatesy, John.(2018).Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets.GENES,9(3).
MLA Springer, Mark S.,et al."Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets".GENES 9.3(2018).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Springer, Mark S.]的文章
[Gatesy, John]的文章
百度学术
百度学术中相似的文章
[Springer, Mark S.]的文章
[Gatesy, John]的文章
必应学术
必应学术中相似的文章
[Springer, Mark S.]的文章
[Gatesy, John]的文章
相关权益政策
暂无数据
收藏/分享

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。