MBE Advance Access originally published online on March 2, 2005
Molecular Biology and Evolution 2005 22(5):1337-1344; doi:10.1093/molbev/msi121
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
More Genes or More Taxa? The Relative Contribution of Gene Number and Taxon Number to Phylogenetic Accuracy
Howard Hughes Medical Institute and Laboratory of Molecular Biology, University of WisconsinMadison
E-mail: sbcarrol{at}wisc.edu.
The relative contribution of taxon number and gene number to accuracy in phylogenetic inference is a major issue in phylogenetics and of central importance to the choice of experimental strategies for the successful reconstruction of a broad sketch of the tree of life. Maximization of the number of taxa sampled is the strategy favored by most phylogeneticists, although its necessity remains the subject of debate. Vast increases in gene number are now possible due to advances in genomics, but large numbers of genes will be available for only modest numbers of taxa, raising the question of whether such genome-scale phylogenies will be robust to the addition of taxa. To examine the relative benefit of increasing taxon number or gene number to phylogenetic accuracy, we have developed an assay that utilizes the symmetric difference tree distance as a measure of phylogenetic accuracy. We have applied this assay to a genome-scale data matrix containing 106 genes from 14 yeast species. Our results show that increasing taxon number correlates with a slight decrease in phylogenetic accuracy. In contrast, increasing gene number has a significant positive effect on phylogenetic accuracy. Analyses of an additional taxon-rich data matrix from the same yeast clade show that taxon number does not have a significant effect on phylogenetic accuracy. The positive effect of gene number and the lack of effect of taxon number on phylogenetic accuracy are also corroborated by analyses of two data matrices from mammals and angiosperm plants, respectively. We conclude that, for typical data sets, the number of genes utilized may be a more important determinant of phylogenetic accuracy than taxon number.
Key Words: phylogenetics taxon number gene number phylogenetic accuracy tree of life genomics
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. B. Prasad, M. W. Allard, NISC Comparative Sequencing Program, and E. D. Green Confirming the Phylogeny of Mammals by Use of Large Comparative Sequence Data Sets Mol. Biol. Evol., September 1, 2008; 25(9): 1795 - 1808. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. W. Peterson Phylogenetic analysis of Aspergillus species using DNA sequences from four loci Mycologia, March 1, 2008; 100(2): 205 - 226. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Salipante, J. M. Thompson, and M. S. Horwitz Phylogenetic Fate Mapping: Theoretical and Experimental Studies Applied to the Development of Mouse Fibroblasts Genetics, February 1, 2008; 178(2): 967 - 977. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Chandra and D. R. Huff Salmacisia, a new genus of Tilletiales: reclassification of Tilletia buchloeana causing induced hermaphroditism in buffalograss. Mycologia, January 1, 2008; 100(1): 81 - 93. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. M. Hallstrom, M. Kullberg, M. A. Nilsson, and A. Janke Phylogenomic Data Analyses Provide Evidence that Xenarthra and Afrotheria Are Sister Groups Mol. Biol. Evol., September 1, 2007; 24(9): 2059 - 2068. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Huttley, M. J. Wakefield, and S. Easteal Rates of Genome Evolution and Branching Order from Whole Genome Analysis Mol. Biol. Evol., August 1, 2007; 24(8): 1722 - 1730. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. B. Rogozin, Y. I. Wolf, L. Carmel, and E. V. Koonin Ecdysozoan Clade Rejected by Genome-Wide Analysis of Rare Amino Acid Replacements Mol. Biol. Evol., April 1, 2007; 24(4): 1080 - 1090. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Baurain, H. Brinkmann, and H. Philippe Lack of Resolution in the Animal Phylogeny: Closely Spaced Cladogeneses or Undetected Systematic Errors? Mol. Biol. Evol., January 1, 2007; 24(1): 6 - 9. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Blanquart and N. Lartillot A Bayesian Compound Stochastic Process for Modeling Nonstationary and Nonhomogeneous Sequence Evolution Mol. Biol. Evol., November 1, 2006; 23(11): 2058 - 2071. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Bateman, J. Hilton, and P. J. Rudall Morphological and molecular phylogenetic context of the angiosperms: contrasting the 'top-down' and 'bottom-up' approaches used to infer the likely characteristics of the first flowers J. Exp. Bot., October 1, 2006; 57(13): 3471 - 3503. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Kellogg Progress and challenges in studies of the evolution of development J. Exp. Bot., October 1, 2006; 57(13): 3505 - 3516. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Chiu, E. K. Lee, M. G. Egan, I. N. Sarkar, G. M. Coruzzi, and R. DeSalle OrthologID: automation of genome-scale ortholog identification within a parsimony framework Bioinformatics, March 15, 2006; 22(6): 699 - 707. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Rokas, D. Kruger, and S. B. Carroll Animal Evolution and the Molecular Signature of Radiations Compressed in Time Science, December 23, 2005; 310(5756): 1933 - 1938. [Abstract] [Full Text] [PDF] |
||||





