Phylogenomics: Genome-Scale Phylogenetics

Overview

Phylogenomics extends traditional phylogenetics by analyzing data from hundreds or thousands of genes across the genome simultaneously. By leveraging genome-scale information, phylogenomics overcomes the limitations of single-gene phylogenies, which often suffer from insufficient signal, lineage-specific evolutionary rate variation, and stochastic error. The increased data volume dramatically improves statistical power, enabling resolution of even the most challenging and rapid evolutionary radiations. Phylogenomics also exposes incongruences between gene trees and species trees caused by incomplete lineage sorting, gene duplication, and horizontal gene transfer.

Key Concepts

A central challenge is gene tree–species tree reconciliation. Because individual genes can have evolutionary histories that differ from the species tree, phylogenomic methods must account for processes such as incomplete lineage sorting (modeled by the multispecies coalescent) and gene duplication and loss. Concatenation approaches align all genes into a supermatrix, while coalescent-based methods analyze each gene independently and then summarize the results. Orthology assignment — distinguishing orthologs from paralogs — is a critical preprocessing step that relies on accurate genome annotation.

Applications

Phylogenomics has resolved long-standing debates in deep metazoan phylogeny, plant evolution, and microbial systematics. It is essential for studying adaptive evolution through the identification of positively selected genes across lineages. The field depends on next-generation sequencing to generate the required genome-wide data, building on classical DNA sequencing approaches. Phylogenomic analyses of bacterial genomes have reshaped our understanding of bacterial genetics, revealing extensive horizontal gene transfer and the dynamic nature of prokaryotic genomes.