Population genetics is the branch of genetics that studies the distribution of genetic variation within and between populations and the forces that change allele frequencies over time. The field provides the mathematical foundation for understanding evolution, conservation biology, and the genetic basis of human disease.
Allele and Genotype Frequencies
The fundamental quantities in population genetics are allele frequencies (the proportion of each allele at a locus in a population) and genotype frequencies (the proportion of each genotype). For a diploid locus with two alleles A and a, if p represents the frequency of A and q represents the frequency of a, then p + q = 1 when only two alleles exist. Genotype frequencies are the proportions of AA, Aa, and aa individuals in the population. These frequencies are related but not identical: genotype frequencies can be derived from allele frequencies only under specific assumptions, and one of the central goals of population genetics is to understand when and why observed genotype frequencies deviate from expectations.
The Hardy-Weinberg Principle
The Hardy-Weinberg principle states that allele and genotype frequencies in a population remain constant from generation to generation in the absence of evolutionary forces. For a locus with two alleles, the expected genotype frequencies under Hardy-Weinberg equilibrium are p² for AA, 2pq for Aa, and q² for aa. This equilibrium is reached after one generation of random mating, regardless of the starting genotype frequencies. The principle serves as a null model for population genetics: when observed genotype frequencies differ significantly from Hardy-Weinberg expectations, it indicates that one or more evolutionary forces are operating. The Hardy-Weinberg principle applies under five assumptions: an infinitely large population, random mating, no mutation, no gene flow (migration), and no natural selection. Real populations never meet all assumptions, making the principle a baseline against which evolutionary change is measured.
Natural Selection
Natural selection changes allele frequencies when genotypes differ in fitness, which is the ability to survive and reproduce. The fitness of a genotype is expressed relative to the fittest genotype, with selection coefficients measuring the reduction in fitness. Directional selection favors one allele over another, causing its frequency to increase over time. For a dominant advantageous allele, selection acts efficiently because the allele is expressed in both homozygotes and heterozygotes. For a recessive advantageous allele, selection proceeds more slowly because the allele is hidden from selection in heterozygotes. Balancing selection maintains multiple alleles in the population through mechanisms such as heterozygote advantage (overdominance), where heterozygotes have higher fitness than both homozygotes, as seen with the sickle cell trait conferring malaria resistance. Frequency-dependent selection occurs when the fitness of a genotype depends on its frequency in the population, with rare genotypes sometimes having an advantage.
Genetic Drift
Genetic drift is the random fluctuation of allele frequencies due to finite population size. In small populations, allele frequencies can change dramatically by chance alone, even in the absence of selection. The magnitude of drift is inversely related to population size: in a population of size N, the variance in allele frequency change per generation is pq/2N for a diploid locus, meaning smaller populations experience larger random fluctuations. Drift leads to the loss of genetic diversity over time, with one allele eventually becoming fixed (frequency = 1) and others lost (frequency = 0). The probability that a new neutral mutation becomes fixed by drift alone equals its initial frequency, 1/2N. The effective population size (Ne) is the size of an ideal population that experiences the same rate of drift as the actual population, accounting for factors such as unequal sex ratios, variance in reproductive success, and population bottlenecks.
Gene Flow and Migration
Gene flow, also called migration, is the movement of alleles between populations. Migration introduces new alleles into a population and reduces genetic differentiation between populations. The extent of gene flow is measured by FST, a statistic that quantifies the proportion of genetic variation attributable to differences among populations, ranging from 0 (no differentiation) to 1 (complete differentiation). Low FST values indicate high gene flow, while high values indicate restricted gene flow and greater population structure. Island models and stepping-stone models describe patterns of migration between populations and predict how genetic variation is partitioned across geographic space.
Mutation
Mutation introduces new genetic variation into populations at low rates, typically 10⁻⁸ to 10⁻⁹ per base pair per generation for point mutations. Despite their rarity, mutations are the ultimate source of all genetic variation and are essential for long-term evolution. The mutation-selection balance describes the equilibrium frequency of a deleterious allele determined by the opposing forces of mutation creating new copies and selection removing them. For a recessive deleterious allele, the equilibrium frequency is approximately the square root of the mutation rate divided by the selection coefficient, while for a dominant deleterious allele, it is approximately twice the mutation rate divided by the selection coefficient.
Non-Random Mating
Non-random mating patterns affect genotype frequencies without directly changing allele frequencies. Positive assortative mating, where individuals with similar phenotypes mate preferentially, increases homozygosity. Negative assortative mating (disassortative mating) increases heterozygosity. Inbreeding, mating between relatives, increases the proportion of homozygotes genome-wide and is quantified by the inbreeding coefficient F, which measures the probability that two alleles at a locus are identical by descent. Inbreeding exposes recessive deleterious alleles in homozygotes, leading to inbreeding depression, the reduced fitness observed in offspring of related individuals.
Applications in Human Genetics
Population genetics principles are applied widely in human genetics. Genome-wide association studies (GWAS) use population-based samples to identify genetic variants associated with disease, but population stratification (allele frequency differences among ancestral groups) can produce spurious associations if not properly controlled. Admixture mapping leverages the mixed ancestry of populations such as African Americans to identify disease risk variants. Forensic genetics uses Hardy-Weinberg assumptions to calculate the probability of random matches in DNA profiling. Conservation genetics applies these principles to manage genetic diversity in endangered species, using effective population size estimates and measures of genetic differentiation to guide breeding programs and habitat management.