Population Genetics and Darwinian Medicine

Sinervo and Pogson © 1998

Selection, Mutation, Drift, and Gene Flow

Selection

Overdominance and Sickle-cell anemia

Darwinian Medicine

Mutation

Genetic drift

Genetic Bottlenecks in Cheetahs and Elephant Seals

Migration or gene flow

Interactions between Selection, Mutation, Drift, and Gene Flow

Migration and selection

Selection and mutation

How efficient is natural selection at eliminating unfavorable mutations?

How rapidly will a favorable mutation spread through a population?

Mutation and Selection in the HIV virus challenged by AZT

Selection and drift

Summary of microevolutionary processes

Selection, Mutation, Drift, and Gene Flow

This lecture is an introduction to the four forces of genetic apocalypse: 1) selection, 2) mutation, 3) genetic drift, and 4) population gene flow. Hardy-Weinberg maintains gene frequencies at an equilibrium in the absence of other forces. The four parameters change gene frequencies and typically perturb Hardy-Weinberg equilibrium. Studying parameters in tandem (push me pull you style) will develop a picture of evolutionary equilibrium between forces.

Components of this lecture are available as a simulation exercise. VERY IMPORTANT: Before running the simulation for each excercise, draw a graph of what you expect to happen (these graphs must be made for you to understand the simulations, trust me).

Selection

The equations used to simulate selection can be readily derived in using algebra very similar to that used for the H-W equilibrium. Change in gene frequency under the force of selection is given by:
(or we can calculate w of each genotype).

wbar is average fitness. d(wbar)/dq is how the derivative of wbar as a function of gene frequency or how rapidly fitness goes up or down with a unit change in gene frequency. p*q is 1/2 the frequency of heterozygotes. Heterozygotes are a large reservoir of genetic variability.

Exercise: Let us first simulate a simple case of overdominance (the heterozygote is the most fit genotype). Before you begin, draw a graph of what you expect to see for gene frequency versus time (generations). To see how selection works in the simulation let us set N=1000, p=0.1 (initial frequency of A), wAA=0.9, wAa = 1.0, and waa=0.9 (We should have both u=0, NGen=100, Npops = 8, isolated populations). What happens? Why? Now set Waa=1.0 (all else remains the same). In this case we have allele "A" recessive to "a" in fitness (or "a" is dominant, genotype "aa" has the highest fitness and its fitness is the same as "Aa", remember A does not always have to be dominant!!!!). Play with strength of selection. How rapidly does the gene frequency become fixed? How do the simulated curves compare to your earlier "intuitive" graph. Why is the trajectory of gene frequency S-shaped? That is why does change in gene frequency slow down when the allele is close to fixation shouldn't it be linear???

Overdominance and Sickle-cell anemia

Overdominance occurs when the heterozygote at a locus has a higher fitness than either homozygote.

One the best known examples of overdominance is sickle cell hemoglobin in humans.

The HbA allele is the normal allele, HbS is the sickle cell allele.

Individuals who are homozygous for the HbA allele are susceptible to malaria in West-central Africa.

Homozygotes for the HbS allele suffer from a severe anemia.

HbA | HbS heterozygotes enjoy resistance to malaria but do not suffer from anemia.

The fitness of the three genotypes is: AS > AA > SS.

The polymorphism persists despite the suffering of AA and SS homozygotes.

Darwinian Medicine

How can we explain the high frequencies of many deleterious mutations. Cystic fibrosis, tay sachs, and a number of other genetic maladys can occur at a very high frequency in a population

Mutation

In natural populations, mutation typically occurs at very low rates, values from 10-3 - 10-4 per gene per generation are fairly typical. The equilibrium gene frequency for q as derived in lecture is given by:

mA->a

qequil = __________

mA->a + mA->a

(where mA->a & mA->a are the forward mutation rates: A -> a, and backward mutation rates a -> A respectively).

Exercise: Things happen really slowly if we were to use realistic mutation rates. So, let's say we have outrageously high mutation rates (mA->a = 10-1 = 0.1 and ma->A = 10-2 = 0.01), p=0.0 (i.e., there are no A alleles around). Unfortunately, the population of field mice you were studying was in Love Canal or really close to Three Mile Island. Set wAA=wAa=waa=1 (the mutations are not deleterious in any way, they are neutral). Notice that the populations come to an equilibrium that is given by the above formula (use a calculator to verify). Now lets start decreasing both mutation rates by an order of magnitude. It just takes longer to reach equilibrium. By itself, mutation is a very weak force. Especially if we consider it relative to selection in which s1, s2 = 1 to 10-2 to very small values (e.g., 10-4 which would be comparable to mutation rates).

Genetic drift

Did you notice how all the simulation lines are wiggly? Even for a relatively large population N=1000, the gene frequencies will drift. The critical concept with genetic drift is effective population size usually termed Ne. Ne is the number of adults that contribute to the "gene pool" each generation. Many factors can alter (always reduce) Ne relative to N the actual number of individuals in a population. For example differences in mate success (some males get more copulation's) lowers Ne. This alteration in effective sex ratio has the following effect on Ne relative to observed number of males and females:

Nm*Nf

Ne = ______

Nm+Nf

Verify for yourself that Ne=N when the sex ratio is equal to 1. Then, calculate Ne for a population of 25 males and 100 females (N=125). Two of the many other factors that reduce Ne include natural selection and non-overlapping generations (offspring perhaps mate with parents, etc.).

Exercise: Change the relative fitness of all three genotypes to 1 (no selection, set both mutation rates to 0, set the initial freq. p = 0.5. Start out with Ne = 1000. Let us decrease Ne by 1/2 on each simulation cycle (e.g., Ne = 500 on the next cycle, and so on ...125, 64, 32, 16). What happens? Is genetic drift a creative force like mutation, or is it a destructive force like selection (i.e.., does it create variation or reduce it)? What inevitably happens to gene frequencies in small populations?

One of the most important effects of genetic drift is that it generates variation among populations. To be sure each population goes to fixation, but they fix for alternative alleles!! This concept will be crucial when we discuss mechanisms of speciation in an upcoming lecture.

Genetic Bottlenecks in Cheetahs and Elephant Seals

Random genetic drift can be very prevalent when populations experience "genetic bottlenecks" or "founder effects". A genetic bottleneck is defined as an extreme reduction in population size that persists over a number of generations which the population eventually recovers. During this period of reduced size, the loss of variation due to random drift will be pronounced. Two of the most extreme examples of known or suspected bottlenecks involve the northern elephant seal (Mirounga angustirostis) and the cheetah (Acinonyx jubatus).

Populations of the East-African and South African subspecies of the cheetah have been surveyed for 52 loci. A sample of 30 individuals from the EAP gave P = 0.04 and H = 0.01. From a sample of 98 individuals from the South African group, P = 0.02, and H = 0.004.

These are extremely low estimates. Even more surprising was the finding that skin grafts among unrelated individuals from the South African subspecies are not rejected. This suggests that the cheetah is monomorphic at its major incompatibility or MHC locus which is abundantly polymorphic in all other mammals.

Further evidence that the cheetah is possesses diminished levels of variation comes from observations that male cheetahs have a high incidence of abnormal spermatozoa and attempts to mate cheetahs in captivity have met with little success.

Furthermore, a recent outbreak of feline infectious peritonitis in the 1980's severely decimated many colonies of animals. It is interesting that the same virus caused only a 1% mortality rate in domestic cats. These observations support the suggestion that cheetahs have lost much of their reservoir of genetic variability and thus may be precariously poised on the brink of extinction.

Unlike the elephant seal there is no direct evidence for a bottleneck in the species' recent history. However, it is known that the cheetah used to have a much larger geographic range. Athough it is now endemic to the African subcontinent, it used to be found throughout Europe and Asia. Apparently, the species has undergone at least two severe bottlenecks resulting in the loss of much of their genetic variation.

Migration or gene flow

The mixing of gene pools from isolated populations with different gene frequencies tends to reduce heterozygosity (Wahlund effect). (e.g., adults oysters are found in estuaries however, their gametes and offspring disperse out of the estuary and contribute to the total population of oysters [all estuaries]). There are several models of gene flow including:

a) continent-island or source-sink: one-way gene flow from a continent which has a large population to a smaller "island" population;

b) island model in which migration occurs among islands (usually all islands have equal probabilities of receiving dispersing individuals and thus the migration rate or gene flow is considered to be equally distributed among all islands);

c) stepping stone model in which each island is more likely to receive migrants from adjacent islands than from islands farther away. The stepping stone model in this program is one in which islands form a ring, and each island receives migrants from adjacent islands only.

Migration homogenizes gene frequencies among populations. We can't simulate the process of homogenization with this program because we can't start out with different gene frequencies in our replicate populations. However, as you will see later, genetic drift erodes variation (loose alleles from the population). Gene flow tends to limit erosion that is caused by genetic drift because it homogenizes the populations that are drifting apart in gene frequency. The effect of gene flow is most interesting when it is considered in combination with other forces:

Interactions between Selection, Mutation, Drift, and Gene Flow

Migration and selection

Exercise: Let's simulate the process of local adaptation. Assume that the allele "a" is very beneficial on an island such that wAA=0.8, wAa = 0.8, and waa=1 (allele a is a beneficial recessive allele on the island). However, on the mainland, the genotype frequency is fixed for all AA (Only AA survive there: that means that you set the value for p=1.0 on the mainland, top right slider). Migration from the mainland to each island is a modest m = 0.1 (remember to set both u=0, Ne=1000, NGen=100, Npops = 8). Start with 8 isolated populations. Change it to source-sink model. What happens to gene frequency?

Selection and mutation

There are two interesting cases that reveal how mutation is an important force in evolution. Both cases involve selection. The first case involves mutation-selection balance in which deleterious recessives are under chronic selection. The rate at which deleterious mutations arise (forward mutation) is always generally higher than the rate at which they might mutate back to a functional gene. Presumably, chronic selection should eliminate such alleles. In this sense selection is purifying in that it tends to eliminate deleterious mutations.

How efficient is natural selection at eliminating unfavorable mutations?

Exercise: Run the simulation program with: N=1000, p=0.5 (initial frequency of A), wAA=1.0, wAa = 1.0, and waa=0.5 (We should have ua->A = 10-3, NGen=100, Ne=1000, Npops = 8, isolated populations). Why isn't the "a" allele eliminated from the population? Test the hypothesis that it is mutation that prevents rapid fixation of the "a" allele -- set the mutation rate to zero. Does allele then disappear from the population rapidly? This is an important concept and is directly related to questions asked at the end of section 1) on selection. Incidentally, the equilibrium gene frequency for mutation selection balance of a deleterious recessive allele is given by:

qequil = How does this formula compare with the simulation?

A partial answer to the above question (why isn't allele "a" eliminated?) is given by changing wAa = 0.9. This makes allele a partially dominant. The equilibrium gene frequency for a partially dominant allele is given by:

qequil = m/(s*h)

where s is given by the selection against aa genotype (e.g., relative fitness of aa is 1-s1 relative to 1 for the AA genotype), and h is given by (1-h) which corresponds to our 1-s2 (or the degree of dominance in fitness). This reflects the fact that possession of one copy of allele "a" has a small effect on fitness of Aa genotype and is thus partially dominant. What is the difference between a deleterious recessive (complete recessive) and a partially dominant one in terms of evolutionary dynamics? Play around with the degree of dominance. Again, test the hypothesis that it was mutation that prevents rapid fixation of the "a" allele -- set the mutation rate to zero. Does allele "a" disappear from the population rapidly when it has an effect in both homozygotes and heterozygotes? Compare this result with the test for mutation above.

How rapidly will a favorable mutation spread through a population?

The second relevant case of mutation-selection deals with the spread of novel beneficial alleles in populations. Just how long does it take for an allele to sweep through a population?

Exercise: Run the simulation starting with the following parameters wAA=0.5, wAa = 0.5, and waa=1.0 (allele "A" is the wild type allele relative to the mutation "a" that has twice the fitness). Set p = 1.0 (remember to set ua->A = 0 and uA->a = 0.0001 (or 10-4), NGen=1000, Npops = 8, isolated populations). We can only realistically simulate a population size of Ne=1000, not an infinite population. For this population allele "a" will only arise every 10 generations on average (why is this so?). Why does it take so long for "a" to make it into the population (hint: it has something to do with another evolutionary parameter as well as the effects discussed at the end of Section 1) ) despite it being so beneficial? It also has to do with the following simulation results. What happens if we change wAa = 0.6 (allele a now has partially dominant effect on fitness)?

Mutation and Selection in the HIV virus challenged by AZT

The WHO estimates that 23 million people were infected with the HIV virus in 1996.

The HIV virus is a retrovirus (RNA virus) responsible for indirectly producing a disease known as AIDS.

Briefly, the virus accomplishes this by infecting a key player in the vertebrate immune system called helper T cells.

Helper T cells are the critical link in our immune system.
They are responsible for the so-called "cell-mediated response" by stimulating killer T cells that recognize and kill infected T cells.
They also mediate the so-called "humoral response" by stimulating B cells to secrete antibodies that bind free virus particles allowing them to be destroyed by macrophages.

By infecting helper T cells, our body's own immune system responds and ends up destroying our T cell population over time.

The compromised immune system is what eventually results in the disease called AIDS.
AIDS is largely caused by fungal and bacterial pathogens that are normally easily handled by the immune system.
This happens over a long period of time - usually 10 years or so.

A lot of excitement was generated about 10 years ago by an AIDS drug called AZT (azidothymidine).

AZT is a base analog, it is incorporated into growing DNA strands in the place of T.
If AZT is incorporated into a growing strand, DNA replication is halted - new bases cannot be added past this point.
Thus, AZT works by poisoning the replication of the virus - it also isn't too good for the host either which is why treatment is so debilitating.
AZT is effective against the HIV virus for a period of 1-2 years and then has little or no effect.

Why does AZT loose its potency? Because of the rapid evolutionary response of the HIV virus to a very strong selective agent - AZT.

What happens is that HIV virus evolves reverse transcriptases that have a dramatically lower affinity for the base analog AZT.
These new strains of the virus have a much greater fitness than the previous strains.
Natural selection favors the propagation of AZT-resistant viruses so that over time, the population of viruses is entirely composed of this new strain and AZT ceases to become effective.

How do we know this?

because we can isolate and sequence the reverse transcriptase gene in newly-infected patients and later once AZT has ceased to be effective and find mutations in the active site of the enzyme.
natural selection favoring AZT-resistant viruses has occurred repeatedly in virtually every patient treated with AZT.

How does the HIV virus achieve such a dramatic evolutionary response?

because of its incredibly high mutation rate - some 106 times higher than humans.
because of its short generation time relative to ours - the ten year incubation period of the virus represents about 3,000 virus generations.

Coupled with the high mutation rate, we can estimate that the HIV virus can evolve over a ten year period to the same extent that humans can over a period of a million years!

Conclusion: no single drug can halt HIV infection - multiple drugs can, though.

Selection and drift

Part of the answer to the last set of questions involved drift. Even for a relatively large population (N = 1000), drift may keep allele "a" at a very low frequency despite fairly strong selection favoring allele "a".

Exercise: Let us see how Ne effects the probability of fixation of a beneficial allele. Run the simulation starting with the following parameters wAA=1.0, wAa = 0.9, and waa=0.9. The initial frequency of pA = 0.3 (remember to set uA->a = 0 and ua->A = 0, NGen=1000, Ne=1000, Npops = 8, isolated populations). How fast does allele "A" fix in the population? What happens if we start decreasing Ne? Reduce Ne to 400, 80, and 40. With Ne set to 40, run the simulation several times (get at least 8 populations X 10 replicate runs? How many times does allele "A" get fixed? How many times does allele "a" get fixed?. What does this tell you about the likelihood of a moderately beneficial allele becoming fixed? What happens if we change the degree of dominance.

Summary of microevolutionary processes.

Occurs where? Magnitude determined by Relative Effect in a small population Relative Effect in a large population

Selection within w very strong very strong

Mutation within u weak weak

Random Drift within Ne moderate weak

Gene Flow among m strong strong

Back To the Syllabus

	Occurs where?	Magnitude determined by	Relative Effect in a small population	Relative Effect in a large population
Selection	within	w	very strong	very strong
Mutation	within	u	weak	weak
Random Drift	within	Ne	moderate	weak
Gene Flow	among	m	strong	strong