Cultural Evolution

The Evolution of Cooperation

From social dilemmas to the five mechanisms that sustain cooperation

Dynamics of Living Systems — Theoretical Biology Course

A Brief History

1964

William D. Hamilton — Kin selection

Hamilton's landmark papers on inclusive fitness theory showed that natural selection can favour costly helping toward genetic relatives. The key insight: genes that code for altruism toward relatives can spread if the benefit to the recipient, weighted by relatedness, exceeds the cost — even though the act is costly to the individual actor. This resolved a longstanding puzzle of how eusocial insects evolved.

1971

Robert Trivers — Reciprocal altruism

Trivers proposed that cooperation can evolve even among non-relatives if individuals interact repeatedly and have reliable ways to remember and punish cheaters. "You scratch my back, I'll scratch yours" — but only if defectors are repaid in kind. This opened the door to studying cooperation in large, anonymous societies.

1973

Maynard Smith & Price — ESS and cooperation

With the Evolutionarily Stable Strategy framework from Population Genetics, researchers could now ask: which cooperation strategies are evolutionarily robust? This provided the mathematical language to rigorously analyse whether a cooperative strategy can resist invasion by mutant defectors.

1981

Axelrod & Hamilton — Tit-for-Tat and tournaments

Axelrod ran a celebrated computer tournament in which simple strategies competed in the iterated Prisoner's Dilemma. The winner was Tit-for-Tat: cooperate on the first move, then copy the opponent's last move. This elegant strategy outperformed more complex approaches and showed that reciprocal cooperation could be stable and robust.

1992

Nowak & May — Spatial games

By embedding games on spatial lattices and allowing only local interactions, Nowak and May showed that cooperation can evolve through a purely spatial mechanism: clusters of cooperators are protected from invasion. This demonstrated that network structure itself can promote cooperation without invoking kin or memory of past interactions.

1998

Nowak & Sigmund — Indirect reciprocity

Nowak and Sigmund showed that cooperation can be sustained through reputation: if individuals know one another's past actions (or hear about them through gossip), they can preferentially help those with good reputations. This mechanism requires only that information spreads, not that individuals directly interact repeatedly. It underlies human morality and economics.

2006

Martin Nowak — Five rules for cooperation

In a landmark Science review, Nowak unified decades of work by identifying exactly five mechanisms under which cooperation can evolve: (1) kin selection, (2) direct reciprocity, (3) indirect reciprocity, (4) network reciprocity, and (5) group selection. These cover all major pathways found in nature and theory.

2010

Gokhale & Traulsen — Multiplayer cooperation

Extension of cooperation theory beyond pairwise games to public goods and multiplayer interactions, showing that cooperation thresholds and nonlinear payoffs can fundamentally change the dynamics and stability of cooperative strategies.

The Problem of Cooperation

Cooperation is one of evolution's greatest puzzles. When two individuals interact, the fitness payoff to helping the other (paying a cost $c$ to deliver a benefit $b$, where $b > c$) is always lower than the payoff to defecting. Under individual-level selection, natural selection should favour the defector, driving cooperation to extinction. Yet cooperation is ubiquitous in nature: humans cooperate with kin and strangers, cells work together in multicellular bodies, bacteria coordinate virulence, and animals engage in reciprocal altruism and mutualism. How is this possible?

The canonical framework for studying this paradox is the Prisoner's Dilemma, introduced in Population Genetics. Recall the payoff matrix with Cooperate ($C$) and Defect ($D$):

	C	D
C	$b - c$	$-c$
D	$b$	$0$

Prisoner's Dilemma payoff matrix

Defection is a dominant strategy: $b > b-c$ and $0 > -c$, so $D$ always yields a higher payoff. Yet the social optimum is mutual cooperation ($b-c$ each), which beats mutual defection ($0$ each). This tension between individual incentives and collective welfare is the core of the cooperation problem.

The tragedy of the commons (Hardin 1968) is a more general formulation: when individuals harvest from a shared resource, each pays the full cost of restraint but the benefit is shared. This leads to overexploitation even when everyone would be better off if all exercised restraint. Cooperation requires solving a social dilemma.

Definition — Cooperation

Cooperation is behaviour in which an individual pays a cost $c$ to deliver a benefit $b$ to another individual, where $b > c > 0$. The actor reduces its own fitness; the recipient gains. Cooperation is individually costly but collectively beneficial.

The central question of this chapter is: what evolutionary mechanisms can sustain cooperation despite the temptation to defect? The answer comes from five key mechanisms, each of which changes the fitness consequences of helping. We now trace the intellectual history that revealed them.

Kin Selection and Hamilton's Rule

The simplest solution to the cooperation puzzle is that helpers direct their costly behaviour toward relatives. An allele that codes for helping siblings or cousins can spread even if it is personally costly, because the relatives share copies of the same allele. The beneficiaries carry copies of the "helping" allele; when it increases their fitness, the allele itself reproduces.

Hamilton formalised this insight with inclusive fitness theory. An individual's fitness includes not only their own reproduction but also the reproduction of genetic relatives, weighted by the probability that they share the genes in question. The classic formulation is

Hamilton's Rule

A costly helping behaviour will be favoured by natural selection if and only if $$r b > c$$ where $r$ is the relatedness between actor and recipient, $b$ is the benefit to the recipient, and $c$ is the cost to the actor. Relatedness $r$ is the probability that a random allele in the recipient is identical by descent (shared ancestry) to an allele in the actor.

For example, in diploid organisms: $r = 1/2$ for full siblings (they share, on average, 50% of genes), $r = 1/4$ for half-siblings or aunts/uncles, and $r = 0$ for unrelated individuals. Hamilton's rule explains why alarm calls evolve (costly to the caller but saves relatives), why parents invest heavily in offspring, and why eusocial insects sacrifice reproduction to care for the queen's brood.

However, Hamilton's rule has limits. Nowak, Tarnita, and Wilson (2010) challenged the explanatory power of inclusive fitness, arguing that it is mathematically equivalent to standard population genetics but sometimes provides less insight. Their critique sparked vigorous debate, but Hamilton's rule remains a central conceptual tool: it clearly shows how genetic relatedness can overcome the tragedy of the commons.

Direct Reciprocity

Trivers (1971) proposed an alternative: cooperation can evolve among non-relatives if individuals interact repeatedly with one another and can identify and punish cheaters. If the probability of a future interaction is high enough, the incentive to cooperate today (building a good reputation with a partner) can outweigh the temptation to defect.

Consider the iterated Prisoner's Dilemma: two players repeat the game for many rounds. At each round they choose $C$ or $D$, receiving the same payoffs as in the one-shot game, but the game continues with probability $w$ (the "shadow of the future"). Over many rounds, the total payoff accumulates, and strategies that damage long-term relationships do worse.

Axelrod (1984) famously ran a computer tournament where experts submitted strategies. The winner was Tit-for-Tat (TFT): cooperate on round 1, then copy the opponent's previous move. TFT is robust because it is nice (never defects first), retaliatory (punishes defection with one defection), forgiving (returns to cooperation if the opponent does), and clear (easy to understand and predict). Other robust strategies include Win-Stay Lose-Shift (WSLS, Nowak & Sigmund 1993): repeat your move if you did well, switch if you did poorly.

For a given payoff matrix and continuation probability $w$, cooperation is evolutionarily stable if the expected future gain from cooperation exceeds the current temptation to defect:

Condition for Direct Reciprocity

Cooperators can resist invasion by defectors if the probability of future interaction is sufficiently high: $$w > \frac{c}{b}$$ where $c$ is the cost of helping and $b$ is the benefit received (in the iterated game context). The higher the ratio of cost to benefit, the more future interaction is needed to sustain cooperation.

A limitation: direct reciprocity requires that individuals recognize partners, remember past interactions, and interact repeatedly. In large, fluid populations where individuals rarely meet the same partner twice, this mechanism is weaker.

Indirect Reciprocity and Reputation

Nowak and Sigmund (1998, 2005) identified a third mechanism: indirect reciprocity. Even if you never meet a partner twice, cooperation can evolve if information about who helped whom is publicly available. The logic is: "I help you, and someone (possibly someone else) helps me."

In a population where helping reputation is tracked and observed, individuals with a history of helping receive help from others in return — not from their direct partners, but from the community. This requires three elements: (1) visibility (people observe or hear about helping acts), (2) memory (reputation is remembered), and (3) discrimination (helpers preferentially help those with good reputations — high "image score").

The standing strategy of Nowak and Sigmund works as follows: help if the recipient has a good standing (previously helped others), and stay in good standing by helping those with good standing. Defectors lose their standing and receive no future help.

Condition for Indirect Reciprocity

Cooperation via reputation can be stable if the probability of knowing someone's reputation is sufficiently high: $$q > \frac{c}{b}$$ where $q$ is the probability that an individual knows the recipient's history of helping before deciding whether to help. Similar to direct reciprocity, but here the mechanism is reputation, not repeated play.

Indirect reciprocity is fundamental to human morality and economics. Reputation systems (from village gossip to credit scores to online reviews) let large, anonymous populations sustain cooperation. It explains why we care about appearing generous, honest, and fair — our reputation is worth more than any single transaction.

Group Selection and Multilevel Selection

A fourth mechanism operates at a higher level of organization. If populations are structured into groups, and groups differ in their frequency of cooperators, then groups with more cooperators may produce more offspring (colonize more widely, outcompete other groups). This group selection can favour cooperation even when, within groups, defection pays more than cooperation.

Group selection has a controversial history in evolutionary biology, but modern formulations via the Price equation and multilevel selection (e.g., Traulsen & Nowak 2006) provide rigorous foundations. The Price equation decomposes evolutionary change into between-group and within-group components:

$$\bar{w}\,\Delta \bar{z} = \underbrace{\text{Cov}(w_g,\, z_g)}_{\text{between-group}} + \underbrace{\mathbb{E}[w_g\,\Delta z_g]}_{\text{within-group}}$$

where $z_g$ is the mean cooperator frequency in group $g$, $w_g$ is group fitness, and $\bar{w}$ is population mean fitness. The covariance term captures between-group selection (favours cooperation if cooperative groups grow faster); the expectation term captures within-group selection (favours defection within each group). In finite populations, both can be significant.

For cooperation to evolve under group selection, the benefit of being in a cooperative group must exceed the cost of altruism:

Condition for Group Selection

In a population with $m$ groups of average size $n$, cooperation can be favoured if $$\frac{b}{c} > 1 + \frac{n}{m}$$ Larger groups or fewer groups make cooperation harder (more within-group selection relative to between-group).

Group selection is particularly relevant in structured populations, microbial communities, and human societies with strong group identity.

Network Reciprocity

A fifth and elegant mechanism: cooperation can evolve on networks (Games on Networks). If individuals are arranged on a graph and play games only with their immediate neighbours, clusters of cooperators are protected from invasion by defectors. A cooperator surrounded by other cooperators does well, while defectors surrounded by cooperators (and thus unable to exploit them further) do poorly.

Ohtsuki et al. (2006) showed that in regular networks where each individual has $k$ neighbours (e.g., $k=4$ on a square lattice), cooperation can be stable if:

Condition for Network Reciprocity

Cooperators can resist invasion on a network with average degree $k$ if $$\frac{b}{c} > k$$ The higher the degree (more neighbours), the harder it is for cooperation to survive. On sparse networks, cooperation is easier to maintain.

Network reciprocity is grounded in spatial structure: you interact with your neighbours, and your offspring inherit your location (or take a nearby one). Over time, cooperators cluster, and clusters are more productive than dispersed strategies. This has been observed in bacterial cooperation, ant colonies, and human communities.

Explore: Five Mechanisms for Cooperation

Five Mechanisms for the Evolution of Cooperation

Select a mechanism to explore. Adjust the parameters ($b$ = benefit, $c$ = cost, plus a mechanism-specific parameter) and watch how cooperator frequency evolves. The left panel shows trajectories from different starting frequencies; the right panel shows the phase portrait ($\dot{x}$ vs $x$) with stable equilibria highlighted.

Benefit $b$: 3.0

Cost $c$: 1.0

Relatedness $r$: 0.5

Cooperator frequency $x(t)$ over time

Phase portrait: $\dot{x}$ vs $x$

Multiplayer Cooperation

The mechanisms above assume pairwise interactions. But in nature, groups often work together: hunting packs, foraging flocks, cells in tissues. The public goods game extends cooperation theory to multiplayer settings. $n$ individuals can contribute to a common pool; each unit contributed is multiplied by a factor $r > 1$, then divided equally among all $n$ players. Individual incentive is to free-ride (not contribute), but if everyone contributes, everyone does better than if nobody does.

In the standard linear public goods game, a cooperator in a group with $k$ other cooperators earns $\pi_C(k) = r\,c\,(k+1)/n - c$ and a defector earns $\pi_D(k) = r\,c\,k/n$. The fitness difference $\pi_C - \pi_D = c(r/n - 1)$ is constant—cooperation is either always favoured ($r > n$) or never. Gokhale and Traulsen (2010) showed that allowing general payoff sequences $a_k$ and $b_k$ (where the marginal benefit of an additional cooperator can vary with group composition) produces nonlinear multiplayer games with up to $d-1$ internal equilibria. Biologically, nonlinearity models synergy (benefit accelerates with more cooperators), diminishing returns (benefit saturates), or threshold effects (a minimum number of cooperators is needed before cooperation pays). These mechanisms can produce bistability and sudden regime shifts—dynamics absent from the linear game. See Gene Regulatory Networks for a full treatment and interactive visualisation.

Exercises

Conceptual Questions

Explain Nowak's Five Rules for cooperation: kin selection, direct reciprocity, indirect reciprocity, group selection, and network reciprocity. Give a biological example for each.
In the Prisoner's Dilemma, why does Defect dominate Cooperate in a single game, yet cooperation can evolve when the game is repeated? What makes Tit-for-Tat successful?
What is the difference between direct reciprocity and indirect reciprocity? Why is indirect reciprocity harder to evolve and more vulnerable to cheaters?
Explain how spatial structure (population on a network) can promote cooperation even among unrelated individuals. Why does localized interaction matter?
What is the tragedy of the commons and how do public goods games model it? What mechanisms (punishment, reward, spatial structure) can prevent commons collapse?

Computer Problems

Prisoner's Dilemma and Tit-for-Tat. Implement a round-robin tournament where different strategies (Cooperate, Defect, Tit-for-Tat, Generous Tit-for-Tat) play 100 rounds of the Prisoner's Dilemma with payoff $R = 3$ (mutual cooperation), $T = 5$ (temptation), $S = 0$ (sucker), $P = 1$ (mutual defection). Compute cumulative payoffs and show why Tit-for-Tat dominates despite never being the highest-scoring single move.
Evolution of Cooperation by Reciprocity. Implement a population where individuals use either Cooperate or Defect, and payoff depends on opponent's previous move (direct reciprocity). Use replicator dynamics or Moran process to show that cooperation frequency increases from a low starting point. Vary shadow of the future (discount factor) and show cooperation threshold.
Indirect Reciprocity with Reputation. Model individuals who track others' reputation (cooperators vs defectors) and prefer to help those with high reputation. Implement a public goods game where helping is conditional on reputation. Show how reputation-based cooperation emerges and how it breaks down if defectors can fake reputation.
Network Reciprocity and Evolutionary Clustering. Implement a lattice or network where individuals are nodes and play Prisoner's Dilemma with neighbours. Use birth-death Moran process: reproduce proportional to payoff, offspring replace random neighbour. Show how spatial clustering of cooperators can protect them from defectors and allow cooperation to invade.
Voluntary Public Goods and Snowdrift Game. Implement a public goods game where contribution is voluntary ($c_i \in [0,1]$). Use continuous strategy space and continuous replicator dynamics. Compare outcomes with fixed-contribution public goods and show why partial contribution can be an ESS when both full contribution and free-riding are unstable.

References

Hamilton, W. D. (1964). The genetical evolution of social behaviour. Journal of Theoretical Biology, 7(1), 1–16.
Trivers, R. L. (1971). The evolution of reciprocal altruism. Quarterly Review of Biology, 46(1), 35–57.
Hardin, G. (1968). The tragedy of the commons. Science, 162(3859), 1243–1248.
Maynard Smith, J., & Price, G. R. (1973). The logic of animal conflict. Nature, 246(5427), 15–18.
Axelrod, R., & Hamilton, W. D. (1981). The evolution of cooperation. Science, 211(4489), 1390–1396.
Axelrod, R. (1984). The Evolution of Cooperation. Basic Books.
Nowak, M. A., & Sigmund, K. (1993). A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game. Nature, 364(6432), 56–58.
Nowak, M. A., & May, R. M. (1992). Evolutionary games and spatial chaos. Nature, 359(6398), 826–829.
Nowak, M. A., & Sigmund, K. (1998). Evolution of indirect reciprocity by image scoring. Nature, 393(6685), 573–577.
Nowak, M. A., & Sigmund, K. (2005). Evolution of indirect reciprocity. Nature, 437(7063), 1291–1298.
Nowak, M. A. (2006). Five rules for the evolution of cooperation. Science, 314(5805), 1560–1563.
Ohtsuki, H., Hauert, C., Lieberman, E., & Nowak, M. A. (2006). A simple rule for the evolution of cooperation on graphs and social networks. Nature, 441(7092), 502–505.
Traulsen, A. & Nowak, M. A. (2006). Evolution of cooperation by multilevel selection. Proc. Natl. Acad. Sci. USA, 103(29), 10952–10955.
Gokhale, C. S. & Traulsen, A. (2010). Evolutionary games in the multiverse. Proc. Natl. Acad. Sci. USA, 107(12), 5500–5504.
Nowak, M. A., Tarnita, C. E., & Wilson, E. O. (2010). The evolution of eusociality. Nature, 466(7310), 1057–1062.