← All summaries

David Reich – Why the Bronze Age was an inflection point in human evolution

Dwarkesh Podcast · Dwarkesh Patel — David Reich · May 8, 2026 · Original

Most important take away

Contrary to the long-standing view that human natural selection has been quiescent over the last several hundred thousand years, a new study from David Reich’s lab using ~16,000 ancient genomes shows that the genome is “vibrating” with directional selection — and the Bronze Age (roughly 5,000–2,000 years ago) was an unexpectedly intense inflection point. Selection pressure during this period exceeded even that of the initial transition to farming, reshaping immune, metabolic, pigmentation, and even cognitive/behavioral traits at speeds equivalent to roughly one standard deviation of modern variation per ~10,000 years.

Summary

Key themes

  • Selection has been rampant, not quiescent. Reich’s lab used a new methodology (predicting allele state from a genetic relatedness matrix and asking whether constant directional selection improves the prediction) on ~16,000 ancient European/Middle-Eastern genomes plus ~6,000 modern ones. They found ~479 positions under selection at 99% confidence and roughly 3,800 at 50% confidence — far more than the dozens previously detected. Independent validation came from sliding their selection statistic and seeing a clean, monotonic enrichment for variants associated with traits in GWAS data (UK Biobank, ~500,000 people).
  • The Bronze Age is the inflection point. Across pigmentation, lactase persistence, FADS1 (plant-fat metabolism), TYK2 (TB risk), hemochromatosis, ABO blood type, body-fat/BMI/diabetes risk, and polygenic predictors of years-of-schooling/IQ, selection peaks roughly 5,000–2,000 years ago. The naive expectation that the big shock was the Neolithic farming transition (~11,000 years ago) is wrong — the genomic response was strongest later, when population density, urbanization, and animal-disease exposure intensified.
  • Selection sometimes reverses. TYK2 rose in frequency, then crashed in the last 3,000 years, plausibly because endemic tuberculosis flipped its fitness sign.
  • Immune traits dominate the strong signals; behavioral/cognitive traits are still selected but underpinned by many small-effect variants and harder to detect at significance.
  • Polygenic predictors of “years of schooling” / cognitive performance moved by ~1 SD of modern variation over the last 10,000 years, with the strongest movement 4,000–2,000 years ago and almost no measurable selection in the last 2,000 years. Validated via cross-population transferability: variants whose effect on schooling was measured in Chinese populations show a 5–6 SD correlation with their selection trajectory in Europe — strong evidence the signal is real, not a Eurocentric GWAS artifact.
  • The “schooling/IQ” polygenic score is correlated with age at first birth, BMI, walking pace, and household wealth — likely tagging an “executive function / deferred gratification / r vs. K reproductive strategy” trait whose direction of selection flips with environment (Iceland over the last century shows a 0.1 SD decrease).
  • Body-fat reduction since the Neolithic supports the “thrifty gene” hypothesis — Europeans appear better protected against type-2 diabetes than groups with shorter agricultural history (African Americans, Native Americans).
  • Population size matters less than people think. Once populations exceed ~1 million, selection is no longer mutation-limited; the time span (3,000+ years) is what makes the Bronze Age signal detectable, not the population reaching 50 million.
  • Two big open mysteries. (1) Why did farming wait 40,000+ years after the cognitive toolkit was in place, only to appear independently across the world after the climate stabilized 12,000 years ago? (2) The standard model of Neanderthal/Denisovan/modern-human relationships looks like Ptolemaic epicycles. Reich is exploring an alternative: Neanderthals may be a Middle-Stone-Age modern-human population that expanded into Europe, got 95% genetically swamped by local archaics, but retained mtDNA, Y-chromosome, and toolkit — analogous to how Yamnaya genetic ancestry survives at only 5–20% in India yet carries Indo-European language and culture.

Actionable insights

  • Use the data, not your priors. Reich repeatedly says he came in with strong intuitions (no Neanderthal admixture, quiescent selection, hunter-gatherers should select hardest for intelligence) and the data overturned all of them. Spend years trying to make surprising results go away — if they survive, trust them.
  • Industrialize the pipeline. The lab’s edge is not exotic methods but having driven ancient-DNA sample throughput to >5,000 individuals/year via robotization and “in-solution enrichment” (washing low-yield samples over synthesized DNA baits to grab ~1M informative positions). The lesson generalizes: dominate a field by industrializing data production.
  • Combine independent datasets to validate. The cleverest move in the paper is using an external GWAS corpus (UK Biobank traits) to calibrate which selection signals are real, sidestepping traditional p-value cutoffs.
  • For research/learning workflows: Dwarkesh credits Cursor for letting him run multiple LLMs in parallel against the same paper, have one model critique another, and auto-generate flashcards. He calls this a better research interface than any chatbot for technical lit reviews.

Career advice

No explicit career advice. Implicit lessons from Reich’s career: (1) own a hard, expensive capability (industrial-scale ancient-DNA generation) that becomes the bottleneck other researchers depend on; (2) be willing to publish results that overturn your own prior published consensus; (3) train postdocs to convert into long-term staff scientists who can drive multi-year methodological innovation (Ali Akbari spent 7 years on this work).

Stocks / investments mentioned

No stocks or investments are discussed. The episode’s sponsor reads cover three companies — none are presented as investment ideas:

  • Cursor (cursor.co/dwarkesh) — coding/research tool used by the host for multi-LLM workflows. Privately held.
  • Jane Street (janestreet.com/dwarkesh) — recruiting pitch; describes their internal “Hivebox” compute-auction currency and a $6B “Hivebox stimulus” tied to a new compute deal. Privately held; not investable.
  • Roussou / “fast-tokens” (roussou.ai/dwarkesh) — open-source Rust tokenizer claiming up to 40% faster time-to-first-token on agentic workloads. Privately held.

No actionable investment ideas; no public tickers mentioned.

Chapter Summaries

  1. Why frequency changes matter (intro). Reich explains why detecting selection requires very large sample sizes: a single genome tells you a lot about history (you have ~tens of thousands of ancestors represented) but only one or two data points per allele frequency.
  2. The mainstream view was wrong. Prior consensus held selection had been quiescent because Europeans and East Asians show almost no 100%-different alleles after 40–50,000 years apart. Reich’s new analysis partitions frequency change: 98% is migration/drift, but the remaining 2% is filled with directional selection at hundreds to thousands of sites.
  3. Methodological sketch (deferred to end of episode). A new statistical method using a genetic relatedness matrix lets them ask whether constant directional selection improves prediction of allele states across 22,000 individuals.
  4. Trait enrichments. Strong signals are massively concentrated in immune traits (~4–5x enrichment) and metabolic traits; behavioral/cognitive enrichments are weak only because they’re underpinned by many small-effect variants.
  5. The Bronze Age inflection. Selection intensified 5,000–2,000 years ago across pigmentation, FADS1, TYK2 (TB), hemochromatosis, ABO, and cognitive polygenic scores. Lactase persistence makes obvious sense (dairying); FADS1 and the others are more surprising.
  6. The African-American admixture comparison. Reich’s 2014 study of ~30,000 African Americans found no detectable selection over ~5 generations of slavery — not because environments weren’t extreme, but because compound interest needs millennia to register.
  7. Cognitive performance polygenic score. Selection moved this score ~1 SD over 10,000 years, peaked 4,000–2,000 years ago, and is essentially zero in the last 2,000. European hunter-gatherers sit ~3 SD below the modern mean on the genetic predictor — though Reich emphasizes this is a noisy proxy correlated with executive function, age at first birth, BMI, etc.
  8. Cross-population validation. Variants whose schooling-effect was measured in China correlate 5–6 SD with their selection trajectory in Europe — proves the signal isn’t a GWAS artifact.
  9. Sponsor reads: Cursor, Jane Street (Hivebox internal compute economy), Roussou (Rust tokenizer).
  10. Why was intelligence not maxed out earlier? Reich speculates the relevant trait is really an r/K reproductive-strategy toggle (few-kids-high-investment vs. many-kids-low-investment) that flips with environmental conditions; schizophrenia/bipolar polygenic risk may similarly tag spectrum traits (creativity, visions) advantageous in some contexts.
  11. Latent variation. The ancestral ~10,000-person bottleneck contained enough latent polygenic variation to support all subsequent trait diversification; only a few traits (lactase, sickle-cell protection) needed novel mutations.
  12. Population size and detection. Once populations exceed ~1M, mutation supply isn’t limiting; what matters is time horizon, not headcount.
  13. The 50,000-year “cognitive revolution” mystery. Malik 2016 found no fixed differences between modern humans and humans 50,000 years ago — the cognitive/symbolic revolution may be cultural or polygenic with no smoking-gun sweep.
  14. Why farming waited. Genetic capability was in place 50,000+ years ago, yet farming only emerges ~12,000 years ago, then independently in many disconnected regions. Climate scientists tell Reich the Holocene’s stability is genuinely unique on a 2-million-year timescale.
  15. Big open question: Neanderthal/Denisovan relationships. Whole-genome data clusters Neanderthals with Denisovans, but mtDNA, Y-chromosome, and toolkits cluster Neanderthals with modern humans. This strains belief: a 5% admixture event somehow drove mtDNA and Y-chromosome to fixation.
  16. Whiteboard digression: a Ptolemy/Copernicus alternative. Reich sketches a model where modern humans invented Middle Stone Age technology, expanded in two directions, mixing 95% with European archaics (becoming Neanderthals) and 20% with deeper African archaics (becoming us). One revolutionary event explains the shared toolkit, mtDNA, Y-chromosome, and the timing of admixture pulses.
  17. Methodological deep-dive (moved to end). ~14x more data than prior studies; in-solution enrichment to grab ~1M informative positions from samples that are <1% human DNA; new GRM-based statistic; calibration via UK Biobank trait enrichment instead of traditional significance thresholds.
  18. Reich on being repeatedly wrong. He recounts how he initially disbelieved the Neanderthal-into-non-Africans signal (spent years trying to make it go away) and similarly disbelieved the new selection results. His career has been a sequence of priors overturned by data.