← All summaries

Dylan Patel - The Infinite Demand for Tokens, Claude Mythos, and Supply Constraints

Invest Like the Best · Patrick O'Shaughnessy — Dylan Patel · April 23, 2026 · Original

Most important take away

Demand for frontier AI tokens is effectively unbounded and growing exponentially: Semi-Analysis itself went from tens of thousands to a $7M annual run rate on Claude in under a year (~25% of salary spend). Supply across memory, logic, optics, and fab equipment cannot catch up before 2027-2028, so frontier model access will concentrate among those with capital and lab relationships. Operators who don’t aggressively use tokens to generate and capture outsized value risk being commoditized or locked into a “permanent underclass.”

Chapter Summaries

  • Semi-Analysis’s token explosion: Firm spend went from ~$10K to $7M run rate in a year driven by Claude Code usage even among non-engineers; one person rebuilt work that previously required full teams (chip reverse-engineering lab, full US power grid mapping, 2000-task economist benchmark).
  • Commoditization pressure: If you don’t adopt aggressively, competitors using AI will undercut you. Investment firms will buy data rather than build it, but edge goes to those who move fastest.
  • Token demand is unbounded: Anthropic grew from $9B to ~$40B revenue with near-flat compute; margins floored at 72%. Willingness to pay for the newest frontier model is extreme; older models become irrelevant despite massive cost declines.
  • Mythos and lab concentration: Anthropic’s “Mythos” is the biggest capability jump in two years; Anthropic is selectively releasing it (e.g., to cyber customers only). Expect frontier models to be distributed to fewer customers at higher prices.
  • Implementation is now cheap; ideas are the constraint: Economic reordering where picking the right idea, selling it, and attracting capital matter more than execution skill.
  • Robotics is next: VLAs are too data-inefficient, but software singularity will enable few-shot pre-trained robot models in 6-18 months, further expanding token demand.
  • OpenAI vs. Anthropic: Anthropic is compute-constrained; OpenAI has raised massively for compute (Oracle, CoreWeave, SoftBank, Microsoft, Trainium). Even Tier-2 and Tier-3 labs will sell out of tokens.
  • Supply bottlenecks: H100 prices rising, GPU useful life extending to 7-8 years. Memory (DRAM especially) will double or triple in price; real capacity doesn’t arrive until late 2027/2028. TSMC capex heading toward $100B by 2028. CPUs sold out due to RL environments and deployed-app serving. Copper foil, glass fibers, lasers, optics all tight.
  • Public perception risk: Patel predicts large-scale anti-AI protests within months; lab CEOs need to stop doing interviews and stop hyping future capabilities.

Summary

Actionable insights

  • Get an enterprise Anthropic contract and pay per token rather than via subscription if you have capital — it minimizes rate limits and secures access to newest models. Relationships with lab reps determine rate-limit increases.
  • Always use the newest frontier model. Mythos is more expensive per token but uses fewer tokens per task, making it net-cheaper than Opus 4.6 for most work. Older-tier models are irrelevant for value creation even as they get 100-1000x cheaper.
  • Three-part mandate to avoid the “permanent underclass”: (1) use more tokens, (2) generate economic value from them, (3) capture that value. The lazy path is working one hour instead of eight; the winning path is working eight hours and producing 8x output.
  • Pick ideas, don’t execute. Implementation cost has collapsed. The scarce skill is choosing which ideas justify the token spend, then selling the result and attracting capital.
  • Expect model-access concentration. Imagine a Ken Griffin-style deal where one firm prepays $10M for first access to each new model. Relationship capital with labs becomes a moat.
  • Robotics window: 6-18 months. Few-shot pre-trained robot models are coming; niche rental/service robot models will proliferate.

Company-specific information

  • Anthropic: ~$40B revenue run rate, 72%+ gross margins (earlier leak showed 30-something percent, now expanded dramatically). Compute-constrained. Holding back Mythos from general release; only select cyber customers have it. Opus 4.7 just launched.
  • OpenAI: Aggressively buying compute from Oracle, CoreWeave, SoftBank, Microsoft, and now Amazon Trainium. Taking a more gradual scaling approach. Perceived as “behind” right now but will catch up and serve the next tier of demand at 50% margins.
  • Semi-Analysis (Patel’s firm): ~$25M salary spend, $7M/year on Claude Code, growing fast. Built energy data business (US grid mapping, power plant/transmission line data) in weeks, competing with incumbents that took a decade. Sells data to hedge funds including Citadel, Shaw.
  • NVIDIA: ~75% gross margins holding; making large prepayments upstream.
  • TSMC: $56-57B capex in 2026, could hit $100B by 2028; raising prices only single digits despite being fully sold out.
  • Memory (DRAM/NAND): Low double-digit percent capacity growth per year; true incremental supply doesn’t arrive until late 2027/2028. DRAM prices expected to double or triple again.
  • ASML, LAM Research, Applied Materials, MKSI: Downstream equipment supply chain set to see amplified demand from TSMC capex tail whip.
  • FPGAs: 120 per next-gen AI rack — an underappreciated demand vector.
  • CPUs: Sold out, needed for RL environments and deployed-app serving.

Career advice

  • Be a hustler: work the full day but 8x your output with AI, don’t coast.
  • Information businesses that don’t aggressively adopt AI will be commoditized by those that do, including their own customers.
  • Value creation > value capture; focus on all three: generating tokens, creating value, and capturing that value commercially.
  • Choose ideas and direction — that’s the new scarce skill. Selling and capital formation around AI-generated output become critical.