20VC: Box's Aaron Levie on Predictions for the Next Wave of AI: Will Foundation Models Be Commoditised | How the Business Model of SaaS Changes Forever | Startups vs Incumbents: Who Wins | App vs Infrastructure Layer: Where is the Value?
Most important take away
We are in a rare once-a-decade architecture shift where AI creates a narrow window (likely 2-5 years) for new platform-scale companies to emerge, but unlike prior waves this one favors incumbents almost as much as startups because they already own the data and workflows. The winners will not be those who build chat interfaces on top of existing software, but those who build AI agents that actually do the work (autopilots, not copilots) and target workflows too operational for a horizontal ChatGPT to subsume.
Summary
Actionable insights and tech patterns from the conversation:
Career and strategic advice
- If you are inside one of these AI-era companies, treat this period as pure survival and execution mode; do not plan to coast. The window is short and won’t repeat for another decade.
- Don’t build anything that one new OpenAI training run or a horizontal chat interface could subsume. Pick the boring, operational workflows that a human needs to run an end-to-end business process. That is where defensible value lives.
- For incumbents: drop your inferior in-house model the moment a better one exists. The instinct to “own the tech” because it’s emotionally yours is the classic innovator’s dilemma trap. Levie predicts the market would actually reward Duolingo if it announced GPT-4o was powering its translations.
- For founders: focus on cash flow earlier than feels necessary. Owning your destiny via cash flow forces better strategy and execution and is the metric you’ll eventually be judged on.
- Delegation lesson: pick the 1-2 areas where you stay deeply hands-on (Levie keeps AI, end-user experience, product strategy) and genuinely delegate the rest. You can return to founder-mode micromanagement when you’re a $2T company.
Tech patterns to watch
- Foundation models are commoditizing fast. Expect maybe 1-3 independent foundation model companies at scale plus the hyperscalers; there is not room for 50. Niche/industry/domain plays (e.g., audio where copyright is messy) may survive.
- Build a model-agnostic abstraction layer so customers can route different tasks to different models (Box’s approach: GPT-4 for legal Q&A, Gemini for metadata extraction, etc.). The personalities of models will stay differentiated enough that this routing matters.
- Context window is the single most important capability metric right now: ~500x improvement in 18 months (4K to 2M tokens). Nothing in tech history grows 500x per 18 months. Moore’s law is not dead but is being supplemented by GPU performance gains.
- Agents are the actual paradigm shift, not chat. Chat just moved the GUI to a command line. Agents move software from “tool I use to do work” to “labor I farm work out to.” Expect AI versions of every job function: AI SDR, AI QA engineer, AI support agent, AI marketer. 10-50 category winners will emerge in a 5-year window, similar to the SaaS boom.
- AI is the death knell for on-prem holdouts. Companies whose data isn’t in cloud-ready form will be locked out of AI value entirely. Levie is seeing former cloud holdouts finally migrate because of AI.
- RPA vendors are well-positioned, not threatened. They already sell automation and consumption-based pricing. The total market will get 10-100x larger because AI removes the complexity that artificially capped the legacy RPA market.
- Org structures will mostly survive intact. The change is an AI labor layer that slots into the existing org chart. Most companies will reinvest productivity gains into more headcount in that function (since there is always more work than people), not slash headcount. The “one-person billion-dollar company” will exist but be the exception.
- Business model is still being figured out. Seat-based SaaS is under pressure. Watch for consumption-based pricing on agent output units (tickets resolved, leads generated, emails sent, contracts reviewed). Pure value-based pricing only works for Palantir-style bespoke implementations.
- Services spend will lead infrastructure spend for the next ~5 years (Accenture’s $2.4B AI revenue already exceeds OpenAI’s $2B). Change management and implementation is the long pole. Eventually infrastructure spend overtakes services once systems are in production.
- Second-order opportunity: tooling to manage AI labor (a “Workday for AI” — identity, guardrails, orchestration of agents from many providers).
- Apple is well-positioned, not behind. AI usage today is mostly happening on Apple devices people paid Apple for. Whether Siri’s engine is OpenAI or in-house is irrelevant to users. The phone becomes the universal task automation engine.
- Democratization angle: AI agents lower the barrier to starting a company globally, similar to what Shopify/Stripe/AWS did. Expect a boom of companies from regions that previously lacked the labor infrastructure (e.g., SDR teams) to start a business.
Regulation
- Less worried than a year ago. The “pause AI for 6 months” moment was the scariest because if government had picked up on tech-community alarmism, progress would have frozen. Current regulation is more surgical (copyright/IP, national security) which is reasonable.
Chapter Summaries
- Living through the AI architecture shift: Once-a-decade window (PC, web, mobile/cloud, now AI). This one is more competitive than past waves because incumbents have the data and workflow advantage. Survival-mode execution is mandatory.
- Foundation models vs application layer: Foundation model layer will consolidate to a handful of players (hyperscalers plus 1-3 independents). The application layer is where most new juggernauts emerge.
- Multi-model architecture at Box: Box built a layer connecting customer data to any model. Different models will remain differentiated by personality and task fit (legal vs metadata extraction).
- On-prem holdouts are finally moving: AI is the final reason to migrate. Cloud holdouts are now talking to Box for the first time.
- Rate of model improvement: 500x context window growth in 18 months. GPU performance gains supplement Moore’s law.
- Agents are the real shift: Chat was a UX paradigm change; agents are a labor paradigm change. RPA gave a preview but was too frail. AI agents = autopilots, not copilots.
- Org structure impact: Org charts mostly survive. AI labor slots in alongside human labor. Most companies reinvest efficiency gains into more headcount, not fewer.
- Experimental vs production spend: Both happening simultaneously. CFOs see mixed AI line items, sometimes bundled in SaaS spend, sometimes as standalone consumption.
- Pricing and business models: Seats under pressure. Consumption-based on agent output units is the likely winner. Bespoke value pricing only works for Palantir-class deals.
- OpenAI as platform threat: OpenAI has told you what they will become (universal assistant + API for all modalities). Don’t build anything they can subsume with one training run.
- Incumbents must drop inferior in-house tech: Duolingo should adopt GPT-4o if it is better. Markets would reward, not punish, this.
- Galvanizing a 2,700-person company: All-hands clarity on the stakes. The risk is being on the wrong side of the bridge when the AI transition completes.
- What’s lost at scale: Speed of pivot. At a startup the whole team is in the room; at scale you must perfectly time when to expose a working thing to the rest of the company. Gemini’s launch issues were partly a manifestation of this.
- Regulation: Less worried than 12 months ago. Surgical regs on IP/copyright/national security are reasonable.
- RPA vendors: Well-positioned. Market will expand 10-100x as AI removes complexity barriers.
- Services lead infrastructure: Accenture’s AI revenue already exceeds OpenAI’s. Implementation is the long pole.
- Democratization of company creation: AI agents lower the barrier to starting a business globally.
- Job displacement and dangerous AI: Existing legal frameworks cover most cases. Less worried about runaway self-replicating AI in the near term.
- Quick-fire: Worst at delegation. Bullish on Zuck because of compute + engineers + users + data + motivation. Would add Jensen Huang to the board for supply-chain visibility. Biggest regret: didn’t focus on cash flow earlier.
- Bullish on Apple: AI doesn’t threaten their business and supercharges device usage. Partnering with OpenAI is not weakness, it’s supply chain.
- Box’s 5-year vision: $2B revenue, position Box as the “digital memory” of the enterprise that AI can tap into for decisions, automation, and onboarding.