The compute bottleneck has three floors, and the SaaS apocalypse just had its first wind-down
Anthropic raised $45B and is still compute-short. OpenAI missed both targets. Medallia got handed back to creditors with $5.1B of equity wiped. Three independent voices in this issue identified the same supply-side wall — but at different layers (memory bandwidth → power → capital cost). Paul Tudor Jones put a 252%-of-GDP number on the equity bubble while Jon Gray put a $300B-from-one-firm number on the data-centre wave. And Adam Foroughi's AppLovin (84% EBITDA margins, $10M EBITDA per employee) is the operator-side proof that the lean AI-native reset works — Kalshi and Baseten echo it from prediction-market and inference-cloud benches.
The Threads
The compute bottleneck has three floors
Last week Dylan Patel put a number on the demand wall — Anthropic’s gross-margin floor at 72%. This week three independent voices put numbers on the supply wall, at three different layers of the stack.
Floor 1 — capital cost. Tuhin Srivastava (Baseten, No Priors) is running the inference cloud at mid-90s utilisation across 90 clusters in 18 clouds with a daily 4pm capacity-allocation meeting. The number that matters: GB200 access now requires 3-5 year contracts with 20-30% TCV prepay. Cost of capital just became the binding constraint on inference capacity. Baseten grew 30x in 12 months and is on track for >$1B in 2026; 95% of served tokens are now custom (post-trained) models, almost no one runs vanilla open-source weights at scale. Top-30 customers have never churned. 400% NDR. The H100, 4.5 years post-launch, is still appreciating in the secondary market. ‘Inference is the last market — even if there’s AGI, all that’s left is inference.’
Floor 2 — power and grid components. Chamath on All-In sharpened the framing his peers missed: ‘Everything in this market is power-constrained. The reason these folks miss a number has nothing to do with demand. It is 100% due to the supply of power.’ Notes that 40% of announced gigawatts will be cancelled because of grid red tape, transformer/turbine supply chain delays. Hyperscalers are extracting equity from labs to grant capacity — Anthropic’s $45B from Amazon is the direct example. Jon Gray (Blackstone) confirms it from the buyer-of-the-assets seat: Blackstone alone is signing 6 GW of data-centre leases in 2026 = ~$100B of data-centre capex + ~$200B of hyperscaler chips = $300B from one firm = ‘almost the size of Finland or Portugal’. 8 of Blackstone’s 10 best-performing Q1 investments were in data centres, LNG, battery storage. The aggregate hyperscaler 2026 CapEx headline number from earnings week was $725B (Amazon $200B / MS $190B / Google $190B / Meta $145B); Amazon FCF imploded -97% QoQ.
Floor 3 — memory bandwidth. Reiner Pope (Maddox, ex-Google TPU) gave the technical proof on Dwarkesh that this is the deepest floor. The roofline analysis is brutal: optimal inference batch ≈ 300 × sparsity (~2-3k tokens), the train-schedule analogy (batches depart every ~20ms = HBM drain time), MoE forces single-rack residency because scale-out fabric is 8x slower than NVLink. API pricing leaks the architecture: Gemini’s 50% jump at 200k context is the empirical inflection point where memory time crosses compute time. Output tokens cost 5x input because decode is memory-bandwidth-bound (one token at a time, fetching the whole KV cache for each step). Per Patel cited mid-episode: ~50% of 2026 hyperscaler CapEx is going on memory. Models are ~100x overtrained vs Chinchilla because inference token volume across a model’s 2-month life exceeds training token volume — the entire ‘sum of human knowledge’ in tokens gets re-emitted by every served model. The 200k context-length ceiling has held for two years and there’s no clear path off it without HBM scaling materially or attention becoming fundamentally sparser. Direct ceiling on the ‘long context replaces continual learning’ / agent-as-employee thesis until the wall moves.
So the same headline — ‘AI is compute-bound’ — actually unfolds into three nested constraints: HBM bandwidth caps context (Pope) → power caps the data-centre buildout (Chamath, Gray) → cost of capital caps who can buy long-dated capacity (Tuhin). Anthropic’s $45B raise [forecast: 2026-05-03-001] doesn’t fix any of them on a 12-month view. Where I’d put numbers on this:
- At least one major lab strikes a new equity-for-compute deal with a hyperscaler structurally similar to Anthropic-Amazon within 9 months [forecast: 2026-05-03-001] — confidence 0.7. Once the precedent exists, the asymmetric leverage is the hyperscalers’.
- The 200k context-length ceiling holds for at least 12 more months for any production model from the top-4 labs (excluding research-only/research-preview releases beyond 1M tokens that aren’t priced or rate-capacitated for production) [forecast: 2026-05-03-002] — confidence 0.6. Hardware roadmaps don’t bend in 12 months.
- Aggregate 2026 hyperscaler CapEx prints at ≥$650B (vs the $725B forward guide) [forecast: 2026-05-03-003] — confidence 0.7. Some delays, but the direction is committed.
PTJ vs Gray: the same IPO number, opposite read
The same data point shows up as a bear case from Paul Tudor Jones on Invest Like the Best and a bull case from Jon Gray on Q2 Market Views. Both are right about the number; they disagree on which way it cuts.
PTJ’s bear case (the bubble math): US stock market cap is now 252% of GDP. That’s the highest ratio in history. 1929 peak: 65%. 1987 peak: 85-90%. 2000 peak: 170%. Mean reversion to the 25-30 yr trailing PE = ~30-35% S&P decline. Apply that to 252%-of-GDP equity wealth and you get a ~89% of GDP reverse wealth effect, with cap gains (10% of US tax revenue) going to zero — budget deficit blows up, bond market ‘gets smoked’, self-reinforcing. The IPO number: 2026 contemplated IPO supply ≈ 5-6% of market cap vs ~2-3%/yr the market has been net-retiring via buybacks for a decade. And buybacks themselves are collapsing because hyperscaler CapEx is eating free cash flow (Amazon FCF -97% QoQ, MS/Google/Meta -12/-12/-8%). ‘Cascade of selling’ analogous to 2001-2002 post-IPO unlocks. Tech is ‘dogged’ because that’s where the IPO funding gets sourced from.
Gray’s bull case: ‘Year of the IPO. Two of the largest tech companies in the world will go public — that helps receptivity. We’ve got 9 companies on file globally.’ Same supply event, framed as evidence that the public-markets risk window is open. PTJ’s frame: this is the cascade catalyst. Gray’s frame: the demand exists for the supply. Both can be true if you separate the cohorts — newly-listed AI-infra and energy/data-centre names absorb capital eagerly, the broader index re-rates lower as buyback support evaporates and LP capital rotates into the IPOs.
PTJ’s other notes worth keeping: PE allocation in institutional portfolios went 7% (2008) → 16% (2026), real estate and infrastructure also up. The illiquidity stack going into a drawdown is structurally worse than the GFC entry conditions. Buying S&P at PE 22 historically produces negative 10-year forward returns. Gray’s tone counter-counter: ‘stay calm, stay positive, never give up’ — and the operating data is good (Q1 PE portfolio +10% revenue growth).
The synthesis on this prediction — pick your battle:
- By end of 2026, US equity wealth declines by ≥10% peak-to-trough at some point in the year [forecast: 2026-05-03-004] — confidence 0.55. Doesn’t require the full mean-reversion crash, just the IPO-supply / buyback-collapse mechanic biting through the index.
- PTJ’s specific long-yen / short-USD trade outperforms USD by ≥10% within 6 months [forecast: 2026-05-03-005] — confidence 0.5. High-conviction setup but FX is hard. Tracking the dollar-yen catalyst on the new Japanese PM.
- The Year-of-the-IPO holds — at least 8 of the 9 Blackstone-on-file companies actually price by end of 2026 [forecast: 2026-05-03-006] — confidence 0.6. They wouldn’t be on file if they weren’t ready, and the underwriting calendar is real.
The SaaS apocalypse had its first wind-down — and its operator-side proof
Last week’s Issue 02 prediction — ‘at least one major SaaS incumbent does an acquihire or product-line wind-down explicitly attributed to LLM-native displacement within nine months’ [forecast: 2026-04-26-009] — got a partial validation event this week. Thoma Bravo handed Medallia back to creditors. $5.1B of equity wiped. A pre-AI low-growth SaaS taking on $2B+ debt, then unable to service it. The honest read: this is partial validation, not clean. The wind-down is primarily a capital-stack failure (LBO debt + low growth = uncorrectable), not an explicit LLM-displacement story. But it’s the first PE wind-down of the cycle and it materially shifts the base rate.
The strongest framing of the surrounding logic came from Lemkin on 20VC — the three-bucket SaaS framework (which I think becomes the durable mental model coming out of this issue):
- Melting iceberg — eroding terminal value, leveraged → effectively dead. (Medallia.)
- System of record — sticky but no agent activity → bounded cash flow, deep-value play. (Workday, Atlassian-without-agents.)
- Agent-using — increasing returns from AI traffic → growth re-acceleration possible. (Stripe, Cloudflare, Twilio.)
The operator-side validation came from a totally different show. Adam Foroughi (AppLovin, 20VC) gave the cleanest operating-leverage proof I’ve seen on the podcast: 84% EBITDA margins, ~$10M EBITDA per employee in the 400-person core, near-triple-digit revenue growth — and they cut 40-50% of headcount in that growth year because the roles were going to be automated. Eliminated CMO, COO, CRO, CHRO, Chief People Officer. 80-90% of code is AI-generated (vs Databricks’ 50% disclosed the day prior). Stack: mostly Claude Code, some Codex, less Cursor than before. Foroughi’s frame on the SaaS apocalypse: ‘when you get into an unpredictable outcome in the future, it’s very easy to sell businesses. The SaaS apocalypse is not done.’ Companies don’t wipe out — embedded software is sticky — but growth dies, terminal value gets discounted, SBC % blows out, downward spiral.
Jon Gray gave the credit-investor’s matching frame: ‘not all software is created equal. Deeply embedded systems of record — ripping them out will be quite difficult.’ And — important for credit cycle pricing — Blackstone’s PE loans typically carry ~60% equity cushion, so even in equity-wipeout scenarios, senior debt is well-protected. The investable corollary: bucket-1 equity is the trade to short; bucket-1 senior debt is largely fine.
What this gets us:
- At least 2 more PE wind-downs of pre-AI low-growth SaaS by year-end 2026 (in addition to Medallia) [forecast: 2026-05-03-007] — confidence 0.65. The capital-stack mechanics are now in motion.
- Lemkin’s three-bucket framework becomes a standard analyst framing (cited in at least 3 sell-side reports) within 6 months [forecast: 2026-05-03-008] — confidence 0.55. Useful frameworks travel.
- The Issue 02 prediction [forecast: 2026-04-26-009] gets logged as partially validated — Medallia happened but not for an explicit LLM-displacement reason. Tracking whether the next wind-down has a cleaner causal story.
Lean ops as the new operating model — three independent proofs
Three companies in this issue independently confirmed the same operating pattern. They’re in completely different verticals.
AppLovin: 400 people in the core, $10M EBITDA per employee, no CMO/COO/CRO/CHRO. Engineers double as PMs. ‘A players won’t exist in bulk if you have a bunch of B’s, C’s, and D’s around them. The only way to fix a bloated culture is to fire 99% and rebuild from the ground up.’
Kalshi: 120 people. No managerial layer. Co-founder Luana ‘knows what 80-85% of the org is doing’ on Slack in any 48-hour window. People self-organise to a dynamically-listed top-N problems. Slope > intercept on hires. Trade-off explicitly accepted: ‘we take on more organisational chaos to avoid bureaucracy.’
Baseten: Was very flat until 12-18 months ago — Sarah Wang told Tuhin he ‘just needed leaders’ and pushed back his engineering instinct that all overhead is bad. Hero culture explicitly banned. First-principles + kind + low-ego + can-handle-no-manager = the explicit hiring rubric. Pager-culture as infrastructure DNA — co-founder Amir’s 7-year-old asks ‘is that a P0?’ when his pager goes off.
Plus a fourth, philosophically: Chamath’s short on the top 0.01% — three traits: work ethic + stamina (‘isn’t God-given, it’s a level of desire’), repetition + focus over thousands of hours, and honesty as the foundation of taste — and taste as the foundation of success. The Eric Brandon-AOL advice he keeps returning to: ‘be the most successful 22-year-old possible’ — don’t compare yourself to people in different contexts. And the warning shot: ‘I have worked with infinite Harvard/Stanford pipeline graduates who showed up with zero resilience.’
The pattern is sharp enough to call it: the operator-side reset that produces the AppLovin financial profile (84% EBITDA margins, $10M EBITDA per employee) is methodically replicable. The ingredients are layered-management elimination, AI-native engineer-as-PM, hero-culture ban, slope-over-intercept hiring, ruthless A-player concentration, and an explicit token-spend-tied-to-revenue-KPI discipline rather than token-leaderboards. The pattern is now visible across martech (AppLovin), prediction-market exchange (Kalshi), and inference cloud infrastructure (Baseten) — and will produce a wave of comparable financial profiles in 2026-27.
- At least 2 publicly-traded software companies post EBITDA-per-employee >$5M in 2026 disclosures (vs the historical ‘best in class’ SaaS metric of $0.5-1M) [forecast: 2026-05-03-009] — confidence 0.55. AppLovin is the tip of an emerging cohort.
The AI-safety regulator gap and the cyber upgrade cycle
The most uncomfortable disclosure in this issue came from Paul Tudor Jones. He attended a closed conference about 18 months ago with one modeler from each of the top-4 labs (~35-40 people in the room). When PTJ asked how AI safety gets resolved, the consensus answer from the modelers themselves was: ‘I think we’ll finally do something about it when 50 or 100 million people die in an accident.’ Buffett sent PTJ a personal note after his CNBC segment: ‘I agree with you 100%, but the genie’s out of the bottle.’ PTJ’s policy ask: AI watermarking, made a felony to violate. He was deepfake-targeted twice this year already. The Atomic Energy Commission analogy: ‘18 months after Hiroshima we had the AEC. Three years into AI — what are you talking about? There is no regulation.’
The investable form of the same risk landed on All-In as the AI cyber upgrade cycle. Sacks framing: GPT-5.5 Cyber matches Mythos and is commercially shipping (Anthropic’s Mythos still gated). All frontier models will hit Mythos-grade in ~6 months. Chinese models (DeepSeek-4) at 80-85% of frontier already. Chamath: the best CSO he knows can ‘essentially manipulate every model.’ Beneficiaries flagged: CrowdStrike, Palo Alto Networks, Wiz — the white-hats get the tools first, find dormant bugs, harden infrastructure. Adam Foroughi on the same risk: ‘these models are built to audit code and expose vulnerabilities — short-term we’ll see more breaches because shipping outpaces audits, long-term a lot more buttoned up.’ Tuhin added the second dimension: ‘security crunched and operationally crunched onto people who can run these data centres’ — only 12 ‘good’ clouds and 3-4 in the gold tier despite the apparent supply.
- Top-3 cybersecurity vendors (CrowdStrike, Palo Alto, Wiz) post average revenue growth ≥30% YoY in their next two reported quarters, attributed in earnings calls to AI-cyber-upgrade demand [forecast: 2026-05-03-010] — confidence 0.55. The revenue is real but the attribution may lag the dollars.
- No federal AI safety regulation passes US Congress within 12 months despite the modeler-acknowledged tail risk [forecast: 2026-05-03-011] — confidence 0.75. Buffett’s read (‘genie’s out of the bottle’) is depressing but accurate. The actual regulator action will come from CARB-style state-level frameworks first.
Two short notes worth keeping
Kalshi and the load-bearing 2024 ruling. Tarek Mansour walked through the 6-year regulatory war and the October 2024 lawsuit win against the CFTC. That ruling is now the legal foundation under every prediction-market product launching in 2026. The structural argument why prediction markets are not gambling — the casino’s KPI is customer losses (forces algorithms to promote losses) vs the exchange’s KPI is transaction-fee volume (forces neutrality and trust) — is the policy/PR backbone the entire sector will use. Real institutional use cases now: Florida Keys hurricane hedging (insurance carriers have exited), Biden-era student-loan forgiveness hedging, S&P holders buying Republican/Democrat contracts to hedge election impact rather than selling the underlying. And the validation that sticks: a Federal Reserve research paper now cites Kalshi-style markets as ‘the best gauge we have on the economy.’
Steve Hilton and the California GOP primary. Trump-endorsed, leading the polls. Tax plan: 0% state income tax under $100k, 7.5% flat above. Also disclosed: CalDOGE-style audits estimate ~$425B of CA waste/fraud over 5 years (~20% of the state budget). California imports 80% of its oil (top supplier Iraq) despite significant in-state reserves; gas $7-8/gal. Hilton’s tech-relevant warning: ‘the proposed billionaire’s tax would be a complete disaster for the tech ecosystem.’ Worth tracking because the path to victory is not as long-shot as the consensus view suggests — needs ~5.9M votes; Trump got 6.1M in CA in 2024 with no campaign spend.
- Hilton wins the California Republican gubernatorial primary on June 2 [forecast: 2026-05-03-012] — confidence 0.7. Trump endorsement + leading polls + crowded D field (top-two primary).
Eleven episodes, 10.7 hours. Analytic hand-off complete. The compute thesis tightened: it’s three floors deep, not one. The SaaS apocalypse moved from prediction to first-event. And the lean-ops pattern crystallised across three independent verticals. Next week: watching the prediction ledger, especially the $300B Blackstone data-centre prints, the next PE wind-down, and whether the Hilton primary delivers.
This Week's Episodes
- The Twenty Minute VCAnthropic Raises $45B but Falls Short on Compute & Thoma Bravo Hand Back Medallia
Lemkin/Rory/Harry on the four-headline week: Anthropic takes another $45B from hyperscalers and is STILL compute-short; OpenAI misses both revenue and user-growth numbers; China blocks Manus's $2B acquisition; Thoma Bravo hands Medallia to creditors with $5.1B of equity wiped. Lemkin's flip — back to Team Sam because his agents prefer OpenAI's API. Three-bucket framework for the SaaS apocalypse.
Read episode summary → - The Twenty Minute VCAppLovin CEO: Why Founders Shouldn't Angel Invest & Why the Best Don't Need Mentorship
Adam Foroughi on AppLovin's 92%-down to ~$150B (~80x off the bottom). 84% EBITDA margins, $10M EBITDA per employee in the 400-person core. Cut 40-50% of headcount in a triple-digit growth year. 80-90% of code AI-generated, mostly Claude Code. Cash flow minus SBC as the only honest metric. The SaaS apocalypse 'is not done.'
Read episode summary → - All-In PodcastOpenAI Misses Targets, Codex vs Claude, Elon vs Sam Trial, Big Hyperscaler Beats
Same week as the 20VC episode but Sacks's contrarian: GPT-5.5 + Spud + Codex are winning back coders while Opus 4.7 is allegedly compute-rationed. Chamath sharpens the bottleneck: power, not compute. 40% of announced GW will be cancelled. Hyperscaler CapEx hits $725B for 2026, Amazon FCF -97% QoQ. AI cyber the next CrowdStrike-scale wave.
Read episode summary → - All-In PodcastCA Governor Candidate Steve Hilton on Why California is Destroying Itself
Hilton (ex-Cameron senior adviser, Trump-endorsed) leading the GOP primary. Tax plan: 0% under $100k, 7.5% flat above. CalDOGE estimates $425B of waste/fraud over 5 years. CA imports 80% of oil from Iraq, gas $7-8/gal. Billionaire's tax called 'a complete disaster for the tech ecosystem.' Path to victory: 5.9M votes; Trump got 6.1M in CA in 2024.
Read episode summary → - BlackstoneHow Blackstone Is Thinking About IPOs, Hard Assets & AI Investing | Q2 2026 Market Views
Blackstone alone signing 6 GW of data-centre leases in 2026 = ~$300B = 'almost the size of Finland or Portugal' from one firm. Q1 PE portfolio +10% revenue growth. Year of the IPO (9 BX companies on file). 'Not all software equal — deeply embedded systems of record will hold up.' Watchwords: stay calm, stay positive, never give up.
Read episode summary → - BlackstoneJon Gray on Private Credit, AI Infrastructure & an All-Weather Firm | Q1 2026 Results
Distributable earnings +25%, ~$70B raised including near-record private-credit institutional quarter. 8 of 10 best Q1 investments in data centres / LNG / battery storage. Portfolio LLM spend up 15x YoY. B-CRED 60% premium over leveraged loan market over 9.5 years. $38B of properties sold at premium to marks since rate hikes.
Read episode summary → - ChamathWhat I Learned From Being Around The Top 0.01%
Three traits: work ethic + stamina (Kevin Hart, Elon), repetition + focus (Draymond Green), honesty as the foundation of taste (Ackman, Loeb). The Eric Brandon AOL advice — 'be the most successful 22-year-old possible.' Calls out Harvard/Stanford pipeline graduates with 'zero resilience and no sense they accomplished anything themselves.'
Read episode summary → - Dwarkesh PodcastHow GPT-5, Claude, and Gemini are actually trained and served — Reiner Pope
Blackboard lecture from Reiner Pope (CEO Maddox, ex-Google TPU). Roofline analysis explains every observable AI pricing fact: optimal batch ~300 × sparsity, Gemini's 50% jump at 200k context, output tokens 5x input. Memory bandwidth — not compute, not even just power — is the deepest bottleneck. ~50% of 2026 hyperscaler CapEx going on memory. Models are ~100x overtrained vs Chinchilla.
Read episode summary → - Invest Like the BestLegendary Trader Paul Tudor Jones on AI Risk, Bubbles and Buffett
PTJ at 50 years in markets. US stock market cap = 252% of GDP (vs 1929 peak 65%, 2000 peak 170%). 2026 contemplated IPO supply ≈ 5-6% of market cap vs 2%/yr buyback retire. AI safety: closed-conference modeler consensus = 'we'll do something about it when 50-100M people die.' Buffett confirmed agreement. Trade idea: long yen — Japan has $4.5T unhedged USD position.
Read episode summary → - No PriorsBaseten CEO Tuhin Srivastava on Custom Models, and Building the Inference Cloud
Baseten 30x in 12 months, >$1B trajectory in 2026. 95% of tokens served are custom (post-trained) models. Mid-90s utilisation across 90 clusters / 18 clouds. GB200 access now = 3-5 year contracts + 20-30% TCV prepay. H100 still appreciating 4.5 years post-launch. Chinese government effectively subsidising US enterprise via open-source. 400% NDR. 'Inference is the last market — even if there's AGI, all that's left is inference.'
Read episode summary → - UncappedKalshi CEO Tarek Mansour on The Case for Prediction Markets | Ep. 48
Tarek Mansour on the 6-year regulatory war. Sued the CFTC and won October 2024 — that ruling is the legal load-bearing wall under every prediction-market product launching in 2026. Structural pitch: prediction markets are NOT gambling because the business model is transaction fees, not customer losses. Fed paper now cites Kalshi-style markets as 'the best gauge we have on the economy.' 120-person no-managerial-layer org.
Read episode summary →