Chapter 11 · The weights debate · Updated June 2026

Who controls the weights controls the future.

The primary ideological and commercial debate centers on weight accessibility: should the mathematical weights of AGI models remain locked in secure vaults behind APIs, or should they be freely downloadable to run on private hardware?

Structural Comparison
FeatureClosed API (OpenAI, Anthropic, Gemini)Open Weights (DeepSeek, Llama, Qwen)
Weights AccessibilityProprietary. Model weights reside entirely on the lab's secure infrastructure; users lease cognitive access via API queries.Downloadable weights under open-source (Apache 2.0) or modified commercial licenses (e.g., Llama 4 Community License, DeepSeek MIT).
Safeguards & BiosecurityCentralized server-side moderation, input/output classifiers, and prompt shields. Access can be instantly revoked under ASL-3 triggers.Downstream safety is the responsibility of the deployer. Once released, weights cannot be recalled and safety filters can be fine-tuned out.
Inference EconomicsPremium token-based pricing margins. GPT-5.4 priced at $2.50 / $15.00 per 1M. Subject to hyperscaler price floors.Highly economical. DeepSeek V3.2 priced at $0.28 / $0.42 per 1M, while Flash APIs operate near-zero. 10-100× cheaper inference.
Licensing RestrictionsSubject to lab terms of service. No commercial reuse limitations other than standard API usage agreements.DeepSeek/Qwen are MIT/Apache 2.0. Llama 4 Community License restricts commercial use above 700M monthly active users.
Model Capabilities (June 2026)
Reasoning & Coding Flagships
Closed API
Claude Fable 5 / GPT-5.4
SWE-bench Pro: 69.2% (Opus 4.8) / 57.7% (GPT-5.4) · GPQA: 85-94%
Open Weights
DeepSeek V4 Pro / Qwen3.7-Max
SWE-bench Pro: 55.4% (V4 Pro) / ~54% (Qwen3.7-Max) · GPQA: 92.4%
API Pricing (per 1M tokens)
Closed API
Claude Fable 5
Input: $10.00 / Output: $50.00
Open Weights
DeepSeek V3.2
Input: $0.28 / Output: $0.42
Maximum Context Windows
Closed API
Gemini 2.5 Pro
2,000,000 tokens (lossless needle-in-a-haystack)
Open Weights
Llama 4 Scout
10,000,000 tokens (Sparse Attention / open weights)
Corporate Shifts

Meta's Closed Pivot: Muse Spark

Meta AI remains a key distributor of open-weights via Llama 4 Scout (109B) and Llama 4 Maverick (400B), but the company has altered its strategy. In April 2026, Meta launched **Muse Spark**, its first closed-weights proprietary reasoning model trained with thought-compression RL.

This shift reflects the pressure of Meta's $65B-$100B capex cycles. To secure return on investment, Meta now restricts its absolute frontier reasoning systems behind proprietary APIs while utilizing Llama weights to commoditize hardware and maintain developer mindshare.

Frontier Regulation

EU AI Act & California's SB 1047 Veto

The regulatory landscape has solidified. The **EU AI Act's** General Purpose AI (GPAI) model provisions entered force in August 2025. Articles 10–15 mandate strict documentation and auditing for post-training preference data collection (SFT, RLHF, DPO), with systemic-risk thresholds (>10^25 FLOPs) enforceable by August 2026.

In the United States, California's controversial **SB 1047** (which would have covered models trained at >$100M or >10^26 FLOPs, mandating developer-controlled kill-switches and audits) was **vetoed by Governor Newsom on September 29, 2024**. Newsom criticized the bill for failing to evaluate empirical trajectories of capability, preserving California as a highly permissive jurisdiction for open weights.