Skip to content
Dashboard

AI Gateway production index

Link to headingAnthropic leads in spend; Google leads in volume

Stacked bar chart of monthly spend share by lab at Oct 2025, Jan 2026, and Apr 2026. Anthropic's pink dominates throughout, OpenAI's teal jumps in April. By Apr 2026, Anthropic 61%, Google 21%, OpenAI 12%, with smaller labs splitting the rest.Stacked bar chart of monthly spend share by lab at Oct 2025, Jan 2026, and Apr 2026. Anthropic's pink dominates throughout, OpenAI's teal jumps in April. By Apr 2026, Anthropic 61%, Google 21%, OpenAI 12%, with smaller labs splitting the rest.
Anthropic leads on spend across the window, with OpenAI's share tripling in April.
Stacked bar chart of token volume share by lab at Oct 2025, Jan 2026, Apr 2026. Anthropic's pink share falls, Google's blue grows. By Apr 2026, Google 38%, Anthropic 26%, OpenAI 13%, xAI 10%, with MiniMax, Moonshot AI, Other splitting the rest.Stacked bar chart of token volume share by lab at Oct 2025, Jan 2026, Apr 2026. Anthropic's pink share falls, Google's blue grows. By Apr 2026, Google 38%, Anthropic 26%, OpenAI 13%, xAI 10%, with MiniMax, Moonshot AI, Other splitting the rest.
Google held a clear lead in token volume in April.

Link to headingSpend follows the cost of being wrong

Paired bars (April 2026) of % tokens / % market cost per use case. Personal Assistants 40.0/19.6. Coding Agents 20.4/21.8. App Generation 11.2/7.0. Education 5.5/6.8. Back Office 15.0/5.8. Sales 3.4/2.7. Recruiting 2.4/0.8. Other 22.4/15.0.Paired bars (April 2026) of % tokens / % market cost per use case. Personal Assistants 40.0/19.6. Coding Agents 20.4/21.8. App Generation 11.2/7.0. Education 5.5/6.8. Back Office 15.0/5.8. Sales 3.4/2.7. Recruiting 2.4/0.8. Other 22.4/15.0.
Volume-heavy workloads run cheap per token, while cost-heavy workloads run expensive.
Paired horizontal bars for April 2026 of % tokens (pink) and % market cost (blue) by B2B classification. B2B 29.7% tokens, 40.7% cost. B2C 62.6% tokens, 43.2% cost. Unknown 7.7% tokens, 16.1% cost.Paired horizontal bars for April 2026 of % tokens (pink) and % market cost (blue) by B2B classification. B2B 29.7% tokens, 40.7% cost. B2C 62.6% tokens, 43.2% cost. Unknown 7.7% tokens, 16.1% cost.
B2C drives volume while B2B drives spend.

Link to headingNo single provider wins across use cases

Stacked bars of market cost share by lab within each use case (April 2026). Back Office 87% Anthropic. Building 55% Anthropic, 6% OpenAI, 31% other. Outreach 36% Anthropic, 28% OpenAI. Consumer 26% Anthropic, 18% OpenAI, 15% Google, 35% other.Stacked bars of market cost share by lab within each use case (April 2026). Back Office 87% Anthropic. Building 55% Anthropic, 6% OpenAI, 31% other. Outreach 36% Anthropic, 28% OpenAI. Consumer 26% Anthropic, 18% OpenAI, 15% Google, 35% other.
Anthropic carries cost share through three of the four categories.
Stacked bars of token share by lab within each use case (April 2026). Back Office 71% Anthropic, 11% Google. Building 33% Anthropic, 20% xAI, 10% MiniMax. Outreach 22% OpenAI, 18% xAI, 17% Anthropic. Consumer 28% Google, 15% OpenAI, 7% Anthropic.Stacked bars of token share by lab within each use case (April 2026). Back Office 71% Anthropic, 11% Google. Building 33% Anthropic, 20% xAI, 10% MiniMax. Outreach 22% OpenAI, 18% xAI, 17% Anthropic. Consumer 28% Google, 15% OpenAI, 7% Anthropic.
Token share spreads more evenly across labs than cost share does.

Link to headingApps are becoming more agentic

Line chart Oct 2025 to Apr 2026, two lines. Pink (tool-call % of tokens) rises from 31.6% to 58.9% with a sharp jump after Jan. Blue (tool-call % of requests) rises from 11.4% to 22.2% more gradually. Gap between the two widens.Line chart Oct 2025 to Apr 2026, two lines. Pink (tool-call % of tokens) rises from 31.6% to 58.9% with a sharp jump after Jan. Blue (tool-call % of requests) rises from 11.4% to 22.2% more gradually. Gap between the two widens.
Tool-using requests carry far more tokens than their share of requests would suggest.

Link to headingLeaderboards rank one model, but production teams use 35+ at scale

Vertical bars of avg distinct models per team (April 2026) by monthly request bucket. <100=0, 100-1K=1, 1K-10K=3, 10K-100K=5, 100K-1M=8, 1M-10M=18, 10M+=35. "Regular use" means a model received 100+ requests from the team in April.Vertical bars of avg distinct models per team (April 2026) by monthly request bucket. <100=0, 100-1K=1, 1K-10K=3, 10K-100K=5, 100K-1M=8, 1M-10M=18, 10M+=35. "Regular use" means a model received 100+ requests from the team in April.
Teams at 10M+ requests average 35 models, up from 18 in the next bucket down.

Link to headingNew models are adopted rapidly

Stacked bars of Claude Sonnet family token share at Oct 2025, Jan 2026, Apr 2026. Versions 3.7 (pink), 4 (dark blue), 4.5 (teal), 4.6 (light blue). Oct splits across 3.7, 4, 4.5. Jan mostly 4.5. By Apr, 4.6 dominates with predecessors at small slivers.Stacked bars of Claude Sonnet family token share at Oct 2025, Jan 2026, Apr 2026. Versions 3.7 (pink), 4 (dark blue), 4.5 (teal), 4.6 (light blue). Oct splits across 3.7, 4, 4.5. Jan mostly 4.5. By Apr, 4.6 dominates with predecessors at small slivers.
Sonnet 4.6 absorbed most of the Sonnet family's traffic within its first full month.
Stacked bars of Claude Opus family token share at Oct 2025, Jan 2026, Apr 2026. Versions 4 (pink), 4.1 (dark blue), 4.5 (teal), 4.6 (light blue), 4.7 (purple). Oct mostly 4.1. Jan mostly 4.5. By Apr, 4.6 dominates with 4.7 near a quarter.Stacked bars of Claude Opus family token share at Oct 2025, Jan 2026, Apr 2026. Versions 4 (pink), 4.1 (dark blue), 4.5 (teal), 4.6 (light blue), 4.7 (purple). Oct mostly 4.1. Jan mostly 4.5. By Apr, 4.6 dominates with 4.7 near a quarter.
Opus 4.7 is taking share from Opus 4.6 on the same curve.

Link to headingProvider outages have a hidden cost

Horizontal bars of AI Gateway fallback rescue share through April 2026, by metric. Of all requests, 3.5% rescued by fallback. Of all tokens, 5.1% rescued. Of all market cost, 4.9% rescued. Remainder succeeded on first try.Horizontal bars of AI Gateway fallback rescue share through April 2026, by metric. Of all requests, 3.5% rescued by fallback. Of all tokens, 5.1% rescued. Of all market cost, 4.9% rescued. Remainder succeeded on first try.
The cost-weighted rescue rate runs higher than the request-weighted rate.

Link to headingConclusion: Build for workload, not the lab

Link to headingAbout this data