b/cited
← Glossary
[ Term ]

AI Gateway (Cloudflare AI Gateway)

Cloudflare's proxy in front of LLM providers. Every AI call b/cited makes — embeddings, brief generation, AEO citation runs — routes through it. Adds caching, retry, observability, and a unified billing surface across OpenAI, Anthropic, and Perplexity.

Also known as:Cloudflare AI GatewayAI proxy

AI Gateway is a Cloudflare service that sits between your application and LLM providers (OpenAI, Anthropic, Perplexity, Google Gemini, others). Your code calls the gateway URL instead of the provider's URL directly; the gateway forwards the request, caches eligible responses, logs every call, and surfaces analytics across providers.

The change is one URL swap — the OpenAI client points at:

https://gateway.ai.cloudflare.com/v1/{account_id}/{gateway_slug}/openai

instead of https://api.openai.com/v1. Everything else stays identical.

What the gateway adds:

Why it matters

b/cited makes a lot of provider calls per ingest:

Cached embeddings alone save more than the gateway costs at scale. The cross-provider analytics are how we know the cost-per-AEO-run breakdown — visible at dash.cloudflare.com/?to=/ai/aigateway.

What b/cited does with it

[ Related ]