Transparent pricing

Pricing

Per-request gateway pricing. No per-seat tax. No markup on model tokens. You pay for routing — each model provider invoices you directly under your existing agreement.

Starter

$299 /mo

$249/mo billed annually

500,000 requests / month

Up to 3 endpoint types
3 routing policies
7-day audit log retention
OpenAI-compatible API
Community support
14-day free trial

Start Free Trial

Most Popular

Team

$899 /mo

$749/mo billed annually

5,000,000 requests / month

Unlimited endpoint types
Unlimited routing policies
90-day audit log retention
Tenant isolation
Data class enforcement
Slack support
Private GPU support (up to 4 nodes)

Get Early Access

Enterprise

Custom

—

Unlimited requests

Everything in Team
Unlimited private GPU nodes
PrivateLink + VPC peering
1-year audit log retention
SAML SSO
Custom SLA
Dedicated onboarding engineer

Talk to the Team

All prices USD. Token costs not included — invoiced directly by your model providers.

Starter and Team tiers are self-serve and include monthly request quotas; Enterprise is contact-only and includes private GPU options. We do not charge per-token markup — only per-request routing.

FAQ

Common questions

How is a "request" counted?

Are model token costs included?

Can I bring my own Llama deployment?

Is there a free tier?

Stop managing three auth flows and three billing lines.

Start Free Trial Talk to the Team

Pricing

Pricing tiers

Common questions

Stop managing three auth flows and three billing lines.