AI Routing Engine

req_8h72k → GPT-4o 12ms

req_8h73k → claude-3.5 18ms

req_8h74k → gemini-2.0 8ms

Uptime

99.4%

Live Metrics

4,218 req / min

31ms avg latency

0.04% error rate

Budget Controls

Engineering $840 / $1k

Product $320 / $500

Research $210 / $500

3 active policies · 0 violations

50+ AI Providers

OpenAI Anthropic Gemini Mistral DeepSeek Bedrock Azure AI Cohere Llama Grok Perplexity + more

api.explane.ai/v1

Simple Pricing

Starter

$0/month

✓ 1M requests / month

✓ 10 providers

✓ Basic analytics

✓ Community support

Quick Start

// npm i @explane/sdk
import { Explane } from '@explane/sdk'

const ai = new Explane({
  apiKey: "ex_..."
})

await ai.route({
  model: "auto",
  messages
})

Master your AI traffic,
cost & compliance.

Explane sits between your application and every AI provider — intelligent routing, complete observability, and cost governance in one unified control plane.

Book Demo Take a tour

explane — live request log

847 req/min

ms endpoint provider selected latency cost status

0POST/v1/chat/completionsclaude-3-5-sonnet11ms$0.0024200

1POST/v1/embeddingstext-embedding-3-small4ms$0.0001200

2POST/v1/chat/completionsgemini-1.5-flash [budget]9ms$0.0004routed

4POST/v1/chat/completionsgpt-4o-mini [fallback]14ms$0.0006retry

5POST/v1/chat/completionsclaude-3-haiku6ms$0.0003200

6POST/v1/images/generationsdall-e-3320ms$0.0400200

7POST/v1/chat/completionsmistral-large-latest18ms$0.0018200

Integrations

OpenAI

Anthropic

Gemini

DeepSeek

Mistral

Cohere

Bedrock

Azure AI

Llama

Perplexity

Reasoning

Vision AI

Voice AI

Agents

Realtime

Streaming

Embeddings

Fine-Tuning

RAG

Tool Calling

Function Calling

Structured Output

Computer Use

Batch API

Image Generation

Multimodal

Web Search

Explane AI Gateway

api.explane.ai/v1

One stable integration layer for every AI provider, model, and capability.

Always Stable

How Explane Works

One Request.
Hundreds of Decisions.

Every AI request is evaluated across cost, latency, quality, availability, and routing policies before the optimal model is selected — in under 2ms.

Incoming Request

Customer Support Request

Received · just now

Cost Analysis

$0.00048/1k tokens

✓

Latency Analysis

p99: 178ms avg

✓

Quality Analysis

94.2 / 100 score

✓

Decision
Engine

Explane

Availability Check

99.99% uptime

✓

Routing Policy

cost-optimize · matched

✓

Fallback Policy

gpt-4o-mini · ready

✓

Evaluating Request...

Cost Analysis

Latency Analysis

Quality Analysis

Availability Check

Routing Policy

Fallback Policy

✓

Optimal Model Selected

Claude 3.5 Sonnet

claude-3-5-sonnet-20241022

Why this model

Highest conversational accuracy for support
Meets latency target under 200ms p99
Quality score exceeds your threshold
Available across all required regions

Observability

See every AI request, end-to-end

Full distributed traces, token-level breakdowns, and latency percentiles for every request across every provider — with zero instrumentation required.

Per-request traces show prompt tokens, completion tokens, and provider latency
Real-time error rate and P95/P99 dashboards by model, team, and endpoint
Export to Datadog, Grafana, or any OpenTelemetry-compatible backend
Full audit log with immutable request history for compliance teams

Request trace · req_01jx9k2m...

Total latency

0ms

Tokens

Cost

$0.000

Span breakdown

0ms

explane-router · 1ms

1ms

anthropic-api · 139ms

2ms

token-count · 2ms

4ms

audit-log · 0ms

Cost Governance

Keep AI costs predictable and controlled

AI costs can spiral without structure. Explane gives every team a budget, every model a cap, and alerts before spending surprises anyone.

Set monthly budgets per team, application, or environment
Automatically route to cheaper models when a budget approaches its limit
Cost alerts via Slack, email, or webhook at thresholds you define
Detailed attribution — know exactly which feature or team drove spend

Cost dashboard · June 2025

Month to date

−0%

vs last month

Spend by provider

Anthropic

74%

$1,367

OpenAI

18%

$332

Google Gemini

$148

Monthly budget$3,000

61.6% of $3,000 budget used this month

Enterprise-ready

Security and compliance built in

Explane is designed for production from day one — with the security controls and audit capabilities enterprise teams require.

Compliance

SOC 2 Type II

Certified annually. Security, availability, and confidentiality controls independently audited and verified.

Security

Zero data retention

Request and response payloads are never stored. Stateless traffic with encrypted credential vaulting at rest.

Access

Role-based access control

Granular permissions per team, workspace, and API key. SSO via SAML 2.0 and OIDC out of the box.

Audit

Immutable audit logs

Every request, routing decision, and config change — logged and tamper-evident for compliance review.

Deployment

Private deployment

Deploy Explane inside your own VPC or on-premises for complete network isolation. Available on Enterprise plans.

Reliability

99.99% SLA

Backed by a contractual uptime guarantee. Multi-region active-active with automatic failover across providers.

Master your AI traffic,
cost & compliance.

Master your AI traffic,
cost & compliance.

Master your AI traffic,
cost & compliance.

Everything needed to operate
AI in production

One Request.
Hundreds of Decisions.

See every AI request, end-to-end

Keep AI costs predictable and controlled

Security and compliance built in

Route. Monitor. Govern.
Start today.

Master your AI traffic,cost & compliance.

Master your AI traffic,cost & compliance.

Master your AI traffic,cost & compliance.

Everything needed to operateAI in production

One Request.Hundreds of Decisions.

See every AI request, end-to-end

Keep AI costs predictable and controlled

Security and compliance built in

Route. Monitor. Govern.Start today.

Master your AI traffic,
cost & compliance.

Master your AI traffic,
cost & compliance.

Master your AI traffic,
cost & compliance.

Everything needed to operate
AI in production

One Request.
Hundreds of Decisions.

Route. Monitor. Govern.
Start today.