One API.
Predictable cost.

Forge routes your AI requests across open-source and commercial models. Drop in, pay flat-rate, stop worrying about token bills.

Get Started View Pricing

Drop-in replacement

# Just change the base URL. That's it.
curl https://forge-api.lanaai.io/v1/chat/completions \
  -H "Authorization: Bearer $FORGE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "auto",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

How it works

Intelligent routing, invisible complexity.

Send your request

Use the same OpenAI or Anthropic SDK you already have. Just point it at Forge.

We route it

Forge picks the best backend — self-hosted open-source or commercial provider — based on cost and quality.

Predictable bill

Flat monthly rate. No per-token surprises. We absorb the cost variance so you don't have to.

Data sovereignty

Your data stays yours.

Forge offers sovereign mode — a per-request or per-organization setting that locks all inference to self-hosted models on LANA-controlled infrastructure. No prompt data is ever sent to third-party providers.

Sovereign inference

All inference on self-hosted models. No data sent to third parties. Available on Pro and above.

Zero-retention mode

No request content stored. Only billing counters retained. Nothing to subpoena, nothing to breach.

Audit log egress

Forward compliance events to your own webhook in real-time. Your storage, your record.

Compliance audit trail

Every request logged with routing decision, provider, and sovereign enforcement status.

Sovereign mode

# Keep all data on LANA infrastructure
curl https://forge-api.lanaai.io/v1/chat/completions \
  -H "Authorization: Bearer $FORGE_API_KEY" \
  -H "X-Sovereign: true" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "auto",
    "messages": [{
      "role": "user",
      "content": "Summarize this contract"
    }]
  }'

# Response guaranteed from self-hosted model.
# No data sent to OpenAI, Anthropic, or any
# third-party provider.

Learn more about data sovereignty →

Built for developers

Drop-in compatible.

OpenAI-compatible

/v1/chat/completions, /v1/embeddings, /v1/models

Anthropic-compatible

/v1/messages with full streaming SSE support

Smart model aliases

"auto", "fast", "reasoning" — let Forge pick the best model

Python

# Works with the OpenAI SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://forge-api.lanaai.io/v1",
    api_key="your-forge-key",
)

response = client.chat.completions.create(
    model="auto",
    messages=[
        {"role": "user",
         "content": "Summarize this contract"}
    ]
)

Read the docs →

Simple pricing

Flat rate. No surprises.

Starter

$20

$49

first month, then $49/mo

5M tokens

Popular

Pro

$149

/month

30M tokens

Scale

$499

/month

150M tokens

Enterprise

Custom

500M+ tokens

Compare plans in detail →

Ready?

Stop overpaying for AI.

Get your API key and start routing requests in under 5 minutes.

Get Started Free

One API. Predictable cost.