Now in Beta

One API.
Predictable cost.

Forge routes your AI requests across open-source and commercial models. Drop in, pay flat-rate, stop worrying about token bills.

Drop-in replacement
# Just change the base URL. That's it.
curl https://forge.lanaai.io/v1/chat/completions \
  -H "Authorization: Bearer $FORGE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "auto",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

How it works

Intelligent routing, invisible complexity.

Send your request

Use the same OpenAI or Anthropic SDK you already have. Just point it at Forge.

We route it

Forge picks the best backend — self-hosted open-source or commercial provider — based on cost and quality.

Predictable bill

Flat monthly rate. No per-token surprises. We absorb the cost variance so you don't have to.

Built for developers

Drop-in compatible.

OpenAI-compatible

/v1/chat/completions, /v1/embeddings, /v1/models

Anthropic-compatible

/v1/messages with full streaming SSE support

Smart model aliases

"auto", "fast", "reasoning" — let Forge pick the best model

Python
# Works with the OpenAI SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://forge.lanaai.io/v1",
    api_key="your-forge-key",
)

response = client.chat.completions.create(
    model="auto",
    messages=[
        {"role": "user",
         "content": "Summarize this contract"}
    ]
)

Simple pricing

Flat rate. No surprises.

Starter

$20
$49

first month, then $49/mo

5M tokens

Popular

Pro

$149

/month

30M tokens

Scale

$499

/month

150M tokens

Enterprise

Custom

 

500M+ tokens

Ready?

Stop overpaying for AI.

Get your API key and start routing requests in under 5 minutes.

Get Started Free