One API.
Predictable cost.
Forge routes your AI requests across open-source and commercial models. Drop in, pay flat-rate, stop worrying about token bills.
# Just change the base URL. That's it. curl https://forge.lanaai.io/v1/chat/completions \ -H "Authorization: Bearer $FORGE_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "auto", "messages": [{"role": "user", "content": "Hello"}] }'
How it works
Intelligent routing, invisible complexity.
Send your request
Use the same OpenAI or Anthropic SDK you already have. Just point it at Forge.
We route it
Forge picks the best backend — self-hosted open-source or commercial provider — based on cost and quality.
Predictable bill
Flat monthly rate. No per-token surprises. We absorb the cost variance so you don't have to.
Built for developers
Drop-in compatible.
OpenAI-compatible
/v1/chat/completions, /v1/embeddings, /v1/models
Anthropic-compatible
/v1/messages with full streaming SSE support
Smart model aliases
"auto", "fast", "reasoning" — let Forge pick the best model
# Works with the OpenAI SDK from openai import OpenAI client = OpenAI( base_url="https://forge.lanaai.io/v1", api_key="your-forge-key", ) response = client.chat.completions.create( model="auto", messages=[ {"role": "user", "content": "Summarize this contract"} ] )
Simple pricing
Flat rate. No surprises.
Starter
first month, then $49/mo
5M tokens
Pro
/month
30M tokens
Scale
/month
150M tokens
Enterprise
500M+ tokens
Ready?
Stop overpaying for AI.
Get your API key and start routing requests in under 5 minutes.
Get Started Free