One API.
All Models. Smart Routing.
China's top LLMs unified under a single endpoint. Write once, access all.
No more juggling API keys. No more rewriting integration code. FlintAPI aggregates DeepSeek, Qwen, Kimi, GLM, and MiniMax โ and routes your requests to the optimal model automatically.
Why FlintAPI
Aggregate today. Route intelligently tomorrow.
Phase 1 โ Model Aggregation
All major Chinese LLMs in one API endpoint. DeepSeek, Qwen, Kimi, GLM, MiniMax — switch models instantly. DeepSeek V4, Qwen3.5, Kimi K2, GLM-5, MiniMax M2 โ switch models by changing one parameter. No library changes, no code rewrites.
- ๐ OpenAI-compatible โ drop-in replacement
- ๐ฐ Competitive pricing โ aggregated volume = lower cost
- ๐ No vendor lock-in โ switch models instantly
- ๐ Unified billing โ one account, all models
Smart Routing
Just call /v1/smart/chat/completions or set model="flint-smart". FlintAPI auto-routes to the optimal model for your prompt. Call flint-smart for auto-routing. Or pick any model directly. analyzes your task โ coding, translation, reasoning, creative writing โ and automatically dispatches to the optimal LLM. Better results, lower cost.
- ๐ฏ Intent-aware routing โ match task to model strength
- ๐ธ Cost-optimized โ use cheap models for simple tasks
- โก Latency-aware โ fallback to faster models under load
- ๐ Quality-optimized โ route hard problems to strongest model
All Major Chinese LLMs. One API.
No fragmentation. No multi-vendor complexity.
30 Seconds to Integrate
Fully OpenAI SDK compatible. Python, JavaScript, Go, cURL โ pick your weapon.
from openai import OpenAI # One base_url. All models. client = OpenAI( base_url="https://www.flintapi.ai/v1", api_key="tk-your-api-key" ) # DeepSeek for code client.chat.completions.create( model="deepseek-v4-pro", messages=[{"role": "user", "content": "Write a Rust HTTP server"}] ) # Qwen for translation โ same client, different model param client.chat.completions.create( model="qwen3.5-plus", messages=[{"role": "user", "content": "Translate to French: Hello World"}] )
model parameter to switch between any supported LLM. No SDK changes needed.
Simple, Transparent Pricing
Pay-as-you-go. No monthly minimums. Free credits on signup.
| Model | Context | Input /M tok | Output /M tok |
|---|---|---|---|
| DeepSeek V4 Pro | 128K | โ | โ |
| DeepSeek V4 Flash | 128K | โ | โ |
| Qwen3.5-Plus | 128K | โ | โ |
| Kimi K2.5 | 128K | โ | โ |
| GLM-5.1 | 32K | โ | โ |
Ready to unify your AI stack?
One API key. Five model families. Zero lock-in. Start building in 30 seconds.