All modelsOpen in Console
Get started free
Kimi K2.5
moonshotaimoonshotai/kimi-k2.5
Moonshot Kimi K2.5 long-context model. Admin-gated by default.
Context
200K
Max output
66K
Tools
STANDARD
Reasoning
Supported
Pricing
| Lane | Per 1M tokens |
|---|---|
| Input | $0.60 |
| Output | $3.00 |
| Cache read | $0.10 |
| Cache write (5m) | $0.75 |
| Cache write (1h) | $1.20 |
Billed per token. No minimums, no per-request fees. Caching applies on supported providers; misses fall back to standard input/output rates.
Reasoning effort
offon · default
Routed providers
- openrouter
Requests are automatically routed to the highest-priority healthy upstream. Failures fall back to the next route transparently.
Quickstart
from openai import OpenAI
client = OpenAI(
base_url="https://api.vecbase.com/v1",
api_key="sk-vbc-...",
)
response = client.chat.completions.create(
model="moonshotai/kimi-k2.5",
messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)Ready to ship?
Create a project, mint an API key, and call this model from any OpenAI-compatible client.