Kimi K2.5

moonshotai

moonshotai/kimi-k2.5

Open in Console

Moonshot Kimi K2.5 long-context model. Admin-gated by default.

Context

200K

Max output

66K

Tools

STANDARD

Reasoning

Supported

Pricing

Lane	Per 1M tokens
Input	$0.60
Output	$3.00
Cache read	$0.10
Cache write (5m)	$0.75
Cache write (1h)	$1.20

Billed per token. No minimums, no per-request fees. Caching applies on supported providers; misses fall back to standard input/output rates.

Reasoning effort

offon · default

Routed providers

openrouter

Requests are automatically routed to the highest-priority healthy upstream. Failures fall back to the next route transparently.

Quickstart

from openai import OpenAI

client = OpenAI(
    base_url="https://api.vecbase.com/v1",
    api_key="sk-vbc-...",
)

response = client.chat.completions.create(
    model="moonshotai/kimi-k2.5",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)

Ready to ship?

Create a project, mint an API key, and call this model from any OpenAI-compatible client.

Get started free