Models & pricing
The Clipia AI Gateway model catalog (GET /v1/models) and prices in rubles per 1M tokens — Claude, GPT, Gemini, DeepSeek, Grok.
Clipia AI Gateway provides access to flagship language models under one contract. GET /v1/models returns the catalog in an OpenAI-compatible format; the prices below are in rubles per 1M tokens, billed separately for input (prompt) and output (the model's response).
Catalog (GET /v1/models)
/v1/modelsmodels = client.models.list()
for m in models.data:
print(m.id)const models = await client.models.list();
for (const m of models.data) console.log(m.id);curl https://api.clipia.ai/v1/models \
-H "Authorization: Bearer $CLIPIA_API_KEY"The response is a data array in OpenAI-compatible form. Each item carries an id (the string for the model field), context length, supported parameters and per-token rates:
{
"object": "list",
"data": [
{
"id": "claude-opus-4-8",
"object": "model",
"name": "Claude Opus 4.8",
"created": 1782200000,
"owned_by": "clipia",
"context_length": 200000,
"architecture": {
"modality": "text->text",
"input_modalities": ["text", "image"],
"output_modalities": ["text"],
"tokenizer": "Claude"
},
"pricing": { "prompt": "0.0004", "completion": "0.002" },
"supported_parameters": [
"tools", "tool_choice", "response_format",
"temperature", "top_p", "max_tokens", "stop", "seed"
]
}
]
}In pricing, values are per single token (prompt / response); the human-readable per-1M-token prices are in the table below. Clients use supported_parameters to detect which features a model supports (tools, response_format, etc.).
Pricing
Billing is per token, separately for input and output. Charges are taken in credits from the account balance; the prices below are shown in rubles for clarity (1M = 1,000,000 tokens).
| Model | model | Input, ₽/1M | Output, ₽/1M |
|---|---|---|---|
| Claude Opus 4.8 | claude-opus-4-8 | 400 | 2,000 |
| GPT-5.5 | gpt-5.5 | 400 | 2,400 |
| Claude Sonnet 4.6 | claude-sonnet-4-6 | 240 | 1,200 |
| Gemini 3.1 Pro | gemini-3.1-pro | 160 | 960 |
| Grok 4.3 | grok-4.3 | 130 | 260 |
| DeepSeek V4 Pro | deepseek-v4-pro | 45.3 | 90.6 |
| Gemini 2.5 Flash | gemini-2.5-flash | 31.2 | 260 |
| DeepSeek V4 Flash | deepseek-v4-flash | 9.3 | 18.6 |
| GPT-5 nano | gpt-5.4-nano | 5.1 | 41.7 |
Billed in credits
Billing is in credits: every response carries usage.cost (the request cost in credits), and the exact cost of an already-completed request is returned by GET /v1/generation. The ruble prices are the public retail rate; top up your balance in the clipia.ai account area.
Verify against the live catalog
The model list and rates are updated periodically. Always take identifiers (model) and current prices from the live GET /v1/models — that is the source of truth. The values in the table are accurate as of publication.
How to choose a model
- Flagships — hard tasks.
claude-opus-4-8andgpt-5.5for reasoning, code, agents and long context, where quality matters more than price. - Quality/price balance.
claude-sonnet-4-6,gemini-3.1-pro,grok-4.3— solid all-rounders for most product scenarios. - Volume and low cost.
deepseek-v4-pro,gemini-2.5-flash,deepseek-v4-flash,gpt-5.4-nano— for classification, data extraction, bulk processing and draft answers where a low per-token cost matters.
Start with a mid-tier model, measure quality on your own data, and move up to a flagship or down to a cheaper option as needed — you only change the model string.
Chat Completions
POST /v1/chat/completions — the OpenAI-compatible chat endpoint of Clipia AI Gateway. Non-streaming and streaming responses, parameters, tool calling and structured outputs.
LangChain
Connect Clipia AI Gateway to LangChain via ChatOpenAI with the Clipia base_url — chat, tools and tool agents on Claude, GPT and Gemini.