Simple, transparent pricing
Pay only for what you use, scale without limits.
Pricing
Simple, transparent pricing
Pay only for what you use. No hidden fees.
MonthlyAnnualSave 20%
Most Popular
Per-model pricing
Prices per 1 million tokens. Input and output priced separately.
| Model | Input | Output | Context |
|---|---|---|---|
| Llama 3.3 70B | $0.20 | $0.60 | 128K |
| Llama 3.3 8B | $0.05 | $0.10 | 128K |
| Qwen 3 32B | $0.10 | $0.30 | 128K |
| Qwen 3 8B | $0.04 | $0.08 | 128K |
| Mistral Large 2 | $0.30 | $0.90 | 128K |
| Mistral 7B | $0.04 | $0.08 | 32K |
| DeepSeek V3 | $0.15 | $0.45 | 128K |
RAG storage & limits
| Resource | Developer | Enterprise |
|---|---|---|
| Vector storage | 10 GB / KB | Unlimited |
| Document storage | 50 GB | Unlimited |
| Knowledge bases | 10 | Unlimited |
Embedding models
| Model | Price |
|---|---|
| BGE Large EN v1.5 | $0.01 / 1M tokens |
| E5 Large v2 | $0.01 / 1M tokens |
| Cohere Embed v3 | $0.10 / 1M tokens |
Reranking models
| Model | Price |
|---|---|
| BGE Reranker Large | $0.02 / 1K queries |
| Cohere Rerank v3 | $0.10 / 1K queries |
