PromptQL Pricing | Pay only for the AI you use

Start free — no credit card

$50 in free credits, plus $20 for every teammate who joins.

Every new project starts with $50 in free credits, and you get $20 more for each collaborator you bring in. On an open-weight model — where the same work costs a fraction of the OLUs — that's enough for thousands of simple tasks before you pay anything.

Starter

Start free with credits. Unlock more with collaborators.

$0.20 per OLU *

No minimums. Pay as you go.

Your $0.20 per OLU is all-in — it includes LLM token and hosting costs. No separate model or infrastructure bill.

Enterprise

For enterprise-grade security, dedicated infra and embedded support.

Contact Sales

Annual and monthly options available.

Everything in Starter, plus:

✓Advanced security and fine-grained permissions

✓Data-access audit trails — see what data was used in which AI thread, by which user

✓Dedicated VPC or bring your own cloud (BYOC)

✓Private networking and VPC peering

✓Single Sign-On (SSO)

✓Bring your own LLMs

✓Forward-deployed engineering support

* OLUs consumed per task depend on query and domain complexity. Typical usage: <2 OLUs for simple data tasks, ~10 OLUs for complex reports, and ~40 OLUs for deep investigations.

Model → OLU multiplier

Your model choice changes your mileage.

Frontier models like Claude Opus do the most per token but cost the most OLUs; lean, open-weight models (DeepSeek, Kimi K2, GLM) do the same work for a fraction of the OLUs. Switch to different models for different threads — or even mid-thread.

Model	Type	OLU multiplier (vs Opus 4.6)	Input tokens / OLU	Output tokens / OLU	Cached tokens / OLU	Cache creation tokens / OLU	Notes
Open-weight models · served via Fireworks
DeepSeek V4 Flash (Official API) DeepSeek (official)	Open-weight	0.018x (57x cheaper)	952,400	595,200	59.5M	1.2M	First-party DeepSeek API pricing.
DeepSeek V4 Flash (Fireworks) Fireworks	Open-weight	0.024x (41x cheaper)	952,400	595,200	6.0M	1.2M	Served via Fireworks. Low-latency/volume workhorse.
Llama 3.3 70B Fireworks	Open-weight	0.036x (28x cheaper)	493,800	617,300	6.2M	617,300	Meta; served via Fireworks.
DeepSeek V4 Pro (Official API) DeepSeek (official)	Open-weight	0.053x (19x cheaper)	306,500	191,600	46.0M	383,100	First-party DeepSeek API pricing.
Qwen 3.7 Plus Fireworks	Open-weight	0.071x (14x cheaper)	333,300	104,200	4.2M	416,700	Alibaba, vision-capable; served via Fireworks.
Mistral Large 3 Fireworks	Open-weight	0.077x (13x cheaper)	333,300	83,300	4.2M	416,700	Mistral AI; served via Fireworks.
Kimi K2.7 Fireworks	Open-weight	0.17x (5.8x cheaper)	140,400	41,700	1.8M	175,400	Moonshot AI 1T-param MoE; served via Fireworks.
GLM-5.2 Fireworks	Open-weight	0.23x (4.3x cheaper)	95,200	37,900	1.2M	119,000	Z.ai (Zhipu), MIT-licensed; served via Fireworks.
DeepSeek V4 Pro (Fireworks) Fireworks	Open-weight	0.25x (4x cheaper)	76,600	47,900	1.1M	95,800	Served via Fireworks (what PromptQL uses for open-weight).
Proprietary models
Claude Haiku 4.5 Anthropic	Proprietary	0.2x (5x cheaper)	133,300	33,300	1.7M	133,300	Fast tier.
GPT-5 OpenAI	Proprietary	0.3x (3.4x cheaper)	106,700	16,700	1.3M	133,300	GPT-5 base.
GPT-5.1 OpenAI	Proprietary	0.3x (3.4x cheaper)	106,700	16,700	1.3M	133,300	Same rate as GPT-5.
Gemini 3.1 Pro Google	Proprietary	0.42x (2.4x cheaper)	66,700	13,900	833,300	83,300	Google flagship Pro.
GPT-5.2 OpenAI	Proprietary	0.42x (2.4x cheaper)	76,200	11,900	952,400	95,200	Flagship step-up.
GPT-5.4 OpenAI	Proprietary	0.52x (1.9x cheaper)	53,300	11,100	666,700	66,700	Reasoning/coding/agentic gains.
Claude Sonnet 4.5 Anthropic	Proprietary	0.6x (1.7x cheaper)	44,400	11,100	555,600	44,400	Balanced tier. (Sonnet 4.6 is blocklisted in PromptQL.)
Claude Opus 4.6 Anthropic	Proprietary	1.0x (baseline)	26,700	6,700	333,300	26,700	Anchor (1.0x).
Claude Opus 4.8 Anthropic	Proprietary	1.0x (baseline)	26,700	6,700	333,300	26,700	Current GA Opus; PromptQL Powerful tier. Same rate as 4.6/4.7.
GPT-5.5 OpenAI	Proprietary	1.0x (baseline)	26,700	5,600	333,300	33,300	Latest flagship. >272K-context billed 2x in / 1.5x out.
Claude Fable 5 Anthropic	Proprietary	2x (2x cost)	13,300	3,300	166,700	13,300	New frontier flagship (Jun 2026); most capable, highest cost.
GPT-5 Pro OpenAI	Proprietary	7.22x (7.2x cost)	8,900	1,400	11,100	11,100	Pro reasoning. No prompt-cache discount.
GPT-5.2 Pro OpenAI	Proprietary	10.11x (10.1x cost)	6,300	992	7,900	7,900	Pro reasoning. No prompt-cache discount.
GPT-5.4 Pro OpenAI	Proprietary	13.53x (13.5x cost)	4,400	926	5,600	5,600	Pro reasoning. No prompt-cache discount.
GPT-5.5 Pro OpenAI	Proprietary	13.53x (13.5x cost)	4,400	926	5,600	5,600	Top GPT pro tier. No prompt-cache discount.

Anchor: Claude Opus 4.6 = 1.0×. Lower multiplier = cheaper, i.e. more tokens of work per OLU (per $0.20). PromptQL serves open-weight models via Fireworks (DeepSeek is shown for both its first-party Official API and the Fireworks-hosted rate PromptQL bills on). PLG projects can use every model the LLM proxy makes available, and the lineup expands over time — pick your model per thread. The GPT *-pro tiers run higher mainly because they have no prompt-cache discount. Token figures are representative; your exact mileage depends on the task.

Frequently asked questions

Yes. You always pay $0.20 per OLU, but the model you pick changes how many OLUs a given task consumes. Open-weight models like DeepSeek, Kimi K2, and GLM do the same work for a fraction of the OLUs — as little as ~1/10th, and often 10×–40× cheaper than a frontier model like Claude Opus.
See the “Your model choice changes your mileage” table above for a representative model → OLU multiplier comparison. You can switch models per thread, and available models expand over time.

An OLU (Operational Language Unit) is the unit PromptQL uses to measure and bill for AI work. OLUs represent the AI effort required to plan and execute a query or investigation — reflecting both query complexity and business domain complexity. More complex tasks consume more OLUs; simpler tasks consume fewer. You can see OLU consumption on each individual thread, as well as in your usage dashboard, giving you full transparency into how it’s measured and billed.

OLU usage depends on the complexity of your query and business domain. 1 OLU = $0.20.

Typical usage:

< 2 OLUs for simple data tasks (<$0.40)
~10 OLUs for complex reports (~$2)
~40 OLUs for deep investigations (~$8)

You can see OLU consumption in real time as a thread runs, including a per-step breakdown, as well as a full summary in your usage dashboard.

When the free credits on your project are used up, you can add a payment method and switch to prepaid billing with optional auto top-up. You control your spending limits and top-up thresholds.

Every new account starts with generous free credits, and no credit card is required to get started.
Credits and billing are managed at the project level:

$50 when you create your project.
$20 each time you onboard a new collaborator.

Credits only apply to your first created project and each collaborator’s first joined project.
PromptQL is an AI analyst built for teams. Its value compounds with collaboration. We encourage you to collaborate, explore, and evaluate together from day one.

PromptQL's Fair Usage Policy (FUP) covers the underlying infrastructure costs required to run your workloads, beyond the core AI work billed through OLUs. This includes compute, storage, data transfer, etc. The FUP acts as a guardrail to ensure reliable platform performance for all users. If infrastructure usage exceeds typical limits, admins are notified and overage charges may apply. Core PromptQL AI consumption is already priced into OLUs and is not part of the Fair Usage Policy.

Yes. You can configure spending limits and alerts in your billing dashboard. When you reach your limit, queries will pause until you increase the cap or the next billing cycle begins. Enterprise plans offer additional budget controls per team and project.