Start free — no credit card

$50 in free credits, plus $20 for every teammate who joins.

Every new project starts with $50 in free credits, and you get $20 more for each collaborator you bring in. On an open-weight model — where the same work costs a fraction of the OLUs — that's enough for thousands of simple tasks before you pay anything.

Starter

Start free with credits. Unlock more with collaborators.

$0.20 per OLU *

No minimums. Pay as you go.

Your $0.20 per OLU is all-in — it includes LLM token and hosting costs. No separate model or infrastructure bill.

Enterprise

For enterprise-grade security, dedicated infra and embedded support.

Contact Sales

Annual and monthly options available.

Everything in Starter, plus:

Advanced security and fine-grained permissions

Data-access audit trails — see what data was used in which AI thread, by which user

Dedicated VPC or bring your own cloud (BYOC)

Private networking and VPC peering

Single Sign-On (SSO)

Bring your own LLMs

Forward-deployed engineering support

* OLUs consumed per task depend on query and domain complexity. Typical usage: <2 OLUs for simple data tasks, ~10 OLUs for complex reports, and ~40 OLUs for deep investigations.

Model → OLU multiplier

Your model choice changes your mileage.

Frontier models like Claude Opus do the most per token but cost the most OLUs; lean, open-weight models (DeepSeek, Kimi K2, GLM) do the same work for a fraction of the OLUs. Switch to different models for different threads — or even mid-thread.

Model

Type

OLU multiplier (vs Opus 4.6)

Input tokens / OLU

Output tokens / OLU

Cached tokens / OLU

Cache creation tokens / OLU

Notes

Open-weight models · served via Fireworks

DeepSeek V4 Flash (Official API)
DeepSeek (official)
Open-weight

0.018x (57x cheaper)

952,400

595,200

59.5M

1.2M

First-party DeepSeek API pricing.
DeepSeek V4 Flash (Fireworks)
Fireworks
Open-weight

0.024x (41x cheaper)

952,400

595,200

6.0M

1.2M

Served via Fireworks. Low-latency/volume workhorse.
Llama 3.3 70B
Fireworks
Open-weight

0.036x (28x cheaper)

493,800

617,300

6.2M

617,300

Meta; served via Fireworks.
DeepSeek V4 Pro (Official API)
DeepSeek (official)
Open-weight

0.053x (19x cheaper)

306,500

191,600

46.0M

383,100

First-party DeepSeek API pricing.
Qwen 3.7 Plus
Fireworks
Open-weight

0.071x (14x cheaper)

333,300

104,200

4.2M

416,700

Alibaba, vision-capable; served via Fireworks.
Mistral Large 3
Fireworks
Open-weight

0.077x (13x cheaper)

333,300

83,300

4.2M

416,700

Mistral AI; served via Fireworks.
Kimi K2.7
Fireworks
Open-weight

0.17x (5.8x cheaper)

140,400

41,700

1.8M

175,400

Moonshot AI 1T-param MoE; served via Fireworks.
GLM-5.2
Fireworks
Open-weight

0.23x (4.3x cheaper)

95,200

37,900

1.2M

119,000

Z.ai (Zhipu), MIT-licensed; served via Fireworks.
DeepSeek V4 Pro (Fireworks)
Fireworks
Open-weight

0.25x (4x cheaper)

76,600

47,900

1.1M

95,800

Served via Fireworks (what PromptQL uses for open-weight).

Proprietary models

Claude Haiku 4.5
Anthropic
Proprietary

0.2x (5x cheaper)

133,300

33,300

1.7M

133,300

Fast tier.
GPT-5
OpenAI
Proprietary

0.3x (3.4x cheaper)

106,700

16,700

1.3M

133,300

GPT-5 base.
GPT-5.1
OpenAI
Proprietary

0.3x (3.4x cheaper)

106,700

16,700

1.3M

133,300

Same rate as GPT-5.
Gemini 3.1 Pro
Google
Proprietary

0.42x (2.4x cheaper)

66,700

13,900

833,300

83,300

Google flagship Pro.
GPT-5.2
OpenAI
Proprietary

0.42x (2.4x cheaper)

76,200

11,900

952,400

95,200

Flagship step-up.
GPT-5.4
OpenAI
Proprietary

0.52x (1.9x cheaper)

53,300

11,100

666,700

66,700

Reasoning/coding/agentic gains.
Claude Sonnet 4.5
Anthropic
Proprietary

0.6x (1.7x cheaper)

44,400

11,100

555,600

44,400

Balanced tier. (Sonnet 4.6 is blocklisted in PromptQL.)
Claude Opus 4.6
Anthropic
Proprietary

1.0x (baseline)

26,700

6,700

333,300

26,700

Anchor (1.0x).
Claude Opus 4.8
Anthropic
Proprietary

1.0x (baseline)

26,700

6,700

333,300

26,700

Current GA Opus; PromptQL Powerful tier. Same rate as 4.6/4.7.
GPT-5.5
OpenAI
Proprietary

1.0x (baseline)

26,700

5,600

333,300

33,300

Latest flagship. >272K-context billed 2x in / 1.5x out.
Claude Fable 5
Anthropic
Proprietary

2x (2x cost)

13,300

3,300

166,700

13,300

New frontier flagship (Jun 2026); most capable, highest cost.
GPT-5 Pro
OpenAI
Proprietary

7.22x (7.2x cost)

8,900

1,400

11,100

11,100

Pro reasoning. No prompt-cache discount.
GPT-5.2 Pro
OpenAI
Proprietary

10.11x (10.1x cost)

6,300

992

7,900

7,900

Pro reasoning. No prompt-cache discount.
GPT-5.4 Pro
OpenAI
Proprietary

13.53x (13.5x cost)

4,400

926

5,600

5,600

Pro reasoning. No prompt-cache discount.
GPT-5.5 Pro
OpenAI
Proprietary

13.53x (13.5x cost)

4,400

926

5,600

5,600

Top GPT pro tier. No prompt-cache discount.

Anchor: Claude Opus 4.6 = 1.0×. Lower multiplier = cheaper, i.e. more tokens of work per OLU (per $0.20). PromptQL serves open-weight models via Fireworks (DeepSeek is shown for both its first-party Official API and the Fireworks-hosted rate PromptQL bills on). PLG projects can use every model the LLM proxy makes available, and the lineup expands over time — pick your model per thread. The GPT *-pro tiers run higher mainly because they have no prompt-cache discount. Token figures are representative; your exact mileage depends on the task.

Frequently asked questions

Yes. You always pay $0.20 per OLU, but the model you pick changes how many OLUs a given task consumes. Open-weight models like DeepSeek, Kimi K2, and GLM do the same work for a fraction of the OLUs — as little as ~1/10th, and often 10×–40× cheaper than a frontier model like Claude Opus.
See the “Your model choice changes your mileage” table above for a representative model → OLU multiplier comparison. You can switch models per thread, and available models expand over time.
An OLU (Operational Language Unit) is the unit PromptQL uses to measure and bill for AI work. OLUs represent the AI effort required to plan and execute a query or investigation — reflecting both query complexity and business domain complexity. More complex tasks consume more OLUs; simpler tasks consume fewer. You can see OLU consumption on each individual thread, as well as in your usage dashboard, giving you full transparency into how it’s measured and billed.
OLU usage depends on the complexity of your query and business domain. 1 OLU = $0.20.
Typical usage:
  • < 2 OLUs for simple data tasks (<$0.40)
  • ~10 OLUs for complex reports (~$2)
  • ~40 OLUs for deep investigations (~$8)
You can see OLU consumption in real time as a thread runs, including a per-step breakdown, as well as a full summary in your usage dashboard.
When the free credits on your project are used up, you can add a payment method and switch to prepaid billing with optional auto top-up. You control your spending limits and top-up thresholds.
Every new account starts with generous free credits, and no credit card is required to get started.
Credits and billing are managed at the project level:
  • $50 when you create your project.
  • $20 each time you onboard a new collaborator.
Credits only apply to your first created project and each collaborator’s first joined project.
PromptQL is an AI analyst built for teams. Its value compounds with collaboration. We encourage you to collaborate, explore, and evaluate together from day one.
PromptQL's Fair Usage Policy (FUP) covers the underlying infrastructure costs required to run your workloads, beyond the core AI work billed through OLUs. This includes compute, storage, data transfer, etc. The FUP acts as a guardrail to ensure reliable platform performance for all users. If infrastructure usage exceeds typical limits, admins are notified and overage charges may apply. Core PromptQL AI consumption is already priced into OLUs and is not part of the Fair Usage Policy.
Yes. You can configure spending limits and alerts in your billing dashboard. When you reach your limit, queries will pause until you increase the cap or the next billing cycle begins. Enterprise plans offer additional budget controls per team and project.