Set a hard cost cap¶
A cost cap is the simplest way to make sure an agent can't blow past your budget. It works at two levels:
- Per-workflow — set on a workflow, halts the run when cumulative cost exceeds the cap.
- Per-call — projected cost from the SDK, rejects any single call that would exceed the cap.
Per-workflow¶
In the dashboard, open the workflow and set the budget. Or via the HTTP API:
curl -X PATCH https://api.nullrun.io/api/v1/orgs/$ORG_ID/workflows/$WORKFLOW_ID \
-H "X-API-Key: $NULLRUN_API_KEY" \
-H "X-Signature: $(compute_hmac)" \
-H "X-Signature-Timestamp: $(date +%s)" \
-H "Content-Type: application/json" \
-d '{"budget_cents": 500}'
Auth uses
X-API-Keyplus an HMAC-SHA256 signature overtimestamp:api_key:body_hash(see the HTTP API reference) and the SDK'sNULLRUN_SECRET_KEY. Bearer session tokens are for dashboard / admin endpoints only.
Then in the SDK:
import nullrun
from nullrun import init, protect
init(api_key="nr_live_...")
with nullrun.workflow("my-workflow"):
@protect
def run(): ...
Cumulative cost > 500¢ → NullRunBlockedException on the next gate call
(see Errors for the full exception hierarchy).
Per-call¶
The SDK does not project per-call cost on its own — the per-call
cap is enforced by the workspace policy on the gateway. When the
policy carries a max_per_call_cents limit, the gate rejects any
single call whose projected cost would exceed the cap before the
model is invoked (see Budgets for the
reservation flow).
See also¶
- Budgets — reservation lifecycle and the
pre-flight
/checkend-to-end - Errors
- Examples → cost cap demo