SLA & Rate Limits
99.9% monthly uptime commitment on paid tiers. 99.9% trailing 90-day actual. p95 memory-search latency under 180ms globally. Credits automatic when we miss.
Uptime
Latency
Live latency distribution at /status. Self-hosted deployments depend on your hardware — see the benchmarks page for reference numbers.
Rate limits by tier
Limits are enforced at the API gateway. Exceeding a limit returns 429 Too Many Requests with a Retry-After header. Short bursts (up to 5x sustained rate for 10 seconds) are allowed.
| Tier | Price | Uptime SLA | Recalls / month | Recalls / day (avg) | Burst / minute | Concurrent |
|---|---|---|---|---|---|---|
| Free | $0 | Best-effort, no SLA | 1,000 / mo | ≈33 / day | 20 / min | 2 |
| Solo | $25 / mo | 99.9% | 100,000 / mo | ≈3,300 / day | 200 / min | 10 |
| Team | $99 / mo | 99.9% | 1,000,000 / mo | ≈33,300 / day | 1,000 / min | 50 |
| Enterprise | Custom | 99.9% (custom available) | Custom volume | Custom | Dedicated capacity | Dedicated capacity |
Concurrent = parallel HTTP connections. Rate limits are token-bucket; bursts up to 2× sustained limit allowed. Enterprise also includes a BAA on request and dedicated support. Dream Engine cycles, background consolidation jobs, and integration syncs count toward monthly recalls at discounted rates — full schedule in the API docs. Authoritative tier numbers reconcile with /pricing-developer; tier data last reviewed 2026-04-26.
Fair use policy
Tier limits apply per account, not per API key. The following are considered abuse and may result in throttling, account suspension, or tier re-classification:
If you're building something that stresses these limits for a legitimate reason, email dev@remlabs.ai before you hit production and we'll size you on enterprise.
Questions?
Enterprise customers get financially-backed uptime and latency SLAs with dedicated capacity, custom metrics, and a signed agreement.