SLA & Rate Limits
99.9% monthly uptime commitment on paid tiers. 99.97% trailing 90-day actual. p95 memory-search latency under 180ms globally. Credits automatic when we miss.
Uptime
Latency
Live latency distribution at /status. Self-hosted deployments depend on your hardware — see the benchmarks page for reference numbers.
Rate limits by tier
Limits are enforced at the API gateway. Exceeding a limit returns 429 Too Many Requests with a Retry-After header. Short bursts (up to 5x sustained rate for 10 seconds) are allowed.
| Tier | Price | Daily requests | Burst / minute | Concurrent |
|---|---|---|---|---|
| Free | $0 | 100 / day | 20 / min | 2 |
| Pro | $20 / mo | 10,000 / day | 200 / min | 10 |
| Team | $99 / mo | 100,000 / day | 1,000 / min | 50 |
| Enterprise | Custom | Negotiated (no cap) | Dedicated capacity | Dedicated capacity |
Dream Engine cycles, background consolidation jobs, and integration syncs count toward daily requests at discounted rates — full schedule in the API docs.
Fair use policy
Tier limits apply per account, not per API key. The following are considered abuse and may result in throttling, account suspension, or tier re-classification:
If you're building something that stresses these limits for a legitimate reason, email legal@remlabs.ai before you hit production and we'll size you on enterprise.
Questions?
Enterprise customers get financially-backed uptime and latency SLAs with dedicated capacity, custom metrics, and a signed agreement.