Replenit - Turn Every Customer Into Loyal Repeat Buyer
docs

Rate Limits

By Marta Szymanska
February 3, 2026

Rate Limits

Rate limiting rules, headers, and operational guidance for the Replenit Ingestion API.

Overview

Replenit applies rate limits to protect platform stability and ensure fair usage across tenants.

Rate limits are enforced per API key, across all tenants the key has access to.

All ingestion endpoints are subject to the same rate-limiting rules.

 

Default Rate Limits

Limit Type Value
Requests per minute 100
Requests per hour 5,000

Limits apply to:

  • Customers API
  • Orders API
  • Products API
  • DELETE endpoints

 

How Rate Limits Are Calculated

  • Each HTTP request counts toward the rate limit
  • Batch size does not affect request count
  • Requests across different tenants using the same API key share the same quota

 

Rate Limit Headers

Responses may include the following headers:

X-RateLimit-Limit: 100
X-RateLimit-Remaining: 95
X-RateLimit-Reset: 1700000000
Retry-After: 30

 

Header definitions:

Header Description
X-RateLimit-Limit Maximum requests allowed in the window
X-RateLimit-Remaining Requests remaining in the current window
X-RateLimit-Reset UNIX timestamp when the window resets
Retry-After Seconds to wait before retrying

 

429 Too Many Requests

When the rate limit is exceeded, the API returns HTTP 429.

Example response:

{
  “success”: false,
  “message”: “Rate limit exceeded.”,
  “data”: {}
}

 

Retry Strategy

Only retry requests that fail with:

  • 429 Too Many Requests
  • 500 Internal Server Error

Recommended approach:

  • Apply exponential backoff
  • Respect the Retry-After header when present
  • Add jitter to avoid synchronized retries

Example backoff sequence (seconds):

1 → 2 → 4 → 8 → 16

 

Batch Usage Recommendations

Batching reduces request volume and rate-limit pressure.

Recommended batch sizes:

Entity Batch Size
Customers 100–500
Orders 50–200
Products 50–100

All endpoints expect arrays, even for single records.

 

Multi-Tenant Considerations

When an API key is authorized across multiple tenants:

  • Rate limits are shared across all tenants
  • High-volume ingestion for one tenant affects available capacity for others

For parallel multi-tenant ingestion:

  • Use batching aggressively
  • Serialize requests where possible
  • Monitor remaining quota via headers

 

Handling Sustained High Volume

For sustained ingestion workloads:

  • Prefer fewer, larger batches
  • Avoid per-record requests
  • Spread ingestion over time
  • Monitor 429 responses and backoff behavior

If higher limits are required, contact your Replenit Customer Success Manager (CSM).

 

Common Rate-Limit Pitfalls

  • Sending single-record requests in loops
  • Retrying 400-series errors
  • Ignoring Retry-After
  • Running parallel ingestion jobs with the same API key

 

Observability

Rate-limit behavior can be observed via response headers.

Ingestion success and volume can be monitored in:

Replenit Dashboard → Data and Health

Access depends on tenant permissions.

 

Related Documentation

Error Responses
Replenit Ingestion API | Error Responses

Best Practices Guide
Replenit Ingestion API | Best Practices

Customers API
Replenit Ingestion API | Customers

Orders API
Replenit Ingestion API | Orders

Products API
Replenit Ingestion API | Products