Rate Limits

AssistantRouter implements rate limiting to ensure fair usage and protect the API from abuse.

Default Limits

Metric	Hobby	Pro	Team	Enterprise
Requests/minute	10	60	300	Unlimited
Assistants	2	10	Unlimited	Unlimited

Use GET /v1/limits to retrieve your current workspace limits.

Feature Availability

Feature	Hobby	Pro	Team	Enterprise
Remove branding	-	Yes	Yes	Yes
Custom rate limits	-	-	-	Yes

Model Access

All tiers have access to all available models. Use GET /v1/models to see the full list.

Rate Limit Headers

Every API response includes headers indicating your current rate limit status:

X-RateLimit-Limit: 60
X-RateLimit-Remaining: 55
X-RateLimit-Reset: 1704067260

Header	Description
`X-RateLimit-Limit`	Maximum requests allowed in the current window
`X-RateLimit-Remaining`	Remaining requests in the current window
`X-RateLimit-Reset`	Unix timestamp when the window resets

Handling Rate Limits

When you exceed a rate limit, the API returns a 429 Too Many Requests error:

{
  "error": {
    "type": "rate_limit_exceeded",
    "message": "Too many requests. Please try again later.",
    "retry_after_seconds": 30
  }
}

Recommended Approach

Implement exponential backoff when you receive a rate limit error:

async function makeRequestWithRetry(fn: () => Promise<any>, maxRetries = 3) {
  for (let i = 0; i < maxRetries; i++) {
    try {
      return await fn();
    } catch (error) {
      if (error.type === 'rate_limit_exceeded' && i < maxRetries - 1) {
        const delay = Math.pow(2, i) * 1000; // 1s, 2s, 4s
        await new Promise(resolve => setTimeout(resolve, delay));
        continue;
      }
      throw error;
    }
  }
}

Check Your Limits

Use the /v1/limits endpoint to check your current limits:

curl https://api.assistantrouter.com/v1/limits \
  -H "Authorization: Bearer $API_KEY"

Response:

{
  "data": {
    "tier": "pro",
    "rateLimitRpm": 60,
    "nerfingRules": []
  }
}

Best Practices

Monitor headers - Check X-RateLimit-Remaining before making requests
Implement backoff - Use exponential backoff when rate limited
Batch requests - Combine multiple operations when possible
Cache responses - Reduce duplicate requests with caching
Check limits endpoint - Use /v1/limits to monitor your usage

Increasing Limits

Need higher limits?

Pro upgrade: Upgrade to Pro in the dashboard
Team upgrade: Upgrade to Team for unlimited assistants and higher rate limits
Enterprise: Contact sales for custom limits

Rate Limits

AssistantRouter implements rate limiting to ensure fair usage and protect the API from abuse.

Default Limits

Metric	Hobby	Pro	Team	Enterprise
Requests/minute	10	60	300	Unlimited
Assistants	2	10	Unlimited	Unlimited

Use GET /v1/limits to retrieve your current workspace limits.

Feature Availability

Feature	Hobby	Pro	Team	Enterprise
Remove branding	-	Yes	Yes	Yes
Custom rate limits	-	-	-	Yes

Model Access

All tiers have access to all available models. Use GET /v1/models to see the full list.

Rate Limit Headers

Every API response includes headers indicating your current rate limit status:

X-RateLimit-Limit: 60
X-RateLimit-Remaining: 55
X-RateLimit-Reset: 1704067260

Header	Description
`X-RateLimit-Limit`	Maximum requests allowed in the current window
`X-RateLimit-Remaining`	Remaining requests in the current window
`X-RateLimit-Reset`	Unix timestamp when the window resets

Handling Rate Limits

When you exceed a rate limit, the API returns a 429 Too Many Requests error:

{
  "error": {
    "type": "rate_limit_exceeded",
    "message": "Too many requests. Please try again later.",
    "retry_after_seconds": 30
  }
}

Recommended Approach

Implement exponential backoff when you receive a rate limit error:

async function makeRequestWithRetry(fn: () => Promise<any>, maxRetries = 3) {
  for (let i = 0; i < maxRetries; i++) {
    try {
      return await fn();
    } catch (error) {
      if (error.type === 'rate_limit_exceeded' && i < maxRetries - 1) {
        const delay = Math.pow(2, i) * 1000; // 1s, 2s, 4s
        await new Promise(resolve => setTimeout(resolve, delay));
        continue;
      }
      throw error;
    }
  }
}

Check Your Limits

Use the /v1/limits endpoint to check your current limits:

curl https://api.assistantrouter.com/v1/limits \
  -H "Authorization: Bearer $API_KEY"

Response:

{
  "data": {
    "tier": "pro",
    "rateLimitRpm": 60,
    "nerfingRules": []
  }
}

Best Practices

Monitor headers - Check X-RateLimit-Remaining before making requests
Implement backoff - Use exponential backoff when rate limited
Batch requests - Combine multiple operations when possible
Cache responses - Reduce duplicate requests with caching
Check limits endpoint - Use /v1/limits to monitor your usage

Increasing Limits

Need higher limits?

Pro upgrade: Upgrade to Pro in the dashboard
Team upgrade: Upgrade to Team for unlimited assistants and higher rate limits
Enterprise: Contact sales for custom limits

Rate Limits

Rate Limits

Default Limits

Feature Availability

Model Access

Rate Limit Headers

Handling Rate Limits

Recommended Approach

Check Your Limits

Best Practices

Increasing Limits

On this page

Rate Limits

Rate Limits

Default Limits

Feature Availability

Model Access

Rate Limit Headers

Handling Rate Limits

Recommended Approach

Check Your Limits

Best Practices

Increasing Limits

On this page