Rate Limits

Understand API rate limits and best practices for optimizing your usage.

Overview

Rate limits prevent abuse and ensure fair usage for all customers. Twig AI enforces limits on:

  • API requests per minute

  • API requests per day

  • Tokens processed per day

  • Concurrent requests

Rate Limit Tiers

Plan
Requests/Minute
Requests/Day
Tokens/Day
Concurrent

Free

20

1,000

100,000

2

Pro

100

10,000

1,000,000

10

Enterprise

1,000+

Custom

Custom

50+

Rate Limit Headers

Every API response includes rate limit information:

X-RateLimit-Limit: 100
X-RateLimit-Remaining: 95
X-RateLimit-Reset: 1640995200
X-RateLimit-Used: 5

Header Meanings:

  • Limit: Total requests allowed in window

  • Remaining: Requests left in current window

  • Reset: Unix timestamp when limit resets

  • Used: Requests consumed in window

Rate Limit Exceeded

When you exceed limits:

Response:

Status Code: 429 Too Many Requests

Handling Rate Limits

Exponential Backoff

Respect Retry-After Header

Queue-Based Processing

Optimization Strategies

1. Caching

Cache responses for common queries:

2. Batching

Batch multiple requests:

3. Request Prioritization

4. Distributed Rate Limiting

For multi-server deployments:

Monitoring Usage

Track Consumption

Set Alerts

Configure alerts for high usage:

Upgrading Plans

Need higher limits?

  1. View current usage: Settings → Usage

  2. Compare with limits

  3. Upgrade plan: Settings → Billing

  4. New limits apply immediately

Enterprise Custom Limits:

Best Practices

1. Stay Within Limits

✅ Monitor usage regularly ✅ Implement retry logic ✅ Cache when possible ✅ Use exponential backoff ❌ Don't spam API

2. Optimize Requests

✅ Batch related operations ✅ Use streaming for long responses ✅ Cache frequent queries ✅ Filter unnecessary requests ❌ Don't poll excessively

3. Plan for Growth

✅ Monitor usage trends ✅ Upgrade proactively ✅ Implement queue systems ✅ Use distributed rate limiting ❌ Don't wait until hitting limits

Troubleshooting

Frequently Hitting Limits

Solutions:

  1. Upgrade plan

  2. Implement caching

  3. Optimize request patterns

  4. Use batch processing

  5. Contact support for custom limits

Unexpected Rate Limit Errors

Check:

  1. Are you within stated limits?

  2. Multiple servers sharing key?

  3. Retry logic causing loops?

  4. Background jobs consuming quota?

Next Steps

Last updated