Rate limits

Learn about Airwallex API rate limits, how they work, and best practices for handling them in your integration.

API rate limits are controls that protect Airwallex systems by limiting the number of requests allowed over a specified period. They ensure fair access to Airwallex APIs for all users and maintain predictable, stable service performance.

The limits described on this page apply to most public APIs listed in the API Reference API.

Limit types

Airwallex enforces two types of limits.

Rate limit

Rate limits control how many requests can be made over a rolling time period. This prevents excessive bursts of traffic that could degrade service performance.

For example, a rate limit of 100 requests per second means you can make up to 100 API calls within any one-second window. If you exceed this threshold, additional requests are rejected with an HTTP 429 status code until the rate drops below the limit.

Concurrency limit

A concurrency limit controls how many active requests (pending for response) are allowed at any given time. This prevents a single account from monopolizing server resources with long-running or slow requests.

For example, a concurrency limit of 50 means you can have up to 50 simultaneous API requests awaiting responses. Once this limit is reached, additional requests are rejected until existing requests complete.

Limit levels

Limits are applied at multiple levels to provide granular control and protection.

Global level

The global level represents the aggregate limit across all endpoints for a given account or organization. This is the total number of requests or concurrent connections allowed regardless of which specific endpoints are being called.

Endpoint level

The endpoint level applies to an individual API operation across all instances of that resource type.

For example:

POST /api/v1/pa/payment_intents/{id}/confirm: Confirm a Payment Intent.
GET /api/v1/deposits: Get list of Deposits.
POST /api/v1/transfers/create: Create a new Transfer.

Each of these endpoints has its own limit that counts separately from other endpoints, though they all contribute to the global limit.

Resource level

The resource level applies to operations on a single, specific resource.

For example:

POST /api/v1/pa/payment_intents/int_hkpdp4tjth9mt2both9/confirm: Confirm a specific Payment Intent.
GET /api/v1/transfers/tr_wn7uEO-YPxmPr2w5JNKb1A: Retrieve a specific Transfer.

Certain API resources have specific rate and concurrency limits to ensure performance or data consistency. Refer to the API Reference API for details about specific resources.

Production and sandbox limits

Airwallex applies different limits based on the environment.

Environment	Rate limit (Global)	Rate limit (Endpoint)	Concurrency limit (Global)
Production	100 per second	20 per second	50
Sandbox	20 per second	10 per second	10

If you require higher limits for your production use case or a temporary uplift for an upcoming flash sale, contact Airwallex support to discuss your needs.

Error response

When a request exceeds a limit listed above, Airwallex returns an error response with an HTTP 429 (Too Many Requests) status code.

JSON

1{
2  "code": "too_many_requests",
3  "message": "Rate Limit Exceeded.",
4  "trace_id": "65550cb69fbe78b760be650a64d00269",
5  "details": {}
6}

Handle limiting

To handle limiting effectively, implement a retry mechanism that responds to 429 status codes. Use exponential backoff to progressively increase wait times between retry attempts, reducing request volume when limits are reached. Add random jitter to your backoff intervals to prevent multiple clients from synchronizing their retries and overwhelming the API.

For example, after receiving a 429 response:

Wait 1 second before the first retry.
Wait 2 seconds before the second retry.
Wait 4 seconds before the third retry.
Continue doubling the wait time up to a maximum threshold (for example, 16 seconds).

For scheduled batch tasks or high-volume operations, implement a client-side throttling algorithm to stay within your account or endpoint limits.

Authentication endpoint limits

The authentication endpoint API (POST /api/v1/authentication/login) has a separate, fixed rate limit that does not follow the limit structure described above.

Rate limit: 100 requests per minute per API key

This limit applies regardless of your account or environment. The authentication endpoint requires a lower limit because access tokens are designed to be reused for extended periods.

Avoid hitting authentication limits

Access tokens remain valid for 30 minutes after creation and can be reused for multiple API calls during that period. To avoid hitting authentication rate limits:

Reuse access tokens: Store the access token after obtaining it and reuse it for all API calls until it expires.
Track token expiration: Use the expires_at field from the authentication response to know when to request a new token.
Avoid per-request authentication: Do not call the authentication endpoint before each API request. This pattern will quickly exhaust your authentication limit and is unnecessary.

For proper token management examples, see Manage API keys.

Load testing

Airwallex discourages conducting load tests in the sandbox environment. The sandbox enforces lower rate and concurrency limits, which means your load tests are likely to encounter limits that wouldn't occur in production. Additionally, the sandbox isn't a perfect representation of production because it uses various mock mechanisms, resulting in different latency profiles that don't reflect real-world performance.

For more realistic load testing, build your integration with the ability to mock Airwallex APIs. Your mocks should simulate realistic latency based on measurements from actual production API calls. This approach allows you to test your system's behavior under load without hitting sandbox limits or generating misleading performance data.

Was this page helpful?