Rate Limiting in Serverless Needs Shared State

An in-memory rate limiter is often good enough in a single long-lived process.

It stops being trustworthy once the same endpoint is served by many ephemeral instances.

That is why serverless deployments usually need rate limiting backed by shared state.

Why Local Counters Fail

If each function instance keeps its own request counts, the limit applies per instance, not per client identity.

That means a distributed deployment can accidentally multiply the effective rate limit just by scaling out.

This is not a bug in the library. It is the natural result of local state in a distributed system.

Redis is useful here because it gives you:

That is enough to implement fixed-window, sliding-window, or token-bucket approaches depending on what fairness and cost model you need.

You do not always need a sliding-window log.

The correct choice depends on product behavior, not on which algorithm sounds most advanced.

If multiple Redis commands are required to evaluate and update the limit, keep them atomic with either:

That matters under concurrency.

For distributed and serverless APIs:

That is more important than which framework middleware you started with.