Design An API Rate Limiter - System Design

Requirements

Functional Requirements
- Apply Rate Limiting Policy per chosen Entity
- To be able to change the Policy
- enforce limits
Non-Functional Requirements
- Scalability
- Availability
- Low Latency

This ensures:

Typical data:

Characteristics:

Admin-facing endpoint (e.g. /admin/configure)
Defines:
- rate limits (e.g. N requests/min)
- burst capacity
- algorithm type (fixed window, token bucket, etc.)

Core responsibility: apply rate limiting policy
Designed to support multiple policies (pluggable engine):
- not fixed to one approach
- can switch between strategies (e.g. fixed window, token bucket, etc.)
Policy is:
- configurable
- not hardcoded
- applied based on configuration (from admin API)
You explicitly chose:
- not to go deep into specific algorithms right now
- keep it flexible and abstract
Key idea:
- limiter acts as an execution engine
- takes policy + request → produces allow/deny decision
IP Based enforcement for abusive requests

Purpose:
- store rate limiting state (e.g. counters)
You identified it as:
- key-value based
- simple structure (no complex schema)
Workload characteristics:
- write-heavy
- frequent updates per request
Requirement:
- should have good write performance
Conclusion you made:
- a simple, high-performance key-value store is sufficient