> For the complete documentation index, see [llms.txt](https://faisalaffan.gitbook.io/design-system/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://faisalaffan.gitbook.io/design-system/02-core/rate-limiter.md). # Rate Limiter Pluggable algorithm interface with three implementations (sliding window, token bucket, fixed window) and an optional storage interface backed by an in-memory store. Exposed as a Gin middleware via `pkg/kit/middleware`. ## Architecture ```mermaid %%{init: {"theme": "base", "themeVariables": {"background": "#ffffff"}}}%% flowchart TB subgraph "HTTP Layer" REQ["Request"] MW["RateLimitPerSecond middleware
(pkg/kit/middleware)"] end subgraph "Algorithm (rate-limiter/algorithm)" IFACE["Algorithm interface
Allow(key, limit, window) bool"] SW["SlidingWindow
(default)"] TB["TokenBucket"] FW["FixedWindow"] end subgraph "Storage (rate-limiter/storage)" SIFACE["Store interface
Increment / Count / Reset"] MEM["MemoryStore"] end REQ --> MW MW --> IFACE IFACE --> SW IFACE --> TB IFACE --> FW SW -.->|optional| SIFACE TB -.->|optional| SIFACE FW -.->|optional| SIFACE SIFACE --> MEM subgraph legend["Algorithm Implementations"] D1["SlidingWindow: records per-timestamp in-slice"] D2["TokenBucket: refill rate + token debt"] D3["FixedWindow: wall-clock window alignment"] end ``` ## Algorithm Interface ```go type Algorithm interface { Allow(key string, limit int, window time.Duration) bool } ``` All algorithms track state per-key (typically the client IP). `Allow` returns `true` if the request is within the limit and `false` if rate-limited. ### Sliding Window (default) Tracks a slice of nanosecond timestamps per key. Old entries outside the window are pruned on each call. Provides the most accurate window boundaries. ```go func (sw *SlidingWindow) Allow(key string, limit int, window time.Duration) bool { now := time.Now().UnixNano() cutoff := now - window.Nanoseconds() w, ok := sw.windows[key] if !ok { sw.windows[key] = &slidingEntry{timestamps: []int64{now}} return true } var valid []int64 for _, ts := range w.timestamps { if ts > cutoff { valid = append(valid, ts) } } if len(valid) < limit { valid = append(valid, now) w.timestamps = valid return true } w.timestamps = valid return false } ``` ### Token Bucket Each key has a token pool that refills continuously at `rate = limit / window.Seconds()`. Bursts up to `limit` are allowed. ```go func (tb *TokenBucket) Allow(key string, limit int, window time.Duration) bool { b, ok := tb.buckets[key] if !ok { b = &bucket{tokens: float64(limit) - 1, lastRefill: time.Now()} tb.buckets[key] = b return true } rate := float64(limit) / window.Seconds() elapsed := time.Since(b.lastRefill).Seconds() b.tokens += elapsed * rate if b.tokens > float64(limit) { b.tokens = float64(limit) } b.lastRefill = time.Now() if b.tokens >= 1 { b.tokens-- return true } return false } ``` ### Fixed Window Aligns windows to wall-clock boundaries (e.g., 0:00--1:00, 1:00--2:00). Simpler but allows double traffic at boundaries. ```go func (fw *FixedWindow) Allow(key string, limit int, window time.Duration) bool { now := time.Now().UnixNano() windowNano := window.Nanoseconds() currentWindow := now / windowNano * windowNano w, ok := fw.windows[key] if !ok || w.window != currentWindow { fw.windows[key] = &fixedEntry{count: 1, window: currentWindow} return true } if w.count < limit { w.count++ return true } return false } ``` ## Storage Interface Optional persistence layer for external rate-limit state (e.g., Redis). ```go type Store interface { Increment(key string, window time.Duration) (int, error) Count(key string, window time.Duration) (int, error) Reset(key string) error } ``` The default `MemoryStore` uses TTL-based expiration per entry. ## Gin Middleware Bridged through `pkg/kit/middleware` as `RateLimitPerSecond`: ```go srv.GET("/limited", kitmw.RateLimitPerSecond(algo, 5), func(c *gin.Context) { kit.OK(c, gin.H{"message": "within rate limit"}) }) ``` Key extracted from `c.ClientIP()`. Returns 429 Too Many Requests when exceeded. ## Endpoints | Method | Path | Description | | ------ | -------------- | ------------------------------------- | | `GET` | `/limited` | Rate-limited route (default: 5 req/s) | | `GET` | `/unlimited` | No rate limit applied | | `POST` | `/admin/reset` | Reset rate limit for a specific IP | ## Technical Decisions * **Sliding window as default**: Best accuracy for rate limiting. Fixed window has boundary bias; token bucket allows short bursts. Sliding window provides the most predictable behavior for most use cases. * **Nanosecond precision**: `time.Now().UnixNano()` avoids collision artifacts that millisecond precision can cause under high concurrency within a single second. * **Per-key maps with mutexes**: In-process maps keep the implementation self-contained with zero external dependencies. Contention is negligible for typical per-IP granularity (one goroutine per request). * **Algorithm interface as the seam**: New algorithms (e.g., sliding window counter from Redis) can be added by implementing `Allow(key, limit, window)`. No middleware or handler code changes. * **Storage interface for production use**: The algorithm state is ephemeral. Production deployments swap `MemoryStore` for Redis using the `Store` interface, making rate limits survive restarts and scale across instances. ## Source Code [View on GitHub](https://github.com/faisalaffan/faisalaffan-design-system/blob/dev/services/rate-limiter/main.go) --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://faisalaffan.gitbook.io/design-system/02-core/rate-limiter.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.