HIGHSecuritymediumAPI Design

Implement Rate Limiting

Apply rate limiting to all public-facing API endpoints. Without rate limits, a single attacker can overwhelm your server, exhaust your database connections, or brute-force authentication — taking down the service for all users.

Why This Matters

Without rate limiting, your API has no defense against volume-based attacks. A single script can send thousands of requests per second, exhausting database connections, consuming server CPU, and blocking legitimate users. Brute-force attacks against login endpoints can try millions of passwords. Scraping bots can download your entire dataset. Rate limiting is the minimum viable protection against these attacks.

Related Rules

Validate All Request Input

Security

CRITICAL

Use Proper HTTP Status Codes

Quality

MEDIUM

Version Your API

Architecture

MEDIUM

Use Resource-Oriented (Noun) URLs

Architecture

MEDIUM

Catch this automatically on every PR

BeforeMerge scans your pull requests against this rule and 4+ others. Get actionable feedback before code ships.

Join Waitlist Browse All Rules

Why this matters

Without rate limiting, your API endpoints accept unlimited requests from any source. This creates multiple attack vectors:

Denial of Service: a single attacker sends enough requests to exhaust your server's CPU, memory, or database connections, making the service unavailable for all users
Brute-force attacks: an attacker tries thousands of password combinations against your login endpoint
Data scraping: a bot downloads your entire dataset by paginating through your API at maximum speed
Cost exhaustion: if you use usage-based pricing (AI APIs, SMS, email), an attacker can run up your bill

Rate limiting doesn't prevent all abuse, but it puts a ceiling on the damage any single actor can cause. It is a fundamental security control that every public API needs.

The rule

Apply rate limiting to every public-facing endpoint. Use different limits for different endpoint types: authentication endpoints should have strict limits (5-10 requests per minute), data APIs should have moderate limits (100-1000 per minute), and read-heavy endpoints can be more generous.

Bad example

// BAD: no rate limiting — unlimited login attempts
export async function POST(request: Request) {
  const { email, password } = await request.json();
 
  const user = await db.user.findUnique({ where: { email } });
  if (!user || !await bcrypt.compare(password, user.passwordHash)) {
    return Response.json({ error: "Invalid credentials" }, { status: 401 });
  }
 
  const token = await createSession(user.id);
  return Response.json({ token });
}

Good example

import { Ratelimit } from "@upstash/ratelimit";
import { Redis } from "@upstash/redis";
 
const ratelimit = new Ratelimit({
  redis: Redis.fromEnv(),
  limiter: Ratelimit.slidingWindow(5, "1 m"), // 5 requests per minute
  analytics: true,
});
 
export async function POST(request: Request) {
  // Rate limit by IP address
  const ip = request.headers.get("x-forwarded-for") ?? "127.0.0.1";
  const { success, limit, remaining, reset } = await ratelimit.limit(ip);
 
  if (!success) {
    return Response.json(
      { error: "Too many requests. Try again later." },
      {
        status: 429,
        headers: {
          "X-RateLimit-Limit": limit.toString(),
          "X-RateLimit-Remaining": remaining.toString(),
          "X-RateLimit-Reset": reset.toString(),
        },
      }
    );
  }
 
  const { email, password } = await request.json();
 
  const user = await db.user.findUnique({ where: { email } });
  if (!user || !await bcrypt.compare(password, user.passwordHash)) {
    return Response.json({ error: "Invalid credentials" }, { status: 401 });
  }
 
  const token = await createSession(user.id);
  return Response.json({ token });
}

How to detect

Check if your API routes have rate limiting middleware:

grep -rn "ratelimit\|rate.limit\|rateLimiter" --include="*.ts" app/api/

If no results, your API likely has no rate limiting.

Remediation

Choose a rate limiting library: Upstash Ratelimit (serverless), express-rate-limit (Express), or build your own with Redis
Define rate limits per endpoint type: strict for auth (5/min), moderate for writes (30/min), generous for reads (200/min)
Identify the client: use IP address, API key, or authenticated user ID
Return 429 status code with Retry-After and X-RateLimit-* headers
Monitor rate limit hits to detect attacks and adjust limits

Implement Rate Limiting

Why This Matters

Tags

Related Rules

Catch this automatically on every PR

Why this matters

The rule

Bad example

Good example

How to detect

Remediation

References