Latency Simulator

Interactive p99, p95, p50, and average latency calculator. Generate realistic latency distributions, visualize percentiles, and understand the critical difference between average and tail latency in distributed systems.

Latency Distribution Generator

Configure realistic latency parameters and generate random samples. Adjust the sliders to model different system behaviors — from predictable APIs to highly-variable services.

Distribution Model

Default for most real-world latencies — right-skewed, non-negative, multiplicative variance.

Quick Presets

Sets parameters to match real-world system behaviors. Click Generate to create a new random sample.

Base Latency (ms)

The typical response time of a normal (non-tail) request. Start with 100 (typical API). Try 30 for cache hit, 500 for slow DB.

Variability

Spread of the distribution — higher = more variance in response times. 0.3 = very consistent (CDN). 0.8 = typical API. 1.5 = highly variable (GC-heavy, external calls).

Tail Probability (%)

Percentage of requests entering the high-latency tail due to retries, cache misses, GC pauses, cold starts, or downstream slowness. 0% = no tail. 2% = typical. 5%+ = problematic.

Tail Multiplier (×)

Multiplier applied to tail-latency requests relative to normal requests. 2-3× = mild slowdown (e.g. cache miss). 5× = typical slow path. 10×+ = timeout-retry scenario.

Sample Count

Generate a sample to see latency metrics.