Latency Simulator
Interactive p99, p95, p50, and average latency calculator. Generate realistic latency distributions, visualize percentiles, and understand the critical difference between average and tail latency in distributed systems.
Distribution Model
Default for most real-world latencies — right-skewed, non-negative, multiplicative variance.
Quick Presets
Sets parameters to match real-world system behaviors. Click Generate to create a new random sample.
The typical response time of a normal (non-tail) request. Start with 100 (typical API). Try 30 for cache hit, 500 for slow DB.
Spread of the distribution — higher = more variance in response times. 0.3 = very consistent (CDN). 0.8 = typical API. 1.5 = highly variable (GC-heavy, external calls).
Percentage of requests entering the high-latency tail due to retries, cache misses, GC pauses, cold starts, or downstream slowness. 0% = no tail. 2% = typical. 5%+ = problematic.
Multiplier applied to tail-latency requests relative to normal requests. 2-3× = mild slowdown (e.g. cache miss). 5× = typical slow path. 10×+ = timeout-retry scenario.
Generate a sample to see latency metrics.