How long must you wait for a bus that arrives completely at random? Or for a radioactive atom to decay? In each case the waiting time has no memory of the past — the distribution that captures this behaviour is the Exponential distribution.
Definition
Let λ>0 be the rate parameter. A random variable X follows an Exponential distribution with rate λ, written X∼Exp(λ), if its probability density function (PDF) is
f(x):={λe−λx0x≥0,x<0.
The parameter λ represents the average number of events per unit time; its reciprocal 1/λ is the average waiting time.
Verification that f is a valid PDF
A valid PDF must be non-negative and integrate to 1. Non-negativity is immediate since λ>0 and e−λx>0. For the integral,
∫−∞∞f(x)dx=∫0∞λe−λxdx=λ⋅[−λ1e−λx]0∞=λ⋅λ1=1.
Cumulative distribution function
Integrating the PDF gives the cumulative distribution function (CDF):
F(x):=P(X≤x)={1−e−λx0x≥0,x<0.
Equivalently, the survival function is P(X>x)=e−λx for x≥0.
Mean
The mean (expected value) of X∼Exp(λ) is derived by integration by parts. Using u=x and dv=λe−λxdx, so du=dx and v=−e−λx:
E[X]=∫0∞xλe−λxdx=[−xe−λx]0∞+∫0∞e−λxdx.
The boundary term vanishes (since xe−λx→0 as x→∞), and the remaining integral gives
E[X]=∫0∞e−λxdx=λ1.
Variance
To find Var(X)=E[X2]−(E[X])2, compute E[X2] via integration by parts twice (or by differentiating the moment generating function). Applying integration by parts with u=x2 and dv=λe−λxdx:
E[X2]=∫0∞x2λe−λxdx=[−x2e−λx]0∞+∫0∞2xe−λxdx=λ2E[X]=λ22.
Therefore
Var(X)=E[X2]−(E[X])2=λ22−λ21=λ21.
The standard deviation equals the mean: σ=1/λ.
Memorylessness
The most distinctive property of the Exponential distribution is memorylessness: for all s,t≥0,
P(X>s+t∣X>s)=P(X>t).
Proof. By the definition of conditional probability and the survival function:
P(X>s+t∣X>s)=P(X>s)P(X>s+t)=e−λse−λ(s+t)=e−λt=P(X>t).□
Informally: if you have already waited s units of time with no event, the remaining waiting time has exactly the same distribution as if you had just started. The past waiting time is irrelevant.
Uniqueness
The Exponential distribution is the unique absolutely continuous distribution on [0,∞) with the memorylessness property. Any non-negative continuous random variable satisfying P(X>s+t∣X>s)=P(X>t) for all s,t≥0 must be Exp(λ) for some λ>0.
This makes the Exponential distribution the continuous-time analogue of the Geometric distribution, which is the unique discrete distribution on {0,1,2,…} with the memorylessness property.
Connection to the Poisson process
In a Poisson process with rate λ, events occur randomly in continuous time such that the number of events in any interval of length t follows a Poisson distribution with mean λt. The inter-arrival times — the waiting times between consecutive events — are independent and each distributed as Exp(λ).
This connection is fundamental: the Exponential distribution is the continuous-time building block for the Poisson process, just as the Geometric distribution is the discrete-time building block for Bernoulli trials.
Summary
- X∼Exp(λ) has PDF f(x)=λe−λx for x≥0 and rate parameter λ>0.
- CDF: F(x)=1−e−λx for x≥0.
- Mean: E[X]=1/λ.
- Variance: Var(X)=1/λ2; standard deviation equals the mean.
- Memorylessness: P(X>s+t∣X>s)=P(X>t); the Exponential is the unique continuous distribution on [0,∞) with this property.
- Inter-arrival times of a Poisson process with rate λ are independent Exp(λ) random variables.