How many times do you flip a coin before it comes up heads? That simple question — waiting for the first success in a sequence of independent trials — is exactly what the Geometric distribution answers.
Setup: repeated Bernoulli trials
Recall that a Bernoulli trial is a single experiment with two outcomes: success (probability p) and failure (probability 1−p), where 0<p≤1. Now repeat such a trial independently, over and over, until the first success appears.
Let X be the total number of trials performed (including the final successful one). Then X takes values in {1,2,3,…}: it equals 1 if the very first trial succeeds, 2 if the first fails and the second succeeds, and so on.
We write X∼Geom(p).
PMF derivation
For X=k to occur, the first k−1 trials must all be failures and the k-th must be a success. Because the trials are independent:
In words: if you have already seen m failures, the number of additional trials you need is distributed exactly the same as if you were starting fresh. The past failures give you no useful information about when the first success will arrive.
Uniqueness of memorylessness
The Geometric distribution is the only discrete distribution on {1,2,3,…} that satisfies the memorylessness property. This is the discrete analogue of the fact that the Exponential distribution is the unique memoryless continuous distribution on the positive reals.
Summary
X∼Geom(p) counts the number of independent Bernoulli(p) trials up to and including the first success.
PMF:P(X=k)=(1−p)k−1p for k=1,2,3,…
Mean:E[X]=1/p.
Variance:Var(X)=(1−p)/p2.
Memorylessness:P(X>m+n∣X>m)=P(X>n); the Geometric is the unique discrete distribution on N+ with this property.