When an experiment consists of repeating the same binary trial many times — flipping a coin, testing manufactured parts, sampling survey responses — the natural question is: how many successes occur? The Binomial distribution answers exactly this question.
Setup
Fix an integer n≥1 and a probability p∈[0,1]. Perform n independent Bernoulli(p) trials and let X count the total number of successes. Formally, let X1,X2,…,Xn be independent with each Xi∼Bernoulli(p); then
X:=X1+X2+⋯+Xn.
We write X∼Binomial(n,p), or X∼Bin(n,p).
PMF derivation
X takes values in {0,1,…,n}. To find P(X=k), count all ways the n trials can yield exactly k successes.
Combinatorial argument. Any particular sequence of k successes and n−k failures occurs with probability pk(1−p)n−k (by independence). The number of such sequences — choosing which k of the n positions are successes — is (kn). Summing over all sequences gives the probability mass function (PMF):
P(X=k)=(kn)pk(1−p)n−k,k=0,1,…,n.
This is a valid PMF because ∑k=0n(kn)pk(1−p)n−k=(p+(1−p))n=1 by the Binomial theorem.
Mean
Linearity of expectation lets us avoid any direct computation from the PMF. Using the representation X=∑i=1nXi and the fact that E[Xi]=p for each Bernoulli indicator:
E[X]=i=1∑nE[Xi]=np.
No independence is required — linearity holds unconditionally.
Variance
Independence of the Xi is needed here. Because the indicators are independent, their variances add:
Var(X)=i=1∑nVar(Xi)=n⋅p(1−p)=np(1−p).
Additive property
Theorem. If X∼Bin(m,p) and Y∼Bin(n,p) are independent, then
X+Y∼Bin(m+n,p).
Proof via MGFs. The MGF of X∼Bin(m,p) is obtained by multiplying m independent Bernoulli MGFs:
MX(t)=((1−p)+pet)m.
Similarly MY(t)=((1−p)+pet)n. Because X and Y are independent, the MGF of their sum factors:
MX+Y(t)=MX(t)⋅MY(t)=((1−p)+pet)m+n,
which is the MGF of Bin(m+n,p). Since the MGF uniquely determines the distribution, the result follows. □
Intuition. Running m independent Bernoulli trials followed by n more independent Bernoulli trials — all with the same p — is indistinguishable from running m+n trials in one go.
Summary
- X∼Bin(n,p) counts successes in n independent Bernoulli(p) trials.
- PMF: P(X=k)=(kn)pk(1−p)n−k for k=0,1,…,n.
- Mean: E[X]=np (by linearity of expectation).
- Variance: Var(X)=np(1−p) (by independence of indicators).
- MGF: M(t)=((1−p)+pet)n.
- Additive: the sum of independent Bin(m,p) and Bin(n,p) is Bin(m+n,p).