The basic rules handle arithmetic combinations of functions, but they do not tell you how to differentiate a composition like (x3+1)100 or sin(x2). The chain rule fills that gap: it expresses the derivative of f∘g in terms of the individual derivatives of f and g.
Statement
Theorem (Chain rule). Let g be differentiable at x and f be differentiable at g(x). Then h:=f∘g is differentiable at x and
h′(x)=f′(g(x))⋅g′(x).(1)
In Leibniz notation, with u=g(x) and y=f(u):
dxdy=dudy⋅dxdu.
Proof
The naive cancellation kf(g(x+k))−f(g(x))=g(x+k)−g(x)f(g(x+k))−f(g(x))⋅kg(x+k)−g(x) fails when g(x+k)=g(x) for some nonzero k (the first factor is then 0/0). The following auxiliary-function argument handles this case cleanly.
Since f is differentiable at g(x), ϕ is continuous at g(x).
For any k=0, setting u=g(x+k):
f(g(x+k))−f(g(x))=ϕ(g(x+k))⋅(g(x+k)−g(x)),
which holds whether or not g(x+k)=g(x) (both sides are 0 when equal). Dividing by k:
kf(g(x+k))−f(g(x))=ϕ(g(x+k))⋅kg(x+k)−g(x).
As k→0: the right factor converges to g′(x); since g is continuous at x, g(x+k)→g(x), so ϕ(g(x+k))→ϕ(g(x))=f′(g(x)). Therefore (f∘g)′(x)=f′(g(x))⋅g′(x). □
Examples
Power of a polynomial
((x3+1)5)′=5(x3+1)4⋅3x2=15x2(x3+1)4.
Linear substitution
((2x+1)100)′=100(2x+1)99⋅2=200(2x+1)99.
Abstract composition
If f is differentiable and g(x)=f(x2+3x), then g′(x)=f′(x2+3x)⋅(2x+3).
Summary
Chain rule: (f∘g)′(x)=f′(g(x))⋅g′(x).
The proof uses an auxiliary function ϕ to avoid division by zero when g(x+k)=g(x).