Introduction to Measure

Basis
Last updated: Tags: Measure Theory

You already know that the interval [a,b][a, b] has length bab - a. But what is the “length” of the set of all rationals in [0,1][0, 1]? Or of a set built by an adversarial construction that avoids any obvious description? Measure theory is the branch of mathematics that answers these questions rigorously — and in doing so, it becomes the foundation of modern probability, integration, and analysis.

Why naïve length is not enough

For a finite union of disjoint intervals, length is obvious: add up the lengths of the pieces. The trouble starts when you try to assign lengths to more exotic subsets of R\mathbb{R}.

The rational numbers have measure zero

The rationals Q[0,1]\mathbb{Q} \cap [0, 1] are dense in [0,1][0, 1]: between any two reals you can find a rational. Yet there are only countably many of them — as you saw in Countable Set. This tension is resolved by measure theory:

λ(Q[0,1])=0.(1)\lambda(\mathbb{Q} \cap [0,1]) = 0. \tag{1}

The intuition: arrange the rationals in a sequence q1,q2,q3,q_1, q_2, q_3, \ldots Cover qkq_k with an open interval of length ε/2k\varepsilon / 2^k. The union of all these intervals covers every rational in [0,1][0,1], yet has total length at most

k=1ε2k=ε.\sum_{k=1}^{\infty} \frac{\varepsilon}{2^k} = \varepsilon.

Since ε\varepsilon can be made arbitrarily small, the rationals occupy zero length — even though they are everywhere.

You cannot measure every subset

The real surprise is that there exist subsets of R\mathbb{R} to which you cannot consistently assign a length. The Vitali set is the canonical example. Partition [0,1][0, 1] by the equivalence relation xy    xyQx \sim y \iff x - y \in \mathbb{Q}. By the Axiom of Choice, pick exactly one representative from each class to form a set VV. If you try to assign VV a length λ(V)0\lambda(V) \geq 0, you run into a contradiction: the translates V+qV + q (for qQ[1,1]q \in \mathbb{Q} \cap [-1, 1]) are pairwise disjoint, their union covers [0,1][0, 1], and — if length were countably additive — the total length would have to equal 11 while each summand is the same number, which is impossible whether that number is 00 or positive.

The lesson: not every set can be measured. Measure theory’s first task is to identify which sets can be measured.

What measure theory builds

The fix has two parts.

First, instead of trying to assign a size to every subset, you restrict attention to a carefully chosen family of subsets called a σ-algebra — a collection that is closed under complement and countable union, making it stable under all the set operations you actually need. The Vitali set lies outside this family.

Second, you define a function μ\mu on the σ-algebra that assigns a non-negative number (possibly ++\infty) to each measurable set. The key property is countable additivity: for pairwise disjoint measurable sets E1,E2,E_1, E_2, \ldots,

μ ⁣(k=1Ek)=k=1μ(Ek).(2)\mu\!\left(\bigsqcup_{k=1}^{\infty} E_k\right) = \sum_{k=1}^{\infty} \mu(E_k). \tag{2}

Together, a σ-algebra F\mathcal{F} on a set XX and a measure μ ⁣:F[0,+]\mu \colon \mathcal{F} \to [0, +\infty] satisfying (2)(2) form a measure space (X,F,μ)(X, \mathcal{F}, \mu).

The road ahead

In the next few checkpoints you will build the Lebesgue measure on R\mathbb{R} step by step:

  1. σ-Algebra — the precise definition of the domain of a measure.
  2. Outer Measure — a preliminary “size” defined on all subsets by covering with intervals; it is monotone and sub-additive but not fully additive.
  3. Carathéodory’s Criterion — the rule that picks out exactly the sets on which the outer measure becomes a true measure.
  4. Lebesgue Measure — the outer measure from step 2 restricted to the measurable sets from step 3, giving the definitive notion of length on R\mathbb{R}.

Summary

  • The length of an interval is the starting point, but extending length to arbitrary subsets of R\mathbb{R} requires care.
  • The rationals have measure zero despite being dense: a countable set can always be covered by intervals of arbitrarily small total length, as in equation (1)(1).
  • The Vitali set shows that not every subset of R\mathbb{R} can be assigned a consistent length: the domain of a measure must be restricted.
  • The solution is a measure space (X,F,μ)(X, \mathcal{F}, \mu): a set XX, a σ-algebra F\mathcal{F} of measurable subsets, and a countably additive function μ\mu on F\mathcal{F} — see equation (2)(2).
  • The Lebesgue measure, built in the next checkpoints, is the canonical example on R\mathbb{R}.