Volumes on rings

Extensive quantities

"Measuring" in the sense of measure theory means quantify: What is asked is not "what value has ...?", "how fast/hot/bright is ...?", but "how much of ...?", "how big/heavy/numerous is ...?". So we are interested in quantities that change with the size of the underlying system: doubling the system doubles the quantity. In physics, quantities with this property are called extensive quantities. An example is mass: if we join two systems of boxes $K_{1}$ and $K_{2}$ with mass $m_{1}$ and $m_{2}$ , the resulting system $K$ has the larger mass $m_{1}+m_{2}$ . By contrast, quantities that do not change with the size of the underlying system are also called intensive. For example, temperature is an intensive quantity: pouring two liquids of one temperature together does result in the same temperature, and not the sum of the temperatures.

So whenever we quantify something in measure theory, we are measuring an extensive quantity. Other examples of such quantities are:

Example (Extensive quantities)

geometric volumes: the length of a distance, the area of a plane or surface, the volume of a geometrical object, ... even volumes of objects of non-integer dimension are conceivable (fractals).
the energy contained in a physical system
the amount of matter (how many atoms?)
the electric charge on a field
but also: the probability of events. If more possible results of a random experiment belong to an event, then the probability that the event occurs increases.

So we can understand the measure theory as a mathematical theory of measuring extensive quantities.

Volume-measuring functions

How can we grasp the measuring of such quantities mathematically? Obviously, by measuring a quantity, an assignment is described: A certain object (geometric body, event, accumulation of things) gets assigned exactly one "volume". Thus, the measuring of a quantity can be described by a function. The domain of definition of this function contains the objects to be measured. These can be understood as subsets of a huge set $\Omega$ . For example, (physical) objects. such as cubes, cuboids, spheres, cones, etc. can be seen as subsets of $\mathbb {R} ^{3}$ :

Example (Geometric objects as subsets of $\mathbb {R} ^{3}$ )

For example, cuboids can be described by sets of the form $[a_{1},b_{1}]\times [a_{2},b_{2}]\times [a_{3},b_{3}]\subseteq \mathbb {R} ^{3}$ , where $a_{i},b_{i}\in \mathbb {R}$ . It is often more clever to use half-open intervals of the form $(a_{i},b_{i}]$ instead of closed intervals: The "half-open cuboids" formed from these can be joined disjointly and without gaps.

A sphere with center at the origin and radius $r>0$ is represented by the set $\{x\in \mathbb {R} ^{3}:\|x\|\leq r\}$ .

Likewise, one can consider subsets of $\mathbb {N}$ when measuring an extensive quantity counting objects. So the domain of definition will be a set system ${\mathcal {C}}\subseteq {\mathcal {P}}(\Omega )$ of subsets of a large basic set $\Omega$ (to be determined more precisely in each case) and the function will be a function on sets. The function values correspond to the volumes assigned to these sets and are scalars in $\mathbb {R}$ . For now, let us call such a function a volume-measuring function.

Definition (Volume-measuring function)

We formalize the measurement of an extensive quantity via a function on a system ${\mathcal {C}}$ of subsets of a basic set $\Omega$ with values in $\mathbb {R}$ . Such a function

\mu :{\mathcal {P}}(\Omega )\supseteq {\mathcal {C}}\rightarrow \mathbb {R} ,

that describes the measurement of an extensive quantity, is called a volume-measuring function (on ${\mathcal {C}}$ ).

From a mathematical point of view, a volume-measuring function is nothing more than a function on sets with values in $\mathbb {R}$ . We investigate which further properties a volume-measuring function should have in order to be able to describe the measurement of an extensive quantity in a meaningful way.

Which properties shall a volume-measuring function have?

Non-negativity

It makes intuitive sense to require non-negativity of volume-measuring functions. After all, how should a negative volume be interpreted? Admittedly, there are situations in which one also allows negative numbers as dvalues of a volume-measuring function (signed measures). One example is the measurement of a total charge, which is composed of positive and negative parts. Sometimes the range of values is generalized even further, so that also complex numbers can appear as function values (complex measures) or certain linear mappings (spectral measures). However, all these cases arise as generalizations from the non-negative real situation, so we will restrict ourselves to the real case at the being. Moreover, infinity should also be allowed as a function value in certain cases: For instance, the geometric volume of $\mathbb {R} ^{n}$ should be infinite. So we require: a volume-measuring function maps to $[0,\infty ]:=\mathbb {R} _{0}^{+}\cup \{\infty \}$ .

Monotonicity

Volume-measuring functions are intended to formalize the measurement of extensive quantities. Extensive quantities are characterized by the fact that they increase with increasing size of the underlying system. For example, when determining the number of atoms in a sample of matter, one should count more atoms after adding a certain amount of the matter. This property should also be found in volume-measuring functions: enlarging a geometric object makes it gain more volume. Mathematically, this can be described by the term monotonicity of a function on sets:

Definition (Monotonic functions on sets)

A function $\mu :{\mathcal {P}}(\Omega )\supseteq {\mathcal {C}}\rightarrow \mathbb {R}$ is called monotonic, if for all sets $A,B\in {\mathcal {C}}$ with $A\subseteq B$ , we have that $\mu (A)\leq \mu (B)$ .

We will demand a volume-measuring function to be monotonic: larger sets have larger volumes.

Sub-additivity

The sets

A_{1},\dots ,A_{7}

cover the rectangle

A

.

B_{1},B_{2},B_{3}

cover

A

.

The monotonicity of the volume-measuring function guarantees that the value for a given set $A\in {\mathcal {C}}$ is always less than or equal to the value for any superset $B\in {\mathcal {C}}$ . But is this also true if the superset $B$ is a union of several sets? Intuitively, this should be the case: If a set $A\in {\mathcal {C}}$ is covered by sets $A_{1},A_{2},\dots ,A_{n}\in {\mathcal {C}}$ , then the volume-measuring function evaluated on $A$ should in no case yield a value larger than for the sum of the contents of the covering sets.

Mathematically expressed: for a volume-measuring function $\mu$ and sets $A,A_{1},\dots ,A_{n}\in {\mathcal {C}}$ it should hold that:

A\subseteq \bigcup _{i=1}^{n}A_{i}\implies \mu (A)\leq \sum _{i=1}^{n}\mu (A_{i}).

In particular, this property should be satisfied if the union of the covering sets $A_{1},\dots ,A_{n}$ itself is no longer in the domain of definition ${\mathcal {C}}$ . But this property is not yet expressed by the monotonicity!

Example

Consider the base set $\Omega =\{1,2,3,4\}$ and the set system ${\mathcal {C}}=\{\{1,2\},\{2,3,4\},\{1,2,3\}\}\subseteq {\mathcal {P}}(\Omega )$ . Let the function $\mu :{\mathcal {C}}\rightarrow [0,\infty ]$ be defined by $\mu (\{1,2\})=\mu (\{2,3,4\})=1$ and $\mu (\{1,2,3\})=3$ . Obviously, $\mu$ is monotonic. However, monotonicity no longer holds for finite covers! We have $\{1,2,3\}\subseteq (\{1,2\}\cup \{2,3,4\})$ , but $\mu (\{1,2,3\})=3>1+1=\mu (\{1,2\})+\mu (\{2,3,4\}).$

Thus, the property of functions on sets that monotonicity is preserved even in the case of finite covers has to be required additionally. We call it sub-additivity:

Definition ((finite) sub-additive function on sets)

A function $\mu :{\mathcal {P}}(\Omega )\supseteq {\mathcal {C}}\rightarrow \mathbb {R}$ is called (finitely) subadditive if for all $n\in \mathbb {N}$ and arbitrary sets $A,A_{1},\dots ,A_{n}\in {\mathcal {C}}$ it holds that:

A\subseteq \bigcup _{i=1}^{n}A_{i}\implies \mu (A)\leq \sum _{i=1}^{n}\mu (A_{i}).

If the covering of the set $A$ consists of only one set $B$ , this property corresponds exactly to monotonicity. So we can take sub-additivity as a generalization of monotonicity and replace the requirement of monotonicity of volume-measuring functions by this generalized version: measuring functions must be sub-additive.

Additivity

The properties of monotonicity or more generally of sub-additivity required so far make only an "approximate" statement: the function values of a sub-additive function on sets may be above the "real" value. As an example:

Example (Sub-additivity for outer approximations)

Let $\Omega =\{1,2\}$ and ${\mathcal {C}}=\{\{1\},\{2\},\{1,2\}\}\subseteq {\mathcal {P}}(\Omega )$ . The function $\mu :{\mathcal {C}}\rightarrow [0,\infty ]$ with $\mu (\{1\})=\mu (\{2\})=1$ and $\mu (\{1,2\})=42$ is obviously sub-additive. However, the function value of the quantity $\{1,2\}$ is only an upper bound for the exact value: To be expected would be $\mu (\{1,2\})=\mu (\{1\})+\mu (\{2\})=2$ .

Disjoint union with

n=7

Obviously, for a volume-measuring function to be "exact", this property must be additionally required: If a set $A\in {\mathcal {C}}$ is exactly covered by finitely many, pairwise disjoint sets $A_{1},\dots ,A_{n}$ , then the function value of the volume-measuring function evaluated at $A$ is said to be equal to the sum of the function values for the individual $A_{i}$ .

It is hence important whether the sets within a union are pairwise disjoint. To make explicit the pairwise disjointness within a union, we introduce a new notation:

Definition (Disjoint union)

A union of sets $A_{1},A_{2},\dots$ is called disjoint if the $A_{i}$ are pairwise disjoint, and we write

A_{1}\uplus A_{2}\quad {\text{or}}\quad \biguplus _{i=1}^{n}A_{i}

.

The above formulated condition of "exactness" of a function on sets is called (finite) additivity: for an exact covering of a set, decomposition and reassembly does not change its volume:

Definition ((finite) additive function on sets)

A function $\mu :{\mathcal {P}}(\Omega )\supseteq {\mathcal {C}}\rightarrow \mathbb {R}$ is called (finitely) additive if for all $n\in \mathbb {N}$ and all pairwise disjoint sets $A_{1},\dots ,A_{n}\in {\mathcal {C}}$ with $\biguplus _{i=1}^{n}A_{i}\in {\mathcal {C}}$ it holds that:

\mu \left(\biguplus _{i=1}^{n}A_{i}\right)=\sum _{i=1}^{n}\mu (A_{i})

.

We fix this desirable property of a volume-measuring function to be exact by demanding: volume-measuring functions must be additive.

Warning

In the definition of additivity, we imposed the restrictive condition that the union of $A_{i}$ also lies in the set system ${\mathcal {C}}$ , so that the left-hand side of the equation makes sense. This was not necessary the case for sub-additivity, because it was only a matter of covering (and not decomposing) a set.

It is important to note that due to this limiting condition, the additivity is even more dependent on the domain of definition ${\mathcal {C}}$ (i.e., a set system), than the sub-additivity. In particular, the additivity of a function on sets does not in general follow from its sub-additivity or monotonicity. A counter-example is a function on sets, whose domain of definition ${\mathcal {C}}$ does not contain disjoint sets:

Example (An additive, but non-monotone function on sets)

Let ${\mathcal {C}}=\{\{1\},\{1,2\}\}$ be a set system and $\mu$ a function on sets with $\mu (\{1\})=42$ and $\mu (\{1,2\})=1$ . Then $\mu$ is additive, but not sub-additive, not even monotonic.

However, we will see later that additivity implies sub-additivity if the domain of definition ${\mathcal {C}}$ has "a sufficiently good structure".

Last, the additivity of volume-measuring functions makes it desirable that $\mu (\emptyset )=0$ holds: Because of additivity, $\mu (\emptyset )=\mu (\emptyset \uplus \emptyset )=\mu (\emptyset )+\mu (\emptyset )$ must hold, and this condition is only satisfiable for $\mu (\emptyset )=0$ or $\mu (\emptyset )=\infty$ . If now $\mu (\emptyset )=\infty$ , then due to monotonicity $\mu (A)=\infty$ would have to hold for all sets $A\in {\mathcal {C}}$ . To exclude this pathological case, one additionally demands: For volume-measuring functions, $\mu (\emptyset )=0$ must hold.

Conclusion

Let us recap the properties that a volume-measuring function should have in order to meaningfully describe the measurement of an extensive quantity:

Such a function should map to $[0,\infty ]$ . In most cases, this makes sense. But we can still generalize to further extended ranges of values.
It should be sub-additive. We derived sub-additivity as an improved form of monotonicity that is preserved even in the presence of finite covers. In particular, any sub-additive function on sets is also monotonic. The monotonicity itself reflects the characteristic property of extensive quantities to grow with increasing size of the system under consideration.
It should be additive. Additivity corresponds to the property of a volume-measuring function to be "exact" and to be compatible with decomposing measured objects into finitely many parts and reassembling them.
It should assign the value zero to the empty set. This makes intuitive sense and also serves to exclude the case that the volume-measuring function is constantly infinite.

Rings: the domain of definition

In the considerations about additivity a difficulty has already appeared: Whether a set function is additive also depends on its domain of definition. The more sets are contained in the definition range, the more set ocmbinations we have to check for additivity, so it gets more difficult. After all, how large can a system ${\mathcal {C}}$ containing subsets of $\Omega$ be made(as the domain of definition of a volume-measuring function) without destroying its additivity? Indeed, the answer is not generally "the power set ${\mathcal {P}}(\Omega )$ " (whichh would be the largest possible set system). We have to stop before! An example where this happens is the elementary geometric volume on the $\mathbb {R} ^{n}$ :

The volume problem

The goal is to define an additive set function that describes the elementary geometric volume in $\mathbb {R} ^{p}$ . The problem of defining this function on all of ${\mathcal {P}}(\mathbb {R} ^{p})$ is also called the volmue problem. And it will turn out to be unsolvable!

Definition (The volume problem)

We are looking for a "content function" ${\mathcal {P}}(\mathbb {R} ^{n})$ defined on the power set $\mathbb {R} ^{n}$ $\mu :{\mathcal {P}}(\mathbb {R} ^{n})\rightarrow [0,\infty ]$ with the following properties:

finite additivity
motion invariance: for any motion $\beta :\mathbb {R} ^{n}\rightarrow \mathbb {R} ^{n}$ and for all $A\subseteq \mathbb {R} ^{n}$ it holds that $\mu (\beta (A))=\mu (A)$ . (A motion is an affine-linear transformation of the form $\beta (x)=Ox+b$ with $b\in \mathbb {R} ^{n}$ and orthogonal linear component $O$ , i.e., a rotation, reflection, or translation in space.)
Normalizedness: $\mu ([0,1]^{n})=1$

However, Hausdorff, Banach and Tarski were able to prove that such a measure cannot be found:

The volume problem is unsolvable for the $\mathbb {R} ^{p}$ if $p\geq 3$ . (Hausdorff, 1914)
The volume problem is solvable for the $\mathbb {R} ^{1}$ and the $\mathbb {R} ^{2}$ , but it is not uniquely solvable. (Banach, 1923)

The first case looks somewhat paradox: we would expect such a volume naturally to exist (why should subsets of the $\mathbb {R} ^{p}$ for $p\geq 3$ do not have a volume?) However, this is not the case! Building on Hausdorff's result, Banach and Tarski showed the following theorem, which is also known as reflecting the "Banach-Tarski paradox":

Theorem (Theorem of Banach and Tarski (1924))

Let $p\geq 3$ and let $A,B\subseteq \mathbb {R} ^{p}$ be bounded sets with nonempty interior. Then there are sets $C_{1},\dots ,C_{n}\subseteq \mathbb {R} ^{p}$ and motions $\beta _{1},\dots ,\beta _{n}$ such that $A$ is the disjoint union of sets $C_{1},\dots ,C_{n}$ and $B$ is the disjoint union of the sets $\beta _{1}(C_{1}),\dots ,\beta _{n}(C_{n})$ .

This theorem makes vividly clear that the volume problem on the $\mathbb {R} ^{p}$ cannot be solvable for $p\geq 3$ . The implications would be absurd: it follows from the theorem that a sphere (think of a pea or an orange) can be suitably divided into finitely many parts and assembled into a sphere the size of the sun. (Of course, this does not work in real life, simply because physical bodies are not continuous sets of points, but are composed of atoms. The quantities of such a decomposition are extremely complex and can best be illustrated as "point clouds", which in general cannot be stated explicitly).

Warning

Although the volume problem is intractable for $p\geq 3$ , not all volume-measuring functions make problems if we want to define it on the whole power set. It depends on the desired properties of the measurement function and on the basic set under consideration whether this is possible. For the volume problem, we only chose too strong requirements: it is the invariance of motion that causes difficulties for $p\geq 3$ . However, for "simpler" measurement functions, it may well be possible to define them on the whole power set. An example is the measurement function over a set $\Omega$ which is constant zero. Also, reducing the base set $\Omega$ may make our life easier.

Rings of sets

So, in general, we cannot hope to define an additive set function on the whole power set. This makes it necessary to think more carefully about the set system ${\mathcal {C}}$ , which is to serve as the domain of definition of volume-measuring functions. Mathematicians spent some considerable time on defining classes of such set systems and therefore, the literature on measure theory provides a whole zoo of such set systems: Semi-rings, rings, $\sigma$ -rings (pronounced "sigma-rings"), algebras, $\sigma$ -algebras, Dynkin systems, monotonous classes, ... You may find an overview in this article

Intuitively, the domain ${\mathcal {C}}$ of an additive (and subadditive) volume-measuring function should be stable under the following set operations: the disjoint union of finitely many sets " $\cup$ " and its counterpart, forming differences of sets " $\setminus$ ". Moreover, if one has the difference operation available, one can make arbitrary unions "artificially" disjoint: if $A$ and $B$ are (not necessarily disjoint) sets, then $A\cup B=A\uplus (B\setminus A)$ , where the two united sets on the right-hand side of the equation are again in ${\mathcal {C}}$ . So one can forget about disjointness and speak of arbitrary finite unions. Finally, to exclude trivial cases, we require ${\mathcal {C}}\neq \emptyset$ and can now define:

Definition (Ring of sets)

Let $\Omega$ be a set. A set system ${\mathcal {R}}\subseteq {\mathcal {P}}(\Omega )$ is called ring of sets (or ring for short) if:

${\mathcal {R}}\neq \emptyset$
$A,B\in {\mathcal {R}}\implies A\cup B\in {\mathcal {R}}$
$A,B\in {\mathcal {R}}\implies A\setminus B\in {\mathcal {R}}$

Hint

For every ring ${\mathcal {R}}$ we have $\emptyset \in {\mathcal {R}}$ : By condition 1 there is at least one element $A\in {\mathcal {R}}$ , and by 3 it also holds that $\emptyset =(A\setminus A)\in {\mathcal {R}}$ .
In point 2, we can use induction to extend the stability of the set system from 2 to any finite number of sets.

Examples for rings of sets

Over a base set $\Omega$ there are always the two rings ${\mathcal {R}}=\{\emptyset \}$ and ${\mathcal {R}}=\{\emptyset ,\Omega \}$ .

Example (Countable subsets)

The system ${\mathcal {R}}=\{A\subseteq \mathbb {R} \mid A{\text{ countable}}\}$ of countable subsets of $\mathbb {R}$ is a ring:

Since the empty set is countable, ${\mathcal {R}}$ is nonempty.
If $A,B\subseteq \mathbb {R}$ are two countable sets, then their union is also countable, so $A\cup B\in {\mathcal {R}}$ holds.
If $A,B\subseteq \mathbb {R}$ are countable, then because $A\setminus B\subseteq A$ their difference is also countable and $A\setminus B\in {\mathcal {R}}$ holds.

This is already an example containing a lot of sets. Especially, for finite ${\mathcal {C}}$ , the ring ${\mathcal {R}}$ can be chosen to be all of the power set ${\mathcal {P}}({\mathcal {C}})$ . For base sets with infinitely many points, such as ${\mathcal {C}}=\mathbb {N}$ , one can also easily construct rings:

Example (Finite subsets)

The set system ${\mathcal {R}}=\{A\subseteq \mathbb {N} \mid A{\text{ finite}}\}$ of finite subsets of $\mathbb {N}$ is a ring:

The empty set is finite, so $\emptyset \in {\mathcal {R}}$ is nonempty.
If $A,B\subseteq \mathbb {N}$ are finite, then so is their union, i.e., $A\cup B\in {\mathcal {R}}$ .
If $A,B\subseteq \mathbb {N}$ are finite, then because of $A\setminus B\subseteq A$ so is their difference, i..e, $A\setminus B\in {\mathcal {R}}$ .

Another important example of a ring in case ${\mathcal {C}}=\mathbb {R} ^{n}$ is the following:

Example (Ring of cuboids)

Some cuboids in

\mathbb {R} ^{2}

(in 2D, cuboids are juts rectangles)

Consider the set system

{\mathcal {Q}}=\left\{\prod _{k=1}^{n}I_{k}\mid I_{k}\subseteq \mathbb {R} {\text{ interval}}\right\}\subseteq \mathbb {R} ^{n}

of the axis-parallel cuboids in $\mathbb {R} ^{n}$ . (The intervals in the product may be open, half-open, or closed.) The ring of cuboids in $\mathbb {R} ^{n}$ is defined as the ring generated by ${\mathcal {Q}}$ , that is, the smallest ring containing ${\mathcal {Q}}$ . Its elements are so-called cuboids, i.e. finite unions of axis-parallel cuboids in $\mathbb {R} ^{n}$ . One can specify this ring explicitly: It is the intersection of all set rings over $\mathbb {R} ^{n}$ containing the cuboids ${\mathcal {Q}}$ . (Compare also the article on generated $\sigma$ -algebras.)

We have now found some properties that a volume-measuring functions should satisfy in order to meaningfully describe the measurement of an extensive quantity: non-negativity, subadditivity (especially monotonicity) and additivity. As natural domains of definition of such functions we juts defined rings.

Additivity is sufficient on rings

Above, we noticed that from the additivity of a volume-measuring function, the subadditivity does not follow in general. And not even its monotonicity - at least not if nothing else is known about the domain of definition ${\mathcal {C}}$ . But if the set system ${\mathcal {C}}$ is a ring, then the monotonicity and the subadditivity already follow from the additivity and non-negativity of the volume-measuring function. The reason is that by cut-stability of the set system, one can easily make unions "artificially" disjoint and thus exploit the property of additivity. When proving this statement, it is clever to first show the somewhat simpler property of monotonicity and then subadditivity.

Theorem (Additive, non-negative functions defined on rings of sets are monotonous)

Let ${\mathcal {R}}$ be a ring over the base set $\Omega$ and let $\mu :{\mathcal {P}}(\Omega )\supseteq {\mathcal {R}}\rightarrow [0,\infty ]$ be an additive and nonnegative function (on sets). Then $\mu$ is monotone.

Proof (Additive, non-negative functions defined on rings of sets are monotonous)

Let $A,B\in {\mathcal {R}}$ be such that $A\subseteq B$ . The idea is to determine the "more" of $B$ over $A$ : Since ${\mathcal {R}}$ is a ring, $B\setminus A\in {\mathcal {R}}$ and it due to additivity,

\mu (B)=\mu ((B\setminus A)\uplus A)=\mu (B\setminus A)+\mu (A)\geq \mu (A),

where in the last step we exploited the non-negativity of $\mu$ .

Theorem (Additive,non-negative functions defined on rings of sets are sub-additive)

Using the notion of the previous theorem, $\mu$ is subadditive.

Proof (Additive,non-negative functions defined on rings of sets are sub-additive)

Let $A,B_{1},\dots ,B_{n}\in {\mathcal {R}}$ besuch that $A\subseteq \bigcup _{i=1}^{n}B_{i}$ . We make the union of $B_{i}$ "artificially" disjoint:

\bigcup _{i=1}^{n}B_{i}=\biguplus _{i=1}^{n}B_{i}'{\text{, where }}B_{i}':=B_{i}\setminus \left(\biguplus _{k=1}^{i-1}B_{k}'\right)

Thus, each $B_{i}'$ is formed by taking out all the elements already contained in the sets $B_{1},\dots ,B_{i-1}$ . Note that the $B_{i}'$ are also in the ring ${\mathcal {R}}$ , as it is stable under difference and union.

Thus $B_{i}'\subseteq B_{i}$ also holds for all $i$ and it follows from additivity and the (already proven) monotonicity of $\mu$ :

\mu (A)\leq \mu \left(\bigcup _{i=1}^{n}B_{i}\right)=\mu \left(\biguplus _{i=1}^{n}B_{i}'\right)=\sum _{i=1}^{n}\mu (B_{i}')\leq \sum _{i=1}^{n}\mu (B_{i}).

Definition

Thus, one can characterize the monotonicity and subadditivity of a volume-measuring function defined on a ring by its non-negativity and additivity alone. So we can now summarize all the properties that a measurement function should possess in the following definition:

Definition (Volume on a ring)

Let ${\mathcal {R}}$ be a ring over the set $\Omega$ . A nonnegative function $\mu :{\mathcal {R}}\to [0,\infty ]$ is called volume function on ${\mathcal {R}}$ (or volume, for short) if:

$\mu (\emptyset )=0$ ,
$\mu$ is additive.

We call the value $\mu (A)$ of a set $A\in {\mathcal {R}}$ the volume of $A$ .

The condition $\mu (\emptyset )=0$ makes intuitive sense and also serves to exclude the pathological case $\mu \equiv \infty$ .

Hint

Sometimes, volumes are defined on other set systems, for example on so-called semi-rings or algebras.

Examples

Example

Let ${\mathcal {R}}=\{A\subseteq \mathbb {N} \mid A{\text{ finite}}\}$ be the ring of all finite subsets of $\mathbb {N}$ and let $\mu$ be the function

\mu (A):=|A|

which counts the number of elements of a set $A\in {\mathcal {R}}$ . Contents:

The empty set contains no elements, so $\mu (\emptyset )=0$ .
Let $A,B\in {\mathcal {R}}$ be disjoint. Because of disjointness, the number of elements in the union is the sum of the number of elements in $A$ and $B$ and it holds that $\mu (A\uplus B)=|A\uplus B|=|A|+|B|=\mu (A)+\mu (B)$ .

One can also consider this volume on the entire power set ${\mathcal {P}}(\mathbb {N} )$ . Then one only has to consider that the additivity holds even if one or both of the disjoint sets contain infinitely many elements.

Example

For any subset $A\in {\mathcal {P}}(\mathbb {N} )$ , we define

\mu (A):={\begin{cases}0,{\text{ if }}A{\text{ finite,}}\\\infty {\text{ else.}}\end{cases}}

This is a volume on ${\mathcal {P}}(\mathbb {N} )$ :

The empty set is finite, so $\mu (\emptyset )=0$ .
If $A$ and $B$ are disjoint, then $\mu (A\uplus B)=0=\mu (A)+\mu (B)$ if the sets are both finite. If either or both sets are infinite, additivity also holds: if w.l.o.g. $A$ is infinite, then $\mu (A\uplus B)=\infty =\infty +\mu (B)=\mu (A)+\mu (B)$ .

A particularly important example is the so-called elementary geometric volume:

Example (Geometric volume of cuboids)

We already know the ring ${\mathcal {R}}$ of cuboids, whose elements are finite unions of axis-parallel cuboids in $\mathbb {R} ^{n}$ . (An axis-parallel cuboid in $\mathbb {R} ^{n}$ is a product $I_{1}\times \dots \times I_{n}$ of intervals, each open, half-open, or closed.) One can show that any such cuboid can be written as a disjoint union of cuboids.

One can show that the function so defined is a volume on the ring of cuboid bodies ${\mathcal {R}}$ . This includes showing that $\lambda$ is well-defined, so that the value $\lambda (F)$ does not depend on the chosen decomposition of the cuboid $F$ into disjoint cuboids.

Further properties of volumes

We collect properties of volumes. In the following, let ${\mathcal {R}}$ be a ring (in particular closed under differences and finite unions) and let $\mu \colon {\mathcal {R}}\to [0,\infty ]$ be a volume.

Theorem (Subtractivity)

Volumes are subtractive: Let $A,B\in {\mathcal {R}}$ with $\mu (A)<\infty$ and $A\subseteq B$ . Then $\mu (B\setminus A)=\mu (B)-\mu (A)$ holds.

Proof (Subtractivity)

By additivity, $\mu (B)=\mu ((B\setminus A)\uplus A)=\mu (B\setminus A)+\mu (A)$ holds. Because of $\mu (A)<\infty$ we can subtract $\mu (A)$ on both sides of the equation and get the result.

Warning

One cannot do without the condition $\mu (A)<\infty$ ! If $\mu (A)=\infty$ holds, one gets the undefined expression $\mu (B)-\mu (A)=\infty -\infty$ . This cannot be sensibly defined. For example, if $\mu$ is the elementary geometric volume on $\mathbb {R}$ that assigns to each interval its length, then for $A=B=\mathbb {R}$ the value " $\infty -\infty$ " is $=\mu (B\setminus A)=0$ . But for $A=(-\infty ,0]$ and $B=\mathbb {R}$ one gets " $\infty -\infty$ " $=\mu (B\setminus A)=\infty$ .

Another property of finite volumes (i.e. $\mu (A)<\infty$ for all sets $A\in {\mathcal {R}}$ ) is the so-called inclusion-exclusion principle. It is important in probability theory (where $\mu (A)\leq 1<\infty$ ), and traces the volumes of a union of sets to a sum of volumes of intersections. Note that for a ring ${\mathcal {R}}$ and $A,B\in {\mathcal {R}}$ , we also have $A\cap B=A\setminus (A\setminus B)\in {\mathcal {R}}$ . Finite intersections of sets from ${\mathcal {R}}$ thus lie again in ${\mathcal {R}}$ .

Theorem (Inclusion-exclusion-Principle)

Let the content $\mu$ be finite and let $A_{1},A_{2},\dots ,A_{n}\in {\mathcal {R}}$ , $n\in \mathbb {N}$ . Then

\mu \left(\bigcup _{i=1}^{n}A_{i}\right)=\sum _{k=1}^{n}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}}).

Proof (Inclusion-exclusion-Principle)

What does this formula even mean? We first make clear what it means for small $n$ :

For

n=2

sets

For $n=1$ we have

\mu (A_{1})=\mu (A_{1})

.

OK, this is obvious.

For $n=2$ we have

\mu (A_{1}\cup A_{2})=\mu (A_{1})+\mu (A_{2})-\mu (A_{1}\cap A_{2})

.

So the formula tells us: if we want to calculate the volume of a union, we first add the measures (=areas) of $A_{1}$ and $A_{2}$ . However, we have counted the overlap $A_{1}\cap A_{2}$ "twice" and must subtract it again.

For

n=3

sets

For $n=3$ we have

\mu (A_{1}\cup A_{2}\cup A_{3})=\mu (A_{1})+\mu (A_{2})+\mu (A_{3})-\mu (A_{1}\cap A_{2})-\mu (A_{1}\cap A_{3})-\mu (A_{2}\cap A_{3})+\mu (A_{1}\cap A_{2}\cap A_{3})

.

A slightly more complicated case: We add, first all 3 measures. Here, the "double overlap areas" $A_{1}\cap A_{2}\setminus A_{3}$ , $A_{1}\cap A_{3}\setminus A_{2}$ and $A_{2}\cap A_{3}\setminus A_{1}$ were counted twice. The "triple overlap area" $\mu (A_{1}\cap A_{2}\cap A_{3})$ even three times! Subtracting the three "double-overlaps" $mu(A_{i}\cap A_{k})$ removes the duplication in the $A_{1}\cap A_{2}\setminus A_{3}$ . But the measure of $A_{1}\cap A_{2}\cap A_{3}$ has been subtracted three times, that is, once too many. So we have to add it back at the end.

Here an induction over $n$ begins to appear: The step in $n=3$ can be generalized to higher $n$ . Our induction can then simply be started with $n=1$ .

Theorem whose validity shall be proven for the $n\in \mathbb {N}$ :

\mu \left(\bigcup _{i=1}^{n}A_{i}\right)=\sum _{k=1}^{n}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}}).

1. Base case:

For $n=1$ we obviously have $\mu (A_{1})=\mu (A_{1})$ .

1. inductive step:

2a. inductive hypothesis:

\mu \left(\bigcup _{i=1}^{n-1}A_{i}\right)=\sum _{k=1}^{n-1}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n-1}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}}).

2b. induction theorem:

\mu \left(\bigcup _{i=1}^{n}A_{i}\right)=\sum _{k=1}^{n}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}}).

2b. proof of induction step:

In the step $n-1\to n$ , the measure $\mu (A_{n})$ must be added to the union on the left-hand side - but minus the sets that are already included. So we have

Graphical depiction of the induction step

\mu \left(\bigcup _{i=1}^{n}A_{i}\right)=\mu \left(\bigcup _{i=1}^{n-1}A_{i}\right)+\mu (A_{n})-\mu \left(A_{n}\cap \bigcup _{i=1}^{n-1}A_{i}\right).

To the first term on the right $\mu \left(\bigcup _{i=1}^{n-1}A_{i}\right)$ (corresponding to the 3 circles in the figure) we can apply the induction assumption, because the union goes only up to $n-1$ :

\mu \left(\bigcup _{i=1}^{n-1}A_{i}\right)=\sum _{k=1}^{n-1}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n-1}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}}).

In the third term on the right (corresponding to the red area in the figure), it makes no difference whether we intersect $A_{n}$ with the whole union, or first intersect all sets individually and then take the union. In both cases, everything outside $A_{n}$ is "cut away" from the union:

\mu \left(A_{n}\cap \bigcup _{i=1}^{n-1}A_{i}\right)=\mu \left(\bigcup _{i=1}^{n-1}(A_{n}\cap A_{i})\right)=\sum _{k=1}^{n-1}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n-1}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}}\cap A_{n}).

Substituting the first and third terms on the right yields:

\mu \left(\bigcup _{i=1}^{n}A_{i}\right)=\sum _{k=1}^{n-1}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n-1}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}})+\mu (A_{n})-\sum _{k=1}^{n-1}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n-1}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}}\cap A_{n}).

What does the complicated expression on the right-hand side mean? And does it actually correspond to the induction assertion? The right-hand side consists of 3 sums, whose summands already have the desired form from the induction assertion (including the correct signs).

The first term now contains all summands in which $A_{n}$ does not occur.
The second term $A_{n}$ contains all summands in which $A_{n}$ but no other $A_{i}$ occurs.
The third term contains all summands in which $A_{n}$ and at least one other $A_{i}$ occurs.

So, in total, we have covered each summand from the induction assertion exactly once. This is because each summand contains either $A_{n}$ or not and either one of the terms $A_{1},...,A_{n-1}$ or not. So

\mu \left(\bigcup _{i=1}^{n}A_{i}\right)=\sum _{k=1}^{n}(-1)^{k-1}\sum _{1\leq i_{1}<\dots <i_{k}\leq n}\mu (A_{i_{1}}\cap \dots \cap A_{i_{k}}).

which establishes the induction assertion.

Continuity of volumes on rings →

Feedback? Do you want to join?

If you have questions concerning the content, or didn't understand something, the feel free to contact us! We would love to answer your questions! Also we are thankful for critics and/or comments! If you share our vision to explain university math in an comprehensible way, then contact us under:

E-Mail: en@serlo.org

This article is licensed under the free license CC-BY-SA 3.0. With that you can use it, modify it or share it freely, as long as you name „Serlo“ as source and put you changes under the same CC-BY-SA 3.0 oder an compatible license. On the page „Kopier uns!“ we explain you what you have to pay attention to, when using our texts, picture or videos.