Supremum and infimum – Serlo

Introduction Bearbeiten

Supremum (from Latin „supremum“ = "the highest/supreme“) sounds, as if it were "the maximum“ (that is, the largest element of the set). In the course of the article, however, we will see that the supremum generalzes the maximum. Let's start by remembering the following:

Every maximum is a supremum, but not every supremum is a maximum.

While the maximum has to be an element of a considered set, this need not apply to the supremum. Therefore we should aptly translate "supremum“ as "the number immediately restricting from the top“. It is "restricting from the top“, because it is like the maximum greater than or equal to any number of the set. And it is "immediate" because it is the smallest of all "upward limiting numbers".

Similarly, the infimum is a generalization of the minimum. It is the "number that immediately restricts downwards", i.e. the largest of all the "numbers that restrict downwards" of a set. We will get to know concrete examples in the coming sections.

For us the concept of the supremum is important, because with it the completeness of the real numbers can be described alternatively. In addition, the supremum is a useful tool in proofs or in defining new terms.

Explanation of the supremum Bearbeiten

To explain the Supremum, we will examine how to arrive at its precise definition. For this we will determine how the supremum can be generalized from the maximum. Remember: the maximum of a set is its largest element. The maximum   of a quantity   has the following properties:

  •   is an element of  .
  • For every   is  .

In the second property there is therefore a smaller-equal and no smaller sign, because in the statement could also be   equal to  . For finite quantities, the maximum is always defined, but this is not necessarily the case for infinite quantities.

First of all, we may encounter the problem that the set under consideration is unlimited upwards. Take for example the set  . This set cannot have a maximum or the like, since there is a larger number of   for each real number. This set cannot have a largest element. There is also no element that could be "directly the largest" element. Therefore, a question about this with this set simply does not make sense.

For the transfer of the term "maximum" to infinite sets, the set must therefore be limited upwards. So there must be a number  , which is greater than or equal to each element of the set. As a result   does not necessarily have to be an element of the set.

 
The set  .

But even then problems can still arise. Take for example the set  . This set is limited to the top, because for   any number greater than   can be selected.

Does the quantity   have a maximum? Unfortunately not. For each     is another number from   with property   (the number   is in the middle between   and  ). However,   cannot have a maximum element, because for each number from   there is at least one larger number from  .

Thus, when looking at infinite quantities, the maximum loses one property. Namely, that it is element of the set[1]:

  • m is an element of M.
  • For every   is  .
 
The supremum is the "least upper bound"

The only property that remains is that the number you are looking for is greater than any element in the set. Such a number is called the "upper limit" of the set:

Definition (upper bound)

Let   be a subset of  . Then a number  , which is greater than or equal to each element of  , is called an upper bound. So it is   for all  .

Similarly, a lower bound is a number that limits a quantity downwards:

Definition (Lower bound)

Let   be a subset of  . Then a number  , which is less than or equal to any element of  , is called a lower bound. So it is   for all  .

When we look at our new definition, we see two things. First: Upper and lower limits do not have to be elements of the considered set, because this is not required by the definition. And secondly: the definition says nothing about a possible uniqueness of the bounds.

For example, consider the set  . Here we certainly first think of   as the upper bound. However,   is also an upper bound and meets the requirements of the definition. Apart from the fact that   is far above our example set, both numbers are not elements of the set. This example shows that there can be more than one upper bound. But it becomes even more disturbing: A limited subset of the real numbers always has infinitely many upper bounds. If   is an upper bound of  , any larger number, i.e.   for all  , is also an upper bound.

On closer inspection, the terms upper and lower bound are not very appropriate. They provide much less than a maximum term. The maximum is always unique: there can only be one of them. That is not the case with the upper bound. Let us therefore try to improve the concept.

On closer inspection, the terms upper and lower bound are not very accurate. They provide much less than a maximum term. The maximum is always unique: there can only be one of them. That is not the case with the upper barrier. Let us therefore try to improve the concept.

Consider as an example again the set  . Which number could be used to generalize the maximum for  ? Intuitively the number   occurs to us. But why choose this number?

We want a general term that works even when the set is no longer so clearly described. Therefore, all upper limits of  , i.e. all numbers greater than or equal to  , are possible. Now our number should be optimal in the sense that it is as small as possible. So we get to the number  . It is not only an upper bound, it is also the smallest upper bound of  . We have already seen that for each   there is another number   with   (namely  ). Thus no number smaller than   can be an upper bound of  .   is what we consider to be the "immediately above" number of  .

Question: What might a set look like where it is not intuitively "clear" what number the Supremum could be?

 
The Mandelbrot set

Let's briefly have a look at this beautiful looking set of numbers: The Mandelbrot set. They are obtained by inserting all points in a two-dimensional coordinate system into a certain function  . It takes a coordinate   and turns it into another coordinate  . This result is put back into this function and then again and again and again and again.... The coordinates you get with every step become very large for some starting points very fast, for others they remain small. Once the coordinates have moved far enough away from their starting point (have exceeded a limit  ), they never come back and "run for it". If a point for the start value   always remains below  , the point   belongs to the set and is colored black. If it exceeds  , it gets a certain color, depending on when it exceeded  . What we see on the right is the resulting image.

The Mandelbrot set is now in the plane, its points have  - and   coordinates, therefore it is not suitable for our supremum concept at first. But we can simply "look at the set of all   coordinates of the Mandelbrot set" and try to find its supremum. To put it more clearly: We would like to know how far up the black dots in the picture reach and are looking for the smallest upper bound. Which value it has exactly, however, is completely unclear at the first (and also at the second) look[2].

Die kleinste obere Schranke   wird durch folgende zwei Eigenschaften charakterisiert:

  •   ist obere Schranke von  : Für jedes   ist  .
  • Jede obere Schranke   von   ist mindestens so groß wie  : Gilt   für alle  , so gilt auch  . Anders formuliert: Für jedes   gibt es mindestens eine Zahl   mit  .

Das können wir als Definition des Supremums verwenden, da es offenbar die kleinste obere Schranke charakterisiert. Das Infimum wird analog als die größte untere Schranke definiert. Eine weitere Möglichkeit der Charakterisierung von Supremum und Infimum werden wir im Abschnitt „Suprema und Infima in Halbordnungen“ kennenlernen.}}

Definition of the Supremum and Infimum Bearbeiten

 
Das Supremum ist die kleinste obere Schranke einer Menge.

Die Definition des Supremums und des Infimums lautet:

Definition (Supremum)

Let   be a subset of  . The supremum   of the set   is the smallest upper bound of  . The supremum is characterized by the following two properties:

  • For every   it holds  .
  • There is no number   less than   that is an upper bound of  : For all   there exists at least one number   with  .

Definition (Infimum)

Let   be a subset of  . The infimum   of the set   is the largest lower bound of  . The infimum is characterized by the following two properties:

  • For every   it holds  .
  • No number   larger than   is a lower bound of  : For all   there exists at least one number   with  .

The Epsilon Definition Bearbeiten

In the second property in the definition of the supremum   of the set  , which is an element of the set  , it says:

"Every number   less than   is not an upper bound of  : For all   there exists at least one number   with  .“

In mathematical literature and textbooks, authors often set   with  . This is a way to write the second propery of the supremum as a formal mathematical claim. Namely, we could replace the second property given above with the equivalent statement:

"For all   there exists some   with  .“

Since both statements are equivalent and just differently worded, it is up to our discretion which variant we choose to use in proofs.


Question: What is the epsilon definition of the infimum?

  is an infimum of   if   is a lower bound of   and if for any   there exists some   such that   holds.

Maximum and Minimum Bearbeiten

For the maximum and minimum we have the following well-known definitions:

Definition (Maximum)

The maximum   of a set   is a number with the two following properties:

  •  .
  • For all   it holds  .

Definition (Minimum)

The minimum   of a set   is a number with the following two properties:

  •  .
  • For all   it holds  .

From these definitions it follows immediately that the maximum of a set is also the supremum of the set. I.e. let   be the maximum of the set  . For one,   is by definition the upper bound of  . Furthermore, for every   with   there exists a   with  , namely  . On the other hand, not every supremum is a maximum, like we saw above using the set  . The number   is the supremum of this set, but not the maximum! Similar statements are true for the minimum and infimum.

Notation Bearbeiten

Notation Meaning
  Supremum of  
  Supremum of  
  Infimum of  
  Infimum of  
  Maximum of  
  Minimum of  

The Duality Principle Bearbeiten

We've already seen in the above definitions and explanations that the terms supremum and infimum can be considered and used similarly. This is a result of the fact that, by switching around the ordering of the real numbers, i.e. replacing   with  , the supremum becomes the infimum and the infimum becomes the supremum. This means we can introduce a new ordering   such that   holds if and only if   (this is the same as us reflecting the real numbers around zero). With this new ordering, the supremum acts as the infimum and vice versa. Both orderings   and   have the same mathematical ordering properties. This means they are isomorphic to one another. Therefore, the properties of the infimum and supremum must be the same for this reversed ordering. This means that any statements we make in the future for suprema will also apply to infima and vice versa. The same applies to the interchangeability of statements regarding the maximum and minimum.

Example (Duality Principle)

For all   it is true that  . Similarly for all   we have the inequality  .

Existence and Uniqueness Bearbeiten

Up until this point we have exclusively been speaking of the supremum. This sounds as if the supremum always exists and is always unique. This hints at answers to some important mathematical questions, namely: why did we bother defining the supremum in the first place? If the maximum of a set does not always exist, defining a supremum does not actually solve this existence problem. What is the advantage of introducing the concept of a supremum? Intuitively we know that of all upper bounds of a set, there should be exactly one smallest upper bound, i.e. intuitively this smallest upper bound should always exist and be unique. However, we haven't yet proven this mathematically. Indeed, the existence and uniqueness is a true property of the supremum, which we will now show formally.

In the following theorem we will prove the uniqueness of the supremum (and infimum), i.e. that a set can have at most one supremum and one infimum.

Theorem (Uniqueness of the Supremum and Infimum)

A set can have at most one supremum and one infimum.

Proof (Uniqueness of the Supremum and Infimum)

We can use the standard proof method for showing uniqueness: first we assume there exists some set   with two suprema   and  . Then we will show  . Both of the suprema have the following properties:

  •   and   are upper bounds of  .
  • No number less than   and   is an upper bound of  .

By the second property and since   is an upper bound o  ,   can't be smaller than   and must therefore be greater than or equal to  . Similarly it must hold  . Since   and   we can conclude  . The proof for the uniqueness of the infimum is similar.

Using the completeness axiom we can also prove the existence of the supremum of a non-empty subset of the real numbers that is bounded above. However, we will not deal with this in this chapter. We can also prove a similar statement about the exitence of the infimum of a non-empty subset of the real numbers that is bounded below. It is indeed the case that the supremum and infimum of a non-empty subset of the real numbers exist when this set is bounded above and below and are always unique.

Exegesis: Suprema and Infima in Partial Orderings Bearbeiten

We introduced the above definitions for suprema and infima for sets of real numbers. This is sufficient for an Introduction to Real Analysis class, since such classes often deal with subsets of  . In higher-level theoretical math classes, the concept of the partial ordering, which satisfies the reflexitivity, anti-symmetry, and transitivity properties, but does not satisfy the totality property, i.e. there may be elements   for which neither   nor   holds. In the case of partial orderings, the definitions we provided above are not sufficient to construct a sensible supremum or infimum. The main issue with these definitions in the case of partial orderings is that we lose the uniqueness of the supremum and infimum. In order to ensure the uniqueness of the supremum, we instead introduce the following definition:

Definition (Supremum in Partial Orderings)

In the partially ordered set  , an element   is the supremum of the set   when it holds:

  •   is an upper bound of  : for every   it holds  .
  • For every other upper bound   of   it holds:  

In order to show that these definitions is a sensible generalization of the supremum with respect to a partial ordering, we must show that both definitions coincide on a subset of the real numbers:

Theorem (Equivalent Definition of the Supremum)

Let   be arbitrary. Our definition of the supremum   is:

  • For every   it holds  .
  • Every number   less than   is not an upper bound of  : for all   there exists at least one number   with  .

This definition is equivalent to the definition of the supremum with respect to partial orderings:

  •   is an upper bound of  : for every   it holds  .
  • For every other upper bound   of   it holds:  

Proof (Equivalent Definition of the Supremum)

Let   be arbitrary. Since the first property of each pairs of properties are identical, we only have to show that the second properties coincide. I.e. we have to show the equivalence of:

  • Every number   less than   is not an upper bound of  : for al   there exists at least one number   with  .
  • For every other upper bound   of   it holds  .

We can formalize the two claims in the following:

  •  
  •  

We can show the equivalence of these two claims in the following way: