How to prove convergence and divergence – Serlo

In this chapter, we will explain how convergence and divergence of a sequence can be proven. Usually, this job splits into two steps: At first, one tries some brainstorming (with a pencil on a piece of paper), trying to find a way to prove convergence or divergence. Then, if one has a solution, one tries to write it down in a short and elegant way. What you can read in most math books is the result of the second step. The aim of step 2 ist to conserve your thoughts for further people (or a later version of yourself), such that the reader can understand the proof investing as little time / effort as possible. However, the thoughts which a mathematician has when trying to find a proof (in step 1) are often quite different from what is written down in step 2. The problems given in this chapter will illustrate both steps and the difference between how to get to a proof and how to write it down.

Be aware that for proving convergence or divergence, there is no "cooking recipe", which will always lead you to a working proof! There is rather a collection of tools you can always carry around with you (like a Swiss pocket knife) and which you can use for mathematical problem solving. In this chapter, we would like to provide you with a specific collection of such tools. Not every problem you encounter in an exercise class is directly solvable with one tool and sometimes, you need to creatively combine different tools and techniques in order to crack open a problem. Those exercises are designed for purpose: They are intended to train your skills in solving abstract problems by combining strategies you know in creative and uncommon ways. These problem solving skills turn out to be useful in a broad variety of jobs and even for your personal life. This is the reason, why math courses are part of the syllabus in school and a huge variety of university degree programmes! (Torturing you with technical formulas is only secondary :)).

Proving convergence

Bearbeiten
An example how to prove convergence of a sequence (video in German)

General structure of proof

Bearbeiten

Before turning to examples, we take a closer look at the structure of the proof. This way, we know what the final proof should look like. There is a mathematical definition for convergence of a sequence   to a limit  , which looks as follows:

 

I.e. after passing an index number  , all elements with index number   coming afterwards will be closer to the limit than a small amount  . The proof that this statement holds looks as follows:

 

If you explicitly write down, what the natural number   looks like, you do not need to explicitly show that   exists, because writing down an explicit expression is already a proof.

An example problem

Bearbeiten

"Does the sequence   with   converge? If yes, what is the limit? Prove all your claims about convergence/ divergence."

The solution involves the following steps:

  1. Investigate the sequence for high  , in order to find a limit  
  2. Make up a proof that   converges to   with a pencil on a piece of paper
  3. Write down the proof in a short way, structured as above

Finding a limit

Bearbeiten

How does the sequence   behave for high  ? Will it converge? In order to get more confident with the sequence, there are some actions you could try:

  • Calculate the first couple of elements: Compute the first couple of elements and what they explicitly look like. It is also useful to draw them in a diagram. Do they increase or decrease faster and faster? Are they jumping between two values? Or do they seem to approach a certain value?
  • Compute some elements with huge indices: What is  ? How about  ? A calculator or PC can be useful to find that out. Or you can make rough estimates what those   are. Do they come close to a specific real number  ? Then this may be your limit.
  • Make assertions based in intuition: The more problems you solve, the more you get an intuition what a limit might be. Polynomials   always run to infinity, as  . Exponentials   run away even faster. Sequences like   and   tend to 0. For fractions  , the higher power wins and exponential functions win against everything (if you feel confused now: we will later explain in detail what these intuitions mean).

So let us start to compute the first (let's say 10) elements of the sequence   :

   
1 0,5
2 0,666…
3 0,75
4 0,8
5 0,833…
6 0,857…
7 0,875
8 0,888…
9 0,9
10 0,909…

We could also plot them in a diagram:

 
The first 10 elements of a_n=n/(n+1)

The elements seem to be monotonously increasing. The increase is getting smaller and smaller. Perhaps, it converges. But what can be a possible limit? Maybe, we compute some elements with high index numbers to see it:

 

or even higher:

 

These numbers are suspiciously close to   . So we may assert that   is the limit of the sequence. This intuitively makes sense in the following way: For high   , there is   (  is almost the same as  ). So for the quotient, we expect

 

following these considerations, we make the assertion that   is the limit of the sequence  .

Warning

The argumentation above is not a full mathematical proof. We only assert that 1 is the limit, but we don't know it yet. Writing down some assertions is a helpful technique but does not score you any points in an exam. You always need a proof in order to consider a problem as solved.

Finding the proof steps

Bearbeiten

The proof builds around establishing an inequality of the form  . It is best to start with   and try to find greater and greater terms, until one of them is   . Throughout these estimates, we can make any assumptions on   of the form   with   being a natural number only depending on   and   (note   must not depend on   !). If   is too small, we just choose a bigger  .

Question: Why is   not allowed to depend on  ?

In the proof structure above, we see that   is defined such that   . So it depends on  , which values for   are allowed. If one makes   depending on   , we would have some circle of dependences, which may or may not be resolved. For instance, if we have   and choose  , then there is  , which does trivially hold for all  . However, a condition like   amounts to  , which can never be fulfilled for any  . So we might run into the trap of imposing a condition on  , which can never be fulfilled.

In the proof structure above, we see that the   is defined before considering an   or   . So   should not depend on  .

We can also recall the definition of convergence in order to see the dependence issue:

 

  may only depend on what has been fixed before, i.e. what is on the left side of it:   and  . By contrast, the variable   is being introduced after   and stands on the right of it. Therefore,   may neither depend on   nor on   .

Another often successful technique is to take   and get   standing alone on one side. That means, we use some equivalent reformulations of the term in order to get it in the form   with   some term depending on  . If we choose   such that  , then for any  , there is also  .   is just equivalent to   (if we only use equivalent reformulations, of course). So we have found a suitable  , such that   for all  . The following chapters will provide some examples how these techniques are applied.

Sometimes, we encounter several conditions on  , such that   can be made sure to hold, for instance  ,  ,…, . In those cases, we just take the maximum  . We can imagine   as thresholds which   has to cross in order that   approaches   up to  . After   has passed the highest of the thresholds  , all other thresholds will have been passed, too and all conditions for   are met. Usually, one makes up the conditions with   in step 1 on a piece of paper, whereas when writing down the proof in step 2, one only defines   (without giving the derivation).

Now let us return to the example. It is very useful to equivalently reformulate  :

 

We know that this term must get lower than any   for sufficiently high  . This is sometimes also called Archimedean axiom: for all   there exists an   with   . We can directly reach   by choosing   , since we instantly get  . Hence, it suffices if   meets the following condition:

 

So we found the desired bound with condition  . For writing down the proof, we choose  , with   being the fixed number from the Archimedean axiom above.

Writing down the proof

Bearbeiten

Now, we go over to step 2 and write down the proof. The final solution is quite short and concise:

Proof

Let   be arbitrary. By the Archimedean axiom, there is an   with  . We choose  . For all   there is:

 

That's it already! If we compare the proof above (step 2) with the derivation (step 1), we notice that they look entirely different. The proof is enormously short and the   and   seem to appear out of nowhere! It ma appear to you that the mathematician who wrote the proof is some kind of superhuman genius who has found some magical way to always and instantly get the right   and  . However, this is not the case. In the beginning, the author often had no idea how the proof works and performed a lot of trial and error, as well as lengthy considerations in step 1. He/ she just did not write them down, because they would just require additional space on the paper and additional time to read.

Übungsaufgabe

Bearbeiten

We recommend that you try to solve the following exercise by yourself

Exercise (Convergence of a sequence)

Prove that the sequence   with   converges. What is its limit?

How to get to the proof? (Convergence of a sequence)

We proceed as above. At first, let us find the limit:

Lösungsschritt: Finding the limit

For fractions of polynomials, it is a good idea to think about what happens for great   . In this case, the enumerator is approximately  , and the denominator  . Hence,

 

for great   . We therefore assert that   converges to   .

Now, we need to do some conceptual calculations(with pen and paper) in order to get a proof:

Lösungsschritt: Finding the proof steps

By the definition of convergence, for any given   we need to find an   , such that for all   there is:  . We first simplify the expression  :

 

Now, let us re-formulate the inequality   into the form  , i.e. a condition on  :

 

So we found a suitable condition for  ,which tells us how to choose   . If we choose   (which is possible by the Archimedean axiom) the above inequalities imply for all  :  .

Our conceptual calculations are done, here. Now we need to formulate a concise mathematical proof out of them.

Proof (Convergence of a sequence)

Let   be arbitrary. By the Archimedean axiom, there is an   with  . Let  . Then,

 

Conducting proofs for divergence

Bearbeiten

General proof structure

Bearbeiten

Divergence of a sequence occurs by definition if and only if the series is not convergent. Or in other words, divergence is the negation of convergence. That means, in the formal definition we have to switch quantifiers   and exchange  . (or   or   , respectively.) So divergence of   amounts to:

 

The structure of proof is hence:

 

Some parts of the proof can later be omitted if they are obvious. But the proof structure is always the same.

An example

Bearbeiten
 
The sequence  

Now, let us take a look at an example for a proof of divergence:

„Does the sequence   with   diverge? Proof your assertion.“

The above techniques (compute the first sequence elements, see what happens for large element numbers) can be applied to make an assertion. the first sequence elements are   and they start to grow quickly. For big  , the sequence elements become very large, as   . So   should diverge. Now, let us try to find a proof for this assertion.

How to get the proof

Bearbeiten

The proof will base on an inequality chain of the form

 

So we start with the term   and try to make it smaller and smaller, until we hit some   .   is assumed to be arbitrary and we can not put any bounds on it: The proof must be conducted for all  .

However,   and   can be chosen arbitrarily. We only need to respect that   and   where   is again some arbitrary natural number. Since   is introduced in the proof after   our   is allowed to depend on   (but not on  ). The natural number   may depend both on  , and on   . So within the estimate chain, we can set a bunch of conditions on   and   . Later, those will be collected and put together in the beginning of the proof.

So, let us start with the expression  , where   is arbitrary. We expect   to get large, so it makes sense to use that it eventually gets larger than  . In case   , we can omit the absolute which makes things a lot easier. Intuitively, it is clear that there must be an   such that   holds, since the  -terms get arbitrarily large. Mathematically, we can use the implications of the Bernoulli inequality:

„For each   and each   there is an  , such that   .“

In our case, setting   and   implies  :

 

(This will later be considered obvious and does not have to be proven, then). Now, we need to show   in order to complete the inequality chain:

 

Which means, we are done, if we can find an   with  . In this case, for any   there is an   with   , since   grows arbitrarily large. The only condition we have to observe for the proof is   .Let us just take  . For   , the two conditions   and   have to be simultaneously fulfilled. This can be done by choosing  .

Writing down the proof

Bearbeiten

Now, we have all material necessary to conduct a full proof:

Proof

Let   be arbitrary. Choose  . Let   be arbitrary. choose   such that   (This is possible by the implications of the Bernoulli inequality). Then,

 

Further proof methods for convergence and divergence

Bearbeiten

The examples above were some very detailed proofs of convergence or divergence, using the Epsilon-definition. Establishing those proofs may take a while. In practice, there are several tricks which short-cut the proof and make you directly recognize whether a sequence converges or diverges (and experienced mathematicians are frequently using them):

  • All unbounded and monotonous sequences diverge. Examples:   diverge, as well as the above  , or  , or in general   with   and many more...
  • A monotonous but bounded sequence must always converge. Examples:   is bounded from below by   and from above by  . It increases monotonously, so it must converge.
  • Let   . If for any   there is an   with   for all   (i.e. sequence elements get closer and closer together) , then   converges. This is called Cauchy-criterion and widely used in mathematics. Basically any time,   is complicated or not well-known, which is mostly the case in abstract proofs.
  • The Limit theorems tell you what happens if you add or multiply sequence, Examples:   diverge and so do   or  .
  • The Squeeze theorem proves convergence by squeezing a sequence between a smaller and a larger sequence. This avoids the Epsilon-definition and can save a lot of effort.