Cosets of a subspace – Serlo

Derivation: coset or affine subspace

Bearbeiten

Lines in  

Bearbeiten

You probably already know the concept of a straight line. But how do we describe a line in   mathematically? You know from school that you can parameterise straight lines by  , where   are two fixed vectors and   takes all values in  . That is, all points on the straight line form the set  . Geometrically described, this is the (infinitely long) line running through   in the direction of  .

 
An affine line is described by the support vector v and the direction vector u.

In general, a line does not pass through the origin  . Thus   is not a subspace of  , since by definition every subspace contains the origin. However, the line   is a displaced version of the line   by the vector  . Here   is a line passing through the origin. This is a subspace because it contains the origin and is closed under addition and scalar multiplication. That is, every straight line is given by the choice of a (one-dimensional) subspace   and a vector  . This justifies the notation  . This notation can also be formalised:

For a subspace   consider a vector  . Let   be by  . Then the following applies for the sets   and   defined above, that  .

Planes in  

Bearbeiten

Let's increase the dimension and consider  . We can describe a line in analogy to the set   with vectors   and  . This is a displaced version of a line through the origin by a vector  . So formally again, any line is of the form   for a vector   and a one-dimensional subspace  .

What about the planes in  ? We parameterise them by  , where   are fixed vectors and   pass through all values in  . The vectors   and   must not be scalar multiples of each other - otherwise we would get a line. All points on the plane form the set  . As in the case of lines, the plane   is generally not a subspace, since the origin need not lie in  . However, the plane is a displaced version of the subspace   by the vector  . It is therefore analogously true that every plane is given by a two-dimensional subspace and a vector, i.e. that  .

 
An affine plane is described by the support vector v and the direction vectors u_1 and u_2.

Lines in  

Bearbeiten

We can also look at certain straight lines in a more complicated space: We consider the  -vector space  . In the article vector space we have already seen that we can think of this vector space as regular points on a torus. Now what is a "straight line" on this torus? We have seen in the previous two sections how we can describe straight lines in the vector spaces   and  : There a straight line is the same as a set   with a support vector   and a direction vector  . In other words, it is the set  , where   is a one-dimensional subspace. We can transfer this construction to  , that is, we can consider a straight line as  , where   is a one-dimensional subspace of  . That is,   is of the form  . We can visualise this set on a torus:

 
points of an affine line in (Z/5Z)^2 on a torus

The points appear to lie on a line. If we connect the points each in the shortest way, we get a closed line that feels like a straight line on the torus.

 
Points of an affine straight line in (Z/5Z)^2 on a torus, connected by a line

Thus, displaced one-dimensional subspaces also correspond to straight lines here.

We consider another example of a straight line in  . Consider the one-dimensional subspace  . We shift this by the vector  . Thus we obtain the line  . Here a line consists of only five vectors. In our case  .

We have characterised geometric objects (e.g. lines and planes) as displaced subspaces in various vector spaces. Let's give them a name.

Definition: coset or affine subspace

Bearbeiten

Definition (Affine subspace or coset)

Let   be a  -vector space and   a subspace of  , so  . Further let  . Then, the set   is called the affine subspace or coset obtained by translating   by the vector  .

Derivation: set of cosets of a subspace

Bearbeiten

We have defined cosets as displaced subspaces. Consider the following example of a displaced subspace   of   by two different vectors   and  :

 
Different displacements of a subspace leading to the same affine subspace

In the example above, we see that different displacements of a subspace can lead to the same affine subspace. So we ask ourselves the following question:

When are two shifted subspaces   and   the same?

Let us first imagine the whole thing in  , where both shifted subspaces are lines. If they are equal, they have the same slope. This characterises the lines passing through the origin   and  . It follows that   and   must be equal.

Let us now consider the question for general vector spaces. So let   be a vector space,   be subspaces of vectors,   be vectors, and let   be sets. We would like to first conclude (as in  ) that  . To do this, it would be nice to get   from  . This is done by taking all vectors of   and subtracting   , which indeed gives us   . Hence, we can write   as:

 

Since   is a subspace, we have  . The above equation thus implies  , i.e., there is a  , such that  , i.e.,  . In particular,  .

More generally, for each subspace   and vector  , we have  . The reason is that each   can be written as  . Since   , we have  . Geometrically, you can also imagine the whole thing like this: If you move the subspace   in a direction in which it already lies, it is mapped onto itself.

Back to our original question: Since   , we know that  . So all in all we get the desired  . On the way we have also seen that   is also a necessary criterion for  .

Are these criteria also sufficient? Yes: Suppose we have   and   with   and   . Then   and hence, by adding   on both sides, we have  .

Let us summarise: Two shifted subspaces   are equal exactly if the (non-shifted) subspaces are equal, i.e.  , and the difference of the shifts lie in   , i.e.,  .

Given a subspace, we can now find out whether two displacements by   or   give the same affine subspace. We can thus construct a kind of "new equality" by considering   and   to be "equal" if they produce the same affine subspace. Such new equalities behave reasonably if they are equivalence relations.

Recall the definition of an equivalence relation.

Definition (Equivalence relation)

An equivalence relation is a homogeneous binary relation on a basic set that has the following properties:

  • reflexive
  • symmetric
  • transitive

Two elements that are in relation with respect to an equivalence relation are called equivalent. If two elements   and   are equivalent to each other with respect to an equivalence relation  , one often writes   or simply  .

To formally write down the "new equality" mentioned above, we define a relation   given by  . Intuitively, our relation should be an equivalence relation, since it says when two shifted subspaces are equal. We now check this formally:

Theorem (  is an equivalence relation)

The relation   defined by   is an equivalence relation. This means that the relation is reflexive, symmetrical and transitive.

How to get to the proof? (  is an equivalence relation)

To show the assertion, we need to verify the three axioms of an equivalence relation: reflexivity, symmetry and transitivity. For reflexivity, we have to show for all   that   is satisfied. By definition of   we must show that  . Now   and   is a subspace. Therefore   and hence reflexivity holds.

For transitivity and symmetry we proceed in the same way: we insert the definition and infer the desired property from the fact that   is a subspace.

Proof (  is an equivalence relation)

Proof step: Reflexivity

Since   is a subspace, we have  . For an arbitrary vector   we have  . By definition of the relation, it follows that   for all  .

Proof step: Symmetry

We want to show that from   we can infer   . So let  . Therefore,  . As   is a subspace,   must be closed under taking inverses. So  . But this is equivalent to  . Hence,  .

Proof step: Transitivity

Finally, we need to show that from   and   we get  . So let  , i.e.,  , and  , i.e.,  . Since   is a subspace,   must be closed under addition. So in particular  . As   we also have  and hence  .

We can now consider the equivalence classes of this relation, that is, to   we consider the set  . So the set  consists of all vectors  , that displace   to the same affine subspace  . How else can we characterise these equivalence classes? We have

 

That is, the equivalence classes of our relation are precisely the coset classes.

Just as we can look at an equivalence relation and its equivalence classes, we can also construct a space in which the "new equality" of the equivalence relation becomes a real equality. This is the set of equivalence classes to which we now want to give a special name.

Definition: set of cosets of a subspace

Bearbeiten

Definition (Set of cosets of a subspace)

Let   be a  -verctorspace and   a subspace of  , so  . Further, let  . Define  . Then,   is an equivalence relation on   and the Equivalence class of an element   is the set  . We call this the coset generated by   with respect to  .

We define

 

to be the set of cosets of  .


We have defined the set of cosets   as the set of equivalence classes according to  . In the last section we saw that the equivalence class generated by   is given exactly by the affine subspace  . Thus an equivalence class with respect to   is the same as a displaced version of  . This provides two equivalent views of the set  : on the one hand,   is the set of equivalence classes with respect to  ; on the other hand, it is the set of displaced versions of  .

Hint

Depending on which of the two interpretations of the elements of   one uses, one uses different names for the elements   of  . If one uses the equivalence relation to divide the elements of   into different sets, one speaks of cosets. If, on the other hand, one considers a displacement of  , one speaks of an affine subspace.

Examples for cosets

Bearbeiten

Example (Physics: Change of potential energy)

We consider the three-dimensional space with a gravitational field on it, with a positive gravitational constant  .As coordinates, we use  ,   and  . Such a space can be, for example, the room where you are reading this article. We put our origin at some point on your table, so we define the potential energy at that point as 0. From that point, you can move an object to different points, assigning to each of those destination points the potential energy of a point particle we move there, which depends only on its height above the table. We can also take it to mean that we want to assign to each movement from the origin its change in potential energy. Let the table in our consideration be within the  - - plane. The potential energy of a particle or the change of the potential energy by a movement from the origin to   is thus:

 

We want to classify the possible rectilinear displacements from the origin based on their change in potential energy, and call two displacements equivalent if their change in the potential energy of a point particle matches. We want to group displacements into one class that change the potential energy in the same way. The mass, as well as the gravitational constant, are given for our point particle. Therefore, two displacements under consideration have the same potential energy if they have the same change in height. The displacements are therefore in the same class if their  -value matches.

Let us now abstract our illustrative example. Our space is the  -vector space  . Mathematically, displacements from the origin are described by vectors. Displacements that cause the same change in potential energy for a point particle move it from the origin in   to the same plane parallel to the  -  plane, since exactly the point particles on this plane have the same potential energy. For a given change in potential energy, we can choose any of the vectors causing such a change as representatives.

In the chapter on subspaces we have already seen that the  - -plane is a subspace   of  . In our physical example, we saw that planes shifted along the  - axis were equivalence classes with respect to the change of potential energy. These classes are also called cosets.

Example (Finance: Change in the balance of two accounts)

Let us assume that each person always has exactly two bank accounts. Now we want to know how much money each person has in total. So we are interested in the sum of all the money each person has in their bank accounts. We look at the two bank accounts that Anna has. In these she has saved amounts of   and   respectively. So the total money saved by Anna is  .

Let us now consider two people, Emma and Fritz. Emma has on her accounts  . Fritz has on his accounts  . Emma and Fritz therefore have exactly the same amount of money if  . We call the pairs of accounts   and   equivalent if there is the same amount of money on them, i.e. if  .

With this definition, the following pairs of accounts are equivalent, for example:

 

The reason is  .

The two accounts of Emma and Fritz are therefore equivalent if  , i.e.,  . We define the difference of the vectors   and   as

 

The vectors   and   are equivalent if and only if  .

In other words, the sum of the funds from two accounts is given by the following linear map:

 

Thus the kernel of   is the set of pairs of accounts whose sum is zero. So two pairs of accounts are equivalent if they differ only by a vector of  . We can further transform the kernel of  :

 

The equivalence classes with respect to the sum of the account balances are thus exactly the coset classes modulo the subspace  . All cosets are of the form

 

with  .

We can also think of it like this: We want to look at the summed balance of the two accounts. In the process of summation, information on the single accounts is lost. We still know how much money a person has in total, but no longer how the money is distributed between the two accounts.

Example (Two switches for one light source)

We consider the following scenario: A hallway has two light switches, both of which are to control the ceiling lamp. For this purpose, one can use a multiway switching. We want to model the behaviour of this circuit using vector spaces.

 
A light switch that points upwards.

We start with the light switches. Every light switch has two states: it points either up or down. So we can model it with  . The   stands for the light switch in the upper position and the   for the light switch in the lower position. When you flip the light switch, the state changes from   to   or from   to   – depending on the current position of the light switch. This change of state corresponds to the mathematical operation   within the field  .

Now that we know how to model a light switch and the flipping of a light switch, we process all the input data of the multiway switching: these are the two light switches. We have four states, as each light switch has the state up ( ) and down ( ). Thus   is a suitable vector space for modelling the states. The state of the first light switch is given by the first component and the state of the second one by the second component. In this modelling, flipping the first light switch then corresponds to   and flipping the second light switch corresponds to  . If both light switches are in the upper position (this corresponds to  ) and we flip the first light switch, we arrive at the state  .

In order to extract the information from this model as to whether the lamp is on or off, we must first understand this circumstance with the multiway switching. The toggle switch allows us freedom at this point: We can build it in such a way that both switches have to be in the same position for the lamp to be on. We can also build it such that the lamp is on when the two switches are in different positions. For this example, we consider the case where the lamp is on exactly when both switches are in different positions. Thus we identify the states   and   with a lamp being on. We identify the other states   and   with a switched-off lamp. The latter form a subspace  . If we shift this by   or  , we get the states of a switched-on lamp  . These are all cosets of   in  . Thus we can summarise the alternating switch situation as follows: Given a switch state  , we get the information whether the lamp is on by determining the coset generated by   in  . If this is  , the lamp is off. If it is  , the lamp is on.

With this understanding of the lamp state, we can also describe the influence that flipping a switch has. Each of the operations   and   affects the lamp states   by changing the state of the lamp. If the lamp is in the off state   and the second switch is flipped with  , it is then in the on state  . If the first switch is now flipped, the lamp changes back to the switched-off state. Mathematically, this is represented by  .

Properties of equivalence classes applied to cosets

Bearbeiten

We have seen above that cosets of a one-dimensional subspace in   are parallel straight lines. We can also explain this by characterising cosets as equivalence classes: Two equivalence classes, as sets, are either equal or disjoint. For us, this means that two cosets, i.e. two straight lines, are either equal or that they have no point of intersection. The latter means that they are parallel.

Furthermore, we know about equivalence classes that they cover the whole space, i.e. the union of all equivalence classes results in the whole set. From this we conclude that the union of all cosets (in our case parallel straight lines) gives the whole  . We can therefore decompose the vector space into the cosets - like leaves. This decomposition is also called a partition. So the cosets partition the vector space. In our example, this means that we can decompose the   into displaced versions of an origin line  . This is illustrated in the following picture:

 
V=\R ^2 is partitioned by a line

Both points mentioned also work in general (not only in  ), since we have not used any property of   in any of our arguments. It is therefore true for a vector space   and a subspace   that:

  is the union of the cosets   and any two cosets are disjoint.

Cosets occur when solving systems of linear equations: The solutions of the associated homogeneous system of equations   form a subvector space. If the linear system of equations has a solution, the solutions form an affine subspace with respect to  .

To-Do:

Set a link as soon as the article on solving linear systems of equations is written..