Solving Linear Systems

Section 1.1 Solving Linear Systems

Subsection 1.1.1 Some Initial Thoughts

Investigation 1.1.1. A 20% Solution (A First Simple Example).

Statement:

Given a solution which it 75% alcohol (call it \(A\)) and distilled water (call it \(W\)) mix one liter of solution with a concentration of 20% alcohol.

Analysis:

Total solution 1 liter:

\begin{equation*} A+W=1 \end{equation*}

Total alcohol in solution 0.2 liters (20%):

\begin{equation*} 0.75A=0.2 \end{equation*}

Solution 1:

From the second equation we derive:

\begin{equation*} A=20/75=4/15\approx 0.267 \end{equation*}

Substituted into the first equation this gives:

\begin{equation*} 4/15+W=1 \end{equation*}

\begin{equation*} W=11/15\approx0.733 \end{equation*}

So, a solution containing 267 milliliters of the original alcohol solution and 733 milliliters of the distilled water will yield 1 liter of a 20% alcohol solution.

Solution 2:

Consider the equations together:

\begin{align*} \amp A + W \amp \amp= \amp \amp 1\\ 0.75 \amp A \amp \amp= \amp 0.\amp2 \end{align*}

When we do this we refer to the set of equations as a system of equations and the \(A\) and \(W\) as the unknowns. Subtracting 0.75 of the first from the second we get:

\begin{equation*} -0.75 W = -0.55 \end{equation*}

\begin{equation*} W = 55/75 \approx 0.733 \end{equation*}

As before, using the second equation or substitution, we get \(A=20/75\approx 0.267\) so that a solution containing 267 milliliters of the original alcohol solution and 733 milliliters of the distilled water will yield 1 liter of a 20% alcohol solution.

Both of the above are examples of what can termed elimination and back substitution. The goal being to isolate one variable to find its value, and then work back from there to find the values of the rest. This is a quick easy way to find solutions when there are only two variables, but it becomes cumbersome as soon as there are three or more.

Investigation 1.1.2. Name the Parabola (A Not So Simple Example).

Statement:

Given a parabola with a \(y\) intercept of 7, which passes through the point (1,6), and has a slope of 8 at \(x=-1\) find the coefficients \(a,b,c\) in \(y=ax^2+bx+c\text{.}\)

Figure 1.1.1. A Parabola?

Analysis:

First if we let \(f(x)=ax^2+bx+c\text{,}\) so that its derivative is \(f'(x)=2ax+b\text{,}\) then from the information given we know:

\begin{align*} f(1) \amp = \amp a+b+c \amp = 6\\ f'(-1) \amp = \amp -2a +b \amp = 8 \\ f(0) \amp = \amp c \amp = 7 \end{align*}

giving us a system of three equations and three unknowns.

Solution 1:

Using elimination and back substitution we first see that \(c=7\text{.}\) Then plugging that into the first equation above we can simplify the problem to solving:

\begin{align*} a+b \amp = -1\\ -2a +b \amp = 8 \end{align*}

We can now subtract the second equation from the first to find \(3a=-9\) or \(a=-3\text{.}\) Finally, substituting this value for \(a\) into either of the previous equations we derive \(b=2\text{.}\) Thus using the information given we know that the desired parabola is given by \(f(x)=-3x^2+2x+7\text{.}\)

Solution 2:

Alternately, we can use mostly just elimination. If we consider our original equations

\begin{align*} a+b+c \amp = 6\\ -2a +b \amp = 8 \\ c \amp = 7 \end{align*}

then we might first subtract row three from row one, eliminating the \(c\text{.}\) This reduces our system to

\begin{align*} a+b \amp = -1\\ -2a +b \amp = 8 \\ c \amp = 7 \end{align*}

which we can similarly reduce by subtracting the second row from the first

\begin{align*} 3a \amp = -9\\ -2a +b \amp = 8 \\ c \amp = 7 \end{align*}

Finally, we simplify the first row by dividing by 3 (giving \(a=-3\)) and then add two copies of that to the second row to get

\begin{align*} a \amp = -3\\ b \amp = 2 \\ c \amp = 7 \end{align*}

Thus arriving at the same solution. Again, using the information given we know that the desired parabola is given by \(f(x)=-3x^2+2x+7\text{.}\)

Solution 3:

Notice that in the work above the variables \(a,b,c\text{,}\) the equal signs, and even the plus signs served no purpose, they just kept things neat. All that really mattered were the coefficients and constant values to the right of the equals. This hints at a better solution. Consider the following array containing just the coefficients from the system of equations

\begin{equation*} A=\left[ \begin{array}{rrr|r} 1 \amp 1 \amp 1 \amp 6 \\ -2 \amp 1 \amp 0 \amp 8 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right] \end{equation*}

which we will call an augmented matrix (note that the left hand portion is called the coefficient matrix for this system). Now let us perform all the same steps we used in the solution above, only now with the matrix.

First we subtract the third row from row one

\begin{equation*} \left[ \begin{array}{rrr|r} 1 \amp 1 \amp 1 \amp 6 \\ -2 \amp 1 \amp 0 \amp 8 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right] \stackrel{R_1-R_3}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 1 \amp 1 \amp 0 \amp -1 \\ -2 \amp 1 \amp 0 \amp 8 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right]. \end{equation*}

We can reduce this further by subtracting the second row from the first

\begin{equation*} \left[ \begin{array}{rrr|r} 1 \amp 1 \amp 0 \amp -1 \\ -2 \amp 1 \amp 0 \amp 8 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right] \stackrel{R_1-R_2}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 3 \amp 0 \amp 0 \amp -9 \\ -2 \amp 1 \amp 0 \amp 8 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right]. \end{equation*}

Now we can divide row one by 3

\begin{equation*} \left[ \begin{array}{rrr|r} 3 \amp 1 \amp 0 \amp -9 \\ -2 \amp 1 \amp 0 \amp 8 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right] \stackrel{\frac{1}{3}R_1}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 1 \amp 0 \amp 0 \amp -3 \\ -2 \amp 1 \amp 0 \amp 8 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right] \end{equation*}

and add two copies of this to the second row to get

\begin{equation*} \left[ \begin{array}{rrr|r} 1 \amp 0 \amp 0 \amp -3 \\ -2 \amp 1 \amp 0 \amp 8 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right] \stackrel{R_2+2R_1}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 1 \amp 0 \amp 0 \amp -3 \\ 0 \amp 1 \amp 0 \amp 2 \\ 0 \amp 0 \amp 1 \amp 7 \end{array} \right]. \end{equation*}

Following the same steps as before we get the same answer, that is using the information given we know that the desired parabola is given by \(f(x)=-3x^2+2x+7\text{.}\)

Subsection 1.1.2 Further Examples

Take a look at the next few examples to get a feel for what happens when you have different combinations of equations and unknowns.

Example 1.1.2. Another Plane but not so Simple Example.

Statement:

Given three planes:

\begin{equation*} 2x+3y-z=1 \end{equation*}

\begin{equation*} -2x+y+z=1 \end{equation*}

\begin{equation*} y-z=1 \end{equation*}

find the point \((x,y,z)\) at which all three intersect.

Figure 1.1.3. Planes

Solution 1:

As before we have a system of three equations and three unknowns

\begin{align*} 2x + 3y - z \amp= 1 \\ -2x + y + z \amp= 1 \\ y - z \amp= 1 \end{align*}

Now we can add row one to row two to get

\begin{align*} 2x + 3y - z \amp= 1 \\ 4y \amp= 2 \\ y - z \amp= 1 \end{align*}

Multiplying the second row by \(1/4^{th}\) and subtracting this from row three we get

\begin{align*} 2x + 3y - z \amp= 1 \\ y \amp= 1/2 \\ - z \amp= 1/2 \end{align*}

\begin{align*} 2x + 3y - z \amp= 1 \\ y \amp= 1/2 \\ z \amp= -1/2 \end{align*}

If we then subtract three copies of row two and add one copy of row three from row one we are left with

\begin{align*} 2x \amp= -1 \\ y \amp= 1/2 \\ z \amp= -1/2 \end{align*}

which gives the solution

\begin{align*} x \amp= -1/2 \\ y \amp= 1/2 \\ z \amp= -1/2 \end{align*}

If we substitute these in to the original equations

\begin{align*} 2\left(-\frac{1}{2}\right) + 3\left(\frac{1}{2}\right) - \left(-\frac{1}{2}\right) \amp= 1 \\ -2\left(-\frac{1}{2}\right) + \left(\frac{1}{2}\right) + \left(-\frac{1}{2}\right) \amp= 1 \\ \left(\frac{1}{2}\right) - \left(-\frac{1}{2}\right) \amp= 1 \end{align*}

we can verify that the solution is correct. Therefore the answer to the original question is that the single point at which all three planes intersect is

\begin{equation*} (x,y,z)=\left(-\frac{1}{2},\frac{1}{2},-\frac{1}{2}\right). \end{equation*}

Solution 2:

Alternately we can rewrite the system as an augmented matrix like so

\begin{equation*} \left[ \begin{array}{rrr|r} 2 \amp 3 \amp -1 \amp 1 \\ -2 \amp 1 \amp 1 \amp 1 \\ 0\amp 1 \amp -1 \amp 1 \\ \end{array} \right]. \end{equation*}

If we now follow the steps we used before we should get the same solution. First add row 1 to row 2:

\begin{equation*} \left[ \begin{array}{rrr|r} 2 \amp 3 \amp -1 \amp 1 \\ -2 \amp 1 \amp 1 \amp 1 \\ 0\amp 1 \amp -1 \amp 1 \\ \end{array} \right] \stackrel{R_2+R_1}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 2 \amp 3 \amp -1 \amp 1 \\ 0 \amp 4 \amp 0 \amp 2 \\ 0\amp 1 \amp -1 \amp 1 \\ \end{array} \right]. \end{equation*}

Next we multiply row 2 by \(1/4\) and then subtract that from row 3:

\begin{equation*} \left[ \begin{array}{rrr|r} 2 \amp 3 \amp -1 \amp 1 \\ 0 \amp 4 \amp 0 \amp 2 \\ 0\amp 1 \amp -1 \amp 1 \\ \end{array} \right] \stackrel{1/4\times R_2,\\ R_3-R_2}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 2 \amp 3 \amp -1 \amp 1 \\ 0 \amp 1 \amp 0 \amp 1/2 \\ 0\amp 0 \amp -1 \amp 1/2 \\ \end{array} \right]. \end{equation*}

The latter matrix is said to be in row echelon form since all the entries below the diagonal are 0. Now, as before we next multiply row 3 by -1 and then subtract three copies of row 2 and add one copy of row 3 to row one:

\begin{equation*} \left[ \begin{array}{rrr|r} 2 \amp 3 \amp -1 \amp 1 \\ 0 \amp 1 \amp 0 \amp 1/2 \\ 0\amp 0 \amp -1 \amp 1/2 \\ \end{array} \right] \stackrel{-R_3,\\ R_1-3R_2+R_3}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 2 \amp 0 \amp 0 \amp -1 \\ 0 \amp 1 \amp 0 \amp 1/2 \\ 0\amp 0 \amp 1 \amp -1/2 \\ \end{array} \right]. \end{equation*}

Finally, we multiply row 1 by 1/2:

\begin{equation*} \left[ \begin{array}{rrr|r} 2 \amp 0 \amp 0 \amp -1 \\ 0 \amp 1 \amp 0 \amp 1/2 \\ 0\amp 0 \amp 1 \amp -1/2 \\ \end{array} \right] \stackrel{1/2\times R_1}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 1 \amp 0 \amp 0 \amp -1/2 \\ 0 \amp 1 \amp 0 \amp 1/2 \\ 0\amp 0 \amp 1 \amp -1/2 \\ \end{array} \right]. \end{equation*}

This last form of the matrix is called reduced row echelon form since the diagonal is all 1's and the off diagonals are 0. Having followed the same steps we get the same solution:

\begin{equation*} (x,y,z)=\left(-\frac{1}{2},\frac{1}{2},-\frac{1}{2}\right). \end{equation*}

Example 1.1.4. An Underdetermined System.

Consider the system of equations:

\begin{align*} t-4s+r \amp =0\\ s-2r \amp =0 \end{align*}

this system is called underdetermined because there are more variables than equations. However, we can still use the same strategies to try and solve it. First we form an augmented matrix:

\begin{equation*} \left[ \begin{array}{rrr|r} 1 \amp -4 \amp 1 \amp 0\\ 0 \amp 1 \amp -2 \amp 0 \end{array} \right]. \end{equation*}

then we can try to simplify this as before:

\begin{equation*} \left[ \begin{array}{rrr|r} 1 \amp -4 \amp 1 \amp 0\\ 0 \amp 1 \amp -2 \amp 0 \end{array} \right] \stackrel{R_1+4R_2}{\Huge \leadsto} \left[ \begin{array}{rrr|r} 1 \amp 0 \amp -7 \amp 0\\ 0 \amp 1 \amp -2 \amp 0 \end{array} \right]. \end{equation*}

But, we can't go any further than this because there are only two equations for our three unknowns. Consequently the solution we get is that \(t=7r\text{,}\) \(s=2r\text{,}\) and \(r\) may be any this we like. For example if \(r=7\) then \(t=49\) and \(s=14\text{,}\) which we can see satisfies the original equation:

\begin{align*} 49-4(14)+7 \amp =0\\ 14-2(7) \amp =0 \end{align*}

However we could also let \(r=1\) so that \(t=7\) and \(s=2\text{,}\) or let \(r=-2\) so that \(t=-14\) and \(s=4\text{.}\) In fact we have infinitely many solutions since there are no restrictions on \(r\text{.}\) In this situation we say that \(r\) is a free variable. Also, note that all the equations were set equal to zero, we call this a homogeneous system and these will come to play a special roll in finding general solutions to systems of equations.

Example 1.1.5. An Overdetermined System.

Of course if there can be underdetermined systems, then there can be overdetermined systems as well. Let's look to this system of three equations but only two unknowns:

\begin{align*} 3x+2y \amp =7\\ x-y \amp = 2\\ 4y \amp =16 \end{align*}

note that the equations are not set equal to 0, so this system is non-homogeneous. We could use the same technique of setting up an augmented matrix like so

\begin{equation*} \left[ \begin{array}{rr|r} 3 \amp 2 \amp 7\\ 1 \amp -1 \amp 2\\ 0 \amp 4 \amp 16 \end{array} \right], \end{equation*}

however in this case we can really solve this by inspection or observation, that is we just look at it for a minute. From the last equation we conclude \(y=4\text{,}\) substituting that into the next equation up we get \(x=6\text{.}\) However if we substitute \(y=4\) into the first equation we get \(x=-1/3\text{,}\) since \(x\) can't have two different values we conclude that this system of equations doesn't have a solution. When a system doesn't have a solution we say that it is inconsistent, other wise it is consistent.

Example 1.1.6. Consistent and Inconsistent Systems.

Just so that you don't get the idea that overdetermined is synonymous with inconsistent consider the homogeneous version of the system we just looked at:

\begin{align*} 3x+2y \amp = 0\\ x-y \amp = 0\\ 4y \amp = 0 \end{align*}

Note that if \(x=y=0\) then all three equations are true and we have a consistent system. This illustrates an important property of homogeneous systems, they are always have at least one solution and so are consistent, you just need to make all the variables 0.

You should also be careful not to think that all under determined systems have infinitely many solutions, or even that they are consistent. If we look at the system

\begin{align*} s+t+r\amp = 2\\ -2s-2t-2r\amp = -10 \end{align*}

and then multiply the second equation by -1/2 we get

\begin{align*} s+t+r\amp = 2\\ s+t+r\amp = 5 \end{align*}

clearly the sum of the three variables can not simultaneously be 2 and 5, so the system is inconsistent.

Section Vocabulary.

Homogeneous, Non-homogeneous, Consistent, Inconsistent, Overdetermined, Underdetermined, Augmented Matrix, Coefficient Matrix, System of Equations, Row Echelon form, Reduced Row Echelon Form, Elimination and Back Substitution.