Ch 1.4 The Matrix Equation Ax = b

1.4 The Matrix Equation $Ax = b$

Equivalence of System, Vector, and Matrix Equations

Connection to Span (from Section 1.3)

A vector equation is equivalent to a system of linear equations.
Example (Page 1): A system involving a vector $y$ and vectors ${V1, V2, V3}$ can be represented by an augmented matrix.
- Row reduction of the augmented matrix:
  $\begin{bmatrix} 1 & 5 & -3 & -4 \ -2 & -7 & 0 & h \ 3 & -6 & -5 & -8 \ \end{bmatrix} \sim \begin{bmatrix} 1 & 5 & -3 & -4 \ 0 & 1 & -2 & -1 \ 0 & 0 & 1 & h-5 \ \end{bmatrix}$
- The system is consistent if and only if there's no pivot in the last (fourth) column. This means $(h-5)$ must be zero.
- Therefore, $y$ is in $Span{V1, V2, V3}$ if and only if $h = 5$ .
Important Note: The presence of a free variable does not guarantee that a system is consistent.

Linear Combinations and Span Properties

If vectors $u$ and $v$ are in $Span{W1, W2, W3}$ , then there exist scalars $C1, C2, C3$ and $d1, d2, d3$ such that:
- $u = C1W1 + C2W2 + C3W3$
- $v = d1W1 + d2W2 + d3W3$
Then, their sum $u + v$ is also in $Span{W1, W2, W3}$ :
- $u + v = (C1W1 + C2W2 + C3W3) + (d1W1 + d2W2 + d3W3)$
- $u + v = (C1 + d1)W1 + (C2 + d2)W2 + (C3 + d3)W3$
- Since $(C1 + d1)$ , $(C2 + d2)$ , and $(C3 + d3)$ are also scalars, this shows $u + v$ is a linear combination of ${W1, W2, W3}$ .

Definition of the Matrix-Vector Product $Ax$

A fundamental idea in linear algebra is to view a linear combination of vectors as the product of a matrix and a vector.
Definition (Page 1): If $A$ is an $m \times n$ matrix with columns $a1, \dots, an$ , and if $x$ is in $R^n$ (a column vector with $n$ entries), then the product of $A$ and $x$ , denoted by $Ax$ , is the linear combination of the columns of $A$ using the corresponding entries in $x$ as weights.
- $Ax = \begin{bmatrix} a1 & a2 & \dots & an \end{bmatrix} \begin{bmatrix} x1 \ x2 \ \vdots \ xn \end{bmatrix} = x1a1 + x2a2 + \dots + xnan$
Condition: $Ax$ is defined only if the number of columns of $A$ (which is $n$ ) equals the number of entries in $x$ (also $n$ ).
Example 1 (Page 1):
- a.
  $\begin{bmatrix} 1 & 2 & 3 \ 4 & 5 & 6 \ \end{bmatrix} \begin{bmatrix} 4 \ 3 \ 7 \ \end{bmatrix} = 4 \begin{bmatrix} 1 \ 4 \ \end{bmatrix} + 3 \begin{bmatrix} 2 \ 5 \ \end{bmatrix} + 7 \begin{bmatrix} 3 \ 6 \ \end{bmatrix} = \begin{bmatrix} 4 \ 16 \ \end{bmatrix} + \begin{bmatrix} 6 \ 15 \ \end{bmatrix} + \begin{bmatrix} 21 \ 42 \ \end{bmatrix} = \begin{bmatrix} 31 \ 73 \ \end{bmatrix}$
- b.
  $\begin{bmatrix} 8 & 3 \ 5 & 1 \ \end{bmatrix} \begin{bmatrix} 4 \ 7 \ \end{bmatrix} = 4 \begin{bmatrix} 8 \ 5 \ \end{bmatrix} + 7 \begin{bmatrix} 3 \ 1 \ \end{bmatrix} = \begin{bmatrix} 32 \ 20 \ \end{bmatrix} + \begin{bmatrix} 21 \ 7 \ \end{bmatrix} = \begin{bmatrix} 53 \ 27 \ \end{bmatrix}$
Example 2 (Page 2): To write the linear combination $3v1 - 5v2 + 7v3$ as a matrix times a vector:
- Place the vectors $v1, v2, v3$ into the columns of a matrix $A$ .
- Place the weights $3, -5, 7$ into a vector $x$ .
- Thus, $3v1 - 5v2 + 7v3 = \begin{bmatrix} v1 & v2 & v3 \end{bmatrix} \begin{bmatrix} 3 \ -5 \ 7 \ \end{bmatrix} = Ax$

The Matrix Equation $Ax = b$

Any system of linear equations can be written as an equivalent matrix equation in the form $Ax = b$ .
Example (System to Matrix Equation, Page 2):
- System of linear equations:
  $x1 - 2x2 - x3 = 4$
  $-5x2 + 3x3 = 1$
- Equivalent vector equation:
  $x1 \begin{bmatrix} 1 \ 0 \ \end{bmatrix} + x2 \begin{bmatrix} -2 \ -5 \ \end{bmatrix} + x3 \begin{bmatrix} -1 \ 3 \ \end{bmatrix} = \begin{bmatrix} 4 \ 1 \ \end{bmatrix}$
- Equivalent matrix equation:
  $\begin{bmatrix} 1 & -2 & -1 \ 0 & -5 & 3 \ \end{bmatrix} \begin{bmatrix} x1 \ x2 \ x3 \ \end{bmatrix} = \begin{bmatrix} 4 \ 1 \ \end{bmatrix}$
- The matrix here is the coefficient matrix of the system.

Theorem 3 (Page 2): Equivalence of Three Forms

If $A$ is an $m \times n$ matrix with columns $a1, \dots, an$ , and $b$ is in $R^m$ , then the matrix equation $Ax = b$ has the same solution set as:
- The vector equation: $x1a1 + x2a2 + \dots + xnan = b$
- The system of linear equations whose augmented matrix is:
  $\begin{bmatrix} a1 & a2 & \dots & an & b \ \end{bmatrix}$
Significance: This theorem provides a powerful tool, allowing problems to be viewed in three equivalent ways: as a matrix equation, a vector equation, or a system of linear equations. This flexibility aids in constructing mathematical models and solving problems through row reduction of the augmented matrix.

Existence of Solutions

Key Fact (Page 3): The equation $Ax = b$ has a solution if and only if $b$ is a linear combination of the columns of $A$ . Equivalently, $Ax = b$ is consistent if and only if $b$ is in $Span{a1, \dots, an}$ .

Determining Consistency for All Possible $b$

Example 3 (Page 3): Let $A = \begin{bmatrix} 1 & 3 & 4 \ -4 & 2 & -6 \ -3 & -2 & -7 \ \end{bmatrix}$ . Is the equation $Ax = b$ consistent for all possible $b = \begin{bmatrix} b1 \ b2 \ b3 \ \end{bmatrix}$ ?
- Row reduce the augmented matrix $[A \ b]$ :
  $\begin{bmatrix} 1 & 3 & 4 & b1 \ -4 & 2 & -6 & b2 \ -3 & -2 & -7 & b3 \ \end{bmatrix} \sim \begin{bmatrix} 1 & 3 & 4 & b1 \ 0 & 14 & 10 & b2+4b1 \ 0 & 7 & 5 & b3+3b1 \ \end{bmatrix} \sim \begin{bmatrix} 1 & 3 & 4 & b1 \ 0 & 14 & 10 & b2+4b1 \ 0 & 0 & 0 & b3+3b1 - \frac{1}{2}(b2+4b1) \ \end{bmatrix}$
- The third entry in the augmented column is $b3+3b1 - \frac{1}{2}b2 - 2b1 = b1 - \frac{1}{2}b2 + b3$ .
- The system $Ax=b$ is not consistent for every $b$ because some choices of $b$ can make $(b1 - \frac{1}{2}b2 + b3)$ nonzero, leading to an inconsistent system.
- The reduced matrix shows that $Ax = b$ is consistent if and only if the entries in $b$ satisfy:
  $b1 - \frac{1}{2}b2 + b3 = 0$
- This equation describes a plane through the origin in $R^3$ , which is the set of all linear combinations of the columns of $A$ . The equation fails to be consistent for all $b$ because the echelon form of $A$ has a row of zeros.

Theorem 4 (Page 3): Equivalence of Four Statements

Let $A$ be an $m \times n$ matrix. The following statements are logically equivalent (all true or all false):
- a. For each $b$ in $R^m$ , the equation $Ax = b$ has a solution.
- b. Each $b$ in $R^m$ is a linear combination of the columns of $A$ .
- c. The columns of $A$ span $R^m$ . (This means $Span{v1, \dots, vp} = R^m$ if ${v1, \dots, vp}$ are the columns of $A$ ).
- d. The matrix $A$ has a pivot position in every row.
Warning (Page 4): Theorem 4 is about the coefficient matrix $A$ , not the augmented matrix $[A \ b]$ . If an augmented matrix $[A \ b]$ has a pivot position in every row, the equation $Ax = b$ may or may not be consistent (e.g., if the pivot is in the augmented column).
Proof Sketch (Page 6):
- Statements (a), (b), and (c) are equivalent by definition.
- To show (a) and (d) are equivalent: Let $U$ be an echelon form of $A$ .
  - If (d) is true (pivot in every row of $U$ ), then $[U \ d]$ cannot have a pivot in the augmented column, so $Ax = b$ has a solution for any $b$ (meaning (a) is true).
  - If (d) is false (last row of $U$ is all zeros), then we can construct a vector $d$ with a $1$ in its last entry. The system $[U \ d]$ represents an inconsistent system. Since row operations are reversible, this implies an original system $[A \ b]$ that is also inconsistent, thus (a) is false.

Computation of $Ax$ : The Row-Vector Rule

This rule provides a more efficient method for calculating entries in $Ax$ by hand.
Example 4 (Page 4): Compute $Ax$ , where $A = \begin{bmatrix} 2 & 3 & 4 \ -1 & 5 & -3 \ 6 & -2 & 8 \ \end{bmatrix}$ and $x = \begin{bmatrix} x1 \ x2 \ x3 \ \end{bmatrix}$ .
- By definition: $Ax = x1 \begin{bmatrix} 2 \ -1 \ 6 \ \end{bmatrix} + x2 \begin{bmatrix} 3 \ 5 \ -2 \ \end{bmatrix} + x3 \begin{bmatrix} 4 \ -3 \ 8 \ \end{bmatrix} = \begin{bmatrix} 2x1 + 3x2 + 4x3 \ -x1 + 5x2 - 3x3 \ 6x1 - 2x2 + 8x3 \ \end{bmatrix}$
- Notice that the first entry in $Ax$ ( $2x1 + 3x2 + 4x3$ ) is the sum of products of entries from the first row of $A$ and the entries in $x$ .
- Similarly, the second entry ( $-x1 + 5x2 - 3x3$ ) is from the second row of $A$ and $x$ . And so on.
Row-Vector Rule for Computing $Ax$ (Page 4): If the product $Ax$ is defined, then the $i^{th}$ entry in $Ax$ is the sum of the products of corresponding entries from row $i$ of $A$ and from the vector $x$ .
Example 5 (Page 5):
- a.
  $\begin{bmatrix} 2 & -3 \ 8 & 0 \ -5 & 2 \ \end{bmatrix} \begin{bmatrix} 4 \ 7 \ \end{bmatrix} = \begin{bmatrix} (2)(4) + (-3)(7) \ (8)(4) + (0)(7) \ (-5)(4) + (2)(7) \ \end{bmatrix} = \begin{bmatrix} 8 - 21 \ 32 + 0 \ -20 + 14 \ \end{bmatrix} = \begin{bmatrix} -13 \ 32 \ -6 \ \end{bmatrix}$
- b. $\begin{bmatrix} 1 & 0 & 0 \ 0 & 1 & 0 \ 0 & 0 & 1 \ \end{bmatrix} \begin{bmatrix} r \ s \ t \ \end{bmatrix}$
  - This specific matrix is called an identity matrix and is denoted by $I$ .
  - The calculation shows that $I \begin{bmatrix} r \ s \ t \ \end{bmatrix} = \begin{bmatrix} (1)(r) + (0)(s) + (0)(t) \ (0)(r) + (1)(s) + (0)(t) \ (0)(r) + (0)(s) + (1)(t) \ \end{bmatrix} = \begin{bmatrix} r \ s \ t \ \end{bmatrix}$ , so $Ix = x$ for every $x$ in $R^3$ .
  - There is an analogous $n \times n$ identity matrix, often written as $In$ , for which $Ix = x$ for every $x$ in $R^n$ .

Properties of the Matrix-Vector Product (Theorem 5)

If $A$ is an $m \times n$ matrix, $u$ and $v$ are vectors in $R^n$ , and $c$ is a scalar, then:
- a. Distributive Property: $A(u + v) = Au + Av$
- b. Scalar Multiplication Property: $A(cu) = c(Au)$
Proof Sketch (Page 5, for $n = 3$ ):
- Let $A = \begin{bmatrix} a1 & a2 & a3 \end{bmatrix}$ , and let $ui$ and $vi$ be the $i^{th}$ entries of $u$ and $v$ respectively.
- For (a):
  $A(u + v) = \begin{bmatrix} a1 & a2 & a3 \end{bmatrix} \begin{bmatrix} u1 + v1 \ u2 + v2 \ u3 + v3 \ \end{bmatrix}$
  $= (u1 + v1)a1 + (u2 + v2)a2 + (u3 + v3)a3$
  $= (u1a1 + u2a2 + u3a3) + (v1a1 + v2a2 + v3a3)$
  $= Au + Av$
- For (b):
  $A(cu) = \begin{bmatrix} a1 & a2 & a3 \end{bmatrix} \begin{bmatrix} cu1 \ cu2 \ cu3 \ \end{bmatrix}$
  $= (cu1)a1 + (cu2)a2 + (cu3)a3$
  $= c(u1a1) + c(u2a2) + c(u3a3)$
  $= c(u1a1 + u2a2 + u3a3)$
  $= c(Au)$

Numerical Note on Computation of $Ax$ (Page 6)

To optimize computer algorithms for $Ax$ , the sequence of calculations should involve data stored in contiguous memory locations.
Fortran: Stores matrices as sets of columns. Algorithms in Fortran compute $Ax$ as a linear combination of the columns of $A$ (consistent with the definition).
C: Stores matrices by rows. Algorithms in C should compute $Ax$ using the row-vector rule (multiplication of rows of $A$ by vector $x$ ).

Practice Problems (Page 6)

Examples of exercises include:
- Exhibiting $b$ as a linear combination of columns of $A$ given that $x$ is a solution to $Ax=b$ .
- Verifying Theorem 5(a) for specific matrices and vectors.
- Constructing matrices $A$ and vectors $b, c$ such that $Ax=b$ has a solution but $Ax=c$ does not.
- Computing matrix-vector products using both the definition and the row-vector rule.
- Converting between matrix equations and vector equations.

Ch 1.4 The Matrix Equation Ax = b

1.4 The Matrix Equation Ax=bAx = bAx=b

Equivalence of System, Vector, and Matrix Equations

Connection to Span (from Section 1.3)

Linear Combinations and Span Properties

Definition of the Matrix-Vector Product AxAxAx

The Matrix Equation Ax=bAx = bAx=b

Theorem 3 (Page 2): Equivalence of Three Forms

Existence of Solutions

Determining Consistency for All Possible bbb

Theorem 4 (Page 3): Equivalence of Four Statements

Computation of AxAxAx: The Row-Vector Rule

Properties of the Matrix-Vector Product (Theorem 5)

Numerical Note on Computation of AxAxAx (Page 6)

Practice Problems (Page 6)

1.4 The Matrix Equation $Ax = b$

Definition of the Matrix-Vector Product $Ax$

The Matrix Equation $Ax = b$

Determining Consistency for All Possible $b$

Computation of $Ax$ : The Row-Vector Rule

Numerical Note on Computation of $Ax$ (Page 6)