Ch 1.4 The Matrix Equation Ax = b
1.4 The Matrix Equation
Equivalence of System, Vector, and Matrix Equations
Connection to Span (from Section 1.3)
A vector equation is equivalent to a system of linear equations.
Example (Page 1): A system involving a vector and vectors can be represented by an augmented matrix.
Row reduction of the augmented matrix:
The system is consistent if and only if there's no pivot in the last (fourth) column. This means must be zero.
Therefore, is in if and only if .
Important Note: The presence of a free variable does not guarantee that a system is consistent.
Linear Combinations and Span Properties
If vectors and are in , then there exist scalars and such that:
Then, their sum is also in :
Since , , and are also scalars, this shows is a linear combination of .
Definition of the Matrix-Vector Product
A fundamental idea in linear algebra is to view a linear combination of vectors as the product of a matrix and a vector.
Definition (Page 1): If is an matrix with columns , and if is in (a column vector with entries), then the product of and , denoted by , is the linear combination of the columns of using the corresponding entries in as weights.
Condition: is defined only if the number of columns of (which is ) equals the number of entries in (also ).
Example 1 (Page 1):
a.
b.
Example 2 (Page 2): To write the linear combination as a matrix times a vector:
Place the vectors into the columns of a matrix .
Place the weights into a vector .
Thus,
The Matrix Equation
Any system of linear equations can be written as an equivalent matrix equation in the form .
Example (System to Matrix Equation, Page 2):
System of linear equations:
Equivalent vector equation:
Equivalent matrix equation:
The matrix here is the coefficient matrix of the system.
Theorem 3 (Page 2): Equivalence of Three Forms
If is an matrix with columns , and is in , then the matrix equation has the same solution set as:
The vector equation:
The system of linear equations whose augmented matrix is:
Significance: This theorem provides a powerful tool, allowing problems to be viewed in three equivalent ways: as a matrix equation, a vector equation, or a system of linear equations. This flexibility aids in constructing mathematical models and solving problems through row reduction of the augmented matrix.
Existence of Solutions
Key Fact (Page 3): The equation has a solution if and only if is a linear combination of the columns of . Equivalently, is consistent if and only if is in .
Determining Consistency for All Possible
Example 3 (Page 3): Let . Is the equation consistent for all possible ?
Row reduce the augmented matrix :
The third entry in the augmented column is .
The system is not consistent for every because some choices of can make nonzero, leading to an inconsistent system.
The reduced matrix shows that is consistent if and only if the entries in satisfy:
This equation describes a plane through the origin in , which is the set of all linear combinations of the columns of . The equation fails to be consistent for all because the echelon form of has a row of zeros.
Theorem 4 (Page 3): Equivalence of Four Statements
Let be an matrix. The following statements are logically equivalent (all true or all false):
a. For each in , the equation has a solution.
b. Each in is a linear combination of the columns of .
c. The columns of span . (This means if are the columns of ).
d. The matrix has a pivot position in every row.
Warning (Page 4): Theorem 4 is about the coefficient matrix , not the augmented matrix . If an augmented matrix has a pivot position in every row, the equation may or may not be consistent (e.g., if the pivot is in the augmented column).
Proof Sketch (Page 6):
Statements (a), (b), and (c) are equivalent by definition.
To show (a) and (d) are equivalent: Let be an echelon form of .
If (d) is true (pivot in every row of ), then cannot have a pivot in the augmented column, so has a solution for any (meaning (a) is true).
If (d) is false (last row of is all zeros), then we can construct a vector with a in its last entry. The system represents an inconsistent system. Since row operations are reversible, this implies an original system that is also inconsistent, thus (a) is false.
Computation of : The Row-Vector Rule
This rule provides a more efficient method for calculating entries in by hand.
Example 4 (Page 4): Compute , where and .
By definition:
Notice that the first entry in () is the sum of products of entries from the first row of and the entries in .
Similarly, the second entry () is from the second row of and . And so on.
Row-Vector Rule for Computing (Page 4): If the product is defined, then the entry in is the sum of the products of corresponding entries from row of and from the vector .
Example 5 (Page 5):
a.
b.
This specific matrix is called an identity matrix and is denoted by .
The calculation shows that , so for every in .
There is an analogous identity matrix, often written as , for which for every in .
Properties of the Matrix-Vector Product (Theorem 5)
If is an matrix, and are vectors in , and is a scalar, then:
a. Distributive Property:
b. Scalar Multiplication Property:
Proof Sketch (Page 5, for ):
Let , and let and be the entries of and respectively.
For (a):
For (b):
Numerical Note on Computation of (Page 6)
To optimize computer algorithms for , the sequence of calculations should involve data stored in contiguous memory locations.
Fortran: Stores matrices as sets of columns. Algorithms in Fortran compute as a linear combination of the columns of (consistent with the definition).
C: Stores matrices by rows. Algorithms in C should compute using the row-vector rule (multiplication of rows of by vector ).
Practice Problems (Page 6)
Examples of exercises include:
Exhibiting as a linear combination of columns of given that is a solution to .
Verifying Theorem 5(a) for specific matrices and vectors.
Constructing matrices and vectors such that has a solution but does not.
Computing matrix-vector products using both the definition and the row-vector rule.
Converting between matrix equations and vector equations.