LU Factorization Notes
LU Factorization
Introduction
These notes cover LU factorization based on section 2.5 of LAE, focusing on the pure LU factorization method. This method does not involve switching rows. The key is to understand under what conditions an LU factorization exists for a given matrix .
Existence of LU Factorization
For an matrix to have an LU factorization, it must be reducible to echelon form using only type one row operations, following the standard Gaussian elimination procedure.
This means:
Pivots must be in their initial state in the top row.
No row swaps are allowed to avoid zeros in pivot positions.
Use each pivot to eliminate all entries below it.
No single row scaling.
If these conditions are met, can be factored into , where:
is the echelon form obtained from the above procedure.
is a square matrix of size , where is the number of rows of .
Properties of L
has the following properties:
Lower triangular: zeros everywhere above its main diagonal.
Main diagonal consists of ones.
Solving Ax = b using LU Factorization
If has an LU factorization, solving becomes computationally efficient:
Solve using forward substitution.
Solve using backward substitution.
This two-step process yields the solution to .
Example
Consider a matrix that can be reduced to echelon form using type one row operations. Suppose the row operations:
result in the echelon form . In this case, is given by:
Notice that the entries in the lower portion of are the negatives of the coefficients used in reducing to . This is because is composed of inverse elementary matrices.
The key point: .
Solving Ax = b
To solve , we first solve .
Given
and
,
then
implies
Solving this system yields:
So, .
Next, solve .
Using the previously determined and given some (echelon form), we solve via backward substitution.
Backward Substitution
Given an upper triangular matrix (echelon form) , solve from the bottom up.
For example, assume looks like this:
then implies solve:
From the bottom equation, .
Rearrange the second equation: , implies .
Rearrange the first equation: , implies .
Remember to present the solution in the correct order: .
Computational Efficiency
Solving as involves first solving for in and then solving for in .
This is a computationally efficient approach, especially when solving for multiple different vectors.
Finding the LU Factorization
To find the LU factorization, we need to reduce to its echelon form using only type one row operations, maintaining the order of pivots from top to bottom.
If this is not possible, then a pure LU factorization does not exist.
As row reduction is performed, track the row operations applied.
Determining L
Take the negatives of the coefficients used in the row reduction and place them in the corresponding lower triangular entries in . This makes the process simple.
For example:
Given row operations:
The lower triangular matrix will have ones on its diagonal. Use the negatives of the given coefficients, 1, -2 and -2 in operation equations, to get:
Lay Textbook Reference
The Lay textbook presents the algorithm for finding differently but yields the same outcome. See example 2 on page 128 of the fifth edition, section 2.5.
Why the Algorithm Works
Inverses of Elementary Matrices
Understanding how inverses of elementary matrices work is crucial.
Type One Row Operations: If replaces row with itself plus copies of row , then replaces row with itself minus copies of row .
Row Swap Operations: Performing the same row swap twice returns the original matrix. Thus, the inverse is the matrix itself.
Row Scaling: If scales row by (where ), then scales row by .
The inverses of elementary matrices correspond exactly to the inverse row operations.
Sequence of Row Operations
If can be reduced to echelon form using type one row operations, then a sequence of elementary matrices can be applied:
Taking the inverses in reverse order:
Here, . Thus, .
Action of Inverse Row Operations
The inverse row operations modify the lower triangular portion of (identity matrix), leaving the diagonal unchanged.
Filling in the negatives of the original row operation coefficients into the corresponding lower triangular positions gives .
By applying these operations in reverse order, we maintain the structure of the matrix and correctly compute .
Example
Suppose the row operations E1 to E6 in sequence convert A to echelon form U:
Then by multiplying the equation by the inverses of these elementary matrices we obtain these expressions:
Which yields: A = LU
We want to justify whether this process for the specific example actually gives us a lower triangular matrix with ones down its digonal, as well as what happens to the raw operation.