Linear Transformations and Matrix Theory in Vector Spaces
Linear Transformations on F-spaces: Definitions and Examples
In the study of vector spaces over a field F, a central concept is the mapping between these spaces that preserves their algebraic structure. Let V and W be F-spaces. A mapping T:V→W is defined as a linear transformation if it satisfies two fundamental properties. First, it must satisfy additivity, meaning T(u+v)=T(u)+T(v) for all vectors u and v in V. Second, it must satisfy homogeneity of degree one, meaning T(αu)=αT(u) for all scalars α∈F and for all vectors u∈V. Alternatively, these two requirements can be combined into a single condition: T(αu+v)=αT(u)+T(v) for all u,v∈V and all α∈F. If the transformation maps a space into itself, such that T:V→V, it is referred to as a linear operator on V.
Consider Example 2.1.2, where a mapping T:F(3)→F(2) is defined by T(x1,x2,x3)=(x1+x2,x3). To verify linearity, let u=(x1,x2,x3) and v=(y1,y2,y3) be elements of F(3), and let α∈F. The sum of the vectors is u+v=(x1+y1,x2+y2,x3+y3). Applying the transformation, T(u+v)=(x1+y1+x2+y2,x3+y3), which can be rearranged as (x1+x2,x3)+(y1+y2,y3)=T(u)+T(v). Similarly, for scalar multiplication, αu=(αx1,αx2,αx3), and T(αu)=(αx1+αx2,αx3). Factoring out the scalar yields α(x1+x2,x3)=αT(u). Since both conditions hold, T is a linear transformation.
In contrast, Example 2.1.3 presents a mapping T:F(3)→F(2) defined by T(x1,x2,x3)=(x1+1,x3). Using Method 1 to disprove linearity, let u=(1,0,0) and v=(2,0,0). Then u+v=(3,0,0). Calculating the values, T(u+v)=T(3,0,0)=(4,0), while T(u)=(2,0) and T(v)=(3,0). Adding the results, T(u)+T(v)=(5,0). Because (4,0)=(5,0), the additivity condition fails. Method 2 demonstrates failure of homogeneity: let u=(1,0,0) and α=2. Then αu=(2,0,0). Here, T(αu)=(3,0) but αT(u)=2(2,0)=(4,0). Since (3,0)=(4,0), the mapping is not linear.
Additional examples include matrix-based transformations and special trivial cases. Example 2.1.4 defines T:Mm×n(F)→Mm×n(F) by T(A)=MAN, where M and N are fixed m×m and n×n matrices respectively. For matrices A and B, T(A+B)=M(A+B)N=MAN+MBN=T(A)+T(B), and T(αA)=M(αA)N=α(MAN)=αT(A), proving linearity. Example 2.1.5 introduces the identity linear transformation I:V→V, where I(u)=u. It satisfies I(u+v)=u+v=I(u)+I(v) and I(αu)=αu=αI(u). Example 2.1.6 defines the zero linear transformation Z:V→W by Z(u)=0. It satisfies Z(u+v)=0=0+0=Z(u)+Z(v) and Z(αu)=0=α⋅0=αZ(u).
Certain remarks and lemmas further characterize these transformations. Remark 2.1.7 notes that for any linear transformation T:V→W, it is always true that T(0)=0. Furthermore, the transformation respects linear combinations: T(α1u1+α2u2+⋯+αnun)=α1T(u1)+α2T(u2)+⋯+αnT(un). Lemma 2.1.8 establishes the existence and uniqueness of linear transformations: if V has a basis B={v1,v2,…,vn}, and S={w1,w2,…,wn} is any set of vectors in W, there exists a unique linear transformation T:V→W such that T(vj)=wj for each j=1,2,…,n.
Matrix of Linear Transformation On F-spaces
A linear transformation between finite-dimensional vector spaces can be represented as a matrix. Let V and W be F-spaces with dimensions n and m respectively. Let B={v1,v2,…,vn} be a basis for V and S={w1,w2,…,wm} be a basis for W. For a linear transformation T:V→W, each image vector T(vj) can be expressed as a unique linear combination of the basis vectors in S: T(vj)=∑i=1maijwi. The matrix A=(aij)∈Mm×n(F) formed by these coordinates is called the matrix of T relative to the bases B and S, denoted as A=[T]B,S.
Example 2.2.2 illustrates this process. Let T:F(3)→F(2) be defined by T(x1,x2,x3)=(x1+x2,2x1+x3). To find the matrix A relative to standard bases B={e1,e2,e3} and S={f1,f2}, we calculate: T(e1)=T(1,0,0)=(1,2)=1f1+2f2, T(e2)=T(0,1,0)=(1,0)=1f1+0f2, and T(e3)=T(0,0,1)=(−1,1)=−1f1+1f2. The resulting matrix is:
A=(1210−11)
In part (2) of the same example, we find a different matrix B relative to different bases B′={v1=(1,0,−1),v2=(1,1,1),v3=(1,0,0)} and S′={w1=(0,1),w2=(1,0)}. We compute: T(v1)=T(1,0,−1)=(1,1)=1w1+1w2, T(v2)=T(1,1,1)=(2,3)=3w1+2w2, and T(v3)=T(1,0,0)=(1,2)=2w1+1w2. The resulting matrix is:
B=(113221)
Note: The coordinate vectors for the images are placed as columns in the matrix B.
The Effect of Change of Bases
When the bases of the vector spaces are changed, the matrix representing the linear transformation also changes. If V has bases B and B′, and W has bases S and S′, the relationship between the matrix A=[T]B,S and B=[T]B′,S′ involves change-of-basis matrices. Lemma 2.1.8 implies the existence of linear transformations P:V→V and Q:W→W related to these coordinate shifts. Let C be the matrix of P relative to B and B′, and D be the matrix of Q relative to S and S′. In the general case, B=DAC−1. For a linear operator T:V→V, where the same basis transition applies to both the domain and codomain, D=C, leading to the relation B=C−1AC.
Example 2.3.2 demonstrates this with a linear transformation T:V→V where dim(V)=3 and the original matrix relative to basis B={u,v,w} is:
A=1−10110011
We wish to find matrix B relative to a new basis B′={u+v,u−2v+w,v−w}. The change-of-basis matrix C is constructed from the coordinates of the new basis vectors relative to the old basis:
C=1101−2101−1
To find C−1, we follow several steps. First, calculate the determinant det(C)=1(2−1)−1(−1−0)+0=1+1=2. Next, determine the matrix of cofactors: c11=1,c12=1,c13=1,c21=1,c22=−1,c23=−1,c31=1,c32=−1,c33=−3. The adjoint matrix is the transpose of the cofactor matrix:
Adj(C)=1111−1−11−1−3
The inverse is C−1=21Adj(C). Finally, the new matrix is obtained via B=C−1AC:
B=211111−1−11−1−31−101100111101−2101−1=⋯=0000310−20
The Kernel and Image of a Linear Transformation
For a linear transformation T:V→W, two critical subspaces are defined: the kernel and the image. The kernel of T, denoted ker(T), consists of all vectors u∈V such that T(u)=0. The image of T, denoted im(T), consists of all vectors w∈W such that w=T(u) for some u∈V. Lemma 2.4.2 provides the proof that ker(T) is a subspace of V and im(T) is a subspace of W. For the kernel, the zero vector is included because T(0)=0. If u,v∈ker(T), then T(u+v)=T(u)+T(v)=0+0=0, proving closure under addition. Similarly, T(αu)=αT(u)=α(0)=0, proving closure under scalar multiplication. Similar logic applies to the image subspace.
Definition 2.4.3 introduces dimensionality terms: the nullity of T is dim(kerT), and the rank of T is dim(imT). Theorem 2.4.5, known as the Rank-Nullity Theorem, states that for a transformation T:V→W, nullity(T)+rank(T)=dim(V).
In Example 2.4.6, T:F(3)→F(3) is defined by T(x,y,z)=(x+2z,2x+y,−2y+2z). To find a basis for im(T), we look at the images of standard basis vectors: v1=(1,2,0), v2=(0,1,−2), and v3=(2,0,2). These generate the image. Checking for independence via the determinant: det(…)=0, indicating dependence. Testing the subset S={(1,2,0),(0,1,−2)}, we find they are linearly independent. Thus, rank(T)=2. By the Rank-Nullity Theorem, nullity(T)=3−2=1. To find the basis for the kernel, we solve T(x,y,z)=(0,0,0). The resulting system yields x=−2z and y=4z. Thus (x,y,z)=z(−2,4,1), and a basis for ker(T) is B={(−2,4,1)}.
Rank of Matrices
The rank of a matrix is a fundamental property related to its invertible sub-structures. The column rank of a matrix A∈Mm×n(F) is the maximum number of linearly independent column vectors in A. The row rank is the maximum number of linearly independent row vectors. Example 2.5.2 and 2.5.4 compute these for a specific matrix A. For:
A=121202−13−1
By applying row operations such as R2−2R1 and R3−R1, the matrix is reduced to a form showing only two rows are linearly independent. Thus, the row rank is 2. Similarly, the column rank is 2. Theorem 2.5.5 states that row equivalent matrices have the same column rank, and Theorem 2.5.6 confirms that for any matrix A, rank(A)=column rank(A)=row rank(A).
Solution of Systems of Linear Equations
Systems of linear equations can be subdivided into homogeneous and non-homogeneous categories. A homogeneous system is of the form AX=0. Theorem 2.6.1 states that the solutions form a subspace of F(n) with dimension n−rank(A). A non-zero solution exists if and only if n>rank(A). If n=rank(A), only the trivial zero solution exists. Example 2.6.2 shows a system where n=4 and rank(A)≤3, thus non-zero solutions exist. Example 2.6.3 shows a system where n=3 and rank(A)=3, meaning only the zero solution exists.
A non-homogeneous system is of the form AX=B. Theorem 2.6.4 states that a solution exists if and only if rank(A∣B)=rank(A), where (A∣B) is the augmented matrix. Example 2.6.5 (first instance) provides a system where rank(A∣B)=3=rank(A), and the solution is found to be x1=1,x2=2,x3=1,x4=−3,x5=1. However, in a second instance of Example 2.6.5, a system is presented where rank(A∣B)=3 but rank(A)=2. Since the ranks are not equal, the system is inconsistent and has no solution.