Linear Transformations and Matrix Theory in Vector Spaces

Linear Transformations on FF-spaces: Definitions and Examples

In the study of vector spaces over a field FF, a central concept is the mapping between these spaces that preserves their algebraic structure. Let VV and WW be FF-spaces. A mapping T:VWT: V \rightarrow W is defined as a linear transformation if it satisfies two fundamental properties. First, it must satisfy additivity, meaning T(u+v)=T(u)+T(v)T(u + v) = T(u) + T(v) for all vectors uu and vv in VV. Second, it must satisfy homogeneity of degree one, meaning T(αu)=αT(u)T(\alpha u) = \alpha T(u) for all scalars αF\alpha \in F and for all vectors uVu \in V. Alternatively, these two requirements can be combined into a single condition: T(αu+v)=αT(u)+T(v)T(\alpha u + v) = \alpha T(u) + T(v) for all u,vVu, v \in V and all αF\alpha \in F. If the transformation maps a space into itself, such that T:VVT: V \rightarrow V, it is referred to as a linear operator on VV.

Consider Example 2.1.2, where a mapping T:F(3)F(2)T: F^{(3)} \rightarrow F^{(2)} is defined by T(x1,x2,x3)=(x1+x2,x3)T(x_1, x_2, x_3) = (x_1 + x_2, x_3). To verify linearity, let u=(x1,x2,x3)u = (x_1, x_2, x_3) and v=(y1,y2,y3)v = (y_1, y_2, y_3) be elements of F(3)F^{(3)}, and let αF\alpha \in F. The sum of the vectors is u+v=(x1+y1,x2+y2,x3+y3)u + v = (x_1 + y_1, x_2 + y_2, x_3 + y_3). Applying the transformation, T(u+v)=(x1+y1+x2+y2,x3+y3)T(u + v) = (x_1 + y_1 + x_2 + y_2, x_3 + y_3), which can be rearranged as (x1+x2,x3)+(y1+y2,y3)=T(u)+T(v)(x_1 + x_2, x_3) + (y_1 + y_2, y_3) = T(u) + T(v). Similarly, for scalar multiplication, αu=(αx1,αx2,αx3)\alpha u = (\alpha x_1, \alpha x_2, \alpha x_3), and T(αu)=(αx1+αx2,αx3)T(\alpha u) = (\alpha x_1 + \alpha x_2, \alpha x_3). Factoring out the scalar yields α(x1+x2,x3)=αT(u)\alpha(x_1 + x_2, x_3) = \alpha T(u). Since both conditions hold, TT is a linear transformation.

In contrast, Example 2.1.3 presents a mapping T:F(3)F(2)T: F^{(3)} \rightarrow F^{(2)} defined by T(x1,x2,x3)=(x1+1,x3)T(x_1, x_2, x_3) = (x_1 + 1, x_3). Using Method 1 to disprove linearity, let u=(1,0,0)u = (1, 0, 0) and v=(2,0,0)v = (2, 0, 0). Then u+v=(3,0,0)u + v = (3, 0, 0). Calculating the values, T(u+v)=T(3,0,0)=(4,0)T(u + v) = T(3, 0, 0) = (4, 0), while T(u)=(2,0)T(u) = (2, 0) and T(v)=(3,0)T(v) = (3, 0). Adding the results, T(u)+T(v)=(5,0)T(u) + T(v) = (5, 0). Because (4,0)(5,0)(4, 0) \neq (5, 0), the additivity condition fails. Method 2 demonstrates failure of homogeneity: let u=(1,0,0)u = (1, 0, 0) and α=2\alpha = 2. Then αu=(2,0,0)\alpha u = (2, 0, 0). Here, T(αu)=(3,0)T(\alpha u) = (3, 0) but αT(u)=2(2,0)=(4,0)\alpha T(u) = 2(2, 0) = (4, 0). Since (3,0)(4,0)(3, 0) \neq (4, 0), the mapping is not linear.

Additional examples include matrix-based transformations and special trivial cases. Example 2.1.4 defines T:Mm×n(F)Mm×n(F)T: M_{m \times n}(F) \rightarrow M_{m \times n}(F) by T(A)=MANT(A) = MAN, where MM and NN are fixed m×mm \times m and n×nn \times n matrices respectively. For matrices AA and BB, T(A+B)=M(A+B)N=MAN+MBN=T(A)+T(B)T(A + B) = M(A + B)N = MAN + MBN = T(A) + T(B), and T(αA)=M(αA)N=α(MAN)=αT(A)T(\alpha A) = M(\alpha A)N = \alpha(MAN) = \alpha T(A), proving linearity. Example 2.1.5 introduces the identity linear transformation I:VVI: V \rightarrow V, where I(u)=uI(u) = u. It satisfies I(u+v)=u+v=I(u)+I(v)I(u + v) = u + v = I(u) + I(v) and I(αu)=αu=αI(u)I(\alpha u) = \alpha u = \alpha I(u). Example 2.1.6 defines the zero linear transformation Z:VWZ: V \rightarrow W by Z(u)=0Z(u) = 0. It satisfies Z(u+v)=0=0+0=Z(u)+Z(v)Z(u + v) = 0 = 0 + 0 = Z(u) + Z(v) and Z(αu)=0=α0=αZ(u)Z(\alpha u) = 0 = \alpha \cdot 0 = \alpha Z(u).

Certain remarks and lemmas further characterize these transformations. Remark 2.1.7 notes that for any linear transformation T:VWT: V \rightarrow W, it is always true that T(0)=0T(0) = 0. Furthermore, the transformation respects linear combinations: T(α1u1+α2u2++αnun)=α1T(u1)+α2T(u2)++αnT(un)T(\alpha_1 u_1 + \alpha_2 u_2 + \dots + \alpha_n u_n) = \alpha_1 T(u_1) + \alpha_2 T(u_2) + \dots + \alpha_n T(u_n). Lemma 2.1.8 establishes the existence and uniqueness of linear transformations: if VV has a basis B={v1,v2,,vn}B = \{v_1, v_2, \dots, v_n\}, and S={w1,w2,,wn}S = \{w_1, w_2, \dots, w_n\} is any set of vectors in WW, there exists a unique linear transformation T:VWT: V \rightarrow W such that T(vj)=wjT(v_j) = w_j for each j=1,2,,nj = 1, 2, \dots, n.

Matrix of Linear Transformation On FF-spaces

A linear transformation between finite-dimensional vector spaces can be represented as a matrix. Let VV and WW be FF-spaces with dimensions nn and mm respectively. Let B={v1,v2,,vn}B = \{v_1, v_2, \dots, v_n\} be a basis for VV and S={w1,w2,,wm}S = \{w_1, w_2, \dots, w_m\} be a basis for WW. For a linear transformation T:VWT: V \rightarrow W, each image vector T(vj)T(v_j) can be expressed as a unique linear combination of the basis vectors in SS: T(vj)=i=1maijwiT(v_j) = \sum_{i=1}^m a_{ij} w_i. The matrix A=(aij)Mm×n(F)A = (a_{ij}) \in M_{m \times n}(F) formed by these coordinates is called the matrix of TT relative to the bases BB and SS, denoted as A=[T]B,SA = [T]_{B,S}.

Example 2.2.2 illustrates this process. Let T:F(3)F(2)T: F^{(3)} \rightarrow F^{(2)} be defined by T(x1,x2,x3)=(x1+x2,2x1+x3)T(x_1, x_2, x_3) = (x_1 + x_2, 2x_1 + x_3). To find the matrix AA relative to standard bases B={e1,e2,e3}B = \{e_1, e_2, e_3\} and S={f1,f2}S = \{f_1, f_2\}, we calculate: T(e1)=T(1,0,0)=(1,2)=1f1+2f2T(e_1) = T(1, 0, 0) = (1, 2) = 1f_1 + 2f_2, T(e2)=T(0,1,0)=(1,0)=1f1+0f2T(e_2) = T(0, 1, 0) = (1, 0) = 1f_1 + 0f_2, and T(e3)=T(0,0,1)=(1,1)=1f1+1f2T(e_3) = T(0, 0, 1) = (-1, 1) = -1f_1 + 1f_2. The resulting matrix is: A=(111201)A = \begin{pmatrix} 1 & 1 & -1 \\ 2 & 0 & 1 \end{pmatrix}

In part (2) of the same example, we find a different matrix BB relative to different bases B={v1=(1,0,1),v2=(1,1,1),v3=(1,0,0)}B' = \{v_1=(1, 0, -1), v_2=(1, 1, 1), v_3=(1, 0, 0)\} and S={w1=(0,1),w2=(1,0)}S' = \{w_1=(0, 1), w_2=(1, 0)\}. We compute: T(v1)=T(1,0,1)=(1,1)=1w1+1w2T(v_1) = T(1, 0, -1) = (1, 1) = 1w_1 + 1w_2, T(v2)=T(1,1,1)=(2,3)=3w1+2w2T(v_2) = T(1, 1, 1) = (2, 3) = 3w_1 + 2w_2, and T(v3)=T(1,0,0)=(1,2)=2w1+1w2T(v_3) = T(1, 0, 0) = (1, 2) = 2w_1 + 1w_2. The resulting matrix is: B=(132121)B = \begin{pmatrix} 1 & 3 & 2 \\ 1 & 2 & 1 \end{pmatrix} Note: The coordinate vectors for the images are placed as columns in the matrix BB.

The Effect of Change of Bases

When the bases of the vector spaces are changed, the matrix representing the linear transformation also changes. If VV has bases BB and BB', and WW has bases SS and SS', the relationship between the matrix A=[T]B,SA = [T]_{B,S} and B=[T]B,SB = [T]_{B',S'} involves change-of-basis matrices. Lemma 2.1.8 implies the existence of linear transformations P:VVP: V \rightarrow V and Q:WWQ: W \rightarrow W related to these coordinate shifts. Let CC be the matrix of PP relative to BB and BB', and DD be the matrix of QQ relative to SS and SS'. In the general case, B=DAC1B = D A C^{-1}. For a linear operator T:VVT: V \rightarrow V, where the same basis transition applies to both the domain and codomain, D=CD = C, leading to the relation B=C1ACB = C^{-1} A C.

Example 2.3.2 demonstrates this with a linear transformation T:VVT: V \rightarrow V where dim(V)=3\dim(V) = 3 and the original matrix relative to basis B={u,v,w}B = \{u, v, w\} is: A=(110111001)A = \begin{pmatrix} 1 & 1 & 0 \\ -1 & 1 & 1 \\ 0 & 0 & 1 \end{pmatrix} We wish to find matrix BB relative to a new basis B={u+v,u2v+w,vw}B' = \{u+v, u-2v+w, v-w\}. The change-of-basis matrix CC is constructed from the coordinates of the new basis vectors relative to the old basis: C=(110121011)C = \begin{pmatrix} 1 & 1 & 0 \\ 1 & -2 & 1 \\ 0 & 1 & -1 \end{pmatrix}

To find C1C^{-1}, we follow several steps. First, calculate the determinant det(C)=1(21)1(10)+0=1+1=2\det(C) = 1(2-1) - 1(-1-0) + 0 = 1 + 1 = 2. Next, determine the matrix of cofactors: c11=1,c12=1,c13=1,c21=1,c22=1,c23=1,c31=1,c32=1,c33=3c_{11}=1, c_{12}=1, c_{13}=1, c_{21}=1, c_{22}=-1, c_{23}=-1, c_{31}=1, c_{32}=-1, c_{33}=-3. The adjoint matrix is the transpose of the cofactor matrix: Adj(C)=(111111113)\text{Adj}(C) = \begin{pmatrix} 1 & 1 & 1 \\ 1 & -1 & -1 \\ 1 & -1 & -3 \end{pmatrix} The inverse is C1=12Adj(C)C^{-1} = \frac{1}{2} \text{Adj}(C). Finally, the new matrix is obtained via B=C1ACB = C^{-1} A C: B=12(111111113)(110111001)(110121011)==(000032010)B = \frac{1}{2} \begin{pmatrix} 1 & 1 & 1 \\ 1 & -1 & -1 \\ 1 & -1 & -3 \end{pmatrix} \begin{pmatrix} 1 & 1 & 0 \\ -1 & 1 & 1 \\ 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} 1 & 1 & 0 \\ 1 & -2 & 1 \\ 0 & 1 & -1 \end{pmatrix} = \dots = \begin{pmatrix} 0 & 0 & 0 \\ 0 & 3 & -2 \\ 0 & 1 & 0 \end{pmatrix}

The Kernel and Image of a Linear Transformation

For a linear transformation T:VWT: V \rightarrow W, two critical subspaces are defined: the kernel and the image. The kernel of TT, denoted ker(T)\text{ker}(T), consists of all vectors uVu \in V such that T(u)=0T(u) = 0. The image of TT, denoted im(T)\text{im}(T), consists of all vectors wWw \in W such that w=T(u)w = T(u) for some uVu \in V. Lemma 2.4.2 provides the proof that ker(T)\text{ker}(T) is a subspace of VV and im(T)\text{im}(T) is a subspace of WW. For the kernel, the zero vector is included because T(0)=0T(0) = 0. If u,vker(T)u, v \in \text{ker}(T), then T(u+v)=T(u)+T(v)=0+0=0T(u+v) = T(u) + T(v) = 0 + 0 = 0, proving closure under addition. Similarly, T(αu)=αT(u)=α(0)=0T(\alpha u) = \alpha T(u) = \alpha(0) = 0, proving closure under scalar multiplication. Similar logic applies to the image subspace.

Definition 2.4.3 introduces dimensionality terms: the nullity of TT is dim(kerT)\dim(\text{ker} T), and the rank of TT is dim(imT)\dim(\text{im} T). Theorem 2.4.5, known as the Rank-Nullity Theorem, states that for a transformation T:VWT: V \rightarrow W, nullity(T)+rank(T)=dim(V)\text{nullity}(T) + \text{rank}(T) = \dim(V).

In Example 2.4.6, T:F(3)F(3)T: F^{(3)} \rightarrow F^{(3)} is defined by T(x,y,z)=(x+2z,2x+y,2y+2z)T(x, y, z) = (x+2z, 2x+y, -2y+2z). To find a basis for im(T)\text{im}(T), we look at the images of standard basis vectors: v1=(1,2,0)v_1 = (1, 2, 0), v2=(0,1,2)v_2 = (0, 1, -2), and v3=(2,0,2)v_3 = (2, 0, 2). These generate the image. Checking for independence via the determinant: det()=0\det(\dots) = 0, indicating dependence. Testing the subset S={(1,2,0),(0,1,2)}S = \{(1, 2, 0), (0, 1, -2)\}, we find they are linearly independent. Thus, rank(T)=2\text{rank}(T) = 2. By the Rank-Nullity Theorem, nullity(T)=32=1\text{nullity}(T) = 3 - 2 = 1. To find the basis for the kernel, we solve T(x,y,z)=(0,0,0)T(x, y, z) = (0, 0, 0). The resulting system yields x=2zx = -2z and y=4zy = 4z. Thus (x,y,z)=z(2,4,1)(x, y, z) = z(-2, 4, 1), and a basis for ker(T)\text{ker}(T) is B={(2,4,1)}\text{B} = \{(-2, 4, 1)\}.

Rank of Matrices

The rank of a matrix is a fundamental property related to its invertible sub-structures. The column rank of a matrix AMm×n(F)A \in M_{m \times n}(F) is the maximum number of linearly independent column vectors in AA. The row rank is the maximum number of linearly independent row vectors. Example 2.5.2 and 2.5.4 compute these for a specific matrix AA. For: A=(121203121)A = \begin{pmatrix} 1 & 2 & -1 \\ 2 & 0 & 3 \\ 1 & 2 & -1 \end{pmatrix} By applying row operations such as R22R1R_2 - 2R_1 and R3R1R_3 - R_1, the matrix is reduced to a form showing only two rows are linearly independent. Thus, the row rank is 2. Similarly, the column rank is 2. Theorem 2.5.5 states that row equivalent matrices have the same column rank, and Theorem 2.5.6 confirms that for any matrix AA, rank(A)=column rank(A)=row rank(A)\text{rank}(A) = \text{column rank}(A) = \text{row rank}(A).

Solution of Systems of Linear Equations

Systems of linear equations can be subdivided into homogeneous and non-homogeneous categories. A homogeneous system is of the form AX=0AX = 0. Theorem 2.6.1 states that the solutions form a subspace of F(n)F^{(n)} with dimension nrank(A)n - \text{rank}(A). A non-zero solution exists if and only if n>rank(A)n > \text{rank}(A). If n=rank(A)n = \text{rank}(A), only the trivial zero solution exists. Example 2.6.2 shows a system where n=4n = 4 and rank(A)3\text{rank}(A) \leq 3, thus non-zero solutions exist. Example 2.6.3 shows a system where n=3n = 3 and rank(A)=3\text{rank}(A) = 3, meaning only the zero solution exists.

A non-homogeneous system is of the form AX=BAX = B. Theorem 2.6.4 states that a solution exists if and only if rank(AB)=rank(A)\text{rank}(A|B) = \text{rank}(A), where (AB)(A|B) is the augmented matrix. Example 2.6.5 (first instance) provides a system where rank(AB)=3=rank(A)\text{rank}(A|B) = 3 = \text{rank}(A), and the solution is found to be x1=1,x2=2,x3=1,x4=3,x5=1x_1=1, x_2=2, x_3=1, x_4=-3, x_5=1. However, in a second instance of Example 2.6.5, a system is presented where rank(AB)=3\text{rank}(A|B) = 3 but rank(A)=2\text{rank}(A) = 2. Since the ranks are not equal, the system is inconsistent and has no solution.