Consider two integers m and n, with m>n, and A, x and b real matrices and vectors. In the case Ax=b, with A of dimension m×n (and therefore x of dimension n×1 and b of dimension m×1), the pseudo inverse can be interpreted as a projection of b in the column space of A, that minimizes the L2 norm.
However, in the case where A is of dimension n×m, (and A has full rank), an infinite number of combinations of the column space vectors can decompose the vector b. Applying A+ to b (A+ pseudo inverse of A), only gives a single vector among those.
How can the pseudo inverse be interpreted in that case ? What are the properties of the vector A+b ?
Answer
If A is a linear mapping between two vector spaces, i.e. A:U→V, then A+(v)=uiffu∈(kerA)⊥ and Au=P(rng A)v
Thus, you get u from v by first projecting v orthogonally onto the range of A, then finding some u′ with A(u′)=v, and finally projecting that u′ onto the orthogonal complement of the kernel of A. It doesn't matter which u′ you picked, since for two possible choices u1,u2, u1−u2∈ker A, and thus P(kerA)⊥(u1−u2)=0, i.e. they get projected onto the same element u in the end.
You also look at this another way. If you restrict A to (kerA)⊥, then this restricted A is injective and hence has an inverse (with range (kerA)⊥). You can then define the pseudo inverse as A+:=(A|(kerA)⊥)−1P(rng A)
Your two cases amount to either P(kerA)⊥ or P(rng A) being the identity, i.e. to either kerA=∅ or rng A=V.
No comments:
Post a Comment