Decompositions

Next: Conditioning Up: Singular Value Decomposition J. Previous: Equivalences Contents Index

Decompositions

Define $\Sigma$ as the by matrix whose top rows contain ${\rm diag}(\sigma_1 ,\ldots, \sigma_n )$ and whose bottom rows are zero. Define the by matrix $U = [u_1,\ldots,u_m]$ and the by matrix $V = [v_1,\ldots,v_n]$ . is called the left singular vector matrix of , and is called the right singular vector matrix of . Since the are orthogonal unit vectors, we see that ; i.e., is a unitary matrix. If is real then the are real vectors, so , and we also say that is an orthogonal matrix. The same discussion applies to . The equalities $Av_i = \sigma_i u_i$ and $A^* u_i = \sigma_i v_i$ for $i=1,\ldots,n$ and for $i=n+1,\ldots,m$ may also be written $AV = U \Sigma$ and $A^* U= V \Sigma^*$ , or $A = U \Sigma V^*$ . The factorization

$\begin{displaymath}A = U \Sigma V^*\end{displaymath}$

is called the SVD of

. In other words,

is unitarily (orthogonally) equivalent to the diagonal matrix $\Sigma$ .

There are several ``smaller'' versions of the SVD that are often computed. Let $U_t = [u_1,\ldots,u_t]$ be an by matrix of the first left singular vectors, $V_t = [v_1,\ldots,v_t]$ be an by matrix of the first right singular vectors, and $\Sigma_t = {\rm diag}(\sigma_1 ,\ldots, \sigma_t)$ be a by matrix of the first singular values. Then we can make the following definitions.

Thin SVD.: $A = U_n \Sigma_n V_n^*$ is the thin (or economy-sized) SVD of . The thin SVD is much smaller to store and faster to compute than the full SVD when $n \ll m$ .
Compact SVD.: $A = U_{r} \Sigma_{r} V_{r}^*$ is the compact SVD of . The compact SVD is much smaller to store and faster to compute than the thin SVD when $r \ll n$ .
Truncated SVD.: $A_t = U_{t} \Sigma_{t} V_{t}^*$ is the rank- truncated (or partial) SVD of , where . Among all rank- matrices , is the unique minimizer of $\Vert A - B \Vert _F$ and also minimizes (perhaps not uniquely) $\Vert A - B \Vert _2$ . The truncated SVD is much smaller to store and cheaper to compute than the compact SVD when $t \ll r$ , and is the most common form of the SVD computed in applications.

The thin SVD may also be written $A = \sum_{i=1}^n \sigma_i u_i v_i^*$ . Each $(\sigma_i, u_i, v_i)$ is called a singular triplet. The compact and truncated SVDs may be written similarly (the sum going from to , or to , respectively).

If is by with , then its SVD is $A = U \Sigma V^*$ , where is by , $\Sigma$ is by with ${\rm diag}(\sigma_1 ,\ldots, \sigma_m)$ in its first columns and zeros in columns through , and is by . Its thin SVD is $A= U_m \Sigma_m V_m^*$ , and the compact SVD and truncated SVD are as above.

More generally, if we take a subset of columns of and
(say $\hat{U}= U(:,[2,3,5])$ = columns 2, 3, and 5, and $\hat{V}= V(:,[2,3,5])$ ), then these columns span a pair of singular subspaces of . If we take the corresponding submatrix $\hat{\Sigma}= {\rm diag}(\sigma_2 , \sigma_3 , \sigma_5 )$ of $\Sigma$ , then we can write the corresponding partial SVD $\hat{U}^* A \hat{V}= \hat{\Sigma}$ . If the columns in $\hat{U}$ and $\hat{V}$ are replaced by different orthonormal vectors spanning the same invariant subspace, then we get a different partial SVD $\check{U}^* A \check{V}= \check{A}$ , where $\check{A}$ is a by matrix whose singular values are those of $\hat{\Sigma}$ , though $\check{A}$ may not be diagonal.

Next: Conditioning Up: Singular Value Decomposition J. Previous: Equivalences Contents Index

Susan Blackford 2000-11-20