Deflation

Next: Implicitly Restarted Arnoldi Method Up: Arnoldi Method Y. Saad Previous: Explicit Restarts Contents Index

Deflation

We now consider the following implementation which incorporates a deflation process. So far we have described algorithms that compute only one eigenpair. In case several eigenpairs are sought, there are two possible options.

The first is to take

to be a linear combination of the approximate eigenvectors when we restart. For example, if we need to compute the

rightmost eigenvectors, we may take

$\begin{displaymath}\hat v_1 = \sum_{i=1}^p \rho_i \tilde u_i, \end{displaymath}$

where the eigenvalues are numbered in decreasing order of their real parts. The vector

is then obtained from normalizing $\hat v_1$ . The simplest choice for the coefficients $\rho_i$ is to take $\rho_i = 1, i=1,\ldots,p$ . There are several drawbacks to this approach, the most important of which being that there is no easy way of choosing the coefficients $\rho_i$ in a systematic manner. The result is that for hard problems, convergence is difficult to achieve.

A more reliable alternative is to compute one eigenpair at a time and use deflation. The matrix can be deflated explicitly by constructing progressively the first Schur vectors. If a previous orthogonal basis $U_{k-1} = [u_1,\ldots,u_{k-1}]$ of the invariant subspace has already been computed, then to compute the eigenvalue $\lambda_{k}$ , we can work with the matrix

$\begin{displaymath} \tilde A = A - U_{k-1} \Sigma U_{k-1}^{\ast}, \end{displaymath}$

in which $\Sigma = \diag(\sigma_i)$ is a diagonal matrix of shifts. The eigenvalues of $\tilde A$ consist of two groups. Those eigenvalues associated with the Schur vectors $u_1, \ldots, u_{k-1}$ will be shifted to $\tilde \lambda_i = \lambda_i - \sigma_i$ and the others remain unchanged. If the eigenvalues with largest real parts are sought, then the shifts are selected so that $\lambda_k$ becomes the next eigenvalue with largest real part of $\tilde A$ . It is also possible to deflate by simply projecting out the components associated with the invariant subspace spanned by $U_{k-1}$ ; this would lead to operating with the matrix

$\begin{displaymath} \tilde A = A (I - U_{k-1} U_{k-1}^{\ast}). \end{displaymath}$

Note that if $A U_{k-1} = U_{k-1} R_{k-1}$ is the partial Schur decomposition associated with the first

Ritz values, then $\tilde A = A - U_{k-1} R_{k-1} U_{k-1}^{\ast}$ . Those eigenvalues associated with the Schur vectors $u_1, \ldots, u_{k-1}$ will now all be moved to zero.

A better implementation of deflation, which fits in well with the Arnoldi procedure, is to work with a single basis $v_1, v_2, \ldots , v_m$ whose first vectors are the Schur vectors that have already converged. Suppose that such vectors have converged and call them $v_1, v_2 ,\ldots,v_{k-1}$ . Then we start by choosing a vector which is orthogonal to $v_1,\ldots,v_{k-1}$ and of norm 1. Next we perform steps of an Arnoldi procedure in which orthogonality of the vector against all previous 's, including $v_1,\ldots,v_{k-1}$ is enforced. This generates an orthogonal basis of the subspace

$\begin{displaymath} {\rm span}\{ v_1,\ldots, v_{k-1} , v_k, A v_k, \ldots, A^{m-k} v_k \} \ . \end{displaymath}$

(125)

Thus, the dimension of this modified Krylov subspace is constant and equal to

in general. A sketch of this implicit deflation procedure combined with the Arnoldi method appears in the following.

$\begin{algorithm}{Explicitly Restarted Arnoldi Method with Deflation for NHEP }... ...o to} (3) \\ {\rm (16)} \> \> \> {\bf end if} \end{tabbing} } \end{algorithm}$

Note that in the Loop, the Schur vectors associated with the eigenvalues $\la_1,\ldots,\la_{k-1}$ will not be touched in subsequent steps. They are sometimes referred to as ``locked vectors.'' Similarly, the corresponding upper triangular matrix corresponding to these vectors is also locked.

$\begin{displaymath} \underbrace{\left[v_1, v_2, \ldots, v_{k-1}\right.}_{Locked}, \underbrace{\left. v_k, v_{k+1}, \ldots v_m \right] }_{Active} \end{displaymath}$

When a new Schur vector converges, step (10) computes the th column of associated with this new basis vector. In the subsequent steps, the approximate eigenvalues are the eigenvalues of the $m \times m$ Hessenberg matrix defined in the algorithm and whose $k \times k$ principal submatrix is upper triangular. For example, when and after the second Schur vector, , has converged, the matrix will have the form

$\begin{displaymath} H_m ~ = ~ \left[ \begin{array}{cccccc} * & * & * & * & * &... ... & & & * & * & * \\ & & & & * & * \\ \end{array} \right]. \end{displaymath}$

(126)

In the subsequent steps, only the eigenvalues not associated with the $2 \times 2$ upper triangular matrix need to be considered.

It can be shown that, in exact arithmetic, the $(n-k) \times (n-k)$ Hessenberg matrix in the lower $(2 \times 2)$ block is the same matrix that would be obtained from an Arnoldi run applied to the matrix

$\begin{displaymath} \tilde A = (I-U_{k-1} U_{k-1}^{\ast} ) A. \end{displaymath}$

Thus, we are implicitly projecting out the invariant subspace already computed from the range of

Next: Implicitly Restarted Arnoldi Method Up: Arnoldi Method Y. Saad Previous: Explicit Restarts Contents Index

Susan Blackford 2000-11-20