Numerical Method Of Relaxation

We also examined numerical methods such as the Runge-Kutta methods, that are used to solve initial-value problems for ordinary di erential equations. However these problems only focused on solving nonlinear equations with only one variable, rather than nonlinear equations with several variables. The goal of this paper is to examine.

In computational mathematics, an iterative method is a mathematical procedure that uses an initial guess to generate a sequence of improving approximate solutions for a class of problems, in which the n-th approximation is derived from the previous ones. A specific implementation of an iterative method, including the termination criteria, is an algorithm of the iterative method. An iterative method is called convergent if the corresponding sequence converges for given initial approximations. A mathematically rigorous convergence analysis of an iterative method is usually performed; however, heuristic-based iterative methods are also common.

In contrast, direct methods attempt to solve the problem by a finite sequence of operations. In the absence of rounding errors, direct methods would deliver an exact solution (like solving a linear system of equations

{displaystyle Amathbf {x} =mathbf {b} }

by Gaussian elimination). Iterative methods are often the only choice for nonlinear equations. However, iterative methods are often useful even for linear problems involving many variables (sometimes of the order of millions), where direct methods would be prohibitively expensive (and in some cases impossible) even with the best available computing power.^[1]

2Linear systems
- 2.1Stationary iterative methods
- 2.2Krylov subspace methods

Attractive fixed points[edit]

If an equation can be put into the form f(x) = x, and a solution x is an attractive fixed point of the function f, then one may begin with a point x₁ in the basin of attraction of x, and let x_n+1 = f(x_n) for n ≥ 1, and the sequence {x_n}_{n ≥ 1} will converge to the solution x. Here x_n is the nth approximation or iteration of x and x_n+1 is the next or n + 1 iteration of x. Alternately, superscripts in parentheses are often used in numerical methods, so as not to interfere with subscripts with other meanings. (For example, x⁽ⁿ⁺¹⁾ = f(x⁽ⁿ⁾).) If the function f is continuously differentiable, a sufficient condition for convergence is that the spectral radius of the derivative is strictly bounded by one in a neighborhood of the fixed point. If this condition holds at the fixed point, then a sufficiently small neighborhood (basin of attraction) must exist.

Linear systems[edit]

In the case of a system of linear equations, the two main classes of iterative methods are the stationary iterative methods, and the more general Krylov subspace methods.

Stationary iterative methods[edit]

Introduction[edit]

Stationary iterative methods solve a linear system with an operator approximating the original one; and based on a measurement of the error in the result (the residual), form a 'correction equation' for which this process is repeated. While these methods are simple to derive, implement, and analyze, convergence is only guaranteed for a limited class of matrices.

Definition[edit]

An iterative method is defined by

{displaystyle mathbf {x} ^{k+1}:=Psi (mathbf {x} ^{k}),quad kgeq 0}

and for a given linear system

{displaystyle Amathbf {x} =mathbf {b} }

with exact solution

{displaystyle mathbf {x} ^{*}}

the error by

{displaystyle mathbf {e} ^{k}:=mathbf {x} ^{k}-mathbf {x} ^{*},quad kgeq 0,.}

An iterative method is called linear if there exists a matrix

{displaystyle Cin mathbb {R} ^{ntimes n}}

such that

{displaystyle mathbf {e} ^{k+1}=Cmathbf {e} ^{k}quad forall ,kgeq 0}

and this matrix is called iteration matrix.An iterative method with a given iteration matrix

{displaystyle C}

is called convergent if the following holds

{displaystyle lim _{krightarrow infty }C^{k}=0,.}

An important theorem states that for a given iterative method and its iteration matrix

{displaystyle C}

it is convergent if and only if its spectral radius

{displaystyle rho (C)}

is smaller than unity, that is,

{displaystyle rho (C)<1,.}

The basic iterative methods work by splitting the matrix

{displaystyle A}

into

{displaystyle A=M-N}

and here the matrix

{displaystyle M}

should be easily invertible.The iterative methods are now defined as

{displaystyle Mmathbf {x} ^{k+1}=Nmathbf {x} ^{k}+b,quad kgeq 0,.}

From this follows that the iteration matrix is given by

{displaystyle C=I-M^{-1}A=M^{-1}N,.}

Examples[edit]

Basic examples of stationary iterative methods use a splitting of the matrix

{displaystyle A}

such as

{displaystyle A=D-L-U,quad D:={text{diag}}((a_{ii})_{i})}

where

{displaystyle D}

is only the diagonal part of

{displaystyle A}

, and

{displaystyle L}

is the strict lower triangular part of

{displaystyle A}

.Respectively,

{displaystyle U}

is the upper triangular part of

{displaystyle A}

Richardson method: ${displaystyle M:={frac {1}{omega }}Iquad (omega neq 0)}$
Jacobi method: ${displaystyle M:=D}$
Damped Jacobi method: ${displaystyle M:={frac {1}{omega }}Dquad (omega neq 0)}$
Gauss–Seidel method: ${displaystyle M:=D-L}$
Successive over-relaxation method (SOR): ${displaystyle M:={frac {1}{omega }}D-Lquad (omega neq 0)}$
Symmetric successive over-relaxation (SSOR): ${displaystyle M:={frac {1}{omega (2-omega )}}(D-omega L)D^{-1}(D-omega U)quad (omega neq {0,2})}$

Linear stationary iterative methods are also called relaxation methods.

Krylov subspace methods[edit]

Krylov subspace methods work by forming a basis of the sequence of successive matrix powers times the initial residual (the Krylov sequence). The approximations to the solution are then formed by minimizing the residual over the subspace formed. The prototypical method in this class is the conjugate gradient method (CG) which assumes that the system matrix

{displaystyle A}

is symmetricpositive-definite.For symmetric (and possibly indefinite)

{displaystyle A}

one works with the minimal residual method (MINRES).In the case of not even symmetric matrices methods, such as the generalized minimal residual method (GMRES) and the biconjugate gradient method (BiCG), have been derived.

Convergence of Krylov subspace methods[edit]

Since these methods form a basis, it is evident that the method converges in N iterations, where N is the system size. However, in the presence of rounding errors this statement does not hold; moreover, in practice N can be very large, and the iterative process reaches sufficient accuracy already far earlier. The analysis of these methods is hard, depending on a complicated function of the spectrum of the operator.

Preconditioners[edit]

The approximating operator that appears in stationary iterative methods can also be incorporated in Krylov subspace methods such as GMRES (alternatively, preconditioned Krylov methods can be considered as accelerations of stationary iterative methods), where they become transformations of the original operator to a presumably better conditioned one. The construction of preconditioners is a large research area.

History[edit]

Probably the first iterative method for solving a linear system appeared in a letter of Gauss to a student of his. He proposed solving a 4-by-4 system of equations by repeatedly solving the component in which the residual was the largest.

The theory of stationary iterative methods was solidly established with the work of D.M. Young starting in the 1950s. The Conjugate Gradient method was also invented in the 1950s, with independent developments by Cornelius Lanczos, Magnus Hestenes and Eduard Stiefel, but its nature and applicability were misunderstood at the time. Only in the 1970s was it realized that conjugacy based methods work very well for partial differential equations, especially the elliptic type.

References[edit]

^Amritkar, Amit; de Sturler, Eric; Świrydowicz, Katarzyna; Tafti, Danesh; Ahuja, Kapil (2015). 'Recycling Krylov subspaces for CFD applications and a new hybrid recycling solver'. Journal of Computational Physics. 303: 222. arXiv:1501.03358. Bibcode:2015JCoPh.303..222A. doi:10.1016/j.jcp.2015.09.040.

External links[edit]

Wikimedia Commons has media related to Iterative methods.

Retrieved from 'https://en.wikipedia.org/w/index.php?title=Iterative_method&oldid=914430210'

< Numerical Methods

3Exact Solution of Linear Systems
4Approximate Solution of Linear Systems

Definitions and Basics[edit]

A linear equation system is a set of linear equations to be solved simultanously. A linear equation takes the form

{displaystyle a_{1}*x_{1}+a_{2}*x_{2}+ldots +a_{n}*x_{n}=bquad ,}

where the

{displaystyle n+1,}

coefficients

{displaystyle a_{0}ldots a_{n},}

and

{displaystyle b,}

are constants and

{displaystyle x_{1}ldots x_{n},}

are the n unknowns. Following the notation above, a system of linear equations is denoted as

\begin{matrix} a_{11} * x_{1} + a_{12} * x_{2} + \dots + a_{1 n} * x_{n} & = & b_{1} \\ a_{21} * x_{1} + a_{22} * x_{2} + \dots + a_{2 n} * x_{n} & = & b_{2} \\ ⋮ \\ a_{m 1} * x_{1} + a_{m 2} * x_{2} + \dots + a_{m n} * x_{n} & = & b_{m} & . \end{matrix} {displaystyle {begin{matrix}a_{11}*x_{1}+a_{12}*x_{2}+ldots +a_{1n}*x_{n}&=&b_{1}&a_{21}*x_{1}+a_{22}*x_{2}+ldots +a_{2n}*x_{n}&=&b_{2}&&vdots &&a_{m1}*x_{1}+a_{m2}*x_{2}+ldots +a_{mn}*x_{n}&=&b_{m}&quad .end{matrix}},}

This system consists of

{displaystyle m,}

linear equations, each with

{displaystyle n+1,}

coefficients, and has

{displaystyle n,}

unknowns which have to fulfill the set of equations simultanously. To simplify notation, it is possible to rewrite the above equations in matrix notation:

{displaystyle mathbf {A} cdot mathbf {x} =mathbf {b} quad .,}

The elements of the

{displaystyle mtimes n,}

matrix

{displaystyle mathbf {A} ,}

are the coefficients of the equations,

{displaystyle a_{ij},}

and the vectors

{displaystyle mathbf {x} ,}

and

{displaystyle mathbf {b} ,}

have the elements

{displaystyle x_{i},}

and

{displaystyle b_{i},}

respectively. In this notation each line forms a linear equation.

Over- and Under-Determined Systems[edit]

In order for a solution

{displaystyle mathbf {x} ,}

to be unique, there must be at least as many equations as unknowns. In terms of matrix notation this means that

{displaystyle mgeq n,}

. However, if a system contains more equations than unknowns (

{displaystyle m>n}

) it is very likely (not to say the rule) that there exists no solution at all. Such systems are called over-determined since they have more equations than unknowns. They require special mathematical methods to solve approximately. The most common one is the Least-Squares-Method which aims at minimizing the sum of the error-squares made in each unknown when trying to solve a system. Such problems commonly occur in measurement or data fitting processes.

example
Assume an arbitrary triangle: suppose one measures all three inner angles ${displaystyle alpha ,beta ,gamma }$ with a precission of ${displaystyle pm 0.2^{circ },}$ . Furthermore assume that the lengths of the sides a, b and c are known exactly. From trigonometry it is known, that using the law of cosines one can compute the angle or the length of a side if all the other sides and angles are known. But as is known from geometry, the inner angles of a planar triangle always must add up to ${displaystyle 180^{circ },}$ . So we have three laws of cosines and the rule for the sum of angles. Makes a total of four equations and three unknowns which gives an over-determined problem.

On the other hand, if

{displaystyle m<n,}

the problem arises, that the solution is not unique, as one unknown can be freely chosen. Again, mathematical methods exist to treat such problems. However, they will not be covered in this text.

This chapter will mainly concentrate on the case where

{displaystyle m=n,}

and assumes so unless mentioned otherwise.

Exact Solution of Linear Systems[edit]

Solving a system

{displaystyle mathbf {Ax} =mathbf {b} ,}

in terms of linear algebra is easy: just multiply the system with

{displaystyle mathbf {A} ^{-1}}

from the left, resulting in

{displaystyle mathbf {x} =mathbf {A} ^{-1}mathbf {b} }

. However, finding

{displaystyle mathbf {A} ^{-1}}

is (except for trivial cases) very hard. The following sections describe methods to find an exact (up to rounding-errors) solution to the problem.

Diagonal and Triangular Systems[edit]

A diagonal matrix has only entries in the main diagonal:

{displaystyle a_{ij}equiv 0;forall ;ineq j}

The inverse of

{displaystyle mathbf {A} }

in such a case is simply a diagonal matrix with inverse entries, meaning

{displaystyle mathbf {A} ^{-1}={mbox{diag}}(1/a_{ii})quad .}

It follows, that a diagonal system has the solution

{displaystyle x_{i}=b_{i}/a_{ii}}

which is very easy to compute.

An upper triangular system is defined as

{displaystyle a_{ij}=0quad forall j<iquad ,}

and a lower triangular system as

{displaystyle a_{ij}=0quad forall j>iquad .}

Backward-substitution is the process of solving an upper triangular system

{displaystyle x_{i}={begin{cases}b_{i}/a_{ii}quad &{mbox{if}};i=N{frac {1}{a_{ii}}}left(b_{i}-sum _{j=i+1}^{N}{a_{ij}x_{j}}right)quad &{mbox{else}}quad .end{cases}}}

Backward-substitution on the other hand is the same procedure for lower triangular systems

{displaystyle x_{i}={begin{cases}b_{i}/a_{ii}quad &{mbox{if}};i=1{frac {1}{a_{ii}}}left(b_{i}-sum _{j=1}^{i-1}{a_{ij}x_{j}}right)quad &{mbox{else}}quad .end{cases}}}

Gauss-Jordan Elimination[edit]

Instead of finding

{displaystyle mathbf {A} ^{-1}}

this method relies on row-operations. According to the laws of linear algebra, the rows of an equation system can be multiplied by a constant without changing the solution. Additionaly the rows can be added and subtracted from one another. This leads to the idea of changing the system in such a way that

{displaystyle mathbf {A} }

has a structure which allows for easy solving for

{displaystyle mathbf {x} }

. One such structure would be a diagonal matrix as mentioned above.

Gauss-Jordan Elimination brings the matrix

{displaystyle mathbf {A} }

into diagonal form. To simplify the procedure, one often uses an adapted scheme. First, the matrix

{displaystyle mathbf {A} }

and the right-hand vector

{displaystyle mathbf {b} }

are combined into the augmented matrix

{displaystyle left[mathbf {A} ,mathbf {b} right]={begin{bmatrix}a_{11}&a_{12}&cdots a_{1N}&b_{1}vdots &ddots &vdots &vdots a_{N1}&a_{N2}&cdots a_{NN}&b_{N}end{bmatrix}}}

To illustrate, consider an easy to understand, yet efficient algorithm can be built from four basic components:

gelim: the main function iterates through a stack of reduced equations building up the complete solution one variable at a time, through a series of partial solutions.

stack: calls reduce repeatedly, producing a stack of reduced equations, ordered from smallest (2 elements, such as <ax = b>) to largest.

solve: solves for one variable, given a reduced equation and a partial solution. For example given the reduced equation <aw bx cy = d> and the partial solution <x y>, w = (d - bx - cy)/a. Now the partial solution <w x y> is available for the next round, e.g. <au bv cw dx , e>.

reduce: takes the first equation off the top and pushes it onto the stack; then produces a residual - a reduced matrix, by subtracting the elements of the original, first equation from corresponding elements of the remaining, lower equations, e.g. b[j][k]/b[j][0] - a[k]/a[0]. As you can see, this eliminates the first element in each of the lower equations by subtracting one from one, and only the remaining elements need be kept - ultimately, the residual is an output matrix with one less row, and one less column than the input matrix. It is then used as the input for the next iteration.

It should be noted that multiplication could also be used in place of division; however, on larger matrices (e.g. n=10), this has a cascading effect producing NAN's (infinities). The division, looked at statistically, has the effect of normalizing the reduced matrices - producing numbers with a mean closer to zero and a smaller standard deviation; for randomly generated data, this produces reduced matrices with entries in the vicinity of +-1.

Continuation still to be written

As it is, it shows that it isn't necessary to bring a system into full diagonal form. It is sufficient to bring it into triangular (either upper or lower) form, since it can then be solved by backward or forward substitution respectively.

LU-Factorization[edit]

This section needs to be written

Approximate Solution of Linear Systems[edit]

This section needs to be written

Jacobi Method[edit]

It is an iterative scheme.

Gauss-Seidel Method[edit]

This section needs to be written

SOR Algorithm[edit]

SOR is an abbreviation for the Successive Over Relaxation. It is an iterative scheme that uses a relaxation parameter

{displaystyle omega }

and is a generalization of the Gauss-Seidel method in the special case

{displaystyle omega =1}

Given a square system of n linear equations with unknown x:

{displaystyle Amathbf {x} =mathbf {b} }

where:

{displaystyle A={begin{bmatrix}a_{11}&a_{12}&cdots &a_{1n}a_{21}&a_{22}&cdots &a_{2n}vdots &vdots &ddots &vdots a_{n1}&a_{n2}&cdots &a_{nn}end{bmatrix}},qquad mathbf {x} ={begin{bmatrix}x_{1}x_{2}vdots x_{n}end{bmatrix}},qquad mathbf {b} ={begin{bmatrix}b_{1}b_{2}vdots b_{n}end{bmatrix}}.}

Then A can be decomposed into a diagonal component D, and strictly lower and upper triangular components L and U:

{displaystyle A=D+L+U,}

where

{displaystyle D={begin{bmatrix}a_{11}&0&cdots &00&a_{22}&cdots &0vdots &vdots &ddots &vdots 0&0&cdots &a_{nn}end{bmatrix}},quad L={begin{bmatrix}0&0&cdots &0a_{21}&0&cdots &0vdots &vdots &ddots &vdots a_{n1}&a_{n2}&cdots &0end{bmatrix}},quad U={begin{bmatrix}0&a_{12}&cdots &a_{1n}0&0&cdots &a_{2n}vdots &vdots &ddots &vdots 0&0&cdots &0end{bmatrix}}.}

The system of linear equations may be rewritten as:

{displaystyle (D+omega L)mathbf {x} =omega mathbf {b} -[omega U+(omega -1)D]mathbf {x} }

for a constant ω > 1.

The method of successive over-relaxation is an iterative technique that solves the left hand side of this expression for x, using previous value for x on the right hand side. Analytically, this may be written as:

{displaystyle mathbf {x} ^{(k+1)}=(D+omega L)^{-1}{big (}omega mathbf {b} -[omega U+(omega -1)D]mathbf {x} ^{(k)}{big )}.}

However, by taking advantage of the triangular form of (D+ωL), the elements of x^(k+1) can be computed sequentially using forward substitution:

{displaystyle x_{i}^{(k+1)}=(1-omega )x_{i}^{(k)}+{frac {omega }{a_{ii}}}left(b_{i}-sum _{j>i}a_{ij}x_{j}^{(k)}-sum _{j<i}a_{ij}x_{j}^{(k+1)}right),quad i=1,2,ldots ,n.}

The choice of relaxation factor is not necessarily easy, and depends upon the properties of the coefficient matrix. For symmetric, positive-definitematrices it can be proven that 0 < ω < 2 will lead to convergence, but we are generally interested in faster convergence rather than just convergence.

Conjugate Gradients[edit]

This section needs to be written

Multigrid Methods[edit]

This section needs to be written

Main Page - Mathematics bookshelf - Numerical Methods

Retrieved from 'https://en.wikibooks.org/w/index.php?title=Numerical_Methods/Solution_of_Linear_Equation_Systems&oldid=3529991'

Attractive fixed points[edit]

Linear systems[edit]

Stationary iterative methods[edit]

Introduction[edit]

Definition[edit]

Examples[edit]

Krylov subspace methods[edit]

Convergence of Krylov subspace methods[edit]

Preconditioners[edit]

History[edit]

See also[edit]

References[edit]

External links[edit]

Definitions and Basics[edit]

Over- and Under-Determined Systems[edit]

Exact Solution of Linear Systems[edit]

Diagonal and Triangular Systems[edit]

Gauss-Jordan Elimination[edit]

LU-Factorization[edit]

Approximate Solution of Linear Systems[edit]

Jacobi Method[edit]

Gauss-Seidel Method[edit]

SOR Algorithm[edit]

Conjugate Gradients[edit]

Multigrid Methods[edit]