Broyden's_method Knowpia

In numerical analysis, Broyden's method is a quasi-Newton method for finding roots in $k$ variables. It was originally described by C. G. Broyden in 1965.^[1]

Newton's method for solving $f (x) = 0$ uses the Jacobian matrix, $J$ , at every iteration. However, computing this Jacobian is a difficult and expensive operation. The idea behind Broyden's method is to compute the whole Jacobian at most only at the first iteration and to do rank-one updates at other iterations.

In 1979 Gay proved that when Broyden's method is applied to a linear system of size $n \times n$ , it terminates in $2 n$ steps,^[2] although like all quasi-Newton methods, it may not converge for nonlinear systems.

Description of the method edit

Solving single-variable equation edit

In the secant method, we replace the first derivative $f'$ at $x n$ with the finite-difference approximation:

f'(x_{n})\simeq {\frac {f(x_{n})-f(x_{n-1})}{x_{n}-x_{n-1}}},

and proceed similar to Newton's method:

x_{n+1}=x_{n}-{\frac {f(x_{n})}{f^{\prime }(x_{n})}}

where $n$ is the iteration index.

Solving a system of nonlinear equations edit

Consider a system of $k$ nonlinear equations

\mathbf {f} (\mathbf {x} )=\mathbf {0} ,

where $f$ is a vector-valued function of vector $x$ :

\mathbf {x} =(x_{1},x_{2},x_{3},\dotsc ,x_{k}),

\mathbf {f} (\mathbf {x} )={\big (}f_{1}(x_{1},x_{2},\dotsc ,x_{k}),f_{2}(x_{1},x_{2},\dotsc ,x_{k}),\dotsc ,f_{k}(x_{1},x_{2},\dotsc ,x_{k}){\big )}.

For such problems, Broyden gives a generalization of the one-dimensional Newton's method, replacing the derivative with the Jacobian $J$ . The Jacobian matrix is determined iteratively, based on the secant equation in the finite-difference approximation:

\mathbf {J} _{n}(\mathbf {x} _{n}-\mathbf {x} _{n-1})\simeq \mathbf {f} (\mathbf {x} _{n})-\mathbf {f} (\mathbf {x} _{n-1}),

where $n$ is the iteration index. For clarity, let us define:

\mathbf {f} _{n}=\mathbf {f} (\mathbf {x} _{n}),

\Delta \mathbf {x} _{n}=\mathbf {x} _{n}-\mathbf {x} _{n-1},

\Delta \mathbf {f} _{n}=\mathbf {f} _{n}-\mathbf {f} _{n-1},

so the above may be rewritten as

\mathbf {J} _{n}\Delta \mathbf {x} _{n}\simeq \Delta \mathbf {f} _{n}.

The above equation is underdetermined when $k$ is greater than one. Broyden suggests using the current estimate of the Jacobian matrix $J n -1$ and improving upon it by taking the solution to the secant equation that is a minimal modification to $J n -1$ :

\mathbf {J} _{n}=\mathbf {J} _{n-1}+{\frac {\Delta \mathbf {f} _{n}-\mathbf {J} _{n-1}\Delta \mathbf {x} _{n}}{\|\Delta \mathbf {x} _{n}\|^{2}}}\Delta \mathbf {x} _{n}^{\mathrm {T} }.

This minimizes the following Frobenius norm:

\|\mathbf {J} _{n}-\mathbf {J} _{n-1}\|_{\rm {F}}.

We may then proceed in the Newton direction:

\mathbf {x} _{n+1}=\mathbf {x} _{n}-\mathbf {J} _{n}^{-1}\mathbf {f} (\mathbf {x} _{n}).

Broyden also suggested using the Sherman–Morrison formula to update directly the inverse of the Jacobian matrix:

\mathbf {J} _{n}^{-1}=\mathbf {J} _{n-1}^{-1}+{\frac {\Delta \mathbf {x} _{n}-\mathbf {J} _{n-1}^{-1}\Delta \mathbf {f} _{n}}{\Delta \mathbf {x} _{n}^{\mathrm {T} }\mathbf {J} _{n-1}^{-1}\Delta \mathbf {f} _{n}}}\Delta \mathbf {x} _{n}^{\mathrm {T} }\mathbf {J} _{n-1}^{-1}.

This first method is commonly known as the "good Broyden's method".

A similar technique can be derived by using a slightly different modification to $J n -1$ . This yields a second method, the so-called "bad Broyden's method" (but see^[3]):

\mathbf {J} _{n}^{-1}=\mathbf {J} _{n-1}^{-1}+{\frac {\Delta \mathbf {x} _{n}-\mathbf {J} _{n-1}^{-1}\Delta \mathbf {f} _{n}}{\|\Delta \mathbf {f} _{n}\|^{2}}}\Delta \mathbf {f} _{n}^{\mathrm {T} }.

This minimizes a different Frobenius norm:

\|\mathbf {J} _{n}^{-1}-\mathbf {J} _{n-1}^{-1}\|_{\rm {F}}.

Many other quasi-Newton schemes have been suggested in optimization, where one seeks a maximum or minimum by finding the root of the first derivative (gradient in multiple dimensions). The Jacobian of the gradient is called Hessian and is symmetric, adding further constraints to its update.

The Broyden Class of Methods edit

In addition to the two methods described above, Broyden defined a whole class of related methods.^[1]^: 578 In general, methods in the Broyden class are given in the form^[4]^: 150

\mathbf {J} _{k+1}=\mathbf {J} _{k}-{\frac {\mathbf {J} _{k}s_{k}s_{k}^{T}\mathbf {J} _{k}}{s_{k}^{T}\mathbf {J} _{k}s_{k}}}+{\frac {y_{k}y_{k}^{T}}{y_{k}^{T}s_{k}}}+\phi _{k}\left(s_{k}^{T}\mathbf {J} _{k}s_{k}\right)v_{k}v_{k}^{T},

where

y_{k}:=\mathbf {f} (\mathbf {x} _{k+1})-\mathbf {f} (\mathbf {x} _{k}),

s_{k}:=\mathbf {x} _{k+1}-\mathbf {x} _{k},

and

v_{k}=\left[{\frac {y_{k}}{y_{k}^{T}s_{k}}}-{\frac {\mathbf {J} _{k}s_{k}}{s_{k}^{T}\mathbf {J} _{k}s_{k}}}\right],

and

\phi _{k}\in \mathbb {R}

for each

k=1,2,...

. The choice of

\phi _{k}

determines the method.

Other methods in the Broyden class have been introduced by other authors.

The Davidon–Fletcher–Powell (DFP) method is the only member of this class being published before the two methods defined by Broyden.^[1]^: 582 For the DFP method, $\phi _{k}=1$ .^[4]^: 150
Schubert's or sparse Broyden algorithm – a modification for sparse Jacobian matrices.^[5]
Klement (2014) – uses fewer iterations to solve many equation systems.^[6]^[7]

References edit

^ ^a ^b ^c Broyden, C. G. (October 1965). "A Class of Methods for Solving Nonlinear Simultaneous Equations". Mathematics of Computation. 19 (92). American Mathematical Society: 577–593. doi:10.1090/S0025-5718-1965-0198670-6. JSTOR 2003941.
^ Gay, D. M. (August 1979). "Some convergence properties of Broyden's method". SIAM Journal on Numerical Analysis. 16 (4). SIAM: 623–630. doi:10.1137/0716047.
^ Kvaalen, Eric (November 1991). "A faster Broyden method". BIT Numerical Mathematics. 31 (2). SIAM: 369–372. doi:10.1007/BF01931297.
^ ^a ^b Nocedal, Jorge; Wright, Stephen J. (2006). Numerical Optimization. Springer Series in Operations Research and Financial Engineering. Springer New York. doi:10.1007/978-0-387-40065-5. ISBN 978-0-387-30303-1.
^ Schubert, L. K. (1970-01-01). "Modification of a quasi-Newton method for nonlinear equations with a sparse Jacobian". Mathematics of Computation. 24 (109): 27–30. doi:10.1090/S0025-5718-1970-0258276-9. ISSN 0025-5718.
^ Klement, Jan (2014-11-23). "On Using Quasi-Newton Algorithms of the Broyden Class for Model-to-Test Correlation". Journal of Aerospace Technology and Management. 6 (4): 407–414. doi:10.5028/jatm.v6i4.373. ISSN 2175-9146.
^ "Broyden class methods – File Exchange – MATLAB Central". www.mathworks.com. Retrieved 2016-02-04.