Barrier function

Summary

In constrained optimization, a field of mathematics, a barrier function is a continuous function whose value on a point increases to infinity as the point approaches the boundary of the feasible region of an optimization problem.[1][2] Such functions are used to replace inequality constraints by a penalizing term in the objective function that is easier to handle. A barrier function is also called an interior penalty function, as it is a penalty function that forces the solution to remain within the interior of the feasible region.

The two most common types of barrier functions are inverse barrier functions and logarithmic barrier functions. Resumption of interest in logarithmic barrier functions was motivated by their connection with primal-dual interior point methods.

Motivation edit

Consider the following constrained optimization problem:

minimize f(x)
subject to xb

where b is some constant. If one wishes to remove the inequality constraint, the problem can be re-formulated as

minimize f(x) + c(x),
where c(x) = ∞ if x > b, and zero otherwise.

This problem is equivalent to the first. It gets rid of the inequality, but introduces the issue that the penalty function c, and therefore the objective function f(x) + c(x), is discontinuous, preventing the use of calculus to solve it.

A barrier function, now, is a continuous approximation g to c that tends to infinity as x approaches b from above. Using such a function, a new optimization problem is formulated, viz.

minimize f(x) + μ g(x)

where μ > 0 is a free parameter. This problem is not equivalent to the original, but as μ approaches zero, it becomes an ever-better approximation.[3]

Logarithmic barrier function edit

For logarithmic barrier functions,   is defined as   when   and   otherwise (in 1 dimension. See below for a definition in higher dimensions). This essentially relies on the fact that   tends to negative infinity as   tends to 0.

This introduces a gradient to the function being optimized which favors less extreme values of   (in this case values lower than  ), while having relatively low impact on the function away from these extremes.

Logarithmic barrier functions may be favored over less computationally expensive inverse barrier functions depending on the function being optimized.

Higher dimensions edit

Extending to higher dimensions is simple, provided each dimension is independent. For each variable   which should be limited to be strictly lower than  , add  .

Formal definition edit

Minimize   subject to  

Assume strictly feasible:  

Define logarithmic barrier  

See also edit

References edit

  1. ^ Nesterov, Yurii (2018). Lectures on Convex Optimization (2 ed.). Cham, Switzerland: Springer. p. 56. ISBN 978-3-319-91577-7.
  2. ^ Nocedal, Jorge; Wright, Stephen (2006). Numerical Optimization (2 ed.). New York, NY: Springer. p. 566. ISBN 0-387-30303-0.
  3. ^ Vanderbei, Robert J. (2001). Linear Programming: Foundations and Extensions. Kluwer. pp. 277–279.

External links edit

  • Lecture 14: Barrier method from Professor Lieven Vandenberghe of UCLA