Assuming that has an inverse in a neighbourhood of and that its derivative at that point is non-zero, its inverse is guaranteed to be differentiable at and have a derivative given by the above formula.
The inverse function rule may also be expressed in Leibniz's notation. As that notation suggests,
This relation is obtained by differentiating the equation in terms of x and applying the chain rule, yielding that:
considering that the derivative of x with respect to x is 1.
Let be an invertible (bijective) function,
let be in the domain of , and
let be in the codomain of .
This also means that is in the domain of ,
and that is in the codomain of .
(Sidenote: since f is a bijective function, being in the
codomain of the function, , it means that
is in the range of the function, .)
Since is an invertible function, we know that:
Which equation should be used for the derivation?
Technically speaking, since and
are inverses, either one will work, and you can do some careful
manipulation of swapping the instances of and ,
however, that may cause confusion. So, which equation should be used?
Taking a look at both equations, it looks like, in order to do anything useful, we would have to
differentiate both sides of the chosen equation.
Due to the fact that there are nested functions, the chain rule will also be in use.
We want our final formula to give the derivative of the inverse function,
with respect to the input to the inverse function (),
rather than with respect to another function. So, thinking about how the chain rule works,
we know that we will have to multiply by the derivative of the "inside".
If we use the first equation (), the "inside" function
will be , and according to the chain rule, we would get
not nested within any other function.
That being said, let's start.
Great! Rather than using as the variable, we can rewrite this equation
using as the input for , and we get the following:
(for positive x) has inverse .
At , however, there is a problem: the graph of the square root function becomes vertical, corresponding to a horizontal tangent for the square function.
This is only useful if the integral exists. In particular we need to be non-zero across the range of integration.
It follows that a function that has a continuous derivative has an inverse in a neighbourhood of every point where the derivative is non-zero. This need not be true if the derivative is not continuous.
Another very interesting and useful property is the following:
Where denotes the antiderivative of .
The inverse of the derivative of f(x) is also of interest, as it is used in showing the convexity of the Legendre transform.
Let then we have, assuming :
This can be shown using the previous notation . Then we have:
By induction, we can generalize this result for any integer , with , the nth derivative of f(x), and , assuming :
The chain rule given above is obtained by differentiating the identity with respect to x. One can continue the same process for higher derivatives. Differentiating the identity twice with respect to x, one obtains
that is simplified further by the chain rule as
Replacing the first derivative, using the identity obtained earlier, we get