Mathematical methods for economic theory

Martin J. Osborne

2.3 Derivatives of functions defined implicitly

One parameter

The equilibrium value of a variable x in some economic models is the solution of an equation of the form
f(xp) = 0,
where f is a function and p is a parameter. In such a case, we would sometimes like to know how the equilibrium value of x depends on the parameter. For example, does it increase or decrease when the value of the parameter increases?

Typically, we make assumptions about the form of the function f—for example, we might assume that it is increasing in x and decreasing in p—but do not assume that it takes a specific form. Thus typically we cannot solve explicitly for x as a function of p (i.e. we cannot write x = g(p) for some function g).

We say that the equation

f(xp) = 0 for all p
defines x implicitly as a function of p. That is, for any value of p, the corresponding value of x is g(p), where f(g(p), p) = 0 for all p.

Before trying to determine how a solution for x depends on p, we should ask whether, for each value of p, the equation has a solution. Certainly not all such equations have solutions. The equation x2 + 1 = 0, for example, has no (real) solution. Even a single linear equation may have no solution in the relevant range. If, for example, the value of x is restricted to be nonnegative number (perhaps it is the quantity of a good), then for p > 0 the equation x + p = 0 has no solution.

One tool that we may be able to use to show that a single equation in a single variable has a solution is the Intermediate Value Theorem. Assume that the function f is continuous, the possible values of x lie between x1 and x2, and for some value of p we have f(x1p) < 0 and f(x2p) > 0, or alternatively f(x1p) > 0 and f(x2p) < 0. Then the Intermediate Value Theorem tells us that there exists a value of x between x1 and x2 for which f(xp) = 0. (Note that even if these conditions are not satisfied, the equation may have a solution.)

If we cannot appeal to the Intermediate Value Theorem (because, for example, f is not continuous, or does not satisfy the appropriate conditions), we may be able to argue that a solution exists by appealing to the particular features of our equation.

Putting aside the question of whether the equation has a solution, consider the question of how a solution, if one exists, depends on the parameter p. If x0 is a solution for the parameter value p0, so that f(x0p0) = 0, and the function g that the equation f(g(p), p) = 0 implicitly defines is differentiable at p0, then we can find its derivative g'(p) by differentiating both sides of the equation. Using the chain rule, we get

f'1(g(p), p)g'(p) + f'2(g(p), p) = 0,
so that
g'(p) = −f'2(g(p), p)/f'1(g(p), p)
if f'1(g(p), p) ≠ 0. Notice that even though x cannot be isolated in the original equation, after differentiating the equation the derivative of g can be isolated in terms of the partial derivatives of f.

This calculation tells us, for example, that if f is an increasing function of both its arguments (f'1(xp) > 0 and f'2(xp) > 0 for all (xp)), then x is a decreasing function of p.

Conditions under which the function g is differentiable are given by the following result.

Proposition 2.3.1 (Implicit function theorem)  
Let f be a continuously differentiable function of two variables defined on an open set S. If f'1(x0p0) ≠ 0 then there exists a continuously differentiable function g of a single variable defined on an open interval I containing p0 such that f(g(p), p) = f(x0p0) for all p ∈ I, and
g'(p0)  = −
f'2(x0p0)
f'1(x0p0)
.
Source  
For a proof, see Rudin (1964), Theorem 9.18 (p. 196), Rudin (1976), Theorem 9.28 (p. 224), or Apostol (1974), Theorem 13.7 (p. 374).

Application: slopes of level curves

The equation f(xy) = c of the level curve of the function f for the value c defines y implicitly as a function of x: we can write
f(xg(x)) = c for all x.
By the implicit function theorem, if f(x0y0) = c, the slope of the level curve through (x0y0) at this point is thus given by
g'(x0) = −
f'1(x0y0)
f'2(x0y0)
.
(Note that we are expressing the second argument as a function of the first argument, rather than the first as a function of the second as in the theorem, so the indices 1 and 2 have to be interchanged.)

We deduce that the equation of the tangent to the level curve at (x0y0) is

y − y0 = −
f'1(x0y0)
f'2(x0y0)
·(x − x0).
(Remember that the equation of a line through (x0y0) with slope m is given by y − y0 = m(x − x0).) Thus the equation of the tangent may alternatively be written as
f'1(x0y0)(x − x0) + f'2(x0y0)(y − y0) = 0,
or
(f'1(x0y0), f'2(x0y0))
left parenthesis x − x0 right parenthesis
y − y0
 = 0.

The vector (f'1(x0y0), f'2(x0y0)) is called the gradient vector of f at (x0y0) and is denoted ∇f(x0y0).

Let (xy) ≠ (x0y0) be a point on the tangent of the level curve at (x0y0). Then the vector

left parenthesis x − x0 right parenthesis
y − y0
is parallel to the tangent. The previous displayed equation, in which the product of this vector with the gradient vector is 0, shows that the two vectors are orthogonal (the angle between them is 90°). Thus the gradient vector is orthogonal to the tangent, as illustrated in the following figure.

x y (x0, y0) (f1'(x0, y0), f2'(x0, y0)) f(x, y) = c y = g(x)

One can compute the second derivative of the level curve as well as the first derivative, by differentiating once again.

Many parameters

Suppose that the equilibrium value of the variable x is the solution of an equation of the form
f(xp) = 0,
where p is a vector of parameters—say p = (p1, ..., pn). By differentiating the equation with respect to pi, holding all the other parameters fixed, we may determine how x varies with pi.

Assume that the function g defined implicitly as follows is differentiable:

f(g(p), p) = 0 for all p.
Then differentiating this identity with respect to pi we have
f'x(g(p), p)g'i(p) + f'pi(g(p), p) = 0
so that if f'x(g(p), p) ≠ 0 we have
g'i(p) = −
f'pi(g(p), p)
f'x(g(p), p)
.

Example 2.3.1
Consider the competitive firm studied previously that uses a single input to produce a single output with the differentiable production function f, facing the price w for the input and the price p for output. Denote by z(wp) its profit-maximizing input for any pair (wp). We know that z(wp) satisfies the first-order condition
pf'(z(wp)) − w = 0 if z(wp) > 0.
How does z depend on w and p?

Differentiating with respect to w the equation that z(wp) satisfies we get

pf"(z(wp))z'w(wp) − 1 = 0.
Thus if f"(z(wp)) ≠ 0 then
z'w(wp) = 
1
pf"(z(wp))
.
We know that f"(z(w,p)) ≤ 0 given that z(wp) is a maximizer, so that if f"(z(wp)) ≠ 0 we conclude that z'w(wp) < 0, which makes sense: as the input price increases, the amount of the input the firm optimally uses decreases (and hence the firm's optimal output also decreases).

A similar calculation yields

z'p(wp) = −
f'(z(wp))
pf"(z(wp))
,
which for the same reason is positive.