Mathematical methods for economic theory: 2.3 Derivatives of functions defined implicitly

2.3 Derivatives of functions defined implicitly

One parameter

The equilibrium value of a variable x in some economic models is the solution of an equation of the form

f(x, p) = 0,

where f is a function and p is a parameter. In such a case, we would sometimes like to know how the equilibrium value of x depends on the parameter. For example, does it increase or decrease when the value of the parameter increases?

Typically, we make assumptions about the form of the function f—for example, we might assume that it is increasing in x and decreasing in p—but do not assume that it takes a specific form. Thus typically we cannot solve explicitly for x as a function of p (i.e. we cannot write x = g(p) for some function g).

We say that the equation

f(x, p) = 0 for all p

defines x implicitly as a function of p. That is, for any value of p, the corresponding value of x is g(p), where f(g(p), p) = 0 for all p.

Before trying to determine how a solution for x depends on p, we should ask whether, for each value of p, the equation has a solution. Certainly not all such equations have solutions. The equation x² + 1 = 0, for example, has no (real) solution. Even a single linear equation may have no solution in the relevant range. If, for example, the value of x is restricted to be nonnegative number (perhaps it is the quantity of a good), then for p > 0 the equation x + p = 0 has no solution.

One tool that we may be able to use to show that a single equation in a single variable has a solution is the Intermediate Value Theorem. Assume that the function f is continuous, the possible values of x lie between x₁ and x₂, and for some value of p we have f(x₁, p) < 0 and f(x₂, p) > 0, or alternatively f(x₁, p) > 0 and f(x₂, p) < 0. Then the Intermediate Value Theorem tells us that there exists a value of x between x₁ and x₂ for which f(x, p) = 0. (Note that even if these conditions are not satisfied, the equation may have a solution.)

If we cannot appeal to the Intermediate Value Theorem (because, for example, f is not continuous, or does not satisfy the appropriate conditions), we may be able to argue that a solution exists by appealing to the particular features of our equation.

Putting aside the question of whether the equation has a solution, consider the question of how a solution, if one exists, depends on the parameter p. If x₀ is a solution for the parameter value p₀, so that f(x₀, p₀) = 0, and the function g that the equation f(g(p), p) = 0 implicitly defines is differentiable at p₀, then we can find its derivative g'(p) by differentiating both sides of the equation. Using the chain rule, we get

f'₁(g(p), p)g'(p) + f'₂(g(p), p) = 0,

so that

g'(p) = −f'₂(g(p), p)/f'₁(g(p), p)

if f'₁(g(p), p) ≠ 0. Notice that even though x cannot be isolated in the original equation, after differentiating the equation the derivative of g can be isolated in terms of the partial derivatives of f.

This calculation tells us, for example, that if f is an increasing function of both its arguments (f'₁(x, p) > 0 and f'₂(x, p) > 0 for all (x, p)), then x is a decreasing function of p.

Conditions under which the function g is differentiable are given by the following result.

Proposition 2.3.1 (Implicit function theorem)

Let f be a continuously differentiable function of two variables defined on an open set S. If f'₁(x₀, p₀) ≠ 0 then there exists a continuously differentiable function g of a single variable defined on an open interval I containing p₀ such that f(g(p), p) = f(x₀, p₀) for all p ∈ I, and

g'(p₀)

= −

f'₂(x₀, p₀)

f'₁(x₀, p₀)

Source: For a proof, see Rudin (1964), Theorem 9.18 (p. 196), Rudin (1976), Theorem 9.28 (p. 224), or Apostol (1974), Theorem 13.7 (p. 374).

Application: slopes of level curves

The equation f(x, y) = c of the level curve of the function f for the value c defines y implicitly as a function of x: we can write

f(x, g(x)) = c for all x.

By the implicit function theorem, if f(x₀, y₀) = c, the slope of the level curve through (x₀, y₀) at this point is thus given by

g'(x₀) = −

f'₁(x₀, y₀)

f'₂(x₀, y₀)

(Note that we are expressing the second argument as a function of the first argument, rather than the first as a function of the second as in the theorem, so the indices 1 and 2 have to be interchanged.)

We deduce that the equation of the tangent to the level curve at (x₀, y₀) is

y − y₀ = −

f'₁(x₀, y₀)

f'₂(x₀, y₀)

·(x − x₀).

(Remember that the equation of a line through (x₀, y₀) with slope m is given by y − y₀ = m(x − x₀).) Thus the equation of the tangent may alternatively be written as

f'₁(x₀, y₀)(x − x₀) + f'₂(x₀, y₀)(y − y₀) = 0,

(f'₁(x₀, y₀), f'₂(x₀, y₀))

	x − x₀
	y − y₀

= 0.

The vector (f'₁(x₀, y₀), f'₂(x₀, y₀)) is called the gradient vector of f at (x₀, y₀) and is denoted ∇f(x₀, y₀).

Let (x, y) ≠ (x₀, y₀) be a point on the tangent of the level curve at (x₀, y₀). Then the vector

	x − x₀
	y − y₀

is parallel to the tangent. The previous displayed equation, in which the product of this vector with the gradient vector is 0, shows that the two vectors are orthogonal (the angle between them is 90°). Thus the gradient vector is orthogonal to the tangent, as illustrated in the following figure.

One can compute the second derivative of the level curve as well as the first derivative, by differentiating once again.

Many parameters

Suppose that the equilibrium value of the variable x is the solution of an equation of the form

f(x, p) = 0,

where p is a vector of parameters—say p = (p₁, ..., p_n). By differentiating the equation with respect to p_i, holding all the other parameters fixed, we may determine how x varies with p_i.

Assume that the function g defined implicitly as follows is differentiable:

f(g(p), p) = 0 for all p.

Then differentiating this identity with respect to p_i we have

f'_x(g(p), p)g'_i(p) + f'_{p_i}(g(p), p) = 0

so that if f'_x(g(p), p) ≠ 0 we have

g'_i(p) = −

f'_{p_i}(g(p), p)

f'_x(g(p), p)

Example 2.3.1

Consider the competitive firm studied previously that uses a single input to produce a single output with the differentiable production function f, facing the price w for the input and the price p for output. Denote by z(w, p) its profit-maximizing input for any pair (w, p). We know that z(w, p) satisfies the first-order condition

pf'(z(w, p)) − w = 0 if z(w, p) > 0.

How does z depend on w and p?

Differentiating with respect to w the equation that z(w, p) satisfies we get

pf"(z(w, p))z'_w(w, p) − 1 = 0.

Thus if f"(z(w, p)) ≠ 0 then

z'_w(w, p) =

pf"(z(w, p))

We know that f"(z(w,p)) ≤ 0 given that z(w, p) is a maximizer, so that if f"(z(w, p)) ≠ 0 we conclude that z'_w(w, p) < 0, which makes sense: as the input price increases, the amount of the input the firm optimally uses decreases (and hence the firm's optimal output also decreases).

A similar calculation yields

z'_p(w, p) = −

f'(z(w, p))

pf"(z(w, p))

which for the same reason is positive.

Your first name*
Your last name*
Your email address*
Comment*
Enter the first six letters of the alphabet*	(to help establish that you are human)