Mathematical methods for economic theory: 7.3 Optimization with inequality constraints: the sufficiency of the Kuhn-Tucker conditions

7.3 Optimization with inequality constraints: the sufficiency of the Kuhn-Tucker conditions

We saw previously that for both an unconstrained maximization problem and a maximization problem with an equality constraint the first-order conditions are sufficient for a global optimum when the objective and constraint functions satisfy appropriate concavity/convexity conditions. The same is true for an optimization problem with inequality constraints.

Concave objective function

Proposition 7.3.1 (Sufficiency of Kuhn-Tucker conditions for concave objective function)

Let f and g_j for j = 1, ..., m be continuously differentiable functions of many variables defined on an open convex set S and let c_j for j = 1, ..., m be constants. Suppose that

f is concave
and g_j is quasiconvex for j = 1, ..., m.

If there exists an m-vector of numbers λ = (λ₁, ..., λ_m) and a point x* ∈ S such that (x*, λ) satisfies the Kuhn-Tucker conditions then x* solves the problem

max_x∈Sf(x) subject to g_j(x) ≤ c_j for j = 1, ..., m.

Proof

The fact that (x*, λ) satisfies the Kuhn-Tucker conditions means that

	f'_i(x) − ∑m j=1λ_jg'_ji(x) = 0 for i = 1, ..., n
	λ_j ≥ 0, g_j(x) ≤ c_j and λ_j[g_j(x) − c_j] = 0 for j = 1, ..., m

(where g'_ji(x*) is the derivative of g_j with respect to its ith argument evaluated at x*).

Let x be such that g_j(x) ≤ c_j for all j. We need to show that f(x) ≤ f(x*).

Let j be an index such that g_j(x*) = c_j. (That is, the jth constraint is binding at x*.) Then g_j(x) ≤ g_j(x*), so that by a characterization of quasiconvex functions, ∑n
i=1g'_ji(x*)·(x_i − x*_i) ≤ 0. Thus given λ_j ≥ 0 we have λ_j∑n
i=1g'_ji(x*)·(x_i − x*_i) ≤ 0.

Now let j be an index such that g_j(x*) < c_j. Then by the complementary slackness condition for constraint j in the Kuhn-Tucker conditions, λ_j = 0.

Thus for every constraint j we have

λ_j∑n
i=1g'_ji(x*)·(x_i − x*_i) ≤ 0

and hence

∑m
j=1λ_j∑n
i=1g'_ji(x*)·(x_i − x*_i) ≤ 0,

or, reversing the summations,

∑n
i=1∑m
j=1λ_jg'_ji(x*)·(x_i − x*_i) ≤ 0.

Now, multiplying the ith first-order condition in the Kuhn-Tucker conditions by x_i − x*_i and summing over i we get

∑n
i=1f'_i(x*)·(x_i − x*_i) − ∑n
i=1∑m
j=1λ_jg'_ji(x*)·(x_i − x*_i) = 0.

By the previous inequality, the second term is nonpositive, so

∑n
i=1f'_i(x*)·(x_i − x*_i) ≤ 0.

Thus from characterization of concave functions, f(x) ≤ f(x*), completing the proof.

This result together with Proposition 7.2.1, giving conditions under which the Kuhn-Tucker conditions are necessary, and the fact that a convex function (and hence a linear one) is quasiconvex yields the following useful corollary.

Corollary 7.3.1 (Necessity and sufficiency of Kuhn-Tucker conditions for concave objective function)

Let f and g_j for j = 1, ..., m be continuously differentiable functions of many variables defined on an open convex set S and let c_j for j = 1, ..., m be constants. Suppose that f is concave and

either each g_j is linear
or each g_j is convex and there exists x ∈ S such that g_j(x) < c_j for j = 1, ..., m.

Then x* ∈ S solves the problem

max_x∈Sf(x) subject to g_j(x) ≤ c_j for j = 1, ..., m

if and only if there exists an m-vector of numbers λ = (λ₁, ..., λ_m) such that (x*, λ) satisfies the Kuhn-Tucker conditions.

Quasiconcave objective function

The condition that the objective function be concave is a bit too strong to be useful in some economic applications. For example, the assumption we would like to impose on a consumer's utility function is that it be quasiconcave. The next result is useful in this case.

Proposition 7.3.2 (Sufficiency of Kuhn-Tucker conditions for quasiconcave objective function)

Let f and g_j for j = 1, ..., m be continuously differentiable functions of n variables defined on an open convex set S and let c_j for j = 1, ..., m be constants. Suppose that

f is quasiconcave
and g_j is quasiconvex for j = 1, ..., m.

If there exists an m-vector of numbers λ = (λ₁, ..., λ_m) and x* ∈ S such that (x*, λ) satisfies the Kuhn-Tucker conditions and it is not the case that f'_i(x*) = 0 for i = 1, ..., n then x* solves the problem

max_x∈Sf(x) subject to g_j(x) ≤ c_j for j = 1, ..., m.

Proof

The proof of the previous result, Proposition 7.3.1, uses the concavity of f only in the last sentence. In particular, under the assumption that f is quasiconcave we can conclude that

∑n
i=1f'_i(x*)·(x_i − x*_i) ≤ 0,

as in that proof. Now suppose, contrary to the claim that x* solves the maximization problem, that f(x) > f(x*) for some x that satisfies the constraints.

Denote the vector of partial derivatives of f by ∇f. Given the continuity of f, the fact that f(x) > f(x*) means that there exists t > 0 (small enough) such that f(x − t∇f(x*)) > f(x*), and the conclusion from the proof of the Proposition 7.3.1 may be written

∇f(x*)·(x − x*) ≤ 0.

Thus

∇f(x)·(x − t∇f(x) − x*)	=	−t∇f(x)·∇f(x) + ∇f(x)(x − x)
	≤	−t∑n i=1(f'_i(x*))² < 0,

given the assumption that ∇f(x*) ≠ (0, ..., 0). But now, by a characterization of quasiconcave functions, this inequality implies that f(x − t∇f(x*)) < f(x*), contradicting f(x − t∇f(x*)) > f(x*). Thus in fact f(x) ≤ f(x*).

The constraints in a standard consumer's optimization problem are linear, so the following implication of this result and Proposition 7.2.1, giving conditions under which the Kuhn-Tucker conditions are necessary, is useful.

Corollary 7.3.2 (Necessity and sufficiency of Kuhn-Tucker conditions for quasiconcave objective function): Let f and g_j for j = 1, ..., m be continuously differentiable functions of n variables defined on an open convex set S and let c_j for j = 1, ..., m be constants. Suppose that f is quasiconcave and each g_j is linear. If x* ∈ S solves the problem
max_x∈Sf(x) subject to g_j(x) ≤ c_j for j = 1, ..., m
then there exists a unique m-vector λ such that (x*, λ) satisfies the Kuhn-Tucker conditions, and if there exists an m-vector λ and a point x* ∈ S such that (x*, λ) satisfies the Kuhn-Tucker conditions and it is not the case that f'_i(x*) = 0 for i = 1, ..., n then x* solves the problem.

If you have a minimization problem, remember that you can transform it to a maximization problem by multiplying the objective function by −1. Thus for a minimization problem the condition on the objective function in the first result above is that it be convex, and the condition in the second result is that it be quasiconvex.

Examples

The next two simple examples illustrate how to use these results and the ones in Section 7.2 to solve optimization problems.

Example 7.3.1

Consider the problem

max_x∈S[−(x − 2)²] subject to x ≥ 1,

where S is the set of all numbers. This problem is illustrated in the following figure.

Written in the standard format, this problem is

max_x∈S[−(x − 2)²] subject to 1 − x ≤ 0.

The set S is open and convex, and the objective and constraint functions are continuously differentiable on this set. The objective function is concave and the constraint function is linear, and hence both concave and quasiconvex. Thus by Proposition 7.2.1 the Kuhn-Tucker conditions are necessary (if x* solves the problem then there is a number λ such that (x*, λ) satisfies the Kuhn-Tucker conditions) and by Proposition 7.3.1 they are sufficient (if (x*, λ) satisfies the Kuhn-Tucker conditions then x* solves the problem).

The Lagrangean is the function L defined by

L(x) = −(x − 2)² − λ(1 − x).

The Kuhn-Tucker conditions are

−2(x − 2) + λ	= 0
1 − x ≤ 0, λ ≥ 0, and λ(1 − x)	= 0.

From the last condition we have either λ = 0 or x = 1.

x = 1: 2 + λ = 0, or λ = −2, which violates λ ≥ 0.
λ = 0: −2(x − 2) = 0; the only solution is x = 2.

Thus the Kuhn-Tucker conditions have a unique solution, (x, λ) = (2, 0). Hence x = 2 is the unique solution of the problem.

Example 7.3.2

Consider the problem

max_x∈S[−(x − 2)²] subject to x ≥ 3,

where S is the set of all numbers. This problem is illustrated in the following figure.

Written in the standard format, this problem is

max_x∈S[−(x − 2)²] subject to 3 − x ≤ 0.

As in the previous example, 7.3.1, the objective function is concave and the constraint function is linear, so that x* is a solution of the problem if and only if there is a number λ such that (x*, λ) is a solution of the Kuhn-Tucker conditions.

The Lagrangean is

L(x) = −(x − 2)² − λ(3 − x).

The Kuhn-Tucker conditions are

−2(x−2) + λ	= 0
3 − x ≤ 0, λ ≥ 0, and λ(3 − x)	= 0.

From the last conditions we have either λ = 0 or x = 3.

x = 3: −2 + λ = 0, or λ = 2.
λ = 0: −2(x − 2) = 0; since x ≥ 3 this has no solution compatible with the other conditions.

Thus the Kuhn-Tucker conditions have a single solution, (x, λ) = (3, 2). Hence x = 3 is the unique solution of the problem.

These two examples illustrate a procedure for finding solutions of the Kuhn-Tucker conditions that is useful in many problems. First, look at the complementary slackness conditions, which imply that either a Lagrange multiplier is zero or a constraint is binding. Then follow through the implications of each case, using the other equations. In the two examples, this procedure is very easy to follow. In the following examples doing so is more complicated.

Example 7.3.3

Consider the problem

max_(x₁,_x₂)∈S [−(x₁ − 4)² − (x₂ − 4)²] subject to x₁ + x₂ ≤ 4 and x₁ + 3x₂ ≤ 9,

where S is the set of all pairs of numbers. The set S is open and convex, and the objective and constraint functions are continuously differentiable on this set. The objective function is concave and each constraint function is linear, and hence both concave and quasiconvex. Thus by Proposition 7.2.1 the Kuhn-Tucker conditions are necessary (if x* solves the problem then there is a number λ such that (x*, λ) satisfies the Kuhn-Tucker conditions) and by Proposition 7.3.1 they are sufficient (if (x*, λ) satisfies the Kuhn-Tucker conditions then x* solves the problem).

In Example 7.1.1 we previously found that the Kuhn-Tucker conditions for this problem are

−2(x₁ − 4) − λ₁ − λ₂	= 0
−2(x₂ − 4) − λ₁ − 3λ₂	= 0
x₁ + x₂ ≤ 4, λ₁ ≥ 0, and λ₁(x₁ + x₂ − 4)	= 0
x₁ + 3x₂ ≤ 9, λ₂ ≥ 0, and λ₂(x₁ + 3x₂ − 9)	= 0.

What are the solutions of these conditions? Start by looking at the two conditions λ₁(x₁ + x₂ − 4) = 0 and λ₂(x₁ + 3x₂ − 9) = 0. These two conditions yield the following four cases.

x₁ + x₂ = 4 and x₁ + 3x₂ = 9: In this case we have x₁ = 3/2 and x₂ = 5/2. Then the first two equations are

5 − λ₁ − λ₂ = 0

3 − λ₁ − 3λ₂ = 0

which imply that λ₁ = 6 and λ₂ = −1, which violates the condition λ₂ ≥ 0.
x₁ + x₂ = 4 and x₁ + 3x₂ < 9, so that λ₂ = 0: Then first two equations imply x₁ = x₂ = 2 and λ₁ = 4. All the conditions are satisfied, so (x₁, x₂, λ₁, λ₂) = (2, 2, 4, 0) is a solution.
x₁ + x₂ < 4 and x₁ + 3x₂ = 9, so that λ₁ = 0: Then the first two equations imply x₁ = 33/10 and x₂ = 19/10, violating x₁ + x₂ < 4.
x₁ + x₂ < 4 and x₁ + 3x₂ < 9, so that λ₁ = λ₂ = 0: Then first two equations imply x₁ = x₂ = 4, violating x₁ + x₂ < 4.

So (x₁, x₂, λ₁, λ₂) = (2, 2, 4, 0) is the single solution of the Kuhn-Tucker conditions. Hence the unique solution of problem is (x₁, x₂) = (2, 2).

The next two examples involve problems of the form

max_x u(x) subject to p·x ≤ w, x ≥ 0,

where u is a function of many variables, p is a vector, and w is a scalar. A standard consumer's maximization problem in economic theory takes this form; the techniques used in the example may be used also in problems with other specifications of the function u.

Example 7.3.4

Consider the problem

max_(x,y)∈S xy subject to x + y ≤ 6, x ≥ 0, and y ≥ 0,

where S is the set of all pairs of numbers. This set is open and convex, and the objective and constraint functions are differentiable on it. Each constraint function is linear, and hence concave. Thus by Proposition 7.2.1 the Kuhn-Tucker conditions are necessary (if x* solves the problem then there is a vector λ such that (x*, λ) satisfies the Kuhn-Tucker conditions). Also, the objective function is continuous and the constraint set is compact, so by the extreme value theorem the problem has a solution. Thus x* is a solution of the problem if and only if there is a vector λ* such that (x*, λ*) satisfies the Kuhn-Tucker conditions and f(x*) ≥ f(x) for all values of x for which there is a vector λ such that (x, λ) satisfies the Kuhn-Tucker conditions.

(Note that we cannot use Proposition 7.3.1 because the objective function is not concave, and we cannot use Proposition 7.3.2 because although the objective function is quasiconcave on the set of vectors x with x ≥ 0, it is not quasiconcave on any open set that contains this set.)

The Lagrangean is given by

L(x, y) = xy − λ₁(x + y − 6) + λ₂x + λ₃y.

The Kuhn-Tucker conditions are

y − λ₁ + λ₂	= 0
x − λ₁ + λ₃	= 0
λ₁ ≥ 0, x + y ≤ 6, λ₁(x + y − 6)	= 0
λ₂ ≥ 0, x ≥ 0, λ₂x	= 0
λ₃ ≥ 0, y ≥ 0, λ₃y	= 0.

If x > 0 and y > 0 then λ₂ = λ₃ = 0, so that λ₁ = x = y from the first two conditions. Hence x = y = λ₁ = 3 from the third condition. These values satisfy all the conditions.
If x = 0 and y > 0 then λ₃ = 0 from the last condition and hence λ₁ = x = 0 from the second condition. But now from the first condition λ₂ = −y < 0, contradicting λ₂ ≥ 0.
If x > 0 and y = 0 then λ₂ = 0, and a symmetric argument yields a contradiction.
If x = y = 0 then λ₁ = 0 from the third set of conditions, so that λ₂ = λ₃ from the first and second conditions. These values satisfy all the conditions.

We conclude that the Kuhn-Tucker conditions have two solutions, (x, y, λ₁, λ₂, λ₃) = (3, 3, 3, 0, 0) and (0, 0, 0, 0, 0). We have f(3, 3) = 9 and f(0, 0) = 0, so the solution of the problem is (3, 3).

In this example, the objective function is defined on an open convex set that contains the constraint set and is differentiable there. In the next example, there is no such open convex set.

Example 7.3.5

Consider the problem

max_(x,y)∈S x^1/2 + y subject to px + y ≤ I, x ≥ 0, and y ≥ 0

where S is the set of pairs (x, y) of numbers with x ≥ 0, and p > 0 and I > 0 are parameters. The set S is not open, and the objective function is defined only for x ≥ 0, so it is not defined on any open set that contains the constraint set. Thus the argument in the previous example cannot be used.

Instead, first consider the problem

max_(x,y)∈S' x^1/2 + y subject to px + y ≤ I and y ≥ 0

where S' = {(x, y): x > 0}. The set S' is open and convex, the objective and constraint functions are continuously differentiable on this set, the objective function is quasiconcave, and each constraint function is linear, so by Corollary 7.3.2 if x* ∈ S' solves the problem then there is a vector λ such that (x*, λ) satisfies the Kuhn-Tucker conditions and if there is a vector λ and a point x* ∈ S' such that (x*, λ) satisfies the Kuhn-Tucker conditions and not all the partial derivatives of the objective function are 0 at x* then x* solves the problem.

The Lagrangean of the modified problem is given by

L(x, y) = x^1/2 + y − λ₁(px + y − I) + λ₂y.

The Kuhn-Tucker conditions are

(1/2)x^−1/2 − λ₁p	= 0
1 − λ₁ + λ₂	= 0
λ₁ ≥ 0, px + y ≤ I, and λ₁(px + y − I)	= 0
λ₂ ≥ 0, −y ≤ 0, and λ₂y	= 0.

From the first condition, λ₁ > 0 and x = 1/(2λ₁p)². From the last condition, either λ₂ = 0 or y = 0.

If λ₂ = 0 then λ₁ = 1 from the second condition, and hence x = 1/(4p²) and y = I − 1/(4p). We have 1/(4p²) > 0, so (x, y, λ₁, λ₂) = (1/(4p²), I − 1/(4p), 1, 0) is a solution of the Kuhn-Tucker conditions if I − 1/(4p) ≥ 0, or p ≥ 1/(4I).
If y = 0 then x = I/p, λ₁ = 1/(2(pI)^1/2), and λ₂ = λ₁ − 1, so that λ₂ ≥ 0 if and only if λ₁ ≥ 1, or p ≤ 1/(4I).

Thus for each pair (p, I) of values of the parameters, the Kuhn-Tucker conditions have a single solution,

	(I/p, 0)	if p ≤ 1/(4I)
	(1/(4p²),I − 1/(4p))	if p > 1/(4I).

Neither partial derivative of the objective function is 0 for any point (x, y), so we conclude that the solution of the modified problem for each value of (p, I) is given by this expression.

Now return to the original problem. The maximal value of the objective function when x = 0 is I. At the solution of the modified problem, the value of the objective function is (I/p)^1/2 if p ≤ 1/(4I), and 1/(2p) + I − 1/(4p) = I + 1/(4p) if p > I/(4p). If p ≤ 1/(4I) then (I/p)^1/2 ≥ 2I, so in both cases the value of the objective function exceeds I. We conclude that the solution of the original problem is the solution of the modified problem.

Your first name*
Your last name*
Your email address*
Comment*
Enter the first six letters of the alphabet*	(to help establish that you are human)