Announcements & Reminders
Well done on the Online Test on Monday April 22
Starting with a review and back to quadratic forms
📖 Unconstrained optimization#
⏱ 21 min | 2104 words
A standard optimization problem#
Example
Consider a (monopolistic) firm that is facing a market demand for its products
To maximize its profit
Plugging in the functions
Components of the optimization problem#
Objective function: function to be maximized or minimized, also known as maximand
In the example above profit function to be maximizedDecision/choice variables: the variables that the agent can control in order to optimize the objective function, also known as controls
In the example above price and quantity variables that the firm can choose to maximize its profitEquality constraints: restrictions on the choice variables in the form of equalities
In the example aboveInequality constraints (weak and strict): restrictions on the choice variables in the form of inequalities
In the example above andParameters: the variables that are not controlled by the agent, but affect the objective function and/or constraints
In the example above , and are parameters of the problemValue function: the “optimized” value of the objective function as a function of parameters
In the example above is the value function
Definition
The general form of the optimization problem is
where:
is an objective function are decision/choice variables are parameters where , are equality constraints where , are inequality constraints is a value function
Definition
The set of admissible choices (admissible set) contains all the choices that satisfy the constraints of the optimization problem.
Note
Sometimes the equality constraints are dropped from the definition of the optimization problem, because they can always be represented as a pair of inequality constraints
Note
Note that strict inequality constraints are not present in the definition above, although they may be present in the economic applications. You already know that this has to do with the intention to keep the set of admissible choices closed, such that the solution of the problem (has a better chance to) exist. Sometimes they are added to the definition.
A roadmap for formulating an optimization problem (in economics)
Determine which variables are choice variables and which are parameters according to what the economic agent has control over
Determine whether the optimization problem is a maximization or a minimization problem
Determine the objective function of the economic agent (and thus the optimization problem)
Determine the constraints of the optimization problem: equality and inequality, paying particular attention to whether inequalities should be strict or weak (the latter has huge implications for the existence of the solution)
Example
Consider a decision maker who is deciding how to divide the money they have between food and services, bank deposit and buying some crypto. Discuss and Write down the corresponding optimization problem. [class exercise]
Classes of the optimization problems#
Static optimization: finite number of choice variables
singe instance of choice
deterministic finite horizon dynamic choice models can be represented as static
our main focus in this course
Dynamic programming: some choice variables are infinite sequences, solved using similar techniques as static optimization
will touch upon in the end of the course
Deterministic optimal control: some “choice variables” are functions, completely new theory is needed
Stochastic optimal control: “choice variables” are functions, objective function is a stochastic process, yet more theory is needed
Review of one-dimensional optimization#
Let
takes and returns numberderivative
exists for all with
Differentiability implies that
Reminder of definitions
A point
maximizer of
on if for allminimizer of
on if for all
Point
A stationary point of

Fig. 73 Both
Fact
If
Algorithm for finding maximizers/minimizers:
Locate stationary points
Evaluate
for each stationary and for ,Pick point giving largest
value
First oder conditions (FOC)#
Necessary
In this lecture we focus on the unconstrained optimization problems of the form
where
Note
Twice continuously differentiable functions are said to be
Every point in the whole space
is interior, therefore all maximizers/minimizers have to be stationary pointsAssuming differentiability implies we can focus on derivative based conditions
Definition
Given a function
Definition
Given a function
If the inequality is strict, then
A maximizer/minimizer (global) must also be a local one, but the opposite is not necessarily true.
Fact (Necessary condition for optima)
Let
Then
Example
Consider quadratic form
Solving the FOC
The point
Show code cell source
import numpy as np
import matplotlib.pyplot as plt
from mpl_toolkits.mplot3d import Axes3D
from matplotlib import cm
A = np.array([[1,.5],[.5,2]])
f = lambda x: x@A@x
x = y = np.linspace(-5.0, 5.0, 100)
X, Y = np.meshgrid(x, y)
zs = np.array([f((x,y)) for x,y in zip(np.ravel(X), np.ravel(Y))])
Z = zs.reshape(X.shape)
fig = plt.figure(dpi=160)
ax1 = fig.add_subplot(111, projection='3d')
ax1.plot_surface(X, Y, Z,
rstride=2,
cstride=2,
cmap=cm.jet,
alpha=0.7,
linewidth=0.25)
plt.setp(ax1,xticks=[],yticks=[],zticks=[])
fig = plt.figure(dpi=160)
ax2 = fig.add_subplot(111)
ax2.set_aspect('equal', 'box')
ax2.contour(X, Y, Z, 50,
cmap=cm.jet)
plt.setp(ax2, xticks=[],yticks=[])
fig = plt.figure(dpi=160)
ax3 = fig.add_subplot(111, projection='3d')
ax3.plot_wireframe(X, Y, Z,
rstride=2,
cstride=2,
alpha=0.7,
linewidth=0.25)
f0 = f(np.zeros((2)))+0.1
ax3.scatter(0, 0, f0, c='black', marker='o', s=10)
ax3.plot([-3,3],[0,0],[f0,f0],color='black')
ax3.plot([0,0],[-3,3],[f0,f0],color='black')
plt.setp(ax3,xticks=[],yticks=[],zticks=[])
plt.show()
Example
Consider quadratic form
Solving the FOC
The point
Show code cell source
import numpy as np
import matplotlib.pyplot as plt
from mpl_toolkits.mplot3d import Axes3D
from matplotlib import cm
A = np.array([[1,.5],[.5,-2]])
f = lambda x: x@A@x
x = y = np.linspace(-5.0, 5.0, 100)
X, Y = np.meshgrid(x, y)
zs = np.array([f((x,y)) for x,y in zip(np.ravel(X), np.ravel(Y))])
Z = zs.reshape(X.shape)
fig = plt.figure(dpi=160)
ax1 = fig.add_subplot(111, projection='3d')
ax1.plot_surface(X, Y, Z,
rstride=2,
cstride=2,
cmap=cm.jet,
alpha=0.7,
linewidth=0.25)
plt.setp(ax1,xticks=[],yticks=[],zticks=[])
fig = plt.figure(dpi=160)
ax2 = fig.add_subplot(111)
ax2.set_aspect('equal', 'box')
ax2.contour(X, Y, Z, 50,
cmap=cm.jet)
plt.setp(ax2, xticks=[],yticks=[])
fig = plt.figure(dpi=160)
ax3 = fig.add_subplot(111, projection='3d')
ax3.plot_wireframe(X, Y, Z,
rstride=2,
cstride=2,
alpha=0.7,
linewidth=0.25)
f0 = f(np.zeros((2)))+0.1
ax3.scatter(0, 0, f0, c='black', marker='o', s=10)
ax3.plot([-3,3],[0,0],[f0,f0],color='black')
ax3.plot([0,0],[-3,3],[f0,f0],color='black')
plt.setp(ax3,xticks=[],yticks=[],zticks=[])
plt.show()
This is an example of a saddle point where the FOC hold, yet the point is not a local maximizer/minimizer!
Similar to
in : derivative is zero, yet the point is not an optimizerHow to distinguish saddle points from optima? Key insight: the function has different second order derivatives in different directions!
Second order conditions (SOC)#
allow to establish whether the stationary point is a local maximizer/minimizer or a saddle point
help to determine whether an optimizer is a maximizer or a minimizer
do not give definitive answer in all cases, unfortunately
Fact (necessary SOC)
Let
has a local maximum at is negative semi-definite has a local minimum at is positive semi-definite
recall the definition of semi-definiteness
Fact (sufficient SOC)
Let
if for some
(FOC satisfied) and is negative definite, then is a strict local maximum ofif for some
(FOC satisfied) and is positive definite, then is a strict local minimum of
observe that SOC are only necessary in the “weak” form, but are sufficient in the “strong” form
this leaves room for ambiguity when we can not arrive at a conclusion — particular stationary point may be a local maximum or minimum
but we can rule out saddle points for sure, in this case neither semi-definiteness nor definiteness can be established, the Hessian is indefinite
Example
Consider a one dimensional function
Point
Treating
Example
Consider a one dimensional function
Point
Treating
Example
Consider a one dimensional function
Point
Treating
Example
Consider a one dimensional function
Point
Treating
Establishing definiteness of Hessian in case#
Recall that a symmetric matrix
positive definite
all eigenvalues are strictly positivenegative definite
all eigenvalues are strictly negativenonpositive definite
all eigenvalues are nonpositivenonnegative definite
all eigenvalues are nonnegativeindefinite
there are both positive and negative eigenvalues
Let
Hence the the two eigenvalues
From the Viets’s formulas for a quadratic polynomial we have
Applying this result to a Hessian of a function
Fact
Given a twice continuously differentiable function
if
and and , is positive definite, has a strict local minimum at
if
and and , is negative definite, has a strict local maximum at
if
and and , is positive semi-definite (nonnegative definite), may have a local minimumundeceive!
if
and and , is negative semi-definite (nonpositive definite), may have a local maximumundeceive!
if
and have different signs, is indefinite, is a saddle point of
Example
Consider a two dimensional function
Point
Therefore at
Point
Therefore at
Point
Therefore again, at
References and reading#
References
Simon & Blume: 13.1, 13.2, 13.3, 14.1, 14.3, 14.4, 14.5, 14.8, whole of chapter 17
Sundaram: 1.4, 1.5, 2.1, 2.2, 4.1 to 4.4