Chapter 9 Stationary Points

9.1 Definition of Stationary Points

Recall that for a univariate function \(y=f(x)\), a stationary point is a value \(x_0\) for \(x\) at which \(f'(x_0)=0\). Graphically this is a point on the curve at which the tangent line is horizontal.


Now consider a function of two variables \(z=f(x,y)\).

A point \((a,b)\) at which \(f_x (a,b) = f_y (a,b) = 0\) is a stationary point of \(f(x,y)\).

Calculate the stationary points of the function \(f(x,y)=x^2 + y^2\).

Calculating the first order partial derivatives one obtains

\[\begin{align*} f_x &= 2x, \\ f_y &= 2y. \end{align*}\]

So

\[\begin{align*} &f_x = 0, \text{ and } f_y = 0 \\[5pt] \iff \qquad &2x = 0, \text{ and } 2y = 0 \\[5pt] \iff \qquad &x=0, \text{ and } y=0. \end{align*}\]

Therefore \(f\) has a unique stationary point at \((0,0)\).


Calculate the stationary points of the function \(f(x,y)=6 x^2 y -3x^3+ 2y^3 -150y\).

Calculating the first order partial derivatives one obtains \[\begin{align*} f_x &= 12xy - 9x^2, \\ f_y &= 6 x^2 +6 y^2 -150. \end{align*}\] So \[\begin{align*} &f_x = 0, \text{ and } f_y = 0 \\[5pt] \iff \qquad &12xy - 9x^2 = 0, \text{ and } 6 x^2 +6 y^2 -150 = 0 \\[5pt] \iff \qquad &3x(4y-3x)=0, \text{ and } x^2 +y^2 =25. \end{align*}\] The equation \(3x(4y-3x)=0\) implies either \(x=0\) or \(y=\frac{3}{4}x\). If \(x=0\), the equation \(x^2 +y^2 =25\) becomes \(y^2 = 25\), which has solutions \(y=5\) and \(y=-5\). Therefore there are stationary points at \((0,5)\) and \((0,-5)\).
Alternatively if \(y=\frac{3}{4}x\), the equation \(x^2 +y^2 =25\) becomes \(\frac{25}{16} x^2 = 25\), which has solutions \(x=4\) and \(x=-4\). At \(x=4\), one has \(y = \frac{3}{4} (4) = 3\), and at \(x=-4\), one has \(y=-3\). Therefore there are stationary points at \((4,3)\) and \((-4,-3)\).

In total there are four stationary points \((0,5)\), \((0,-5)\), \((4,3)\) and \((-4,-3)\).


Suppose \((a,b)\) is a stationary point of a function \(f(x,y)\). Graphically one can take two cross-sections of the surface \(z=f(x,y)\) through the planes \(x=a\) and \(y=b\) respectively. This will describes two curves, one given in the \((y,z)\)-plane by the univariate function \(z=f(a,y)\), and the other given in the \((x,z)\)-plane by the univariate function \(z=f(x,b)\).These curves will have stationary points, in the context of univariate functions, at \(y=b\) and \(x=a\) respectively.


Describing the surface of \(f\) using an implicit function, namely \(F(x,y,z) = z - f(x,y)=0\), allows us to calculate a normal vector as per Section 7.1. Specifically a normal is given by \[\begin{align*} \nabla F &= \left( \frac{\partial F}{\partial x}, \frac{\partial F}{\partial y}, \frac{\partial F}{\partial z} \right) \\ &= \left( -f_x, -f_y, 1 \right) \\ &= (0,0,1). \end{align*}\]

It follows that the tangent plane to \(z=f(x,y)\) is horizontal at a stationary point.


Definition 9.1.1 generalises to any multivariate function.

A stationary point of \(f(x_1, x_2, \ldots , x_n)\) is a point such that \(f_{x_i} =0\), for all \(i = 1,2,\ldots n\).

There are three types of stationary points:

  • Local maximum;

  • Local minimum;

  • Saddle point.

We will study each of these in turn in the following sections.

9.2 Local Maxima and Minima

Let \((a,b)\) be a stationary point of a function of two variables \(f(x,y)\).

The value \(f(a,b)\) of \(f\) at \((a,b)\) is a local maximum if

\[f(a,b) \geq f(x,y), \qquad \text{for all } (x,y) \text{ in } D,\]

where \(D\) is some open disc with center \((a,b)\).


The value \(f(a,b)\) of \(f\) at \((a,b)\) is a local minimum if

\[f(a,b) \leq f(x,y), \qquad \text{for all } (x,y) \text{ in } D,\]

where \(D\) is some open disc with center \((a,b)\).


Any size of the disc \(D\) in Definition 8.2.1 and Definition 8.2.2 will do, no matter how small. It is also worth noting the terminology local: there may be other points at which the function is smaller than a local minimum or greater than a local maximum, but these will be a distance away from the stationary point.

Consider the function \(f(x,y) = x^2 + y^2\) from Example 9.1.2. We know there is a unique stationary point at \((0,0)\). Calculate that

\[\begin{align*} &f(0,0) = 0, \\ &f(x,y) = x^2 + y^2 > 0, \qquad \text{for } (x,y) \neq (0,0). \end{align*}\]

Therefore \((0,0)\) is a minimum stationary point.

9.3 Saddle Points

Let \((a,b)\) be a stationary point of a function of two variables \(f(x,y)\).

The point \((a,b)\) is a saddle point of \(f\) if for every disc \(D\) with center \((a,b)\) there exists

  • a point \((x_1,y_1) \in D\) such that \(f(x_1,y_1) > f(a,b)\);

  • a point \((x_2,y_2) \in D\) such that \(f(x_2,y_2) < f(a,b)\).



Equivalently a saddle point is a stationary point that is neither a local maximum or a local minimum.

9.4 Classification of Stationary Points

Suppose \(f(x,y)\) has a stationary point at \((a,b)\). How can one tell if this stationary point is a local maximum, a local minimum or a saddle point?

Let \(f\) be a function with continuous second order partial derivatives. The Hessian of \(f\) is \[H(x,y) = f_{xx} f_{yy} - f_{xy}^{2}\]


Let \(f\) be a function with a stationary point at \((a,b)\), and continuous second order partial derivatives in a disc centered at \((a,b)\). Then

  • If \(H(a,b)< 0\), then \(f\) has a saddle point at \((a,b)\);

  • If \(H(a,b) >0\) and \(f_{xx}(a,b)>0\), then \(f\) has a local minimum at \((a,b)\);

  • If \(H(a,b) >0\) and \(f_{xx}(a,b)<0\), then \(f\) has a local maximum at \((a,b)\);

  • If \(H(a,b) = 0\), then no conclusion can be made about the classification of \((a,b)\).

Let \(h,k\) be suitably small variables so that \((a+h,b+k)\) is contained in the disc centered at \((a,b)\) in which \(f\) has continuous second order partial derivatives


By Taylor’s theorem with \(n=1\), one has \[f(a+h,b+k) = f(a,b) + h f_x (a,b) + k f_y (a,b) + R_1,\] where \[R_1 = \frac{1}{2} \bigg( h^2 f_{xx}(a+th,b+tk) + 2hk f_{xy}(a+th,b+tk) + k^2 f_{yy}(a+th,b+tk) \bigg),\] for some \(t \in [0,1]\). Substituting in that \(f_{x}(a,b) = f_{y}(a,b)=0\) and rearranging, one obtains \[f(a+h,b+k) - f(a,b) = R_{1}.\] Hence the sign of \(R_1\) is key to the classification of the stationary point \((a,b)\) since it determines whether \(f\) increases or decreases away from \(f\). Specifically for a fixed value of \(h,k\):

  • \(R_1 \left( \frac{h}{k} \right) >0, \quad \implies \quad f(a+h,b+k) > f(a,b)\);

  • \(R_1 \left( \frac{h}{k} \right) <0, \quad \implies \quad f(a+h,b+k) < f(a,b)\).

Rearranging:

\[\begin{align*} R_1 &= \frac{1}{2} \bigg( h^2 f_{xx} + 2hk f_{xy} + k^2 f_{yy} \bigg) \\ &= \frac{k^2}{2} \bigg( f_{xx} \left(\frac{h}{k}\right)^2 + 2f_{xy} \frac{h}{k} + f_{yy} \bigg). \end{align*}\]

So \(R_1\) can be thought of a quadratic function in the single variable \(\frac{h}{k}\). Since we are only interested in the sign of \(R_1\), and \(\frac{k^2}{2}\) is always positive, it is enough to consider the quadratic function \[\widetilde{R_1} \left( \frac{h}{k} \right) = f_{xx} \left(\frac{h}{k}\right)^2 + 2f_{xy} \frac{h}{k} + f_{yy}\] Quadratic functions are well-understood. In particular for a general quadratic \(Q(x) = ax^2 + bx + c\), the roots of \(Q\) are governed by the discriminant \(\Delta_Q = b^2 -4ac\). With this in mind, define: \[\begin{align*} \Delta &= \Delta_{\widetilde{R_1} \left(\frac{h}{k}\right)} \\ &= \left( 2 f_{xy} \right)^2 - 4 f_{xx} f_{yy} \\ &= 4 \left( f_{xy}^{2} - f_{xx}f_{yy} \right). \end{align*}\]

Suppose that \(\Delta>0\). Then \(\widetilde{R_1}\) has two real roots.


Specifically there are two distinct values for \(\frac{h}{k}\) for which \(R_1=0\). At both of these roots \(R_1\) changes sign, and \((a,b)\) is therefore a saddle point.


Alternatively suppose \(\Delta <0\), that is that \(\widetilde{R_1}\) has no real roots. Then \(R_1\) will always have the same sign, be that positive or negative. Therefore \((a,b)\) is either a local minimum or a local maximum.



Since in this case the sign of \(R_{1}\) does not change, it can be found by looking at a single point. Specifically when \(k=0\), we have \(R_{1} = \frac{h^2}{2} f_{xx}\). Hence

  • \(f_{xx} >0 \quad \implies \quad R_{1} > 0 \quad \implies \quad (a,b)\) is a local minimum;

  • \(f_{xx} <0 \quad \implies \quad R_{1} < 0 \quad \implies \quad (a,b)\) is a local maximum.

Noting that \(\Delta = -4 H\) gives the desired result.

Find and classify the stationary points of \[f(x,y) = 6x^2 - 2 x^3 + 3y^2 + 6xy.\]
Equating the two first order partial derivatives of \(f\) to zero, one obtains \[\begin{align*} f_x &= 12x -6x^2 + 6y = 0 \tag{$\star$}, \\ f_y &= 6y + 6x = 0 \tag{$\star \star$}. \end{align*}\]

Substituting \((\star \star)\) into \((\star)\) gives

\[\begin{align*} 12x - 6 x^2 - 6x &= 0, \\ x(1-x) &= 0, \\ \implies \quad x=0 \text{ or } x&=1. \end{align*}\]

When \(x=0\), equation \((\star \star)\) dictates \(y=0\), and when \(x=1\), equation \((\star)\) dictates \(y=-1\). Therefore \(f\) has two stationary points: \((0,0)\) and \((1,-1)\).

Calculating the second order partial derivatives of \(f\), one obtains:

\[\begin{align*} f_{xx} &= 12 -12x, \\ f_{xy} &= 6, \\ f_{yy} &= 6. \end{align*}\]

The Hessian of \(f\) is then given by

\[H(x,y) = f_{xx} f_{yy} - f_{xy}^{2} = 6(12-12x) - 36= 36 - 72x.\]

By Theorem 9.4.2, the stationary points are classified as follows:

  • \(H(0,0) =36>0\) and \(f_{xx} =12 >0, \quad \implies \quad\) the stationary point \((0,0)\) is a local minimum;

  • \(H(1,-1) =-36<0, \quad \implies \quad\) the stationary point \((1,-1)\) is a saddle point.