CA4: The Directional Derivative

Chapter 4 CA4: The Directional Derivative

We have noted previously that the instantaneous rate of change of a function \(z = f(x,y)\) at the point \((x,y) = (x_0,y_0)\) will depend on the direction in which the independent variables are changing.

Example 4.1.

Consider the function \(f(x,y) = x^2-y^2\text{.}\) The graph of this function is shown below. At \((x,y)=(0,0)\text{,}\) \(f=0\text{.}\) As we can see by looking at the graph, as we move away from the origin along the positive \(x\)-axis the value of \(f\) is increasing, i.e. the rate of change of the function will be positive. However, if we move away from the origin along the positive \(y\)-axis the value of \(f\) is decreasing, i.e. the rate of change of the function will be negative.

Figure 4.2. 3D plot of \(f(x,y) = x^2-y^2\text{.}\)

In the case that the direction is parallel to the positive x-axis we already know that the slope is given by the partial derivative \(f_x(x_0,y_0)\) and in the case that the direction is parallel to the positive \(y\)-axis the slope is given by \(f_y(x_0,y_0)\text{.}\) In this section we will look at the problem of finding the slope of the function if we move away from the point \((x,y) = (x_0,y_0)\) in any direction.

Section 4.1 Directional Derivatives

Firstly, note that \(2D\) vectors are a convenient way to specify directions in the \(xy\)-plane. For example, we could say the slope of the function in the direction of the vector \(\mathbf{i} = \langle 1, 0 \rangle\) is \(f_x(x_0,y_0)\) while in the direction of the vector \(\mathbf{j} = \langle 0,1 \rangle\) it is \(f_y(x_0,y_0)\text{.}\) Thus the problem we are looking at is that of finding the slope of the function at the point \((x,y) = (x_0,y_0)\) in the direction given by some vector \(\mathbf{u} = \langle u_1, u_2 \rangle\text{.}\) Mathematically, we would say that we are trying to find the directional derivative of the function \(f(x,y)\) at the point \((x_0,y_0)\) in the direction \(\mathbf{u}\text{.}\) The notation that we use to denote this directional derivative is

\begin{equation*} D_{\mathbf{u}}f(x_0,y_0)\text{.} \end{equation*}

One way to approach the problem of finding the directional derivative \(D_{\mathbf{u}}f(x_0,y_0)\) is to use the tangent plane to the function at the point \((x_0,y_0)\text{,}\) i.e.

\begin{equation*} L(x,y) = f(x_0,y_0) + f_x (x_0,y_0)(x - x_0) + f_y(x_0,y_0) (y-y_0)\text{.} \end{equation*}

Then the slope of the function \(f(x,y)\) in the direction of \(\mathbf{u}\) is the slope of \(L(x,y)\) in that direction. If \(\hat{\mathbf{u}} = \langle u_1, u_2 \rangle\) is a unit vector in the direction of \(\mathbf{u}\) then the required slope is the amount by which the value of \(L\) changes as the independent variables change from \((x_0,y_0)\) to \((x_0+u_1,y_0+u_2)\text{,}\) i.e.

\begin{align*} D_u f(x_0,y_0) \amp = L(x_0+u_1, y_0+u_2) - L(x_0,y_0)\\ \amp = \bigg[ f(x_0,y_0) + f_x(x_0,y_0)(x_0+u_1 - x_0) + f_y(x_0,y_0)(y_0+u_2-y_0) \bigg] - f(x_0,y_0)\\ \amp = f_x(x_0,y_0)u_1 + f_y(x_0,y_0)u_2 \end{align*}

Example 4.3.

Consider the function \(z(x,y) = 5 - \dfrac{x^2+y^2}{2}\text{.}\) Figure 4.4 shows the graph of this function along with its the tangent plane at \((x,y) = (2,1)\text{.}\) Also shown on the diagram are the vectors \(\mathbf{u} = \langle 2,2 \rangle\) and \(\mathbf{v} = \langle 2,-1 \rangle\) drawn in the \(xy\)-plane with their tails at the point \((x,y) = (2,1)\text{.}\) Then the directional derivative \(D_{\mathbf{u}}(f(2,1))\) will be the slope of the line joining the points \((2,1,z(2,1))\) and \((4,3,L(4,3))\) while the directional derivative \(D_{\mathbf{v}}(f(2,1))\) will be the slope of the line joining the points \((2,1,z(2,1))\) and \((4,0,L(4,0))\text{.}\)

Figure 4.4. 3D plot of \(z(x,y) = 5 - \dfrac{x^2+y^2}{2}\) and the tangent plane at \((2,1)\text{.}\)

Example 4.5.

The below Sage cell computes the tangent plane to the surface

\begin{equation*} xy+yz^2+xz^3=54 \end{equation*}

at the point \((2,0,3)\) (shown in red). This surface is a level surface of the function \(f(x,y,z)=xy+yz^2+xz^3\text{.}\) The gradient vector is then a normal vector for the surface. Since we have a point on the surface, we can then determine an equation for the tangent surface:

\begin{equation*} L:f_x(2,0,3)(x-2)+f_y(2,0,3)(y-0)+f_z(2,0,3)(z-3)=0 \end{equation*}

Example 4.6.

The below Sage cell computes the tangent plane to a "rugby ball" at the point \((x_0,y_0)\) and the corresponding normal vector to the surface at this point (shown in red).

To summarise:

Definition 4.7. Directional Derivative.

The directional derivative of the differentiable function \(f(x,y)\) at the point \((x_0,y_0)\) in the direction of the unit vector \(\hat{\mathbf{u}} = \langle u_1, u_2 \rangle\) is given by

\begin{equation*} D_{\hat{\mathbf{u}}} f(x_0,y_0) = f_x(x_0,y_0)u_1 + f_y (x_0,y_0)u_2\text{.} \end{equation*}

Example 4.8.

Find the directional derivative of \(f(x,y) = y\ln(x)\) at \((1,-3)\) in the direction \(\mathbf{u} = \langle -4, 3 \rangle\text{.}\)

Answer.

\(D_{\hat{\mathbf{u}}} f(1,-3)=\dfrac{12}{5}\text{.}\)

Solution.

For the given function

\begin{equation*} f_x = \dfrac{y}{x} \: \text{ and } \: f_y = \ln(x)\text{.} \end{equation*}

Thus

\begin{equation*} f_x(1,-3) = -3 \: \text{ and } \: f_y(1,-3) = 0\text{.} \end{equation*}

Now the unit vector in the direction of \(\langle -4,3 \rangle\) is

\begin{equation*} \hat{\mathbf{u}} = \left \langle -\dfrac{4}{5}, \dfrac{3}{5} \right \rangle\text{.} \end{equation*}

Thus the required directional derivative is

\begin{equation*} D_{\hat{\mathbf{u}}} f(1,-3) = (-3) \left( -\dfrac{4}{5} \right) + (0) \left( \dfrac{3}{5} \right) = \dfrac{12}{5}\text{.} \end{equation*}

Example 4.9.

Find the directional derivative of \(f(x,y) = \sin(x+2y)\) in the direction of the angle (from the positive \(x\)-axis) \(\theta = \dfrac{3 \pi}{4}\text{.}\)

Answer.

\(D_{\hat{\mathbf{u}}} f(x,y)=\dfrac{1}{\sqrt{2}} \cos(x+2y)\)

Solution.

For the given function

\begin{equation*} f_x = \cos(x+2y) \: \text{ and } \: f_y = 2\cos(x+2y)\text{.} \end{equation*}

Now, the unit vector in the direction of the angle \(\theta = \dfrac{3 \pi}{4}\) is

\begin{equation*} \hat{\mathbf{u}} = \left \langle - \dfrac{1}{\sqrt{2}}, \dfrac{1}{\sqrt{2}} \right \rangle\text{.} \end{equation*}

Thus the required directional derivative is

\begin{align*} D_{\hat{\mathbf{u}}} f(x,y) \amp = \cos(x+2y) \left( - \dfrac{1}{\sqrt{2}} \right) + 2\cos(x+2y) \left( \dfrac{1}{\sqrt{2}} \right)\\ \amp = \dfrac{1}{\sqrt{2}} \cos(x+2y) \end{align*}

Note that the directional derivative \(D_\mathbf{u} f(x_0,y_0)\) can be expressed in the terms of the scalar product if we use the following definition.

Definition 4.10. Gradient Vector.

The vector

\begin{equation*} \nabla f(x_0,y_0) = \langle f_x(x_0,y_0), f_y(x_0,y_0) \rangle \end{equation*}

is called the gradient vector of \(f(x,y)\) at \((x_0,y_0)\text{.}\)

With this definition the directional derivative can be written as:

\begin{align*} D_{\mathbf{u}} f(x_0,y_0) \amp = \langle f_x(x_0,y_0),f_y(x_0,y_0) \rangle \cdot \langle u_1, u_2 \rangle\\ \amp = \nabla f(x_0,y_0) \cdot \hat{\mathbf{u}} \end{align*}

Example 4.11.

The Sage cell below computes the gradient vector \(\nabla f(x,y)\) (shown by the orange arrow) at some location \((x,y)\) for the function

\begin{equation*} f(x,y)=\cos(x)\sin(y). \end{equation*}

The gradient vector points in the direction of steepest ascent on the surface \(z=f(x,y)\text{.}\) The unit vector \(\mathbf{\hat{u}}\) in the direction of some angle is shown by the red arrow. (Note that when the angle is zero, the unit vector is parallel to the gradient vector.) The tangent line to the surface is plotted in green. The gradient of that tangent line is the directional derivative.

Example 4.12.

Find the gradient vector for the function \(f(x,y) = e^{-x} \sin(y)\text{.}\) Hence find \(\nabla f(0,\pi/3)\) and the directional derivative in the direction of the origin.

Answer.

\(\nabla f = \left \langle -e^{-x}\sin(y), e^{-x}\cos(y) \right \rangle\)

\(\nabla f(0, \pi/3) = \left \langle - \dfrac{\sqrt{3}}{2} , \dfrac{1}{2} \right \rangle\)

\(D_{\hat{\mathbf{u}}} f \left( 0, \dfrac{\pi}{3} \right)=-\dfrac{1}{2}\text{.}\)

Solution.

For the given function

\begin{equation*} f_x = -e^{-x} \sin(y) \: \text{ and } \: f_y = e^{-x}\cos(y)\text{,} \end{equation*}

and so the gradient vector is

\begin{equation*} \nabla f = \left \langle -e^{-x}\sin(y), e^{-x}\cos(y) \right \rangle\text{.} \end{equation*}

Thus

\begin{equation*} \nabla f(0, \pi/3) = \left \langle -e^0 \sin \left( \dfrac{\pi}{3} \right), e^0 \cos \left( \dfrac{\pi}{3} \right) \right \rangle = \left \langle - \dfrac{\sqrt{3}}{2} , \dfrac{1}{2} \right \rangle\text{.} \end{equation*}

Now, the unit vector in the direction of the origin from the point \(\left( 0, \dfrac{\pi}{3} \right)\) is

\begin{equation*} \hat{\mathbf{u}} = \langle 0, -1 \rangle\text{.} \end{equation*}

Thus the required directional derivative is

\begin{equation*} D_{\hat{\mathbf{u}}} f \left( 0, \dfrac{\pi}{3} \right) = \left \langle -\dfrac{\sqrt{3}}{2} , \dfrac{1}{2} \right \rangle \cdot \langle 0, -1 \rangle = - \dfrac{1}{2}\text{.} \end{equation*}

The gradient vector has some interesting facts associated with it. Note that in the following remarks, we are assuming that \(\nabla f \neq \langle 0, 0 \rangle\text{.}\)

Remark 4.13.

\(\nabla f\) points in the direction in which the directional derivative takes on its largest value. To see this, note that

\begin{align*} D_{\mathbf{u}} f(x,y) \amp = \nabla f(x,y) \cdot \hat{\mathbf{u}}\\ \amp = \| \nabla f \| \| \hat{\mathbf{u}} \| \cos(\theta)\\ \amp = \| \nabla f \| \cos(\theta) \end{align*}

At a given point \(\| \nabla f \|\) is fixed and so the largest value of \(D_{\mathbf{u}} f(x,y)\) will occur when \(\cos(\theta) = 1\text{,}\) i.e. when \(\theta = 0\) or put another way, when \(\hat{\mathbf{u}}\) is parallel to \(\nabla f\text{.}\) We can also see from this that the largest value that the directional derivative can take is \(\| \nabla f \|\text{.}\)

Similarly, the directional derivative takes on its smallest value in the direction of \(-\nabla f\) and has value \(- \| \nabla f \|\text{.}\)

Definition 4.14.

For the function \(z=f(x,y)\text{,}\) the level curve passing through the point \((x_0,y_0)\) is given by

\begin{equation*} f(x,y)=f(x_0,y_0). \end{equation*}

Remark 4.15.

\(\nabla f(x_0,y_0)\) is orthogonal (i.e. at right angles) to the level curve passing through \((x_0,y_0)\text{.}\) To see this, run the Sage cell below, which plots the level curves of the function \(f(x,y)=xy+y^2-x^3\) and the corresponding gradient vectors \(\nabla f(x,y)\) in red.

Remark 4.16.

As shown in Figure 4.17, a vector parallel to the tangent to this curve at the point \((x_0,y_0)\) will be \(\left \langle 1, \dfrac{dy}{dx} \right \rangle\text{.}\)

Figure 4.17. Plot of \(f(x,y) = k\) (blue) and the tangent vector (red) at the point \((x_0,y_0)\text{.}\)

Thus a vector normal to the curve at the point \((x_0,y_0)\) will be \(\left\langle -\dfrac{dy}{dx},1\right\rangle\text{.}\) We will see subsequently, via implicit differentiation, that for the curve \(f(x,y)=k\text{,}\)

\begin{equation*} \frac{dy}{dx}=-\frac{f_x}{f_y} \end{equation*}

and so a vector normal to the curve at the point \((x_0,y_0)\) will be \(\left \langle \dfrac{f_x(x_0,y_0)}{f_y(x_0,y_0)}, 1 \right \rangle\text{,}\) which is parallel to \(\nabla f(x_0,y_0)\text{.}\)

Notice that since \(\nabla f(x_0,y_0)\) is orthogonal to the level curve passing through the point \((x_0,y_0)\) and that \(\nabla f (x_0,y_0)\) is the direction in which the directional derivative takes on its largest value, the “path of steepest ascent” on any surface \(z=f(x,y)\) is always at right angles to its contours. To see this, run the Sage cell below. This generates a 2D contour plot of \(z=f(x,y)\text{.}\) The unit vector \(\mathbf{hat{u}}\) starting at some point \((x,y)\) and pointing in the direction of some angle is shown by the red arrow. The gradient vector at the point \((x,y)\) is shown in orange.

Example 4.18.

For the function \(f(x,y) = x^2y^3-3x\) find the directions in which the directional derivative at the point \((-2,4)\) is maximised, minimised and \(0\text{.}\)

Answer.

Maximised in the direction \(\mathbf{u} = \langle -259, 192 \rangle\text{;}\) minimised in the direction \(\mathbf{u} = \langle 259, -192 \rangle\text{;}\) and \(0\) when \(\mathbf{u} = \langle 192, 259 \rangle\)

Solution.

For the given function

\begin{equation*} f_x = 2xy^3-3 \: \text{ and } \: f_y = 3x^2y^2\text{,} \end{equation*}

and so

\begin{equation*} \nabla f(-2,4) = \langle -259,192 \rangle\text{.} \end{equation*}

Thus the directional derivative, \(D_{\hat{\mathbf{u}}} f(-2,4)\text{,}\) will be maximised in the direction

\begin{equation*} \mathbf{u} = \nabla f(-2,4) = \langle -259, 192 \rangle \end{equation*}

and minimised in the direction

\begin{equation*} \mathbf{u} = -\nabla f(-2,4) = \langle 259, -192 \rangle\text{.} \end{equation*}

Finally \(D_{\hat{\mathbf{u}}} f(-2,4)\) will be \(0\) when

\begin{align*} \nabla f(-2,4) \cdot \mathbf{u} \amp = 0\\ -259u_1 + 192u_2 \amp = 0 \end{align*}

i.e. when

\begin{equation*} \mathbf{u} = \langle 192, 259 \rangle \end{equation*}

or some scalar multiple of this.

Example 4.19.

For the function \(f(x,y) = x^2-y\) find the level curve, the tangent line and the gradient vector at the point \((-3,1)\text{.}\)

Answer.

The level curve is \(y=x^2-8\text{.}\)

The tangent line is \(y=-6x-17\text{.}\)

The gradient vector is \(\nabla f(-3,1) = \langle -6, -1 \rangle\)

Solution.

Since \(f(-3,1)=8\) the level curve through the point \((-3,1)\) is \(x^2-y=8\) or \(y=x^2-8\text{.}\) We can find the equation of the tangent by standard calculus to obtain

\begin{equation*} y=-6x-17\text{.} \end{equation*}

Next, the gradient vector is

\begin{equation*} \nabla f(-3,1) = \langle f_x(-3,1), f_y(-3,1) \rangle = \langle -6, -1 \rangle\text{.} \end{equation*}

As can be seen in the diagram below, the gradient vector is orthogonal to the level curve.

Example 4.21.

Suppose you are climbing a hill whose shape is given by the equation

\begin{equation*} z = 1000-0.01x^2 - 0.02y^2 \end{equation*}

and you are standing at the point with coordinates \((60,100,764)\text{.}\)

In which direction should you proceed initially in order to be ascending most rapidly?
If you climb in that direction, at what angle to the horizontal will you be climbing initially?

Answer.

Head in the direction of \(\nabla f (60,100) = \langle-1.2,-4 \rangle\text{.}\)
The angle to the horizontal will be \(\theta = \tan^{-1} (4.17) \simeq 1.33^{c}\text{.}\)

Solution.

Since we want to travel on the path of steepest ascent we will want to head in the direction of \(\nabla f(60,100)\text{.}\) Now
\begin{equation*} \nabla f(x,y) = \langle -0.02x, -0.04y \rangle \end{equation*}
and hence
\begin{equation*} \nabla f (60,100) = \langle-1.2,-4 \rangle\text{.} \end{equation*}
In this direction we know that
\begin{equation*} D_{\hat{\mathbf{u}}} f(60,100) = \| \nabla f(60,100) \| \simeq 4.17\text{.} \end{equation*}
Thus the angle, \(\theta\text{,}\) to the horizontal will be
\begin{equation*} \theta = \tan^{-1} (4.17) \simeq 1.33^{c}\text{.} \end{equation*}

Exercises Example Tasks

1.

Find the directional derivative for \(f(x,y) = (x-2y)^2 + 5x^2\) at the point \((-3,1)\) in the direction of the point \((1,4)\text{.}\)

2.

Find the maximum value of the rate of change of \(h(s,t) = \dfrac{1}{\sqrt{s^2+t^2}}\) at \((3,4)\text{.}\)

3.

For the curve \(e^x \ln (y) - xy = 0\) use the gradient vector of a two variable function to find the tangent line and the normal line at the point \((2,e^2)\text{.}\)

4.

For the following contour plot for some unspecified function of two variables estimate the sign of the directional derivatives at:

The point \(A\) and in the direction of \(\mathbf{u} = \langle 1,2 \rangle\text{.}\)
The point \(B\) and in the direction of \(\mathbf{w} = \langle -1,-1 \rangle\text{.}\)
The point \(A\) and in the direction of the origin.
The point \(B\) and in the direction of the origin.

Section 4.2 In Three Variables

The concepts of the directional derivative and the gradient vector extend to functions of more than two variables. In this section we will look at some examples for functions of three variables.

Example 4.23.

Find the rate of change of the function \(f(x,y,z) = xy+yz^2+xz^3\) at the point \((2,0,3)\) in the direction \(\hat{\mathbf{u}} = \left \langle -\dfrac{2}{3}, -\dfrac{1}{3}, \dfrac{2}{3} \right \rangle\text{.}\)

Answer.

\(D_{\hat{\mathbf{u}}} f(2,0,3) = \dfrac{43}{3}\text{.}\)

Solution.

The gradient vector for the given function is

\begin{align*} \nabla f \amp = \left \langle f_x, f_y, f_z \right \rangle\\ \amp = \left \langle y +z^3, x+z^2, 2yz+3xz^2 \right \rangle\text{.} \end{align*}

Thus

\begin{equation*} \nabla f (2,0,3) = \langle 27, 11, 54 \rangle \end{equation*}

and so the required directional derivative is

\begin{equation*} D_{\hat{\mathbf{u}}} f(2,0,3) = \langle 27, 11, 54 \rangle \cdot \left \langle -\dfrac{2}{3}, -\dfrac{1}{3}, \dfrac{2}{3} \right \rangle = \dfrac{43}{3}\text{.} \end{equation*}

Example 4.24.

The temperature at the point \((x,y,z)\) is given by the function

\begin{equation*} T(x,y,z) = 200e^{-x^2-3y^2-9z^2}\text{.} \end{equation*}

Find the rate of change of temperature at the point \(P = (2,-1,2)\) in the direction \(\overrightarrow{PQ}\) where \(Q = (3,-3,3)\text{.}\)
In which direction does the temperature increase the fastest at \(P\text{?}\)
Find the maximum rate of increase at \(P\text{.}\)

Answer.

\(D_{\hat{\mathbf{u}}} T(2,-1,2) = \dfrac{-10400}{\sqrt{6}} e^{-43}\text{.}\)
\(\nabla T (2,-1,2) = 400e^{-43} \langle -2, 3, -18 \rangle\text{.}\)
\(\| \nabla T (2,-1,2) \| = \dfrac{400 \sqrt{337}}{e^{43}}\text{.}\)

Solution.

The gradient vector for the given function is

\begin{align*} \nabla T \amp = \left \langle T_x, T_y, T_z \right \rangle\\ \amp = \left \langle -400xe^{-x^2-3y^2-9z^2}, -1200ye^{-x^2-3y^2-9z^2},-3600ze^{-x^2-3y^2-9z^2} \right \rangle\text{.} \end{align*}

Thus

\begin{equation*} \nabla T (2,-1,2) = 400e^{-43} \langle -2, 3, -18 \rangle\text{.} \end{equation*}

Since \(\overrightarrow{PQ} = \langle 1, -2, 1 \rangle\text{,}\) the required rate of change is given by the directional derivative
\begin{equation*} D_{\hat{\mathbf{u}}} T(2,-1,2) = 400 e^{-43} \langle -2, 3, -18 \rangle \cdot \dfrac{1}{\sqrt{6}} \langle 1, -2, 1 \rangle = \dfrac{-10400}{\sqrt{6}} e^{-43}\text{.} \end{equation*}
The direction in which the temperature increase the fastest at \(P\) is
\begin{equation*} \nabla T (2,-1,2) = 400e^{-43} \langle -2, 3, -18 \rangle\text{.} \end{equation*}
The maximum rate of increase at \(P\) is the maximum value of \(D_{\mathbf{u}} T(2,-1,2)\) which is
\begin{equation*} \| \nabla T (2,-1,2) \| = \dfrac{400 \sqrt{337}}{e^{43}}\text{.} \end{equation*}

Example 4.25.

Find the equation of the tangent plane to the level surface of \(f(x,y,z) = x^2+y^2 - z + \cos(z)\) at the point \((-1,1,0)\text{.}\)

Answer.

\(2x-2y+z = -4\text{.}\)

Solution.

Since \(f(-1,1,0) = 3\text{,}\) the level surface for this function satisfies the equation

\begin{equation*} x^2+y^2 - z + \cos(z) = 3\text{.} \end{equation*}

A normal to this surface at the point \((-1,1,0)\text{,}\) and hence to the tangent plane at this point, is given by \(\nabla f (-1,1,0)\text{.}\) Now,

\begin{equation*} \nabla f = \left \langle 2x, 2y, -1-\sin(z) \right \rangle\text{,} \end{equation*}

and so

\begin{equation*} \nabla f (-1,1,0) = \langle-2, 2, -1 \rangle\text{.} \end{equation*}

Thus the equation of the tangent plane is

\begin{equation*} \langle -2, 2, -1 \rangle \cdot \left( \langle x, y, z \rangle - \langle -1, 1, 0 \rangle \right) = 0 \end{equation*}

which simplifies to

\begin{equation*} 2x-2y+z = -4\text{.} \end{equation*}

Exercises Example Tasks

1.

Find the directional derivative of \(g(x,y,z) = \dfrac{z-x}{z+y}\) at \((1,0,-3)\) in the direction \(\mathbf{a} = -6\mathbf{i} + 3\mathbf{j} - 2\mathbf{k}\text{.}\)

2.

By thinking of level surfaces to a function of \(3\) variables show that the normal lines to a sphere pass through its centre.

Prev Top Next