Chapter 3 Introduction to the Derivative

If I have seen further it is by standing on the shoulders of giants.

— Isaac Newton

3.1 Definition of a Derivative

Imagine you are driving in your parent’s car (or perhaps you are driving your own car). You can check the speedometer to read the current speed. Suppose we wanted to determine how the car is calculating this speed. We could start by using the typical formula learned in basic physics:

\[ v = \frac{\Delta x}{\Delta t} = \frac{x_f - x_i}{t_f - t_i} = \frac{\text{Change in Position}}{\text{Change in Time}} \]

How do we measure the change in position or the change in time? We can calculate how far your car has moved over the past 2 seconds, the past 20 seconds, the past minute, or the past 30 minutes! Which of these times would allow us to compute the value reported by your speedometer?

We make a distinction between two types of speed:

Average Speed allows one to compute the average rate at which a car is changing position over time. To compute this value, simply divide how far the car has traveled by the time it took to travel that distance (the formula above).
Instantaneous Speed is the rate at which one is moving at a single moment in time.

What we want is the instantaneous speed, not the average speed. Computing the instantaneous speed is a bit more subtle than calculating the average speed. If we are calculating the instantaneous speed, we are essentially asking what the speed of the car will be in a single moment, so the difference in time (the denominator in the equation above) will be 0! This seems like a contradiction: how can we expect the car to travel (and thus have a speed) over no time? The way out of this conundrum is an application of the limiting process described in the previous two chapters. In particular, we need a notion called the derivative.

The interactive figure below illustrates the difference between average speed and instantaneous speed. Whenever the two points are not overlapping, that indicates we are computing the average speed, which is the slope of the line defined by two non-overlapping points on the plot. In a mathematical context, such a line is called a secant line. On the other hand, when the points overlap, then the slope of the line represents the instantaneous speed of the car at that moment. In a mathematical context, such a line is called the tangent to the curve at that point.

Now we have some context of the problem. We also have a nice picture. How do we translate these thoughts and these pictures into mathematics? Fundamentally, what we want is the slope of the line that results when we bring the points \((a, f(a))\) and \((b, f(b))\) very close together. As previously mentioned, this involves a limit. In particular, if we want the slope of the curve at a particular point on some graph (which is the instantaneous speed, or derivative) we can compute the following limit:

Definition 3.1 (Derivative Definition 1) \[\begin{equation} f'(b) = \lim_{a \rightarrow b} \frac{f(b) - f(a)}{b - a} \end{equation}\]

Great! We have a definition we can work with! It should be noted, however, that a definition we will use more often is the following fraction limit:

Definition 3.2 (Derivative Definition 2) \[ f'(x) = \lim_{h \rightarrow 0} \frac{f(x + h) - f(x)}{h} \]

This definition is identical to the one above if we let \(a = x\) and \(b = x + h\). Note that \(h\) is the horizontal distance between \(a\) and \(b\) in our interactive figure above. Therefore, the previous definitions means: “Calculate the slope of the line between \((x, f(x))\) and \((x + h, f(x + h))\) as we let the distance between those points go to zero”.

Note that there are other notations which are frequently used to denote the derivative of a function. The following all denote the same concept:

\[ f'(x) \hspace{7mm} D_x f(x) \hspace{7mm} \dot{f}(x) \hspace{7mm} \frac{d}{dx}f(x)\] The first and last are the notations that will be used most frequently throughout this book. The second is typically reserved for derivatives of functions involving multiple variables. The third notation is frequently used in physics to indicate a derivative with respect to time. Each notation type has a name: the first is Lagrange’s Notation, the second is Euler’s Notation, the third is Newton’s Notation, and the last is Leibniz’s Notation.

3.2 Examples Applying the Limit Definition

3.2.1 Derivative of Constants

3.2.2 Derivative of Lines

Now that we have a mathematical definition, we want to apply that definition to functions. Let’s start with a really simple function: a straight line!

Notice that no matter which points are selected on a line, they always return the same slope, namely, the slope of the line itself! Therefore, if we put the formula of a line in the limit definition above, we ought to get the slope of the line back. Recall that any line can be written in slope intercept form as: \(f(x) = ax + b\), where \(a\) is the slope of the line and \(b\) is the y-intercept of that line. Therefore, we have:

\[ f(x + h) = a(x + h) + b \]

\[ f(x) = ax + b \]

Therefore,

\[ f'(x) = \lim_{h \rightarrow 0} \frac{f(x + h) - f(x)}{h} = \lim_{h \rightarrow 0}\frac{a(x + h) + b - (ax + b)}{h} \] \[ = \lim_{h \rightarrow 0} \frac{ax + ah + b - ax - b}{h} = \lim_{h \rightarrow 0} \frac{ah}{h} = \lim_{h\rightarrow 0} a = a \] Wonderful! The definition returned the value we expected (the slope of the line)!

3.2.3 Derivative of Monomials

Next, let’s try to compute the derivative of \(f(x) = x^2\), a parabola:

\[ f(x) = x^2 \] \[ f(x + h) = (x + h)^2 \] Therefore,

\[ f'(x) = \lim_{h \rightarrow 0} \frac{f(x + h) - f(x)}{h} = \lim_{h \rightarrow 0} \frac{(x + h)^2 - x^2}{h}\] \[ = \lim_{h \rightarrow 0} \frac{(x^2 + 2xh + h^2) - x^2}{h} = \lim_{h \rightarrow 0} \frac{2xh + h^2}{h} = \lim_{h \rightarrow 0} 2x + h = 2x.\] The interactive graph below illustrates the relationship between the tangent lines of a parabola (on the left) and the value of the slope (the point on the line \(y = 2x\)).

We can also find the derivative of the function \(f(x) = x^3\) using an argument identical to the one just provided. We have:

\[f(x) = x^3 \hspace{2.5mm}\text{and}\hspace{2.5mm}f(x + h) = (x + h)^3 \]

Using the limit definition, we have

\[f'(x) = \lim_{h\rightarrow 0}\frac{f(x + h) - f(x)}{h} = \lim_{h\rightarrow 0}\frac{(x + h)^3 - x^3}{h}\]

Expanding \((x + h)^3\), we have

\[(x + h)^3 = (x + h)^2 (x+h) = (x^2 + 2xh + h^2)(x + h) = x^3 + 3x^2 h + 3x h^2 + h^3\]

Therefore, the derivative becomes

\[f'(x) = \lim_{h\rightarrow 0}\frac{\color{red}{x^3} + 3x^2 h + 3xh^2 + h^3 - \color{red}{x^3}}{h} = \lim_{h\rightarrow 0}\frac{3x^2 h + 3xh^2 + h^3}{h} = \lim_{h\rightarrow 0}\left(3x^2 + 3xh + h^2\right)\]

Therefore, the derivative is given by

\[f'(x) = 3x^2\]

An illustration of this slope is given by

Now suppose we wanted to find the derivatives of \(x^4\) and \(x^5\) and … and so on. It would take a long time for us to compute the derivative for all of these curves! One thing to notice is that for both \(x^2\) and \(x^3\), the exponent became a coefficient. After bringing the exponent down, we reduced the exponent by one. Therefore we might expect the following:

\[\text{If}\hspace{1.5mm} f(x) = x^n \Rightarrow \frac{df}{dx} = nx^{n - 1}\]

This turns out to be true. Proving that this is true requires the use of the Binomial Theorem, which the reader may not know. The reader is not obligated to understand the proof of this fact; however, the reader must be familiar with the formula, as it will arise frequently later.

We will let the exponent be \(n\), which will stand for any positive integer (for now):

\[f(x) = x^n\]

\[f(x + h) = (x + h)^n\]

Now, there was a very valuable theorem called the Binomial Theorem that you learned (but may have forgotten) in Algebra. The theorem is the following:

\[ (a + b)^n = \sum_{x = 0}^n {n \choose x} a^x b^{n - x}\] Therefore, we have:

\[ f'(x) = \lim_{h \rightarrow 0}\frac{f(x + h) - f(x)}{h} = \lim_{h \rightarrow 0}\frac{(x + h)^n - x^n}{h} = \lim_{h \rightarrow 0}\frac{\sum_{i = 0}^{n} {n \choose x} x^i h^{n-i} - x^n}{h}\] \[ = \lim_{h \rightarrow 0}\frac{x^n + \sum_{i = 0}^{n - 1} {n \choose i} x^ih^{n - i} - x^n}{h} = \lim_{h \rightarrow 0} \frac{\sum_{i = 0}^{n - 1} {n \choose i} x^i h^{n - i}}{h} = \lim_{h\rightarrow 0}\sum_{i = 0}^{n - 1} {n \choose i} x^i h^{(n - 1) - i}\] Now, all of the terms in the last quantity but one will be zero since we are taking the limit as \(h \rightarrow 0\). The only term for which is this not true is when the index \(i = n - 1\), since \(h^{(n - 1) - (n - 1)} = h^0 = 1\):

\[ = \lim_{h \rightarrow 0} \left({n \choose n - 1} x^{n - 1} + \sum_{i = 0}^{n - 2} {n \choose i} x^i h^{(n - 1) - i} \right) = {n \choose n-1} x^{n - 1} = nx^{n - 1}\]

This formula is so important that it will receive its own number.

\[\begin{equation} \frac{d}{dx} x^n = nx^{n-1} \tag{3.1} \end{equation}\]

Example 3.1 (Monomial Derivatives 1) Compute the derivative of the function \(f(x) = x^{27}\).

This is a simple application of the formula above. For this example, \(n = 27\). Therefore, we have:

\[\frac{d}{dx} x^{27} = 27 x^{26}\]

Example 3.2 (Monomial Derivatives 2) Compute the derivative of the function \(f(x) = 10x^9\).

We know that constants go through derivatives. In particular, the constant \(10\) is not affected by the derivative. Therefore, we have:

\[\frac{df}{dx} = \frac{d}{dx}\left( 10 x^9\right) = 10\frac{d}{dx}x^9 = 10\left(9 x^8\right) = 90x^8\]

3.2.4 Derivative of Square Root

We will illustrate the limit definition one last time, then we will proceed to rules regarding the derivative.

Suppose we seek the derivative of \(f(x) = \sqrt{x}\). Then we have the following:

\[ f(x) = \sqrt{x} \] and

\[ f(x + h) = \sqrt{x + h}\] Therefore, we have:

\[f'(x) = \lim_{h\rightarrow 0}\frac{f(x + h) - f(x)}{h} = \lim_{h \rightarrow 0}\frac{\sqrt{x + h} - \sqrt{x}}{h}\] Notice that if we naïvely allow \(h \rightarrow 0\) before manipulating the fraction, we end up with \(\frac{\sqrt{x} - \sqrt{x}}{0} = \frac{0}{0}\). This isn’t very helpful; we will not be able to apply that result to any problem we encounter in the real world. We’ll have to use a little trick to cancel out the \(h\) from the denominator:

\[ = \lim_{h\rightarrow 0}\frac{\sqrt{x + h} - \sqrt{x}}{h}\cdot\frac{\sqrt{x + h} + \sqrt{x}}{\sqrt{x + h} + \sqrt{x}} = \lim_{h\rightarrow 0}\frac{x + h + \sqrt{x}\cdot\sqrt{x+h} - \sqrt{x}\cdot\sqrt{x + h} - x}{h(\sqrt{x + h} + \sqrt{x})} \]

\[ \lim_{h\rightarrow 0}\frac{h}{h(\sqrt{x + h} + \sqrt{x})} = \lim_{h\rightarrow 0}\frac{1}{(\sqrt{x + h} + \sqrt{x})} = \frac{1}{2\sqrt{x}} \]

Note that \(\frac{d}{dx}\sqrt{x} = \frac{1}{2\sqrt{x}} = \frac{1}{2}x^{1/2 - 1}\) also satisfies the formula \(\frac{d}{dx} x^n = nx^{n - 1}\) where we assumed \(n\) was assumed to be an integer. The same proof we provided for the case when \(n\) is an integer can be extended to show that the formula \(\frac{d}{dx} x^n = nx^{n - 1}\) holds for any real number \(n\).

3.3 Derivatives of log(x), exp(x), sin(x), and cos(x)

3.3.1 Log(x)

First, we will provide a sketch demonstrating how to determine the derivative of the function \(f(x) = \ln(x)\) using the definition 3.2.

\[\frac{d}{dx}\left( \ln(x) \right) = \lim_{h \rightarrow 0}\frac{\ln(x + h) - \ln(x)}{h}\]

Now, an important rule involving logs states: \(\ln(a) - \ln(b) = \ln(\frac{a}{b})\). Therefore, we have:

\[ = \lim_{h\rightarrow 0}\frac{1}{h}\ln\left(\frac{x + h}{x}\right) = \lim_{h\rightarrow 0}\frac{1}{h}\ln\left(1 + \frac{h}{x}\right)\]

Another important rule involving logs: \(a\cdot\ln(b) = \ln(b^a)\). Hence, the limit becomes:

\[ = \lim_{h\rightarrow 0} \ln\left(1 + \frac{h}{x}\right)^{\frac{1}{h}} \]

We take the common definition of Euler’s number (as opposed to Euler’s Constant) \(e = \lim_{h \rightarrow \infty} \left(1 + \frac{1}{h}\right)^h\). Therefore, rearranging the limit above, we have:

\[ \lim_{h\rightarrow 0} \ln\left(1 + \frac{1/x}{1/h} \right)^{1 / h} = \ln\left(e^{1/x} \right) = \frac{1}{x}.\] Therefore,

\[\begin{equation} \frac{d}{dx}\left( \ln(x)\right) = \frac{1}{x} \tag{3.2} \end{equation}\]

An illustration of this plot is provided below.

3.3.2 Exp(x)

Next, we will find the derivative of the function \(f(x) = e^x\). We will use a clever trick involving the chain rule, 3.3. The proof goes like this:

\[f(x) = e^x\] Recall that \(\ln(e^x) = x\) since \(\ln(x)\) and \(e^x\) are inverse functions. Thus, taking the natural log of both sides of the equation above, we have:

\[\ln\left( f(x)\right) = x\]

Now we can take a derivative with respect to \(x\). By the chain rule, we first take the derivative of the natural log part, then we will take the derivative of \(f(x)\). Furthermore, the derivative of \(x\) with respect to \(x\) is just one. Therefore, we have:

\[\frac{1}{f(x)}\frac{d}{dx}f(x) = 1\]

Multiplying both sides of this equation by \(f(x)\), we have:

\[\frac{d}{dx} f(x) = f(x) = e^x\]

Therefore, the derivative of \(f(x) = e^x\) is itself! That’s an amazing thing; indeed, \(e^x\) is the only function for which this holds! An illustration of this remarkable fact is provided below.

3.3.3 Sin(x)

We will now compute the derivative of the sine function, \(\sin x\). As usual, we begin with the definition of a limit:

\[\frac{d}{dx}\sin x = \lim_{h\rightarrow 0}\frac{\sin(x + h) - \sin(x)}{h}\] We will now expand \(\sin(x + h)\). To do so, we will be using formula (1.7). Using this result, we have:

\[ = \lim_{h\rightarrow 0}\frac{\cos(x)\sin(h) + \sin(x)\cos(h) - \sin(x)}{h} = \cos(x)\lim_{h\rightarrow 0}\frac{\sin(h)}{h} + \sin(x)\lim_{h\rightarrow 0}\frac{\cos(h) - 1}{h} \]

\[ = \cos(x)\lim_{h\rightarrow 0}\frac{\sin(h)}{h} - \sin(x)\lim_{h\rightarrow 0}\frac{1 - \cos(h)}{h}\]

We now have two limits we must compute. The first is \(\lim_{h\rightarrow 0}\frac{\sin(h)}{h}\) and the second is \(\lim_{h\rightarrow 0}\frac{1 - \cos(h)}{h}\).

These are limits computed in the previous chapter.

\[ \frac{d}{dx} \sin(x) = \cos(x)\lim_{h\rightarrow 0}\frac{\sin(h)}{h} - \sin(x)\lim_{h\rightarrow 0}\frac{1 - \cos(h)}{h} \] Applying the limits we just computed, we have:

\[\begin{equation} \frac{d}{dx}\sin(x) = \cos(x) \tag{3.3} \end{equation}\]

The interactive below illustrates that the derivative of sine is indeed cosine:

3.3.4 Cos(x)

Computing the derivative of cosine is almost identical to the derivative of sine. Once again, we apply the definition of the derivative:

\[\frac{d}{dx}\cos(x) = \lim_{h\rightarrow 0}\frac{\cos(x + h) - \cos(x)}{h} \] Now we must apply the cosine sum identity (1.9):

\[= \lim_{h\rightarrow 0}\frac{\cos(x)\cos(h) - \sin(x)\sin(h) - \cos(x)}{h}\]

\[=\cos(x)\lim_{h\rightarrow 0}\frac{\cos(h) - 1}{h} - \sin(x)\lim_{h\rightarrow 0}\frac{\sin(h)}{h}\]

Those limits should look familiar! Indeed, we have already proved that \(\lim_{h\rightarrow 0}\frac{\cos(h) - 1}{h} = 0\) and \(\lim_{h\rightarrow 0}\frac{\sin(h)}{h} = 1\). Therefore, we have:

\[\begin{equation} \frac{d}{dx}\cos(x) = -\sin(x) \tag{3.4} \end{equation}\]

An Interactive Calculus Textbook

Chapter 3 Introduction to the Derivative

3.1 Definition of a Derivative

3.2 Examples Applying the Limit Definition

3.2.1 Derivative of Constants

3.2.2 Derivative of Lines

3.2.3 Derivative of Monomials

3.2.4 Derivative of Square Root

3.3 Derivatives of log(x), exp(x), sin(x), and cos(x)

3.3.1 Log(x)

3.3.2 Exp(x)

3.3.3 Sin(x)

3.3.4 Cos(x)

3.4 Derivative Rules

3.4.1 Constant Rule

3.4.2 Product Rule

3.4.3 Chain Rule

3.4.4 Quotient Rule

3.4.5 Tan(x)