C.1 Chain Rule - Developing a Conceptual Understanding
Calculus students often hear that "the Chain Rule" is scary or difficult. Let's break it down, and learn how to find the most complicated derivatives. On this screen we'll develop a conceptual understanding of the Chain Rule, and then on the next screen we'll present the rule and work some initial practice problems.
You might have noticed that, so far in this "Calculating Derivatives" chapter, we have only found the derivative of simple functions like
Note: Some students initially find abstract discussion of the Chain Rule difficult to understand. If you're one of them, we encourage you to jump to the Check Questions at the bottom of next screen to see how easy the Chain Rule actually is to use in practice. You can then proceed to the "Basic Practice" screen to develop your problem-solving skills further. Once you see how to use the Chain Rule routinely, you may find the discussion of why it works the way it does easier to follow.
Why we need a new rule
To begin, let's quickly consider two examples to illustrate why we need a new rule at all.
First, consider the function
Naively, looking at
As a second example, consider the function
Again thinking naively, you might simply bring the power of 2 down in front of the parentheses:
As you'll see below, the Chain Rule resolves this discrepancy, and will let us — easily, with practice! — find the derivatives of functions that are quite complicated.
In order to understand the Chain Rule, we first need to make sure we're clear about compound functions.
Compound (Composite) Functions Review
Recall that a compound function, also known as a composite function, is a function comprised of one or more functions inside it.
For instance,
As another example,
As yet another example,
"How can I tell what the inner and outer functions are?"
Here's a foolproof method to determine the inner and outer functions: Imagine calculating the numerical output of the function for a particular input value of x and identify the steps you would take, because you'll always automatically start with the inner function and work your way out to the outer function.
For example, imagine computing
This imaginary computational process works every time to identify correctly what the inner and outer functions are.
Example 1: Identify Inner and Outer Functions of Compound Functions
Each function below can be thought of as a composition of functions,
Note: Often there is more than one way to define the inside and outside functions, and even to determine how many "layers deep" the functions go. Our solution below may not be the only correct possibility.
𝑝 ( 𝑥 ) = ( 3 𝑥 − 1 ) 2 . (We'll view this as being comprised of three functions.)𝑠 ( 𝑥 ) = 1 1 + 𝑒 − 𝑥 . 𝑓 ( 𝑡 ) = 𝐴 c o s ( 𝑏 𝑡 ) .
Solution.
In the tables below we present three different ways of describing each function's decomposition: verbal description, "box notation," and more common "function notation" using x, u, t and such.
(a) Given
| Inside | Outside | |
|---|---|---|
| description | multiply the input by 3, and subtract 1 | square the input |
| boxes | ||
| function notation |
So, composing our outside and inside functions, we get
"f of g of x,
Alternate notation: You may see the compound (or composite) function
Now please set aside this quick review of compound functions. It'll be imporant again in a bit, but first we're going to develop an intuitive understanding of the Chain Rule before we present it formally.
Developing a Conceptual Understanding of the Chain Rule: A balloon ascends and cools
Before we present the official Chain Rule, let's consider an example situation to illustrate the basic idea.
Imagine a balloon that travels straight upward at a rate given in m/s.
As you may know, as you move upward away from the Earth's surface, the temperature of the air decreases. Specifically, the air around it gets cooler and cooler at a rate given in
To keep our focus on the key point here, we're going to pretend that the two rates are constant. Specifically, let's imagine that the balloon ascends such that its elevation, E, changes at the constant rate with respect to time:
Let's also imagine that as the balloon travels upward from the Earth's surface, its temperature, T, changes at the constant rate with respect to elevation:
Here's the question that gets to the core of the Chain Rule:
What is
For instance, in 1 second, how much does the balloon's temperature change?
[Do you have an answer in mind? If not, please stop and develop one for yourself. In particular, imagine what happens over 1 second: the balloon travels upward ___ m, which means its temperature changes by . . . . ]
If your instinct was simply to multiply the two rates, then great! Hold onto that intuition, because it is perfectly correct and is at the core of the Chain Rule.
If not, then think about what happens over the course of a 1-second time change. Given
Now shift focus. When the balloon's elevation increases by 0.004 meters, what temperature change does it undergo? Since the temperature rate of change is
Having focused on what happens in 1 second, let's return to the time-rate at which the balloon's temperature T changes,
If that all makes sense, you have the fundamental idea behind the Chain Rule.
Recasting the balloon scenario in terms of functions
To see how this scenario relates to compound functions, let's recast the balloon's temperature change in function notation.
We know that variations in time result in corresponding variations in elevation, so elevation is a function of time. We denote this functional dependence by writing
Similarly, variations in elevation result in corresponding variations in temperature, so temperature is a function of elevation. We denote this by writing
Putting the pieces together, we can write temperature T as a function of time t as the composition of the functions T and
Returning to the question of
The key thing to notice: to find the rate-of-change of the overall function
The Upshot
- A compound (or composite) function is comprised of an outer function and an inner function.
- Typically, without much thought, to find the rate-of-change of a compound function in an everyday scenario you would automatically multiply the rate-of-change of the outer function with the rate-of-change of the inner function. That is the Chain Rule.