Proving the Fundamental Theorems of Calculus
Date: 02/02/2001 at 10:22:32 From: Junior Wellington Subject: The Fundamental Theorem of Calculus Good morning to you all. There appears to be some confusion in regard to the first and second fundamental theorems of calculus. In some books, the first theorem is given, and in others, the second is. What is the correct order? Also, could you please provide a proof of the second fundamental theorem? Yours sincerely, Junior
Date: 02/12/2001 at 16:04:10 From: Doctor Jordi Subject: Re: The Fundamental Theorem of Calculus Hi, Junior - thank you for writing to Dr. Math. There is often a trade-off to be made between rigor and clarity, which is one of the reasons that different authors present the same material differently. I will try to give you a rigorous proof. Before reading it, you should consult the following explanation from our archives: Riemann Sums and the Integral http://mathforum.org/dr.math/problems/brian1.18.99.html This is not a rigorous proof, but only a motivation for the proof that I'm going to provide. In particular, the explanation states that "by definition, the inverse of integration is differentiation," something that is true only for a specific kind of integration. Historically, indefinite integration has always been defined to be the inverse of differentiation, i.e. the indefinite integral of f(x) is the collection of all possible antiderivatives of f(x), which happen to differ only by a constant. But definite integration, motivated by the problem of finding areas under curves, was originally defined as a limit of Riemann sums. Only later was it discovered that the limits of these Riemann sums can actually be computed with antiderivatives, leading to our modern FTC. The Fundamental Theorem of Calculus (FTC) is usually presented as a pair of statements, sometimes with one as a corollary of the other: IF: f(t) is continuous on an open interval I; a and b are constants contained in I; x and t are in I; and F(t) is any antiderivative of f(t); THEN: 1) Int[a,x](f(t) dt) = F(x); and 2) Int[a,b](f(t) dt) = F(b) - F(a) Statement (2) is sometimes called the "Evaluation Theorem" because it allows us to evaluate the definite integral. (Note that I've used "Int[a,b](f(t) dt)" to mean "The definite integral of f(t) from a to b." Sometimes I will only write Int[a,b] if I'm only concerned with the limits of integration and the function is implied or understood.) So, how do we prove these statements? Statement 1: ------------ Let's give a definition of A(x). We will define, for constant a, A(x) = Int[a, x](f(t) dt) You can think of this function A(x) as the area under the curve from a up to x, for fixed a and variable x. Look at the diagram here to see more clearly what I mean: The main motivation for this proof is that dA/dx = f(x). Is this true? Well, if so, then by the definition of the derivative, dA A(x + h) - A(x) -- = lim[h -> 0] --------------- dx h which we think is equal to f(x). The general idea of the proof I am about to show you is to think of h as an increment in x, make that increment infinitesimal, and see what implications follow from that. Let's look in the numerator of the definition above. For any h > 0 such that x + h is in I, A(x + h) = Int[a, x + h](f(t) dt) and thus A(x + h) - A(x) = Int[a, x + h](f(t) dt) - Int[a, x](f(t) dt) = Int[x, a](f(t) dt) + Int[a, x + h](f(t) dt) (true, since Int[r, s] = - Int[s, r]) = Int[x, x + h](f(t) dt) (*) (true, since Int[r, d] + Int[d, s] = Int[r, s]) Now, can you picture what that last integral looks like? Remember, we are about to make h infinitely small (h -> 0), so you can think of the integral Int[x, x + h] as a very thin rectangle under the graph of f(t) of width h and height somewhere between f(x) and f(x + h). Look at the above diagram again to see what I'm talking about. The integral (*) is nothing more than a small increment in the area function, A(x). What else can we say about that little integral, that slim rectangle? Just how narrow is it, and more importantly, how tall is it? Well, we can definitely give some restrictions on how tall it can be, and thus we can also give some restrictions on its area. We can do this by using a theorem from calculus that places limits on the value of a definite integral. Let m be the minimum value of f(t) in the interval [x, x + h] and M be the corresponding maximum. Thus, the area of our integral Int[x, x + h](f(t) dt) must be greater than or equal to hm and less than or equal to hM. That is, hm <= Int[x, x + h](f(t) dt) <= hM Divide through by h and get: Int[x, x + h](f(t) dt) m <= ---------------------- <= M h Hey, this is really getting somewhere! Do you recognize that little fellow, bounded by m and M? We certainly have met him before, but he looked a little different, something more like A(x + h) - A(x) m <= --------------- <= M h Great - we're almost there. We now finally let h -> 0 and see what happens. The bounded quantity above will be our derivative, dA/dx. What about m and M, though? What values will they take now? Well, remember they were defined as the minimum and maximum values of f(t) in the interval [x, x + h]. As h gets smaller and smaller, m and M have fewer and fewer values to choose from, until all that's left is f(x). So, m and M both tend to f(x) as h tends to 0. Using the Squeeze Theorem for limits, since both m and M tend to the same limit, f(x), the quantity bounded by them must also tend to that limit. Thus, dA/dx = f(x). Therefore, an antiderivative of f(x) is A(x). Thus Int[a,x](f(t)dt) = F(x) This completes the proof of statement 1. The exchange of F(x) for A(x) in that last step is justified because we specified that they are both antiderivatives of f(x), although they may differ by a constant. The value of this constant, in fact, will be crucial for proving statement 2. Statement 2: ------------ The second part of the FTC follows immediately from the first part, to the point where it is practically just a corollary. We want to know what Int[a, b](f(t) dt) evaluates to. The natural way to do this is by noting that we already know two possible ways to express the integral Int[a, x](f(t) dt) with its antiderivatives A(x) and F(x). Since A(x) and F(x) have the same derivative, f(x), they must differ only by a constant. That is, A(x) = F(x) + C (**) We want to now know what that constant is. We do a little trick and let x = a, so that A(a) = 0, because A(a) = Int[a, a](f(t) dt) = 0. We see that: 0 = F(a) + C so C = - F(a) Substitute back in (**). A(x) = F(x) - F(a) We now replace x by b and get the conclusion of Statement 2: A(b) = Int[a,b](f(t)dt) = F(b) - F(a) And we have proved everything! **************************************** One more comment. It is possible to prove Statement 2 without relying on Statement 1, by using the definition of the integral and the mean value theorem for derivatives. The general flavor of this alternate proof is as follows: 1) For each subinterval in the partition used for the Riemann sum in the definition of the integral, use the mean value theorem for derivatives. 2) Each subinterval will have an expression of the form F(j) - F(k). When you add up all those expressions, you will ultimately end up with F(b) - F(a) where a and b are your limits of integration. 3) You will find that for any partition there exists a Riemann sum that is equal to F(b) - F(a), so that, therefore, the integral that is defined as the limit of these Riemann sums will also have that value. Perhaps this is the alternate stream of logic you have seen in your textbooks, and it is also perfectly okay. Which one is correct? It depends on what you want it for. The second possible proof I outlined for proving statement 2 is in a sense stronger than the complete proof I gave before. It is very sly to note this, but this second proof says nothing about the continuity of the function we are integrating, since it goes directly into the definition of the integral. Notice how I had to include the continuity of f in the hypothesis to the FTC as I presented it to you. That is why, if we use the second proof, we have a stronger form of statement 2 that only requires the function to be integrable (all continuous functions in an interval can be integrated, but not all integrable functions are continuous). As a final note, I will say that in mathematics there is not always a perfectly "correct" answer. Sometimes we do things one way to emphasize one point; sometimes we do them another way, concentrating and being very careful and pedantic about another point. It is all part of the game. Write back if you have questions about anything. I hope I addressed all your questions adequately. Best regards, - Doctor Jordi, The Math Forum http://mathforum.org/dr.math/
Search the Dr. Math Library:
Ask Dr. MathTM
© 1994- The Math Forum at NCTM. All rights reserved.