Substitution

Substitution is the most powerful and at the same time perhaps the easiest method of evaluating integrals, which is why it is our method of choice. It has one disadvantage: sometimes it fails. Therefore we not only need to learn the mechanics of substitution itself, but also acquire some feeling for when the substitution works and when it fails. This comes with experience. The general idea of sustitution (not just in integral) is described in this note.

Direct substitution

is based on this mathematical theorem. In the ideal case it can be expressed by this procedure:

That is, we choose a transformation y = g(x) and use it to literally replace all appearances of g(x) with the symbol y. We also need to replace dx. For this we use the equality we deduced in the second row, according to it we replace g′(x)dx with dy.

If this substitution succeeds, we obtain a new integral and with a bit of luck we can solve it. Then one has to also do a so-called back substitution, that is, return back to the original variable, but that is simple, we have a formula for it.

How does this tie in with the formal theorem on substitution? We have a function f (y) = e^y and we are interested in in an integral that features f composed with g(x) = sin(x). This more complicated integral is considered on the interval J of all real numbers, since we always want the integral on the largest possible set and there is no reason to restrict ourselves to something smaller. The function g maps this interval onto the interval I = [−1,1]. According to the theorem on direct substitution, it is enough to find an indefinite integral of the function f on the interval I, which is the middle part in the calculation above. Into the formula that we obtained we substitute g and get the answer to our original question, valid on the original interval I.

The beauty of the procedure we use above above is that it allows us to dispose of all this theoretical reasoning, we do not worry about J and I, we simply want to exchange one expression for another. We also need not remember that according to that theorem we also need to have the derivative of g in that complicated integral with composed f. The substituting procedure forces us to replace not dx but g′(x)dx. If we do not find it there (that is, if we did not have that cosine in the integral), then this substitution can't be done. Thus when doing practical calculations people do not worry about that theorem, the procedure we have shown above guarantees that everything is fine.

The main weakness is that we need to have that g′(x) next to dx and that x must appear exclusively in the form of g(x), otherwise the chosen substitution with g would not be possible. However, this is rarely possible. Fortunately, one can help it, which makes substitution one of the most powerful tools. How does it work?

The basic requirement is that after the substitution, all x must disappear from the integral. The only tool available is the chosen transformation y = g(x) and the related equality y = g′(x)dx, but we are also allowed to change these equalities algebraically in order to obtain formulas that allow us to replace expressions featuring x with expressions featuring y. This gives us tremendous freedom, but one should remember that in such a case we do not use the quoted theorem as stated, so then we are on a shaky ground and the answer may come out wrong. It is therefore strongly recommended to check that our result is correct (by differentiating it) - actually, we should do this with every integral we solve. Actually, there is rarely any trouble unless you get really wild with those formulas. We will now show one example of this more general approach.

Linear substitution.
By this we mean any substitution of the form y = Ax + B, which leads to the equality dy = A⋅dx. Here we do not have to rely on the expression A⋅dx appearing in the integral, since we can always rearrange this equality to dx = (1/A)dy and then replace dx in the integral. This means that we can always do any linear substitution. We will show one fairly important example.

The integral is of course only valid for a > 0, otherwise the generalized exponential does not exist. Many people consider this integral an elementary one and memorize it.

Indirect substitution

is based on the following mathematical theorem. In the ideal case it can be expressed by this procedure:

It looks as if we followed the above direct substitution in the opposite direction and there is something to it. There we tried to simplify the integral by replacing a (complicated) expression with a single letter, here we make the integral more complicated, but the basic mechanism stays the same: We choose a transformation and then use it to change the integral, we also get an equality for changing the defferential int he same way, by taking derivative of g and using it as a coefficient. However, deep inside it is a different procedure, which can be seen for instance from the fact that here requirements on g are more strict (in particular, it must be invertible, unlike the situation in direct substitution).

There is also a difference in difficulty of various stages. The direct substitution tight at the start, especially with replacing the dx which often makes the chosen substitution impossible, but the back substitution is clear. On the other hand, the indirect substitution has no trouble with dx, from x = g(t) we immediately get x = g′(t)dt, but the back substitution can be quite an adventure, especially if we want the answer in a nice form.

Now why would anyone want to make an integral more complicated then it already is? Not surprisingly, the indirect substitution is used quite rarely, but there are specific types of integrals where it does help and experienced integrators know beforehand that certain indirect substitutions will in the end magically simplify (see integrals with roots in Methods Survey - Integration). We will show a very simple example of this kind.

Actually, this is an elementary integral that one should remember.

Note that even this simple example is already good enough to illustrate that the indirect substitution is not as simple as it looks. A careful reader should get mightily suspicious at the first integral in the second line: Shouldn't there be an absolute value in the denominator? That is a very good question and the answer depends heavily on how we actually do this indirect substitution. According to the theorem we are supposed to take a function g = sin(t) that goes from some interval J to our interval (−1,1). This interval J must be chosen in such a way that g is 1-1 there and the image of J under g must be the whole interval (−1,1). The most natural choice is the interval , so the new integral, after the substitution, is considered on this interval. There the cosine is always positive, so indeed, there is no need to use an absolute value in the denominator when cancelling the square root and the square power there. Note, however, that if we chose a different interval for J, then we might be forced to use the absolute value. As we see, this is another difference, in direct substitution we need not worry about intervals used in the procedure.

Mixed substitution

is a combination of direct and indirect substitution. We use it to shorten calculations when we have to use several substitutions in a row. Experienced integrators can save quite a lot of time this way. It starts with a transformation h(y) = g(x) and the second important equality is dediced using the usual idea, we differentiate on both sides and attach differentials: h′(y)dy = g′(x)dx. We present one example here that can be also solved by applying two simpler substitutions one after another (try it, it is a good practice):

Also here one has to be careful on which interval the new variable lives, in this particular example one has to decide between intervals [0,∞) and (−∞,0], since y² is supposed to be 1-1 on the working interval. It will not make any difference at the end, but it may strongly influence calculations that get us there, we will see some of it below.

Substitution in a definite integral

This is basically the same as using substitution in an indefinite integral. We state that the basic rule in substitution is to change everything from the language of x to the language of y, and that include also the limits. How do we transform them? As usual, using our basic transformation y = g(x). Example:

Similarly we change limits in the indirect and mixed substitution, but there it is somewhat more difficult. For instance, we had a substitution y² = x³ + 8 above. If it were a definite integral and one of its limits was x = 2, then we would substitute it into the transformation equation and then look for corresponding y. In this case there are two possibilities, plus and minus 4, the right one is determined by the choice of the interval J in the substitution.

For practical hints and examples see Substitution in Methods Survey - Methods of integration.

Next section: integration by parts
Back to Theory - Integration