2 Calculus
In this chapter, we will review/collect a large number of results that you should know and be familiar with from calculus. I’m not going to prove them or provide a bunch of details and explanations and graphs, so if anything strikes you as unfamiliar or you want more details, please consult your calculus textbook.
Calculus is important to statistics for lots of reasons, but I would like to point out three major ones before we begin the review.
Finding most likely solutions
A statistical analysis typically begins with some sort of model for how the data (which I’ll call ) depends on an unknown parameter (which I’ll call ). We observe the data, but what’s ? To estimate it, we typically create some function (which I’ll call ) that is large when is in agreement with the data we’ve seen. To find the “best” value of , we can take the derivative of to find the optimal value. Note that this is actually a partial derivative, since would be a function of both and .
Probability and density
For continuous quantities such as height, the distribution of likely values is specified in terms of a probability density . Calculating the probability from a probability density involves integration. For example, if we wanted to know the probability that a person’s height was between 63 and 66 inches:
Independent observations
Suppose we are interested in the probability of events . If those events are independent, this is given by
However, it is almost always easier to deal with this kind of quantity after taking the log:
To see why, go ahead and multiply a bunch of probabilities together and see how useful the result is to work with. The same trick can be used with dependent terms as well, although the results are messier.
It is hard to overstate how often you will do this. This isn’t some occasional trick – this is standard operating procedure, so it is critical that you know the properties of exponents and logs extremely well.