In the applications of probability theory to real phenomena two results of the mathematical theory of probability play a conspicuous role. These results are known as the law of large numbers and central limit theorem. At this point in this book we have sufficient mathematical tools available to show how to apply these basic results. In Chapters 9 and 10 we develop the additional mathematical tools required to prove these theorems with a sufficient degree of generality.
A set of
be the sum of the observations. Their arithmetic mean
is called the sample mean .
By (4.1), (4.6), and (4.7), we obtain the following expressions for the mean, variance, and moment-generating function of
From (5.4) we obtain the striking fact that the variance of the sample mean
However, even more can be proved about the sample mean than that it tends to be equal to the mean. One can approximately evaluate, for any interval about the mean, the probability that the sample mean will have an observed value in that interval, since the sample mean is approximately normally distributed. More generally, it may be shown that if
In words, (5.5) may be expressed as follows: the sum of a large number of independent identically distributed random variables with finite means and variances, nomalized to have mean zero and variance 1, is approximately normally distributed . Equation (5.5) represents a rough statement of one of the most important theorems of probability theory. In 1920 G. Polya gave this theorem the name “the central limit theorem of probability theory”. This name continues to be used today, although a more apt description would be “the normal convergence theorem”. The central limit theorem was first proved by De Moivre in 1733 for the case in which
It may be of interest to outline the basic idea of the proof of (5.5), even though the mathematical tools are not at hand to justify the statements made. To prove (5.5) it suffices to prove that the moment-generating function
satisfies for
in which
where the remainder
where
It then follows that
where
From (5.11) and (5.12) one obtains (5.7). Our heuristic outline of the proof of (5.5) is now complete.
Given any random variable
The standardization
The central limit theorem of probability theory can now be formulated: The standardization
Example 5A . Reliability . Evaluation of the reliability of rockets is a problem of obvious importance in the space age. By the reliability of a rocket one means the probability
Solution
Let
If
Example 5B . Brownian motion and random walk . A particle (of diameter
in which
The result given by (5.15) is especially important; it states that the mean square displacement
Thus the mean-square displacement of a particle undergoing a random walk is proportional to the number of steps
in which
Exercises
5.1. Which of the following sets of evidence throws more doubt on the hypothesis that new born babies are as likely to be boys as girls: (i) of 10,000 new born babies, 5100 are male; (ii) of 1000 new born babies, 510 are male.
Answer
(i) throws more doubt than (ii).
5.2. The game of roulette is described in example 1D. Find the probability that the total amount of money lost by a gambling house on 100,000 bets made by the public on an odd outcome at roulette will be negative.
5.3. As an estimate of the unknown mean
Answer
62.
5.4. A man plays a game in which his probability of winning or losing a doliar is
(i) Find
(ii) Find approximately the probability that after 10,000 plays of the game the change in the man’s fortune will be between -50 and 50 dollars.
5.5. Consider a game of chance in which one may win 10 dollars or lose
Answer
25 or more.
5.6. A certain gambler’s daily income (in dollars) is a random variable
(i) Find approximately the probability that after 100 days of independent play he will have won more than 200 dollars.
(ii) Find the quantity
(iii) Determine the number of days the gambler can play in order to have a probability greater than
5.7. Add 100 real numbers, each of which is rounded off to the nearest integer. Assume that each rounding-off error is a random variable uniformly distributed between
Answer
5.8. If each strand in a rope has a breaking strength, with mean 20 pounds and standard deviation 2 pounds, and the breaking strength of a rope is the sum of the (independent) breaking strengths of all the strands, what is the probability that a rope made up of 64 strands will support a weight of (i) 1280 pounds, (ii) 1240 pounds.
5.9. A delivery truck carries loaded cartons of items. If the weight of each carton is a random variable, with mean 50 pounds and standard deviation 5 pounds, how many cartons can the truck carry so that the probability of the total load exceeding 1 ton will be less than
Answer
38.
5.10. Consider light bulbs, produced by a machine, whose life
(i) Find approximately the probability that a sample of 100 bulbs selected at random from the output of the machine will contain between 30 and 40 bulbs with a lifetime greater than 1020 hours.
(ii) Find approximately the probability that the sum of the lifetimes of 100 bulbs selected randomly from the output of the machine will be less than 110,000 hours.
5.11. The apparatus known as Galton’s quincunx is described in exercise 2.10 of Chapter 6. Assume that in passing from one row to the next the change
Answer
5.12. A man invests a total of
(i) If it is desired to hold the risk to a minimum, what amounts
(ii) What is the amount of risk that must be taken in order to achieve a portfolio whose mean return is equal to 400 dollars?
(iii) By means of Chebyshev’s inequality, find an interval, symmetric about 400 dollars, that, with probability greater than