The fundamental empirical fact upon which are based all applications of the theory of probability is expressed in the empirical law of large numbers, first formulated by Poisson (in his book, Recherches sur le probabilité des jugements , 1837):
In many different fields, empirical phenomena appear to obey a certain general law, which can be called the Law of Large Numbers. This law states that the ratios of numbers derived from the observation of a very large number of similar events remain practically constant, provided that these events are governed partly by constant factors and partly by variable factors whose variations are irregular and do not cause a systematic change in a definite direction. Certain values of these relations are characteristic of each given kind of event. With the increase in length of the series of observations the ratios derived from such observations come nearer and nearer to these characteristic constants. They could be expected to reproduce them exactly if it were possible to make series of observations of an infinite length.
In the mathematical theory of probability one may prove a proposition, called the mathematical law of large numbers, that may be used to gain insight into the circumstances under which the empirical law of large numbers is expected to hold. For an interesting philosophical discussion of the relation between the empirical and the mathematical laws of large numbers and for the foregoing quotation from Poisson the reader should consult Richard von Mises, Probability, Statistics, and Truth , second revised edition, Macmillan, New York, 1957, pp. 104–134.
A sequence of jointly distributed random variables,
in some mode of convergence as
We consider first the case of independent random variables with finite means. We prove in section 3 that a sequence of independent identically distributed random variables obeys the weak law of large numbers if the common mean
In theoretical exercise 4.2 we indicate the proof of the law of large numbers for independent, not necessarily identically distributed, random variables with finite means: if, for some
then
Equation (2.2) is known as Markov’s condition for the validity of the weak law of large numbers for independent random variables.
In this section we consider the case of dependent random variables
between the
Let us examine the possible behavior of
If the random variables
More generally, let us consider random variables
It is clear that an orthogonal sequence of random variables (in which all the random variables have the same variance
We now show that under condition (2.5) a necessary and sufficient condition for the sample mean
Theorem 2A. A sequence of jointly distributed random variables
Proof
Since
To prove (2.9), we write the familiar formula
from which (2.9) follows by dividing through by
To see (2.11), note that for any
Letting first
If it is known that
Theorem 2B. A sequence of jointly distributed random variables
Remark: For a stationary sequence of random variables [in which case
Proof
If (2.13) holds, then (assuming, as we may, that
By (2.15) and (2.9), it follows that for some constant
Choose now any integer
By (2.16), the sequence
If we sum (2.18) over all
Therefore, by theorem 1A, it follows that
We have thus shown that a properly selected subsequence
We claim it is clear, in view of (2.20), that to show that
In view of theorem 1A, to show that (2.22) holds, it suffices to show that
We prove that (2.23) holds by showing that for some constants
To prove (2.24), we note that
from which it follows that
in which we use
Consequently, (2.26) implies the first part of (2.24). Similarly,
from which one may infer the second part of (2.24). The.proof of theorem 2B is now complete.
Exercises
2.1. Random digits. Consider a discrete random variable
2.2. The distribution of digits in the decimal expansion of a random number. Let
Prove that the random variables
2.3. Convergence of the sample distribution function and the sample characteristic function of dependent random variables . Let
Show that
Show that
Prove that (2.30) and (2.31) hold if the random variables
2.4. The law of large numbers does not hold for Cauchy distributed random variables. Let
2.5. Let
Consequently, conclude that
2.6. A probabilistic proof of Weierstrass’ theorem: Extend (2.34) to show that to any continuous function