In probability theory, two random variables A and B are considered independent if their joint probability function (or joint probability distribution) is a product of their individual probability functions (or probability distributions). Mathematically, this can be expressed as:
\( Pr(A \cap B) = Pr(A) \times Pr(B) \)
We start by investigating sums of random variables that are not only independent, but also identically distributed. This means that they will have the same values for their means and standard deviations.
Consider, for example, the numbers observed when two six-sided dice are rolled. Let \( X_1 \) be the number observed when the first die is rolled, and \( X_2 \) be the number observed when the second die is rolled. The two random variables \( X_1 \) and \( X_2 \) are independent and have identical distributions.
What can we say about the distribution of \( X_1 + X_2 \)?
Since the rolling of these two dice can be considered as independent events, we can find probabilities associated with the sum by multiplying probabilities associated with each individual random variable. For example:
Pr(\( X_1 + X_2 = 2 \)) = Pr(\( X_1 = 1, X_2 = 1 \))
= Pr(\( X_1 = 1 \)) × Pr(\( X_2 = 1 \))
= \( \frac{1}{6} \) × \( \frac{1}{6} \) = \( \frac{1}{36} \)
Let \( X \) be a random variable with mean \( \mu \) and variance \( \sigma^2 \). Then if \( X_1 \) and \( X_2 \) are independent random variables with identical distributions to \( X \), we have:
\( E(X_1 + X_2) = E(X_1) + E(X_2) = 2\mu \)
\( \text{Var}(X_1 + X_2) = \text{Var}(X_1) + \text{Var}(X_2) = 2\sigma^2 \)
\( \text{sd}(X_1 + X_2) = \sqrt{\text{Var}(X_1 + X_2)} = \sqrt{2\sigma^2} \)
Note: Since \( \text{sd}(X_1) + \text{sd}(X_2) = 2\sigma \), we see that \( \text{sd}(X_1 + X_2) > \text{sd}(X_1) + \text{sd}(X_2) \) for \( \sigma > 0 \).
We can easily determine that \( E(X_1) = 1 \) and \( E(X_2) = \frac{1}{2} \), and we know that \( E(2X_1 + 3X_2) = \frac{7}{2} \).
Thus, we have:
\( E(2X_1 + 3X_2) = 2 E(X_1) + 3 E(X_2) \)
We can calculate \( \text{Var}(X_1) = \frac{2}{3} \) and \( \text{Var}(X_2) = \frac{1}{4} \), and we know that \( \text{Var}(2X_1 + 3X_2) = \frac{59}{12} \).
Thus, we have:
\( \text{Var}(2X_1 + 3X_2) = 2^2 \text{Var}(X_1) + 3^2 \text{Var}(X_2) \)
Let \( X \) be a random variable with mean \( \mu \) and variance \( \sigma^2 \). Then if \( X_1, X_2, \ldots, X_n \) are independent random variables with identical distributions to \( X \), we have:
\( E(X_1 + X_2 + \ldots + X_n) = E(X_1) + E(X_2) + \ldots + E(X_n) = n\mu \)
\( \text{Var}(X_1 + X_2 + \ldots + X_n) = \text{Var}(X_1) + \text{Var}(X_2) + \ldots + \text{Var}(X_n) = n\sigma^2 \)
\( \text{sd}(X_1 + X_2 + \ldots + X_n) = \sqrt{\text{Var}(X_1 + X_2 + \ldots + X_n)} = \sqrt{n\sigma^2} \)
Note: The result for the expected value holds even if the random variables \( X_1, X_2, \ldots, X_n \) are not independent.
Let \( X_1, X_2, \ldots, X_n \) be independent random variables with means \( \mu_1, \mu_2, \ldots, \mu_n \) and variances \( \sigma_{1}^2, \sigma_{2}^2, \ldots, \sigma_{n}^2 \) respectively. Then if \( a_1, a_2, \ldots, a_n \) are constants, we have:
\( E(a_1X_1 + a_2X_2 + \ldots + a_nX_n) = a_1 E(X_1) + a_2 E(X_2) + \ldots + a_n E(X_n) \)
\( = a_1\mu_1 + a_2\mu_2 + \ldots + a_n\mu_n \)
\( \text{Var}(a_1X_1 + a_2X_2 + \ldots + a_nX_n) = a_1^2 \text{Var}(X_1) + a_2^2 \text{Var}(X_2) + \ldots + a_n^2 \text{Var}(X_n) \)
\( = a_1^2 \sigma_{1}^2 + a_2^2 \sigma_{2}^2 + \ldots + a_n^2 \sigma_{n}^2 \)
\( \text{sd}(a_1X_1 + a_2X_2 + \ldots + a_nX_n) = \sqrt{a_1^2 \sigma_{1}^2 + a_2^2 \sigma_{2}^2 + \ldots + a_n^2 \sigma_{n}^2} \)
Note: The result for the expected value holds even if the random variables \( X_1, X_2, \ldots, X_n \) are not independent.
Let \( X_1, X_2, \ldots, X_n \) be independent normal random variables, and let \( a_1, a_2, \ldots, a_n \) be constants. Then the random variable \( a_1X_1 + a_2X_2 + \ldots + a_nX_n \) is also normally distributed.
The time taken to prepare a house for painting is known to be normally distributed with a mean of 10 hours and a standard deviation of 4 hours. The time taken to paint the house is independent of the preparation time and is normally distributed with a mean of 20 hours and a standard deviation of 3 hours.
We are asked to find the probability that the total time taken to prepare and paint the house is more than 35 hours.
Let \( X \) represent the time taken to prepare the house, and \( Y \) the time taken to paint the house. Since \( X \) and \( Y \) are independent normal random variables, the distribution of \( X + Y \) is also normal, with:
Therefore, we have:
\( Pr(X + Y > 35) = Pr\left(Z > \frac{35 - 30}{5}\right) = Pr(Z > 1) = 0.1587 \)