In statistics, confidence intervals play a crucial role in estimating population parameters based on sample data.
When it comes to estimating the population mean, confidence intervals provide a range of values within which the true population mean is likely to fall. This statistical tool offers insights into the precision and reliability of our estimates, allowing us to make informed decisions and draw meaningful conclusions.
A confidence interval for the population mean is a range of values calculated from sample data that is likely to contain the true population mean with a certain level of confidence. This level of confidence, often denoted by \(1 - \alpha\), where \(\alpha\) represents the significance level, reflects the probability that the interval contains the true population parameter. Commonly used confidence levels are 90%, 95%, and 99%.
Suppose, for example, we are interested in the mean IQ score of all Year 12 mathematics students in Australia. The value of the population mean \( \mu \) is unknown. Collecting information about the whole population is not feasible, and so a random sample must suffice.
What information can be obtained from a single sample? Certainly, the sample mean \( \bar{x} \) gives some indication of the value of the population mean \( \mu \), and can be used when we have no other information.
The value of the sample mean \( \bar{x} \) can be used to estimate the population mean \( \mu \). Since this is a single-valued estimate, it is called a point estimate of \( \mu \).
Thus, if we select a random sample of 100 Year 12 mathematics students and find that their mean IQ is 108.6, then the value \( \bar{x} = 108.6 \) serves as an estimate of the population mean \( \mu \).
The value of the sample mean \( \bar{x} \) obtained from a single sample is going to change from sample to sample, and while sometimes the value will be close to the population mean \( \mu \), at other times it will not. To use a single value to estimate \( \mu \) can be rather risky. What is required is an interval that we are reasonably sure contains the parameter value \( \mu \).
An interval estimate for the population mean \( \mu \) is called a confidence interval for \( \mu \).
An approximate 95% confidence interval for \( \mu \) is given by
\( \left[\bar{x} - 1.9600 \frac{\sigma}{\sqrt{n}}, \bar{x} + 1.9600 \frac{\sigma}{\sqrt{n}}\right] \)
where:
Let \( X \) be any random variable, with mean \( \mu \) and standard deviation \( \sigma \). Then, provided that the sample size \( n \) is large enough, the distribution of the sample mean \( \bar{X} \) is approximately normal with mean \( E(\bar{X}) = \mu \) and standard deviation \( \text{sd}(\bar{X}) = \frac{\sigma}{\sqrt{n}} \).
Central limit theorem tells us that, whatever the underlying distribution of the random variable \( X \), if the sample size \( n \) is large, then the sampling distribution of \( \bar{X} \) is approximately normal with \( E(\bar{X}) = \mu \) and \( \text{sd}(\bar{X}) = \frac{\sigma}{\sqrt{n}} \).
For the standard normal random variable \( Z \), we have \( \text{Pr}(-1.9600 < Z < 1.9600) = 0.95 \).
So we can state that, for large \( n \):
\( \text{Pr}\left(-1.9600 < \frac{\bar{X} - \mu}{\frac{\sigma}{\sqrt{n}}} < 1.9600\right) \approx 0.95 \)
Multiplying through gives:
\( \text{Pr}\left(-1.9600 \frac{\sigma}{\sqrt{n}} < \bar{X} - \mu < 1.9600 \frac{\sigma}{\sqrt{n}}\right) \approx 0.95 \)
Further simplifying, we obtain:
\( \text{Pr}\left(\bar{X} - 1.9600 \frac{\sigma}{\sqrt{n}} < \mu < \bar{X} + 1.9600 \frac{\sigma}{\sqrt{n}}\right) \approx 0.95 \)
This final expression gives us an interval which, with 95% probability, will contain the value of the population mean \( \mu \) (which we do not know).
Note:Often when determining a confidence interval for the population mean, the population standard deviation \( \sigma \) is unknown. If the sample size is large (say \( n \geq 30 \)), then we can use the sample standard deviation \( s \) in this formula as an approximation to the population standard deviation \( \sigma \).
We can find an approximate confidence interval with a level of confidence other than 95% by using the same principles. For example, since we know that \( \text{Pr}(-1.6449 < Z < 1.6449) = 0.90 \), an approximate 90% confidence interval for \( \mu \) is given by
\( \left[\bar{x} - 1.6449 \frac{\sigma}{\sqrt{n}}, \bar{x} + 1.6449 \frac{\sigma}{\sqrt{n}}\right] \)
We can generalize these two examples as follows:
An approximate \( C\% \) confidence interval for \( \mu \) is given by
\( \left[\bar{x} - z \frac{\sigma}{\sqrt{n}}, \bar{x} + z \frac{\sigma}{\sqrt{n}}\right] \)
where:
The values of \( z \) (to four decimal places) for commonly used confidence intervals are:
The 95% confidence interval found in Example 1 should not be interpreted as meaning that \( \text{Pr}(105.66 < \mu < 111.54) = 0.95 \). Since \( \mu \) is a constant, the value either does or does not lie in the stated interval.
The correct interpretation of a 95% confidence interval is that we expect approximately 95% of such intervals to contain the population mean \( \mu \). Whether or not the particular confidence interval obtained contains the population mean \( \mu \) is generally not known.
If we were to repeat the process of taking a sample and calculating a confidence interval many times, the result would be something like that indicated in the diagram.
The diagram shows the confidence intervals obtained when 20 different samples were drawn from the same population.
The value of the population mean \( \mu \) is indicated by the vertical line, and it is, of course, constant.
It is quite easy to see from the diagram that none of the values of the sample estimate is exactly the same as the population mean, but that all the intervals except one (19 out of 20, or 95%) have captured the value of the population mean, as would be expected in the case of a 95% confidence interval.
We saw in Example 2 that increasing the level of confidence increases the width of the confidence interval. The width of a confidence interval is important, as for a confidence interval to be useful it should not be too wide. The distance between the sample mean and the endpoints of a confidence interval is called the margin of error. The smaller the margin of error, the better the estimate of the population mean.
Since the width of the confidence interval is inversely proportional to the square root of the sample size, it makes sense that a better way to decrease the width of the confidence interval is to increase the sample size.