[A, SfS] Chapter 5: Confidence Intervals: 5.5: CI for proportion
Confidence Interval for a Population Proportion
Confidence Interval for a Population Proportion
In this lesson, you will learn how to estimate the proportion of a population having a specified characteristic.
Now suppose is a binary variable measured on a population in which an unknown proportion of the population meets some condition of interest, with if the subject meets that condition and otherwise. Thus:
Let denote planned measurements of on a random sample of size from the population. Then:
Let . We saw previously that:
Moreover, we learned previously that when is large, the Central Limit Theorem implies that has an approximate
Unfortunately, we cannot use this distribution to make a CI for , because the variance of depends on the unknown value of .
But now consider .
It has been found (Agresti and Coull, 1998) that when is large, the distribution of is well-approximated by the
Confidence Interval for a Population Proportion
Suppose .
Let denote planned measurements of on a random sample of size from the population. Then:
Furthermore, let:
When is large, an approximate confidence interval for the population proportion is:
If the lower limit is negative, we replace it with , and if the upper limit is larger than , we replace it with , since must fall into the interval .
A study in 2008 investigated the use of nicotine patches among HIV-positive smokers. They surveyed a random sample of HIV-positive smokers and found that reported that they used a nicotine patch.
Thus:
The margin of error of a CI for the proportion of all HIV-positive smokers who use a nicotine patch is then:
(Here we recalled that .)
So the CI for is then:
If you are planning a study and you prefer a CI for to have a margin of error no larger than some while maintaining the same confidence level , you must determine the minimum sample size necessary to achieve this goal.
However, the variance of depends on , which is unknown to you before data are collected. But this variance is maximized when is maximized, that is, when .
Controlling the Margin of Error
To guarantee that the margin of error of a confidence interval for a population proportion will not exceed and ensure at least confidence that your CI includes , you can assume .
That is
requires that
You would then round up to the nearest integer.
Continuing the example above, if we want the margin of error of the confidence interval to be no larger than , then we would need to survey a random sample of at least:
One-sided Confidence Intervals for a Population Proportion
It is also possible to construct one-sided confidence intervals for a population proportion.
A lower CI for would be:
where is the quantile of the distribution such that when .
An upper CI for would be:
Or visit omptest.org if jou are taking an OMPT exam.