Suppose you take several samples each of size n from the population and for each you calculate
± 1.96
then, on average, 95% of the intervals will contain the true but unknown value µ and 5% will not.
If you plotted the intervals vertically they might look like this
Note: The intervals vary from sample to sample.
On average 95% of "95% confidence intervals" will contain µ.
provides
a point estimate (i.e. single value approximation) for µ
e.g. In the example about women's heights,
= 165cms
so µ is approximately 165cms.
e.g. with 95% confidence the population mean height µ for these women is in the interval (164, 166) cms
i.e. (164, 166) cms.
Suppose the sample size had been n=40 but the mean and standard
deviation were still
= 165
and s = 5. Then a 95% confidence interval for
µ is
= 165 ± 1.55
which gives (163.5, 166.5).
Notice that increasing the sample size increases the precision of
the estimate by
not
.
e.g. width of 95% confidence interval
+ 1.96
) - (
- 1.96
)
So if n = 100, width =
= 0.392 s
or if n = 25, width =
=
0.784 s.
If you increase the sample size by 4 you decrease the width of the
confidence interval by ½. Precision of the estimate depends on
the term
in the standard
error SE
=
A sample of 50 bolts has a mean length of
= 14.85 cms. Does this suggest that the average length of all
bolts is not 15 cms?
Sampling distribution of sample mean is
In this case s = 0.3, n = 50,
= 14.85 and we want to know if
µ = 15 is plausible.
i.e. (14.77, 14.93)cms
Interpretation - with 95% confidence the interval (14.77, 14.93) contains the population mean µ of all bolts produced by the process. As the interval does not contain 15.0, the data are not consistent with the hypothesis that µ = 15. That is, the data do not support the hypothesis that the average length of all bolts is 15cms.
For the above example a 99% confidence interval is
i.e. (14.74, 14.96)
e.g. ZINTERVAL 95 0.3 C1
here 95 = required confidence level (as a percentage)
The second approach is to test the hypothesis (i.e. µ = 15cm) more directly as follows
as far away as or further from the assumed
population mean as was observed (i.e.
=
14.85)
The shaded areas represent
values as far away
as or further from µ = 15 as the
observed value
= 14.85
This is called the "p-value". In this case
p-value = P(
14.85 or
15.15)
If ( ~ N(15,
) then Z =
Hence P(
14.85 or 
15.15).
=
= P(Z < - 3.5 or Z > 3.5) < 0.001 from tables
MINITAB can be used to calculate the p-value for this example as follows
e.g. in the above example µ = 15 so that
~ N(15,
), taking s = 0.3
= 14.85)
= P(obtaining a value which is as extreme or more extreme than the calculated statistic if the hypothesis is true)
e.g. in the above example
p-value = Pr( 
14.85 or 
15.15)
| ... Previous page | Next page ... |