Elementary Statistics Ch.9
Elementary Statistics Ch.9
Chi-Square Tests
and the F-
Distribution
§ 9.1
Goodness of Fit
Multinomial Experiments
A multinomial experiment is a probability experiment
consisting of a fixed number of trials in which there are more than
two possible outcomes for each independent trial. (Unlike the
binomial experiment in which there were only two possible
outcomes.)
Example:
A researcher claims that the distribution of favorite pizza
toppings among teenagers is as shown below.
Topping Frequency, f
Each outcome is Cheese 41% The probability for
classified into Pepperoni 25% each possible
categories. Sausage 15% outcome is fixed.
Mushrooms 10%
Onions 9%
Chi-Square Goodness-of-Fit
Test
A Chi-Square Goodness-of-Fit Test is used to test whether
a frequency distribution fits an expected distribution.
To calculate the test statistic for the chi-square goodness-of-
fit test, the observed frequencies and the expected
frequencies are used.
The observed frequency O of a category is the frequency for the category observed in the sample data.
The expected frequency E of a category is the calculated frequency for the category. Expected frequencies are obtained assuming the specified (or
hypothesized) distribution. The expected frequency for the ith category is
Ei = npi
where n is the number of trials (the sample size) and pi is the assumed probability of the ith category.
Observed and Expected
Frequencies
Example:
200 teenagers are randomly selected and asked what their favorite
pizza topping is. The results are shown below.
Find the observed frequencies and the expected
frequencies.
Continued.
Expected Frequency
Example continued:
Age
Gender 16 – 20 21 – 30 31 – 40 41 – 50 51 – 60 61 and Total
older
Male 32 51 52 43 28 10 216
Female 13 22 33 21 10 6 105
Total 45 73 85 64 38 16 321
F
1 2 3 4
Critical Values for the F-
Distribution
Finding Critical Values for the F-Distribution
1. Specify the level of significance .
2. Determine the degrees of freedom for the numerator,
d.f.N.
3. Determine the degrees of freedom for the denominator,
d.f.D.
4. Use Table 7 in Appendix B to find the critical value. If
the hypothesis test is 1
b. two-tailed, use the 2 F-table.
a. one-tailed, use the F-table.
Critical Values for the F-
Distribution
Example:
Find the critical F-value for a right-tailed test when
= 0.05, d.f.N = 5 and d.f.D = 28.
Appendix B: Table 7: F-Distribution
d.f.D: = 0.05
Degrees of d.f.N: Degrees of freedom, numerator
freedom,
denominator
1 2 3 4 5 6
1 161.4 199.5 215.7 224.6 230.2 234.0
27
2 4.21 19.00
18.51 3.35 2.96
19.16 2.73
19.25 2.57
19.30 2.46
19.33
28 4.20 3.34 2.95 2.71 2.56 2.45
29 4.18 3.33 2.93 2.70 2.55 2.43
1 2 3 4 5 6
1 161.4 199.5 215.7 224.6 230.2 234.0
2 18.51 19.00 19.16 19.25 19.30 19.33
3 10.13 9.55 9.28 9.12 9.01 8.94
4 7.71 6.94 6.59 6.39 6.26 6.16
5 6.61 5.79 5.41 5.19 5.05 4.95
The critical
6 value is F05.14
5.99 = 4.53.
4.76 4.53 4.39 4.28
7 5.59 4.74 4.35 4.12 3.97 3.87
Two-Sample F-Test for
Variances
Two-Sample F-Test for Variances
A two-sample F-test is used to compare two
population σ12variances
and σ22 when a sample is
randomly selected from each population. The
populations must be independent and normally
distributed.
s12 The test statistic is
F 2
s2
s12 and s22
s12 s22.
where represent the sample variances with
The degrees of freedom for the numerator is
d.f.N = n1 – 1 and the degrees of freedom for the
2
s
denominator is d.f.D = n12 – 1, where n1 is the size of
2
the sample having s2. variance and n2 is the size of
the sample having variance
Two-Sample F-Test for
Variances
σ12 and σ22
Using a Two-Sample F-Test to Compare
In Words In Symbols
1. Identify the claim. State the State H0 and Ha.
null and alternative
hypotheses.
Identify .
2. Specify the level of
significance. d.f.N = n1 – 1
d.f.D = n2 – 1
3. Identify the degrees of
freedom. Use Table 7 in
Appendix B.
Continued.
4. Determine the critical value.
Two-Sample F-Test for
Variances
σ12 and σ22
Using a Two-Sample F-Test to Compare
In Words In Symbols
5. Determine the rejection
region. s12
F 2
6. Calculate the test statistic. s2
If F is in the
7. Make a decision to reject or rejection region,
fail to reject the null reject H0.
hypothesis. Otherwise, fail to
reject H0.
1. The variance between samples MSB measures the differences related to the treatment given to
each sample and is sometimes called the mean square between.
2. The variance within samples MSW measures the differences related to entries within the same
sample. This variance, sometimes called the mean square within, is usually due to sampling
error.
One-Way ANOVA
One-Way Analysis of Variance Test
If the conditions listed are satisfied, then the
sampling distribution for the test is approximated
by the F-distribution. The test statistic is
MS B
F .
MSW
If F is in the
7. Make a decision to reject or rejection region,
fail to reject the null reject H0.
hypothesis. Otherwise, fail to
reject H0.
Sum of Degrees
Mean
Variation square of F
squares
s freedom
SS B
Between SSB d.f.N MS B MS B MSW
d.f.N
SSW
Within SSW d.f.D MSW
d.f.D
Performing a One-Way ANOVA
Test
Example:
The following table shows the salaries of randomly
selected individuals from four large metropolitan
areas. At = 0.05, can you conclude that the mean
salary is different in at least one of the areas?
(Adapted from US Bureau of Economic Analysis)
d.f.D = N – k = 20 – 4 = 16