0% found this document useful (0 votes)
14 views

Lecture 4-5 (1)

Uploaded by

tahsinefeedu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views

Lecture 4-5 (1)

Uploaded by

tahsinefeedu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 79

Chapter 8

Fundamental
Sampling
Distributions and
Data
Descriptions

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Section 8.1
Random
Sampling

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Definition 8.1 & 8.2

• A Population is the set of all items or individuals


of interest
– Examples: All likely voters in the next election
All parts produced today
All sales receipts for November

• A Sample is a subset of the population


– Examples: 1000 voters selected at random for interview
A few parts selected for destructive
testing
Random receipts selected for audit

Copyright © 2020 Pearson Education Ltd. All Rights Reserved.


Definition 8.3

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8-4


Section 8.2
Some Important
Statistics

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Definition 8.4

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8-6


In Chapter 1 we introduced several measures of sample variability,
including
the sample variance, sample standard deviation, and sample
range. In
this chapter, we will focus mainly on the sample variance.

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8-7


Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8-8
Theorem 8.1

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8-9


Proof

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 10


Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 11
Section 8.3
Sampling
Distributions

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Sampling Distributions

• A sampling distribution is a probability


distribution of all of the possible values of a
statistic for a given size sample selected
from a population

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 13


Developing a
Sampling Distribution

• Assume there is a population …


D
A B C
• Population size N=4
• Random variable, X,
is age of individuals
• Values of X:
18, 20, 22, 24 (years)
Developing a
Sampling Distribution
(continued)
Summary Measures for the Population Distribution:

μ
 X i P(x)
N
.25
18  20  22  24
 21
4
0

σ
 (X  μ)
i
2

2.236
18
A B
20
C
22
D
24 x
N
Uniform Distribution
Developing a
Sampling Distribution
Now consider all possible samples of size n = 2
1st 2nd Observation 16 Sample
Obs 18 20 22 24 Means
18 18,18 18,20 18,22 18,24
1st 2nd Observation
20 20,18 20,20 20,22 20,24 Obs 18 20 22 24
22 22,18 22,20 22,22 22,24 18 18 19 20 21
24 24,18 24,20 24,22 24,24 20 19 20 21 22
16 possible 22 20 21 22 23
samples (sampling
with replacement)
24 21 22 23 24
Developing a
Sampling Distribution
Sampling Distribution of All Sample
Means
16 Sample Means Sample Means
Distribution
1st 2nd Observation _
Obs 18 20 22 24 P(X)
.3
18 18 19 20 21
.2
20 19 20 21 22
.1
22 20 21 22 23
0 _
24 21 22 23 24 18 19 20 21 22 23 24 X
(no longer uniform)
Section 8.4
Sampling
Distribution of
Means and the
Central Limit
Theorem

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Sample Mean

• Let X1, X2, . . . Xn represent a random sample from a


population

• The sample mean value of these observations is defined


as

1 n
X   Xi
n i1
Standard Error of the Mean

• Different samples of the same size from the same population


will yield different sample means
• A measure of the variability in the mean from sample to
sample is given by the Standard Error of the Mean:

σ
σX 
n
• Note that the standard error of the mean decreases as the
sample size increases
Developing a
Sampling Distribution (continued)

Summary Measures of this Sampling Distribution:

E(X) 
 X i

18  19  21   24
21 μ
N 16

σX 
 ( X i  μ) 2

N
(18 - 21) 2  (19 - 21) 2    (24 - 21) 2
 1.58
16
Comparing the Population
with its Sampling Distribution
Population Sample Means
N=4 Distribution
μ 21 σ 2.236 n=2
_ μX 21 σ X 1.58
P(X) P(X)
.3 .3

.2 .2

.1 .1

0
18 20 22 24 X
0
18 19 20 21 22 23 24
_
X
A B C D
If the Population is Normal

• If a population is normal with mean μ and


standard deviation σ, the sampling distribution
of X is also normally distributed with
σ
μ X μ σX 
and
n
Z-value for Sampling Distribution
of the Mean

• Z-value for the sampling distribution of X

(X  μ) (X  μ)
Z 
σX σ
n
Sampling Distribution Properties

Normal Population

E[X] μ Distribution

μ x
(i.e. x is unbiased ) Normal Sampling
Distribution

(both distributions have the same mean)

μx
x
Copyright © 2020 Pearson Education Ltd. All Rights Reserved.
Sampling Distribution Properties
(continued)

σ Normal Population
σx  Distribution

n
μ x
(i.e. x is unbiased ) Normal Sampling
Distribution

(the distribution of x has a reduced standard


deviation
μx
x
Copyright © 2020 Pearson Education Ltd. All Rights Reserved.
Sampling Distribution Properties
(continued)

As n increases, Larger
sample size
σ x decreases

Smaller
sample size

μ x
Copyright © 2020 Pearson Education Ltd. All Rights Reserved.
Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 28
Figure 8.2 Area for Example 8.4

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 29


Recall Theorem 7.11

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 7 - 30


Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 31
Theorem 8.2

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 32


Central Limit Theorem

the sampling
As the n↑
distribution
sample
becomes
size gets
almost
large
normal
enough…
regardless of
shape of
population

x
Copyright © 2020 Pearson Education Ltd. All Rights Reserved.
If the Population is not Normal
(continued)

Population Distribution
Sampling distribution
properties:

Central Tendency

μ x μ μ x
Variation Sampling Distribution
σ (becomes
increases)normal as n
σx  Larger
n Smaller
sample size
sample
size

Copyright © 2020 Pearson Education Ltd. All Rights Reserved. μx x


How Large is Large Enough?

• For most distributions, n > 30 will give a


sampling distribution that is nearly normal
• For normal population distributions, the
sampling distribution of the mean is always
normally distributed

Copyright © 2020 Pearson Education Ltd. All Rights Reserved.


Figure 8.1 Illustration of the Central
_
Limit Theorem (distribution of X for
n = 1, moderate n, and large n)

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 36


If the Population is not Normal

• We can apply the Central Limit Theorem:

– Even if the population is not normal, sample

means from the population will be approximately

normal as long as the sample size is large enough.


– Properties of the sampling distribution:
σ
μ x μ σx 
and n
Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 38
Figure 8.4 Area for Example 8.5

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 39


Exercise 8.23
The random variable X, representing the number of cherries in a
cherry puff, has the following probability distribution:

a) Find the mean μ and the variance σ2 of X.


b) Find the mean and the variance of the sample mean for random
samples of 36 cherry puffs.
c) Find the probability that the average number of cherries in 36
cherry puffs will be less than 5.5.

8 - 40
Exercise 8.24
If a certain machine makes electrical resistors having a mean
resistance of 40 ohms and a standard deviation of 2 ohms, what is
the probability that a random sample of 36 of these resistors will
have a combined resistance of more than 1458 ohms?

8 - 41
Exercise 8.26
The amount of time that a vehicle spends in a petrol bunk is a random
variable with the mean μ = 4.5 minutes and a standard deviation σ = 1.8
minutes. If a random sample of 24 vehicles is observed, find the
probability that its mean time at the petrol bunk is
(a) at most 3.6 minutes
(b) more than 5.5 minutes
(c) at least 3.2 minutes but less than 4 minutes.

8 - 42
Sampling Distribution of the
Difference of the Two Means

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 43


Theorem 8.3

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 45
Figure 8.6 Area for Example 8.6

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 46


Example: Aircraft Engine Life

The effective life of a component used


in jet-turbine aircraft engine is a
random variable with mean 5000 and Figure 7-6 The sampling distribution
SD 40 hours and is close to a normal of X2 − X1
distribution. The engine manufacturer
Process
introduces an improvement into the
Old (1) New (2) Diff (2-1)
Manufacturing process for this
x -bar = 5,000 5,050 50
component that changes the
s= 40 30
parameters to 5050 and 30. Random
n= 16 25
samples of size 16 and 25 are
Calculations
selected.
s / √n = 10 6 11.7
z= -2.14
What is the probability that the
P(xbar2-xbar1 > 25) = P(Z > z) = 0.9840
difference in the two sample means is
= 1 - NORMSDIST(z)
at least 25 hours?
Sampling
Distribution of
Proportions

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Sampling Distributions of
Sample Proportions

P = the proportion of the population having


some characteristic
• Sample proportion (p̂) provides an estimate
of P:
X number of items in the sample having the characteristic of interest
pˆ  
n sample size

• 0 ≤ p̂ ≤ 1
• p̂ has a binomial distribution, but can be approximated by a
normal distribution when nP(1 – P) > 5
Sampling Distribution of p ^

• Normal approximation:
Sampling
P(Pˆ ) Distribution
.3
.2
.1
0
0 .2 .4 .6 8 1 P̂

Properties:  X  P(1 P)
E(pˆ ) P
2
σ Var   

and n n
(where P = population proportion)
Z-Value for Proportions

Standardizep̂ to a Z value with the


formula:
pˆ  P pˆ  P
Z 
σ pˆ P(1 P)
n
Example

• If the true proportion of voters who support


Proposition A is P = .4, what is the
probability that a sample of size 200 yields a
sample proportion between .40 and .45?
 i.e.: if P = .4 and n = 200, what is
P(.40 ≤p̂ ≤ .45) ?
Example
(continued)

• if P = .4 and n = 200, what is


P(.40 ≤p̂ ≤ .45) ?
P(1  P)
σp̂ 
n
Find σ pˆ:

ˆ .45)
P(.40 p
Convert to
standard
normal:
Example
(continued)

• if P = .4 and n = 200, what is


P(.40 ≤p̂ ≤ .45) ?
Use standard normal table: P(0 ≤ Z ≤ 1.44) =

Sampling Standardized
Distribution Normal Distribution

Standardize

.40 .45 p̂ 0 1.44


Z
Example

According to the US Census Bureau's American Community


Survey, 87%, percent of Americans over the age of 25 have earned a
high school diploma. Suppose we are going to take a random sample
of 200 Americans in this age group and calculate what proportion of
the sample has a high school diploma.
What is the probability that the proportion of people in the
sample with a high school diploma is less than 85 percent?
10.3
Two Population Proportions

Goal: For the difference between


Population
proportions
two population proportions,
Px – Py

Assumptions:
Both sample sizes are large,

nP(1 – P) > 5
Two Population Proportions
(continued)

Population • The random variable


proportions

(pˆ x  pˆ y )  (p x  p y )
Z
pˆ x (1 pˆ x ) pˆ y (1 pˆ y )

nx ny

is approximately normally
distributed
Test Statistic for
Two Population Proportions

Population
proportions

z
pˆ x  pˆ y 
pˆ 0 (1 pˆ 0 ) pˆ 0 (1 pˆ 0 )

nx ny

n xpˆ x  n ypˆ y
Where pˆ 0 
nx  ny
Example:
Two Population Proportions
What is the probability that there is a significant difference
between the proportion of men and the proportion of women
who will vote Yes on Proposition A?

• In a random sample, 36 of 72 men and 31 of 50


women indicated they would vote Yes
Section 8.5
Sampling
Distribution of S2

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Theorem 8.4

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 61


Figure 8.7 The chi-squared
distribution

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 62


The Chi-square Distribution

• The chi-square distribution is a family of distributions,


depending on degrees of freedom:
• d.f. = n – 1

0 4 8 12 16 20 24 28 2 0 4 8 12 16 20 24 28 2 0 4 8 12 16 20 24 28 2
d.f. = 1 d.f. = 5 d.f. = 15

Copyright © 2020 Pearson Education Ltd. All Rights Reserved.


Degrees of Freedom (df)
Idea: Number of observations that are free to vary
after sample mean has been calculated

Example: Suppose the mean of 3 numbers is 8.0

If the mean of these three values is


Let X1 = 7 8.0,
Let X2 = 8 then X3 must be 9
(i.e., X3 is not free to vary)
What is X3?
Here, n = 3, so degrees of freedom = n – 1 = 3 – 1 = 2
(2 values can be any numbers, but the third is not free to
vary for a given mean)
Copyright © 2020 Pearson Education Ltd. All Rights Reserved.
Chi-square Example

• A commercial freezer must hold a selected temperature


with little variation. Specifications call for a standard
deviation of no more than 4 degrees (a variance of 16
degrees2).
 A sample of 14 freezers is to be tested
 What is the upper limit (K) for the sample
variance such that the probability of exceeding
this limit, given that the population standard
deviation is 4, is less than 0.05?

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


Finding the Chi-square Value

2
(n  1)s Is chi-square distributed with (n – 1) = 13 degrees
χ2  of freedom
σ2

• Use the the chi-square distribution with area 0.05 in the


upper tail:

213 = 22.36 (α = .05 and 14 – 1 = 13 d.f.)

probability
α = .05

2

213 = 22.36
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Chi-square Example
(continued)
213 = 22.36 (α = .05 and 14 – 1 = 13 d.f.)

 (n  1)s2 2 
So: 2

P(s  K) P  χ13  0.05
 16 
(n  1)K
or 22.36 (where n = 14)
16

(22.36)(16)
so K 27.52
(14  1)

If s2 from the sample of size n = 14 is greater than 27.52,


there is strong evidence to suggest the population variance
exceeds 16.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall
Example

• A manufacturer of car batteries guarantees that the


batteries will last, on average, 3 years with a standard
deviation of 1 year. If five of these batteries have
lifetimes of 1.9, 2.4, 3.0, 3.5, and 4.2 years, should the
manufacturer still be convinced that the batteries have a
standard deviation of 1 year? Assume that the battery
lifetime follows a normal distribution.

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 68


Example

A particular type of vacuum-packed coffee packet


contains an average of 16 oz. It has been observed that the
number of ounces of coffee in these packets is normally
distributed with σ = 1.41 oz. A random sample of 15 of
these coffee packets is selected, and the observations are
used to calculate s. Find the numbers a and b such
that P(a ≤ S2 ≤ b) = 0.90.

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 69


Example

• A psychologist claims that the mean age at which


female children start walking is 11.4 months. If 20
randomly selected female children are found to have
started walking at a mean age of 12 months with
standard deviation of 2 months, would you agree with
the psychologist's claim? Assume that the sample came
from a normal population.

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 70


Section 8.7
F-Distribution

Copyright © 2017 Pearson Education, Ltd. All rights reserved.


Theorem 8.6

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 72


Figure 8.11 Typical F-distributions

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 73


Figure 8.12 Illustration of the f for
the F-distribution

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 74


Theorem 8.7

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 75


Theorem 8.8

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 76


Example

Let S12 denote the sample variance for a random sample of size 10
from Population I and let S22 denote the sample variance for a
random sample of size 8 from Population II. The variance of
Population I is assumed to be three times the variance of Population
II. Find two numbers a and b such that P(a ≤ S12/S22 ≤ b) = 0.90
assuming S12 to be independent of S22.

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 77


Solution

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 78


Example

Copyright © 2017 Pearson Education, Ltd. All rights reserved. 8 - 79

You might also like