0% found this document useful (0 votes)
180 views58 pages

Chapter 8 Introduction To Hypothesis Testing

This document provides an overview of hypothesis testing including: - The four main steps: state hypotheses, set decision criteria, collect data, make a decision - Types of errors like Type I (rejecting a true null hypothesis) and Type II (failing to reject a false null hypothesis) - Key concepts like critical regions, significance levels, and interpreting statistically significant results - Factors that influence hypothesis test outcomes like sample size, variability, and difference from the hypothesized value The goal is to use sample data and probability to evaluate whether a hypothesis about a population should be rejected or retained.

Uploaded by

de
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
180 views58 pages

Chapter 8 Introduction To Hypothesis Testing

This document provides an overview of hypothesis testing including: - The four main steps: state hypotheses, set decision criteria, collect data, make a decision - Types of errors like Type I (rejecting a true null hypothesis) and Type II (failing to reject a false null hypothesis) - Key concepts like critical regions, significance levels, and interpreting statistically significant results - Factors that influence hypothesis test outcomes like sample size, variability, and difference from the hypothesized value The goal is to use sample data and probability to evaluate whether a hypothesis about a population should be rejected or retained.

Uploaded by

de
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 58

Chapter 8

Introduction to Hypothesis Testing


PowerPoint Lecture Slides
Essentials of Statistics for the
Behavioral Sciences
Eighth Edition
by Frederick J. Gravetter and Larry B. Wallnau
Chapter 8 Learning Outcomes
1 • Understand logic of hypothesis testing

2 • State hypotheses and locate critical region(s)

3 • Conduct z-test and make decision

4 • Define and differentiate Type I and Type II errors

5 • Understand effect size and compute Cohen’s d

6 • Make directional hypotheses and conduct one-tailed test


Tools You Will Need
• z-scores (Chapter 5)
• Distribution of sample means (Chapter 7)
– Expected value
– Standard error
– Probability and sample means
8.1 Hypothesis Testing Logic
• A hypothesis test is an inferential procedure that
uses sample data to evaluate the credibility of a
hypothesis about a population
• Here we’ll apply what we know about z-scores and
probability to inform our inferences (i.e., how likely is
it that a given sample came from a population?)
• We’ll also set criteria for making decisions about
whether a sample represents a population or not
Logic of Hypothesis Test
• State a hypothesis about a population
• Predict the expected characteristics of the
sample based on the hypothesis
• Obtain a random sample from the population
• Compare the obtained sample data with the
prediction made from the hypothesis
– If consistent, hypothesis is reasonable
– If discrepant, hypothesis is rejected
Figure 8.1
Basic Experimental Design
Figure 8.2 Unknown Population in
Basic Experimental Design
Four Steps in Hypothesis Testing

Step 1: State the hypotheses


Step 2: Set the criteria for a decision
Step 3: Collect data; compute sample statistics
Step 4: Make a decision
Step 1: State Hypotheses
• Null hypothesis (H0) states that, in the general
population, there is no change, no difference,
or is no relationship

• Alternative hypothesis (H1) states that there is


a change, a difference, or there is a
relationship in the general population
Step 2: Set the Decision Criterion
• Distribution of sample outcomes is divided
– Those likely if H0 is true
– Those “very unlikely” if H0 is true
• Alpha level, or significance level, is a probability
value used to define “very unlikely” outcomes
• Critical region(s) consist of the extreme sample
outcomes that are “very unlikely”
• Boundaries of critical region(s) are determined by
the probability set by the alpha level
Figure 8.3 Note “Unlikely” Parts of
Distribution of Sample Means
Figure 8.4
Critical region(s) for α = .05
Learning Check
• A sports coach is investigating the impact of a
new training method. In words, what would
the null hypothesis say?
• The new training program produces different
A results from the existing one
• The new training program produces results
B about like the existing one
• The new training program produces better
C results than the existing one
• There is no way to predict the results of the
D new training program
Learning Check - Answer
• A sports coach is investigating the impact of a
new training method. In words, what would
the null hypothesis say?
• The new training program produces different
A results from the existing one
• The new training program produces results
B about like the existing one
• The new training program produces better
C results than the existing one
• There is no way to predict the results of the
D new training program
Learning Check
• Decide if each of the following statements
is True or False.

• If the alpha level is decreased, the size


T/F of the critical region decreases

T/F • The critical region defines unlikely


values if the null hypothesis is true
Learning Check - Answers

• Alpha is the proportion of the area


True in the critical region(s)

True • This is the definition of “unlikely”


Step 3: Collect Data (and…)

• Data always collected after hypotheses stated


• Data always collected after establishing
decision criteria
• This sequence assures objectivity
Step 3: (continued)…
Compute Sample Statistics
• Compute a sample statistic (z-score) to show
the exact position of the sample

M 
z
M

• In words, z is the difference between the


observed sample mean and the hypothesized
population mean divided by the standard
error of the mean
Step 4: Make a decision

• If sample statistic (z) is located in the critical


region, the null hypothesis is rejected

• If the sample statistic (z) is not located in the


critical region, the researcher fails to reject the
null hypothesis
Jury Trial:
Hypothesis Testing Analogy
• Trial begins with the null hypothesis “not guilty”
(defendant’s innocent plea)
• Police and prosecutor gather evidence (data)
relevant to the validity of the innocent plea
• With sufficient evidence against, jury rejects null
hypothesis innocence claim to conclude “guilty”
• With insufficient evidence against, jury fails to
convict, i.e., fails to reject the “not guilty” claim
(but does not conclude defendant is innocent)
Learning Check
• Decide if each of the following statements
is True or False.

• When the z-score is quite


T/F extreme, it shows the null
hypothesis is true
• A decision to retain the null
T/F hypothesis means you proved that
the treatment has no effect
Learning Check - Answer

• An extreme z-score is in the critical


False region—very unlikely if H0 is true

• Failing to reject H0 does not prove it


False true; there is just not enough evidence
to reject it
8.2 Uncertainty and Errors
in Hypothesis Testing
• Hypothesis testing is an inferential process
– Uses limited information from a sample to make a
statistical decision, and then from it a general
conclusion
– Sample data used to make the statistical decision
allows us to make an inference and draw a
conclusion about a population

• Errors are possible


Type I Errors
• Researcher rejects a null hypothesis that is
actually true
• Researcher concludes that a treatment has an
effect when it has none
• Alpha level is the probability that a test will
lead to a Type I error
Type II Errors

• Researcher fails to reject a null hypothesis


that is really false
• Researcher has failed to detect a real
treatment effect
• Type II error probability is not easily identified
Table 8.1

Actual Situation

No Effect = Effect Exists =


H0 True H0 False
Type I error
Reject H0 Decision correct
Researcher’s (α)
Decision Type II error
Fail to reject H0 Decision correct
(β)
Figure 8.5 Location of
Critical Region Boundaries
Learning Check
• Decide if each of the following statements
is True or False.

• A Type I error is like convicting an


T/F innocent person in a jury trial

• A Type II error is like convicting a


T/F guilty person in a jury trial
Learning Check - Answer

• Innocence is the “null hypothesis”


True for a jury trial; conviction is like
rejecting that hypothesis
• Convicting a guilty person is not an
False error; but acquitting a guilty
person would be like Type II error
8.3 Hypothesis Testing Summary
• Step 1: State hypotheses and select alpha level
• Step 2: Locate the critical region
• Step 3: Collect data; compute the test statistic
• Step 4: Make a probability-based decision
about H0: Reject H0 if the test statistic is
unlikely when H0 is true—called a “significant”
or “statistically significant” result
In the Literature
• A result is significant or statistically significant
if it is very unlikely to occur when the null
hypothesis is true; conclusion: reject H0
• In APA format
– Report that you found a significant effect
– Report value of test statistic
– Report the p-value of your test statistic
Figure 8.6
Critical Region for Standard Test
8.3 Assumptions for
Hypothesis Tests with z-Scores

• Random sampling
• Independent Observation
• Value of σ is not changed by the treatment
• Normally distributed sampling distribution
Factors that Influence the
Outcome of a Hypothesis Test
• Size of difference between sample mean and
original population mean
– Larger discrepancies  larger z-scores
• Variability of the scores
– More variability  larger standard error
• Number of scores in the sample
– Larger n  smaller standard error
Learning Check
• A researcher uses a hypothesis test to evaluate
H0: µ = 80. Which combination of factors is most
likely to result in rejecting the null hypothesis?

A • σ = 5 and n = 25
B • σ = 5 and n = 50
C • σ = 10 and n = 25
D • σ = 10 and n = 50
Learning Check - Answer
• A researcher uses a hypothesis test to evaluate
H0: µ = 80. Which combination of factors is most
likely to result in rejecting the null hypothesis?

A • σ = 5 and n = 25
B • σ = 5 and n = 50
C • σ = 10 and n = 25
D • σ = 10 and n = 50
Learning Check
• Decide if each of the following statements
is True or False.

• An effect that exists is more likely


T/F to be detected if n is large

• An effect that exists is less likely to


T/F be detected if σ is large
Learning Check - Answers

• A larger sample produces a


True smaller standard error and larger z

• A larger standard deviation


True increases the standard error and
produces a smaller z
8.4 Directional Hypothesis Tests

• The standard hypothesis testing procedure is


called a two-tailed (non-directional) test
because the critical region involves both tails
to determine if the treatment increases or
decreases the target behavior
• However, sometimes the researcher has a
specific prediction about the direction of the
treatment
8.4 Directional Hypothesis Tests
(Continued)
• When a specific direction of the treatment
effect can be predicted, it can be incorporated
into the hypotheses
• In a directional (one-tailed) hypothesis test,
the researcher specifies either an increase or
a decrease in the population mean as a
consequence of the treatment
Figure 8.7 Example 8.3
Critical Region (Directional)
One-tailed and Two-tailed Tests
Compared
• One-tailed test allows rejecting H0 with
relatively small difference provided the
difference is in the predicted direction
• Two-tailed test requires relatively large
difference regardless of the direction of the
difference
• In general two-tailed tests should be used
unless there is a strong justification for a
directional prediction
Learning Check
• A researcher is predicting that a treatment will
decrease scores. If this treatment is evaluated
using a directional hypothesis test, then the
critical region for the test.
• would be entirely in the right-hand tail of
A the distribution
• would be entirely in the left-hand tail of
B the distribution
• would be divided equally between the two tails
C of the distribution
• cannot answer without knowing the value of
D the alpha level
Learning Check - Answer
• A researcher is predicting that a treatment will
decrease scores. If this treatment is evaluated
using a directional hypothesis test, then the
critical region for the test.
• would be entirely in the right-hand tail of
A the distribution
• would be entirely in the left-hand tail of
B the distribution
• would be divided equally between the two tails
C of the distribution
• cannot answer without knowing the value of
D the alpha level
8.5 Hypothesis Testing Concerns:
Measuring Effect Size
• With a large n, small effects (or differences)
can be statistically significant
• Effect size measures the absolute magnitude
of a treatment effect, independent of sample
size
• Cohen’s d measures effect size simply and
directly in a standardized way
Cohen’s d : Measure of Effect Size

mean difference treatment   no treatment


Cohen' s d  
standard deviation 

Magnitude of d Evaluation of Effect Size


d = 0.2 Small effect
d = 0.5 Medium effect
d = 0.8 Large effect
Figure 8.8 When is a 15-point
Difference a “Large” Effect?
Learning Check
• Decide if each of the following statements
is True or False.

• Increasing the sample size will also


T/F increase the effect size

• Larger differences between the


T/F sample and population mean
increase effect size
Learning Check -Answers

• Sample size does not affect


False Cohen’s d

• The mean difference is in the


True numerator of Cohen’s d
8.6 Statistical Power
• The power of a test is the probability that the
test will correctly reject a false null hypothesis
– It will detect a treatment effect if one exists
– Power = 1 – β [where β = probability of a Type II
error]
• Power usually estimated before starting study
– Requires several assumptions about factors that
influence power
Figure 8.9
Measuring Statistical Power
Influences on Power
• Increased Power
– As effect size increases, power also increases
– Larger sample sizes produce greater power
– Using a one-tailed (directional) test increases power
(relative to a two-tailed test)
• Decreased Power
– Reducing the alpha level (making the test more
stringent) reduces power
– Using two-tailed (non-directional) test decreases
power (relative to a one-tailed test)
Figure 8.10
Sample Size Affects Power
Learning Check
• The power of a statistical test is the
probability of _____

A • rejecting a true null hypothesis


B • supporting true null hypothesis
C • rejecting a false null hypothesis
D • supporting a false null hypothesis
Learning Check - Answer
• The power of a statistical test is the
probability of _____

A • rejecting a true null hypothesis


B • supporting true null hypothesis
C • rejecting a false null hypothesis
D • supporting a false null hypothesis
Learning Check
• Decide if each of the following statements
is True or False.

• Cohen’s d is used because alone, a


T/F hypothesis test does not measure
the size of the treatment effect
• Lowering the alpha level from .05
T/F to .01 will increase the power of a
statistical test
Answer

• Differences might be significant


True but not of substantial size

• It is less likely that H0 will be


False rejected with a small alpha
Equations?

Concepts
?

Any
Questions
?

You might also like