STAT453 Study Guide
STAT453 Study Guide
Nils DM
February 18, 2022
1 Introduction
1. Statistical Design of Experiments: The process of planning the
experiment so that appropriate data will be collected and analysed by
statistical methods.
(a) Randomization
(b) Replication
(c) Blocking (and the factorial principle)
1
4. Strategy of Experimentation:
2
2 Simple Comparative Experiments
1. Run: An observation.
yij = µi + ϵij
Where:
We then have:
3
4. Two Independent Sample T-Test: Assumptions:
H0 : µ1 = µ2
H1 : µ1 ̸= µ2
Test Statistic:
y − y2
t0 = q1
sp n11 + n12
Where:
(n1 − 1)s21 + (n2−1 )s22
sp =
n1 + n2 − 2
If H0 is true, then t0∼tn1 +n2 −2
Confidence Interval:
r
1 1
(y 1 − y 2 ) ± t α2 ,n1 +n2 −2 sp +
n1 n2
H0 : µ1 = µ2
H1 : µ1 ̸= µ2
Test Statistic:
y −y
t0 = q 1 2 2 2
s1 s
n1
+ n22
Confidence Interval:
s
s21 s2
(y 1 − y 2 ) ± t α2 ,v sp + 2
n1 n2
4
(c) Case 3: σ12 and σ22 is known
Hypotheses:
H0 : µ1 = µ2
H1 : µ1 ̸= µ2
Test Statistic:
If σ12 ̸= σ22 , then the test statistic is:
y −y
Z0 = q 1 2 2 2
σ1 σ
n1
+ n22
Under H0 , Z0 ∼ (0, 1)
Confidence Interval:
s
σ12 σ22
(y 1 − y 2 ) ± z α2 +
n1 n2
H0 : µ = µ0
H1 : µ ̸= µ0
H0 ∼ N (0, 1)
Confidence Interval:
σ
y ± z α2 √
n
If the sample size is not large and the variance is unknown:
y − µ0
t0 =
√s
n
5
5. Checking for equal variance: The way that we check if two in-
dependent normally distributed populations have equal variance is to
check the ratio of their sample variances which are F distributed:
s21
∼ Fn1 −1,n2 −1
s22
6
3 Experiments with a Single Factor: The Anal-
ysis of Variance
1. Treatments: The different levels of a factor.
a × n = N Runs.
If the runs are chosen in a random order, than the design is called a
completely randomized design (CRD).
4. ANOVA model:
We specify the model as:
Yij = µ + τi + ϵij
For:
(a) i = 1, . . . , a
(b) j = 1, . . . , n
µi = µ + τ i
7
5. Model Adequacy: The following tests are used to determine if the
model is acceptable:
8
4 Randomized Blocks, Latin Squares, and Re-
lated Designs
1. Nuisance Factors: factors that affect the response variable but we
are not interested in them. How do we address this issue?
9
ii. The observations in the Latin square should be taken in ran-
dom order.
(c) Model:
Yijk = µ + αi + τj + βk + ϵijk
(d) Degrees of Freedom:
(a) Assume we have a treatments and b blocks where each block con-
tains k < a treatments.
(b) Each treatment a occurs r times in the design, therefore there are:
N = ar = bk
total observations.
(c) The number of times each pair of treatments appears in the same
block is:
r(k − 1)
λ=
a−1
5. Model:
yij = µ + τi + βj + ϵij
Same assumptions as the RCBD
6. Degrees of freedom:
10
7. Test Statistic:
M ST reatments
F0 = ∼ Fα,a−1,N −a−b+1
M SE
11