0% found this document useful (0 votes)

10 views6 pages

Imp Formula

stats 2

Uploaded by

mishkatchougule

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

Imp Formula

stats 2

Uploaded by

mishkatchougule

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

lOMoARcPSD|31573472

Stats 2 Formula Sheet - Summary Programming and data

science
Programming and data science (Indian Institute of Technology Madras)

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university

Downloaded by Mishkat Chougule ([email protected])
lOMoARcPSD|31573472

Statistics for Data Science - 2

Formula file

Discrete random variables:

Distribution PMF (fX (k)) CDF (FX (x)) E[X] Var(X)


 0 x<0
1
, x=k

 k−a+1

k ≤x<k+1
Uniform(A) n
n a+b n2 −1
n=b−a+1 2 12
A = {a, a + 1, . . . , b} k = a, a + 1, . . . , b − 1, b
k = a, a + 1, . . . , b



1 x≥n


0 x<0
( 
p x=1
Bernoulli(p) 1−p 0≤x<1 p p(1 − p)
1−p x=0 
1 x≥1



 0 x<0
k


n
Ci pi (1 − p)n−i

P
n
Ck pk (1 − p)n−k , k ≤x<k+1
Binomial(n, p) i=0 np np(1 − p)
k = 0, 1, . . . , n
k = 0, 1, . . . , n





1 x≥n


0
 x<0
1 1−p
(1 − p)k−1 p, k
Geometric(p) 1 − (1 − p) k ≤ x < k + 1
k = 1, . . . , ∞  p p2
k = 1, . . . , ∞


 0 x<0
e−λ λk


 k λi
Poisson(λ) , e−λ
P
k ≤x<k+1 λ λ
k!
k = 0, 1, . . . , ∞ 
 i=0 i!

 k = 0, 1, . . . , ∞

Downloaded by Mishkat Chougule ([email protected])

lOMoARcPSD|31573472

Continuous random variables:

Distribution PDF (fX (k)) CDF (FX (x)) E[X] Var(X)


 0 x≤a
(b − a)2

x − a
1 a+b
Uniform[a, b] ,a≤x≤b a<x<b
b−a 
 b−a 2 12
1 x≥b

(
0 x≤0 1 1
Exp(λ) λe−λx , x > 0
1 − e−λx x > 0 λ λ2
−(x − µ)2

1
√ exp ,
Normal(µ, σ 2 ) σ 2π 2σ 2 No closed form µ σ2
−∞ < x < ∞
β α α−1 −βx α α
Gamma(α, β) x e ,x>0
Γ(α) β β2
Γ(α + β) α−1
x (1 − x)β−1 α αβ
Beta(α, β) Γ(α)Γ(β)
α+β (α + β)2 (α + β + 1)
0<x<1

1. Markov’s inequality: Let X be a discrete random variable taking non-negative values

with a finite mean µ. Then,
µ
P (X ≥ c) ≤
c
2. Chebyshev’s inequality: Let X be a discrete random variable with a finite mean µ
and a finite variance σ 2 . Then,
1
P (| X − µ |≥ kσ) ≤
k2

3. Weak Law of Large numbers: Let X1 , X2 , . . . , Xn ∼ iid X with E[X] = µ, Var(X) =

σ2.
X1 + X2 + . . . + Xn
Define sample mean X = . Then,
n
σ2
P (|X − µ| > δ) ≤
nδ 2

4. Using CLT to approximate probability: Let X1 , X2 , . . . , Xn ∼ iid X with E[X] =

µ, Var(X) = σ 2 .
Define Y = X1 + X2 + . . . + Xn . Then,
Y − nµ
√ ≈ Normal(0, 1).
nσ

Page 2

Downloaded by Mishkat Chougule ([email protected])

lOMoARcPSD|31573472

5. Bias of an estimator: Bias(θ̂, θ) = E[θ̂] − θ.

1P n
6. Method of moments: Sample moments, Mk (X1 , X2 , . . . , Xn ) = Xk
n i=1 i
Procedure: For one parameter θ

• Sample moment: m1
• Distribution moment: E(X) = f (θ)
• Solve for θ from f (θ) = m1 in terms of m1 .
• θ̂ : replace m1 by M1 in the above solution.

7. Likelihood of i.i.d. samples: Likelihood of a sampling x1 , x2 , . . . , xn , denoted

n
Y
L(x1 , . . . , xn ) = fX (xi ; θ1 , θ2 , . . .)
i=1

8. Maximum likelihood (ML) estimation:

n
Y
θ1∗ , θ2∗ , . . . = arg max fX (xi ; θ1 , θ2 , . . .)
θ1 ,θ2 ,...
∗ ∗
i=1

9. Bayesian estimation: Let X1 , . . . , Xn ∼ i.i.d.X, parameter Θ.

Prior distribution of Θ : Θ ∼ fΘ (θ).
Samples, S : (X1 = x1 , . . . , Xn = xn )
Posterior: Θ | (X1 = x1 , . . . , Xn = xn )
Bayes’ rule: Posterior ∝ Prior × Likelihood
Posterior density ∝ fΘ (θ) × P (X1 = x1 , . . . , Xn = xn | Θ = θ)

10. Normal samples with unknown mean and known variance:

X1 , . . . , Xn ∼ i.i.d. Normal(M, σ 2 ).
Prior M ∼ Normal(µ0 , σ02 ).
nσ02 σ2

Posterior mean: µ̂ = X + µ0
nσ02 + σ 2 nσ02 + σ 2

Page 3

Downloaded by Mishkat Chougule ([email protected])

lOMoARcPSD|31573472

11. Hypothesis Testing

• Test for mean

Case (1): When population variance σ 2 is known (z-test)

Test H0 HA Test statistic Rejection region

T =X
right-tailed µ = µ0 µ > µ0 X − µ0 X>c
Z= σ√
/ n
T =X
left-tailed µ = µ0 µ < µ0 X − µ0 X<c
Z= σ√
/ n
T =X
two-tailed µ = µ0 µ ̸= µ0 X − µ0 |X − µ0 | > c
Z= σ√
/ n

Case (2): When population variance σ 2 is unknown (t-test)

Test H0 HA Test statistic Rejection region

T =X
right-tailed µ = µ0 µ > µ0 X − µ0 X>c
tn−1 = S/√n

T =X
left-tailed µ = µ0 µ < µ0 X − µ0 X<c
tn−1 = S/√n

T =X
two-tailed µ = µ0 µ ̸= µ0 X − µ0 |X − µ0 | > c
tn−1 = S/√n

Page 4

Downloaded by Mishkat Chougule ([email protected])

lOMoARcPSD|31573472

• χ2 -test for variance:

Test H0 HA Test statistic Rejection region

(n − 1)S 2
right-tailed σ = σ0 σ > σ0 T = ∼ χ2n−1 S 2 > c2
σ02

(n − 1)S 2
left-tailed σ = σ0 σ < σ0 T = 2
∼ χ2n−1 S 2 < c2
σ0
α
(n − 1)S 2 S 2 > c2 where = P (S 2 > c2 ) or
two-tailed σ = σ0 σ ̸= σ0 T = ∼ χ2n−1 2
σ02 α
S 2 < c2 where = P (S 2 < c2 )
2

• Two samples z-test for means:

Test H0 HA Test statistic Rejection region

T =X −Y
σ2 σ2

right-tailed µ1 = µ2 µ1 > µ2 X −Y >c
X − Y ∼ Normal 0, 1 + 2 if H0 is true
n1 n2
T =Y −X
σ2 σ2

left-tailed µ1 = µ2 µ1 < µ2 Y −X >c
Y − X ∼ Normal 0, 2 + 1 if H0 is true
n2 n1
T =X −Y
σ2 σ2

two-tailed µ1 = µ2 µ1 ̸= µ2 |X − Y | > c
X − Y ∼ Normal 0, 1 + 2 if H0 is true
n1 n2

• Two samples F -test for variances

Test H0 HA Test statistic Rejection region

S12 S12
one-tailed σ1 = σ2 σ 1 > σ2 T = ∼ F(n1 −1,n2 −1) >1+c
S22 S22

S12 S12
one-tailed σ1 = σ2 σ 1 < σ2 T = ∼ F(n1 −1,n2 −1) <1−c
S22 S22
S12 α
> 1 + cR where = P (T > 1 + cR ) or
S12 S22 2
two-tailed σ1 = σ2 σ1 ̸= σ2 T = ∼ F(n1 −1,n2 −1)
S22 S12 α
< 1 − cL where = P (T < 1 − cL )
S22 2

Page 5

Downloaded by Mishkat Chougule ([email protected])

Week-7_GA_Solution_1
100% (1)
Week-7_GA_Solution_1
11 pages
Stats 2 Formula Sheet
No ratings yet
Stats 2 Formula Sheet
6 pages
STAT 315- Formulas and Probability Tables (2)
No ratings yet
STAT 315- Formulas and Probability Tables (2)
13 pages
Week 10
No ratings yet
Week 10
2 pages
Compre_Sol__B-22-23
No ratings yet
Compre_Sol__B-22-23
6 pages
18.5
No ratings yet
18.5
4 pages
Summary Week 2
No ratings yet
Summary Week 2
17 pages
2022 MA311 Statistics-Tutorial
No ratings yet
2022 MA311 Statistics-Tutorial
5 pages
Summary Week 2
No ratings yet
Summary Week 2
17 pages
Formula File - Quiz 2
No ratings yet
Formula File - Quiz 2
2 pages
Exam 2020
No ratings yet
Exam 2020
5 pages
Formulae-gulag-free
No ratings yet
Formulae-gulag-free
1 page
W10 Notes
No ratings yet
W10 Notes
2 pages
Stats2 Quiz2 Formula
No ratings yet
Stats2 Quiz2 Formula
2 pages
EXAM FormulaSheet
No ratings yet
EXAM FormulaSheet
9 pages
Iit Stat 2012
No ratings yet
Iit Stat 2012
6 pages
Statistics Training
No ratings yet
Statistics Training
96 pages
Final Review Handout
No ratings yet
Final Review Handout
47 pages
Jawaban PSM by Blackbox
No ratings yet
Jawaban PSM by Blackbox
2 pages
2nd year Statistics chapter wise test
No ratings yet
2nd year Statistics chapter wise test
8 pages
Formula Sheet Statistics II
No ratings yet
Formula Sheet Statistics II
5 pages
Statistics and Probability 2-Mark Questions and Answers with Formulas - Google Docs
No ratings yet
Statistics and Probability 2-Mark Questions and Answers with Formulas - Google Docs
6 pages
Problem Set 1 With Answers
No ratings yet
Problem Set 1 With Answers
6 pages
W7PS
No ratings yet
W7PS
6 pages
OM Week3
No ratings yet
OM Week3
4 pages
Statistical Tables and Formula Sheet
No ratings yet
Statistical Tables and Formula Sheet
15 pages
Qualifying Exam in Probability and Statistics PDF
0% (1)
Qualifying Exam in Probability and Statistics PDF
11 pages
Tif ch08
80% (10)
Tif ch08
33 pages
ProblemSet1Sol
No ratings yet
ProblemSet1Sol
7 pages
FIT5197 2021 S1 Formula Sheet
No ratings yet
FIT5197 2021 S1 Formula Sheet
20 pages
MLESA v2024 Week10 Assignment Solution
No ratings yet
MLESA v2024 Week10 Assignment Solution
7 pages
Tong Hop Cong Thuc Mt2013 Lop Thay Dungclc
No ratings yet
Tong Hop Cong Thuc Mt2013 Lop Thay Dungclc
9 pages
Final Exam Practice Problems
No ratings yet
Final Exam Practice Problems
8 pages
STAT270 Formula Booklet Vretta Updated
No ratings yet
STAT270 Formula Booklet Vretta Updated
10 pages
Stats 2 Formulae
No ratings yet
Stats 2 Formulae
5 pages
MSexam Stat 2016S Solution
No ratings yet
MSexam Stat 2016S Solution
11 pages
Exam 2 Rev
No ratings yet
Exam 2 Rev
4 pages
Introduction Statistics Imperial College London
50% (2)
Introduction Statistics Imperial College London
474 pages
Stats formula sheet
No ratings yet
Stats formula sheet
6 pages
Assignment MATH4281 ModIV
No ratings yet
Assignment MATH4281 ModIV
3 pages
Static Tics
No ratings yet
Static Tics
47 pages
MSexam Stat 2019S Solutions
No ratings yet
MSexam Stat 2019S Solutions
11 pages
University of Toronto Scarborough Department of Computer and Mathematical Sciences Final Exam, Winter - 2015
No ratings yet
University of Toronto Scarborough Department of Computer and Mathematical Sciences Final Exam, Winter - 2015
13 pages
ECE286 Final Exam Aid Sheet
No ratings yet
ECE286 Final Exam Aid Sheet
4 pages
STA3030F - Jan 2015 PDF
No ratings yet
STA3030F - Jan 2015 PDF
13 pages
Stats (s1) As Level
No ratings yet
Stats (s1) As Level
598 pages
MATH 376 - Final Exam Sample Solutions: 1 2 M 1 2 N I 1 2 1 I 2 2 2
No ratings yet
MATH 376 - Final Exam Sample Solutions: 1 2 M 1 2 N I 1 2 1 I 2 2 2
8 pages
Formula Sheet
No ratings yet
Formula Sheet
19 pages
MachineLearningPropertyModeling_UserGuide
No ratings yet
MachineLearningPropertyModeling_UserGuide
21 pages
msqe_metrics_1_ps2
No ratings yet
msqe_metrics_1_ps2
11 pages
Formula Sheet Math236
No ratings yet
Formula Sheet Math236
2 pages
X400004_20220215_solutions
No ratings yet
X400004_20220215_solutions
8 pages
Advanced Statistical Inference
No ratings yet
Advanced Statistical Inference
7 pages
Statistics Formula Sheet-With Tables
No ratings yet
Statistics Formula Sheet-With Tables
5 pages
CQE Academy Equation Cheat Sheet B
No ratings yet
CQE Academy Equation Cheat Sheet B
15 pages
04lecture CEE106 Measurements-Errors
No ratings yet
04lecture CEE106 Measurements-Errors
27 pages
Chapter3 220928 093636
No ratings yet
Chapter3 220928 093636
70 pages
CB2200+Assignment+1+Questions 3
No ratings yet
CB2200+Assignment+1+Questions 3
1 page
QM Consolidated Formulae
No ratings yet
QM Consolidated Formulae
40 pages
Water Resources Research - 2014 - Madadgar - Improved Bayesian Multimodeling Integration of Copulas and Bayesian Model
No ratings yet
Water Resources Research - 2014 - Madadgar - Improved Bayesian Multimodeling Integration of Copulas and Bayesian Model
18 pages
Efektivitas Terapi Film Dalam Meningkatkan Empati Terhadap Mahasiswa Psikologi UIN Raden Fatah Palembang
No ratings yet
Efektivitas Terapi Film Dalam Meningkatkan Empati Terhadap Mahasiswa Psikologi UIN Raden Fatah Palembang
8 pages
ST102 Notes
0% (1)
ST102 Notes
21 pages
Audit Committee Attributes and Audit Quality - A Benchmark Analysis
No ratings yet
Audit Committee Attributes and Audit Quality - A Benchmark Analysis
13 pages
Walker 1979
No ratings yet
Walker 1979
140 pages
Chapter 5 in Text Answers
No ratings yet
Chapter 5 in Text Answers
60 pages
CRE Equations and Formulas Print Out
No ratings yet
CRE Equations and Formulas Print Out
30 pages
Reliability-Based Design and Optimization of Self-Twisting Composite Marine Rotors
No ratings yet
Reliability-Based Design and Optimization of Self-Twisting Composite Marine Rotors
8 pages
Illustrates A Random Variable
No ratings yet
Illustrates A Random Variable
19 pages
Financial Deepening and The Performance of Manufacturing Firms in Nigeria
No ratings yet
Financial Deepening and The Performance of Manufacturing Firms in Nigeria
10 pages
DSME 2011 G, I and J Assignment 6: Sampling Distributions, One-And Two - Sample Tests, Regression
No ratings yet
DSME 2011 G, I and J Assignment 6: Sampling Distributions, One-And Two - Sample Tests, Regression
8 pages
Assignment 4
No ratings yet
Assignment 4
3 pages
Statistical Interpretation of Data - : Guide To
No ratings yet
Statistical Interpretation of Data - : Guide To
56 pages
Lesson 2-07 Properties of Means and Variances
100% (1)
Lesson 2-07 Properties of Means and Variances
9 pages
Activity 2 Visualizing A Normal Distribution1
No ratings yet
Activity 2 Visualizing A Normal Distribution1
1 page
Neda TCOM Draft
No ratings yet
Neda TCOM Draft
17 pages
Sample Size and Power
No ratings yet
Sample Size and Power
19 pages
Stats Cheat Sheet
No ratings yet
Stats Cheat Sheet
2 pages
Soil Variability and Its Consequences in Geotechnical Engineering
No ratings yet
Soil Variability and Its Consequences in Geotechnical Engineering
302 pages
Cfa - R4
No ratings yet
Cfa - R4
1 page
SQC
No ratings yet
SQC
46 pages
MAT2337 December 2010 Final Exam
No ratings yet
MAT2337 December 2010 Final Exam
11 pages
Prof Ed-Assessment
No ratings yet
Prof Ed-Assessment
4 pages
Second Midterm: Part I - Multiple Choice Questions (3 Points Each)
No ratings yet
Second Midterm: Part I - Multiple Choice Questions (3 Points Each)
6 pages
Preview Ansi+Asq+z1.9 2008
No ratings yet
Preview Ansi+Asq+z1.9 2008
12 pages
ps7 Sol
No ratings yet
ps7 Sol
7 pages
CQE Academy Equation Cheat Sheet - D
No ratings yet
CQE Academy Equation Cheat Sheet - D
15 pages
Final Review Worksheet-STAT 362-Final Review
No ratings yet
Final Review Worksheet-STAT 362-Final Review
24 pages
DLL Stat and Probab 1st Quarter Rev
No ratings yet
DLL Stat and Probab 1st Quarter Rev
8 pages
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet

Imp Formula

Uploaded by

Imp Formula

Uploaded by

lOMoARcPSD|31573472

Stats 2 Formula Sheet - Summary Programming and data

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university

Statistics for Data Science - 2

Discrete random variables:

Distribution PMF (fX (k)) CDF (FX (x)) E[X] Var(X)

Downloaded by Mishkat Chougule ([email protected])

Continuous random variables:

Distribution PDF (fX (k)) CDF (FX (x)) E[X] Var(X)

1. Markov’s inequality: Let X be a discrete random variable taking non-negative values

3. Weak Law of Large numbers: Let X1 , X2 , . . . , Xn ∼ iid X with E[X] = µ, Var(X) =

4. Using CLT to approximate probability: Let X1 , X2 , . . . , Xn ∼ iid X with E[X] =

Downloaded by Mishkat Chougule ([email protected])

5. Bias of an estimator: Bias(θ̂, θ) = E[θ̂] − θ.

7. Likelihood of i.i.d. samples: Likelihood of a sampling x1 , x2 , . . . , xn , denoted

8. Maximum likelihood (ML) estimation:

9. Bayesian estimation: Let X1 , . . . , Xn ∼ i.i.d.X, parameter Θ.

10. Normal samples with unknown mean and known variance:

Downloaded by Mishkat Chougule ([email protected])

11. Hypothesis Testing

• Test for mean

Test H0 HA Test statistic Rejection region

Case (2): When population variance σ 2 is unknown (t-test)

Test H0 HA Test statistic Rejection region

Downloaded by Mishkat Chougule ([email protected])

• χ2 -test for variance:

Test H0 HA Test statistic Rejection region

• Two samples z-test for means:

Test H0 HA Test statistic Rejection region

• Two samples F -test for variances

Test H0 HA Test statistic Rejection region

Downloaded by Mishkat Chougule ([email protected])

You might also like