Lecture 01
Lecture 01
Sample space: 𝑆𝑆 = 1, 2, 3, 4, 5, 6
pmf: 𝑓𝑓 𝑥𝑥 = 1/6 for 𝑥𝑥 ∈ 𝑆𝑆
i.e. 𝑓𝑓 1 = 𝑓𝑓 2 = ⋯ = 𝑓𝑓 6 = 1/6
2 3 5
pmf: 𝑓𝑓 2 = = 0.2, 𝑓𝑓 5 = = 0.3, 𝑓𝑓 10 = = 0.5
10 10 10
Properties:
1. 0 ≤ 𝐹𝐹 𝑥𝑥 ≤ 1
2. 𝐹𝐹 𝑥𝑥 is a non-decreasing function. i.e. 𝐹𝐹 𝑥𝑥2 ≥ 𝐹𝐹 𝑥𝑥1
if 𝑥𝑥2 > 𝑥𝑥1
𝑥𝑥
CDF: 𝐹𝐹 𝑥𝑥 = 𝑃𝑃 𝑋𝑋 ≤ 𝑥𝑥 = , for 𝑥𝑥 = 1, 2, … , 6
6
1
𝐹𝐹 1 = 𝑃𝑃 𝑋𝑋 ≤ 1 = 𝑓𝑓 1 =
6
2
𝐹𝐹 2 = 𝑃𝑃 𝑋𝑋 ≤ 2 = 𝑓𝑓 1 + 𝑓𝑓 2 =
6
…
6
𝐹𝐹 6 = 𝑃𝑃 𝑋𝑋 ≤ 6 = 𝑓𝑓 1 + 𝑓𝑓 2 + ⋯ + 𝑓𝑓 6 =
6
𝑖𝑖=1
𝐸𝐸 𝑋𝑋 = � 𝑥𝑥𝑖𝑖 ⋅ 𝑓𝑓(𝑥𝑥𝑖𝑖 )
𝑖𝑖=1
1 1 1 1 1 1
=1 +2 +3 +4 +5 +6
6 6 6 6 6 6
= 3.5
That means the average of the numbers showing up when rolling a
fair die for a large number times is equal to 3.5.
𝐸𝐸 𝑋𝑋 = � 𝑥𝑥𝑖𝑖 ⋅ 𝑓𝑓(𝑥𝑥𝑖𝑖 )
𝑖𝑖=1
= 2 0.2 + 5 0.3 + 10 0.5
= $6.9
𝜎𝜎 2 = 𝑉𝑉𝑉𝑉𝑉𝑉 𝑋𝑋 = � 𝑥𝑥𝑖𝑖 − 𝜇𝜇 2
⋅ 𝑓𝑓(𝑥𝑥𝑖𝑖 )
𝑖𝑖=1
or equivalently
𝑛𝑛
Since variance does not have the same unit as the random variable
𝑋𝑋, we define standard deviation as the square root of the variance.
𝜎𝜎 = 𝑉𝑉𝑉𝑉𝑉𝑉 𝑋𝑋 = 𝜎𝜎 2
• If most of the 𝑥𝑥𝑖𝑖 's are far from the mean 𝜇𝜇, we will have
a large variance.
𝜎𝜎 2 = 𝑉𝑉𝑉𝑉𝑉𝑉 𝑋𝑋 = � 𝑥𝑥𝑖𝑖2 ⋅ 𝑓𝑓(𝑥𝑥𝑖𝑖 ) − 𝜇𝜇2 = 22 0.2 + 52 0.3 + 102 0.5 − 6.92 = 10.69
𝑖𝑖=1
𝜎𝜎 = 𝜎𝜎 2 ≈ $3.270
Properties:
1. 0 ≤ 𝐹𝐹 𝑥𝑥 ≤ 1
2. 𝐹𝐹 𝑥𝑥 is a non-decreasing function. i.e. 𝐹𝐹 𝑥𝑥2 ≥ 𝐹𝐹 𝑥𝑥1
if 𝑥𝑥2 > 𝑥𝑥1
SEHH2311 Foundations of Data Science Page 27
Uniform Distribution
A random variable 𝑋𝑋 with a uniform distribution U(a,b) has a
constant pdf in a given interval [a, b]. The pdf 𝑓𝑓(𝑥𝑥) has the
following form
1
𝑓𝑓𝑓𝑓𝑓𝑓 𝑎𝑎 ≤ 𝑥𝑥 ≤ 𝑏𝑏
𝑓𝑓 𝑥𝑥 = �𝑏𝑏 − 𝑎𝑎
0 𝑓𝑓𝑓𝑓𝑓𝑓 𝑥𝑥 < 𝑎𝑎 𝑜𝑜𝑜𝑜 𝑥𝑥 > 𝑏𝑏
0 𝑓𝑓𝑓𝑓𝑓𝑓 𝑥𝑥 < 𝑎𝑎
𝑥𝑥 − 𝑎𝑎
𝐹𝐹 𝑥𝑥 = 𝑓𝑓𝑓𝑓𝑓𝑓 𝑎𝑎 ≤ 𝑥𝑥 ≤ 𝑏𝑏
𝑏𝑏 − 𝑎𝑎
1 𝑓𝑓𝑓𝑓𝑓𝑓 𝑥𝑥 > 𝑏𝑏
0 𝑓𝑓𝑓𝑓𝑓𝑓 𝑥𝑥 < −1
1 + 𝑥𝑥 𝑓𝑓𝑓𝑓𝑓𝑓 − 1 ≤ 𝑥𝑥 ≤ 0
𝑓𝑓 𝑥𝑥 =
1 − 𝑥𝑥 𝑓𝑓𝑓𝑓𝑓𝑓 0 < 𝑥𝑥 ≤ 1
0 𝑓𝑓𝑓𝑓𝑓𝑓 𝑥𝑥 > 1
0 𝑓𝑓𝑓𝑓𝑓𝑓 𝑥𝑥 < −1
𝑥𝑥 + 𝑥𝑥 2 /2 + 1/2 𝑓𝑓𝑓𝑓𝑓𝑓 − 1 ≤ 𝑥𝑥 ≤ 0
𝐹𝐹 𝑥𝑥 =
𝑥𝑥 − 𝑥𝑥 2 /2 + 1/2 𝑓𝑓𝑓𝑓𝑓𝑓 0 < 𝑥𝑥 ≤ 1
1 𝑓𝑓𝑓𝑓𝑓𝑓 𝑥𝑥 > 1
𝑃𝑃 𝑋𝑋 ≤ −0.5 = 𝐹𝐹 =
𝑃𝑃 𝑋𝑋 ≤ 0.2 = 𝐹𝐹 =
−0.5 2 1
𝑃𝑃 𝑋𝑋 ≤ −0.5 = 𝐹𝐹 −0.5 = −0.5 + + = 0.125
2 2
0.2 2 1
𝑃𝑃 𝑋𝑋 ≤ 0.2 = 𝐹𝐹 0.2 = 0.2 − + = 0.68
2 2
𝜎𝜎 = 𝑉𝑉𝑉𝑉𝑉𝑉 𝑥𝑥 = 𝜎𝜎 2
𝑋𝑋 + 𝑌𝑌 + 𝑍𝑍
𝐸𝐸 =?
3
𝑋𝑋 + 𝑌𝑌 + 𝑍𝑍
𝑉𝑉𝑉𝑉𝑉𝑉 =?
3
𝑋𝑋 𝐸𝐸 𝑋𝑋
However, 𝐸𝐸 ≠ in general!!!
𝑌𝑌 𝐸𝐸 𝑌𝑌