Al-Andalus International School (A.A.I.
S)
(American Division) Math Department
Answers - Revision packets
Statistics
For Grade (12) – T (3)
Correlation & Regression
Academic Year: 2023-2024
Student Name : ……………………………..
Class : : ……………………….
Topics of Trimester Three
Correlation and Regression
(Unit one) - Correlation
Introduction
The meaning of correlation
The scatter diagram
Type of correlation : - 1 ≤ 𝑟 ≤ 1 (Perfect – Strong – Weak)
(i) If r (+)ve: direct correlation
(ii) If r (-)ve : inverse correlation
Pearson ,s Correlation Coefficient of Ungrouped Data
𝑛∑𝑋 𝑌 − ( ∑𝑋 ) ( ∑𝑌 )
r(p) =
√𝑛 ∑ 𝑋 2 − ( ∑ 𝑋 )2 × √𝑛 ∑ 𝑌 2 − ( ∑ 𝑌 )2
Spearman, s Ranks Correlation Coefficient
6 ∑ 𝐷2
r(s) = 1 -
𝑛 ( 𝑛2 − 1 )
(Unit two) Regression :
The regression line
The equation of the regression line of Y on X : Y = a + b X
𝑛∑𝑋 𝑌 −( ∑𝑋 ) ( ∑𝑌 ) ∑ 𝑌 − 𝑏 ( ∑𝑋 )
Where : b = 2 & a=
𝑛 ∑ 𝑋2 − ( ∑ 𝑋 ) 𝑛
The (Error) in the value of y =|Y - 𝑌 / |
2
Correlation
Types of correlation - 1 ≤ 𝑟 ≤ 1
(1) The strongest correlation coefficient of the following is:
(a)* – 0.8 b – 0.5 c 0.4 d 0.7
(2) Complete the following
Pearson` s Correlation Coefficient
(3) Find the Pearson correlation coefficient between the two variables X and Y identify
its type where : ∑ x = 68 , ∑ y = 36 , ∑ xy = 348 , n = 8 , ∑ x 2 =
620 , ∑ y 2 = 204 .
Solution :
8 𝑋 348 −68 𝑋 36
r(p)= = 1 & The type is perfect direct correlation
√8 𝑋620− (68)2 𝑋 √8 𝑋204− (36)2
3
(4) Calculate: Pearson correlation between x and y and determine its type .
x 6 5 7 8 7 6 ∑(𝑋) ∑ 𝑌=3 ∑ 𝑋 2 ∑ 𝑌 2= ∑ 𝑋𝑌=240
y 4 7 5 6 8 7 =39 7 =259 239
Solution:
6 𝑋 240 −39 𝑋 37
r(p)= = - 0.06 & The type is inverse correlation
√6 𝑋259 − (39)2 𝑋 √6 𝑋239 − (37)2
(5)
4
(6)
(7) If ∑ x = 41 , ∑ y = 55 , ∑ xy = 362 , n = 8 , ∑ x 2 = 256 ,
∑ y 2 = 523 . Then calculate Pearson correlation
r = 0.98 and direct correlation , regression line: y= a + bx
b = 1.7 , a = -1.8 y = -1.8 +1.7x
(8) If ∑ x = 49 , ∑ y = 45 , ∑ xy = 320 , n = 7 , ∑ x 2 = 256 ,
∑ y 2 = 523 . Then find the equation of the regression line
𝑛∑𝑋 𝑌 − ( ∑𝑋 ) ( ∑𝑌 ) ∑ 𝑌 − 𝑎 ( ∑𝑋 )
b= , a=
𝑛 ∑ 𝑋 2 − ( ∑ 𝑋 )2 𝑛
The equation of the regression line of Y on X is Y = a + b X
7 𝑋320−49 𝑋 45 5 45 − 0.3125 𝑋 49
b= = ≅ 0.31, a = ≅ 4.24
7𝑋 359− ( 49)2 16 7
The equation of the regression line of Y on X is : Y = 4.24 + 0.31 X
5
Correlation : Spearman ranks correlation
(9)
------------------------------------------------------------------------------------------------------
(10) Find Spearman ranks correlation
Math Verg excellent weak good Weak pass Weak
good
Statistics pass Verg weak pass good pass weak
good
Solution:
6
(11)
(12) From the following data Spearman ranks is ………
X 10 7 8 7 6 4
Solution: Y 5 8 7 9 9 10
∑ 𝐷2 = 52 + 0.52 + 32 + 12 + 2.52 + 52 = 66.5
6 ∑ 𝐷2 6 × 66.5
r=1- r=1- = - 0.9 Type is invers correlation
𝑛 ( 𝑛2 − 1 ) 6 ( 62 − 1 )
7
(13) Calculate the Spearman ranks correlation coefficient between X and Y and state its type .
R(X) 2 2 4 6 5 2
X V. good V. good good weak pass V. good
Y good pass good excellent V. good pass
R(Y) 3.5 5. 5 3.5 1 2 5.5
Solution: ∑ 𝐷2 (𝑠𝑢𝑚 𝑜𝑓 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒)2 = 1.52 + 3.52 + 0.52 + 52 + 32 + 3.52
6 ∑ 𝐷2 6 × 61
∑ 𝐷2 = 61 r = 1 - r=1- = - 0.7
𝑛 ( 𝑛2 − 1 ) 6 ( 62 − 1 )
Type is invers correlation
(14)
(15) : Find the equation of the regression line of Y on X
If ∶ ∑ x = 79 , ∑ y = 179 , ∑ x 2 = 1331 , ∑ xy = 2899 , n = 7
𝑛∑𝑋 𝑌 − ( ∑𝑋 ) ( ∑𝑌 ) ∑ 𝑌 −𝑏 (∑𝑋 )
Solution : b = , a=
𝑛 ∑ 𝑋 2 − ( ∑ 𝑋 )2 𝑛
The equation of the regression line of Y on X is : Y = a + b X
7 𝑋2899−79 𝑋 179 179−2 𝑋 79
b= =2 , a= =3
7 𝑋 1331− ( 79)2 7
The equation is : Y = 3 + 2X
8
(16): Calculate The equation of the regression line of Y on X
x 6 5 7 8 10 6 7
y 4 7 5 6 8 7 8
Solution :
X Y 𝑋2 𝑌2 XY
6 4 36 16 24
5 7 25 49 35
7 5 49 25 35
8 6 64 36 48
10 8 100 64 80
6 7 36 49 42
7 8 49 64 56
Sum =49 45 359 303 320
𝑛∑𝑋 𝑌 − ( ∑𝑋 ) ( ∑𝑌 ) ∑ 𝑌 − 𝑏 ( ∑𝑋 )
b= 2 , a=
𝑛 ∑ 𝑋2 − ( ∑ 𝑋 ) 𝑛
The equation of the regression line of Y on X is : Y =b x + a
7 𝑋320−49 𝑋 45 5 45 − 0.3125 𝑋 49
b= = ≅ 0.31 , a = ≅ 4.24
7𝑋 359− ( 49)2 16 7
The equation of the regression line of Y on X is : Y = 4.24 + 0.31 X
9
(17) Solution: 1(b) – 2(b) – 3(a) – 4(a) – 5(d) – 6(d)
(18) find the regression line equation of Y on X If:
The equation of the regression line of Y on X : Y = a + b X
𝑛∑𝑋 𝑌 −( ∑𝑋 ) ( ∑𝑌 )
Where : b = 2 = 0.9
𝑛 ∑ 𝑋2 − ( ∑ 𝑋 )
∑ 𝑌 − 𝑏 ( ∑𝑋 )
& a= = − 0.2 The equation is: Y = - 0.2 + 0.9 X
𝑛
10
(19) Solution: 1(a) – 2(c) – 3(a) – 4(d) – 5(b) – 6(r = -1)
=-1
11
(20) Solution: 1(b) – 2(c) – 3(a) – 4(c) – 5(a) – 6(c)
(3)
(4)
(5)
(6)
12
(7) The scatter diagram representing inverse correlation is figure:
(8 )
(9)
(Perfect inverse)
(10)
Complete: -
(1 ) If the regression line equation of Y on X is 𝑌̂ = 0.2𝑥 + 3 and the value of Y when x = 5 is
4.6 Then the error in the value of y = ……
Solution: (The error of y) E = | Y (table point) – 𝑌 / (𝑟𝑒𝑔𝑟𝑒𝑠𝑠𝑖𝑜𝑛 𝑒𝑞𝑢𝑎𝑡𝑖𝑜𝑛)|
E = | 4.6 – 4 | = 0.6
(2 ) If the regression line equation of Y on X is 𝑌̂ = 0.2 𝑥 + 4 and the value of Y when x = 10
is 3.8 Then the error in the value of y = ……
Solution: (The error of y) E = | Y (table point) – 𝑌 / (regression equation)|
E = | 3.8 – 6 | = 2.2
13
(3 ) If the regression line equation is: 𝑦 = 4.2 + 0.3 𝑥 .Then the error in the value of y at the
point (10, 8) is ……
Solution: (The error of y) E = | Y (table point) – 𝑌 / (regression equation)|
E = | 8 – 7.2 | = 0.8
(4 ) If the error at the point (8,6) is 0.3 , then the value of y satisfying the regression equation is
…….
Solution: (The error of y) E = | Y (table point) – 𝑌 / (regression equation)|
0.3 = | 6 – 𝑌 / | so 𝑌 / = 6.3
(5 ) If the error at the point (10, k) is 0.4 . and the value of y satisfying the regression equation is
12.6, then the value of k is ……..
Solution: (The error of y) E = | Y (table point) – 𝑌 / (regression equation)|
0.4 = | k – 12.6| so k = 13
(6 ) If the regression line equation is: 𝑦 = 0.4 𝑥 + 3 .Then the error in the value of y at the
point (5, 3.2) is ……
Solution: (The error of y) E = | Y (table point) – 𝑌 / (regression equation)|
E = | 3.2 – 5 | = 1.8
(7 ) If the error at the point (5, m) is 0.6 and the value of y satisfying the regression equation is
14.5, then the value of m is ……..
Solution: (The error of y) E = | Y (table point) – 𝑌 / (regression equation)|
0.6 = | m –14.5| so m = 15.1
(8 ) If the error at the point (2, 8) is 1.4 then the value of y satisfying the regression equation is
……
Solution: (The error of y) E = | Y (table point) – 𝑌 / (regression equation)|
1.4 = | 8 –𝑌 / | then 𝑌 / = 6.6
(9) If the regression line equation of Y on X is 𝑌̂ = 0.2𝑥 + 3 and the value of Y when x = 5
is 4.6 Then the error in the value of y = …… Solution: y = 0.6
14
(10) If D is the difference between the ranks of each corresponding values of two variables x
and y ∑ 𝐷2 = 0 , then the correlation coefficient (r) between x and y equals …..
− .
(11) If the equation of the regression line is: 𝑦̂ = 4.2 + 0.3𝑥 the error at x = 10 is…..
- -
Solution: Error = | 8 – 7.2| = 0.8
(12) If the error at the point (8 , 6) is 0.3 then the value satisfying the regression equation is……
Solution: Error = | y (table value) – y (regression equation) |
0.3 = | 6 – 𝑦 / | Then: 𝑦 / (regression equation) = 6.3
(13) If the error at the point (10 , k)is 0.4 and the value satisfying the regression equation is 12.6
, then the value of k = ……
Solution: Error = | y (table value) – y (regression equation) |
0.4 = | k – 12.6 | Then: y (regression equation) = 13
15