Greenwood High School 2021 - 2022 Mathematics - Project 2: Aarav Batra Grade 9, B
Greenwood High School 2021 - 2022 Mathematics - Project 2: Aarav Batra Grade 9, B
2021 – 2022
Mathematics – Project 2
Statistical Representation of Surveyed Data
~ Aarav Batra
Grade 9th, B.
This project work has not only been done for grades but it has
also given me skills as I did a lot of research. It taught me the
real meaning of hard work. I have gained lot of knowledge and
have become more aware of mathematical concepts.
Objectives 1
Problem Description 2
Introduction 3-4
Procedure 5-11
Observation 12
Conclusion 13
Further Study 14-15
Bibliography 16
Objectives
Now let’s look at some statistical tools I will be using further in this project:
o Arrangement: Arrangement of the raw data in an ascending or descending order
makes it look more organized.
o Tabulation: Tabulation of all the data (i.e. imputing all values into a table) makes
performing tasks and calculations easier.
o Central Tendencies: In statistics, a single value can be used to represent an entire
set of data which is known as a central tendency. Mean, Median, Mode and
Range are central tendencies of data calculated by different methods. Here are
the methods to calculate all of these:
▪ Mean: The mean of a data is also known as the average of all terms. To calculate
it, we add up the values of all the terms and then divide by the number of terms.
▪ Median: The median of a distribution is the middle term of the data arranged in
an ascending order.
i. If the frequency (the number of terms) is odd, then the central value is the
median. To calculate it we divide the successor or the frequency by two
and the term at that place is the median.
ii. If the frequency is an even number than we just divide it by 2 and find the
mean of both numbers at that place when counted from both ends.
Example:
1,2,3,4,5,6,7,8,9,10
f = 10
10/2=5
Median = mean of 5th term counted from both ends
= 5+6/2
=11/2
=5.5
Therefore, median is 5.5
▪ Mode: Mode of the data is the observation (value) that has the highest frequency
(occurs the highest number of times). A rough value for mode can be calculated
as the difference between thrice the median and twice the mean. This is known
as the empirical formula but however it gives an approximate value of the mode
and not the mode itself.
▪ Range: The range of a distribution with a discrete random variable is the
difference between the maximum value and the minimum value.
Raw data
Through my survey I have collected the weight, height and family size (number of
members in a family) from 28 or my classmates which is presented below in a table.
Name Weight (Kg) Height (Cm) Family Members
Aarav Batra 50 165 4
Aarnav Verma 60 175 3
Aditi Sathish 50 167 4
Aishwarya Garine 72 156 5
Aman Thimmaiah 55 181 5
Arjun Jagannathan 50 155 3
Arnav Gupta 56 163 4
Charvi M 41 157 7
Deepthi Menon 51 162 4
Dishita Bajaj 70 170 4
Harshitha Reddy 55 160 6
Jivika Dialani 46 162 3
Maayan Hazra 51 174 3
Mahi Rajne 60 165 3
Manik Bhatia 55 168 4
Mihir Halapeth 50 164 4
Palak Suri 68 164 3
Rishika Reddy 51 165 4
Saketh Shuntipadi 45 164 3
Shlok Rajiv 69 170 6
Shourya Sinha 65 170 8
Shruti Vijay Kumar 62 167 5
Shubhra Chatterjee 47 162 5
Sohan Jasti 51 175 4
Sohan Shanbhag 47 163 4
Sonit Saraf 51 175 3
Tanush Bhaumik 58 157 4
Trisha Shub 53 168 4
Thus, here are the observations collected:
• Height- 165, 175, 167, 156, 181, 155, 163, 157, 162, 170, 160, 162, 174, 165,
168, 164, 164, 165, 164, 170, 170, 167, 162, 175, 163, 175, 157, 168.
• Weight- 50, 60, 50, 72, 55, 50, 56, 41, 51, 70, 55, 46, 51, 60, 55, 50, 68, 51,
45, 69, 65, 62, 47, 51, 47, 51, 58, 53.
• Family Size- 4, 3, 4, 5, 5, 3, 4, 7, 4, 4, 6, 3, 3, 3, 4, 4, 3, 4, 3, 6, 8, 5, 5, 4, 4, 3,
4, 4.
Tabulation of data and calculation of central tendencies
Step 1- First let’s arrange all the raw data in ascending order.
• Height- 155, 156, 157, 157, 160, 162, 162, 162, 163, 163, 164, 164, 164, 165, 165,
165, 167, 167, 168, 168, 170, 170, 170, 174, 175, 175, 175, 181.
• Weight- 41, 45, 46, 47, 47, 50, 50, 50, 50, 51, 51, 51, 51, 51, 53, 55, 55, 55, 56, 58,
60, 60, 62, 65, 68, 69, 70, 72.
• Family Size- 3, 3, 3, 3, 3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 5, 5, 5, 5, 6, 6, 7, 8.
Step 2- Creating frequency tables to find central tendencies.
Xi Fi Fi Xi Cumulative Height
155 1 155 1
Mean = ∑Fi Xi / ∑Fi
156 1 156 2
=> 4644/28 = 165.86.
157 2 314 4
160 1 160 5
Median = mean of (no of
terms/2)th terms from both
162 3 486 8
sides.
163 2 326 10
=> 28/2 = mean of 14th and
164 3 492 13 15th terms.
165 3 495 16 => 165+165/2 = 165.
167 2 334 18
Mode =(162+164+165)/2=
168 2 336 20 245.5
170 3 510 23 using empirical formula,
174 1 174 24 mode= 3(165.86)-2(165)=
175 3 525 27 163.29
181 1 181 28 Range = 181-155 = 26
∑Fi =28 ∑Fi Xi =4644
Xi Fi Fi X i Cumulative Weight
41 1 41 1
45 1 45 2 Mean = ∑Fi Xi / ∑Fi
46 1 46 3 => 1539/28 = 54.96
47 2 94 5
50 4 200 9 Median = mean of (no of
51 5 255 14 terms/2)th terms from both
53 1 53 15 sides.
55 3 165 18
56 1 56 19 => 28/2 = mean of 14th and
58 1 58 20 15th terms.
60 2 120 22 => 51+53/2 = 52.
62 1 62 23
65 1 65 24 Mode = 51.
68 1 68 25 using empirical formula,
69 1 69 26 mode= 3(52)-2(54.96)= 46.07.
70 1 70 27
72 1 72 28 Range = 72-41 = 31
∑Fi =28 ∑Fi Xi =1539
Class Distribution
We can also create a class distribution or frequency table and divide the data entries
into intervals and classes.
Frequency Table
Class Count Class Mark
150-155 0 152.5
155-160 4 157.5
160-165 9 162.5
165-170 7 167.5 Height
170-175 4 172.5 Mean = ∑Fi Xi / ∑Fi
175-180 3 177.5 => 4670/28 = 166.79
180-185 1 182.5
185-190 0 187.5 Median = mean of (no of terms/2)th
*We can use the class mark to find mean, median, mode & range. terms from both sides.
=> 28/2 = mean of 14th and 15th terms.
Xi Fi Fi Xi Cumulative
=> 167.5+167.5/2 =167.5.
152.5 0 0 0
157.5 4 630 4 Mode = 162.5.
162.5 9 1462.5 13 using empirical formula, mode=
167.5 7 1172.5 20 3(167.5)-2(166.79)= 168.92
172.5 4 690 24
177.5 3 532.5 27 Range = 182.5-157.5 = 25
182.5 1 182.5 28
187.5 0 0 28
∑Fi =28 ∑Fi Xi =4670
Frequency Table
Class Count Class mark Weight
30-40 0 35
40-50 5 45 Mean = ∑Fi Xi / ∑Fi
50-69 15 55 => 1539/28 = 56.79
60-70 6 65 Median = mean of (no of terms/2)th
70-80 2 75 terms from both sides.
80-90 0 85 => 28/2 = mean of 14th and 15th terms.
=> 55+55/2 =55.
Xi Fi Fi X i Cumulative
35 0 0 0 Mode = 55.
45 5 225 5 using empirical formula, mode= 3(55)-
55 15 825 20 2(56.79)= 51.42
65 6 390 26
75 2 150 28 Range = 75-45 = 30
85 0 0 28
∑Fi =28 ∑Fi Xi =1590
Frequency Table
Class Count Class mark
2-3 0 2.5
3-4 8 3.5
Family Size
4-5 12 4.5 Mean = ∑Fi Xi / ∑Fi
5-6 4 5.5 => 133/28 = 4.75
6-7 2 6.5
7-8 1 7.5 Median = mean of (no of
8-9 1 8.5 terms/2)th terms from both
9-10 0 9.5 sides.
=> 28/2 = mean of 14th and
Xi Fi Fi X i Cumulative 15th terms.
=> 4.5+4.5/2 =4.5
2.5 0 0 0
3.5 8 28 8 Mode = 4.5
4.5 12 54 20 using empirical formula,
5.5 4 22 24 mode= 3(4.5)-2(4.75)= 4.
6.5 2 13 26 Range = 8.5-3.5 = 5
7.5 1 7.5 27
8.5 1 8.5 28
9.5 0 0 28
∑Fi =28 ∑Fi Xi =133
Statistics
Lowest Observation 3
Highest Observation 8
Total Number of
Observations 28
Number of Distinct
Observations 6
Weight
Statistics
Lowest Observation 41
Highest Observation 72
Total Number of
Observations 28
Number of Distinct
Observations 17
Family Size
Statistics
Lowest Observation 3
Highest Observation 8
Total Number of
Observations 28
Number of Distinct
Observations 6
Histograms and frequency polygons.
*we add one class after the highest class and before the lowest class to make the graph
Height
Weight
Family Size
The mean using the Step-Deviation Method has come out to be 4.25 which was
exactly the same mean we found earlier.
*Please Note – In all three scenarios the mean came out to be exactly the same as when calculated
earlier using the proper method but this might not be the same always and thus the value of the
mean found using any on these methods may vary very slightly.
Bibliography
Https://www.google.co.in/
https://quillbot.com/
https://www.socscistatistics.com/
https://statisticsbyjim.com/basics/importance-statistics/
https://www.ilovepdf.com/
https://www.iloveimg.com/
https://www.photopea.com/