0% found this document useful (0 votes)
25 views14 pages

Lecture 3

The document discusses measures of central tendency, variation, and properties of the normal distribution. It defines mean, mode, median, range, variance, and standard deviation. It explains that a normal distribution is symmetrical, unimodal, and bell-shaped with the mean, mode, and median in the center. It also discusses how to calculate what percentage of values fall within a given number of standard deviations from the mean.

Uploaded by

bida22-016
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views14 pages

Lecture 3

The document discusses measures of central tendency, variation, and properties of the normal distribution. It defines mean, mode, median, range, variance, and standard deviation. It explains that a normal distribution is symmetrical, unimodal, and bell-shaped with the mean, mode, and median in the center. It also discusses how to calculate what percentage of values fall within a given number of standard deviations from the mean.

Uploaded by

bida22-016
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 14

Lesson 3

Recap – Normal Distribution


Stewart Muchuchuti
Charles Kaondera

Botswana Accountancy College


Outline
• There are three different MEASURES of CENTRAL TENDENCY
• Mean=arithmetic average of a set of scores
• (add all of the scores and divide by the number of scores you used)
• (i.e., 3, 11, 3, 4, 7, 8 –> 3 + 11 + 3 + 4 + 7 + 8 = 36 –> 36 divided by 6
(number of scores) = 6)
• Mode= score that appears most often
• (i.e., 3, 11, 3, 4, 7, 8 –> there are more 3s than any other value so 3 is the
mode)
• Median= middle score when the scores are put in order of value
• (i.e., 3, 11, 3, 4, 7, 8 –> 3, 3, 4, 7, 8, 11 (scores in order) –> the middle is
between 4 and 7 so the median is 5.5 (half way between 4 and 7)
Big Data Analytics, 2
Outline
• MEASURES of VARIATION (Spread)

• Range=difference between the high and low score (i.e., 3, 11, 3, 4, 7, 8 –>
3, 3, 4, 7, 8, 11 (scores in order) –> 11 (high) minus 3 (low) = 8
• Variance=average of the squared distance each score is from the mean
• (i.e., 3, 11, 3, 4, 7, 8)

Big Data Analytics, 3


Outline
• MEASURES of VARIATION (Spread)

• (The mean is 6 and we subtract 6 from each score and then square that
answer) –>
• 3 – 6 = -3 –> -3 X -3 = 9
• 11 – 6 = 5 –> 5 X 5 = 25
• 3 – 6 = -3 –> -3 X -3 = 9
• 4 – 6 = -2 –> -2 X -2 = 4
• 7 – 6 = 1 –> 1 X 1 = 1
• 8 – 6 = 2 –> 2 X 2 = 4

Big Data Analytics, 4


Outline
• MEASURES of VARIATION (Spread)

• Now we add all of the squared scores) –> 9 + 25 + 9 + 4 + 1 + 4 = 52


• (Next we divide this total (total of the squared distance each score was from
the mean) by the number of scores) –> 52 divided by 6 (number of scores) =
8.67
• NOTE: We divide by number of scores if the data is from the entire
population…we divide by the number of scores minus 1 (6-1=5) if the data is
from a sample.
• If our data were from a sample the variance would be 52 divided by 5 (52
divided by 5 = 10.4)

Big Data Analytics, 5


Outline
• MEASURES of VARIATION (Spread)

• Standard Deviation =square root of the variance


• (i.e., 3, 11, 3, 4, 7, 8 –> has a variance of 8.67 for a population so the
standard deviation the square root of 8.67 or 2.94 (use a calculator to find
it))
• (i.e., 3, 11, 3, 4, 7, 8 –> has a variance of 10.4 for a sample so the standard
deviation is the square root of 10.4 or 3.22 (use a calculator to find it))

Big Data Analytics, 6


Outline
• MEASURES of VARIATION (Spread)

Big Data Analytics, 7


Outline
• Properties of a Normal Distribution

Big Data Analytics, 8


Outline
• Properties of a Normal Distribution

1.Unimodal (one mode)


2.Symmetrical (left and right halves are mirror images)– just as many people
above the mean as below the mean
3.Bell shaped (maximum height (mode) at the mean)
4.Mean, Mode, and Median are all located in the center
5.Asymptotic (the further the curve goes from the mean, the closer it gets to
the X axis; but the curve never touches the X axis)

Big Data Analytics, 9


Outline
• Properties of a Normal Distribution

1.Assuming that we have a normal distribution, it is easy to calculate what


percentage of students who are between 1.5 standard deviations above the
mean and 2.5 standard deviations above the mean.
2.To do this, use the Area Under the Normal Curve Calculator at
http://davidmlane.com/hyperstat/z_table.html.

Big Data Analytics, 10


Outline
• Properties of a Normal Distribution

Big Data Analytics, 11


Outline
• Properties of a Normal Distribution

A z score indicates the number of standard deviations a corresponding raw score is


above or below the mean
z score –> subtract the mean from the raw score and divide that answer by the
standard deviation
(i.e., raw score=5, mean=8, standard deviation=2 –> 5 – 8 = -3 –> -3 divided by 2 = -
1.5)

T score (transformed score — a.k.a. Z score) –> multiple the z score by 10 and add 50
(i.e., z score=-1.5 –> -1.5 X 10 = -15 –> -15 + 50 = 35)

Big Data Analytics, 12


Outline
• Scores don’t always form a normal distribution

The skew is the tail. If the tail (skew) is on the left (negative side), we have a negatively skewed distribution. That means
that more of the subjects scored on the high end (because most of the people are not in the tail where the low scores
are)..

If the skew (tail) is on the right (positive side), we have a positive skew. That means more people scored low (because most
of the people are not in the tail where the high scores are

Big Data Analytics, 13


Outline
• Scores don’t always form a normal distribution

Sometimes most of the scores are in the middle, we then have a leptokurtic distribution.

Sometimes the scores have a large spread without a lot of people in the middle, we then have a platykurtic
distribution.

If a set of scores does not form a normal distribution (skewed), then the characteristics of the normal curve do not
apply. For example, 68% of the scores would not fall within one standard deviation of the mean if the distribution were
negatively skewed.
Big Data Analytics, 14

You might also like