0% found this document useful (0 votes)

38 views31 pages

Probabilistik Dan Proses Stokastik

The document discusses various statistical concepts for representing and analyzing data, including histograms, measures of central tendency (mean, median), measures of spread (range, interquartile range), and outliers. It provides examples to demonstrate how to calculate the median, quartiles, IQR, and identify outliers for a data set. Box and whisker plots are also introduced as a way to visually depict these aspects of a data distribution.

Uploaded by

faris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views31 pages

Probabilistik Dan Proses Stokastik

Uploaded by

faris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Probabilistik dan Proses

Stokastik

Todays Agenda
Continue from data Representation
Histogram

Center and Spread of Data

Quartiles
Box and Whisker Plot
Outliers

Data Representation (Example)

89 84 87 81 89 86 91 90 78 89 87 99 83 89
Sort this data
78 81 83 84 86 87 87 89 89 89 89 90 91 99
Group this data
Make 5 groups
Group

No of Elements

75 - 79

80 - 84

85 - 89

90 - 94

94 - 99

Data Representation (Example)

78 81 83 84 86 87 87 89 89 89 89 90 91 99
Representing the same data in stem and leaf
plot,

Stem

Leaf

134

6779999

Data Representation (Example)

78 81 83 84 86 87 87 89 89 89 89 90 91 99
Counting how many leaves a certain stem
has, we write that number in the left most
column, and call it absolute frequency
Absolute
frequency

Stem

Leaf

134

677999
9

Data Representation (Example)

78 81 83 84 86 87 87 89 89 89 89 90 91 99
To find the cumulative absolute frequency, we
add up the absolute frequencies up to the line
of the leaf
Cumulativ
e Absolute
frequency

Absolute
frequency

Group

No of
Elements

134

677999
9

Data Representation (Example)

Cumulative
Absolute
frequency

Absolute
frequency

Group

No of
Elements

134

6779999

Individual entries of left most column in stem

and leaf plot are called Cumulative Absolute
Frequency CAS, i. e. the sum of the absolute
frequencies of values up to the line of the
leaf.
For example, 11 shows that there are 11 values in
the data not exceeding 89.

Data Representation (Example)

Dividing the absolute frequency by n (total
number of entries in the data) gives Relative

class Frequency

In the present example there are total 14

entries, therefore, relative frequency is
calculated as
Group

Abs. Freq

Relative C.
Frequency

75 - 79

1/14

80 - 84

3/14

85 - 89

7/14

90 - 94

2/14

94 - 99

1/14

Relative frequency
How Relative class Frequency is used for data
representation?

Histogram
Area of the rectangles are proportional to the
relative frequency.
Grou
p

Abs.
Freq

Rel. Freq

75 79

1/14

0.07

80 84

3/14

0.21

85 89

7/14

0.50

90 94

2/14

0.14

0,10

94 99

1/14

0.07

0,00

0,60
0,50
0,40
0,30
0,20

75 - 79 80 - 84 85 - 89 90 - 94 94 - 99

Histogram
What information does Histogram?
The data was
78 81 83 84 86 87 87 89 89 89 89 90 91 99
0,60
0,50
0,40
0,30

0,20
0,10
0,00
75 - 79

80 - 84

85 - 89

90 - 94

94 - 99

Histogram
What information does Histogram?
It give us a clear picture where is the
concentration of data
Or we can say, which way the data is inclined

Progress so far?
We have studied,
absolute frequencies
Relative frequency
And how to use it in plotting histogram

Data
We have collected data and we want to
analyze it,
We take the previous data
89 84 87 81 89 86 91 90 78 89 87 99 83 89
Sorting this data we get
78 81 83 84 86 87 87 89 89 89 89 90 91 99

Center and Spread of Data

As a center of the location of data values we can
take a median.
78 81 83 84 86 87 87 89 89 89 89 90 91 99
There are total 14 values
As in the present data set we have even number
of values so there is no center value
But we have 87 and 89 as middle values (7th and
8th) so

We take the median as

(87+89)/2
=88
Therefore, The median is 88.

Remember Median may not be present in the data.

Median Cont..
Take another example
51 54 55 55 57 62 63 63 69
There are total 9 values
As in the present data set we have ODD
number of values so there is a center value
The center value is 57

Therefore, The median is 57.

Notice in this example Median is present in

the data.

Median Cont..
Take another example
51 54 55 55 56 57 62 63 63 69
There are total 10 values
As in the present data set we have even
number of values so there is no center value
But we have 56 and 57 as middle values (5th
and 6th) so

We take the median as

(56+57)/2
=56.5
Therefore, The median is 56.5.

Remember Median may have decimal places.

Spread of Data
Spread of data can be measured by the range
Spread is also called variability.
Spread = maximum value minimum value
Example data
78 81 83 84 86 87 87 89 89 89 89 90 91 99
In this case spread is 99 78 = 21.

Spread of Data
Example data
51 54 55 55 57 62 63 63 69
In this case spread is 69 51 = 18.

Example1
3, 13, 7, 5, 21, 23, 39, 23, 40, 23, 14, 12, 56, 2
3, 29
putting data in order
3, 5, 7, 12, 13, 14, 21, 23, 23, 23, 23, 29, 39, 4
0, 56
Total value are 15, 8th value is in the middle.
The median value turns out to 23
The spread 56 3 = 53

Example1
3, 13, 7, 5, 21, 23, 23, 40, 23, 14, 12, 56, 23, 29
Here we have even number of elements in data.
Putting this data in order
3, 5, 7, 12, 13, 14, 21, 23, 23, 23, 23, 29, 40, 56
n = 14
3, 5, 7, 12, 13, 14, 21, 23, 23, 23, 23, 29, 40, 56
Median is found by (21 + 23)/2 = 22 i.e. by taking
mean value of two middle values.
The spread 56 3 = 53
Median separates the data in two equal halves.

Quartiles
With Quartiles data is divided in 4 groups in
the same manner as we do for median.
There are three quartiles in data called
Lower Quartile ql (median of the lower half of the
data)
Middle Quartile qm(median of the data)
Upper Quartile qu (median of the upper half of the
data)

Interquartile Range IQR can be found by

IQR = qu - ql

Example2

78 81 83 84 86 87 87 89 89 89 89 90 91 99
Lower half of data is
78 81 83 84 86 87 87
Lower Quartile is 84
Upper half of data is
89 89 89 89 90 91 99
Lower Quartile is 89
Middle Quartile (same as median) is 88
IQR (interquartile range) = 89 84 = 5

Box and Whisker Plot

Also called Box Plot
Box plot is obtained by 5 values of data.
Minimum value of the data
Three quartiles
Maximum value of the data

Example2
78 81 83 84 86 87 87 89 89 89 89 90 91 99
Middle Quartile is 88
Lower half of data is
78 81 83 84 86 87 87
Lower Quartile is 84

Upper half of data is

89 89 89 89 90 91 99
Upper Quartile is 89
IQR = 89 84 = 5

Outliers
Lets say an experiment was performed in
which time was noted for a toy parachute to
land on the ground from a fixed height. The
experiment was repeated 10 times, under
similar conditions
The data was recorded as
14 13 15 16 5 27 16 11 12 22

Outliers
14 13 15 16 5 27 16 11 12 22
Sorting this data
5 11 12 13 14 15 16 16 22 27
Remember we said that the same experiment is
repeated 10 times under the same
conditions, then the time take should be same in
all the cases and we should have the same
number 10 times,
However due to unavoidable delay in the response
of the human in clicking the stop watch, we have
varied data,
But some of the data is completely out of sink
with the rest of the data.
The data which is not representative of the rest
of the data is called OUTLIERS

Outliers
An outlier is a value that appears to be
uniquely different from the rest of the data
set.
It might indicate that something went wrong
with the data collection process
The outlier is normally defined as a value
more than a distance of 1.5 IQR, from either
end of the box.

Outliers

Coming back to the data

14 13 15 16 5 27 16 11 12 22
Sorting this data
5 11 12 13 14 15 16 16 22 27
Middle quartile = 14.5
Lower quartile = 12
Upper quartile = 16
Spread = 27-5 = 22
IQR = 16-12 = 4
1.5xIQR = 1.5x4 = 6
Therefore all values above upper quartile +6
16+6 = 22, are outliers as is 27

Outliers

Coming back to the data

14 13 15 16 5 23 16 11 12 22
Sorting this data
5 11 12 13 14 15 16 16 22 23
Middle quartile = 14.5
Lower quartile = 12
Upper quartile = 16
Spread = 23-7 = 16
IQR = 16-12 = 4
1.5xIRQ = 1.5x4 = 6
Therefore all values below (lower quartile -6)
12-6 = 6, are outliers as is 5

References
1: Advanced Engineering Mathematics by E
Kreyszig 8th edition

Design of Experiments, Principles and Applications
100% (2)
Design of Experiments, Principles and Applications
350 pages
Statistics Full Report UTHM
100% (1)
Statistics Full Report UTHM
39 pages
Statistics and Probability Theory: Fasih Ur Rehman
No ratings yet
Statistics and Probability Theory: Fasih Ur Rehman
17 pages
DM Lec2 Getting To Know Your Data
No ratings yet
DM Lec2 Getting To Know Your Data
34 pages
G9 - Statistics - Cumulative Frequency Measuring The Spread Box Plot Freq Density
No ratings yet
G9 - Statistics - Cumulative Frequency Measuring The Spread Box Plot Freq Density
8 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
Statistics Part 1 and 2
No ratings yet
Statistics Part 1 and 2
53 pages
Variability Final
No ratings yet
Variability Final
53 pages
Descriptive Statistics Week 2: L2 - Graphical Display of Data
No ratings yet
Descriptive Statistics Week 2: L2 - Graphical Display of Data
22 pages
Data Mining-5 - Getting Know Data 1
No ratings yet
Data Mining-5 - Getting Know Data 1
27 pages
fundamentals stats
No ratings yet
fundamentals stats
44 pages
Statistics Measure of Center
No ratings yet
Statistics Measure of Center
11 pages
S1 Chp3 RepresentationsOfData
No ratings yet
S1 Chp3 RepresentationsOfData
41 pages
4. Exploring Numerical Data_students
No ratings yet
4. Exploring Numerical Data_students
97 pages
Chapter 1 Descriptive Stats L2 Jan 2024
No ratings yet
Chapter 1 Descriptive Stats L2 Jan 2024
22 pages
Chapter 2 Final of Final
No ratings yet
Chapter 2 Final of Final
158 pages
Staticus: Math 103 Lecture 9 Class Notes
No ratings yet
Staticus: Math 103 Lecture 9 Class Notes
4 pages
2. Summarising data
No ratings yet
2. Summarising data
7 pages
Quantitative Methods For Management
No ratings yet
Quantitative Methods For Management
118 pages
Statistics - Lecture Slides 3 - For Lecture
No ratings yet
Statistics - Lecture Slides 3 - For Lecture
37 pages
First Week
No ratings yet
First Week
8 pages
7_2
No ratings yet
7_2
34 pages
Measure of Variation
No ratings yet
Measure of Variation
50 pages
Lecture-2 Descriptive Statistics-Box Plot Descriptive Measures
No ratings yet
Lecture-2 Descriptive Statistics-Box Plot Descriptive Measures
44 pages
A Detailed Lesson Plan in Mathematics 10: A. Preliminary/Routinary Activity
No ratings yet
A Detailed Lesson Plan in Mathematics 10: A. Preliminary/Routinary Activity
12 pages
Mean, Median, Mode, Standard Deviation (Descriptive Statistics)
No ratings yet
Mean, Median, Mode, Standard Deviation (Descriptive Statistics)
43 pages
Assignment 1 Midterm
No ratings yet
Assignment 1 Midterm
5 pages
Statistics Midterm Review
No ratings yet
Statistics Midterm Review
21 pages
Continuation Cahpter 4
No ratings yet
Continuation Cahpter 4
47 pages
Statistics For Css
No ratings yet
Statistics For Css
73 pages
bloxplots in data science
No ratings yet
bloxplots in data science
3 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
Business Statistics
No ratings yet
Business Statistics
106 pages
L3 Numerical Summary Measures
No ratings yet
L3 Numerical Summary Measures
44 pages
Data Handling
No ratings yet
Data Handling
18 pages
Central Tendency and Dispersion: A.Ramesh
No ratings yet
Central Tendency and Dispersion: A.Ramesh
58 pages
02data Part2
No ratings yet
02data Part2
34 pages
MATH& 146 Lesson 8: Averages and Variation
No ratings yet
MATH& 146 Lesson 8: Averages and Variation
30 pages
W4 D3 G9-12 Outliers Student
No ratings yet
W4 D3 G9-12 Outliers Student
4 pages
CHP 2
No ratings yet
CHP 2
52 pages
Data Preprocessing Data Basics
No ratings yet
Data Preprocessing Data Basics
86 pages
3 Stats Box and Whisker
No ratings yet
3 Stats Box and Whisker
35 pages
Unit 1
No ratings yet
Unit 1
26 pages
Math 11th Grade Lesson File 4 2021-22
No ratings yet
Math 11th Grade Lesson File 4 2021-22
6 pages
11-6D Quartiles, Percentiles and Boxplots and Histograms
No ratings yet
11-6D Quartiles, Percentiles and Boxplots and Histograms
24 pages
lecture_note_2
No ratings yet
lecture_note_2
7 pages
EDA 1 Continuation
No ratings yet
EDA 1 Continuation
10 pages
g11 10 Statistics
No ratings yet
g11 10 Statistics
49 pages
Business Statistics CH (7)
No ratings yet
Business Statistics CH (7)
37 pages
Answers IBS
No ratings yet
Answers IBS
13 pages
Notes Measures of Variation Range and Interquartile Range
No ratings yet
Notes Measures of Variation Range and Interquartile Range
11 pages
IZT_Lecture 5_S&Q
No ratings yet
IZT_Lecture 5_S&Q
10 pages
Basic 1
No ratings yet
Basic 1
60 pages
Data Analytics TB
No ratings yet
Data Analytics TB
1,944 pages
R22-UNIT2-CH2
No ratings yet
R22-UNIT2-CH2
28 pages
Note 02
No ratings yet
Note 02
31 pages
DM 02 01 Data Undrestanding
No ratings yet
DM 02 01 Data Undrestanding
35 pages
Statistical Analysis
No ratings yet
Statistical Analysis
15 pages
Stats1 Chapter 3::: Representations of Data
No ratings yet
Stats1 Chapter 3::: Representations of Data
40 pages
Interpretation of Test Results
No ratings yet
Interpretation of Test Results
27 pages
Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis
No ratings yet
Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis
26 pages
Simple Numbers
From Everand
Simple Numbers
Prasant
No ratings yet
New File Spss
No ratings yet
New File Spss
4 pages
Datasets - Bodyfat2 Fitness Newfitness Abdomenpred: Saseg 8B - Correlation Analysis
No ratings yet
Datasets - Bodyfat2 Fitness Newfitness Abdomenpred: Saseg 8B - Correlation Analysis
34 pages
Statistical Formula Sheet 1: X X N X N X F X N
No ratings yet
Statistical Formula Sheet 1: X X N X N X F X N
11 pages
Stat
No ratings yet
Stat
6 pages
Normal Distribution
No ratings yet
Normal Distribution
21 pages
Get Mathematical statistics basic ideas and selected topics Volume II Bickel P.J. PDF ebook with Full Chapters Now
100% (2)
Get Mathematical statistics basic ideas and selected topics Volume II Bickel P.J. PDF ebook with Full Chapters Now
55 pages
The Effect of Agility Training On Athletic Power Performance
No ratings yet
The Effect of Agility Training On Athletic Power Performance
8 pages
FINAL EXAM - Attempt Review PDF
No ratings yet
FINAL EXAM - Attempt Review PDF
9 pages
SPSS Manual
100% (3)
SPSS Manual
72 pages
How To 'Sum' A Standard Deviation?
No ratings yet
How To 'Sum' A Standard Deviation?
7 pages
Statistical Estimation: Prof GRC Nair
No ratings yet
Statistical Estimation: Prof GRC Nair
15 pages
Non Linear Regression
No ratings yet
Non Linear Regression
12 pages
Topic 5-20200317101441 PDF
No ratings yet
Topic 5-20200317101441 PDF
15 pages
Robust
No ratings yet
Robust
3 pages
Wa0003.
No ratings yet
Wa0003.
2 pages
THE EFFECT OF SOCIAL MEDIA ADDICTION ON MARRIAGE ROLE EXPECTATIONS
No ratings yet
THE EFFECT OF SOCIAL MEDIA ADDICTION ON MARRIAGE ROLE EXPECTATIONS
12 pages
177 Excel Time Series Analysis
No ratings yet
177 Excel Time Series Analysis
5 pages
Straightforward Statistics 1st Edition Bowen Test Bank download
100% (3)
Straightforward Statistics 1st Edition Bowen Test Bank download
54 pages
5.6 MMW
No ratings yet
5.6 MMW
27 pages
Bab V
No ratings yet
Bab V
29 pages
Introduction To SPC Documents
No ratings yet
Introduction To SPC Documents
33 pages
2.lines of Best Fit PDF
No ratings yet
2.lines of Best Fit PDF
6 pages
Uniform Distribution
No ratings yet
Uniform Distribution
10 pages
InternationalJournalFGCN_MohdEffendi_AZKe (4)
No ratings yet
InternationalJournalFGCN_MohdEffendi_AZKe (4)
15 pages
Practical Research 2 Fourth Summative Test-Second Quarter
No ratings yet
Practical Research 2 Fourth Summative Test-Second Quarter
2 pages
Regression Discontinuity Models: Pavel Coronado
No ratings yet
Regression Discontinuity Models: Pavel Coronado
63 pages
Wendorf ReportingStatistics
No ratings yet
Wendorf ReportingStatistics
6 pages
Perhitungan Regresi Dengan Excell Secara Manual
No ratings yet
Perhitungan Regresi Dengan Excell Secara Manual
7 pages