0% found this document useful (0 votes)

10 views6 pages

Chapter 5

Introduction to Statistics

Uploaded by

murad.ridwan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

Chapter 5

Introduction to Statistics

Uploaded by

murad.ridwan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Chapter 5

Introduction to Statistics

5.1 Statistics
Statistics is the science which deals with methods of collecting, classifying,
presenting, comparing and interpreting numerical data collected to throw
light on any sphere of enquiry.

5.2 Frequency Distributions

Consider the marks (per 50)obtained by 60 students according to their ID
numbers:

38, 11, 40, 0, 26, 15, 5, 45, 7, 32, 2, 18, 42, 8, 31, 27, 4, 12, 35, 15, 0, 7, 28,
46, 9, 16, 29, 34, 10, 7, 5, 1, 17, 22, 35, 8, 36, 47, 11, 30, 19, 0, 16, 14, 14,
18, 41, 38, 2, 17, 42, 45, 48, 28, 7, 21, 8, 28, 5, 20.

The data does not give any useful information. These are called raw data or
ungrouped data. To bring out certain salient features of this data, we arrange
the data into classes or categories and determine the number of individuals
belonging to each class, called the class frequency. The resulting arrange-
ment is called a frequency distribution or frequency table. A graph for the
frequency distribution can be supplied by a histogram or by a polygon graph
connecting the middle points of of the tops in the histogram.

1
Class Notes on
5.3. MEASURES OF CENTRAL TENDENCY Applied Probability and Statistics ECEG-342

Table 5.1: Frequency Distribution

Marks Frequency
0-9 17
10 - 19 15
20 - 29 7
30 - 39 9
40 - 50 12

5.3 Measures of Central Tendency

Types of averages in common use:
1. Arithmetic average or mean
2. Median
3. Mode

1. Arithmetic Mean x̄
If x : x1 , x2 , . . . , xn then
x1 + x2 + . . . + x n 1X
x̄ = = xi
n n i

If the frequency distribution is given, i.e.,

x : x1 , x2 , . . . , xn
f : f1 , f2 , . . . , fn

then P
f 1 x1 + f 2 x2 + . . . + f n xn f i xi
x̄ = = Pi
f1 + f2 + . . . + fn i fi
Weighted Arithmetic Mean: If the individual data are not of equal
importance, we may attach to them weights w1 , w2 , . . . , wn as measure
of their importance
P
w 1 x1 + w 2 x2 + . . . + w n xn w i xi
x̄w = = Pi
w1 + w2 + . . . + wn i wi

Exercise 5.1: The mean of 200 items was 50. Later on it was discovered
that two items were misread as 92 and 8 instead of 192 and 88. Find out
the correct mean. (ans. 50.9)

Murad Ridwan, 2 of 6
Dep. of Electrical & Computer Engineering
AAiT, Addis Ababa University.
Aug 2010.
Class Notes on
5.4. MEASURES OF DISPERSION Applied Probability and Statistics ECEG-342

2. Median
It is the central value of the data when the values are arranged in
ascending or descending order of magnitude.
When the n data are arranged in ascending or descending order of
magnitude, i.e., x1 , x2 , . . . , xn

x(n+1)/2 , if n is odd;
median = xn/2 +x(n/2+1)
2
, if n is even.

3. Mode
Mode is the value which occurs most frequently. It does not always
exist. This is certainly true when all observations occur with the same
frequency.

Exercise 5.2: The nicotine contents for a random sample of 6 cigarettes

of a certain brand are found to be 2.3, 2.7, 2.5, 2.9, 3.1, and 1.9. milligrams.
Find the mean, median and mode. (ans. x = 2.57; median = 2.6; mode =
does not exist)

5.4 Measures of Dispersion

Measures of dispersion in common use are range, variance and standard
deviation.

1. Range
The range of x1 , x2 , . . . , xn is defined as x(n) − x(1) where x(n) and x(1)
are, respectively, the largest and smallest values.

2. Variance n
1X
S2 = (xi − x)2
n i=1

3. Standard Deviation: is the positive square root of variance, i.e., S.

5.5 Regression and Correlation

Often, in practice, we encounter experiments in which we observe or measure
two quantities simultaneously. In practice we may distinguish between two
kinds of experiments, as follows:

Murad Ridwan, 3 of 6
Dep. of Electrical & Computer Engineering
AAiT, Addis Ababa University.
Aug 2010.
Class Notes on
5.5. REGRESSION AND CORRELATION Applied Probability and Statistics ECEG-342

1. In correlation analysis both quantities are random variables and we

are interested in relations between them. For example, if X represents
the age of a used automobile and Y represents the retail book value of
the automobile, we should expect large values of X to correspond to
small values of Y and small values of X to correspond to large values
values of X. Correlation analalysis attempts to measure the strength of
such relationships between two variables by means of a single number
called correlation coefficient.

2. In regression analysis one of the two variables, call it x, can be

regarded as an ordinary variable, that is, it can be measured without
appreciable error. The other variable, Y , is a random variable. x is
called the independent or controlled variable, and one is interested in
the dependence of Y on x. Typical example may be the dependence of
the blood pressure Y on the age x of a person or, as we shall now say,
the regression of Y on x.

5.5.1 Linear Regression

Assume in an experiment we first select n values x1 , . . . , xn of x and
then observe Y at those values of x, so that we obtain a sample of the
form (x1 , y1 ), . . . , (xn , yn ). In regression analysis the mean µ of Y is
assumed to depend on x, i.e., µ = µ(x). The curve of µ(x) is called the
regression curve of Y on x. We consider the simplest case, when µ(x)
is a linear function, µ(x) = α + βx. Then we may want to plot the
sample values as n points in the xY -plane, fit a straight line through
them and use this line for estimating µ(x) for given values of x, so that
we know what values of Y we can expect if we choose certain certain
values of x.

A widely used mathematical model for fitting lines is the method of

least squares by Gauss. In this method, the straight line shroud be
fitted through the given points so that the sum of the squares of the
distances of those points from the straight line is minimum, where the
distance is measured in the vertical direction.

The vertical distance (in the y-direction) of a sample point (xi , yi ) from
a straight line y = a + bx is |yi − a − bxi |. Hence the sum of the squares

Murad Ridwan, 4 of 6
Dep. of Electrical & Computer Engineering
AAiT, Addis Ababa University.
Aug 2010.
Class Notes on
5.5. REGRESSION AND CORRELATION Applied Probability and Statistics ECEG-342

of these distances is
n
X
e= (yi − a − bxi )2
i=1

In the method of least squares we choose a and b such that the estima-
tion error e is minimum. A necessary condition for e to be minimum
is
∂e ∂e
= 0, and =0
∂a ∂b
Theorem 1. The least-squares line (linear regression) approximating
the set of points (x1 , y1 ), . . . , (xn , yn ) has the equation y = a+bx, where
the constants a and b are given by
n ni=1 xi yi − ( ni=1 xi )( ni=1 yi )
P P P
b =
n ni=1 x2i − ( ni=1 xi )2
P P
Pn Pn
i=1 yi − b i=1 xi
a =
n
Exercise 5.3: Verify the above theorem.

Exercise 5.4: A study was made on the amount of converted sugar in a

certain process at various temperatures. The data were coded and recorded
as follows:

Temperature, x Converted Sugar, y

1.0 8.1
1.1 7.8
1.2 8.5
1.3 9.8
1.4 9.5
1.5 8.9
1.6 8.6
1.7 10.2
1.8 9.3
1.9 9.2
2.0 10.5

(a) Plot a scatter diagram

(b) Estimate the linear regression line
(c) Estimate the mean amount of converted sugar produced when the
coded temperature is 1.75.
(ans. (b) 6.4136 + 1.8091x, (c) ŷ = 9.580.)

Murad Ridwan, 5 of 6
Dep. of Electrical & Computer Engineering
AAiT, Addis Ababa University.
Aug 2010.
Class Notes on
5.5. REGRESSION AND CORRELATION Applied Probability and Statistics ECEG-342

Exercise 5.5: The following data are the selling prices z of a certain make
and model of used cars w years old:

w years z dollars
1 6350
2 5695
2 5750
3 5395
5 4985
5 4895

(a) Plot a scatter diagram

(b) Fit a nonlinear sample regression curve of the form z = cdw .
Hint: Write

ln z = ln c + (ln d)w
= a + bw,

where a = ln c and b = ln d, and then estimate a and b by the formulas

for linear regression using the sample points (wi , ln zi )
(c) Estimate the selling price of such a car when it is 4 years old.
(ans. (b) z = 6461.392 × 0.947w (c) z = 5197)

Exercise 5.6: Construct a least-squares straight line which approximates

the data given below using

(a) x as independent variable,

(b) x as dependent variable.

x 1 3 4 6 8 9 11 14
y 1 2 4 4 5 7 8 9

Murad Ridwan, 6 of 6
Dep. of Electrical & Computer Engineering
AAiT, Addis Ababa University.
Aug 2010.

Elementary Statistical Methods 7th Edition - 678
No ratings yet
Elementary Statistical Methods 7th Edition - 678
383 pages
Linear Algebra Lecture Notes 01
No ratings yet
Linear Algebra Lecture Notes 01
7 pages
Final Report On Waterproofing
100% (2)
Final Report On Waterproofing
35 pages
Linear Regression and Correlation: Model
No ratings yet
Linear Regression and Correlation: Model
9 pages
Lecture4 Mech SU
No ratings yet
Lecture4 Mech SU
17 pages
BRM-Statistics in Research
No ratings yet
BRM-Statistics in Research
30 pages
Lesson 6 7 Basic Concept in Statistics Measures of Central Tendency
No ratings yet
Lesson 6 7 Basic Concept in Statistics Measures of Central Tendency
44 pages
V Unit NOTES_f6b2726b8f2fcb4e15e8323a20772509
No ratings yet
V Unit NOTES_f6b2726b8f2fcb4e15e8323a20772509
19 pages
Lectures 14 15
No ratings yet
Lectures 14 15
66 pages
2.stat & Proba 2
No ratings yet
2.stat & Proba 2
15 pages
Sta301 Ch.1 To 22 For Grand Quiz
No ratings yet
Sta301 Ch.1 To 22 For Grand Quiz
16 pages
Group 10 - Curve Fitting
No ratings yet
Group 10 - Curve Fitting
81 pages
Business Statistics
No ratings yet
Business Statistics
6 pages
statistics mcqs
No ratings yet
statistics mcqs
8 pages
Tut-1-Cvg4150-2016-For Posting
No ratings yet
Tut-1-Cvg4150-2016-For Posting
20 pages
Lecture Notes For STAT2602
No ratings yet
Lecture Notes For STAT2602
104 pages
Statistics Assignment 05
50% (2)
Statistics Assignment 05
14 pages
Ics 2328 Computer Oriented Statistical Modeling Assignment March 2024 Ms
No ratings yet
Ics 2328 Computer Oriented Statistical Modeling Assignment March 2024 Ms
6 pages
06 Regression
No ratings yet
06 Regression
18 pages
Basic Statistics
No ratings yet
Basic Statistics
44 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
Statistics
No ratings yet
Statistics
20 pages
SAS 2130 Statistics 2021
No ratings yet
SAS 2130 Statistics 2021
212 pages
Module 11
No ratings yet
Module 11
21 pages
Solutions Chapter 5
No ratings yet
Solutions Chapter 5
14 pages
STATSECO-XI-5
No ratings yet
STATSECO-XI-5
9 pages
Normal Distribution and Regression Notes
No ratings yet
Normal Distribution and Regression Notes
71 pages
03 - Measures - of - Center - Variation
No ratings yet
03 - Measures - of - Center - Variation
45 pages
Business Statistics
No ratings yet
Business Statistics
20 pages
Data and overview Lecture 6
No ratings yet
Data and overview Lecture 6
22 pages
Chapter 4 Regression (2)-Unlocked
No ratings yet
Chapter 4 Regression (2)-Unlocked
97 pages
Linear Regression Analysis_1
No ratings yet
Linear Regression Analysis_1
18 pages
Lesson 12 - Introduction To Regression and Correlation Analysis Regression Analysis
No ratings yet
Lesson 12 - Introduction To Regression and Correlation Analysis Regression Analysis
39 pages
ODD - Solutions Chapter 5
No ratings yet
ODD - Solutions Chapter 5
9 pages
List of Formula For Unit-3 and Unit-4
No ratings yet
List of Formula For Unit-3 and Unit-4
7 pages
Document 8
No ratings yet
Document 8
10 pages
Formula Stables
No ratings yet
Formula Stables
29 pages
STB1003_Unit-3 bsc
No ratings yet
STB1003_Unit-3 bsc
12 pages
ABC Business Statistics
No ratings yet
ABC Business Statistics
12 pages
Correlation Regression Bivariate
No ratings yet
Correlation Regression Bivariate
12 pages
Frequency and Measures of Central Tendency and Variability
No ratings yet
Frequency and Measures of Central Tendency and Variability
108 pages
Stats Formula
No ratings yet
Stats Formula
6 pages
unit 4 business statistics
No ratings yet
unit 4 business statistics
12 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
8 pages
Untitled 472
No ratings yet
Untitled 472
13 pages
Module 1A - Review of Elementary Statistics
No ratings yet
Module 1A - Review of Elementary Statistics
13 pages
5 Introduction To Statistics
No ratings yet
5 Introduction To Statistics
12 pages
Mda-Session-7 Simple Linear Regression
No ratings yet
Mda-Session-7 Simple Linear Regression
75 pages
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
No ratings yet
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
31 pages
Location) .: Distribution Is The Purpose of Measure of Central
No ratings yet
Location) .: Distribution Is The Purpose of Measure of Central
13 pages
Chapter5
No ratings yet
Chapter5
14 pages
Correlation and Regression 2
No ratings yet
Correlation and Regression 2
24 pages
Biostatistics Notes
No ratings yet
Biostatistics Notes
47 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
BST Numericals
No ratings yet
BST Numericals
77 pages
Handout 05 Regression and Correlation PDF
No ratings yet
Handout 05 Regression and Correlation PDF
17 pages
Statistics Mini Project Roll No 211 To 220 NEW 2 0
No ratings yet
Statistics Mini Project Roll No 211 To 220 NEW 2 0
28 pages
(Ebook) Graybill & Iyer 2004 Regression Analysis - Concepts & Applications - With SAS & Minitab
No ratings yet
(Ebook) Graybill & Iyer 2004 Regression Analysis - Concepts & Applications - With SAS & Minitab
648 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Week 3 - Illness and Indigeneity 2
No ratings yet
Week 3 - Illness and Indigeneity 2
13 pages
NBS#244
100% (1)
NBS#244
6 pages
Guidance For Propeller Blade Welding Rep
No ratings yet
Guidance For Propeller Blade Welding Rep
2 pages
February Newsletter
No ratings yet
February Newsletter
2 pages
Journal Dragon Fruit Skin
No ratings yet
Journal Dragon Fruit Skin
9 pages
Thyristor Converters: - Controlled Conversion of Ac Into DC
No ratings yet
Thyristor Converters: - Controlled Conversion of Ac Into DC
39 pages
Bord and Pillar
No ratings yet
Bord and Pillar
4 pages
Kumpulan Soal Bahasa Inggris Uts
100% (1)
Kumpulan Soal Bahasa Inggris Uts
7 pages
Applying Design Knowledge and Machine Learning To Scada Data For Classification of Wind Turbine Operating Regimes
No ratings yet
Applying Design Knowledge and Machine Learning To Scada Data For Classification of Wind Turbine Operating Regimes
8 pages
2008 Oct Olympic Dam Presentation
No ratings yet
2008 Oct Olympic Dam Presentation
65 pages
Your Personality and Successful Trading
100% (5)
Your Personality and Successful Trading
17 pages
AI Introduction
No ratings yet
AI Introduction
17 pages
Expression of Like and Dislike
No ratings yet
Expression of Like and Dislike
13 pages
Compiled By: Mohammad Ashraful Alam (Shuvro), Student of IBA (MBA 56D Batch)
No ratings yet
Compiled By: Mohammad Ashraful Alam (Shuvro), Student of IBA (MBA 56D Batch)
18 pages
ROintro 1
No ratings yet
ROintro 1
15 pages
Soliman Et Al. - 2015 - Formula SAE Aerodynamics Design Process With Focu
No ratings yet
Soliman Et Al. - 2015 - Formula SAE Aerodynamics Design Process With Focu
15 pages
Public International Law: Law of The Sea
No ratings yet
Public International Law: Law of The Sea
19 pages
Geotechnical Reuse of Waste Materialnew
No ratings yet
Geotechnical Reuse of Waste Materialnew
42 pages
MD Faruk ADIS Project Word File
100% (1)
MD Faruk ADIS Project Word File
93 pages
2019 May MA204-E - Ktu Qbank
No ratings yet
2019 May MA204-E - Ktu Qbank
3 pages
May16 Spinal Fusion
No ratings yet
May16 Spinal Fusion
1 page
Bollet Point To Lodha Letter
No ratings yet
Bollet Point To Lodha Letter
3 pages
CINCO - Safety Plan
100% (1)
CINCO - Safety Plan
113 pages
Personal Resume
No ratings yet
Personal Resume
1 page
Basic 32 Bit MCU Design and Troubleshooting Checklist DS70005439
No ratings yet
Basic 32 Bit MCU Design and Troubleshooting Checklist DS70005439
76 pages
Puri G.M. - Python Scripts For ABAQUS - Learn by Example
No ratings yet
Puri G.M. - Python Scripts For ABAQUS - Learn by Example
29 pages
RHN Owners Manual
No ratings yet
RHN Owners Manual
112 pages
Mending Wall Analysis
No ratings yet
Mending Wall Analysis
4 pages

Chapter 5

Uploaded by

Chapter 5

Uploaded by

Chapter 5

5.2 Frequency Distributions

Table 5.1: Frequency Distribution

5.3 Measures of Central Tendency

If the frequency distribution is given, i.e.,

Exercise 5.2: The nicotine contents for a random sample of 6 cigarettes

5.4 Measures of Dispersion

3. Standard Deviation: is the positive square root of variance, i.e., S.

5.5 Regression and Correlation

1. In correlation analysis both quantities are random variables and we

2. In regression analysis one of the two variables, call it x, can be

5.5.1 Linear Regression

A widely used mathematical model for fitting lines is the method of

Exercise 5.4: A study was made on the amount of converted sugar in a

Temperature, x Converted Sugar, y

(a) Plot a scatter diagram

(a) Plot a scatter diagram

where a = ln c and b = ln d, and then estimate a and b by the formulas

Exercise 5.6: Construct a least-squares straight line which approximates

(a) x as independent variable,

You might also like