0% found this document useful (0 votes)
53 views19 pages

Chapter 03

This document summarizes key concepts for describing relationships between two variables using bivariate data. Graphs like comparative pie charts, bar charts, and scatterplots can be used. The correlation coefficient r measures the strength and direction of a linear relationship between two quantitative variables, ranging from -1 to 1. A value of 0 indicates a weak relationship; values near 1 or -1 show a strong positive or negative relationship.

Uploaded by

Lucas Luther
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
53 views19 pages

Chapter 03

This document summarizes key concepts for describing relationships between two variables using bivariate data. Graphs like comparative pie charts, bar charts, and scatterplots can be used. The correlation coefficient r measures the strength and direction of a linear relationship between two quantitative variables, ranging from -1 to 1. A value of 0 indicates a weak relationship; values near 1 or -1 show a strong positive or negative relationship.

Uploaded by

Lucas Luther
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 19

Introduction to Probability

and Statistics
Twelfth Edition

Robert J. Beaver • Barbara M. Beaver • William Mendenhall

Presentation designed and written by:


Barbara M. Beaver
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Introduction to Probability
and Statistics
Twelfth Edition

Chapter 3
Describing Bivariate Data

Copyright ©2006 Brooks/Cole


Some images © 2001-(current year) www.arttoday.com A division of Thomson Learning, Inc.
Bivariate Data
• When two variables are measured on a single
experimental unit, the resulting data are
called bivariate data.
• You can describe each variable individually,
and you can also explore the relationship
between the two variables.
• Bivariate data can be described with
– Graphs
– Numerical Measures

Copyright ©2006 Brooks/Cole


A division of Thomson Learning, Inc.
Graphs for Qualitative
Variables
• When at least one of the variables is
qualitative, you can use comparative pie charts
or bar charts.
Do
Doyou
youthink
thinkthat
thatmen
menand
andwomen
womenareare Variable #1 = Opinion
treated
treatedequally
equallyin
inthe
theworkplace?
workplace? Variable #2 = Gender
Men Women

Copyright ©2006 Brooks/Cole


A division of Thomson Learning, Inc.
Comparative Bar Charts
120 Gender 70
120 Gender 70
Men
Men
Women 60
100 Women 60
100
50
50
80
80

Percent
40

Percent
Percent

40
Percent

60
60 30
30

40 20
40 20

10
20 10
20
0
Gender 0 Men Women Men Women Men Women
0 Gender Men Women Men Women Men Women
Opinion Agree Disagree No Opinion
Opinion 0 Agree Disagree No Opinion Opinion Agree Disagree No Opinion
Opinion Agree Disagree No Opinion

• Stacked Bar Chart • Side-by-Side Bar Chart


Describe the relationship between opinion and
gender:
More
More women
women than
than men
men feel
feel that
that they
they are
are
not
not treated
treated equally
equally in
in the
the workplace..
workplace..
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Two Quantitative Variables
When both of the variables are quantitative,
call one variable x and the other y. A single
measurement is a pair of numbers (x, y) that
can be plotted using a two-dimensional graph
called a scatterplot.
y
(2, 5)

y=5

x
x=2 Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Describing the Scatterplot
• What pattern or form do you see?
• Straight line upward or downward
• Curve or no pattern at all
• How strong is the pattern?
• Strong or weak
• Are there any unusual observations?
observations
• Clusters or outliers
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Examples

Positive linear - strong Negative linear -weak

Curvilinear No relationship
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Numerical Measures for
Two Quantitative Variables
• Assume that the two variables x and y
exhibit a linear pattern or form.
form
• There are two numerical measures to
describe
– The strength and direction of the
relationship between x and y.
– The form of the relationship.
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
The Correlation Coefficient
• The strength and direction of the relationship
between x and y are measured using the
correlation coefficient, r.

ssxyxy ((
 xxii)()(
 yyii))

 xxiiyyii 
rr  where ssxy  nn
ssxxssyy xy
nn11
ssxx == standard
standard deviation
deviation of
of the
the x’s
x’s
ssyy == standard
standard deviation
deviation of
of the
the y’s
y’s
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Example
• Living area x and selling price y of 5 homes.
Residence 1 2 3 4 5
x (thousand sq ft) 14 15 17 19 16
y ($000) 178 230 240 275 200

••The
The scatterplot
scatterplot
indicates
indicates aa positive
positive
linear
linear relationship.
relationship.

Copyright ©2006 Brooks/Cole


A division of Thomson Learning, Inc.
x
14
y
178
xy
2492
Example
15 230 3450
Calculate
17 240 4080
x  16.2 s x  1.924
19 275 5225
16 200 3200
y  224.6 s y  37.360
81 1123 18447

( xi )( yi ) s xy
 xi yi  r
s xy  n sx s y
n 1
63.6
(81)(1123)   .885
18447  1.924(37.36)
 5  63.6
4
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Interpreting r MY APPLET

•-1  r  1 Sign of r indicates direction of


the linear relationship.

•r  0 Weak relationship; random scatter


of points

•r  1 or –1 Strong relationship; either


positive or negative

All points fall exactly on a


•r = 1 or –1 straight line.
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
The Regression Line
• Sometimes x and y are related in a particular way
—the value of y depends on the value of x.
– y = dependent variable
– x = independent variable
• The form of the linear relationship between x and y
can be described by fitting a line as best we can
through the points. This is the regression line,
y = a + bx.
– a = y-intercept of the line MY APPLET

– b = slope of the line


Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
The Regression Line
• To find the slope and y-intercept of
the best fitting line, use:

ssyy
bb  rr
ssxx
aa  yy bbxx

• The least squares


• regression line is y = a + bx
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
x
14
y
178
xy
2492
Example
15 230 3450
Recall
17 240 4080 x  16.2 s x  1.9235
19 275 5225 y  224.6 s y  37.3604
16 200 3200
r  .885
81 1123 18447

sy 37.3604
br  (.885)  17.189
sx 1.9235
a  y  bx  224.6  17.189(16.2)  53.86
RegressionLine : y  53.86  17.189 x
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Example
• Predict the selling price for another
residence with 1600 square feet of living
area.

Predict: y  53.86  17.189 x


 53.86  17.189(16)  221.16 or $221,160
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Key Concepts
I. Bivariate Data
1. Both qualitative and quantitative variables
2. Describing each variable separately
3. Describing the relationship between the variables
II. Describing Two Qualitative Variables
1. Side-by-Side pie charts
2. Comparative line charts
3. Comparative bar charts
 Side-by-Side
 Stacked
4. Relative frequencies to describe the relationship between
the two variables.
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Key Concepts
III. Describing Two Quantitative Variables
1. Scatterplots
 Linear or nonlinear pattern
 Strength of relationship
 Unusual observations; clusters and outliers
2. Covariance and correlation coefficient
3. The best fitting line
 Calculating the slope and y-intercept
 Graphing the line
 Using the line for prediction
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.

You might also like