Lab 4 DONE New
Lab 4 DONE New
Introduction
Correlation
Correlation measures the linear association between two continuous variables. The explanatory variable
is x and the response variable is y. The values can fall between [-1, 1]. Where the closer the value is to |
1|, the stronger the linear relationship, and the sign (+. -) indicates if the relationship is positive: as x
increases y also increases, or negative: as x increases y decreases.
Correlation is very sensitive to outliers in a data set. Outliers that follow the trend of the data set (in
trend) tend to inflate the correlation (make it stronger). Outliers that are in the opposite trend from the
data set (out of trend) tend to deflate (weaken) the correlation.
Coefficient of Determination
The coefficient of determination is the r 2 value. This measures how much of the change in y is because
of the change in x. Interpretation is typically formatted as follows: The percent change in y is explained
by the change in x. Where the percentage, x, and y are replaced based on the problem’s context.
Regression
Linear regression is used for predicting y-values (response) for given x-values (explanatory). The
equation of the line is determined by fitting a line of best fit through the scatterplot of the data. The
prediction is only reliable if the data follow a linear trend (measured by the correlation)
Lab
Using the Large.FEV Data set, download the Excel file “Large.FEV” from Blackboard (In the Lab 4 folder).
The forced expiratory volume (FEV, measured in liters) is a primary indicator of lung function and
corresponds to the volume of air that can forcibly be blown out in the first second after full inspiration.
The large.FEV data file contains the FEV values of a large sample of children, age, height, and some
categorical descriptors of each. We will use the data to study growth patterns in children.
1
Correlation
Copy FEV data into the worksheet of Minitab.
2
Plot the relationship between age in years (explanatory, x-axis) and height in inches (response, y-axis) in
boys and girls, using the category of sex (where 0 = boys 1 = girls).
3
Yes, the growth patterns seem to have a linear growth for the most part, regardless of the sex, even
though sex 0 seems to have a steeper slope than sex 1. There seems to be a positive linear
correlation within the two.
Regression Line
Plot the relationship between height in inches (explanatory, x-axis) and FEV( an indicator of lung
function that corresponds to the volume of air), using the category of sex (where 0 = boys 1 = girls).
8) What percent of the variation in height is explained by these models for the boys(sex = 0)?
(Hint: Hover over the regression line to see it)
77.9 %
9) What percent of the variation in height is explained by these models for the girls(sex =1)?
(Hint: Hover over the regression line to see it)
69.6%
Plot the relationship between height and FEV(an indicator of lung function that corresponds to the
volume of air), and obtain the least-squares regression equation.
8) What are the least-squares regression equations for the fev and height?
11) Use the least-squares equation to predict the FEV of a child who is 60 inches tall.
Y= -5.433 + 0.1320(60) = 2.487