0% found this document useful (0 votes)

26 views16 pages

Assignment# 06

The document is an R Notebook detailing statistical analyses performed on two datasets: one concerning students' physical attributes and another related to mood assessments. It includes steps for loading data, checking dataset structures, calculating Pearson correlation coefficients, visualizing relationships, and conducting Shapiro-Wilk tests for normality. The findings indicate significant correlations between body weight and height, as well as between negative and positive moods, with both datasets showing non-normal distributions.

Uploaded by

shanza161199

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views16 pages

Assignment# 06

Uploaded by

shanza161199

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

11/29/24, 12:41 PM R Notebook

R Notebook
This is an R Markdown (http://rmarkdown.rstudio.com) Notebook. When you execute code within the notebook, the
results appear beneath the code.

Try executing this chunk by clicking the Run button within the chunk or by placing your cursor inside it and
pressing Ctrl+Shift+Enter.

plot(cars)

Add a new chunk by clicking the Insert Chunk button on the toolbar or by pressing Ctrl+Alt+I.

Assignment-06
Exercise-75
Step-01:Load the data

students<-read.delim("E:\\Statistics\\Datasets\\Students.txt",
stringsAsFactors=F)

Step-02:Check the dataset structure

file:///E:/Statistics/Exercises/Assignment-06.html 1/16
11/29/24, 12:41 PM R Notebook

summary(students)

## ID Sex Sex_coded Blood_group

## Min. : 1.00 Length:82 Min. :0.0000 Length:82
## 1st Qu.:21.25 Class :character 1st Qu.:0.0000 Class :character
## Median :41.50 Mode :character Median :1.0000 Mode :character
## Mean :41.50 Mean :0.6585
## 3rd Qu.:61.75 3rd Qu.:1.0000
## Max. :82.00 Max. :1.0000
## Blood_group_coded Rhesus_factor Rhesus_factor_coded Smoking
## Min. :0.0000 Length:82 Min. :0.0000 Length:82
## 1st Qu.:0.0000 Class :character 1st Qu.:1.0000 Class :character
## Median :1.0000 Mode :character Median :1.0000 Mode :character
## Mean :0.9512 Mean :0.8415
## 3rd Qu.:1.0000 3rd Qu.:1.0000
## Max. :3.0000 Max. :1.0000
## Smoking_coded Size_cm Weight_kg Points_exam
## Min. :0.0000 Min. :157.0 Min. :46.00 Min. : 1.000
## 1st Qu.:0.0000 1st Qu.:167.0 1st Qu.:56.25 1st Qu.: 6.250
## Median :0.0000 Median :170.0 Median :61.00 Median : 8.000
## Mean :0.3171 Mean :173.2 Mean :65.84 Mean : 7.988
## 3rd Qu.:1.0000 3rd Qu.:179.0 3rd Qu.:75.75 3rd Qu.:10.000
## Max. :1.0000 Max. :194.0 Max. :98.00 Max. :12.000
## Grade
## Min. :1.000
## 1st Qu.:2.000
## Median :3.000
## Mean :3.122
## 3rd Qu.:4.750
## Max. :5.000

Step-03:Calculate the Pearson correlation coefficient

correlation <- cor(students$Weight_kg, students$Size_cm, method = "pearson")

cat("Pearson Correlation Coefficient:", correlation, "\n")

## Pearson Correlation Coefficient: 0.7790491

Conclusion

file:///E:/Statistics/Exercises/Assignment-06.html 2/16
11/29/24, 12:41 PM R Notebook

# Is there any linear relationship between the variables?

# Hypotheses for Pearson Correlation:

#Null Hypothesis(H0): There is no linear relationship between body weight and body height (p=0).
#Alternative Hypothesis (H1): There is a linear relationship between body weight and body height
(p≠0).

# As p<0.05, we reject the null hypothesis. There is sufficient evidence to conclude that body w
eight and body height are significantly positively linearly related with a correlation coefficie
nt of 𝑟=0.7790491.

Step-04:Visualize the Relationship of Scatter

library(ggpubr)

## Loading required package: ggplot2

ggscatter(
students, x = "Weight_kg", y = "Size_cm",
color = "#1f77b4",
add = "reg.line",
conf.int = TRUE,
add.params = list(color = "#ff7f0e"),
cor.coef = TRUE, cor.method = "pearson",
xlab = "Weight (kg)", ylab = "Height (cm)"
)

file:///E:/Statistics/Exercises/Assignment-06.html 3/16
11/29/24, 12:41 PM R Notebook

Conclusion

#The scatter plot reveals a Strong positive linear relationship between coefficient of body weig
ht and body height in the data set students.

Step-05:Shapiro-Wilk tests

# Shapiro-Wilk test for body weight

shapiro_weight <- shapiro.test(students$Weight_kg)
cat("Shapiro-Wilk Test for Weight:\n")

## Shapiro-Wilk Test for Weight:

cat("W-statistic:", shapiro_weight$statistic, "\n")

## W-statistic: 0.9195322

cat("p-value:", shapiro_weight$p.value, "\n")

## p-value: 7.40539e-05

file:///E:/Statistics/Exercises/Assignment-06.html 4/16
11/29/24, 12:41 PM R Notebook

# Shapiro-Wilk test for body height

shapiro_height <- shapiro.test(students$Size_cm)
cat("Shapiro-Wilk Test for Height:\n")

## Shapiro-Wilk Test for Height:

cat("W-statistic:", shapiro_height$statistic, "\n")

## W-statistic: 0.958204

cat("p-value:", shapiro_height$p.value, "\n")

## p-value: 0.009213035

Step-06:Q-Q Plots

library(ggpubr)

# Q-Q plot for body weight

plot1 <- ggqqplot(students$Weight_kg, ylab = "Body Weight (kg)", color = "#FFA500")

# Q-Q plot for body height

plot2 <- ggqqplot(students$Size_cm, ylab = "Body Height (cm)", color = "#FFA500")

# Arrange the plots side by side

ggarrange(plot1, plot2, ncol = 2, nrow = 1,
labels = c("A", "B"), # Add labels to the plots
common.legend = TRUE, legend = "bottom") # Shared legend

file:///E:/Statistics/Exercises/Assignment-06.html 5/16
11/29/24, 12:41 PM R Notebook

Conclusion

# Test for significance of the correlation

#Hypotheses for Shapiro-Wilk Test

#Null Hypothesis (H0): The data is normally distributed.
#Alternative Hypothes is (H1): The data is not normally distributed.

#Shapiro-Wilk Test for Weight: W-statistic: 0.9195322,p-value: 7.40539e-05, as p<0.05 we reject

the null hypothesis (The data for body weight is not normally distributed).
#Shapiro-Wilk Test for height: W-statistic: 0.958204,p-value: 0.009213035 , as p<0.05 we reject
the null hypothesis (The data for body height is not normally distributed).

Exercise-76
Step 01:Load the data:

# Load the ICM dataset

ICM <- read.delim("E:\\Statistics\\Datasets\\ICM.txt", stringsAsFactors = FALSE)

# View the structure of the data to identify the columns for negative and positive mood
str(ICM)

file:///E:/Statistics/Exercises/Assignment-06.html 6/16
11/29/24, 12:41 PM R Notebook

## 'data.frame': 199 obs. of 23 variables:

## $ ID : int 75 90 173 189 100 155 63 48 76 165 ...
## $ Gender : chr "female" "female" "female" "female" ...
## $ Age : int 22 22 37 17 19 16 17 19 27 19 ...
## $ Englishfluent : chr "yes" "yes" "yes" "yes" ...
## $ Germanfluent : chr "no" "no" "yes" "yes" ...
## $ Transport : chr "PublicTransport" "PublicTransport" "Car" "Car" ...
## $ Highest_level_of_education: chr "College" "College" "University" "none" ...
## $ Do_you_smoke : chr "No" "No" "No" "No" ...
## $ Socialmediahours : chr "1.5-3hrs/day" "1.5-3hrs/day" "<1.5hrs/day" "1.5-3hrs/da
y" ...
## $ Timewithfriends : chr "2-5hrs/week" "2-5hrs/week" "5-10hrs/week" "10-20hrs/wee
k" ...
## $ Pet : chr "No" "No" "Yes" "Yes" ...
## $ Siblings : chr "Yes" "Yes" "No" "Yes" ...
## $ Children : chr "No" "No" "Yes" "No" ...
## $ Relationshipstatus : chr "Relationship" "Relationship" "Relationship" "Single" ...
## $ Activitieshours : int 10 10 20 40 20 10 10 20 10 20 ...
## $ NegativeMood : num NA NA NA 4 2.82 ...
## $ PositiveMood : num NA NA NA 0 0.333 ...
## $ Mentalhealth : num 2.667 2.667 3.5 1 0.833 ...
## $ Socialization : num NA NA NA 1 2.5 ...
## $ Activity : num 2.8 2.8 3.4 3.2 1.2 2.6 1.6 1.8 1.2 0.4 ...
## $ SocialSupport : num 4 4 2.333 0.667 2.333 ...
## $ Communication_open_direct : num NA NA 3.38 3.62 3.15 ...
## $ OHS : num 4.59 4.59 5.1 3.14 2.76 ...

# View the first few rows to check the data

head(ICM)

ID Gen… A… Englishfluent Germanfluent Transport Highest_level_of_education

1 75 female 22 yes no PublicTransport College

2 90 female 22 yes no PublicTransport College

3 173 female 37 yes yes Car University

4 189 female 17 yes yes Car none

5 100 female 19 yes yes Walk HighSchool

6 155 female 16 yes no Walk none

6 rows | 1-8 of 24 columns

Step 02:Check for missing values

# Check the number of missing values in both columns

sum(is.na(ICM$NegativeMood))

file:///E:/Statistics/Exercises/Assignment-06.html 7/16
11/29/24, 12:41 PM R Notebook

## [1] 5

sum(is.na(ICM$PositiveMood))

## [1] 3

ICM_clean <- na.omit(ICM[, c("NegativeMood", "PositiveMood")])

correlation <- cor(ICM_clean$NegativeMood, ICM_clean$PositiveMood, method = "pearson")
cat("Pearson Correlation Coefficient:", correlation, "\n")

## Pearson Correlation Coefficient: -0.6433565

Conclusion

# Is there any linear relationship between the variables?

#Null Hypothesis (H0):There is no linear relationship between Negative Mood and Positive Mood (p
=0)
# Alternative Hypothesis(H1): There is a linear relationship between Negative Mood and Positive
Mood (p is not equal to zero)

# As p<0.05, we reject the null hypothesis.There is a statistically significant negative linear

relationship between Negative Mood and Positive Mood.

Step 03:Test for Significance

cor_test <- cor.test(ICM_clean$NegativeMood, ICM_clean$PositiveMood, method = "pearson")

cat("Pearson Correlation Test:\n")

## Pearson Correlation Test:

print(cor_test)

##
## Pearson's product-moment correlation
##
## data: ICM_clean$NegativeMood and ICM_clean$PositiveMood
## t = -11.644, df = 192, p-value < 2.2e-16
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
## -0.7190609 -0.5525618
## sample estimates:
## cor
## -0.6433565

Step 04:Visualize the Relationship (Scatter Plot)

file:///E:/Statistics/Exercises/Assignment-06.html 8/16
11/29/24, 12:41 PM R Notebook

library(ggpubr)
ggscatter(
ICM_clean, x = "NegativeMood", y = "PositiveMood",
color = "#1f77b4",
add = "reg.line", conf.int = TRUE,
cor.coef = TRUE, cor.method = "pearson",
xlab = "Negative Mood", ylab = "Positive Mood"
)

Step-05:Shapiro-Wilk tests

# Shapiro-Wilk Test for Normality

shapiro_negative <- shapiro.test(ICM_clean$NegativeMood)
cat("Shapiro-Wilk Test for Negative Mood:\n")

## Shapiro-Wilk Test for Negative Mood:

print(shapiro_negative)

file:///E:/Statistics/Exercises/Assignment-06.html 9/16
11/29/24, 12:41 PM R Notebook

##
## Shapiro-Wilk normality test
##
## data: ICM_clean$NegativeMood
## W = 0.97664, p-value = 0.002498

shapiro_positive <- shapiro.test(ICM_clean$PositiveMood)

cat("Shapiro-Wilk Test for Positive Mood:\n")

## Shapiro-Wilk Test for Positive Mood:

print(shapiro_positive)

##
## Shapiro-Wilk normality test
##
## data: ICM_clean$PositiveMood
## W = 0.98441, p-value = 0.03015

Step-06:Q-Q Plot

# Q-Q plot for Negative Mood

ggqqplot(ICM_clean$NegativeMood, ylab = "Negative Mood", color = "#1f77b4")

file:///E:/Statistics/Exercises/Assignment-06.html 10/16
11/29/24, 12:41 PM R Notebook

# Q-Q plot for Positive Mood

ggqqplot(ICM_clean$PositiveMood, ylab = "Positive Mood", color = "#1f77b4", )

file:///E:/Statistics/Exercises/Assignment-06.html 11/16
11/29/24, 12:41 PM R Notebook

Conclusion

#Test for significance of the correlation.

#Null Hypothesis (H0):The data is normally distributed.
# Alternative Hypothesis(H1):The data is not normally distributed.

# Shapiro-Wilk Test for Negative Mood: As p<0.05 , W = 0.97664, p-value = 0.002498,we reject the
null hypothesis. The data for Negative Mood is not normally distributed.
# Shapiro-Wilk Test for Positive Mood: As p<0.05 , W = 0.98441, p-value = 0.03015,we reject the
null hypothesis. The data for Positive Mood is not normally distributed.

Exercise-79
Step-01:Load the Dataset

# Load the students dataset

students <- read.delim("E:\\Statistics\\Datasets\\Students.txt", stringsAsFactors = FALSE)

# View the structure of the dataset to identify the columns for weight and height
str(students)

file:///E:/Statistics/Exercises/Assignment-06.html 12/16
11/29/24, 12:41 PM R Notebook

## 'data.frame': 82 obs. of 13 variables:

## $ ID : int 24 5 54 9 34 52 12 16 32 59 ...
## $ Sex : chr "M" "M" "F" "M" ...
## $ Sex_coded : int 0 0 1 0 1 1 0 0 1 1 ...
## $ Blood_group : chr "0" "0" "A" "0" ...
## $ Blood_group_coded : int 0 0 1 0 1 0 0 1 0 1 ...
## $ Rhesus_factor : chr "+" "+" "+" "+" ...
## $ Rhesus_factor_coded: int 1 1 1 1 1 1 1 1 1 0 ...
## $ Smoking : chr "no" "no" "no" "no" ...
## $ Smoking_coded : int 0 0 0 0 0 1 1 1 0 0 ...
## $ Size_cm : int 190 187 171 185 166 164 184 187 163 170 ...
## $ Weight_kg : int 98 81 54 70 53 55 74 75 46 63 ...
## $ Points_exam : int 1 2 2 3 3 3 4 4 4 4 ...
## $ Grade : int 5 5 5 5 5 5 5 5 5 5 ...

# View the first few rows of the dataset to check the data
head(students)

ID S… Sex_co… Blood_group Blood_group_coded Rhesus_factor Rhesus_factor_coded Sm

<int><chr> <int> <chr> <int> <chr> <int> <c

1 24 M 0 0 0 + 1 no

2 5 M 0 0 0 + 1 no

3 54 F 1 A 1 + 1 no

4 9 M 0 0 0 + 1 no

5 34 F 1 A 1 + 1 no

6 52 F 1 0 0 + 1 ye

6 rows | 1-9 of 14 columns

Step-02:Calculate Spearman’s rho

# Calculate Spearman's rank correlation coefficient between body weight and body height
spearman_corr <- cor(students$Weight_kg, students$Size_cm, method = "spearman")

# Display the Spearman correlation coefficient

cat("Spearman's rho:", spearman_corr, "\n")

## Spearman's rho: 0.7740172

Step-03:Test for Significance

# Perform the Spearman correlation test

cor_test <- cor.test(students$Weight_kg, students$Size_cm, method = "spearman")

file:///E:/Statistics/Exercises/Assignment-06.html 13/16
11/29/24, 12:41 PM R Notebook

## Warning in cor.test.default(students$Weight_kg, students$Size_cm, method =

## "spearman"): Cannot compute exact p-value with ties

# Print the result of the correlation test

cat("Spearman's rank correlation test result:\n")

## Spearman's rank correlation test result:

print(cor_test)

##
## Spearman's rank correlation rho
##
## data: students$Weight_kg and students$Size_cm
## S = 20764, p-value < 2.2e-16
## alternative hypothesis: true rho is not equal to 0
## sample estimates:
## rho
## 0.7740172

Conclusion

# Test for significance of the correlation

# Null Hypothesis (H0):There is no monotonic relationship between body weight and body height (p
=0)
# Alternative Hypothesis (H1):There is a monotonic relationship between body weight and body hei
ght (p is not equal to 0)

#S = 20764, p-value < 2.2e-16,as the p-value is less than 0.05, we reject the null hypothesis. T
herefore, we conclude that there is a statistically significant monotonic relationship between b
ody weight and body height with a Spearman’s rho of p=0.7740172.

Exercise-80
Step-01:Load and view the Dataset

ICM <- read.delim("E:\\Statistics\\Datasets\\ICM.txt", stringsAsFactors = FALSE)

# View the structure of the data to identify the columns for NegativeMood and OHS
str(ICM)

file:///E:/Statistics/Exercises/Assignment-06.html 14/16
11/29/24, 12:41 PM R Notebook

## 'data.frame': 199 obs. of 23 variables:

# View the first few rows of the dataset to check the data
head(ICM)

ID Gen… A… Englishfluent Germanfluent Transport Highest_level_of_education

1 75 female 22 yes no PublicTransport College

2 90 female 22 yes no PublicTransport College

3 173 female 37 yes yes Car University

4 189 female 17 yes yes Car none

5 100 female 19 yes yes Walk HighSchool

6 155 female 16 yes no Walk none

6 rows | 1-8 of 24 columns

Step:02-Calculate Spearman’s rho

file:///E:/Statistics/Exercises/Assignment-06.html 15/16
11/29/24, 12:41 PM R Notebook

# Remove rows with missing values in either NegativeMood or OHS

cleaned_data <- na.omit(ICM[, c("NegativeMood", "OHS")])

# Calculate Spearman's correlation on the cleaned data

spearman_corr <- cor(cleaned_data$NegativeMood, cleaned_data$OHS, method = "spearman")

# Display the Spearman correlation coefficient

cat("Spearman's rho:", spearman_corr, "\n")

## Spearman's rho: -0.5725575

Step-03:Test for Significance

# Perform the Spearman correlation test

cor_test <- cor.test(ICM$NegativeMood, ICM$OHS, method = "spearman")

## Warning in cor.test.default(ICM$NegativeMood, ICM$OHS, method = "spearman"):

## Cannot compute exact p-value with ties

# Print the result of the correlation test

cat("Spearman's rank correlation test result:\n")

## Spearman's rank correlation test result:

print(cor_test)

##
## Spearman's rank correlation rho
##
## data: ICM$NegativeMood and ICM$OHS
## S = 1453320, p-value < 2.2e-16
## alternative hypothesis: true rho is not equal to 0
## sample estimates:
## rho
## -0.5725575

Conclusion

# Test for significance of the correlation

# Null Hypothesis (H0): There is no monotonic relationship between negative mood and OHS (p=0)
# Alternative Hypothesis (H1):There is a monotonic relationship between negative mood and OHS
(p is not equal to 0)

#S = 1453320, p-value < 2.2e-16,as the p-value is less than 0.05, we reject the null hypothesis
and conclude that there is a statistically significant negative monotonic relationship between n
egative mood and OHS.

file:///E:/Statistics/Exercises/Assignment-06.html 16/16

Statistical Analysis Homework Guide
No ratings yet
Statistical Analysis Homework Guide
12 pages
ProbList5 24 SLN
No ratings yet
ProbList5 24 SLN
9 pages
Stroke Prediction Dataset
No ratings yet
Stroke Prediction Dataset
48 pages
R Programming Basics and Data Analysis
No ratings yet
R Programming Basics and Data Analysis
18 pages
Textbook Practice Problems 1
No ratings yet
Textbook Practice Problems 1
39 pages
Statistical Analysis of Health Data
No ratings yet
Statistical Analysis of Health Data
11 pages
Assignment # 07 (Updated)
No ratings yet
Assignment # 07 (Updated)
59 pages
Heart Disease Data Analysis Project
No ratings yet
Heart Disease Data Analysis Project
55 pages
R Based Project
No ratings yet
R Based Project
24 pages
Heart Disease Prediction Model
No ratings yet
Heart Disease Prediction Model
19 pages
7th Report
No ratings yet
7th Report
14 pages
R Statistical Analysis and Sampling Techniques
No ratings yet
R Statistical Analysis and Sampling Techniques
38 pages
Exploratory Data Analysis Homework
No ratings yet
Exploratory Data Analysis Homework
23 pages
Data Analysis and Modeling Techniques
No ratings yet
Data Analysis and Modeling Techniques
35 pages
An Introduction To The Psych Package: Part I: Data Entry and Data Description
No ratings yet
An Introduction To The Psych Package: Part I: Data Entry and Data Description
63 pages
Data Analysis Exam with R 2024
No ratings yet
Data Analysis Exam with R 2024
15 pages
Student Record Dataset Analysis in R
No ratings yet
Student Record Dataset Analysis in R
7 pages
Correlation Diploma
No ratings yet
Correlation Diploma
10 pages
SPSS Regression Modeling Guide
No ratings yet
SPSS Regression Modeling Guide
36 pages
R Programming Challenges for Data Analysis
No ratings yet
R Programming Challenges for Data Analysis
11 pages
Unit 3 Homework: Matched Pairs & ANOVA
No ratings yet
Unit 3 Homework: Matched Pairs & ANOVA
73 pages
Choosing and Performing Statistical Tests
No ratings yet
Choosing and Performing Statistical Tests
7 pages
Understanding Parametric and Non-Parametric Tests
No ratings yet
Understanding Parametric and Non-Parametric Tests
29 pages
Heart Disease Prediction Model
No ratings yet
Heart Disease Prediction Model
35 pages
Healthcare Analytics
No ratings yet
Healthcare Analytics
72 pages
R Software Data Entry and Analysis Guide
No ratings yet
R Software Data Entry and Analysis Guide
7 pages
Introduction to the psych Package
No ratings yet
Introduction to the psych Package
65 pages
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
No ratings yet
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
6 pages
Statistical Analysis of Body Metrics
No ratings yet
Statistical Analysis of Body Metrics
6 pages
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
No ratings yet
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
39 pages
Acupuncture Study on Lower Back Pain
No ratings yet
Acupuncture Study on Lower Back Pain
6 pages
Statistical Analysis of Teen Gambling Data
No ratings yet
Statistical Analysis of Teen Gambling Data
8 pages
R Statistical Measures for Hospital Data
No ratings yet
R Statistical Measures for Hospital Data
34 pages
R Cheat Sheet
No ratings yet
R Cheat Sheet
9 pages
R Data Management and Statistical Functions
No ratings yet
R Data Management and Statistical Functions
4 pages
Stata Commands for Data Analysis
No ratings yet
Stata Commands for Data Analysis
8 pages
Psychometrics Data Analysis Guide
No ratings yet
Psychometrics Data Analysis Guide
13 pages
MSc Epidemiology: Mixed Models Exercises
No ratings yet
MSc Epidemiology: Mixed Models Exercises
26 pages
Data Analysis of Health Metrics
No ratings yet
Data Analysis of Health Metrics
12 pages
PH6205 RTutorial 2
No ratings yet
PH6205 RTutorial 2
15 pages
Statistical Methods in Laboratory Evaluation
No ratings yet
Statistical Methods in Laboratory Evaluation
46 pages
Data Analysis in Animal Sciences
No ratings yet
Data Analysis in Animal Sciences
69 pages
ANOVA Analysis of Chicken and Bovine Data
No ratings yet
ANOVA Analysis of Chicken and Bovine Data
44 pages
SQQS1013 Elementary Statistics Group Assignment
0% (1)
SQQS1013 Elementary Statistics Group Assignment
13 pages
Statistical Modelling Lab Solutions
No ratings yet
Statistical Modelling Lab Solutions
14 pages
R Commands
No ratings yet
R Commands
5 pages
Unit 2 Homework: T-Tests & ANOVA
No ratings yet
Unit 2 Homework: T-Tests & ANOVA
48 pages
t-Test Analysis for Statistical Data
No ratings yet
t-Test Analysis for Statistical Data
23 pages
Statistical Tests and R Commands Guide
No ratings yet
Statistical Tests and R Commands Guide
5 pages
Biostat MBBS Project Final 231118 133415
No ratings yet
Biostat MBBS Project Final 231118 133415
51 pages
Analyzing BRFSS Data in R
No ratings yet
Analyzing BRFSS Data in R
7 pages
Quiz 2 Solution Id 22070144
No ratings yet
Quiz 2 Solution Id 22070144
10 pages
Exercise Solutions
No ratings yet
Exercise Solutions
30 pages
Abhay Biostats Assignment
No ratings yet
Abhay Biostats Assignment
11 pages
CB161 (R Lab Manual)
No ratings yet
CB161 (R Lab Manual)
32 pages
Chi-Squared Tests on Demographics and Health
No ratings yet
Chi-Squared Tests on Demographics and Health
10 pages
Summary Statistics and Data Analysis in R
No ratings yet
Summary Statistics and Data Analysis in R
11 pages
C8203 IRDA Class Support Handbook
No ratings yet
C8203 IRDA Class Support Handbook
53 pages
Python Functions for Various Tasks
No ratings yet
Python Functions for Various Tasks
2 pages
Cloud Computing Question Bank Guide
No ratings yet
Cloud Computing Question Bank Guide
5 pages
Quick Start Guide 5
No ratings yet
Quick Start Guide 5
2 pages
(The Only Proper) PDO Tutorial - Treating PHP Delusions PDF
No ratings yet
(The Only Proper) PDO Tutorial - Treating PHP Delusions PDF
121 pages
P7 User Manual Guide
No ratings yet
P7 User Manual Guide
7 pages
Southern Seminary Manual of Style 5.0
No ratings yet
Southern Seminary Manual of Style 5.0
98 pages
Iphone 13 Mini - Google Search
No ratings yet
Iphone 13 Mini - Google Search
1 page
Radioss Modeling Best Practices
No ratings yet
Radioss Modeling Best Practices
37 pages
Appendix d3 - Coordination Procedure
No ratings yet
Appendix d3 - Coordination Procedure
15 pages
Microcontroller Programming Basics
No ratings yet
Microcontroller Programming Basics
23 pages
Load Flow Analysis Using Forward and Backward Sweep, and Minimising Power Losses Using Genetic Algorithm
No ratings yet
Load Flow Analysis Using Forward and Backward Sweep, and Minimising Power Losses Using Genetic Algorithm
10 pages
Westinghouse Low-voltagePowerCircuitBreakers PDF
No ratings yet
Westinghouse Low-voltagePowerCircuitBreakers PDF
21 pages
Companion 3 Multimedia Speaker System: Product Description
No ratings yet
Companion 3 Multimedia Speaker System: Product Description
30 pages
Understanding Inheritance in C++
No ratings yet
Understanding Inheritance in C++
16 pages
Lecture 5 - Sorting and Order Statistics
No ratings yet
Lecture 5 - Sorting and Order Statistics
44 pages
Bank Database Design Homework Guide
No ratings yet
Bank Database Design Homework Guide
1 page
Manjunath B.S., Salembier P., Sikora T. - Introduction To MPEG 7. Multimedia Content Description Language
No ratings yet
Manjunath B.S., Salembier P., Sikora T. - Introduction To MPEG 7. Multimedia Content Description Language
400 pages
Event Management System Project Report
No ratings yet
Event Management System Project Report
95 pages
Bora Comfort
No ratings yet
Bora Comfort
17 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Site Quality Plan A-FA-I-001 Overview
No ratings yet
Site Quality Plan A-FA-I-001 Overview
8 pages
Best Practices
No ratings yet
Best Practices
13 pages
O RAN - WG1.O RAN Architecture Description v04.00
100% (1)
O RAN - WG1.O RAN Architecture Description v04.00
34 pages
Windows Phone 8S by HTC User Guide
No ratings yet
Windows Phone 8S by HTC User Guide
102 pages
C Programming Methodology Exam Guide
No ratings yet
C Programming Methodology Exam Guide
7 pages
ECDISNEWCEDRIC
No ratings yet
ECDISNEWCEDRIC
16 pages
Writing & Tracking Test Cases Guide
100% (1)
Writing & Tracking Test Cases Guide
3 pages
03-02-2023-1675414201-10th Maths em QR Code Questions by Way To Success Teachers Team
No ratings yet
03-02-2023-1675414201-10th Maths em QR Code Questions by Way To Success Teachers Team
33 pages
20240621060452CopyGame Log
No ratings yet
20240621060452CopyGame Log
25 pages
Four Alternative Input Devices
No ratings yet
Four Alternative Input Devices
7 pages

Assignment# 06

Uploaded by

Assignment# 06

Uploaded by

11/29/24, 12:41 PM R Notebook

Step-02:Check the dataset structure

## ID Sex Sex_coded Blood_group

Step-03:Calculate the Pearson correlation coefficient

correlation <- cor(students$Weight_kg, students$Size_cm, method = "pearson")

## Pearson Correlation Coefficient: 0.7790491

# **Is there any linear relationship between the variables?**

# Hypotheses for Pearson Correlation:

Step-04:Visualize the Relationship of Scatter

## Loading required package: ggplot2

# Shapiro-Wilk test for body weight

## Shapiro-Wilk Test for Weight:

cat("W-statistic:", shapiro_weight$statistic, "\n")

cat("p-value:", shapiro_weight$p.value, "\n")

# Shapiro-Wilk test for body height

## Shapiro-Wilk Test for Height:

cat("W-statistic:", shapiro_height$statistic, "\n")

cat("p-value:", shapiro_height$p.value, "\n")

# Q-Q plot for body weight

# Q-Q plot for body height

# Arrange the plots side by side

# **Test for significance of the correlation**

#Hypotheses for Shapiro-Wilk Test

#Shapiro-Wilk Test for Weight: W-statistic: 0.9195322,p-value: 7.40539e-05, as p<0.05 we reject

# Load the ICM dataset

## 'data.frame': 199 obs. of 23 variables:

# View the first few rows to check the data

ID Gen… A… Englishfluent Germanfluent Transport Highest_level_of_education

1 75 female 22 yes no PublicTransport College

2 90 female 22 yes no PublicTransport College

3 173 female 37 yes yes Car University

4 189 female 17 yes yes Car none

5 100 female 19 yes yes Walk HighSchool

6 155 female 16 yes no Walk none

6 rows | 1-8 of 24 columns

Step 02:Check for missing values

# Check the number of missing values in both columns

ICM_clean <- na.omit(ICM[, c("NegativeMood", "PositiveMood")])

## Pearson Correlation Coefficient: -0.6433565

# **Is there any linear relationship between the variables?**

# As p<0.05, we reject the null hypothesis.There is a statistically significant negative linear

Step 03:Test for Significance

cor_test <- cor.test(ICM_clean$NegativeMood, ICM_clean$PositiveMood, method = "pearson")

## Pearson Correlation Test:

Step 04:Visualize the Relationship (Scatter Plot)

# Shapiro-Wilk Test for Normality

## Shapiro-Wilk Test for Negative Mood:

shapiro_positive <- shapiro.test(ICM_clean$PositiveMood)

## Shapiro-Wilk Test for Positive Mood:

# Q-Q plot for Negative Mood

# Q-Q plot for Positive Mood

#Test for significance of the correlation.

# Load the students dataset

## 'data.frame': 82 obs. of 13 variables:

ID S… Sex_co… Blood_group Blood_group_coded Rhesus_factor Rhesus_factor_coded Sm

6 rows | 1-9 of 14 columns

Step-02:Calculate Spearman’s rho

# Display the Spearman correlation coefficient

## Spearman's rho: 0.7740172

Step-03:Test for Significance

# Perform the Spearman correlation test

## Warning in cor.test.default(students$Weight_kg, students$Size_cm, method =

# Print the result of the correlation test

## Spearman's rank correlation test result:

# **Test for significance of the correlation**

ICM <- read.delim("E:\\Statistics\\Datasets\\ICM.txt", stringsAsFactors = FALSE)

## 'data.frame': 199 obs. of 23 variables:

ID Gen… A… Englishfluent Germanfluent Transport Highest_level_of_education

1 75 female 22 yes no PublicTransport College

2 90 female 22 yes no PublicTransport College

3 173 female 37 yes yes Car University

4 189 female 17 yes yes Car none

5 100 female 19 yes yes Walk HighSchool

6 155 female 16 yes no Walk none

6 rows | 1-8 of 24 columns

Step:02-Calculate Spearman’s rho

# Is there any linear relationship between the variables?

# Test for significance of the correlation

# Is there any linear relationship between the variables?

# Test for significance of the correlation

# Test for significance of the correlation