Project name:
STREAMS
Instructor: Assoc. Prof. Dr. Kürşad Tosun
Statistical Computing CENG 3004
Ramadan Şanlı
120709042
Yavuz Selim ABAZAOĞLU
120709012
MARCH 09, 2016
Project Streams?
Researchers collected samples of water from 163 streams in the Adirondack
Mountains to investigate effects of acidic rain. They recorded a number of biological,
chemical, and physical variables, including the stream name, the substrate of the
stream (the composition of soil and rock over which they flow – limestone, shale, or
mixed, the pH (acidity) of the water, the temperature (° C) of the water, the BCI ( a
measure of biological diversity), and hardness of the water.
1. What does a scatterplot show about relationship between the water’s pH and its
hardness?
2.
2. Is it appropriate to summarize the strength of association with a correlation?
Explain.
3. Make a side by side boxplot of the pH of the streams by substrate(limestone,
mixed, or shale)
4. Describe what you see in the comparative boxplots. A lower pH means the water
is more acidic.
5. In this question you will compare the mean pH level of streams with limestone
and shale substrates by using two sample t-test.
a. Do you think the assumptions for inference are met (Independent Group
Assumption, Independent Assumption, and Nearly normal Condition?) Explain.
b. Perform a 4-step hypothesis test (two sample t-test)?
6. Make a scatterplot of BCI against pH. Describe what you see in the scatterplot .
7. Is there any evidence of an association between BCI and pH?
a. State the null and alternative hypothesis for this test.
b. Make a histogram of the residuals, Normal qq plot of residuals, and a
residuals plot (predicted vs residuals). Do you think the assumptions for the
inference are met(straight enough condition, Independence assumption, Does the
plot thicken? Condition, Nearly normal Condition)? Explain
c. State a conclusion based on the regression analysis( use an appropriate
4-step test.
8. Write the equation of the regression line to predict BCI from pH.
9.Predict the BCI if the pH is 7.2.????????????
Statistical Computing
Project Streams final report
1)
> plot(PH,HARD,main = "Scatterplot between Ph and Hardness",ylim = c(30,520))
2)
> abline(lm(HARD~PH),col=4)
> cor(PH,HARD)
[1] 0.709633
There is positive and strong correlation between Ph and hardness and it has some outliers.
3)
> substrates =split(PH, SUB)
> boxplot(substrates)
4) Here on the bottom of this picture L represents “limestone”, M represents “mixed”, and S represents
“shale”.
This picture shows that Limestone streams are less acidic. Mixed streams pH levels are nearly between
7.0 and 7.5 and Shale streams are more acidic.
5)
> L = substrates["L"] //”Type = list” We can not apply this code to t-test because it doesn’t return double type.
> L = PH[SUB == "L"] //Type = double
> S = PH[SUB == "S"] //Type = double
> M = PH[SUB == "M"] //Type = double
> t.test(L,s,alt=c("greater"))
Welch Two Sample t-test
data: L and s
t = 16.303, df = 133.13, p-value < 2.2e-16
alternative hypothesis: true difference in means is greater than 0
95 percent confidence interval:
0.6607197 Inf
sample estimates:
mean of x mean of y
7.681818 6.946377
a) This assumption is independent assumption. This streams are occurred in the same acidic rains but
their substrate are different.
b) 1. H0: μL – μS = 0 HA: μL – μS > 0
2. t = 16.303
3. p-value = 2.2 x 10−16
4. Because of the small P-value, we reject H0. The mean reading
comprehension score for the streams with Limestone is
significantly higher than the mean score for the shale.
6)
> plot(BCI~PH)
> abline(lm(BCI~PH),col=4)
This graph show that negative correlation between Ph and hardness biological diversity and it
has some outliers.
7)Kanıt var mı?
a)#Hipotez Kurulacak?#
b)
> Residual = resid(lm(BCI~PH))
> hist(Residual,main = "Histogram of residual(PH,BCI)")
> qqnorm(Residual,main = "Normal QQ Plot of residual")
> BCI.rmna <- na.omit(BCI)
> plot(BCI.rmna,Residual,xlab = "BCI",ylab = "Residual",main="Residual Plot BCI")
> abline(0,0)
This plot is thicken because of histogram of residuals is support this.
c)
8)
> coeffs = coefficients(lm(BCI~PH))
> coeffs
(Intercept) PH
2733.3662 -197.6937
9)
> predict.ph = coeffs[1] + coeffs[2]*7.2
> predict.ph
(Intercept)
1309.972
THANK YOU!

More Related Content

PDF
An Offshore Natural Gas Transmission Pipeline Model and Analysis for the Pred...
PDF
Advances in Rock Physics Modelling and Improved Estimation of CO2 Saturation,...
PDF
East Palatka_Nitrogen Modeling_Report
PDF
Measurement of slap2 and gisp 17 o and proposed vsmow slap normalization
DOCX
Lab 3 408 Marc VanPuymbrouck
PDF
Mesoscopic simulation of incompressible fluid flow in porous media
PDF
CO2QUEST - Fluid Properties and phase behaviour of CO2 with impurities - Geor...
PPTX
To Generate LLE data for Ternary system
An Offshore Natural Gas Transmission Pipeline Model and Analysis for the Pred...
Advances in Rock Physics Modelling and Improved Estimation of CO2 Saturation,...
East Palatka_Nitrogen Modeling_Report
Measurement of slap2 and gisp 17 o and proposed vsmow slap normalization
Lab 3 408 Marc VanPuymbrouck
Mesoscopic simulation of incompressible fluid flow in porous media
CO2QUEST - Fluid Properties and phase behaviour of CO2 with impurities - Geor...
To Generate LLE data for Ternary system

What's hot (19)

PDF
Mesoscopic simulation of incompressible fluid flow in
PDF
Pore scale dynamics and the interpretation of flow processes - Martin Blunt, ...
PDF
Oil and gas reserves sensitivity to log evaluation
PDF
A comparison of_two-phase_inflow_perform
PPT
Thesis_Presentation
DOCX
145_7
PDF
Electro kinetic fractal dimension for characterizing shajara reservoirs
PDF
Comsol conference presentation
PDF
Impact of Equation of State on Simulating CO2 Pipeline Decompression, Solomon...
DOCX
Chuyển động sinh học ở điều kiện số Reynolds
PDF
Research reproducibility - Code etc.
PPT
Dr. Mohammed Hamoda - Composting of Mixtures of Municipal Solid Wastes and Se...
PDF
PDF
Pressure head fractal dimension for characterizing shajara reservoirs of the ...
PDF
Numerical Simulation of Theories for Gas Absorption with Chemical Reaction
PDF
PDF
Applications of Differential Equations in Petroleum Engineering
PDF
Briaud2001
PPT
Ultracentrifugation Pervushin
Mesoscopic simulation of incompressible fluid flow in
Pore scale dynamics and the interpretation of flow processes - Martin Blunt, ...
Oil and gas reserves sensitivity to log evaluation
A comparison of_two-phase_inflow_perform
Thesis_Presentation
145_7
Electro kinetic fractal dimension for characterizing shajara reservoirs
Comsol conference presentation
Impact of Equation of State on Simulating CO2 Pipeline Decompression, Solomon...
Chuyển động sinh học ở điều kiện số Reynolds
Research reproducibility - Code etc.
Dr. Mohammed Hamoda - Composting of Mixtures of Municipal Solid Wastes and Se...
Pressure head fractal dimension for characterizing shajara reservoirs of the ...
Numerical Simulation of Theories for Gas Absorption with Chemical Reaction
Applications of Differential Equations in Petroleum Engineering
Briaud2001
Ultracentrifugation Pervushin
Ad

More from ramadansanli (12)

PPTX
UNIVERSITETI I PRISHTINËS ramadan.pptx
PDF
Celonis Snap Presentation Final Ramadan SANLI.pdf
PPTX
Ramadan Sanli kompjuteri
PPTX
Java ile programlamaya giris ramadan sanli
PPTX
Virusler
PPTX
Huazimet nga turqishtja
DOCX
RUMELİ'DEN ÇIKTIK YOLA ADLI ROMAN HAKKINDA DÜŞÜNCELER
DOCX
Resit Hanadan
PPTX
Smart searcher1
PPTX
Bioinformatics ramadan_sanli
PPTX
Linux vs windows
PDF
Anatomy of a super computer
UNIVERSITETI I PRISHTINËS ramadan.pptx
Celonis Snap Presentation Final Ramadan SANLI.pdf
Ramadan Sanli kompjuteri
Java ile programlamaya giris ramadan sanli
Virusler
Huazimet nga turqishtja
RUMELİ'DEN ÇIKTIK YOLA ADLI ROMAN HAKKINDA DÜŞÜNCELER
Resit Hanadan
Smart searcher1
Bioinformatics ramadan_sanli
Linux vs windows
Anatomy of a super computer
Ad

Recently uploaded (20)

PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PPTX
Platelet disorders - thrombocytopenia.pptx
PDF
THE-VITAL-ROLE-OF-MITOCHONDRIAL-RESPIRATION-IN-PLANT-GROWTH-AND-DEVELOPMENT.pdf
PPTX
endocrine - management of adrenal incidentaloma.pptx
PPTX
Spectroscopic Techniques for M Tech Civil Engineerin .pptx
PPTX
02_OpenStax_Chemistry_Slides_20180406 copy.pptx
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PDF
Sujay Rao Mandavilli IJISRT25AUG764 context based approaches to population ma...
PPTX
Preformulation.pptx Preformulation studies-Including all parameter
PPTX
BPharm_Hospital_Organization_Complete_PPT.pptx
PPTX
A powerpoint on colorectal cancer with brief background
PPT
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...
PPTX
Introduction to Immunology (Unit-1).pptx
PDF
Science Form five needed shit SCIENEce so
PPTX
Substance Disorders- part different drugs change body
PPTX
Understanding the Circulatory System……..
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPT
Enhancing Laboratory Quality Through ISO 15189 Compliance
PDF
Sumer, Akkad and the mythology of the Toradja Sa'dan.pdf
PDF
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
TORCH INFECTIONS in pregnancy with toxoplasma
Platelet disorders - thrombocytopenia.pptx
THE-VITAL-ROLE-OF-MITOCHONDRIAL-RESPIRATION-IN-PLANT-GROWTH-AND-DEVELOPMENT.pdf
endocrine - management of adrenal incidentaloma.pptx
Spectroscopic Techniques for M Tech Civil Engineerin .pptx
02_OpenStax_Chemistry_Slides_20180406 copy.pptx
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Sujay Rao Mandavilli IJISRT25AUG764 context based approaches to population ma...
Preformulation.pptx Preformulation studies-Including all parameter
BPharm_Hospital_Organization_Complete_PPT.pptx
A powerpoint on colorectal cancer with brief background
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...
Introduction to Immunology (Unit-1).pptx
Science Form five needed shit SCIENEce so
Substance Disorders- part different drugs change body
Understanding the Circulatory System……..
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Enhancing Laboratory Quality Through ISO 15189 Compliance
Sumer, Akkad and the mythology of the Toradja Sa'dan.pdf
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...

Streams

  • 1. Project name: STREAMS Instructor: Assoc. Prof. Dr. Kürşad Tosun Statistical Computing CENG 3004 Ramadan Şanlı 120709042 Yavuz Selim ABAZAOĞLU 120709012 MARCH 09, 2016
  • 2. Project Streams? Researchers collected samples of water from 163 streams in the Adirondack Mountains to investigate effects of acidic rain. They recorded a number of biological, chemical, and physical variables, including the stream name, the substrate of the stream (the composition of soil and rock over which they flow – limestone, shale, or mixed, the pH (acidity) of the water, the temperature (° C) of the water, the BCI ( a measure of biological diversity), and hardness of the water. 1. What does a scatterplot show about relationship between the water’s pH and its hardness? 2. 2. Is it appropriate to summarize the strength of association with a correlation? Explain. 3. Make a side by side boxplot of the pH of the streams by substrate(limestone, mixed, or shale) 4. Describe what you see in the comparative boxplots. A lower pH means the water is more acidic. 5. In this question you will compare the mean pH level of streams with limestone and shale substrates by using two sample t-test. a. Do you think the assumptions for inference are met (Independent Group Assumption, Independent Assumption, and Nearly normal Condition?) Explain. b. Perform a 4-step hypothesis test (two sample t-test)? 6. Make a scatterplot of BCI against pH. Describe what you see in the scatterplot . 7. Is there any evidence of an association between BCI and pH? a. State the null and alternative hypothesis for this test. b. Make a histogram of the residuals, Normal qq plot of residuals, and a residuals plot (predicted vs residuals). Do you think the assumptions for the inference are met(straight enough condition, Independence assumption, Does the plot thicken? Condition, Nearly normal Condition)? Explain c. State a conclusion based on the regression analysis( use an appropriate 4-step test.
  • 3. 8. Write the equation of the regression line to predict BCI from pH. 9.Predict the BCI if the pH is 7.2.???????????? Statistical Computing Project Streams final report 1) > plot(PH,HARD,main = "Scatterplot between Ph and Hardness",ylim = c(30,520))
  • 4. 2) > abline(lm(HARD~PH),col=4) > cor(PH,HARD) [1] 0.709633 There is positive and strong correlation between Ph and hardness and it has some outliers.
  • 5. 3) > substrates =split(PH, SUB) > boxplot(substrates) 4) Here on the bottom of this picture L represents “limestone”, M represents “mixed”, and S represents “shale”. This picture shows that Limestone streams are less acidic. Mixed streams pH levels are nearly between 7.0 and 7.5 and Shale streams are more acidic. 5) > L = substrates["L"] //”Type = list” We can not apply this code to t-test because it doesn’t return double type. > L = PH[SUB == "L"] //Type = double > S = PH[SUB == "S"] //Type = double > M = PH[SUB == "M"] //Type = double > t.test(L,s,alt=c("greater")) Welch Two Sample t-test data: L and s t = 16.303, df = 133.13, p-value < 2.2e-16 alternative hypothesis: true difference in means is greater than 0 95 percent confidence interval: 0.6607197 Inf sample estimates:
  • 6. mean of x mean of y 7.681818 6.946377 a) This assumption is independent assumption. This streams are occurred in the same acidic rains but their substrate are different. b) 1. H0: μL – μS = 0 HA: μL – μS > 0 2. t = 16.303 3. p-value = 2.2 x 10−16 4. Because of the small P-value, we reject H0. The mean reading comprehension score for the streams with Limestone is significantly higher than the mean score for the shale. 6) > plot(BCI~PH) > abline(lm(BCI~PH),col=4) This graph show that negative correlation between Ph and hardness biological diversity and it has some outliers. 7)Kanıt var mı?
  • 7. a)#Hipotez Kurulacak?# b) > Residual = resid(lm(BCI~PH)) > hist(Residual,main = "Histogram of residual(PH,BCI)") > qqnorm(Residual,main = "Normal QQ Plot of residual")
  • 8. > BCI.rmna <- na.omit(BCI) > plot(BCI.rmna,Residual,xlab = "BCI",ylab = "Residual",main="Residual Plot BCI") > abline(0,0) This plot is thicken because of histogram of residuals is support this. c) 8) > coeffs = coefficients(lm(BCI~PH)) > coeffs (Intercept) PH 2733.3662 -197.6937 9) > predict.ph = coeffs[1] + coeffs[2]*7.2 > predict.ph (Intercept) 1309.972