0% found this document useful (0 votes)

41 views7 pages

Intro To R Software

outline of commands for simple basics stat like mean, standard deviation, anova, t-tests, correlation and regression analysis on R crane software

Uploaded by

Nabeela Ijaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views7 pages

Intro To R Software

outline of commands for simple basics stat like mean, standard deviation, anova, t-tests, correlation and regression analysis on R crane software

Uploaded by

Nabeela Ijaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Intro to R software:

>data entry

bt=c(1,2,3,4,5,5,4,2,3,1)

>length(nameofdata)

this command gives 'n' or 'number of observations

>nrow(nameofdatatable)

-to check number of rows in data table

>ncol(nameofdatatable)

-to check number of column in data table (data.frame)

>dataname[,-1]

-this command is used to remove column

>dataname[c(2,17,8),]

-this command is used to locate multiple rows data

>dataname[,c(6,4,12)]

-this command is used to locate different column data

>dataname[1:10,]

-this command gives rows from 1 to 10

>dataname[,1:10]

-this command gives coloumn from 1 to 10

>data.frame(nameofdata,variables)

this command gives data in tabular form

>nameofdata[,1]

-this command locate column in give data table

>mean(nameofdata)

to find mean of sindle variable,data

>mean(nameofdatatable[,2])

another method of calculating mean

>mean(nameofdatatable[,2][1:50])

-to calculate mean for column 2 from entries 1 to 50

>apply(datatablename,2,mean), same for median,sd,var

to find collective mean of data

2 position for coloumn

1 position for row

>aggregate(datatablename[,5]~datatablename[,3],data=datatablename,FUN=mean)

-this commands gives mean against each category in datatable

-place numeric column first then category column later

-~ 'tilta' sign is used to link one column to another

>aggregate(datatablename[,5]~datatablename[,3],data=datatablename,FUN=sd)

-to get sd against each category

>aggregate(datatablename[,5]~datatablename[,3],data=datatablename,FUN=summary)

-to get summary of these column

>sort(dataname)

to arrange data in ascending order

>unique(dataname)

to exclude repitition of same values

>plot(dataname)

graphical representation of data in Scatter plot

>plot(dataname,col="green")

to add colors in graph

>plot(dataname,col="red",xlab="bodytemperature",ylab="weights")

to add names on graph

>plot(dataname,col="red",xlab="bodytemperature",ylab="weights",main="graph of bt and weights")

to add name of graph

>boxplot(dataname,col="red",xlab="bt",ylab="weights",main="graph")

this command gives boxplot of data

>summary(nameofdataordatatable)

this commands gives information and range of data or quartiles

category and length of datatable

>apply(nameofdata,2,mean,na.rm=T)

if we have different lengths of data then we add "NA" in missing

positions to equal lengths of data

then in computing mean add "na.rm=T" command to remove NA during calculation

>data()

this commands gives us datasets already available in R software

>head(nameofdata)

this commands gives us some first values from data

>tail(nameofdata)

this command gives us some last values from data

>str(nameofdata)

-this command gives structure of dataset

-like number of observations/ number of variables

>getwd()

this command gives the directory of R

>setwd("D:/Nabeela/mphil 2 semester/stat/data sets")

this commands enters the directory where datasets are saved

>namethedata=read.csv("D:/Nabeela/mphil 2 semester/stat/data sets/iq_level.csv")

-this command read the file in drive saved in excel form

-but save this excel file in File Format CSV(Comma delimited)

>namethedata=read.table("D:/Nabeela/mphil 2 semester/stat/data sets/quail_partial_data.txt")

-this command import data file in R saved in notepad form

>namethedata=read.table("D:/Nabeela/mphil 2 semester/stat/data
sets/quail_partial_data.txt",header=T)

-this command removes header from the table in saved file

>windows()

-to have more windows for graphs on other windows

>par(mfrow=c(2,5))

-to divide windows to get more than one

>attach(dataname)

-this command is used to get any column from given data table

-after applying this command just type name of column you want to work

on and click enter

>nameofdata[,1]

-same as above

>nameofdata$nameofcolumn

-same as above

>cbind(nameofcolumn,nameofothercolumn)

-this command is used to bind two columnes of different datasets

>rbind(nameofcolumn,nameofothercolumn)

-this command is used to bind two coloumns and represent in row format

>table(dataname[,2])

-this command gives frequency of values in data

> plot(iris[,1],col="black",ylim=c(0,8))

> points(iris[,2],col="red")

> points(iris[,3],col="blue")

> points(iris[,4],col="green")

-to get plot and add further points on it

>givename=which(variablename=="valueofvariable")

-it split vaiable column with same values

-f=which(Maternal=="F1") OR p=Maternal[-f]

-> f

[1] 1 2 3 4 22 23 24 25 26 27 28 29 30 31 32 47 48 67 68

[20] 69 70 71 72 73 74 75 76 77 78 79 80 97 98 99 100 101 102 122

[39] 123 124 125 126 127 128 129 130 144 145 146 147 148 169 170 171 172 173 174

[58] 175 176 177 178 179 180 181 182

-> Maternal[f]

[1] "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1"
[16] "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1"

[31] "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1"

[46] "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1" "F1"

[61] "F1" "F1" "F1" "F1" "F1"

REGRESSION MODEL (SIMPLE OR LINEAR)

STEPS:

1-TO CHECK LINEARITY PLOT SCATTER PLOT BETWEEN DEPENDENT VARIABLE (Y) AND INDEPENDENT
VARIABEL (X)

2-DRAW HISTOGRAM TO CHECK NORMALITY.

3-THEN APPLY LINEAR MODEL FUNCTION IN R.

4-INTERPRET RESULTS BY TAKING SUMMARY OF Lm.

>lm(dependentvariable~independentvariable)

-this commands is for linear model function

>plot(independentvar,dependentvar)

-to get scatter plot for regresion model

>points(independentvar,yhat,col="anycolor")

-to mark points for line of best fit or regression line

>lines(independentvar,yhat,col="anycolor")

-to connect points through lines

>abline(lm(dependentvar~independentvar),col="red")

-to get regression line in scattor plot

>pred=predict(fit,newdata=nameofnewdata)

-to get prediction of new data set based on previous data lm results

>cor(nameofdata)

-to get corelation between variables

>givename=which(nameofcolumn=="nameofcategory")

-for example: Gender=which(Sex=="M")

>givename
-for example:

Gender

1 3 7 9 13 16 17 18 19 20 21 23 24

25 27 30 31 32 33 34 40 41 42 43 48 49 50 51 52 58

-this gives positions where M category is placed

>nameofcolumn[namegiven]

-for example: Gender[male]

"M" "M" "M" "M" "M" "M" "M" "M" "M" "M" "M" "M" "M" "M" "M"

-this commands give value of given data name

>cor(nameofdata[,-c(1:3)])

-this commands compute corelation but removing 1 to 3 column

-Ho=no relation ;H1=relation

> library(nameofpackage)

-to get package of test you want to perform

-for example

library(ppcor)

>pcor(nameofdata)

-to perform partial corelation

>cor(variableone,variable2,method="spearmen")

-to get rank corelation with spearmen method

>cor(variableone,variabletwo,method="kendal")

-to get rank corelation with kendal method

>cor.test(variableone,variabletwo,method="kendal")

-to get colrelation with p value

>cor(nameofdatatabel,method="spearman")

-to get table of spearman corelation analysis

>ad.test(nameofdata)

-to get anderson darling test for normality

-Ho=normal ;H1=not normal

>shapiro.test(nameofdata)

-to perform normality of test

-Ho=normal ;H1=not normal

testing of hypothesis:

TESTING OF HYPOTHESIS:

STEPS:

1:Normality

2:Homogenity

3:tests

normal and homogenous t-test, var true

normal and non homogenous t-test, var false

non normal wilcox test

Paired data t-test paired ture

#########################################AFTER MIDS ###########################

>dataname=rep(1:4,each=5)

-this command is used to repeat 1 to 4 counting 5 times

-for example treatment= (1,1,1,1,1,2,2,2,2,2,3,3,3,3,3,4,4,4,4,4)

>overallmean=mean(name of data $ treatment output name )

-to caculate overall mean

>txmean=tapply(name of data $ treatment output name , name of data $ treatment name ,mean)

-to calculate means of treatments individually

>duncanTest(fitt)

-to apply duncan test

>tukeyHSD(fitt)

-to apply tuckey honest significant difference test

R Syntax Examples 1
No ratings yet
R Syntax Examples 1
6 pages
Essential R Commands Guide
No ratings yet
Essential R Commands Guide
11 pages
BAN5
No ratings yet
BAN5
2 pages
R Course
No ratings yet
R Course
7 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
8 pages
Standard Deviation in RStudio Guide
No ratings yet
Standard Deviation in RStudio Guide
10 pages
Cours BI - R
No ratings yet
Cours BI - R
18 pages
R Programming Basics and Data Analysis
No ratings yet
R Programming Basics and Data Analysis
18 pages
Basics: TH TH TH TH TH TH TH
No ratings yet
Basics: TH TH TH TH TH TH TH
3 pages
A Short List of Some Useful R Commands: Input and Display
No ratings yet
A Short List of Some Useful R Commands: Input and Display
2 pages
Essential R Studio Commands Guide
No ratings yet
Essential R Studio Commands Guide
5 pages
R File Code
No ratings yet
R File Code
16 pages
R Commands
No ratings yet
R Commands
5 pages
Summary of R Commands For Statistics 100
No ratings yet
Summary of R Commands For Statistics 100
3 pages
R Codes
No ratings yet
R Codes
5 pages
Rintro
No ratings yet
Rintro
42 pages
Simple Tutorial in R
No ratings yet
Simple Tutorial in R
15 pages
UL2
No ratings yet
UL2
2 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
R Lecture 2-1
No ratings yet
R Lecture 2-1
28 pages
R Cheat Sheet
No ratings yet
R Cheat Sheet
9 pages
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
No ratings yet
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
15 pages
Ds
No ratings yet
Ds
2 pages
R Reference Card
No ratings yet
R Reference Card
1 page
R Code
No ratings yet
R Code
9 pages
Questions With No Solutions
No ratings yet
Questions With No Solutions
20 pages
R Programming Basics: Vectors, Matrices, Dataframes
No ratings yet
R Programming Basics: Vectors, Matrices, Dataframes
13 pages
Lecture 1
No ratings yet
Lecture 1
167 pages
R Programming
No ratings yet
R Programming
4 pages
List of Experiments
No ratings yet
List of Experiments
5 pages
Data Manipulation and Visualization in R
No ratings yet
Data Manipulation and Visualization in R
58 pages
SELF GUIDE B in R
No ratings yet
SELF GUIDE B in R
4 pages
AMDA Practical - A048
No ratings yet
AMDA Practical - A048
35 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
Data Science
No ratings yet
Data Science
20 pages
Session Set Working Directory Choose Directlry
No ratings yet
Session Set Working Directory Choose Directlry
17 pages
R Studio Cheat Sheet
No ratings yet
R Studio Cheat Sheet
6 pages
Applied Statistics MAT1011
No ratings yet
Applied Statistics MAT1011
22 pages
R
No ratings yet
R
4 pages
Resumo Adp
No ratings yet
Resumo Adp
5 pages
Data Analytic R
No ratings yet
Data Analytic R
28 pages
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
No ratings yet
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
50 pages
Module - 4 (R Training) - Basic Stats & Modeling
No ratings yet
Module - 4 (R Training) - Basic Stats & Modeling
15 pages
Stata Commands for Data Analysis
No ratings yet
Stata Commands for Data Analysis
14 pages
Data Analysis with R: Tables & Plots
No ratings yet
Data Analysis with R: Tables & Plots
13 pages
R Studio
No ratings yet
R Studio
8 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
Handy R Stuff
No ratings yet
Handy R Stuff
5 pages
RStudio Tips and Common Functions Guide
No ratings yet
RStudio Tips and Common Functions Guide
7 pages
R Programming-1
No ratings yet
R Programming-1
6 pages
STAT-2450 Assignment 1: Name:, Student ID: B00
No ratings yet
STAT-2450 Assignment 1: Name:, Student ID: B00
9 pages
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
No ratings yet
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
6 pages
R Practicals
No ratings yet
R Practicals
32 pages
R Statistical Analysis and Sampling Techniques
No ratings yet
R Statistical Analysis and Sampling Techniques
38 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
R Examples
No ratings yet
R Examples
56 pages
R Console
No ratings yet
R Console
6 pages
Kruskal Wallis or H-Test
No ratings yet
Kruskal Wallis or H-Test
11 pages
Generalized Elliptical Distributions Theory and Applications (Thesis) - Frahm (2004)
No ratings yet
Generalized Elliptical Distributions Theory and Applications (Thesis) - Frahm (2004)
145 pages
Buku Metode Penelitian
No ratings yet
Buku Metode Penelitian
63 pages
Project Chapter 3
No ratings yet
Project Chapter 3
2 pages
Machine Learning Test 4
100% (1)
Machine Learning Test 4
7 pages
Density CPKO
No ratings yet
Density CPKO
18 pages
The Influence of Philosophical Mentality and Spiritual Intelligence On Creativity of Employees Mediated by Organizational Commitment
No ratings yet
The Influence of Philosophical Mentality and Spiritual Intelligence On Creativity of Employees Mediated by Organizational Commitment
18 pages
Input Modeling in Discrete-Event Simulation
No ratings yet
Input Modeling in Discrete-Event Simulation
7 pages
Time Study: Avg. Observed Time (Or Actual Time (AT) )
100% (1)
Time Study: Avg. Observed Time (Or Actual Time (AT) )
8 pages
Sampling Research-Instrument Data Collection
No ratings yet
Sampling Research-Instrument Data Collection
43 pages
Confidence Intervals For Kendall's Tau-B Correlation
No ratings yet
Confidence Intervals For Kendall's Tau-B Correlation
6 pages
Statistics Exam Analysis & Solutions
No ratings yet
Statistics Exam Analysis & Solutions
1 page
Prob. & Stati. 2024 PYQ
No ratings yet
Prob. & Stati. 2024 PYQ
6 pages
Eunji (Elly) Choi - Unit 3 FRQ Review
100% (1)
Eunji (Elly) Choi - Unit 3 FRQ Review
4 pages
Bi Is The Slope of The Regression Line Which Indicates The Change in The Mean of The Probablity Bo Is The Y Intercept of The Regression Line
No ratings yet
Bi Is The Slope of The Regression Line Which Indicates The Change in The Mean of The Probablity Bo Is The Y Intercept of The Regression Line
5 pages
Business Stats for MIS Students
No ratings yet
Business Stats for MIS Students
2 pages
Learning Objectives: 3 Introduction To Statistical Quality Control, 6 Edition by Douglas C. Montgomery
No ratings yet
Learning Objectives: 3 Introduction To Statistical Quality Control, 6 Edition by Douglas C. Montgomery
24 pages
R Programming for Students
No ratings yet
R Programming for Students
3 pages
Vce Chemistry Ga23
No ratings yet
Vce Chemistry Ga23
3 pages
Week 1 Activities
No ratings yet
Week 1 Activities
5 pages
Time Series Models and Forecasting and Forecasting
No ratings yet
Time Series Models and Forecasting and Forecasting
49 pages
Week1 Introduction
No ratings yet
Week1 Introduction
36 pages
Chapter 9 Statistics Review
No ratings yet
Chapter 9 Statistics Review
6 pages
The Effect of Agility Training On Athletic Power Performance
No ratings yet
The Effect of Agility Training On Athletic Power Performance
8 pages
Session3 Quadrant2 Module5 SkewnessKutosis Notes
No ratings yet
Session3 Quadrant2 Module5 SkewnessKutosis Notes
4 pages
Random Number Tests
No ratings yet
Random Number Tests
4 pages
Critical Value Z Value Calculation
No ratings yet
Critical Value Z Value Calculation
5 pages
Statistics & Probability Worksheet
100% (1)
Statistics & Probability Worksheet
5 pages
General Linear Models in Small Area Estimation: An Assessment in Agricultural Surveys
No ratings yet
General Linear Models in Small Area Estimation: An Assessment in Agricultural Surveys
23 pages
Holt D and Smith TMF, 1979 - Post Stratification. Journal of The Royal Statistical Society. Series A (General)
No ratings yet
Holt D and Smith TMF, 1979 - Post Stratification. Journal of The Royal Statistical Society. Series A (General)
15 pages