Setting up Environment for Machine Learning with R Programming

Last Updated : 22 Jul, 2020

Machine Learning is a subset of Artificial Intelligence (AI), which is used to create intelligent systems that are able to learn without being programmed explicitly. In machine learning, we create algorithms and models which is used by an intelligent system to predict outcomes based on particular patterns or trends which are observed from the given data. Machine learning follows a unique principle of using data and the outcomes from the data to predict the rules which are stored in a model. This model is then used to predict outcomes from a different set of data. In R programming the environment for machine learning can be set easily through RStudio.

Setting up an environment for machine learning using Anaconda

Step 1: Install Anaconda (Linux, Windows) and launch the navigator. Step 2: Open Anaconda Navigator and click the Install button for Rstudio. anaconda-navigator

Step 3: After installation, create a new environment. Anaconda will then send a prompt asking to enter a name for the new environment and the lunch the R studio. create-new-environment

Running R commands

Method 1: R commands can run from the console provided in R studio. After opening Rstudio simply type R commands to the console. running-R-command

Method 2: R commands can be stored in a file and can be executed in an anaconda prompt. This can be achieved by the following steps.

Open an anaconda prompt
Go to the directory where the R file is located
Activate the anaconda environment by using the command:
```
conda activate <ENVIRONMENT_NAME>
```
Run the file by using the command:
```
 Rscript <FILE_NAME>.R
```

Installing machine learning packages in R

Packages help make code easier to write as they contain a set of predefined functions that perform various tasks. The most used machine learning packages are Caret, e1071, net, kernlab, and randomforest. There are two methods that can be used to install these packages for your R program. Method 1: Installing Packages through Rstudio

Open Rstudio and click the Install Packages option under Tools which is present in the menubar.
Enter the names of all the packages you want to install separated by spaces or commas and then click install.

Method 2: Installing Packages through Anaconda prompt/Rstudio console

Open an Anaconda prompt.
Switch the environment to the environment you used for Rstudio using the command:
```
conda activate <ENVIRONMENT_NAME>
```
Enter the command r to open the R console.

Install the required packages using the command:

install.packages(c("<PACKAGE_1>", "<PACKAGE_2>", ..., "<PACKAGE_N>"))

CRAN

Machine Learning packages in R

Example:

Preparing the Data Set:

this link

Python3

# Import the data set
Data <- read.csv("GenderClassification.csv",
                  stringsAsFactors = TRUE)
# Using set.seed()
# Generating random number
set.seed(10)

# Cleaning the data set
Data$Favorite.Color <- as.numeric
                          (Data$Favorite.Color)
Data$Favorite.Music.Genre <- as.numeric
                          (Data$Favorite.Music.Genre)
Data$Favorite.Beverage <- as.numeric
                          (Data$Favorite.Beverage)
Data$Favorite.Soft.Drink <- as.numeric
                          (Data$Favorite.Soft.Drink)

# Split into train and test data set
TrainingSize <- createDataPartition(Data$Gender, 
                                    p = 0.8, 
                                    list = FALSE)
TrainingData <- Data[TrainingSize,]
TestingData <- Data[-TrainingSize,]

CARET

# Using CARET package

# Importing the library
library(caret)

# Using the train() available in
# Caret package
model <- train(Gender ~ ., data = TrainingData, 
               method = "svmPoly",
               na.action = na.omit,
               preProcess = c("scale", "center"),
               trControl = trainControl(method = "none"),
               tuneGrid = data.frame(degree = 1, 
                                     scale = 1, 
                                     C = 1)
)
model.cv <- train(Gender ~ ., data = TrainingData,
                  method = "svmPoly",
                  na.action = na.omit,
                  preProcess = c("scale", "center"),
                  trControl = trainControl(method = "cv", 
                                           number = 6),
                  tuneGrid = data.frame(degree = 1, 
                                        scale = 1,
                                        C = 1)
)

# Printing the models
print(model)
print(model.cv)

Output:

ggplot2

The ggplot2

# Using ggplot2

# Creating a bar plot from the 
# Data's Favorite.Color attribute
ggplot(Data, aes(Favorite.Color)) +
  geom_bar(fill = "#0073C2FF")

Output:

randomForest

# Using randomforset

# Importing the randomForest package
library(randomForest)

# Using the randomForest function 
# From the randomForest package
model <- randomForest(formula = Gender ~ ., 
                      data = Data)
print(model)

Output:

nnet

# Using nnet

# Importing the nnet package
library(nnet)

# Using the nnet function
# In the nnet package 
model <- nnet(formula = Gender ~ ., 
              data = Data, 
              size = 30)
print(model)

Output:

e1071

# Using e1071

# Importing the e1071 package
library(e1071)

# Using the svm function 
# In the e1071 package
model <- svm(formula = Gender ~ ., 
             data = Data)
print(model)

Output:

rpart

# Using rpart

# Importing the rpart package
library(rpart)

# Using the rpart function
# To partition data
partition <- rpart(formula = Gender~., 
                   data = Data)
plot(partition)

Output:

dplyr

rpart

dplyr

filter, select, and arrange.

# Using dplyr

# Importing the dplyr package
library(dplyr)

# Using the filter function
# From the dplyr package 
Data %>% 
  filter(Gender == "M")

Output:

Supervised and Unsupervised Learning in R Programming

haniel

Improve

Article Tags :

Setting up Environment for Machine Learning with R Programming

Setting up an environment for machine learning using Anaconda

Running R commands

Installing machine learning packages in R

Machine Learning packages in R

Example:

Similar Reads

Getting Started With Machine Learning In R

Data Processing

Supervised Learning

Evaluation Metrics

Unsupervised Learning

Model Selection and Evaluation

Reinforcement Learning

Dimensionality Reduction

Advanced Topics

Thank You!

What kind of Experience do you want to share?