0% found this document useful (0 votes)
71 views

LP 1 (DA) Question Bank

This document contains questions from four assignments related to data analysis topics for a data analytics course at Zeal College of Engineering and Research. The assignments cover datasets involving iris flowers, Pima Indians diabetes indicators, trip histories, and Twitter data. The questions ask students to explain concepts like datasets, data frames, data visualization, the naive Bayes algorithm, decision trees, sentiment analysis and how to use various functions in R and Python for tasks like loading data, splitting data, plotting and predicting.

Uploaded by

Dr Strange
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
71 views

LP 1 (DA) Question Bank

This document contains questions from four assignments related to data analysis topics for a data analytics course at Zeal College of Engineering and Research. The assignments cover datasets involving iris flowers, Pima Indians diabetes indicators, trip histories, and Twitter data. The questions ask students to explain concepts like datasets, data frames, data visualization, the naive Bayes algorithm, decision trees, sentiment analysis and how to use various functions in R and Python for tasks like loading data, splitting data, plotting and predicting.

Uploaded by

Dr Strange
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

ZEAL EDUCATION SOCIETY’S

ZEAL COLLEGE OF ENGINEERING AND RESEARCH


NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

LP -1(Data Analytics) Question Bank

Assignment 1: Iris flower dataset

1. What is dataset..?
2. What is data frame..?
3. How min, max ,mean, range, standard deviation, variance are calculated..? Also
explain the same.
4. Explain Histogram. Also Explain its applications.
5. Explain Boxplot. Also Explain its applications
6. What are outliers..?
7. What are quartiles in box pot..?
8. What do you mean by “read.csv” function..?
9. Explain print, summary and str function.
10. Explain “sapply” function…?
11. Explain the purpose of “hist”, “Sys.sleep” , “cat” fuctions.
12. Explain Code:

“boxplot(df[,1:4], main = "Boxplot of Iris")

“print(quantile(df[,n], probs = c(0, 0.25, .5,0.75, 1)))”

13. How to create boxplot for each feature in the dataset..?


14. How to create histogram.
15. What is Data visualization?
ZEAL EDUCATION SOCIETY’S
ZEAL COLLEGE OF ENGINEERING AND RESEARCH
NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

Assignment 2: Pima Indians Diabetes dataset, using Naïve Bayes


Algorithm

1. Explain Naïve Bayes Algorithm.


2. Explain Naïve Bayes Classifier.
3. Explain how to load “.csv” file.
4. What is conditional probability in Naïve Bayes Theorem…?
5. Explain application of Naïve Bayes Theorem.
6. What is confusion matrix?
7. Which function is used to split the dataset in R?
8. What is train and test data. Explain there purposes.
9. What are steps of Naïve Bayes algorithm..?
10. How to load or install packages like “library(“e1071”), Explain its methods.
11. What does “View” function do in r..?
12. Explain following functions in R
“sample.split”,
“subset”,
“dim”,
“naïve bayes”,
“predict”,
”cbind”
ZEAL EDUCATION SOCIETY’S
ZEAL COLLEGE OF ENGINEERING AND RESEARCH
NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

Assignment 3: Trip History Analysis

1. What is classification algorithm..?


2. Explain recursive partitioning and regression tress.
3. What is purpose of Split the dataset into Training and Test dataset..?
4. Explain following functions in “R”.
“sample.split”
“subset”
“summary”
“subset”
“head
“rpart”
“plot”
“text”
5. What is “predict” function…?
6. Explain following code:

“temp_field<-sample.split(subset_mydata,SplitRatio=0.9)”
ZEAL EDUCATION SOCIETY’S
ZEAL COLLEGE OF ENGINEERING AND RESEARCH
NARHE │PUNE -41 │ INDIA
DEPARTMENT OF COMPUTER ENGINEERING

Assignment 4: Twitter Data Analysis

1. What is Sentiment analysis..?


2. Which API is required to handle authentication.?
3. What is syntax of utility function..?
4. What is function of Text Blob library?
5. What is Re library in python?
6. Explain the concept “sentiment analysis of Twitter dataset”, How to classify the
tweets from dataset?
7. What do you mean by Categorization of tweets as positive and negative tweets..?
8. What is purpose of “sentimentr” package..?

You might also like