Big Data Analytics April 2023
Big Data Analytics April 2023
1
IV B.Tech II Semester Regular Examinations, April – 2023
BIG DATA ANALYTICS
(Elective V for CSE & IT, Open Elective for Other Branches)
UNIT II
3 a) Explain Filtering a stream in detail [7]
b) Explain stream data model and architecture. [8]
(OR)
4 a) Discuss Real Time Analytics platform application for Stock Market
predictions. [7]
b) Illustrate Stream Processing Model [8]
UNIT III
5 a) Draw the architecture of HDFS and explain its components. [7]
b) How Hadoop streaming is suited with text processing explain. [8]
(OR)
6 a) Discuss the various types of map reduce & its formats. [7]
b) Explain various phases of Map Reduce job with an example. [8]
UNIT IV
7 a) Explain the key components of PIG architecture. [7]
b) Write short notes on: i) HBase ii) zookeeper [8]
(OR)
8 a) How will you query the data in HIVE? [7]
b) Explain two execution types or modes in PIG. [8]
UNIT V
9 a) Distinguish between Regression and Classification. [7]
b) Explain the importance of predictive analytics for improving Business. [8]
(OR)
10 a) How do businesses use Regression Analysis? [7]
b) Explain in detail about Multiple Linear Regression technique. [8]
1 of 1
|''|'||||''|'''|||'|
Code No: R194205E R19 Set No. 2
IV B.Tech II Semester Regular Examinations, April – 2023
BIG DATA ANALYTICS
(Elective V for CSE & IT, Open Elective for Other Branches)
UNIT IV
7 a) Give a detail note on HBASE. [7]
b) Illustrate the Architecture of PIG. [8]
(OR)
8 a) How to create and Manage the database and tables using Hive. [7]
b) Write a brief notes on distributed modes of running PIG Scripts. [8]
UNIT V
9 a) Explain about Predictive Analysis. [7]
b) Illustrate Simple Linear Regression. [8]
(OR)
10 a) Write the importance of Regression in Data Science and Data Analytics. [7]
b) How to interpret coefficients of Multiple Linear Regression? Explain. [8]
1 of 1
|''|'||||''|'''|||'|
Code No: R194205E R19 Set No. 3
IV B.Tech II Semester Regular Examinations, April – 2023
BIG DATA ANALYTICS
(Elective V for CSE & IT, Open Elective for Other Branches)
1 of 1
|''|'||||''|'''|||'|
Code No: R194205E R19 Set No. 4
IV B.Tech II Semester Regular Examinations, April – 2023
BIG DATA ANALYTICS
(Elective V for CSE & IT, Open Elective for Other Branches)
UNIT II
3 a) Explain the Data streaming concept in detail. [7]
b) Write a short note on Decaying Window Algorithm. [8]
(OR)
4 a) Explain the different applications of data streams in detail. [7]
b) What is Real Time Analytics? Discuss their technologies in detail [8]
UNIT III
5 a) Explain the anatomy of write operation in HDFS. [7]
b) Explain the map reduce data flow with single reduce and multiple
reduce. [8]
(OR)
6 a) Write a Java program to implement the word count program using Map
Reduce paradigm. [7]
b) Explain the role of combiner and partitioner phases in Map Reduce job. [8]
UNIT IV
7 a) What is HiveQL? Explain its features. [7]
b) What is Zookeeper explain its features with applications. [8]
(OR)
8 a) Give a brief note on Querying Data in Hive. [7]
b) What is Apache PIG? Give its features, running modes and applications. [8]
UNIT V
9 a) How to interpret p values and coefficients in regression analysis? [7]
b) Explain Cross-Validation in Multiple linear regression. [8]
(OR)
10 a) Is multiple linear regression predictive analytics? justify. [7]
b) Give a brief note on Model Selection and Stepwise Regression. [8]
1 of 1
|''|'||||''|'''|||'|