0% found this document useful (0 votes)
16 views2 pages

Data Analytics Important Questions

Uploaded by

TECH RISHABH 07
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views2 pages

Data Analytics Important Questions

Uploaded by

TECH RISHABH 07
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Important Questions of Data Analytics CS-503 (A)

1. What are the benefits of Big Data? Discuss challenger under Big Data.
2. Explain CRUD operations in Mongo DB.
3. How Big Data analytics can be useful in development of smart cities? (Discuss one
application).
4. What do you mean by inter and trans fire wall analytics.
5. Explain the process of data storage in Hadoop Distributed File System (HDFS) with the help
of a suitable example.
6. Explain the 4V’s of big data.
7. Explain the architecture and features of Hive.
8. Justify: SPARK is faster than Map reduce.
9. What is Apache pig and why we need it?
10. Discuss the applications of big data analytics in weather fore casting.
11. How to create collection in Mongo DB? Explain with its syntax.
12. Explain the term Pig Latin in detail
13. Explain working of Hive with proper steps and diagram.
14. Explain following for Mongo DB.
i) Indexing
ii) Aggregation
15. Write short note on any three of the following.
i) Capacity scheduler in Map Reduce
ii) 5P’s of Big Data
iii) Metastore in Hive
iv) Regression ANOVA
v) Term frequency
16. What is Big Data? Explain characteristics of Big Data.
17. Differentiate between Apache Pig and Map Reduce.
18. Explain replication and scaling features of Mongo DB.
19. Write down the goals of HDFS.
20. Define term frequency and inverse document frequency.
21. Explain 5 P’s of Big data in brief.
22. What is H Base? Explain storage mechanism of H Base with an example
23. Explain the concept of metastore in Hive.
24. How can we extract data for data storage?
25. What is Map Reduce programming model? Explain.
26. Explain Hadoop architecture and its components with proper diagram.
27. What is Zoo keeper? List the benefits of it.
28. Explain any three Hive QL DDL command with its syntax and example.
29. Write down the process of installing and running Hive.
30. Write short note on any three of the following.
i) Pig latin
ii) Information management
iii) Sharding process
iv) SPARK
v) 4 V’s of Big data
31. Suppose the weights of 800 male students are normally distributed with 28.8 kg and SD of
2.06 kg. Find the number of students whose weights are
i) Between 28.4 kg and 30.4 kg
ii) More than 31.3kg
32. A random variable has the following probability function:
x 0 1 2 3 4 5 6 7
P (x) 0 k 2k 2k 3k K2 2k2 7k2+k

Determine:
i) k
ii) mean
iii) variance

33. A sales tax officer has reported that the average sales of the 500 businesses that he has to deal
with during a year is Rs. 36,000 with a standard deviation of Rs. 10,000. Assuming that the
sales in these businesses are normally distributed, find
(i). The number of businesses as the sales of which are greater than Rs. 40,000.
(ii). The percentage of business the sales of which are likely to range between Rs. 30,000 and
Rs. 40,000.

34. Discuss the trends in big data generation and acquisition.


35. Explain the following:
a) Predictive analytics
b) Inter-and Trans-firewall analytics
c) Information management
d) Crowdsourcing analytics
36. With an example, explain the term social media analytics.
37. What are the various stages in big data analytics life cycle? Illustrate with a figure, explaining
each of them.
38. Brief about the main component of Map Reduce.
39. What is Hadoop? Describe the role of Hadoop in big data analysis, also explain core
components of Hadoop.
40. Describe the structure of HDFS in a Hadoop ecosystem using a diagram.
41. Why is finding similar items important in Big Data? Illustrate using two example applications.
42. Why to choose Hadoop for processing Big Data in detail and explain the concept of distributed
and parallel computing challenges?
43. Explain in detail the interacting process with Hadoop Ecosystem. List out various big data
processing technologies.
44. Explain Pig Data Model in detail and discuss how it will help for effective data flow?
45. Draw and explain architecture of APACHE HIVE. Explain various data insertion techniques in
HIVE with example.

You might also like